130 programs for "scrape text from html" with 1 filter applied:

  • Finance Automation that puts you in charge Icon
    Finance Automation that puts you in charge

    Tipalti delivers smart payables that elevate modern business.

    Our robust pre-built connectors and our no-code, drag-and-drop interface makes it easy and fast to automatically sync vendors, invoices, and invoice payment data between Tipalti and your ERP or accounting software.
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
  • 1
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 2
    dude uncomplicated data extraction

    dude uncomplicated data extraction

    dude uncomplicated data extraction: A simple framework

    Dude is a very simple framework for writing web scrapers using Python decorators. The design, inspired by Flask, was to easily build a web scraper in just a few lines of code. Dude has an easy-to-learn syntax. Dude is currently in Pre-Alpha. Please expect breaking changes. You can run your scraper from terminal/shell/command-line by supplying URLs, the output filename of your choice and the paths to your python scripts to dude scrape command.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Laravel Response Cache

    Laravel Response Cache

    Speed up a Laravel app by caching the entire response

    This Laravel package can cache an entire response. By default, it will cache all successful get-requests that return text-based content (such as HTML and json) for a week. This could potentially speed up the response quite considerably. So the first time a request comes in the package will save the response before sending it to the users. When the same request comes in again we're not going through the entire application but just respond with the saved response.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    command-output-to-html-table

    command-output-to-html-table

    A shell script to convert any file or command output into a html table

    Please watch the video below, to convert any file or a command output into a nice html table, in less than 5 Minutes time. The output html file can then be browsed from any location, using a local webserver or an internet www domain. Usage Examples: (Type them on Terminal) cd ~/Downloads/tabulate # location chmod +x *.sh cat "student_marks.csv" | { cat ; echo ; } | ./tabulate.sh -d "," -t "My School" -h "First Term" > "marks.html" # or > "/var/www/html/marks.html" -d specifies...
    Downloads: 11 This Week
    Last Update:
    See Project
  • Powering the next decade of business messaging | Twilio MessagingX Icon
    Powering the next decade of business messaging | Twilio MessagingX

    For organizations interested programmable APIs built on a scalable business messaging platform

    Build unique experiences across SMS, MMS, Facebook Messenger, and WhatsApp – with our unified messaging APIs.
  • 5
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 1,814 This Week
    Last Update:
    See Project
  • 6
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    .... Due to its technological architecture design, OpenKM meets the document management needs of businesses of all sizes (from SMEs to big corporations). Thanks to its elegant and intuitive interface, OpenKM transforms complex operations into easy tasks. The most relevant functions of OpenKM is the indexing of the most common types of files: text, Office, Office 2007, OpenOffice, PDF, HTML, XML, MP3, JPEG, etc. For a complete feature list take a look at http://goo.gl/au8cQy
    Leader badge
    Downloads: 1,130 This Week
    Last Update:
    See Project
  • 7
    Madedit-Mod

    Madedit-Mod

    MadEdit-Mod is a cross platform Text/Hex editor based on MadEdit

    Madedit-Mod is a cross platform text/hex editor base on MadEdit with a log of critical bug fix from me or other developers. A lot of new features were added, such as Drag-Drop Edit(cross platform), Highlight word, etc. The reason that I maintained this project is that the author of MadEdit had not worked on it for for a long time and I really like it and need more features. Find more information on Wiki pages. Currently supported Languages: English Chinese Simplified (zh_CN...
    Leader badge
    Downloads: 106 This Week
    Last Update:
    See Project
  • 8
    TextTest is an application-independent tool for text-based functional testing. This means running a batch-mode binary in lots of different ways, and using the text output produced as a means of controlling the behaviour of that application.
    Leader badge
    Downloads: 92 This Week
    Last Update:
    See Project
  • 9
    Writer2LaTeX and Writer2xhtml is a collection of converters from OpenDocument Format (ODF) to LaTeX/BibTeX, HTML+MathML and EPUB. It is delivered as a standalone java library, as a command line application and as extensions for LibreOffice.
    Leader badge
    Downloads: 58 This Week
    Last Update:
    See Project
  • Intelligent network automation for businesses and organizations Icon
    Intelligent network automation for businesses and organizations

    Network automation for the hybrid multi-cloud era

    BackBox seamlessly integrates with network monitoring and NetOps platforms and automates configuration backups, restores, and change detection. BackBox also provides before and after config diffs for change management, and automated remediation of discovered network security issues.
  • 10
    TinTin++ Mud Client

    TinTin++ Mud Client

    A MUD client for Android, iOS, Linux, Mac OS X and Windows

    TinTin++ is a console mud client for any type of text mud or bbs. The mud client's main assets are an easy to learn scripting language, triggers, automapping, and a split screen interface to separate mud output from client input. Installation instructions for Android, iOS, and several other operating systems are available on the website. https://tintin.mudhalla.net
    Leader badge
    Downloads: 86 This Week
    Last Update:
    See Project
  • 11
    Quiz/Survey/Test - QST

    Quiz/Survey/Test - QST

    A Free, complete, enterprise grade, open source exam management system

    QST, the worlds unparalleled open source online/lan assessment software. From a quick quiz on your phone to very large scale, high stakes, proctored desktop testing, we make it easy/secure/economical. Our intuitive design contains features (Immediate detailed results, Create/Export/Import/Convert Questions, WYSIWYG/Math-Chemistry/Basic Editors, Question/Item Bank, Multiple Question Types, Multiple Delivery Styles, Multiple Delivery/Results Options, Adaptive/Branching Questions, Randomly Chosen...
    Leader badge
    Downloads: 106 This Week
    Last Update:
    See Project
  • 12
    Wapiti

    Wapiti

    Wapiti is a web-application vulnerability scanner

    Wapiti is a vulnerability scanner for web applications. It currently search vulnerabilities like XSS, SQL and XPath injections, file inclusions, command execution, XXE injections, CRLF injections, Server Side Request Forgery, Open Redirects... It use the Python 3 programming language.
    Leader badge
    Downloads: 67 This Week
    Last Update:
    See Project
  • 13
    WIKINDX

    WIKINDX

    Virtual Research Environment / On-line Bibliography Manager

    Reference management, bibliography management, citations and a whole lot more. Designed by academics for academics, under continuous development since 2003, and used by both individuals and major research institutions worldwide, WIKINDX is a Virtual Research Environment (an enhanced on-line bibliography manager) storing searchable references, notes, files, citations, ideas, and more. An integrated WYSIWYG word processor exports formatted articles to RTF and HTML. Plugins include...
    Leader badge
    Downloads: 47 This Week
    Last Update:
    See Project
  • 14
    readpe

    readpe

    The PE file analysis toolkit

    readpe (formerly known as pev) is a multiplatform toolkit to work with PE (Portable Executable) binaries. Its main goal is to provide feature-rich tools for properly analyze binaries with a strong focus on suspicious ones.
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • 15
    maXbox

    maXbox

    maXbox: Precompiled Object Based Scripting Tool

    maXbox: Precompiled Object Based Scripting Tool maXbox is a free scripter tool with an inbuilt object pascal delphi engine in one exe! It is designed for teaching, develop, test and analyzing apps and algorithms and runs under Win, Mac and Linux (CLX) to set code in a box without installation and no administration. The tool for 32 and 64-bit is based on an educational program with examples and exercises (from biorhythm, REST services , form building to how math works). Pure Code for Object...
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 16

    RecordEditor

    Editor for Fixed Width, Csv and Existing Xml files.

    The RecordEditor is a Data File editor for Flat Files (delimited and fixed field position). It supports Unix / PC / Legacy (e.g. Mainframe) file formats, both Text and binary files. The Editor uses a Record-Layout description to format the files. This is ideal for Fixed width (Text or Binary) files, Cobol Data Files, Mainframe files and complicated Csv files. Cobol Copybooks can be used to format Cobol Data files. As well as an editor, The following utilities are supplied * Formatted...
    Leader badge
    Downloads: 69 This Week
    Last Update:
    See Project
  • 17
    Custom-Puppy-Linux-For-Scripts

    Custom-Puppy-Linux-For-Scripts

    A Custom Puppy Linux Live CD For Creating & Running Shell Scripts

    This Live CD is created for the purpose of writing & running shell scripts on various computers. This is a 64-bit system for better performance and features. Please see the Images of its Desktop below. Pressing F4 from any Folder on the File Manager will launch the Terminal and will set your working directory automatically. After Downloading, Either Burn this .iso image to a DVD and boot from it on any computer OR install it to a USB drive with the multiboot-usb software, available here...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    metaf2xml

    metaf2xml

    Parse and decode METAR, TAF, SYNOP, BUOY, AMDAR and write data as XML

    metaf2xml can download, parse and decode aviation routine weather reports (METAR, SPECI, SAO), aerodrome forecasts (TAF), synoptic observations (SYNOP), observations from buoys (BUOY) and meteorological reports from aircrafts (AMDAR). Data can also be taken from decoded BUFR messages. The extracted data can be written as XML or passed to a user-defined function (all done in Perl). It also provides XSLT style sheets to convert the XML to plain language (text, HTML), or XML with different...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Ring

    Ring

    Simple and flexible programming language for applications development

    The Ring is a practical general-purpose multi-paradigm language. The supported programming paradigms are Imperative, Procedural, Object-Oriented, Functional, Meta programming, Declarative programming using nested structures, and Natural programming. The language is portable (Windows, Linux, macOS, Android, etc.) and can be used to create Console, GUI, Web, Games and Mobile applications. The language is designed to be Simple, Small and Flexible.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    Rayshade

    Rayshade

    Rayshade raytracer

    now gnu-ized, gcc-4.7.4 or gcc-10.2.0 A raytracer does not require custom code for (shadows, mirrors) like GL: it uses physics to simulate light to make realistic images, leaving one to specify only what is in the scene. (at a cost of speed) Rayshade is a 1990's raytracer, a great one back then (and still useful). Rayshade has an excellent easy to read yet informative User's Guide that other's could not help but copy from. (html of guide is in rayshade-4.0-ftp-and-guide.tar.gz...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21

    myoradump

    MyOraDump is a tool for exporting data from an Oracle database

    MyOraDump is a program that will export oracle data stored in most of the basic oracle types as a text file that can for example be used for import into other databases, for example MariaDB and MySQL. The program is written in C using OCI and supports array fetching, which means good performance. In addition BLOB, CLOB, NCLOB as well as RAW, LONG and LONG RAW is supported. Output can be in several formats, like separated values, JSON, MySQL dump and HTML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    adx - addressbook.xml

    adx - addressbook.xml

    Minimalistic address book in web browser. No server or plugin needed.

    Minimalistic but full-featured addressbook in your web browser. adx is a standalone and portable web app (online and offline). FEATURES Contact Management, portable, small (~350KB), lightweight, contact tagging, geo mapping, web accounts, trigger phone/Skype calls, etc. EXPORT FUNCTIONALITY vCard (as file or QR code via offline generator) HOW IT WORKS Your address-book (XML file) is transformed in your web browser (via XSLT) to a full-featured web application (HTML...
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    WP2LaTeX is a program for conversion WordPerfect MAC1.x, 2.x, 3.x and PC 4.x,5.x,6.x documents to LaTeX. Converted features are: Headers, Tables, Equations, Centered+ Right+ Left text, a lot of extended characters (greek, math,cyrilic) and of course a normal text. Alternative URL: http://78.108.103.11/~fojtik/wp2latex/wp2latex.htm and also http://ftsoft.wz.cz/wp2latex/wp2latex.htm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Automated_Quiz

    Automated_Quiz

    A Shell Script for Quizzing the whole class of students and log result

    Please download the above zip file and extract it to a secret folder and run the quiz.sh script from that folder in your terminal program, as shown in the youtube videos : https://youtu.be/kQi0XlIaT9M and https://youtu.be/prwKN5DiFp0 You may change the contents of the quiz.txt file to your liking, while maintaining the format of it. Note: Press Enter/Return key after typing on your Terminal, to proceed to the next step, while testing the quiz.sh script. Also, increase the terminal font...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Create and Host Web Forms For All

    Create and Host Web Forms For All

    Only Text Editing Skills Required - Convert All Paper Based Forms

    ... people whom we share this folder link with, can then access our form. P.S. This word was generated using the free service : https://www.random.org/strings/ Note: Please use the html required attribute ( search google ) for all form elements, so that users don't skip filling up your input fields.
    Downloads: 0 This Week
    Last Update:
    See Project