Showing 22 open source projects for "scrape text from html"

View related business solutions
  • Precoro helps companies spend smarter Icon
    Precoro helps companies spend smarter

    Fully Automated Process in One Tool: From Purchase Orders to Budget Control and Reporting.

    For minor company expenses, you might utilize a spend management solution or track everything in spreadsheets. For everything more, you'll need Precoro. We help companies achieve procurement excellence and budget efficiency by building transparent, predictable, automated spending workflows.
  • Cybersecurity Management Software for MSPs Icon
    Cybersecurity Management Software for MSPs

    Secure your clients from cyber threats.

    Define and Deliver Comprehensive Cybersecurity Services. Security threats continue to grow, and your clients are most likely at risk. Small- to medium-sized businesses (SMBs) are targeted by 64% of all cyberattacks, and 62% of them admit lacking in-house expertise to deal with security issues. Now technology solution providers (TSPs) are a prime target. Enter ConnectWise Cybersecurity Management (formerly ConnectWise Fortify) — the advanced cybersecurity solution you need to deliver the managed detection and response protection your clients require. Whether you’re talking to prospects or clients, we provide you with the right insights and data to support your cybersecurity conversation. From client-facing reports to technical guidance, we reduce the noise by guiding you through what’s really needed to demonstrate the value of enhanced strategy.
  • 1
    MultiMarkdown-6

    MultiMarkdown-6

    Lightweight markup processor to produce HTML, LaTeX, and more

    Lightweight markup processor to produce HTML, LaTeX, and more. MultiMarkdown is a superset of the Markdown lightweight markup syntax with support for additional output formats and features. Markdown is a text-to-HTML conversion tool for web writers. Markdown allows you to write using an easy-to-read, easy-to-write plain text format, then convert it to structurally valid XHTML (or HTML).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TinTin++ Mud Client

    TinTin++ Mud Client

    A MUD client for Android, iOS, Linux, Mac OS X and Windows

    TinTin++ is a console mud client for any type of text mud or bbs. The mud client's main assets are an easy to learn scripting language, triggers, automapping, and a split screen interface to separate mud output from client input. Installation instructions for Android, iOS, and several other operating systems are available on the website. https://tintin.mudhalla.net
    Leader badge
    Downloads: 86 This Week
    Last Update:
    See Project
  • 3
    readpe

    readpe

    The PE file analysis toolkit

    readpe (formerly known as pev) is a multiplatform toolkit to work with PE (Portable Executable) binaries. Its main goal is to provide feature-rich tools for properly analyze binaries with a strong focus on suspicious ones.
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • 4
    LPub3D

    LPub3D

    LDraw™ editor for LEGO® style digital building instructions.

    LPub3D is an Open Source WYSIWYG editing application for creating LEGO® style digital building instructions. LPub3D is developed and maintained by Trevor SANDY. It uses the LDraw™ parts library, the most comprehensive library of digital Open Source LEGO® bricks available (www.ldraw.org/ ) and reads the LDraw LDR and MPD model file formats. LPub3D is available for free under the GNU Public License v3 and runs on Windows, Linux and macOS Operating Systems. LPub3D is also...
    Downloads: 17 This Week
    Last Update:
    See Project
  • Manage your IT department more effectively Icon
    Manage your IT department more effectively

    Streamline your business from end to end with ConnectWise PSA

    ConnectWise PSA (formerly Manage) allows you to stop working in separate systems, and helps you build a more profitable business. No more duplicate data entries, inefficient employees, manual invoices, and the inability to accurately track client service issues. Get a behind the scenes look into the award-winning PSA that automates processes for each area of business: sales, help desk, support, finance, and HR.
  • 5
    Ring

    Ring

    Simple and flexible programming language for applications development

    The Ring is a practical general-purpose multi-paradigm language. The supported programming paradigms are Imperative, Procedural, Object-Oriented, Functional, Meta programming, Declarative programming using nested structures, and Natural programming. The language is portable (Windows, Linux, macOS, Android, etc.) and can be used to create Console, GUI, Web, Games and Mobile applications. The language is designed to be Simple, Small and Flexible.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Rayshade

    Rayshade

    Rayshade raytracer

    now gnu-ized, gcc-4.7.4 or gcc-10.2.0 A raytracer does not require custom code for (shadows, mirrors) like GL: it uses physics to simulate light to make realistic images, leaving one to specify only what is in the scene. (at a cost of speed) Rayshade is a 1990's raytracer, a great one back then (and still useful). Rayshade has an excellent easy to read yet informative User's Guide that other's could not help but copy from. (html of guide is in rayshade-4.0-ftp-and-guide.tar.gz...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7

    myoradump

    MyOraDump is a tool for exporting data from an Oracle database

    MyOraDump is a program that will export oracle data stored in most of the basic oracle types as a text file that can for example be used for import into other databases, for example MariaDB and MySQL. The program is written in C using OCI and supports array fetching, which means good performance. In addition BLOB, CLOB, NCLOB as well as RAW, LONG and LONG RAW is supported. Output can be in several formats, like separated values, JSON, MySQL dump and HTML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    wordTabulator
    Program wordTabulator is intended for text analysis. It can generate index of word elements extracted from defined text set. Word elements may be words, N-grams or phrases (syntagmes). The program can process texts as in ordinary 1-byte encoding (ANSI), as in multibyte UTF-8 encoding.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    JCppEdit v4.0

    JCppEdit v4.0

    Best IDE for Beginners

    JCppEdit is a free as well as "best IDE for Beginners" and is your one-stop IDE for all your coding needs. Whether you need to finish your Java project or submit your first HTML web page or perhaps have a need to code in C language while executing a java program into a Java IDE, JCppEdit will help you achieve your goals easily. Exploring your project and detecting an error in your code is much easier. You will not waste time detecting errors before compiling codes because you will get a real...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Translate docs, audio, and videos in real time with Google AI Icon
    Translate docs, audio, and videos in real time with Google AI

    Make your content and apps multilingual with fast, dynamic machine translation available in thousands of language pairs.

    Google Cloud’s AI-powered APIs help you translate documents, websites, apps, audio files, videos, and more at scale with best-in-class quality and enterprise-grade control and security.
  • 10

    XmHTML

    HTML Widget for Motif (for HTML 3.2)

    XmHTML is a HTML widget for Motif. It supports HTML 3.2. Since HTML 3.2 is an older version of HTML, XmHTML is primarily useful for displaying help texts or to visualize results of computations in an application. XmHTML is an old project with contributions from many people. It was abandoned for a while and was reactivated because Arahne, d.o.o. ( www.arahne.si ) ordered the following enhancements in XmHTML from Christian Linhart: *support for antialiased XFT fonts *support for UTF8...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    Gumbo

    Gumbo

    An HTML5 parsing library in pure C99

    Gumbo is an implementation of the HTML5 parsing algorithm implemented as a pure C99 library with no outside dependencies. It's designed to serve as a building block for other tools and libraries such as linters, validators, templating languages, and refactoring and analysis tools. Gumbo gains some of this by virtue of being written in C, but it is not an important consideration for the intended use-case, and was not a major design factor. Gumbo is intentionally designed to turn an HTML document...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Z Notation E-Mail Mark-up Tools

    Tools to convert Z mark-up to HTML or text.

    A small library and two command-line tools to parse and convert Z notation from the "e-mail" mark-up into HTML code, or into UTF-8 text with box-drawing graphics, or into the Z Standard text format. See the project's Wiki Home Page for details --- the "Wiki" button in the bar above, or the following link:
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    TexLexAn is an open source text analyser for Linux, able to estimate the readability and reading time, to classify and summarize texts. It has some learning abilities and accepts html, doc, pdf, ppt, odt and txt documents. Written in C and Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    If you have many documents those written in several type such as pdf, chm, doc, html... How can you fast search their contents concurrently? Rebook software solves that problems. How can you get differences things between them? Rebook software supports that problems. How can you get similar things between them? Rebook software supports that problems. While you are typing a document/text, how can you fast reference the give contents in your documents? Use Rebooks software...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Search and export numerics from any text/ascii file. Data sets (scalar, vector, matrix) are given unique names, based on file content. Results can be generated for Matlab, IDL, Scilab, Octave, XML, HTML A wrapper exists for direct usage from Matlab.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This is a text-mode program designed to compact javascript in HTML pages.. It uses libjs from the mozilla project, and thus fully understands the javascript and is able to do a better job than compressors that use regular expressions or similar kludges.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    LICOM is a compression/de-compression tool that sits in-between the browser and content from the server. It aims to facilitate compression of ASCII text, RTF, DOC, PDF, HTML, GIF, JPG, BMP and similar filetypes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Papyrus is an XML reporting engine for Linux. Papyrus enables you to generate reports from a variety of different SQL databases (postgresql,mysql,etc). Your reports can be generated as PDF, PostScript, XML, HTML, DVI, Latex or straight ANSI text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    pcheck is a console proxy checker written in c language. It can extract and check HTTP proxy servers from different files. It works with any text file format(e.g. xml, html). also it can check proxy for anonymity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    GAAT means GNOME ASCII Art Tool. It is an ASCII text or HTML file format generator. It works with GNOME2, GTK2, PANGO and GdkPixbuf. You can choose the type of the output file, resize or select the characters to use in the transformation from image to t
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    RBML is similar to XML or ASP. It is a server-side tool for either generating static HTML pages or dynamic pages as CGI-BIN scripts. RBML is a macro expansion language. Text from the input file is written to the output file unmodified unless it is enclo
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    tableforms

    A HTML form manager

    A mini web content management system (CMS) for forms, which we hope mixes well with other CMSs. The objective is to develop a http(s) based service using existing http(s) servers that manages HTML forms and results from form submissions from the client (browser) side. Uses file types: PHP, HTML 5, CSS, javaScript, bash, GNU make, C, and plan text. Includes: * client side WYSIWYG (what you see is what you get), javaScript/HTML/CSS based, HTML form editor, * client side form...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next