Showing 186 open source projects for "html source extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    ansifilter

    ansifilter

    ANSI sequence filter

    Ansifilter handles text files containing ANSI terminal escape codes. The command sequences may be stripped or be interpreted to generate formatted output (HTML, RTF, TeX, LaTeX, BBCode and Pango Markup).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 1,627 This Week
    Last Update:
    See Project
  • 3
    Tinn-R

    Tinn-R

    Tinn-R Editor - GUI for R Language and Environment

    The Tinn-R is an open source (GNU General Public License) project. It is an editor/word processor ASCII/UNICODE generic for the Windows operating system, very well integrated into the R, with characteristics of Graphical User Interface (GUI) and Integrated Development Environment (IDE). Project leader and main developer: José Cláudio Faria/UESC/DCET. LANGUAGE: Object Pascal, IDE: DELPHI 2007.
    Downloads: 47 This Week
    Last Update:
    See Project
  • 4
    Govel

    Govel

    Lightweight and lightning fast novel writing and organizing tool

    Govel is a lightweight novel writing and organizing software designed with the intention to leave your hands where they belong: on the keyboard. There are no fancy menus, submenus, dialogues, buttons, tabs, you name it. You don't even need to use the mouse, which makes your overall workflow faster than in any other program. You can still customize the visual appearance while beeing maximum productive. And the best of all: Govel is completly free.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    ant4docbook

    ant4docbook

    ANT4DOCBOOK is an ANT task for DOCBOOK

    ANT4DOCBOOK is an ANT task for DOCBOOK, a semantic markup language for technical documentation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    MahaKurawa.My.ID URL Extractor

    MahaKurawa.My.ID URL Extractor

    MahaKurawa.My.ID URL Extractor is Simple Tool to extract unique URL

    MahaKurawa.My.ID URL Extractor is Simple Tool to extract unique URL from any text content in instant. It's useful when you lazy enough to identify and copy-paste URL from your content one by one by yourself.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Madedit-Mod

    Madedit-Mod

    MadEdit-Mod is a cross platform Text/Hex editor based on MadEdit

    Madedit-Mod is a cross platform text/hex editor base on MadEdit with a log of critical bug fix from me or other developers. A lot of new features were added, such as Drag-Drop Edit(cross platform), Highlight word, etc. The reason that I maintained this project is that the author of MadEdit had not worked on it for for a long time and I really like it and need more features. Find more information on Wiki pages. Currently supported Languages: English Chinese Simplified...
    Leader badge
    Downloads: 113 This Week
    Last Update:
    See Project
  • 9
    RTextDoc

    RTextDoc

    An editor for structured documents

    RTextDoc is an editor for structured text documents such as LaTeX, AsciiDoc, DocBook. RTextDoc has proofreading capabilities: on-the-fly spelling, instant grammar checking and built-in free dictionaries. RTextDoc has syntax highlighting, bracket matching, folding, document structure browser for sections and labels, bookmarks, manager for LaTeX symbols, an editor for mathematical equations,integrated BibTeX database manager and several tools to convert LaTeX to HTML and back. AsciiDoc...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    EBookGenTools

    EBookGenTools

    EBook Generation Tools - scripts to create ebook formats EPUB, DOC

    EBookGenTools is a set of GNU/Linux shell scripts to process plain text for a book into HTML and electronic book formats. It was developed to create EPUB and DOC files from book text exported from novel writing software such as Manuskript, StoryBook, or your favourite text editor. EBookGenTools builds on the power of other software to create the following ebook formats: - EPUB: Calibre - ebook management - DOC: LibreOffice - free office suite These tools can be used directly to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    SimpleTextFormatter

    STF automatically generates documentation

    STF is a system of automatically generating documentation under control of a program or a script. It is frequently used to automatically generate test reports. STF is also used to clean up the output of a process and turn it into a nice looking report.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    CNote

    CNote

    A Fast Lightweight Note & Code Editor

    CNote is a fast and lightweight note/code editor, it can help you quickly create/edit notes or write code. it has all the important features you need in one place, With Live Html Preview, Python Code Execution, and more, you can find the changelog at the Github repo.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 13
    SimplyHTML is an application and a java component for rich text processing. It stores documents as HTML files in combination with Cascading Style Sheets (CSS). SimplyHTML is not intended to be used as an editor for web pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Blackfriday

    Blackfriday

    Blackfriday, a markdown processor for Go

    Blackfriday is a Markdown processor implemented in Go. It is paranoid about its input (so you can safely feed it user-supplied data), it is fast, it supports common extensions (tables, smart punctuation substitutions, etc.), and it is safe for all utf-8 (unicode) input. HTML output is currently supported, along with Smartypants extensions. It started as a translation from C of Sundown. Blackfriday itself does nothing to protect against malicious content. If you are dealing with user-supplied...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16

    htmleditor.py

    A Python based HTML and CSS Editor

    Requires PyQt >= 5.2 QsciScintilla >= 2.8 Python >=3.4
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    wordTabulator
    Program wordTabulator is intended for text analysis. It can generate index of word elements extracted from defined text set. Word elements may be words, N-grams or phrases (syntagmes). The program can process texts as in ordinary 1-byte encoding (ANSI), as in multibyte UTF-8 encoding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Programa para generar documentos HTML con expresiones matemáticas incrustadas, procesadas con Maxima (maxima.sourceforge.net).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    LaTeX Reference Card Creator

    LaTeX Reference Card Creator

    A Makefile based build system for creating LaTeX reference cards

    LaTeX Reference Card Creator is a Makefile based build system for creating reference cards. LaTeX Reference Card Creator compiles content into PDF, DjVu, TEX DVI, HTML and PostScript output formats. A three column reference card will be created. Features include batch image format conversions, spell checking, broken link checking, automatic backups and .zip and .tar.gz distribution building. LaTeX Reference Card Creator provides many LaTeX examples which can be used to make a reference card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ChessTask is a frontend for easily creating chess tasks with LaTeX. The tasks are stored in a list which can be exported, either to a LaTeX input file, or to HTML files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Web Book Downloader

    Web Book Downloader

    Download websites as e-book: pdf, txt, epub.

    This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Leader badge
    Downloads: 51 This Week
    Last Update:
    See Project
  • 23
    Rondel for Haiku

    Rondel for Haiku

    Markdown-capable text editor for Haiku

    ...Nothing special about it, you've used text editors before. But Rondel allows you to insert Markdown formatting codes, either by hand or from its menus, and to generate HTML files from its own text files. Installation: Move (don't copy) the .hpkg file into /boot/home/config/packages on any Haiku installation with Package Management enabled. If your Haiku installation is quite up-to-date, you should be able to double-click the .hpkg and have it load into HaikuDepot, ready for installation. The source files will be found in /boot/home/config/data/src after installation, in Yabadabbadoo format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Ghawwas_V4

    An open source system for Arabic corpora processing

    ...Accept Windows and UTF-8 character encoding g. Accept TXT, DOC, DOCX, RTF and HTML formats h. Export the processing results in CSV file format
    Downloads: 9 This Week
    Last Update:
    See Project
  • 25
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. ...
    Leader badge
    Downloads: 156 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB