Showing 34 open source projects for "html source extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    KOReader

    KOReader

    An ebook reader application supporting PDF, DjVu, EPUB, FB2, etc.

    KOReader is a document viewer for E Ink devices. Supported fileformats include EPUB, PDF, DjVu, XPS, CBT, CBZ, FB2, PDB, TXT, HTML, RTF, CHM, DOC, MOBI and ZIP files. It’s available for Kindle, Kobo, PocketBook, Android and desktop Linux. Runs on embedded devices (Cervantes, Kindle, Kobo, PocketBook, reMarkable), Android and Linux computers. Developers can run a KOReader emulator in Linux and MacOS. Multi-lingual user interface with a highly customizable reader view and many typesetting...
    Downloads: 143 This Week
    Last Update:
    See Project
  • 2
    Linux command

    Linux command

    Linux command encyclopedia search tool

    Linux command encyclopedia search tool, the content includes Linux command manual, detailed explanation, study, and collection. The current warehouse has collected more than 570 Linux commands. It is a non-profit warehouse. It has generated a web site for easy use. Currently, the site does not have any advertisements. The content includes Linux command manuals, detailed explanations, and learning. Very worthy collection of Linux command quick reference manual. The copyright belongs to the...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    Pro Git

    Pro Git

    Pro Git 2nd Edition

    Welcome to the second edition of the Pro Git book. The entire Pro Git book, written by Scott Chacon and Ben Straub and published by Apress, is available here. All content is licensed under the Creative Commons Attribution Non Commercial Share Alike 3.0 license. Print versions of the book are available on Amazon.com. What is “version control”, and why should you care? Version control is a system that records changes to a file or set of files over time so that you can recall specific versions...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    VideLibri

    VideLibri

    Client for public libraries

    VideLibri lists the books you have borrowed from a public library and lets you search the library catalog from your local device. It has all the usual features of a library OPAC, i.e. viewing, renewing, searching and ordering books. It also renews all due books automatically and can do other things only a locally running app can do, like keeping a history of all ever borrowed books across multiple libraries. Developed in 2006, it is the world's first library app, and by supporting 20...
    Leader badge
    Downloads: 2,003 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Kiwix

    Kiwix

    Wikipedia offline & more

    Kiwix is an offline reader for Web content. It's especially intended to make Wikipedia available offline. With Kiwix, you can enjoy Wikipedia on a boat, in the middle of nowhere... or in Jail. Kiwix manages to do that by reading ZIM files, a highly compressed open format with additional meta-data.
    Leader badge
    Downloads: 292 This Week
    Last Update:
    See Project
  • 6
    WIKINDX

    WIKINDX

    Virtual Research Environment / On-line Bibliography Manager

    Reference management, bibliography management, citations and a whole lot more. Designed by academics for academics, under continuous development since 2003, and used by both individuals and major research institutions worldwide, WIKINDX is a Virtual Research Environment (an enhanced on-line bibliography manager) storing searchable references, notes, files, citations, ideas, and more. An integrated WYSIWYG word processor exports formatted articles to RTF and HTML. Plugins include a...
    Leader badge
    Downloads: 37 This Week
    Last Update:
    See Project
  • 7
    elibsrv

    elibsrv

    a light OPDS/HTML server indexing EPUB and PDF files

    elibsrv is a light, standalone OPDS server for Linux. It allows to generate an OPDS repository of EPUB and/or PDF files scanned from on-disk directories. It also provides a simple html interface for non-OPDS humans, which makes it a good fit for both OPDS-aware devices (like Android with FBReader or Aldiko) and browsers with EPUB/PDF capabilities (for ex. Firefox with the excellent EPUBReader plugin). It's worth noting that elibsrv is a complete solution - ie. it doesn't rely on third...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables. Also computes readability indexes (Gunning-Fog, Coleman-Liau, Automated Readability Index (ARI), SMOG grade,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Fuzzy machine learning framework

    Fuzzy machine learning framework

    A library and a GUI front-end for fuzzy machine learning

    Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    Awesome Portfolio Websites

    Awesome Portfolio Websites

    Personal portfolio for researchers, developers, and analysts

    A community-maintained open-source project aimed at making a personal portfolio for researchers, developers and analysts. Simple, fast and less cumbersome. We make sure you have a full-fledged website to showcase your work while you can spend time on your learning and innovative endeavors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    eXeAutomaticResources

    Generación automática de recursos para eXeLearning

    Scripts en bash para la generación automática del código para insertar recursos en eXeLearning 2.6 (Snap). Y para crear de forma automática los créditos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Epub.js

    Epub.js

    Enhanced eBooks in the browser

    Epub.js is an open-source Javascript library that allows any web page to render Epub documents on any device with a modern browser. Epub.js contains a flexible rendering engine and provides a simple interface for common ebook functions such as styling, persistence and pagination. We release and maintain Epub.js on GitHub, with a growing developer community. Just include a single javascript file, and pointing it at a hosted epub. Epub.js provides an interface for common ebook functions (such...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13

    Edition Visualization Technology

    Digital edition visualization software

    EVT is a tool that creates web-based diplomatic and/or interpretative editions starting from XML TEI-encoded documents. The current release, EVT v. 1.3, builds upon the feature set of the previous releases and adds support for quite a number of new features: * added a new navigation bar; * added support for translations in multiple languages; * added support VisColl style sheets; * added support for comment and critical <note>s; * added support for verse numbering and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Cool Reader

    Cool Reader

    A cross-platform XML/CSS based eBook reader

    CoolReader is fast and small cross-platform XML/CSS based eBook reader for desktops and handheld devices. Supported formats: FB2, TXT, RTF, DOC, TCR, HTML, EPUB, CHM, PDB, MOBI. Platforms: Win32, Linux, Android. Ported on some eInk based devices.
    Leader badge
    Downloads: 391 This Week
    Last Update:
    See Project
  • 15

    JHOVE

    File validation and characterization

    JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. JHOVE should not be confused with JHOVE2, a product with similar aims but a completely separate code base.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    QuickSlide

    A tool to generate HTML-based slide shows.

    This tool allow the generation of slide shows in HTML format (to run simply in the browser). It can not replace Microsoft Power Point, but, it allows you to create a slide show by writing Java code. The generated slide show can be run easily in the browser so it is platform independent and does not need any additional software installation, even on new installed OS or Smart phones. In this first pre-alpha version, the generated code supports simple text and basic lists only, but, I hope to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    QuickWeb

    This tool can be used to create quickly the skeleton of a web site.

    QuickWeb is a tool to quickly create a skeleton of a web site. The tool developed in Java and to create a web site, there's no need to manipulate HTML in the beginning. All you have to do is to write a simple Java code that will generate a web site of several pages. Created to help me with documenting different Sourceforge project, QuickWeb does not generate, for now, any CSS or JS code, but, it can generate the skeleton and the static content of a website composed of several web pages and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Marcion

    Marcion

    The study environment of ancient languages (Coptic, Greek, Latin)

    Marcion is a software forming a study environment of ancient languages (esp. Coptic, Greek, Latin) and providing many tools and resources (dictionaties, grammars, texts). Although Marcion is focused on to study the gnosticism and early christianity, it is an universal library working with various file formats and allowing to collect, organize and backup texts of any kind. Overview of gnostic sources in Coptic language delivered with Marcion: Nag Hammadi Library; Berlin Codex; Codex...
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 19
    xowa

    xowa

    A free, open-source, offline Wikipedia application

    XOWA is a desktop application for reading and editing Wikipedia offline (XOWA has moved to http://gnosygnu.github.io/xowa/download.html)
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    Klen-library
    Open Source program for creating electronic books and electronic manuals. Contains professional tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    مكتبة القرطاس

    مكتبة القرطاس

    مكتبة الكترونية بها كل ما تحتاج

    مكتبة القرطاس برنامج مجاني مفتوح تعمل على جميع أنظمة التشغيل لينكس او وندوز أو ماك تتيح لك مطالعة الكتب و تحريرها كما يمكنك إضافة كتب جديدة
    Downloads: 7 This Week
    Last Update:
    See Project
  • 22
    Saaghar
    “Saaghar” (ساغر) is a Persian poetry software written by C++ under Qt framework, it uses "ganjoor" database as its database. It has tab feature in both its “Viewer” and its “Search” page that cause it be suitable for research goals.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    Acumen Digital Library Sytem
    Acumen has gone through a near complete rewrite. Please see the Github page for details - https://github.com/AcumenProject [OLD Description] Acumen is a simple, robust digital library system using PHP, MySQL, and Apache. Easy to set up and administer, does not complicate workflows, supports all browsers, and automatically creates and updates its own index database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    note taking simplified
    *nts* provides a simple format for using text files to store notes, a command line interface for viewing notes in a variety of convenient ways and a cross-platform, wx(python)-based GUI for creating and modifying notes as well as viewing them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB