Showing 38 open source projects for "html source extractor"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    KOReader

    KOReader

    An ebook reader application supporting PDF, DjVu, EPUB, FB2, etc.

    KOReader is a document viewer for E Ink devices. Supported fileformats include EPUB, PDF, DjVu, XPS, CBT, CBZ, FB2, PDB, TXT, HTML, RTF, CHM, DOC, MOBI and ZIP files. It’s available for Kindle, Kobo, PocketBook, Android and desktop Linux. Runs on embedded devices (Cervantes, Kindle, Kobo, PocketBook, reMarkable), Android and Linux computers. Developers can run a KOReader emulator in Linux and MacOS. Multi-lingual user interface with a highly customizable reader view and many typesetting...
    Downloads: 143 This Week
    Last Update:
    See Project
  • 2
    Linux command

    Linux command

    Linux command encyclopedia search tool

    Linux command encyclopedia search tool, the content includes Linux command manual, detailed explanation, study, and collection. The current warehouse has collected more than 570 Linux commands. It is a non-profit warehouse. It has generated a web site for easy use. Currently, the site does not have any advertisements. The content includes Linux command manuals, detailed explanations, and learning. Very worthy collection of Linux command quick reference manual. The copyright belongs to the...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    Pro Git

    Pro Git

    Pro Git 2nd Edition

    Welcome to the second edition of the Pro Git book. The entire Pro Git book, written by Scott Chacon and Ben Straub and published by Apress, is available here. All content is licensed under the Creative Commons Attribution Non Commercial Share Alike 3.0 license. Print versions of the book are available on Amazon.com. What is “version control”, and why should you care? Version control is a system that records changes to a file or set of files over time so that you can recall specific versions...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    VideLibri

    VideLibri

    Client for public libraries

    VideLibri lists the books you have borrowed from a public library and lets you search the library catalog from your local device. It has all the usual features of a library OPAC, i.e. viewing, renewing, searching and ordering books. It also renews all due books automatically and can do other things only a locally running app can do, like keeping a history of all ever borrowed books across multiple libraries. Developed in 2006, it is the world's first library app, and by supporting 20...
    Leader badge
    Downloads: 2,003 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    Kiwix

    Kiwix

    Wikipedia offline & more

    Kiwix is an offline reader for Web content. It's especially intended to make Wikipedia available offline. With Kiwix, you can enjoy Wikipedia on a boat, in the middle of nowhere... or in Jail. Kiwix manages to do that by reading ZIM files, a highly compressed open format with additional meta-data.
    Leader badge
    Downloads: 292 This Week
    Last Update:
    See Project
  • 6
    WIKINDX

    WIKINDX

    Virtual Research Environment / On-line Bibliography Manager

    Reference management, bibliography management, citations and a whole lot more. Designed by academics for academics, under continuous development since 2003, and used by both individuals and major research institutions worldwide, WIKINDX is a Virtual Research Environment (an enhanced on-line bibliography manager) storing searchable references, notes, files, citations, ideas, and more. An integrated WYSIWYG word processor exports formatted articles to RTF and HTML. Plugins include a...
    Leader badge
    Downloads: 37 This Week
    Last Update:
    See Project
  • 7
    elibsrv

    elibsrv

    a light OPDS/HTML server indexing EPUB and PDF files

    elibsrv is a light, standalone OPDS server for Linux. It allows to generate an OPDS repository of EPUB and/or PDF files scanned from on-disk directories. It also provides a simple html interface for non-OPDS humans, which makes it a good fit for both OPDS-aware devices (like Android with FBReader or Aldiko) and browsers with EPUB/PDF capabilities (for ex. Firefox with the excellent EPUBReader plugin). It's worth noting that elibsrv is a complete solution - ie. it doesn't rely on third...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables. Also computes readability indexes (Gunning-Fog, Coleman-Liau, Automated Readability Index (ARI), SMOG grade,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Fuzzy machine learning framework

    Fuzzy machine learning framework

    A library and a GUI front-end for fuzzy machine learning

    Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    Awesome Portfolio Websites

    Awesome Portfolio Websites

    Personal portfolio for researchers, developers, and analysts

    A community-maintained open-source project aimed at making a personal portfolio for researchers, developers and analysts. Simple, fast and less cumbersome. We make sure you have a full-fledged website to showcase your work while you can spend time on your learning and innovative endeavors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    eXeAutomaticResources

    Generación automática de recursos para eXeLearning

    Scripts en bash para la generación automática del código para insertar recursos en eXeLearning 2.6 (Snap). Y para crear de forma automática los créditos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Epub.js

    Epub.js

    Enhanced eBooks in the browser

    Epub.js is an open-source Javascript library that allows any web page to render Epub documents on any device with a modern browser. Epub.js contains a flexible rendering engine and provides a simple interface for common ebook functions such as styling, persistence and pagination. We release and maintain Epub.js on GitHub, with a growing developer community. Just include a single javascript file, and pointing it at a hosted epub. Epub.js provides an interface for common ebook functions (such...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13

    Edition Visualization Technology

    Digital edition visualization software

    EVT is a tool that creates web-based diplomatic and/or interpretative editions starting from XML TEI-encoded documents. The current release, EVT v. 1.3, builds upon the feature set of the previous releases and adds support for quite a number of new features: * added a new navigation bar; * added support for translations in multiple languages; * added support VisColl style sheets; * added support for comment and critical <note>s; * added support for verse numbering and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Computer Glossary

    Computer Glossary

    is a rich dictionary containing multiple computer-related terms

    Computer Glossary is a rich dictionary containing multiple computer-related terms, which is useful for both students and professionals. The program integrates an offline database and offers references and descriptions for each term. Computer Glossary can be installed in just a few simple steps and does not require special skills. The program comes with a simple interface and lets you easily search any term. The results will let you see the word's definition, references to other related...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Cool Reader

    Cool Reader

    A cross-platform XML/CSS based eBook reader

    CoolReader is fast and small cross-platform XML/CSS based eBook reader for desktops and handheld devices. Supported formats: FB2, TXT, RTF, DOC, TCR, HTML, EPUB, CHM, PDB, MOBI. Platforms: Win32, Linux, Android. Ported on some eInk based devices.
    Leader badge
    Downloads: 391 This Week
    Last Update:
    See Project
  • 16

    JHOVE

    File validation and characterization

    JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. JHOVE should not be confused with JHOVE2, a product with similar aims but a completely separate code base.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Wiki Quick Search

    Wiki Quick Search

    Search Wikipedia in every language in this WebApp

    Wiki Quick Search & Print is software was made with the API provided by Wikipedia and allows you to enable fast searches with autocomplete input inside the famous free encyclopedia. You can print the results of your search in PDF or save it in your computer to html including tables, text, images and file audio. Finally, you can avail of a convenient input for advanced search strings contained in the result page. ========= Windows 7 and later are supported. Both x86 and amd64 (x64) binaries...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    QuickSlide

    A tool to generate HTML-based slide shows.

    This tool allow the generation of slide shows in HTML format (to run simply in the browser). It can not replace Microsoft Power Point, but, it allows you to create a slide show by writing Java code. The generated slide show can be run easily in the browser so it is platform independent and does not need any additional software installation, even on new installed OS or Smart phones. In this first pre-alpha version, the generated code supports simple text and basic lists only, but, I hope to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    QuickWeb

    This tool can be used to create quickly the skeleton of a web site.

    QuickWeb is a tool to quickly create a skeleton of a web site. The tool developed in Java and to create a web site, there's no need to manipulate HTML in the beginning. All you have to do is to write a simple Java code that will generate a web site of several pages. Created to help me with documenting different Sourceforge project, QuickWeb does not generate, for now, any CSS or JS code, but, it can generate the skeleton and the static content of a website composed of several web pages and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Marcion

    Marcion

    The study environment of ancient languages (Coptic, Greek, Latin)

    Marcion is a software forming a study environment of ancient languages (esp. Coptic, Greek, Latin) and providing many tools and resources (dictionaties, grammars, texts). Although Marcion is focused on to study the gnosticism and early christianity, it is an universal library working with various file formats and allowing to collect, organize and backup texts of any kind. Overview of gnostic sources in Coptic language delivered with Marcion: Nag Hammadi Library; Berlin Codex; Codex...
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 21
    xowa

    xowa

    A free, open-source, offline Wikipedia application

    XOWA is a desktop application for reading and editing Wikipedia offline (XOWA has moved to http://gnosygnu.github.io/xowa/download.html)
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    Newspaper3k

    Newspaper3k

    News, full-text, and article metadata extraction in Python 3

    Inspired by requests for its simplicity and powered by lxml for its speed. Newspaper is an amazing python library for extracting & curating articles. Newspaper delivers Instapaper style article extraction. Newspaper is a Python3 library! If you are certain that an entire news source is in one language, go ahead and use the same api. Works in 10+ languages, English, Chinese, German, Arabic, and more! On python3 you must install newspaper3k, not newspaper. newspaper is our python2 library....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Klen-library
    Open Source program for creating electronic books and electronic manuals. Contains professional tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    مكتبة القرطاس

    مكتبة القرطاس

    مكتبة الكترونية بها كل ما تحتاج

    مكتبة القرطاس برنامج مجاني مفتوح تعمل على جميع أنظمة التشغيل لينكس او وندوز أو ماك تتيح لك مطالعة الكتب و تحريرها كما يمكنك إضافة كتب جديدة
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    Saaghar
    “Saaghar” (ساغر) is a Persian poetry software written by C++ under Qt framework, it uses "ganjoor" database as its database. It has tab feature in both its “Viewer” and its “Search” page that cause it be suitable for research goals.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB