Showing 17 open source projects for "html source extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    HyperSQL is like a doxygen plus javadoc for SQL, hypermapping SQL views, packages, procedures, and functions to HTML source code listings and showing all code locations where these are used.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    ZetaBoards topic fetcher
    Fetches topics with new posts from ZetaBoards forums and does something with the URLs, like opening them in a browser. Configurations can be stored and manipulated for quicker fetching. Development, translations, bug reports, etc. are handled at Launchpad: https://launchpad.net/zb-fetcher SourceForge is used to host released files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    MedusWiki is a Python Wiki engine intended to be used as a personal knowledge management system. It uses Topic Maps (XTM) to store metadata, meaningful associations could be created between wiki pages. Zope Page Templates (ZPT) are used to produce HTML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    A HTML scraper that uses machine learning frameworks to extract labelled fields from raw HTML. The project also involves the development of a tool to display the semi structured data generated by the scraper component.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    a small collection of python 3000 scripts/modules used to automate searching craigslist.org cities and categories for interesting stuff; these scripts currently use html screen scraping, since craigslist currently has no api
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Starting from various input files, we parse to find a list of addresses, then display them on a map.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Wixi is a multi-platform wiki application for the desktop. It is written in python/wxpython and does anything you might expect from a wiki application. Generating html from text is done by txt2tags, wich is simple and easy to learn.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    This project aims to provide an offline version of wikipedia, available from the web browser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    zSearch is a simple python based crawler and search engine. Raw HTML are stored in bzip2 archives, the index is created using pylucene, and twsited is used to provide internal http server. Results are sent back as XML over HTTP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Bookmark-Manager is an advanced bookmark management utility for Windows supporting importing/exporting and merging of Internet Explorer favorites, Opera hotlists, Mozilla, Netscape, and Firefox bookmarks, XBEL, and HTML lists.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    System to assist with the management of a local Buddhist study library. Will download and install chosen documentation (sutras, etc), index them, and provide a web interface. Python, HTML. Needs a webserver, namazu. Can be used for other documents, too.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    CVS-Summary is a program that generates an HTML summary of CVS activity, very similar to that provided by the popular ViewCVS script. CVS-Summary differs from ViewCVS in that it dumps the summary information to static HTML files which can be served by n
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Simple tool to backup Livejournal entries written in Python. Given a username and date range, downloads all entries in the range and places them in html files on the user's hard drive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MAD is acronym for \'Monitor, Analyse and Delivery\'. Project\'s goal is create some scripts for periodicall checkups for new messages in interested forums, extract it into portable text format without html-junk and annoying advertisments, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A powerful, themeable image gallery generator for static HTML pages.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    This is a collection of REST specifications, and implementations of those specs, for very low-level information sharing and workflow operations using REST actions over HTTP. Implementations are in various languages, mainly Java, Python, and Ruby.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB