Showing 6 open source projects for "html source extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades. Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content. ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data....
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    DeepSearch5Plus

    DeepSearch5Plus

    Search recursively all files, text inside files, and bookmarks

    Files, text, and bookmarks html/json search tool. It is able to merge duplicated bookmark links from multiple html/json files and export to html. The user interface is designed to be used by kyeboard in any aspect of its features scrolling list, select input field, menu bar, check and drop-down menu and so on. It's able to printout search list to pdf or system printer or save as text file. It allows to rename single file or change upper/lower case, join spaces in bulk mode with multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    pandas-datareader

    pandas-datareader

    Extract data from a wide range of Internet sources

    Up-to-date remote data access for pandas. Works for multiple versions of pandas. Install using pip and then import and use one of the data readers. This example reads 5-years of 10-year constant maturity yields on U.S. government bonds. Stable documentation is available on github.io. A second copy of the stable documentation is hosted on read the docs for more details.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    pyShowRename

    pyShowRename

    Sensible batch renaming of downloaded TV files

    pyShowRename is a semi-automated batch file renamer which allows for controlled selection and maintenance/sanitisation of downloaded TV programs. The project was initially just to help the author get to grips with python but as of version 1.0 the project is stable enough to be used regularly and has proven to be a very useful tool. pyShowRename interfaces directly with the free epguides.com website using HTML scraping and retains a local cache of available TV Shows. This can be manually...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 5
    XSDB XML is to DATA as HTML is to DOCUMENT. Publish and combine data as easily as HTML format and web browsers publish and view documents. Implementations in Python, javascript, java, C#/.NET.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Simple tool to backup Livejournal entries written in Python. Given a username and date range, downloads all entries in the range and places them in html files on the user's hard drive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB