Showing 19 open source projects for "web scraper extractor"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender : https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    newpipeextractor

    newpipeextractor

    Library for extracting streaming site data without official APIs

    ...It handles many low-level tasks involved in web data extraction, including parsing responses, managing platform-specific logic, and handling errors, allowing developers to focus on implementing application features rather than scraping mechanics. Each supported service is implemented through its own extractor components that conform to a common interface, enabling consistent access to data across different platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Please follow this link to get latest version https://sourceforge.net/projects/web-spider-web-crawler-extract/ Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby Database - Written in Java Cross Platform See also Free Email Sender in this link: https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Thymeleaf Property Extractor

    Thymeleaf Property Extractor

    A Java-written extractor for all your Thymeleaf localization strings

    Thymeleaf Property Extractor (ThExtractor in short) is a java-written app that given a folder or a html file, extracts every Thymeleaf localisation tags (for example: th:text="#{myString}") to a properties file, to ease and fasten the development of your Spring web app. It's really easy to use! Just run the jar, and browse for your HTML (or a folder containing them) in the first field, and browse for a target .properties file (or create one!)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Contacts Email Extractor
    This email harvester and bulk mailer is written in java and is totally free. Must have java installed to Click & Run the .jar file. Get email addresses in batches. Mac OS use Contacts Extractor.jar with the JavaMail API in a \lib folder next to it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Pattern Extractor

    a java based pattern extractor

    Pattern Extractor helps you to extract tile image from any kind of images. These tiles can be used for creating pattern in web pages or in images. You can save the tiles and export the pattern in your resolution. Tile selection is made simple for you as you can see the pattern it creates in real time. CHECK OUT AND SUPPORT MY LATEST ANDROID PROJECTS IN HERE: https://play.google.com/store/apps/developer?
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    htmlpicker

    Picks up text from a web page using a html template.

    A java html picker - text extractor Picks up text from a web page using a html template. Useful if you have regularly data to extract from the same site. You may use the same url or you may build urls having parameters. These parameters are fetch from a text file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Lioness (Languages Interop Framework)
    Framework for making Windows applications that are one .exe file in AutoHotKey_L,C++,C#, VB.NET,Java,Groovy,Common Lisp,Nemerle,Ruby,Python,PHP,Lua,Tcl,Perl,Jint,S#,WSH VBScript,HTML/JavaScript/CSS,COM, PowerShell without compiling . For .NET 4.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    a web data extractor based on partial tree alignment
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A tool that can extract data from the Calibre SQLite database, including a command line tool that generates OPDS catalogs. In everyone's words, it takes the metadata out of Calibre, and generates catalogs for Stanza, Aldiko and web browsers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    RSS EXTRACTOR is a java library for generating RSS newsfeeds considering the RSS web feeds from multiple websites. It extracts the best of newsfeed entries and a produces a RSS file which is a fusion of newsfeed entries from several websites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Easy program that can 1) extract all your album art from iTunes and 2) create web displays of your music library.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    JSP tag library, Java Class Library, and Dot Netand Mono(C#, VB.Net, any Dot Net Language) DLL to display stock quotes, stock charts and other stock data. (Examples for VB.Net, C#, Java, JSP Tags are included)
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Java program to extract postings and comments from http://www.livejournal.com (blog) into DB and view/classify/process it. LJ loader. Components to reuse: perl-like, but efficient Web pages scraper, trees analyzer, concurrent scheduler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Hirudo is a Java Swing application for downloading web content. It functions as a screen-scraper, filename generator and download manager. All this and much more in an intuitive cross-platform user interface. Hirudo requires Java 1.4.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    This project aims at developing a free open-source Scraper and Macro Interface for Web programmers, that needs no installation and will work in most of the operating systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Vertical Web Extractor is a project to extract the data of products or something else in congeneric websites. Developed with java ,using SWT/JFace/RCP technology.Suitable for windows and linux.It's somehow like a vertical search engine plus data extract
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    danaides

    danaides

    Meta data extractor based on ontologies

    Danaides stores meta-data from biological banks in a graph databases as well as OBO formatted ontologies. It also provides a web query interface to parse ontologies or extract nodes based on ontology requirements. Example: Get all known elements that are in family of Fish (species ontology) AND a RNA (sequence ontology).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB