Showing 16 open source projects for "web extractor"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender : https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    newpipeextractor

    newpipeextractor

    Library for extracting streaming site data without official APIs

    ...It handles many low-level tasks involved in web data extraction, including parsing responses, managing platform-specific logic, and handling errors, allowing developers to focus on implementing application features rather than scraping mechanics. Each supported service is implemented through its own extractor components that conform to a common interface, enabling consistent access to data across different platforms.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    java-pdf-table-extractor-lib

    java-pdf-table-extractor-lib

    Java Pdf Table extraction library

    The command line application is an example of usage of the Java library. The library is based on pdfbox library and works by looking for the layout of each selected pdf page, and looking for table structure patterns. After calling the library (passing the pdf filename, and the page range), the result is a List<PdfTextElement>. PdfTextElement is an interface that has two implementations. * A basic text (outside the tables) * And PdfTextTabulaElement, for table structures. That...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Please follow this link to get latest version https://sourceforge.net/projects/web-spider-web-crawler-extract/ Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby Database - Written in Java Cross Platform See also Free Email Sender in this link: https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 1 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Thymeleaf Property Extractor

    Thymeleaf Property Extractor

    A Java-written extractor for all your Thymeleaf localization strings

    Thymeleaf Property Extractor (ThExtractor in short) is a java-written app that given a folder or a html file, extracts every Thymeleaf localisation tags (for example: th:text="#{myString}") to a properties file, to ease and fasten the development of your Spring web app. It's really easy to use! Just run the jar, and browse for your HTML (or a folder containing them) in the first field, and browse for a target .properties file (or create one!)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Contacts Email Extractor
    This email harvester and bulk mailer is written in java and is totally free. Must have java installed to Click & Run the .jar file. Get email addresses in batches. Mac OS use Contacts Extractor.jar with the JavaMail API in a \lib folder next to it.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    Pattern Extractor

    a java based pattern extractor

    Pattern Extractor helps you to extract tile image from any kind of images. These tiles can be used for creating pattern in web pages or in images. You can save the tiles and export the pattern in your resolution. Tile selection is made simple for you as you can see the pattern it creates in real time. CHECK OUT AND SUPPORT MY LATEST ANDROID PROJECTS IN HERE: https://play.google.com/store/apps/developer?
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    htmlpicker

    Picks up text from a web page using a html template.

    A java html picker - text extractor Picks up text from a web page using a html template. Useful if you have regularly data to extract from the same site. You may use the same url or you may build urls having parameters. These parameters are fetch from a text file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    Lioness (Languages Interop Framework)
    Framework for making Windows applications that are one .exe file in AutoHotKey_L,C++,C#, VB.NET,Java,Groovy,Common Lisp,Nemerle,Ruby,Python,PHP,Lua,Tcl,Perl,Jint,S#,WSH VBScript,HTML/JavaScript/CSS,COM, PowerShell without compiling . For .NET 4.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    a web data extractor based on partial tree alignment
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A tool that can extract data from the Calibre SQLite database, including a command line tool that generates OPDS catalogs. In everyone's words, it takes the metadata out of Calibre, and generates catalogs for Stanza, Aldiko and web browsers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    RSS EXTRACTOR is a java library for generating RSS newsfeeds considering the RSS web feeds from multiple websites. It extracts the best of newsfeed entries and a produces a RSS file which is a fusion of newsfeed entries from several websites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Easy program that can 1) extract all your album art from iTunes and 2) create web displays of your music library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Vertical Web Extractor is a project to extract the data of products or something else in congeneric websites. Developed with java ,using SWT/JFace/RCP technology.Suitable for windows and linux.It's somehow like a vertical search engine plus data extract
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    danaides

    danaides

    Meta data extractor based on ontologies

    Danaides stores meta-data from biological banks in a graph databases as well as OBO formatted ontologies. It also provides a web query interface to parse ontologies or extract nodes based on ontology requirements. Example: Get all known elements that are in family of Fish (species ontology) AND a RNA (sequence ontology).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB