With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now
Cloud tools for web scraping and data extraction
Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Switchboard is a conceptual-level interface to many web and network related functions (SOAP, REST, XML parsing, screen-scraping, FTP, network sniffing), designed for the Processing environment.
jGetFile is a command-line scriptable recursive file downloader for the web. Where other downloaders fail, jGetFile succeeds in downloading the files you want with simplicity and ease of use.
list2db reads digested email files generated by the mailman mailing list software and converts them into SQL for a relational database. The project also includes a PHP frontend for users to search and browse archived list emails.
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
VDC has been superseded by DVN: https://sourceforge.net/projects/dvn/ ---- The Virtual Data Center project is building an operational, open-source, digital library to enable the sharing of quantitative research data, and the development of distribute
Cross-platform searchable CD-ROM. Vicaya is a search engine and indexing tool for use on a local file system or CDROM, written in Java and based on Apache Nutch, and Tomcat. The goal is to replicate a website on a CD-ROM to be used on any platform.
Analysis and interactive visualization of a web-based community. Supports different focuses on the given social network to present community groups to the user. Also specific information of each member is provided.
Krakatoa is search engine for your desktop with simple and advanced search capabilities. It will search on any key word, exact phrases or files. Search within a domain or site. Fast search engine switching for better results.
Pocomos is a cloud-based field service solution that caters to businesses
Built for the pest control industry, but also works great for Mosquito Control, Bin Cleaning, Window Washing, Solar Panel Cleaning, and other Home Service Businesses in need of an easy-to-use software that helps you simplify routing, scheduling, communications, payment processing, truck tracking, time tracking, and reporting.
Hoople is like attribute-oriented programming for URLs. Rather than having the configuration information and “URL Logic” spread throughout the site, you create a single XML configuration file that contains all the “logic” for each URL on the file
QuickWCM is a Web Content Manager (WCM) with a very easy to use web-based interface, seamless security model, integrated search engine and more. QuickWCM runs on JSR-170 repository and is easy to extend with JSR-168 portlets.
This project uses a combination of JSP tag, Factory pattern classes and XML to display directory structures. The directory will be specified in the JSP tag, then it will call the package to generate a XML document that describes the directory structure.
NICE is a high speed opensource ftp search engine written 100% in Java and no database required, running on any web container such as Tomcat. it uses Struts,Lucene,Quartz and provides a dynamic AJAX based Web interface and control panel.
This project is about building a small search engine. This is done using the TREC-6 document collection as a basis to provide solid and reliable evaluation.
Java program to extract postings and comments from http://www.livejournal.com (blog) into DB and view/classify/process it. LJ loader. Components to reuse: perl-like, but efficient Web pages scraper, trees analyzer, concurrent scheduler.
A web app for creating a repository of pictures (our focus is birds). Users submit pictures, with a wizard that generates RDF descriptiors. Sumissions are forwarded to Admins for aproval. Instances will export the RDF so that repositories may cooperate.
phpByteBazar is a web based, operating system independent file management and exchange application with multiple user support and comprehensive indexing and searching capabilities.
J-Obey is a Java Library/package, which allows people writing their own crawlers to have a stable Robots.txt parser, if you are writing a web crawler of some sort you can use J-Obey to take out the hassle of writing a Robots.txt parser/intrepreter.
The goal of the project is to guide developers in designing Web applications which uses various Opensource frameworks such as spring and hibernate etc to build a scaleable, efficient and reliable Web application.
SENTENSA Knowledge Miner is a platform independent tool for searching any text. SENTENSA uses robust methods of indexing and searching text, leveraging on experience from more than 20 years of information retrieval.