Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
JTidy is a Java port of HTML Tidy, a HTML syntax checker and pretty printer. Like its non-Java cousin, JTidy can be used as a tool for cleaning up malformed and faulty HTML. In addition, JTidy provides a DOM parser for real-world HTML.
This project is a java web spider (web crawler) with the ability to download (and resume) files. It is also highly customizable with regular expressions and download templates. All backend functionalities are also available in a separate library.
Reporting engine library written in C. Create one XML file and generate PDF, HTML, TXT, and CSV reports based on queries. Has support for MySQL, PostgreSQL, ODBC. Bindings for PHP, Java, Python.
Very Quick Wiki is a Java Server Pages based WikiWikiWeb clone. It's goal is to be as lightweight but as functional as possible. It is deployed by dropping a single war file into any J2EE web container without any extra installation processes.
OpenVPN ALS is a web-based SSL VPN server written in Java. It has a browser-based AJAX UI which allows easy access to intranet services. OpenVPN ALS is a direct descendant of Adito, which was a fork of SSL-Explorer.
JUploader is a program to uploading files to image/file hosting servers without necessary to visit their websites. Supported servers for now: ImageShack, VPX, eFotek, BankFotek, Tapetosik, YourFileLink, ZippyShare, SpeedyShare, Plikojad.
Peter's Updater is a program to synchronize local directories automatically with FTP-Directories or other local directories. Peter's Updater supports proxies, gateways, filters and binary-/text-mode depending on the file extension.
Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.
Banks, lending institutions
Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
You want to render an Excel file inside web browsers and allow users to edit information in cells... Workbook Tag has been designed to simplify this scenario. MSOffice is not required ! Ajax interact with server (Excel, Editor, Tag, HTML Converter, JSP)
Tammi is a JMX-based Java application development framework and run-time container providing a service architecture for J2EE server side Internet applications that are accessible from any device that supports HTTP including mobile (wireless) handsets.
Proxy for debugging web apps. All info from request and response with decompression. Allows to set latency, bandwidth or filtering of specific pages. To evaluate high latency,AJAX debugging,page load time,time schema or to verify submitted data.
mvnForum is a open source, powerful, easy to use/setup bulletin board (forum) built on the J2EE technology(Jsp/Servlet) and compatible with any Servlet Container which supports Jsp 1.2 and Servlet 2.3.
The official home page is www.mvnForum.com
The D2RQ API provides access to relational database content from within the Jena and Sesame RDF frameworks. D2R Server is a Semantic Web server and SPARQL endpoint based on D2RQ. Both use a declarative language for mapping database schemata to RDF.
Insight, the analysis TechTool for developers is a platform independent tool for viewing & analying run time meta data of applications. It aids in trouble shooting problems, and aids the developer in every day work through very advanced log analysis.
MaxMind GeoIP is a set of APIs for looking up the location of an IP address, including the country, region, city, latitude, and longitude. Free GeoLite databases are available at http://www.maxmind.com/app/geolitecity
Toolkit e-formulieren is een opensource toolkit voor het op een gebruikersvriendelijke manier kunnen maken en onderhouden van e-formulieren.
De Toolkit maakt gebruik van Orbeon, en ondersteunt XForms-compliant e-formulieren, evt. met voorinvulling.
jWic is a java-based development framework for developing dynamic web applications with the convenience and familiarity of 'rich client' style programming.
PLEASE NOTE: We moved with the sourcecode to GitHub: https://github.com/xwic/jWic
You will find the latest version there. The one here on SF is quite outdated....
Jix does not work any more. It was a client for FotoBilder. You used to be able to use it to browse pictures on your computer an upload them to the LiveJournal ScrapBook.
It stopped working because the image hosting on LiveJournal changed.
The gateway is an open source JavaEE application developed by the Vermont Dept of Taxes. It provides a web services framework for accepting Streamlined Sales Tax registrations and returns. It also includes a web interface for submitting transmissions.
Open-jACOB is a rapid application development tool base on Eclipse that enables the development of RichInternetApplications that are based entirely on Java. Open-jACOB application can be deployed as a stand alone app's in a Web container like Tomcat.
iFTPd, Independent FTP Daemon, is an FTP server written in Java. iFTPd features platform independence, system independence, ruggedness and a virtual file system.
A PHP library/framework for the development of websites. The main features are: database independence, template-driven content, theme-able content generation, integrated WML generation, user content management, Lucene server integration.