Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
A JBIG2 decoder written for Java under a BSD license
The JPedal JBIG2 Image Decoder is a 100% pure Java image decoder for the JBIG2 file format. The decoder takes the JBIG2 image processing technology developed for the JPedal PDF renderer in 2007 and makes it available as a generic library for more general usage.
The JDeli Image Library https://www.idrsolutions.com/jdeli contains a much enhanced and faster version and also provides a complete ImageIO replacement (see https://www.idrsolutions.com/jdeli/imageio-replacement) for Tiff, JPEG,...
A Java-based command-line tool for automatically removing old backups selectively. It can be used when you already make daily backups on disk, and want to clean them up while still keeping some of the ones made weeks, months or years ago.
A PPP client and server which is written in Java. It will support AT command modem directly by COM port, without any API (except commAPI) and JNI supported. It is pure Java PPP api. For more, please visit www.simpleteam.com.
Sequoia is a database clustering middleware offering load balancing and transparent failover. Databases are replicated over multiple nodes and Sequoia balances the queries between them. Sequoia supports online maintenance and recovery operations.
Open SOA Repository is a JEE application enabling lifecycle management and contract management of your SOA services (SOAP WebServices, BPEL...). It can interact with UDDI and ebXML to publish your services.
FOAFRealm (also called D-FOAF in version 2.0) is a distributed user profile management system based on FOAF. FOAFRealm is a set of tools that enables to manipulate FOAF (Friend-of-a-Friend) information within J2EE application and provides Realm implement
Enable your academic documents on your hard-drive to be searched using an automated solution with limited user-intervention. All this is done in a non-intrusive manner, ensuring your files are not moved.
OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.
Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
The Genalyze log analyzer, a simple yet highly extensible framework for parsing log files, was designed for parsing T-Server log files generated by Genesys contact center software platform, yet is generic and easy enough to adapt for other log formats.
This eclipse plugin allows for the export of an eclipse project's runtime configurations to shell scripts in various formats. This allows for the execution of java code OUTSIDE the Eclipse environment without having to custom write the shell script code.
Jack Mail Client: jackmailclient: Mail Client in Java This is a simple mail client uses any mail server with a minimal configuration you will have a robost email client. Purely written in java.
TightVNC is an improved version of VNC, great free remote-desktop tool. The improvements include bandwidth-friendly "Tight" encoding, file transfers in the Windows version, enhanced GUI, many bugfixes, and more.
Narrows search result produced by popular Internet search engines, allowing to put extra filtering conditions, as certain words presented, certain words excluded, and so on.
Webclient is a novel client which provides web interface
to the resources provided through UNICORE 6. The Webclient takes advantage
of the UNICORE client API and technologies such as AJAX, JSP,
and JavaScript frameworks.
The CodeTime platform covers every aspect of parallel software from authoring, through distribution, to run-time. Its goals are: high programmer productivity; write once, run high performance anywhere; and wide acceptance.
zpm(c) is a Java client cooperating with database tables created by zpmd (z/VM Performance Monitor Daemon) software. It presents system usage and actual events.
open-fdc is short for "Open Framework for Distributed Computing". This framework allows developers to create "tasks" which are executed by the engine. A task is similar to a work unit type in the SETI project.
TcpMon is a Linux program written in Java that will show you detailed listings of all TCP and UDP endpoints on your system, including the local and remote addresses and state of TCP connections. Requires JAVA 6 installed to run.
Damon is a Distributed AOP Middleware on top of a p2p Overlay Network. Its main aim is to offer the necessary abstractions and services to develop distributed aspects, and to provide scalability to existent or new applications transparently.
The WAW tools provide a more automated approach to web harvesting, based on archival principles, automated process and human decision-making. The model seeks to use archival principles to preserve documents on the web.
As in Pretty Good Privacy, this is a Pretty Good Incremental Backup tool. Inspired in rsync, jnSynch provides a basic file synchronization engine written in JAVA and so executable from either Java-compatible computer.