Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
The Original Buy Center Software.
Never Go To The Auction Again.
VAN sources private-party vehicles from over 20 platforms and provides all necessary tools to communicate with sellers and manage opportunities. Franchise and Independent dealers can boost their buy center strategies with our advanced tools and an experienced Acquisition Coaching™ team dedicated to your success.
A wrapper over the fcsh (Flex Compiler Shell) that expose the fcsh.exe to be accessed from an ant task. You can send mxmlc commands from ant tasks to a wrapped fcsh instance. Informations about project are on http://fcshwrapper.blogspot.com/
InfoCrawler allows you to crawl and index various types of documents, accessing data from various resources: Intranets, public WEB sites, local or remote file systems. For product information please see our website at http://www.infocrawler.org/
...Incrementally re-compiles classes automatically as soure changes for Rapid application development. Can run as signed applet in Internet Explorer. Alter-Home page at http://www.jhttpd.org.
This project automates Http-requests i.e. all browser activities can be logged/written to XML-formated files and redone by using simple methods.
This is very useful for automating http-server-requests e.g. queries to search engines, external databases..
Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
...For example, you want record the resources that a user visited. You know what to log at the development time. AuditLog is just for that. It is a plug-in of LimpidLog. See at http://www.acelet.com/auditlog/
A collection of libraries aimed to speed up Java application development. Functionality include versioning, testing, file access, graph navigation, HTML and XML processing , HTTP utilities etc.
An eclipse (3.3) plug-in for playing and organizing music files (mp3, ogg, few others). Defines a "Jukebox" perspective. Supports playlists, rating, DnD. Requires Java 5. Update site: http://musicplugin.sourceforge.net/update
Generative Al is shaping brand discovery. AthenaHQ ensures your brand leads the conversation.
AthenaHQ is a cutting-edge platform for Generative Engine Optimization (GEO), designed to help brands optimize their visibility and performance across AI-driven search platforms like ChatGPT, Google AI, and more.
Server Framework Objects provides a robust Web Server / Proxy Server, plus an elegant plug-in architecture for adding/modifying services. There is also a collection of utility classes for use with the project, or any Java project.
This aims to be a cross platform GUI control panel for XAMPP. The GUI provided with XAMPP works well only under Windows. This aims to provide a unified GUI on all OS's. Currently, KDE and GNOME are supported. Looking for help with Mac support.
...Runs on Linux and Windows and comes with a bundled installer. Reads NDS, RAR and Zipfiles Can extract directly to NDS or Zipfile. Datasource is provided as XMLFile from http://www.advanscene.com/
This project develops an integration of the distributed revision control system Darcs (http://darcs.net) into the Eclipse IDE (http://eclipse.org). It provides a set of plugins that enable IDE users to manage the code under development in Darcs repositor
A Java library as a wrapper for the Google Search Appliance's search protocol XML API. The XML API is publicly available at: http://code.google.com/gsa_apis/xml_reference.html The homepage and tutorial for this project is at: http://gsa-japi.sf.net
Apache Wicket is a component based Java web application framework. It has found a new home at Apache: http://wicket.apache.org Please visit us there to discover new releases, find support and learn more about Apache Wicket.
NOTICE: You can now download CandyFolds through JEdit's plugin manager. CandyFolds is now part of the jedit project (http://jedit.sf.net). Go to the forums for comments or questions.
twexter formats twin twext translations to help us learn language .. demo: http://test.twext.com .. javascript code is open at http://github.com/tudisco/twexter
jfile is a java library that can be used to detect a file's mime type using globs and/or magic files as specified at http://standards.freedesktop.org/shared-mime-info-spec/shared-mime-info-spec-latest.html.
CSMonkey TV Remote is a desktop TV remote for Linux. It's simple, easy-to-use and customizable. You can start TV and change channels. Uses Sun Java. http://tvremote.sourceforge.net
nBB2 is the Java counterpart of phpBB 2.0.22. It is the result of an automated migration from PHP, using the nTile PtoJ product by Numiton (http://www.numiton.com).
...It relies on the "Apache POI - Java API To Access Microsoft Format Files" project. A bundle distribution of the application can be accessed from the following url: http://mspviewer.blogspot.com/
If you like this software and are a developer, check out:
https://sourceforge.net/projects/crudzillawebapplicationbuilder/
Geoclipse is a collection of Eclipse plugins, that allow you to add mapping capabilities to your Eclipse Rich Client applications. Current development happens on github! http://michaelkanis.github.com/geoclipse