Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
The Original Buy Center Software.
Never Go To The Auction Again.
VAN sources private-party vehicles from over 20 platforms and provides all necessary tools to communicate with sellers and manage opportunities. Franchise and Independent dealers can boost their buy center strategies with our advanced tools and an experienced Acquisition Coaching™ team dedicated to your success.
PCP (Pattern Classification Program) is an open-source machine learning program for supervised classification of patterns. PCP is a binary executable running on Linux and Windows (under Cygwin environment).
treecat is a simple, interactive tool that helps you classify files or database entries into an arbitrary category tree. Items are automatically classified within a taxonomy quickly and accurately, requiring single key confirmation or correction.
Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight
Lock Down Any Resource, Anywhere, Anytime
CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.
Italian labeled digits corpus, good for speech recognition.
Corpus di cifre italiane segmentato, adatto a esperimenti di riconoscimento vocale e riconoscimento fonetico.
SENTENSA Knowledge Miner is a platform independent tool for searching any text. SENTENSA uses robust methods of indexing and searching text, leveraging on experience from more than 20 years of information retrieval.
PARASOL is a C library for implementing computer-system simulations.
PARASOL is a C library for implementing simulations of computer systems. It supports a simulated, user-configured, multiprocessor environment on which user-written software (in C or C++) is executed.
DAT Freight and Analytics operates DAT One truckload freight marketplace
DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
A configurable knowledge management framework. It works out of the box, but it's meant mainly as a framework to build complex information retrieval and analysis systems. The 3 major components: Crawler, Analyzer and Indexer can also be used separately.
clusterviz allows to cluster three-dimensional data. The clustering process is visualized using OpenGL. As clustering algorithms the family of k-means algorithms is implemented, including mixture models.
Banyan is an enterprise class, web based, information management system that helps people work collaboratively to solve complex problems. Banyan increases project performance and reduces risk through the power of collaborative clear thinking.
JMiner is a (not yet!) complete data mining and artificial intelligence solution written in Java. Support for neural networks, genetic algorithms/programming, decision trees, clustering, market basket analysis, link analysis, data cleansing, and others.
Mcube is designed to solve 4x4x4 Rubiks Cubes (AKA, Rubiks Revenge, AKA, Master Cube). It is multiplatform, and has both console and graphical support.
PROJECT IS BEING DEVELOPED UNDER A DIFFERENT NAME: Please visit http://sourceforge.net/projects/modelbuilder/ Datamodeler gives scientists and engineers the possiblity to find any multivariate model for any dataset, how complex or large both may be.
MultiBoost is a C++ implementation of the multi-class AdaBoost algorithm. AdaBoost is a powerful meta-learning algorithm commonly used in machine learning. The code is well documented and easy to extend, especially for adding new weak learners.
DotPlot is an Eclipse plug-in to graphically compare word sequences of any type of text. Matches will be plotted as dots on a graph. Similarities in thousands of lines of text or code will result in typical textures and diagonals in the plot.
Simple Application which allows various calculations to be done on Powerball (lotto) drawing data. Capable of generating probable sets of #'s using a combination of several algorithms and variables. Auto updates recent drawings from over the internet.
PyVisi is the Python visualisation interface: a generic Python-based interface to many different scientific visualisation packages. PyVisi is intended for use in both standard and high-performance computing environments, and with the escript package.
Thetis is a Java (OS-independent) application written to allow the linguistic and statistical analysis of the Homeric and Hesiodic Epic. Current achievement is the creation of a complete (and free) Thesaurus of the Homeric and Hesiodic poems.