Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
HOA Software
Smarter Community Management Starts Here
Simplify HOA management with software that handles everything from financials to communication.
The Unstructured Information Management Architecture (UIMA) Framework is an open, scalable and extensible platform for building analytic solutions that process unstructured information to find latent meaning, relationships and relevant facts.
Market Advisor is a project which offers a tool to achieve a better performance for your investments on stock quote market. It's focused on Italian Stock Market but it can be also easily extended to other markets.
MultiJADS is a domain independent multiagent active design documents shell. It uses multiagent technology to support activities in concurrent and distributed design systems and is based on the Active Design Documents (ADD) approach.
The Genomic Diversity and Phenotype Data Model (GDPDM) captures molecular and phenotypic diversity data. MySQL databases are used to implement the schema. This project develops software tools (written in Java, Perl, etc.) associated with this model.
PHP Growth Charts provides a simple PHP interface to generate graphical (PNG-format) pediatric growth charts, based on CDC data, and plot patient progress.
FRIDA (flexible rapid interactive data analysis) is a generic program for manipulating, fitting, and plotting x,z,y data. It is primarily aimed at spectral analysis, especially in neutron scattering. Note: project has moved away from sf.net.
The Microdata Management Toolkit is a collection of tools for documenting, disseminating and preserving survey and census microdata The project is sponsored by the International Household Survey Network with financial support from the World Bank.
The NS-Mapper ad-hoc scenario editor is improved and extended by adding more realistic strategies, such as random based node placement, movement and traffic to the ad-hoc simulation of the Network Simulator 2 (NS-2).
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Poor Man's HPC is a framework that allows distributing and running code on a server farm. pmHPC is a scaled down and simplified version of distributed computing projects such as SETI, so is a perfect fit for enthusiasts and universities.
The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.
Graphist uses PHP's GD library to produce data plots, in real time, served up as standard images for consumption by web pages (though such images could be saved for use in other document types).
the pIG bulk-processes data files generated by 'popular' scientific behavioural programs, including Med Associates MedIV-PC, Coulburn TruScan and Kinder Scientific.
Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.
ISIADIS: development environment for hierarchical software systems. ISIADIS allows prototyping, development and running software systems that are based on hierarchical (tree-like) architecture.
LIK (Locate Internet Knowledge) is dedicated to find information on the internet and locate this information (web server) and the path (router) between it and you (your personal computer) on a map.
OpenSource Semantic Web Search Engine Software: If two machines anywhere on the web can agree on the same definition of a digital service or digital good, then machine to machine transactions can use this lingua franca to transact on the users behalf.
The Citizen Privacy Service is an asynchronous component using artificial intelligence capabilities including DL decidability and first order logic provenance that provide policy decision and policy enforcement points based on the US Privacy Act of 1974.
This project consists in a simulation software of robot A.I. It is aimed at comparing the efficiency of robot intelligence against movement tasks between fixed checkpoints in a logical world.
FORce based Cluster Editing (FORCE) is a Java software heuristically solving the graph cluster editing problem on weighted edges using BLAST E-values. It further provides a training mode for heuristic parameter estimation.
A user-friendly open-source toolkit written in Java that lets you visualize and analyze the behaviour of users in the ActiveWorlds family of 3D virtual worlds by mapping them over 2D space.
geolocate is a front-end java program that works with google maps to provide dynamic maps to users. Combined with the flexibility of XML and the power of javascript, users can see various relationships on their map to draw conclusions.
JVnSegmenter is a Java-based and open-source Vietnamese word segmentation tool. The segmentation model was trained on about 8,000 sentences using Conditional Random Fields (FlexCRFs). This tool would be useful for Vietnamese NLP community.
K-automaton is a new parsing (syntactic analysis) machine isomorphous to language. Implemented in Java. Can generate Java code from grammars described in EBNF.