Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
Fast customizable time series web database for big data like log files
Our goal is to create the world's fastest extendable, non-transactional time series database for big data (you know, for kids)!
Log file indexing is our initial focus. For example append only ASCII files produced by libraries like Log4J, or containing FIX messages or JSON objects.
Occursions was built by a small team sick of creating hacks to remotely copy and/or grep through tons of large log files. We use it to index around a terabyte of new log data per day. You can use it too. Who...
visualFSA is a small tool which lets you construct NFAs/DFAs [(non)deterministic finite automata]. It offers some common algorithms which can be applied to that automatons, like converting NFA -> DFA, word problem, accepted language etc.
Genvlin is an easy to use graphical dataplotting environment. It also provides direct access to the data via scripting and a console. You can easily import, export, change, plot, ... all the data.
The project purpose is to develop an application to store and analyze workout data. Workout data in terms of duration of a training session, average pulse during the session and so forth. The focus will be on endurance training.
StatCvs-XML provides statistics about cvs usage based on the cvs log. It extends the StatCvs project with fancy 3D charts, advanced report generation and customization features. It also features a plugin for seamless integration with Apache Maven.
TotalDepth is an OpenSource, cross platform, software collection that can process petrophysical data from the oil field such as wireline logs, seismic data and so on.
Conventional, proprietary, software for petrophysical data tends to be expensive to licence, restrictive, slow to develop for and tied to expensive hardware. TotalDepth changes all of that.
An semantic editor for describing public administration services. Produces ontologies containing a workflow description in owl notation. Can be used for modelling processes in different fields, but was originally focused on semantically describing government services with their corresponding documents, fees and other information items.
Launch the editor with Java Web Start at http://ri.tdf.lv
Burrow-owl is a software package for visualizing multidimensional nuclear magnetic resonance (NMR) spectra, with an emphasis on spectra used in macromolecular structure determination.
Landlords, multi-family homes, manufactured home communities, single family homes, associations, commercial properties and mixed portfolios.
Rent Manager is award-winning property management software built for residential, commercial, and short-term-stay portfolios of any size. The program’s fully customizable features include a double-entry accounting system, maintenance management/scheduling, marketing integration, mobile applications, more than 450 insightful reports, and an API that integrates with the best PropTech providers on the market.
Thinkedit is an opensource web based content and data management system written in PHP. With large emphasis on databases, and specifically sql ones (mysql). It is highly extensible through simple configuration files. Not actively maintained
OpenDinTest is a OpenOffice-Calc-Sheet for statistical Data-Analysis according to the german industry-norm DIN 32645. <br> <br> OpenDinTest ist eine OpenOffice-Calc-Datei zur statistischen Auswertung von Messdaten nach der DIN 32645. <br>
MESH is a tool that measures distortion between two discrete surfaces (triangular meshes) using the Hausdorff distance to compute a maximum, mean and root-mean-square errors between two given surfaces. It also displays the error values on the surface.
Matrex is a lightweight vectorial spreadsheet: calculates blocks of values, not single cells; it is strongly multithreaded; users can work together on sheets using a server. Adapters to matlab, scilab, octave, R.
A framework for the visualization of multi-layer networks
MuLaViTo is a multi-layer visualization framework which allows to visualize relations between multiple graphs in derived applications. Moreover, it provides helpful means like k-shortest path algorithms, running algorithms in a separate thread, ...
Intrepid Bioinformatics' Integrative Genomics Viewer (IGV) modifications designed to integrate Intrepid's repository with the Broad Institute's IGV visualization tool.
GoFigure2 is an open-source, cross-platform application for visualizing, processing and analyzing of multidimensional microscopy data. Users can visualize, segment and track cells through time, detect cell-division and ultimately generate lineages.
MLPlot is a lightweight plotting library written in Java. It produces vector graphics in the SVG format but can also save EPS, JPG, PNG,..(if convert and inkscape are installed). The goal is to imitate MATLAB's 2D plotting functionality.
MouseGestureComposer is a graphical partition editor for visual artists and musicians. Basic elements are mouse movements, placed on a timeline segmented in measures, similar to classical music. Uses processing.org. Developed by numediart.org. A video tutorial is available here: http://vimeo.com/33442844
Main developper: François Zajéga - http://www.frankiezafe.org
OO PHP Libraries for mining data from eBay into mysql database
I started this project for use in a new business and decided the the development time for the end result was going to be too long. This is basically a OO PHP API to retrieve data from eBay to be stored in a MySQL database for analysis.
In a test run I retrieved over 804,000 completed item auction records from the consumer electronics category on eBay.
OrthoInspector is a new software system for orthology/paralogy analysis. Its purpose is to facilitate the installation and the maintenance of a database describing orthology/inparalogy relations and to provide tools to exploit these data.
Semtinel is an application framework for the semi-automatic creation, maintenance and analysis of hierarchical concept schemes (thesaurus, classification or ontology). Semtinel supports the development of new analysis methods and visualization techniques
A tool that parses SQL Select statements and generates a diagram. The diagram shows parts of the underlying SQL directly in the diagram. For example x=30 , GROUP BY (year), HAVING MIN(age) > 18. It is easy to see cartesian joins and/or loops.