Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Leverage AI to Automate Medical Coding
Medical Coding Solution
As a healthcare provider, you should be paid promptly for the services you provide to patients. Slow, inefficient, and error-prone manual coding keeps you from the financial peace you deserve. XpertDox’s autonomous coding solution accelerates the revenue cycle so you can focus on providing great healthcare.
Searches arbitrary text input (logs, etc) for regular expression(s) matching, and then executes arbitrary SQL statement(s) with matched substrings used as parameters. Offers daemon, incremental and transactional features.
A PHP based software for automatic dossier preparation of Wikipedia content. Retrieves Wikipedia content, determines its relevance to a given subject and compiles selected material into a PDF dossier.
Framework for storing and editing Forestry-related XML data, with a specialised graphical interface for each object type. Plug-in API for adding functionality. Coded in Java, uses WebStart for distribution, swing GUI, ~100k lines of code.
Thea, Tools for High-throughput Experiment Analysis, is an integrated information processing system dedicated to the annotation of data issued from classification systems with biological information coming from a knowledge base.
Secure and customizable compute service that lets you create and run virtual machines.
Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
Phoenix is an information extraction engine written in java.
Controlled by rules (declared in xml), it extracts information form any XML document (unstructured XHTML/OpenOffice documents). Supports XPath, additional conditions and top-down decomposit
MicroArray Genome Imaging and Clustering Tool (MAGIC tool) is a platform-independant java program for analyzing MicroArray data (.tiff scans & .txt godlists) via graphs and clustering operations (including QT-clustering). http://www.bio.davidson.edu/magic
JSpice is a Java Wrapper (using JNI) for the CSpice library provided by NAIF.
Goal of JSpice is to provide a full portable version of CSpice wich is used to calculate geometricals parameters of spatial mission as Cassini/Huygens.
Artificial vision library. Objectives are to make an OCR, fingerprint and face identification as some applications through a general purpose learning and pattern relationships algorithm (Currently performs very basic identification).
brCluster is a class library, written in java, that implements generic clustering algorithms carefully designed to allow its aplication in any kind of data. The algorithms implemented are K-means and Hierarchical Clustering (Simple and Complete Link).
libtour is a generic tournament processing library. The rules, participants, schedule, and results of a sporting tournament can be defined in the Scheme programming language and given to the library as input.
pystats is a comprehensive Python module implementing algorithms for statistics and information theory, including probability distributions, descriptive statistics, analysis of variance, regression, and inference.
An opensource library which enables the user to take advantage of most LCD screens owning a commercial controller,
and a free software allowing the user to dispatch any kind of informations from the PC toward the LCD screen.
The Handwritten project aims to provide software for open distribution of handwritten notes. A rudimentary datastructure containg strokes made by writing devices as for example palm computers, digital pens, etc, is implemented in both Java and C.
Java Randomness Test Suite is a gui application to run randomness tests on random stream resource (file or algorithm). Application can be run in command line. Includes general statistical tests, NIST and DIEHARD tests.
CloneAnalyzer is a tool for software quality analysis. It allows you to find, display and inspect clones, which are fragments of duplicated source code resulting from lack of proper reuse. It can be used as a plugin for Eclipse and on the command line.
WebDip is a platform for the tracking of information search in a psychological decision making experiment. It enables to run parallel experiments, with different search modes and comfortabel user management.
Framework for Query Expansion for Apache Lucene. Provides implementation of Rocchio Query Expansion (QE) method. Provides implementation of pseudo feedback QE utilizing Google's web API to query the world wide web in order to acquire terms for QE.
This library is a Fortran 95 gnuplot interface for some Unix like OS'es. This provides some routines that enables direct access of a child gnuplot session from a Fortran 95 program. You will need a copy of fortranposix to make this work.
The Unit Command Climate Assessment and Survey System (UCCASS) is a PHP survey script for creating dynamic surveys. Supports unlimited questions and answers, branching logic, bar graph results with filtering, CSV export, templates, time limits and more..
phpTrafMon is a set of scripts written in php. It shows in an attractive and user-friendly way the traffic in a local network and a share in it of every user. phpTrafMon requires MySQL, crontab and a popular IPFM program.
Celerity, an open-source, Java based data-flow architecture that provides high performance with fine data granuality. Celerity offers a module based programming environment, which can parrellelize across processors, machines and architectures.
This program will calculate and output metrics on code written in Python. Reports can be generated in text- or XML-files. A plug-in system lets new metrics be added to the program. A set of built-in plug-ins with provide some of the most common metrics a
An omnifont OCR software for KDE. Due to the fact that each step of the OCR process can be visualized you can get a quick idea of how OCR works and where the problems lie. However the program may be of minor/no use for end users in its current state.