Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Leverage AI to Automate Medical Coding
Medical Coding Solution
As a healthcare provider, you should be paid promptly for the services you provide to patients. Slow, inefficient, and error-prone manual coding keeps you from the financial peace you deserve. XpertDox’s autonomous coding solution accelerates the revenue cycle so you can focus on providing great healthcare.
Document summarization system. By adding document content to system, user queries will generate a summary document containing the available information to the system.
Dessy is a DEsktop Search and SYnchronization framework for mobile devices. It allows file search using content, metadata, and custom tags, and synchronization of results, and entire file system subtrees. It works on MIDP 2.0 handsets and Java 2 SE.
Jake is a console based app written in python and qt4. Plugins will let you do almost anything, for example, search in google, translate, view images, talk with it (aka AI bot). Also, skining system will let you choose how should jake look.
This project is abandoned. Download these files at your own risk.
For the latest version please visit http://decafbad.net/projects/piddlepodder/
This application is a podcatching client written in Perl. It is based off of the simplicity of bashpodder with a few added features. It downloads podcasts from a list of XML feeds, and puts them into a download directory by date.
Empower your field service with mobility and reliability
Resco’s mobile solution streamlines your field service operations with offline work, fast data sync, and powerful tools for frontline workers, all natively integrated into Dynamics 365 and Salesforce.
The SSAF ("Secure Search And Forwards") is a dirt-simple standalone web app for inexpensive and secure information sharing. Any uploaded record may be forwarded to an intended destination, and may also be stashed in a searchable repository.
UProm Requirement Management System (traceability and requirement coverage). Includes : - An admin module to manage SCRUM Project and tree of requirements - A multi-user Web-based client - Tools for databases management (HSQLDB, PostGreSQL,...) UProm
iMeMex is a dataspace management system. iMeMex is a research prototype. The package also provides several useful components for research such as external sorting, B+-trees, inverted indexes, content converters, query operators, and graph indexes.
A first-of-its-kind homeownership solution that puts YOU at the center of the coveted lifetime consumer relationship.
BoldTrail, the #1 rated real estate platform, is built to power your entire brokerage with next-generation technology your agents will use and love. Showcase your unique brand with customizable websites for your company, offices, and every agent. Maximize lead capture with a modern, portal-like consumer search experience and intelligent behavior tracking. Hyper-local area pages, home valuation pages and options for rich lifestyle data keep customers searching with your brokerage as the local experts. The most robust lead gen tools on the market help your brokerage, teams & agents effectively drive new business - no matter their budget. Empower your agents to generate free leads instantly with our simple to use landing pages & IDX squeeze pages. Drive more leads with higher quality and lower cost through in-house tools built within the platform. Diversify lead sources with our automated social media posting, integrated Google and Facebook advertising, custom text codes and more.
Sgrep (sorted grep) is a much faster alternative to traditional Unix grep when searching large files, because sgrep searches sorted input files using a fast binary search to find matching lines.
Kato is an approach to bring the work done on software agents out of academia and into the public arena. Developers can create agents as easily as they can Drupal modules.
This library implements several locality sensitive hashing(LSH) based algorithms, including indexing data structure for high dimensional spaces and metric spaces, sketch constructions and set embedding algorithms.
Bible Concordance generator and viewer for UNIX/Linux/Cygwin, written in C.
Supports multiple word searches using boolean operators and Reverse Polish notation. Unlike printed concordances, shows complete verse, and in context.
A Java implementation of a desktop search engine based on Apache Lucene. It indexes HTML-, XML-, OpenOffice- (Writer, Calc, Impress), MS Word-, and PDF- documents as well as plain text files. For other, arbitrary file types the file name can be indexed.
Combine is an open system for crawling Internet resources. It can be used both as a general and focused crawler.
If you want to download
Web-pages pertaining to a particular topic (like 'Carnivorous Plants')
Then Combine is the system for you!
This is a simple file search program. This program is not based on indexed searching. This program is written in JAVA and may work on any platform but is tested on Windows platform.
Requirements:
JRE 1.5 or higher
PyQFindReplace is a python based find and replace utility. It allows you to select files and folders. You can replace text in several thousands of files in one click. It has a pretty neat UI
Nucular Archiving System for creating full text indices for fielded data. Python API, web, and command line interfaces. Fast. Very light weight. Concurrent read/writes with no possible locking issues. No server process. Proximity. Facets. Funny name.
Enable your academic documents on your hard-drive to be searched using an automated solution with limited user-intervention. All this is done in a non-intrusive manner, ensuring your files are not moved.
Narrows search result produced by popular Internet search engines, allowing to put extra filtering conditions, as certain words presented, certain words excluded, and so on.
A framework for transferring implicit or tacit knowledge between members of a community. It works by observing the behavior of community members, and providing them with suggestions from other community members based on their past behavior.
Finddupe is a VERY FAST command line C program to catalog very large archives, identifying duplicate files even when offline. It has many features. You can easily grep a catalog to find what you have, and locate where it is.
Create new Your Own Search Engine using Yahoo Boss API. Yahoo Boss API provide unlimited request. This is writing on PHP5. Demo - http://search.ourlk.com
Sabuesonix is a desktop search engine. It can explore your PDF, TXT and HTML files (and more in the future) and create an index for quick documents search.
Mustru is a desktop Q&A search engine based on Lucene. You can search local filesystems using natural language questions or boolean queries. A list of answers or hits will be returned. A web based interface is included.