Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
FusionAuth: Authentication and User Management Software
Offer your users flexible authentication options, including passwords, passwordless, single sign-on (SSO), and multi-factor authentication (MFA).
FusionAuth adds login, registration, SSO, MFA, and a bazillion other features to your app in days - not months.
StrMatch is a simple java program, created for educational purposes, that allows user to test a great part of string matching algorithms.
An intuitive user interface shows each algorithm step accurately...
PileWorks provides the organizational structure for coordinating several different projects which approach some aspect of Pile Technology. PileWorks defines a set of interfaces and implements some basic infrastructure for Pile engines and agents.
A configurable knowledge management framework. It works out of the box, but it's meant mainly as a framework to build complex information retrieval and analysis systems. The 3 major components: Crawler, Analyzer and Indexer can also be used separately.
Generative Al is shaping brand discovery. AthenaHQ ensures your brand leads the conversation.
AthenaHQ is a cutting-edge platform for Generative Engine Optimization (GEO), designed to help brands optimize their visibility and performance across AI-driven search platforms like ChatGPT, Google AI, and more.
myDbSearcher is a search engine for MySQL Databases. It is written in Java. It scans several tables on different databases. A XMLRPC-Server will give you access to the Index.
Currently it runs on http://www.idowa.de/ueberblick/suche/index_html
Gumshoe Desktop Search indexes local files of various formats on a Windows desktop and provides a search GUI. The project is developed in Java. It builds on other opensource projects including Lucene, Luke, SWT, tagsoup, Jakarta POI and others.
jFinder indexates your data files, so you can find easily files corresponding to a patern, i.e. to list your albums of Marilyn Manson, you'll type "mar mans". You can then enqueue them in your favorite media player.
Create and convert pipeline at scale through industry leading SMS campaigns, automation, and conversation management.
TextUs is the leading text messaging service provider for businesses that want to engage in real-time conversations with customers, leads, employees and candidates. Text messaging is one of the most engaging ways to communicate with customers, candidates, employees and leads. 1:1, two-way messaging encourages response and engagement. Text messages help teams get 10x the response rate over phone and email. Business text messaging has become a more viable form of communication than traditional mediums. The TextUs user experience is intentionally designed to resemble the familiar SMS inbox, allowing users to easily manage contacts, conversations, and campaigns. Work right from your desktop with the TextUs web app or use the Chrome extension alongside your ATS or CRM. Leverage the mobile app for on-the-go sending and responding.
JavaCat is an app for managing files on your different drives (cds, hdds...). It's especially usefull for searching files on your exchangeable drives such as cds and dvds. Instead of digging through several disc just use JavaCat to find reqired file and
Xapian is a Search Engine Library, written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C# and Ruby. Xapian allows you to easily add advanced indexing and search facilities to your applications. See www.xapian.org for more information.
BeanQuery is a Java solution that allows for querying arbitrary collections of arbitrary object types by using a criteria like API in a declarative and typesafe manner.
The application should take a directory and parse it recursively or not, and execute a given executor. The executors shuold be pluggable and rendered on the application's ui according to some strict rules.
A searcher and indexer to allow easy and fast locating of relevent information from a large collection of research papers. A Java backend with a web based frontend. Based on the Lucene indexer and searcher
Find files within other ZIP JAR WAR EAR files. Search recursively for file names or search strings. This Java based utility can save you time when you wonder in which jar or ear file a particular class is defined.
I AM File Indexing can index files in given folders and make the content search able. Written in pure java it is meant for people who need very basic web-site search or multiple files search capability for their java applications.
ScenConnect shows scenarios as networks of situation and event tag sets, for fast comparisons. It links scenarios to tags, scores, and other metadata, creating situationals suitable for search, mining, machine learning, and planning.
Looks at file names in a directory and finds common parts in them trying to search similar and repeated ones. Useful when you have multiple files which differ in checksum and somehow in names. You can collect them and make a decision afterwards.
Hyper-M is a bluetooth based DHT peer-to-peer infrastructure for J2ME (CLDC1.1/ MIDP2.0) enabled handphones. Hyper-M allows the user to create a peer-to-peer network and share and retrieve files on this network. Has been tested mainly on Nokia handphones
InfoSpace is an application which indexes and then allows you to search your personal information space, such as your email, your documents, your music, your videos, your Flickr account, your news feeds, the web pages you've visited and much more.