Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
This is a simple Java front-end for the UNIX grep. It lets you search file contents also if you are not familiar with the command line or the regular expression syntax.
Create new Your Own Search Engine using Yahoo Boss API. Yahoo Boss API provide unlimited request. This is writing on PHP5. Demo - http://search.ourlk.com
Sabuesonix is a desktop search engine. It can explore your PDF, TXT and HTML files (and more in the future) and create an index for quick documents search.
Maloney finds documents, mails, photos and many other files on your desktop computer. It comes as a standalone application or as a plug-in for your eclipse IDE.
The GIS Metadata Manager is a simple Java-based desktop application to manage files from geographic information systems (GIS) in a specified directory by grouping files into records, adding metadata and searching the metadata to find files.
Mustru is a desktop Q&A search engine based on Lucene. You can search local filesystems using natural language questions or boolean queries. A list of answers or hits will be returned. A web based interface is included.
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
Regular expressions system with Lisp-styled syntax.
Also this software can be used as a library for embedding lisp-syntax regular expressions in an another programm.
AASE(Anarchy Advantage Search Engine) is a search engine that reorders search results with base in the habits of its users. By measuring user activity on the search engine result page the search engine constantly improves the search results.
Plait (pronounced "play") is a command-line jukebox and music player front-end. It understands brief queries that pick a single song, mix queries that combine works from multiple artists, and stream queries that find Shoutcast radio streams.
Single Click Real Time searching of both structured and unstructured data and information.
Simultaneous searching of Structured: databases and unstructured: documents from within a web browser, desktop application and application plugins
This project is based on the PHP Sphider search engine by Ando Saabas, published at www.sphider.eu. As he concentrates on basic functions, here developers and end-users may find additional modules, plug-ins and ready to use full versions.
Filter files in Eclipse Navigator panel using patterns and regular expressions. Apply filters to specific folders, packages, or projects! Install this plug-in now from within Eclipse using update site http://eclipseresfltr.sourceforge.net/update
Meme is a multi-agent system. It aggregates literature information gathered from different sources into a viable format. It provides a visualization search and exports the literature information for users. It also integrates JADE and Nutch.
Like Unix-Tail BUT:
- Runs with or without GUI
- Suspend and resume tailing at runtime
- Can monitor a set of Files
- Print output to a textfield, stdout or file
- Runs in "Grep" mode, too (Read files once)
- (Almost) the same options as Unix-Tail
Exostar LDAP Proxy is a specialized LDAP proxy used to look up X.509 encryption certificates for prospective recipients in secure e-mail applications. It can be used to fetch other types of end user certificates, CA certificates and CRLs
Phuzby is a small lightweight cross-platform phone book query tool for Windows and Linux which sits in the task tray and allows quick and easy access to LDAP enabled directories.
This is the "Eclipse of Web Browsers", a secure social web browsing and multi-user messaging system. You will need to run your own version of mysql. Development is active and we are seeking project leaders. Please email suprasphere___at___gmail.com.
Full text search engine - console tools and GUI frontends for users, program components and libraries for developers. Cross-platform, portable (Win32/64, .NET, Linux). Extensible architecture. Morphology of natural languages (English, Russian and French)
Didaskon will deliver a framework for assembling a curriculum from existing learning objects provided by e-Learning services. The selection of learning objects will be based on the semantically annotated specification of the user's current skills.
j-sand is an advanced search tool written in java for developers. In the current first version it searches through a directory tree to find all files of a specified type and then inside those files for the specified search string.
DuMP3 is a duplicate and similar file finder. It finds exact duplicate binaries by hash, similar text files by substring content, images (JPG, BMP, GIF, PNG, etc) by color and audio files (MP3, WAV, OGG, etc) by wave data. Future: fonts, video.