Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
The Most Powerful Software Platform for EHSQ and ESG Management
Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.
Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
Poor Man's HPC is a framework that allows distributing and running code on a server farm. pmHPC is a scaled down and simplified version of distributed computing projects such as SETI, so is a perfect fit for enthusiasts and universities.
DAT Freight and Analytics operates DAT One truckload freight marketplace
DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
NetWhistler is SNMP monitoring software that offers integrated fault and performance management functionality. This Java program auto-detects networks and presents them on graphical maps, which reflect host status. Mila NNM is a pyQT fork of Netwhistler.
TimeVault is a simple GUI for making automated snapshots of files and directories. Previous versions of a file or directory can be accessed by examining the properties and selecting the 'Previous Versions' tab in GNOME's file manager.
The CCM Tools are CASE tools used for generating CORBA components, test components, and test programs based on source IDL files. Various target languages can be generated. Scripting language wrappers can also be generated to enable rapid prototyping.
Java Hibernation Package is an experimental implementation of Java thread serialization mechanism based on the Java Platform Debugger Architecture (JPDA). It is designed to support serialization of synchronizing multiple threads.
Mangage all your servers/desktops from one place. Install packages, push files, get sw/hw inventory, keep a changelog for all systems, setup auto-installation systems (kickstart and others).
All this by working with groups of hosts, not individually
Create email archives getting IMAP mailfolders or POP3 mailboxes. Search emails by sender/receiver/subject/body among already existing backup files (i.e. on dvd) in plain or compressed format. Tested on Linux-x86, Linux-PPC and Windows-XP platforms.
Simple Cifs Mount is a tool to be redistributed by System Administrators to end users, the aim to provide a simple login box for end users to mount predefined services.
webapp-config is the installer for web-based applications of the Gentoo Linux distribution. It is being used for the automatic setup of web applications in a virtual hosting environment.
OpenPavilion is a free surveillance suite made for observing, monitoring and benchmarking your applications and networks. OpenPavilion displays comprehensive relationships using simple to understand dashboard views.
Hard Search is a program for windows written entirely in Python that searches any valid hard drives. It is capable of indexing hard drives for instant search results.
Universal information crawler is a fast precise and reliable Internet crawler. Uicrawler is a program/automated script which browses the World Wide Web in a methodical, automated manner and creates the index of documents that it accesses.
Cold Launcher - Keyboard/search driven program launcher written in Python+wxPython for Windows XP/Win32. Allows quick launching of programs without using the mouse and by searching.