Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
AI-First Supply Chain Management
Supply chain managers, executives, and businesses seeking AI-powered solutions to optimize planning, operations, and decision-making across the supply
Logility is a market-leading provider of AI-first supply chain management solutions engineered to help organizations build sustainable digital supply chains that improve people’s lives and the world we live in. The company’s approach is designed to reimagine supply chain planning by shifting away from traditional “what happened” processes to an AI-driven strategy that combines the power of humans and machines to predict and be ready for what’s coming. Logility’s fully integrated, end-to-end platform helps clients know faster, turn uncertainty into opportunity, and transform the supply chain from a cost center to an engine for growth.
A collection of software to implement search engine technology. The overall search technology is built on the individual components of this project, each component is released under the BSD License, and is written in the language most suited to its task.
System to assist with the management of a local Buddhist study library. Will download and install chosen documentation (sutras, etc), index them, and provide a web interface. Python, HTML. Needs a webserver, namazu. Can be used for other documents, too.
CVS-Summary is a program that generates an HTML summary of CVS activity, very similar to that provided by the popular ViewCVS script. CVS-Summary differs from ViewCVS in that it dumps the summary information to static HTML files which can be served by n
LAMP eGovernment Database Project offers state and local governments a free open source, web-enabled system for use in developing public information sites. You can also use this system for government-to-government systems as well.
All-in-one security tool helps you prevent ransomware and breaches.
SIEM + Detection and Response for IT Teams
Blumira’s detection and response platform enables faster resolution of threats to help you stop ransomware attacks and prevent data breaches. We surface real threats, providing meaningful findings so you know what to prioritize. With our 3-step rapid response, you can automatically block known threats, use our playbooks for easy remediation, or contact our security team for additional guidance. Our responsive security team helps with onboarding, triage and ongoing consultations to continuously help your organization improve your security coverage.
Ferret CMS is a Content Management System based on Zope. It is focused on easy administration and fast deployment of a web site. It has a workflow mechanism with roles that can be assigned to backend users.
Emine is a python script that parses an email file, separates all the email elements, including words and phrases, and populates a database with file offsets for retrieval from the original file.
Open Source Application for databasing your Music Collection(s). iChoons will utilize other open source products such as MySQL, Apache Webserver and PHP as well as Python / wxPython and SQL Lite. We will also be including tools written in Python for Win3
HORUS is a system for knowledge acquisition, hypothesis generation, inference and learning. It is an interactive, internet environment accessible to a diverse community of users (public-access or membership basis) - see also UMKAILASH project for more.
Dun and Bradstreet Connect simplifies the complex burden of data management
Our self-service data management platform enables your organization to gain a complete and accurate view of your accounts and contacts.
The amount, speed, and types of data created in today’s world can be overwhelming. With D&B Connect, you can instantly benchmark, enrich, and monitor your data against the Dun & Bradstreet Data Cloud to help ensure your systems of record have trusted data to fuel growth.
This project is for credit in UIUC's CS327 Software Engineering class.
This is a class scheduler written in ocaml and python, using a web interface. The feature list will be decided as the project progresses.
Squishdot is a web-based news publishing and discussion product for Zope. It is written in Python with the ability to script in DTML and runs on Linux, *BSD, Unix and Windows 95/98/NT/200.
PySMBSearch is a crawler and search engine for SMB shares. It consists of a crawler script, which creates an index and stores it in an SQL database, and a CGI script that can be used to extract queries from the database.
Xapian is a Search Engine Library, written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C# and Ruby. Xapian allows you to easily add advanced indexing and search facilities to your applications. See www.xapian.org for more information.
The NLADA E-Library is an add-on product for the Zope web application server. It is designed to be a drop-in web application and content managment system for creating web based document libraries.
This project is a Python-based HTTP web proxy server that hooks into MySQL to store a full history of your browsing. Allows you to check out statistics about your browsing habits. Creates a personal portal page, has search features, multi-user, filters.
Tyriel is an open-source search engine written in python and designed to run within a small group of sites (but potentially extensible to a greater scope).
JeeZez is a distributed information mangement/publishing framework. Currently the focus is to develop a module for software-management, but more modules will follow.
Frosttie (FROnt-end SchemaTron Text Internet Engine) takes XHTML pages and processes them with various user-definable filters such a W3C's WAI, Section 508 (US) web usability compliance, ad removal, etc. It can be used with zKnowMan.
Omseek has been renamed to Xapian. Xapian is a Search Engine Library, written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C# and Ruby. It allows you to easily add advanced indexing and search facilities to your applications.
Work it! is a job post fetcher. It can fetch jobs posts on multiple websites, and store in a local database for later reviewing. Setup as an automated process for nightly (or daily) scanning. Then use viewer to scan at your leisure.
PyEsp - Enhanced/Evolving/Extensible Semantic Profiling.
This Python program will sort and filter search results by applying semantic profiling on web pages. The program will learn the user preferences and profiling will be done on the client computer.