Open source libraries and APIs to build custom preprocessing pipelines
Instill Core is a full-stack AI infrastructure tool for data
Superlinked is a Python framework for AI Engineers
Parse files for optimal RAG
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Extract schema, statistics and entities from datasets
Autonomous LLM agent for end-to-end data science workflows
Context database designed specifically for AI Agents
Claude Code skill for generating production-quality SVG+PNG technical
A fast, helpful, and open-source document parser
Central interface to connect your LLM's with external data
Vector database for scalable similarity search and AI applications
The open source mesh processing system
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Python module for parsing semi-structured text into python tables
A system for agentic LLM-powered data processing and ETL
CrateDB is a distributed and scalable SQL database
A modular graph-based Retrieval-Augmented Generation (RAG) system
No-code LLM Platform to launch APIs and ETL Pipelines
Fast and efficient unstructured data extraction
AI-data warehouse to enrich, transform and analyze unstructured data
Clean network diagrams, One-time setup, zero upkeep
Fluentd: Unified Logging Layer (project under CNCF)
Web framework designed for speed, security, and SEO
Open Source Data & Experience Management Platform