A powerful tool for creating datasets for LLM fine-tuning
Central interface to connect your LLM's with external data
The ultimate RAG for your monorepo
Fast State-of-the-Art Static Embeddings
Minimal Python framework for scalable AI inference servers fast
Open source RAG framework for building scalable modular AI apps
High-performance inference server for text embeddings models API layer
Mastering Applied AI, One Concept at a Time
Papers integrating knowledge graphs (KGs) and large language models
RAG Search API
Ready-to-run cloud templates for RAG
A lightweight, lightning-fast, in-process vector database
Making RAG Simpler with Small and Open-Sourced Language Models
SimpleMem: Efficient Lifelong Memory for LLM Agents
A New Axis of Sparsity for Large Language Models
Advanced RAG cookbooks for building accurate LLM applications
Build production-ready AI agents in both Python and Typescript
Ship AI Agents to Google Cloud in minutes, not months
AI-powered document analysis and tagging for Paperless-ngx
Local RAG engine for private multimodal knowledge search on devices
A collection of scientific methods, processes, algorithms
Learning to Reason with Search for LLMs via Reinforcement Learning
Traditional Mandarin LLMs for Taiwan
Korvus is a search SDK that unifies the entire RAG pipeline
Data Infrastructure providing an approach to multimodal AI workloads