Integrate cutting-edge LLM technology quickly and easily into your app
System Level Intelligent Router for Mixture-of-Models at Cloud
Official code repo for the O'Reilly Book
local-first semantic code search engine
SimpleMem: Efficient Lifelong Memory for LLM Agents
Search all of YouTube from the command line
Data Infrastructure providing an approach to multimodal AI workloads
Minimal reproduction of OneRec
Research project. A Memory solution for users, teams, and applications
Multilingual sentence & image embeddings with BERT
A @ClickHouse fork that supports high-performance vector search
Demystify RAG by building it from scratch
Retrieval and Retrieval-augmented LLMs
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Bringing BERT into modernity via both architecture changes and scaling
SQL-Driven RAG Engine
A tension reasoning engine over 131 S-class problems
Knowledge Graph Generation from Any Text
This repository provides an advanced RAG
Uncertainty Quantification for Language Models, is a Python package
Research and application of technologies such as nl processing
A New Axis of Sparsity for Large Language Models
AI Powered Knowledge Graph Generator
LISA: Reasoning Segmentation via Large Language Model
Open-source enterprise-level AI knowledge base and MCP