An official Qdrant Model Context Protocol (MCP) server implementation
Koog is the official Kotlin framework for building AI agents
SimpleMem: Efficient Lifelong Memory for LLM Agents
NLP Cloud serves high performance pre-trained or custom models for NER
Official Microsoft Learn MCP Server, powering LLMs and AI agents
Bringing BERT into modernity via both architecture changes and scaling
Demystify RAG by building it from scratch
A suite of tools to develop RAG, semantic search, and other AI apps
Semantic search and document parsing tools for the command line
AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek
An LLM-based Multi-agent Framework of Web Search Engine
Retrieval and Retrieval-augmented LLMs
RAG Search API
Explore large language models in 512MB of RAM
Task-oriented finetuning for better embeddings on neural search
CPU/GPU inference server for Hugging Face transformer models
A platform for building vector based applications
BGE-Large v1.5: High-accuracy English embedding model for retrieval
Compact English sentence embedding model for semantic search tasks
Efficient English embedding model for semantic search and retrieval