AI-Driven Exploration in the Space of Code
AI search engine - self-host with local or cloud LLMs
SQL-Driven RAG Engine
A Next-Generation Training Engine Built for Ultra-Large MoE Models
A lightweight vLLM implementation built from scratch
Tensor search for humans
A @ClickHouse fork that supports high-performance vector search
local-first semantic code search engine
Fast Multimodal LLM on Mobile Devices
A high-throughput and memory-efficient inference and serving engine
LightLLM is a Python-based LLM (Large Language Model) inference
Alibaba's high-performance LLM inference engine for diverse apps
High-performance inference framework for large language models
Run AI models locally on your machine with node.js bindings for llama
Request recommended movies, TV shows and anime to Jellyseer/Overseer
Build multimodal language agents for fast prototype and production
LLocalSearch is a completely locally running search aggregator
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding
Retrieval Augmented Generation (RAG) framework
Official Implementation of "Graph of Thoughts
AI-powered CLI git wrapper, boilerplate code generator, chat history
Experimental search engine for conversational AI such as parl.ai