AI-Driven Exploration in the Space of Code
AI search engine - self-host with local or cloud LLMs
A Next-Generation Training Engine Built for Ultra-Large MoE Models
SQL-Driven RAG Engine
A lightweight vLLM implementation built from scratch
Tensor search for humans
A @ClickHouse fork that supports high-performance vector search
local-first semantic code search engine
Fast Multimodal LLM on Mobile Devices
Big Model Application Development Practice 1
A high-throughput and memory-efficient inference and serving engine
LightLLM is a Python-based LLM (Large Language Model) inference
Alibaba's high-performance LLM inference engine for diverse apps
High-performance inference framework for large language models
Run AI models locally on your machine with node.js bindings for llama
Build multimodal language agents for fast prototype and production
Request recommended movies, TV shows and anime to Jellyseer/Overseer
LLocalSearch is a completely locally running search aggregator
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding
Retrieval Augmented Generation (RAG) framework
Official Implementation of "Graph of Thoughts
AI-powered CLI git wrapper, boilerplate code generator, chat history
Experimental search engine for conversational AI such as parl.ai