A Tree Search Library with Flexible API for LLM Inference-Time Scaling
Helping you get the most out of AWS, wherever you use MCP
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Real-World Centric Foundation GUI Agents
Self-healing browser harness that enables LLMs to complete any task
RAG Search API
Block Diffusion for Ultra-Fast Speculative Decoding
ICLR2024 Spotlight: curation/training code, metadata, distribution
Inference Llama 2 in one file of pure C
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
AI-powered semantic indexing: automating the creation of book indexes
Did you say you like data?
The unofficial python package that returns response of Google Bard
AIlice is a fully autonomous, general-purpose AI agent
AI code-writing assistant that understands data content
Self-Modifying Framework from the Future
Serving multiple LoRA finetuned LLM as one
Explore large language models in 512MB of RAM
Doctor Dignity is an LLM that can pass the US Medical Licensing Exam
Python package for easily interfacing with chat apps
An unnecessarily tiny implementation of GPT-2 in NumPy
Large model-based chatbot builder that can quickly integrate AI models
A text generation library with pre-trained language models github.com
A Deep-Learning-Based Chinese Speech Recognition System