Agentic, Reasoning, and Coding (ARC) foundation models
Taming Stable Diffusion for Lip Sync
StreamSpeech is a seamless model for offline speech recognition
Oobabooga - The definitive Web UI for local AI, with powerful features
AIMET is a library that provides advanced quantization and compression
Efficient few-shot learning with Sentence Transformers
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Claude code for everything except coding
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Z80-μLM is a 2-bit quantized language model
A unified, comprehensive and efficient recommendation library
A state-of-the-art open visual language model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A library for accelerating Transformer models on NVIDIA GPUs
MTEB: Massive Text Embedding Benchmark
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
PyTorch library of curated Transformer models and their components
Visual Automation IDE — automate anything you see on screen
AI Suite for upscaling, interpolating & restoring images/videos
An easy-to-use LLMs quantization package with user-friendly apis
Visual Instruction Tuning: Large Language-and-Vision Assistant
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Basaran, an open-source alternative to the OpenAI text completion API
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)