Agentic, Reasoning, and Coding (ARC) foundation models
Taming Stable Diffusion for Lip Sync
StreamSpeech is a seamless model for offline speech recognition
Oobabooga - The definitive Web UI for local AI, with powerful features
AIMET is a library that provides advanced quantization and compression
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
20+ high-performance LLMs with recipes to pretrain, finetune at scale
A library for accelerating Transformer models on NVIDIA GPUs
Efficient few-shot learning with Sentence Transformers
MTEB: Massive Text Embedding Benchmark
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
A state-of-the-art open visual language model
Z80-μLM is a 2-bit quantized language model
A unified, comprehensive and efficient recommendation library
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
PyTorch library of curated Transformer models and their components
An easy-to-use LLMs quantization package with user-friendly apis
Open platform for training, serving, and evaluating language models
Visual Automation IDE — automate anything you see on screen
AI Suite for upscaling, interpolating & restoring images/videos
Visual Instruction Tuning: Large Language-and-Vision Assistant
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Basaran, an open-source alternative to the OpenAI text completion API