Definitions for AI/ML tasks like dataset creation
Open-source infrastructure for Computer-Use Agents. Sandboxes
Practical productivity tools for Claude Code, Codex-CLI
The largest collection of PyTorch image encoders / backbones
The repository provides code for running inference with SAM 2
A neural network that transforms a design mock-up into static websites
The official gpt4free repository
CLIP, Predict the most relevant text snippet given an image
Generate audiobooks from e-books, voice cloning & 1107+ languages
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Audiocraft is a library for audio processing and generation
Lets make video diffusion practical
Chinese Llama-3 LLMs) developed from Meta Llama 3
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Utilities intended for use with Llama models
Your Personal AI Assistant; easy to install, deploy on local or coud
Create videos with Stable Diffusion
Controllable and fast Text-to-Speech for over 7000 languages
Contexts Optical Compression
Sandbox for training deep learning networks
Block Diffusion for Ultra-Fast Speculative Decoding
"Big Model" trains a visual multimodal VLM with 26M parameters
Ling is a MoE LLM provided and open-sourced by InclusionAI