Uncommon Objects in 3D dataset
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
A Powerful Native Multimodal Model for Image Generation
SOTA discrete acoustic codec models with 40/75 tokens per second
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
[CVPR 2025 Best Paper Award] VGGT
The machine learning toolkit for time series analysis in Python
Flexible Photo Recrafting While Preserving Your Identity
A game theoretic approach to explain the output of ml models
Efficient few-shot learning with Sentence Transformers
Large-language-model & vision-language-model based on Linear Attention
Automatically translates the text of a video based on a subtitle file
A fast library for AutoML and tuning
Fast backend for long-term AI user memory via structured profiles
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Sharp Monocular Metric Depth in Less Than a Second
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
The easiest way to use deep metric learning in your application
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A chatbot built based on a large model
Simple and easily configurable grid world environments
UI-TARS-desktop version that can operate on your local personal device
4M: Massively Multimodal Masked Modeling
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI