Tiny vision language model
Collection of Gemma 3 variants that are trained for performance
Language Model Reinforcement Learning Environments frameworks
Build, evaluate and train General Multi-Agent Assistance with ease
Meta Agents Research Environments is a comprehensive platform
Talk to Your AI Agents from Anywhere
Inference code for scalable emulation of protein equilibrium ensembles
48khz stereo neural audio codec for general audio
Optax is a gradient processing and optimization library for JAX
A very simple framework for state-of-the-art NLP
Mentat - The AI Coding Assistant
Seamlessly integrate LLMs into scikit-learn
State-of-the-art Parameter-Efficient Fine-Tuning
OCR expert VLM powered by Hunyuan's native multimodal architecture
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Generate blog articles from video or audio
Controllable and fast Text-to-Speech for over 7000 languages
Towards Human-Level Text-to-Speech through Style Diffusion
DeepMind model for tracking arbitrary points across videos & robotics
code for Mesh R-CNN, ICCV 2019
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Inference Llama 2 in one file of pure C
Tool for visualizing and tracking your machine learning experiments
Train machine learning models within Docker containers
An AI agent development platform with all-in-one visual tools