The Library for LLM-based multi-agent applications
Real-time voice interactive digital human
Toolkit for audio, music, and speech generation
Set of tools to assess and improve LLM security
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Open-source large language model family from Tencent Hunyuan
Benchmarking Multimodal Agents for Open-Ended Tasks
Advanced evolutionary computation library built on top of PyTorch
Implementation of RLHF (Reinforcement Learning with Human Feedback)
Massively parallel rigidbody physics simulation
Official inference library for Mistral models
PPTAgent: Generating and Evaluating Presentations
Foundational model for human-like, expressive TTS
Renderer for the harmony response format to be used with gpt-oss
Educational framework exploring multi-agent orchestration
A neural network that transforms a design mock-up into static websites
Large Multimodal Models for Video Understanding and Editing
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
This repository contains the official implementation of FastVLM
PyTorch code and models for the DINOv2 self-supervised learning
Official implementation of DreamCraft3D
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Speech-AI-Forge is a project developed around TTS generation model
Diversity-driven optimization and large-model reasoning ability