A TTS model capable of generating ultra-realistic dialogue
Industrial-level controllable zero-shot text-to-speech system
Sharp Monocular Metric Depth in Less Than a Second
GUI/CLI tool for downloading Xiaohongshu
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Set of tools to assess and improve LLM security
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Open-source large language model family from Tencent Hunyuan
A Modular Simulation Framework and Benchmark for Robot Learning
TextWorld is a sandbox learning environment for the training
Training framework for Stable Baselines3 reinforcement learning agents
The Library for LLM-based multi-agent applications
Foundational model for human-like, expressive TTS
Renderer for the harmony response format to be used with gpt-oss
Educational framework exploring multi-agent orchestration
PPTAgent: Generating and Evaluating Presentations
Large Multimodal Models for Video Understanding and Editing
Benchmarking Multimodal Agents for Open-Ended Tasks
Advanced evolutionary computation library built on top of PyTorch
Implementation of RLHF (Reinforcement Learning with Human Feedback)
Massively parallel rigidbody physics simulation
Official inference library for Mistral models
Speech-AI-Forge is a project developed around TTS generation model
Diversity-driven optimization and large-model reasoning ability
A minimal yet professional single agent demo project