HY-Motion model for 3D character animation generation
Repo of Qwen2-Audio chat & pretrained large audio language model
High-Fidelity and Controllable Generation of Textured 3D Assets
Framework for building neural networks
This repository contains the official implementation of FastVLM
Converts text to speech in realtime
The Open Source Cowork Desktop to Unlock Your Exceptional Productivity
100–200× Acceleration for Video Diffusion Models
StreamSpeech is a seamless model for offline speech recognition
Interpretable prompting and models for NLP
MARS5 speech model (TTS) from CAMB.AI
Python framework for adversarial attacks, and data augmentation
GPT4V-level open-source multi-modal model based on Llama3-8B
An undetectable, powerful, flexible, high-performance Python library
User toolkit for analyzing and interfacing with Large Language Models
PyTorch extensions for fast R&D prototyping and Kaggle farming
An open phone agent model & framework
Multi-Agent daTa geneRation Infra and eXperimentation framework
Test Suites for validating ML models & data
Implementation of Make-A-Video, new SOTA text to video generator
Deep learning optimization library making distributed training easy
LLM Council works together to answer your hardest questions
Renderer for the harmony response format to be used with gpt-oss
Virtual AI anchor that combines state-of-the-art technology
Python library and CLI tool to interface with Google Translate