High-Fidelity and Controllable Generation of Textured 3D Assets
Multi-modal large language model designed for audio understanding
Open-source framework for intelligent speech interaction
A minimal yet professional single agent demo project
Real-time voice interactive digital human
An Open Source text-to-speech system built by inverting Whisper
Towards Human-Sounding Speech
Speech-AI-Forge is a project developed around TTS generation model
Scalable machine learning for time series forecasting
On-device Speech-to-Intent engine powered by deep learning
Benchmarking synthetic data generation methods
Making Enterprise Data Intelligent and Responsive for AI
AIMET is a library that provides advanced quantization and compression
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Powering Amazon custom machine learning chips
Open source machine learning framework to automate text conversations
An advanced paper search agent powered by large language models
GUI Exploration Lab. One of the best GUI agent solutions
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Free, high-quality text-to-speech API endpoint to replace OpenAI
Building a Secure and Interoperable Future for AI-Driven Payments
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
A python library for easy manipulation and forecasting of time series
Capable of understanding text, audio, vision, video