Instant voice cloning by MIT and MyShell. Audio foundation model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Evaluation suite designed to assess the performance of LLMs
TextWorld is a sandbox learning environment for the training
21 Lessons, Get Started Building with Generative AI
GUI/CLI tool for downloading Xiaohongshu
Chinese Llama-3 LLMs) developed from Meta Llama 3
Chinese XLNet pre-trained model
Multi-lingual large voice generation model, providing inference
Collection of reference environments, offline reinforcement learning
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
InvokeAI is a leading creative engine for Stable Diffusion models
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
The data structure for multimodal data
Train a 26M-parameter GPT from scratch in just 2h
Build your chatbot within minutes on your favorite device
Interact with your SQL database, Natural Language to SQL using LLMs
Tools to build web AI agents that can authenticate
Optax is a gradient processing and optimization library for JAX
Controllable and fast Text-to-Speech for over 7000 languages
Towards Human-Level Text-to-Speech through Style Diffusion
Provides convenient access to the Anthropic REST API from any Python 3
An MLOps framework to package, deploy, monitor and manage models
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A batteries-included library for building AI-powered software