tiktoken is a fast BPE tokeniser for use with OpenAI's models
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Open-sourced unified customization model
Audiocraft is a library for audio processing and generation
A Powerful Native Multimodal Model for Image Generation
Collection of reference environments, offline reinforcement learning
Towards Real-World Vision-Language Understanding
Open source platform for the machine learning lifecycle
Provides CTP stock options and Zhongtai Securities XTP
MemU is an open-source memory framework for AI companions
⚡ Building applications with LLMs through composability ⚡
The Fastest LLM Gateway with built in OTel observability
The no-nonsense RAG chunking library
Easily turn large sets of image urls to an image dataset
Interface for OuteTTS models
Central interface to connect your LLM's with external data
SAPIEN Manipulation Skill Framework
Simple and easily configurable grid world environments
CLIP, Predict the most relevant text snippet given an image
Framework that is dedicated to making neural data processing
OpenDILab Decision AI Engine
MARS5 speech model (TTS) from CAMB.AI
Real-time voice interactive digital human
Towards Human-Level Text-to-Speech through Style Diffusion
End-to-end speech processing toolkit