Capable of understanding text, audio, vision, video
Real-time voice interactive digital human
An unsupervised and free tool for image and video dataset analysis
Document Image Parsing via Heterogeneous Anchor Prompting”
Streamline your ML workflow
VITS2 backbone with multilingual-bert
Open platform for training, serving, and evaluating language models
ChatGLM2-6B: An Open Bilingual Chat LLM
A minimal yet professional single agent demo project
Interface for OuteTTS models
Python package for AutoML on Tabular Data with Feature Engineering
Tensor search for humans
An undetectable, powerful, flexible, high-performance Python library
SWE-agent takes a GitHub issue and tries to automatically fix it
A high-performance ML model serving framework, offers dynamic batching
On-device Speech-to-Intent engine powered by deep learning
StreamSpeech is a seamless model for offline speech recognition
Open-source MCP server that gives your coding agent
Language modeling in a sentence representation space
Spatiotemporal Signal Processing with Neural Machine Learning Models
Repo of Qwen2-Audio chat & pretrained large audio language model
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Multi-Voice and Prompt-Controlled TTS Engine
Smart Thermodynamic Modeling with Graph Neural Networks
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments