AudioMuse-AI is an Open Source Dockerized environment
Sharp Monocular Metric Depth in Less Than a Second
Renderer for the harmony response format to be used with gpt-oss
kaldi-asr/kaldi is the official location of the Kaldi project
A Powerful Native Multimodal Model for Image Generation
Educational framework exploring multi-agent orchestration
The standard data-centric AI package for data quality and ML
Training data (data labeling, annotation, workflow) for all data types
AI Toolkit for Healthcare Imaging
Synchronized Translation for Videos
MOSS‑TTS Family open‑source speech and sound generation model
26m function call model that runs on incredibly small devices
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Autonomous harness engineering
All-in-one native macOS AI chat application
Open Agent Harness with a built-in personal agent, Ohmo
Workplace AI platform for enterprise search and workflow automation
OCR model for complex documents with layout-aware structured outputs
Codes/Notebooks for AI Projects
A general fine-tuning kit geared toward image/video/audio diffusion
Pluggable SOTA multi-object tracking modules for segmentation
An efficient forwarding service designed for LLMs
Maimaibot, a (more focused) multi-platform intelligent agent
Weaving the Digital Agent Galaxy
Llama Chinese community, real-time aggregation