MOSS‑TTS Family open‑source speech and sound generation model
Bidirectional token-classification model for identifiable info
Achieving 3+ generation speedup on reasoning tasks
Ultra-Efficient LLMs on End Device
Zero-code platform for building AI agents from natural language input
AI multi-agent framework for automating data-driven R&D workflows
Faster and easier training and deployments
Designed for training LLM/VLM agents via RL
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
High-performance inference framework for large language models
Pretrained time-series foundation model developed by Google Research
CLI tool for configuring and monitoring Claude Code
Long-form streaming TTS system for multi-speaker dialogue generation
The power of Claude Code / GeminiCLI / CodexCLI
Lemonade helps users run local LLMs with the highest performance
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
ICLR2024 Spotlight: curation/training code, metadata, distribution
A flexible, high-performance 3D simulator for Embodied AI research
A PyTorch library for implementing flow matching algorithms
One-click local MCP server installation in desktop apps
Memory-efficient and performant finetuning of Mistral's models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Diffusion Transformer with Fine-Grained Chinese Understanding
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model