Towards Human-Level Text-to-Speech through Style Diffusion
High-quality multi-lingual text-to-speech library by MyShell.ai
OCR expert VLM powered by Hunyuan's native multimodal architecture
Pruna is a model optimization framework built for developers
Generate short videos with one click using AI LLM
Pokee Deep Research Model Open Source Repo
The leading agent orchestration platform for Claude
Agent framework and applications built upon Qwen>=3.0
airda(Air Data Agent
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The easiest, and fastest way to run AI-generated Python code safely
Reference PyTorch implementation and models for DINOv3
Go ahead and axolotl questions
SGLang is a fast serving framework for large language models
Qwen3-omni is a natively end-to-end, omni-modal LLM
Easiest and laziest way for building multi-agent LLMs applications
The official repo of Qwen chat & pretrained large language model
Ongoing research training transformer models at scale
Supercharge Your LLM with the Fastest KV Cache Layer
text and image to video generation: CogVideoX (2024) and CogVideo
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Meta Agents Research Environments is a comprehensive platform
Portia Labs Python SDK for building agentic workflows
A coding-free framework built on PyTorch
Models for the spaCy Natural Language Processing (NLP) library