Shell command execution server implementing the Model Context Protocol
Python Stream Processing
Chat & pretrained large audio language model proposed by Alibaba Cloud
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Taming Stable Diffusion for Lip Sync
Official inference repo for FLUX.2 models
GLM-4-Voice | End-to-End Chinese-English Conversational Model
AlphaFold 3 inference pipeline
Qwen3-omni is a natively end-to-end, omni-modal LLM
Open-source framework for intelligent speech interaction
A nearly-live implementation of OpenAI's Whisper
An experimental version of DeepSeek model
A game theoretic approach to explain the output of ml models
Oobabooga - The definitive Web UI for local AI, with powerful features
Multi-modal large language model designed for audio understanding
Large Audio Language Model built for natural interactions
Open source framework for deep learning satellite and aerial imagery
Tool for visualizing and tracking your machine learning experiments
An open sourced end-to-end VLM-based GUI Agent
Controllable & emotion-expressive zero-shot TTS
Chat & pretrained large vision language model
Qwen3-Coder is the code version of Qwen3
Dataset of GPT-2 outputs for research in detection, biases, and more
Hunyuan Translation Model Version 1.5
Qwen3-TTS is an open-source series of TTS models