Official DeiT repository
Foundational Models for State-of-the-Art Speech and Text Translation
One-click local MCP server installation in desktop apps
Analyze computation-communication overlap in V3/R1
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Repo of Qwen2-Audio chat & pretrained large audio language model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Global weather forecasting model using graph neural networks and JAX
Language modeling in a sentence representation space
An AI-powered security review GitHub Action using Claude
Advancing Formal Mathematical Reasoning via Reinforcement Learning
Clean and efficient FP8 GEMM kernels with fine-grained scaling
The ChatGPT Retrieval Plugin lets you easily find personal documents
Designed for text embedding and ranking tasks
Capable of understanding text, audio, vision, video
A Unified Framework for Text-to-3D and Image-to-3D Generation
Open-source large language model family from Tencent Hunyuan
Audio foundation model excelling in audio understanding
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Example Discord bot written in Python that uses the completions API
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
A fast, local neural text to speech system