Open Source Speech Language Model
From Vibe Coding to Agentic Engineering
Contexts Optical Compression
Large-language-model & vision-language-model based on Linear Attention
Ultra-Efficient LLMs on End Device
Multimodal model achieving SOTA performance
A theoretical reconstruction of the Claude Mythos architecture
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-source multi-speaker long-form text-to-speech model
Large Multimodal Models for Video Understanding and Editing
Diffusion Transformer with Fine-Grained Chinese Understanding
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Encoder of greater-than-word length text trained on a variety of data
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Dataset of GPT-2 outputs for research in detection, biases, and more
LL model providing reasoning and conversational capabilities
Model that fuses instruct, reasoning and agentic skills
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
Omnimodal AI model for agents, coding, and long-context tasks
Efficient MoE model for million-token reasoning and coding
Google’s flagship dense multimodal model for coding and reasoning
Compact 8B multimodal instruct model optimized for edge deployment
Small 3B-base multimodal model ideal for custom AI on edge hardware
Efficient 14B multimodal instruct model with edge deployment and FP8