Open-Source Financial Large Language Models
My personal Claude Code configuration
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepSeek Coder: Let the Code Write Itself
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Audio foundation model excelling in audio understanding
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Advancing Formal Mathematical Reasoning via Reinforcement Learning
Qwen3-ASR is an open-source series of ASR models
Collection of Gemma 3 variants that are trained for performance
A series of math-specific large language models of our Qwen2 series
Long-form streaming TTS system for multi-speaker dialogue generation
Pushing the Limits of Mathematical Reasoning in Open Language Models
Official DeiT repository
Learning to Act by Watching Unlabeled Online Videos
Open-source code agent designed for Lean 4
Small 3B-base multimodal model ideal for custom AI on edge hardware
Large-scale xAI model for local inference with SGLang, Grok-2.5
Omnimodal AI model for agents, coding, and long-context tasks
JetBrains’ 4B parameter code model for completions
Speculative-decoding accelerator for the 675B Mistral Large 3
Versatile 8B-base multimodal LLM, flexible foundation for custom AI
Powerful 14B-base multimodal model — flexible base for fine-tuning