Research code artifacts for Code World Model (CWM)
Open-source, high-performance AI model with advanced reasoning
Qwen3-Coder is the code version of Qwen3
Powerful AI language model (MoE) optimized for efficiency/performance
Fully automatic censorship removal for language models
Large-language-model & vision-language-model based on Linear Attention
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A high-performance ML model serving framework, offers dynamic batching
Utilities intended for use with Llama models
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Parallax is a distributed model serving framework
Framework and no-code GUI for fine-tuning LLMs
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
The official Meta Llama 3 GitHub site
Uncertainty Quantification for Language Models, is a Python package
All-in-one WebUI for AI generative image and video creation
Advanced language and coding AI model
AirLLM 70B inference with single 4GB GPU
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
StarVector is a foundation model for SVG generation
Train a 26M-parameter GPT from scratch in just 2h
Qwen3 is the large language model series developed by Qwen team
LISA: Reasoning Segmentation via Large Language Model
lightweight package to simplify LLM API calls
Open-source model for program synthesis