Collections of robotics environments
kaldi-asr/kaldi is the official location of the Kaldi project
AI Toolkit for Healthcare Imaging
Qwen3-omni is a natively end-to-end, omni-modal LLM
The data structure for multimodal data
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-weight, large-scale hybrid-attention reasoning model
Large-scale Self-supervised Pre-training Across Tasks, Languages, etc.
A Pioneering Open-Source Alternative to GPT-4o
Towards Real-World Vision-Language Understanding
Release for Improved Denoising Diffusion Probabilistic Models
Images to inference with no labeling
Get a ChatGPT plugin up and running in under 5 minutes
Official release of InternLM series
RNN with great LLM performance
Meta-Transformer for Unified Multimodal Learning
Enable sending and receiving images during chatting
Headless Rasa chatbot platform with LLM integration and APIs
LLaMA: Open and Efficient Foundation Language Models
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Code for the paper "PixelCNN++: A PixelCNN Implementation..."
Drench yourself in Deep Learning, Reinforcement Learning
Fast, modular reference implementation of Instance Segmentation
A multi-modeling and simulation environment to study complex systems