"Big Model" trains a visual multimodal VLM with 26M parameters
Generate blog articles from video or audio
Controllable and fast Text-to-Speech for over 7000 languages
Tooling for the Common Objects In 3D dataset
Inference Llama 2 in one file of pure C
LLM based autonomous agent that does online comprehensive research
Superfast AI decision making and processing of multi-modal data
Images to inference with no labeling
Find the Root Cause in Your Code's Trace
Open-weight, large-scale hybrid-attention reasoning model
Real-time voice interactive digital human
Fundamentals of Machine Learning and Deep Learning
OpenMLDB is an open-source machine learning database
Examples and guides for using the OpenAI API
Taming Stable Diffusion for Lip Sync
A set of Docker images for training and serving models in TensorFlow
No-code LLM Platform to launch APIs and ETL Pipelines
An Open-Source AI Agent Platform for Financial Analysis using LLMs
Follow along with my AI Agents Masterclass videos
Framework for building neural networks
Omnilingual ASR Open-Source Multilingual SpeechRecognition
FAIR Sequence Modeling Toolkit 2
Official DeiT repository
Anthropic's educational courses
Diffusion Transformer with Fine-Grained Chinese Understanding