OCR model for complex documents with layout-aware structured outputs
A comprehensive quantitative trading system with AI-powered analysis
Skywork-R1V is an advanced multimodal AI model series
Code and models for ICML 2024 paper, NExT-GPT
Follow along with my AI Agents Masterclass videos
gpt-oss-120b and gpt-oss-20b are two open-weight language models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
From-scratch PyTorch implementation of Google's TurboQuant
TFX is an end-to-end platform for deploying production ML pipelines
The official Python SDK for Model Context Protocol servers and clients
Open Source Differentiable Computer Vision Library
Controllable & emotion-expressive zero-shot TTS
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Renderer for the harmony response format to be used with gpt-oss
Advanced LLM-powered brute-force tool combining AI intelligence
Official inference framework for 1-bit LLMs
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Offical Implementation for "Recursive Multi-Agent Systems"
ClawTeam: Agent Swarm Intelligence (One Command → Full Automation)
Deep and Machine Learning for Microscopy
AI video agents framework for next-gen video interactions
Global weather forecasting model using graph neural networks and JAX
kaldi-asr/kaldi is the official location of the Kaldi project
This repository provides an advanced RAG
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs