Port of OpenAI's Whisper model in C/C++
Code for running inference and finetuning with SAM 3 model
A Customizable Image-to-Video Model based on HunyuanVideo
A solution to build and deploy MCP agents and applications
Node.js client for the official ChatGPT API. 🔥
Benchmarking Multimodal Agents for Open-Ended Tasks
Implementation of RLHF (Reinforcement Learning with Human Feedback)
OpenAI + LINE + Vercel = GPT AI Assistant
Inference code for CodeLlama models
Educational framework exploring multi-agent orchestration
Reference PyTorch implementation and models for DINOv3
Research code artifacts for Code World Model (CWM)
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Multimodal Diffusion with Representation Alignment
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Central interface to connect your LLM's with external data
NVIDIA Federated Learning Application Runtime Environment
An experimental version of DeepSeek model
A neural network that transforms a design mock-up into static websites
SAPIEN Manipulation Skill Framework
Superduper: Integrate AI models and machine learning workflows
Volcano Engine Reinforcement Learning for LLMs
Easily turn large sets of image urls to an image dataset
Open source platform for the machine learning lifecycle