A Customizable Image-to-Video Model based on HunyuanVideo
Training framework for Stable Baselines3 reinforcement learning agents
A solution to build and deploy MCP agents and applications
Benchmarking Multimodal Agents for Open-Ended Tasks
Implementation of RLHF (Reinforcement Learning with Human Feedback)
OpenAI + LINE + Vercel = GPT AI Assistant
Inference code for CodeLlama models
Educational framework exploring multi-agent orchestration
Node.js client for the official ChatGPT API. 🔥
Research code artifacts for Code World Model (CWM)
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Multimodal Diffusion with Representation Alignment
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Reference PyTorch implementation and models for DINOv3
An experimental version of DeepSeek model
Models for object and human mesh reconstruction
A neural network that transforms a design mock-up into static websites
SAPIEN Manipulation Skill Framework
Superduper: Integrate AI models and machine learning workflows
Easily turn large sets of image urls to an image dataset
NVIDIA Federated Learning Application Runtime Environment
Open source platform for the machine learning lifecycle
Structure-from-Motion and Multi-View Stereo
⚡ Building applications with LLMs through composability ⚡