AI multi-agent framework for automating data-driven R&D workflows
Create prompt-friendly codebase digests from any Git repository URL
Faster and easier training and deployments
Designed for training LLM/VLM agents via RL
AI-Driven Exploration in the Space of Code
Traditional Mandarin LLMs for Taiwan
The SOTA Open-Source Browser Agent
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Document (PDF, Word, PPTX ...) extraction and parse API
High-performance inference framework for large language models
Run PyTorch LLMs locally on servers, desktop and mobile
CLI tool for configuring and monitoring Claude Code
General-purpose image editing model that delivers high-fidelity
The power of Claude Code / GeminiCLI / CodexCLI
Pre-trained Deep Learning models and demos
Inference script for Oasis 500M
Fast and Universal 3D reconstruction model for versatile tasks
4M: Massively Multimodal Masked Modeling
ICLR2024 Spotlight: curation/training code, metadata, distribution
PyTorch code and models for the DINOv2 self-supervised learning
Memory-efficient and performant finetuning of Mistral's models
Official implementation of DreamCraft3D
Diffusion Transformer with Fine-Grained Chinese Understanding
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming