NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
An industrial grade federated learning framework
An efficient forwarding service designed for LLMs
Framework for building AI-powered interactive digital humans and agent
Knowledge Graph Generation from Any Text
From Paper to Presentation in One Click
SOTA discrete acoustic codec models with 40/75 tokens per second
Global weather forecasting model using graph neural networks and JAX
Tooling for the Common Objects In 3D dataset
Request recommended movies, TV shows and anime to Jellyseer/Overseer
Interface for OuteTTS models
A solution to build and deploy MCP agents and applications
Build cross-modal and multimodal applications on the cloud
Python binding to the Apache Tika™ REST services
ChatGPT interface with better UI
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Multi-modal large language model designed for audio understanding
A minimal yet professional single agent demo project
Swirl queries any number of data sources with APIs
Private chat with local GPT with document, images, video, etc.
Official Repo for ICML 2024 paper
airda(Air Data Agent
Framework for building AI agents that automate complex web tasks
Build high-performance AI models with modular building blocks