A HTML5 video player with a parser that saves traffic
Laravel-focused MCP server for augmenting AI powered local development
Video understanding codebase from FAIR for reproducing video models
A Unified Framework for Text-to-3D and Image-to-3D Generation
A TypeScript SSE proxy for MCP servers that use stdio transport
"Big Model" trains a visual multimodal VLM with 26M parameters
Flexible Photo Recrafting While Preserving Your Identity
ENScan_GO is an enterprise information reconnaissance tool
Foundation Models for Time Series
Official implementation of DreamCraft3D
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Foundational model for human-like, expressive TTS
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
LLM-based agent for general purpose software engineering tasks
Large Multimodal Models for Video Understanding and Editing
Please do not feed the models
Massively parallel rigidbody physics simulation
CSGHub is a brand-new open-source platform for managing LLMs
Gateway service that instantly transforms existing MCP Servers
Sharp Monocular Metric Depth in Less Than a Second
Code for Language models can explain neurons in language models paper
Demo of a customer service use case implemented with the OpenAI Agents
SAPIEN Manipulation Skill Framework
Sample code and notebooks for Generative AI on Google Cloud
NVIDIA Federated Learning Application Runtime Environment