Pretrained time-series foundation model developed by Google Research
Generate Any 3D Scene in Seconds
This repository contains the official implementation of FastVLM
Hackable and optimized Transformers building blocks
CogView4, CogView3-Plus and CogView3(ECCV 2024)
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Global weather forecasting model using graph neural networks and JAX
Tooling for the Common Objects In 3D dataset
code for Mesh R-CNN, ICCV 2019
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Language modeling in a sentence representation space
Renderer for the harmony response format to be used with gpt-oss
Diversity-driven optimization and large-model reasoning ability
Large Multimodal Models for Video Understanding and Editing
Large-language-model & vision-language-model based on Linear Attention
Inference code for scalable emulation of protein equilibrium ensembles
The Clay Foundation Model - An open source AI model and interface
Phi-3.5 for Mac: Locally-run Vision and Language Models
Tiny vision language model
The official PyTorch implementation of Google's Gemma models
Programmatic access to the AlphaGenome model
Open Source Speech Language Model
Long-form streaming TTS system for multi-speaker dialogue generation
Open-source industrial-grade ASR models