Multilingual Automatic Speech Recognition with word-level timestamps
Graph Neural Network Library for PyTorch
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Sparsity-aware deep learning inference runtime for CPUs
Generate audiobooks from e-books
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Standardized Serverless ML Inference Platform on Kubernetes
An Efficient Agentic Model for Computer Use
Open-source evaluation toolkit of large multi-modality models (LMMs)
Unleashing 10,000+ Word Generation from Long Context LLMs
The repository provides code for running inference with SAM 2
Agent S: an open agentic framework that uses computers like a human
Fast State-of-the-Art Static Embeddings
Gemma open-weight LLM library, from Google DeepMind
Core ML tools contain supporting tools for Core ML model conversion
The official PyTorch implementation of Google's Gemma models
Text and image to video generation: CogVideoX and CogVideo
Official inference framework for 1-bit LLMs
The largest collection of PyTorch image encoders / backbones
A modular high-level library to train embodied AI agents
AIMET is a library that provides advanced quantization and compression
Code for Cicero, an AI agent that plays the game of Diplomacy
Python package built to ease deep learning on graph
Test-Time Reinforcement Learning
MiroThinker is an open source deep research agent