Fast backend for long-term AI user memory via structured profiles
Provider-agnostic, open-source evaluation infrastructure
A HTML5 video player with a parser that saves traffic
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second
Foundational model for human-like, expressive TTS
Sample code and notebooks for Generative AI on Google Cloud
Code for running inference with the SAM 3D Body Model 3DB
Unified Multimodal Understanding and Generation Models
A MCP for Claude Desktop / Claude Code / Windsurf / Cursor
An AI-powered security review GitHub Action using Claude
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Renderer for the harmony response format to be used with gpt-oss
kaldi-asr/kaldi is the official location of the Kaldi project
Brokk brings code intelligence to AI
CUDA Templates for Linear Algebra Subroutines
A high-level machine learning and deep learning library for PHP
Python package built to ease deep learning on graph
Face recognition with deep neural networks
SSH User Management With Add/Delete Users
Hummingbird compiles trained ML models into tensor computation
Workflow and speech recognition app
Build Vision Agents quickly with any model or video provider
An Open Source text-to-speech system built by inverting Whisper
ComfyUI integration for Microsoft's VibeVoice text-to-speech model