Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Genome modeling and design across all domains of life
TensorRT LLM provides users with an easy-to-use Python API
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
AI video generator optimized for low VRAM and older GPUs use
Traditional Mandarin LLMs for Taiwan
FlashMLA: Efficient Multi-head Latent Attention Kernels
Fast and memory-efficient exact attention
Run your own AI cluster at home with everyday devices
CV-CUDA™ is an open-source, GPU accelerated library
Open source alternative to ChatGPT that runs 100% offline
A nearly-live implementation of OpenAI's Whisper
Project Lyra: Open Generative 3D World Models
Document content and metadata extraction microservice
950 line, minimal, extensible LLM inference engine built from scratch
OpenShell is the safe, private runtime for autonomous AI agents.
On-device wake word detection powered by deep learning
The Triton Inference Server provides an optimized cloud
A set of Docker images for training and serving models in TensorFlow
oneAPI Deep Neural Network Library (oneDNN)
InvokeAI is a leading creative engine for Stable Diffusion models
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
An open sourced end-to-end VLM-based GUI Agent
Instant neural graphics primitives: lightning fast NeRF and more
A GPU-accelerated library containing highly optimized building blocks