Fast multimodal LLM for real-time voice interaction and AI apps
Diffusion Transformer with Fine-Grained Chinese Understanding
A simple and easy-to-use library for interacting with the Ollama API
The headless Chrome/Chromium driver on top of Puppeteer
Large-language-model & vision-language-model based on Linear Attention
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A comprehensive PHP Generative AI Framework
go1pylib is a Python library designed to control the Go1 robot
Clean network diagrams, One-time setup, zero upkeep
AI tool that turns Hacker News posts into daily podcast updates
AutoGluon: AutoML for Image, Text, and Tabular Data
online video editor built with nextjs, remotion and ffmpeg
Using AI models to automatically provide commentary and edit videos
Fast State-of-the-Art Tokenizers optimized for Research and Production
AI tool for automatic batch short video creation and editing
Running large language models on a single GPU
Edit videos with Claude Code
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Make writing Java http clients easier
Open source AI VTuber platform with voice chat and Live2D avatars
An LLM-based presentation generation platform
Models for the spaCy Natural Language Processing (NLP) library
Ultra-Efficient LLMs on End Device
Learn How LLM Transformer Models Work with Interactive Visualization