Build Vision Agents quickly with any model or video provider
Visual Causal Flow
Improve your resumes with Resume Matcher
Fast multimodal LLM for real-time voice interaction and AI apps
Diffusion Transformer with Fine-Grained Chinese Understanding
A simple and easy-to-use library for interacting with the Ollama API
The headless Chrome/Chromium driver on top of Puppeteer
Large-language-model & vision-language-model based on Linear Attention
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A comprehensive PHP Generative AI Framework
go1pylib is a Python library designed to control the Go1 robot
Clean network diagrams, One-time setup, zero upkeep
AI tool that turns Hacker News posts into daily podcast updates
online video editor built with nextjs, remotion and ffmpeg
AI tool for automatic batch short video creation and editing
Running large language models on a single GPU
Open source AI VTuber platform with voice chat and Live2D avatars
An LLM-based presentation generation platform
Ultra-Efficient LLMs on End Device
Learn How LLM Transformer Models Work with Interactive Visualization
Multimodal model achieving SOTA performance
A Python library for extracting structured information
Audio foundation model excelling in audio understanding
Export and Share your ChatGPT conversation history