Public opinion analysis system
A Web UI for easy subtitle using whisper model
Benchmark LLMs by fighting in Street Fighter 3
Project Lyra: Open Generative 3D World Models
An Open Source package that allows video game creators
OCR expert VLM powered by Hunyuan's native multimodal architecture
The Cradle framework is a first attempt at General Computer Control
A lightweight vision library for performing large object detection
Build cross-modal and multimodal applications on the cloud
AI-powered tool to quickly remove watermarks from videos flawlessly
An extremely simple tool for separating vocals and background music
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Based on the Disco Diffusion, version of the AI art creation software
Telegram Group Calls Streaming bot with some useful features
Face Mask Detection system based on computer vision and deep learning
We estimate dense, flicker-free, geometrically consistent depth
Pytorch implementation of our method for high-resolution