Use Microsoft Edge's online text-to-speech service from Python
Open-source MCP server that gives your coding agent
New Modpack with Gregtech, Thaumcraft and Witchery
Label Studio is a multi-type data labeling and annotation tool
Easy to use Python library for creating 2D arcade games
Sharp Monocular Metric Depth in Less Than a Second
Extracts semi-random frames from all MP4 videos
The Unofficial TikTok API Wrapper In Python
A beautiful, powerful, self-hosted rom manager and player
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
The data structure for multimodal data
GUI/CLI tool for downloading Xiaohongshu
Generating Immersive, Explorable, and Interactive 3D Worlds
PyTorch code and models for V-JEPA self-supervised learning from video
21 Lessons, Get Started Building with Generative AI
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
OCR expert VLM powered by Hunyuan's native multimodal architecture
Uncommon Objects in 3D dataset
InvokeAI is a leading creative engine for Stable Diffusion models
Dealing with all unstructured data, such as reverse image search
🎥 A free & open-source Python tool to remove unwanted objects from videos frame-by-frame using brush masking and AI inpainting (OpenCV + FFmpeg). EXE included.
myplayer Free Karaoke & Media Player Software (Myanmar)
A feature-rich event management system
Free Motion Capture for Everyone
Less rage, more chill