GUI for a Vocal Remover that uses Deep Neural Networks
Python scraper based on AI
The most powerful and modular diffusion model GUI, api and backend
A python library that makes AMR parsing, generation and visualization
A state-of-the-art open visual language model
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
StreamSpeech is a seamless model for offline speech recognition
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Witness the aha moment of VLM with less than $3
Elyra extends JupyterLab with an AI centric approach
Image polygonal annotation with Python
An open sourced end-to-end VLM-based GUI Agent
Game Boy emulator written in Python
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Agent framework and applications built upon Qwen>=3.0
Framework and no-code GUI for fine-tuning LLMs
GUI/CLI tool for downloading Xiaohongshu
Generate audiobooks from e-books
Agents write python code to call tools and orchestrate other agents
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
The AI toolkit for the AI developer
Convert AI papers to GUI
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Turn your website into a GIF
Sharp Monocular Metric Depth in Less Than a Second