Helps data scientists define testable self-documenting dataflows
Apache OpenNLP
A free, open-source, and cross-platform big data analytics framework
A GUI Agent app based on UI-TARS to control your computer using AI
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A state-of-the-art open visual language model
Real-World Centric Foundation GUI Agents
Agent framework and applications built upon Qwen>=3.0
Open Source OCR Engine
Framework and no-code GUI for fine-tuning LLMs
UI-TARS-desktop version that can operate on your local personal device
An open sourced end-to-end VLM-based GUI Agent
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
GUI for a Vocal Remover that uses Deep Neural Networks
Generate audiobooks from e-books, voice cloning & 1107+ languages
The most powerful and modular diffusion model GUI, api and backend
Speech-to-text, text-to-speech, and speaker recognition
A self-hostable CDN for databases
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
GUI Exploration Lab. One of the best GUI agent solutions
A high-throughput and memory-efficient inference and serving engine
Testing tool for modeling GUI transitions
Fast stable diffusion on CPU and AI PC