Synthesizing and manipulating 2048x1024 images with conditional GANs
Foundation Model for Tabular Data
An Efficient Agentic Model for Computer Use
Concurrent Python made simple
Phi-3.5 for Mac: Locally-run Vision and Language Models
A framework for the creation of autonomous agent services
Agents write python code to call tools and orchestrate other agents
a pluggable app that runs a full check on the deployment
AI-data warehouse to enrich, transform and analyze unstructured data
Windows GUI Automation with Python (based on text properties)
Tools to ease the creation of snippets, syntax definitions, etc.
Utility for sending notifications, on demand and when commands finish
14-stage Fusion Pipeline for LLM token compression
A Next-Generation Training Engine Built for Ultra-Large MoE Models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GUI Exploration Lab. One of the best GUI agent solutions
Qwen3-omni is a natively end-to-end, omni-modal LLM
Swirl queries any number of data sources with APIs
Pure Python FFmpeg-based live video / audio streaming to YouTube
Audio Normalization for Python/ffmpeg
Transform a cold separation into a warm Skill
Just talk to your agent
Open-source AI research assistant for biomedicine
Machine learning image inpainting task that removes watermarks
A collection of machine learning examples and tutorials