Framework for building neural networks
Refer and Ground Anything Anywhere at Any Granularity
Convert AI papers to GUI
Framework for building AI-powered interactive digital humans and agent
PyTorch code and models for VJEPA2 self-supervised learning from video
Language modeling in a sentence representation space
The standard data-centric AI package for data quality and ML
Bailing is a voice dialogue robot similar to GPT-4o
Stanford NLP Python library for many human languages
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Multi-modal large language model designed for audio understanding
Qwen3-omni is a natively end-to-end, omni-modal LLM
Visual Automation IDE — automate anything you see on screen
Mice speech to text with MX Cinnamon OS ISO
Graphical User Interface Face Anonymization Tool
PyPhoto - Image Editor
A subtitle generator for Japanese Adult Videos.
Chat & pretrained large vision language model
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
Air traffic control tower and radar simulator (solo + multi-player)
mice stt tts
Run GGUF models easily with a UI or API. One File. Zero Install.
Turns the YouTube Music site into a desktop application.
A Python application to add watermarks (text or image) to PDF files