A python library for self-supervised learning on images
Build cross-modal and multimodal applications on the cloud
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling
UI-TARS-desktop version that can operate on your local personal device
Multi-modal large language model designed for audio understanding
Concatenate a directory full of files into a single prompt
Swirl queries any number of data sources with APIs
Multi-Voice and Prompt-Controlled TTS Engine
Example Discord bot written in Python that uses the completions API
A free and reliable P2P BitTorrent client
Transformers4Rec is a flexible and efficient library
High quality, fast, modular reference implementation of SSD in PyTorch
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments
A deep learning toolkit for Text-to-Speech, battle-tested in research
Hewlett-Packard's Linux imaging and printing software
An Autonomous LLM Agent for Complex Task Solving
Multi-functional autoclicker with keyboard presser
Blazingly Fast & Customizable Linux distribution
32/64 bit multi-platform Ethernet S7 PLC communication suite
Optimized Workforce Learning for General Multi-Agent Assistance
Cooperative multiplayer graphical RPG and adventure game
Embed images and sentences into fixed-length vectors
Air traffic control tower and radar simulator (solo + multi-player)
Official code for Style Aligned Image Generation via Shared Attention