UI-TARS-desktop version that can operate on your local personal device
Python chatbot framework with Natural Language Understanding
An open-source RAG-based tool for chatting with your documents
Unified web UI for training and running open models locally
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Document Image Parsing via Heterogeneous Anchor Prompting”
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Agent S: an open agentic framework that uses computers like a human
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Low-level Python library used to interact with a Substra network
Open source RAG framework for building scalable modular AI apps
Text and image to video generation: CogVideoX and CogVideo
Autonomous LLM agent for end-to-end data science workflows
Enable AI to control your desktop, mobile and HMI devices
The most powerful Android RPA agent framework
Python tool for browser-based interactive data apps in one file
GUI Exploration Lab. One of the best GUI agent solutions
Deploy and share agents with open infrastructure
Language-model investigation agent with a terminal UI
Code to accompany "A Method for Animating Children's Drawings"
Improve human sleep through scientifically
Modular quant framework
Open-sourced unified customization model
The best ChatGPT that $100 can buy
The open-source data curation platform for LLMs