Automate browser-based workflows with LLMs and Computer Vision
UI-TARS-desktop version that can operate on your local personal device
Modest natural-language processing
Memory engine and app that is extremely fast, scalable
Low code project to build admin panels, internal tools, and dashboards
Open Source and Free Alternative to ChatGPT Atlas
An open phone agent model & framework
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Physical Symbolic Optimization
Stanford NLP Python library for many human languages
Qwen3-omni is a natively end-to-end, omni-modal LLM
Document Management System and Content Management System
A C library for parsing/normalizing street addresses around the world
Python library for model interpretation/explanations
Computerized guideline editor for clinical decision support
Free OMR - OCR web sofware based on javascript and PHP
DBpedia Spotlight is a tool for automatically annotating
Multimodal Transformer for document image understanding and layout
Tencent’s 36-language state-of-the-art translation model
Multimodal 7B model for image, video, and text understanding tasks