Tools like web browser, computer access and code runner for LLMs
Developer AI Persona Search Agent
Agentic LLM Vulnerability Scanner / AI red teaming kit
Time-lapse Video Generation Models as Metamorphic Simulators
Easily turn large sets of image urls to an image dataset
Implementation of a U-net complete with efficient attention
Fast image augmentation library and an easy-to-use wrapper
Open Source Computer Vision Library
Supercharge Your LLM Application Evaluations
Fault-tolerant, highly scalable GPU orchestration
Recovering the Visual Space from Any Views
Spark-TTS Inference Code
Director, Screenwriter, Producer, and Video Generator All-in-One
Browser automation for AI agents and humans
Hunyuan Translation Model Version 1.5
LTX-Video Support for ComfyUI
UI-TARS-desktop version that can operate on your local personal device
Fast and accurate AI powered file content types detection
PPTAgent: Generating and Evaluating Presentations
CLIP, Predict the most relevant text snippet given an image
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Multilingual Automatic Speech Recognition with word-level timestamps
Developers and anyone seeking an LLM solution to scan for vulnerabilit
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Uncover insights, surface problems, monitor, and fine tune your LLM