Private chat with local GPT with document, images, video, etc.
A simple screen parsing tool towards pure vision based GUI agent
A Heterogeneous Benchmark for Information Retrieval
LLM based data scientist, AI native data application
The official Python SDK for Model Context Protocol servers and clients
Build resilient language agents as graphs
Agentic LLM Vulnerability Scanner / AI red teaming kit
Medical imaging toolkit for deep learning
Graph Neural Network Library for PyTorch
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
High-Fidelity and Controllable Generation of Textured 3D Assets
Real-time voice interactive digital human
Scalable machine learning for time series forecasting
Git-based data version control for machine learning workflows
From Paper to Presentation in One Click
Bash is all you need, write a claude code with only 16 line code
Fast backend for long-term AI user memory via structured profiles
SOTA discrete acoustic codec models with 40/75 tokens per second
AI video agents framework for next-gen video interactions
Data and tools for generating and inspecting OLMo pre-training data
Efficient Retrieval Augmentation and Generation Framework
Leveraging BERT and c-TF-IDF to create easily interpretable topics
LLM-based Reinforcement Learning audio edit model
GUI Exploration Lab. One of the best GUI agent solutions
Large-language-model & vision-language-model based on Linear Attention