Evaluate your LLM's response with Prometheus and GPT4
General proxy performance testing tool based on Clash using Telegram
Helps data scientists define testable self-documenting dataflows
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Comprehensive paid advertising audit & optimization skill
Lightweight framework for evaluating large language model performance
GUI Exploration Lab. One of the best GUI agent solutions
Evaluation suite designed to assess the performance of LLMs
A framework that facilitates all stages of LLM development
One-stop solution for creating your digital avatar from chat history
A minimal yet professional single agent demo project
The open source post-building layer for agents
ComfyUI wrapper nodes for WanVideo and related models
DoWhy is a Python library for causal inference
Apple Silicon (MLX) port of Karpathy's autoresearch
Synthetic Data Generation for tabular, relational and time series data
Democratizing AI scientists with ToolUniverse
Open platform for building, deploying, and managing LLM agents
AI-powered penetration testing assistant using local LLM on linux
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Semi-Structured Agentic Framework. Workflows build themselves
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A unified, comprehensive and efficient recommendation library
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction