A Python toolbox for gaining geometric insights
Benchmarking Multimodal Agents for Open-Ended Tasks
Static Analyzer for Solidity
RAG-Anything: All-in-One RAG Framework
PaddlePaddle End-to-End Development Toolkit
Open-source evaluation toolkit of large multi-modality models (LMMs)
AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories
VMZ: Model Zoo for Video Modeling
Official implementation of Watermark Anything with Localized Messages
Multimodal Diffusion with Representation Alignment
Zero-code platform for building AI agents from natural language input
A Python library for extracting structured information
Chinese and English multimodal conversational language model
Data manipulation and transformation for audio signal processing
Gemma open-weight LLM library, from Google DeepMind
Virtual AI anchor that combines state-of-the-art technology
InvokeAI is a leading creative engine for Stable Diffusion models
3D plotting and mesh analysis through a streamlined interface
AI tool that converts GitHub repositories into interactive diagrams
Detects phishing and lookalike domains using DNS fuzzing techniques
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Open-source platform for building enterprise-grade agents
Open source feature flagging and remote config service
State-of-the-art Image & Video CLIP, Multimodal Large Language Models