FastGPT is a knowledge-based platform built on the LLMs
Towards Real-World Vision-Language Understanding
Agent S: an open agentic framework that uses computers like a human
Qwen3-omni is a natively end-to-end, omni-modal LLM
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Next-generation AI Agent Optimization Platform
CogView4, CogView3-Plus and CogView3(ECCV 2024)
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Reference PyTorch implementation and models for DINOv3
An AI-based intelligent recipe generation platform
DNN && GAN && NLP && BIG DATA
Machine learning image inpainting task that removes watermarks
Roadmap to becoming an Artificial Intelligence Expert in 2022
Openclaw as your girlfriend
Annotate and review coding agent plans visually, share with your team
Just a Better Chatbot. Powered by MCP Client & Workflows
Postman for MCPs - A tool for testing and debugging MCPs
AI tool for automatic batch short video creation and editing
Foundation model for image generation
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
All-in-one AI productivity platform with agents, workflows, and IM
Label Studio is a multi-type data labeling and annotation tool
Lets make video diffusion practical
A Claude Code Skill that turns prompts into magazine-style HTML decks