CogView4, CogView3-Plus and CogView3(ECCV 2024)
Just a Better Chatbot. Powered by MCP Client & Workflows
Label Studio is a multi-type data labeling and annotation tool
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Generate audiobooks from e-books
A Pioneering Open-Source Alternative to GPT-4o
A computer vision closed-loop learning platform
Gemma open-weight LLM library, from Google DeepMind
Benchmarking Multimodal Agents for Open-Ended Tasks
General-purpose image editing model that delivers high-fidelity
Handwritten Text Recognition (HTR) system implemented with TensorFlow
A frontier, first-principles handbook
Motion-controllable Video Generation via Latent Trajectory Guidance
Python package for AutoML on Tabular Data with Feature Engineering
Qwen3-omni is a natively end-to-end, omni-modal LLM
An open phone agent model & framework
InvokeAI is a leading creative engine for Stable Diffusion models
Doom-based AI research platform for reinforcement learning
GitLab automatic code review tool based on large models
Claude code for everything except coding
From Addition, Subtraction, Multiplication, and Division to ML
Flock is a workflow-based low-code platform for building chatbots
Agent Skill for generating 2D sprite sheets and map, transparent PNG
Open multimodal web agent built by Ai2
No-code LLM Platform to launch APIs and ETL Pipelines