A modular graph-based Retrieval-Augmented Generation (RAG) system
Low-latency REST API for serving text-embeddings
Qwen3-Coder is the code version of Qwen3
Large-language-model & vision-language-model based on Linear Attention
MiniMax M2.1, a SOTA model for real-world dev & agents.
AI Browser Automation
Open-source observability for your LLM application
State of the art LLM and coding model
Private Open AI on Kubernetes
A native AI PPT generation application based on nano banana pro
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Seamlessly integrate LLMs into scikit-learn
I Agent designed to interact with ROS1- and ROS2-based robotics system
Test-Time Reinforcement Learning
Papers integrating knowledge graphs (KGs) and large language models
Helps developers deploy LangChain runnables and chains as a REST API
A @ClickHouse fork that supports high-performance vector search
Take control of your AI agents
Open-Source Analytics Infrastructure
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Open Source Deep Research Alternative to Reason and Search
Query anything (GitHub, Notion, +40 more) with SQL and let LLMs
Implement CPU from scratch and play with large model deployments
Integrate cutting-edge LLM technology quickly and easily into your app
Build ChatGPT over your data, all with natural language