GPU accelerated decision optimization
An efficient forwarding service designed for LLMs
MoBA: Mixture of Block Attention for Long-Context LLMs
Persistent context and multi-instance coordination
A Unified Framework for Image Customization
ChatGPT interface with better UI
Unleash Next-Level AI
Local AI coding agent CLI with multi-agent orchestration tools
Build and run agents you can see, understand and trust
Pruna is a model optimization framework built for developers
The most accurate natural language detection library for Python
Standardized Serverless ML Inference Platform on Kubernetes
Parallax is a distributed model serving framework
ZAPI by Adopt AI is an open-source Python library
Ultimate meta-skill for generating best-in-class Claude Code skills
Superfast AI decision making and processing of multi-modal data
Low-latency AI inference engine optimized for mobile devices
A Model Context Protocol (MCP) Gateway & Registry
Large-language-model & vision-language-model based on Linear Attention
AI Suite for upscaling, interpolating & restoring images/videos
Building Mixture-of-Experts from LLaMA with Continual Pre-training
e-Dokyumento is web-based Document Management System (DMS)
A PyTorch implementation of "Capsule Graph Neural Network"
A PyTorch implementation of the NIPS 2017 paper
The open source Algorithmic Trading System