TT-NN operator library, and TT-Metalium low level kernel programming
Take control of your AI agents
Traditional Mandarin LLMs for Taiwan
Fast Multimodal LLM on Mobile Devices
A high-performance inference engine for AI models
Korvus is a search SDK that unifies the entire RAG pipeline
Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Open-source LLM load balancer and serving platform for hosting LLMs
Recipes to train reward model for RLHF
Here comes a selection of technology stacks and tool repositories
A tension reasoning engine over 131 S-class problems
Constrained Value Alignment via Safe Reinforcement Learning
An Efficient Web-enhanced Question Answering System
Open-Source Analytics Infrastructure
Bringing BERT into modernity via both architecture changes and scaling
An Easy-to-Use and High-Performance AI Deployment Framework
A Unified MCP Server Management App (MCP Manager)
An LLM Compiler for Parallel Function Calling
Scalable RL solution for advanced reasoning of language models
AI Powered Knowledge Graph Generator
Make your agents learn from experience
Autoregressive Model Beats Diffusion
dLLM: Simple Diffusion Language Modeling
An agentless approach to automatically solve software development