DepGraph: Towards Any Structural Pruning
High-performance Inference and Deployment Toolkit for LLMs and VLMs
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
AWS Skills for Agents
Train multi-step agents for real-world tasks using GRPO
An Open-Source AI Agent Platform for Financial Analysis using LLMs
Multimodal model achieving SOTA performance
Set of guides meant to help explain often-times complex pricing
Python open source project "The Road to Self-Study Programming"
Fast Differentiable Tensor Library in JavaScript & TypeScript with Bun
MobileLLM Optimizing Sub-billion Parameter Language Models
A high-performance distributed file system
BISHENG is an open LLM devops platform for next generation apps
High-Performance Symbolic Regression in Python and Julia
Library for Rapid (Web) Crawler and Scraper Development
Datalog variant for tool designers crafting analyses in Horn clauses
A markdown parser written in Go. Easy to extend, standard, compliant
Rapid Web Development w/ Go
The easiest way to use deep metric learning in your application
Personal mini-web in text
Cloud environment inspector
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code
Add expert Swift Concurrency guidance to your AI coding tool
New family of code large language models (LLMs)
The python library for real-time communication