Any model. Any hardware. Zero compromise
TensorRT LLM provides users with an easy-to-use Python API
Open source RAG framework for building scalable modular AI apps
Superduper: Integrate AI models and machine learning workflows
NLP Cloud serves high performance pre-trained or custom models for NER
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Determined, deep learning training platform
A refreshing functional take on deep learning
Build voice-based LLM agents. Modular + open source
No-code multi-agent framework to build LLM Agents, workflows
A library for deep learning end-to-end dialog systems and chatbots
The fastest way to build data pipelines
Play couplet with seq2seq model
On the Structural Pruning of Large Language Models
A simple, performant and scalable Jax LLM
A lightweight framework for building LLM-based agents
A new open-source framework to build and deploy intelligent agents
High-performance inference framework for large language models
High-performance Inference and Deployment Toolkit for LLMs and VLMs
One API call, pull Claude agent, completely sandboxed
The open-source data curation platform for LLMs
All-in-one AI productivity platform with agents, workflows, and IM
Implement CPU from scratch and play with large model deployments
Generative AI reference workflows
Build and run agents you can see, understand and trust