Netflix’s Workflow Orchestrator
Faster and easier training and deployments
Python HTTP client with TLS and HTTP/2 fingerprint emulation support
Running large language models on a single GPU
The open source post-building layer for agents
Agent framework that enables tool-use agent tasks
A @ClickHouse fork that supports high-performance vector search
An orchestration framework for agentic AI and LLM applications
Evaluate your LLM's response with Prometheus and GPT4
Alibaba's high-performance LLM inference engine for diverse apps
Fetch source code for npm packages
A large-scale model of medical consultation in Chinese
On the Structural Pruning of Large Language Models
SQL-Driven RAG Engine
Uncertainty Quantification for Language Models, is a Python package
Hypernetworks that adapt LLMs for specific benchmark tasks
MemoryOS is designed to provide a memory operating system
UCCL is an efficient communication library for GPUs
Towards Efficient Self-Evolving Agent System
Driving with Graph Visual Question Answering
Jlama is a modern LLM inference engine for Java
Chat with any codebase in under two minutes | Fully local
E2M converts various file types (doc, docx, epub, html, htm, url
AI-powered markdown editor - leverage LLMs with your documents
An open-source, code-first Java toolkit