High-performance inference server for text embeddings models API layer
Fast, flexible LLM inference
High-performance, multiplayer code editor from the creators of Atom
Fast, local-first web content extraction for LLMs
Python-free Rust inference server
Rust async runtime based on io-uring
Fast ML inference & training for ONNX models in Rust
Fast and efficient unstructured data extraction
Convert codebases into structured prompts optimized for LLM analysis