OpenSquilla is a token-efficient microkernel AI agent runtime designed for CLI, web UI, and chat-based workflows. It routes each turn through a shared loop that can select lower-cost models when appropriate while preserving tool dispatch, retries, memory, and decision logging. The project supports multiple LLM providers through a pluggable provider layer, making it adaptable to different model ecosystems. It includes persistent memory, built-in web search, on-device embeddings, and sandboxing for safer execution. OpenSquilla is designed for users who want stronger agent capabilities without wasting tokens on every interaction. Its main value is combining cost-aware routing, durable context, and multi-channel agent execution in one local runtime.
Features
- Token-efficient microkernel agent runtime
- CLI, Web UI, and chat-channel support
- Local model routing across providers
- Persistent memory and on-device embeddings
- Layered sandbox and web search support
- Shared turn loop for consistent agent behavior