...Streaming responses are built in, returning an async generator so applications can render output progressively instead of waiting for a full response. It also supports cloud-hosted usage by pointing the client at Ollama’s cloud endpoint with an API key, while preserving a familiar local-first workflow for developers who want to move between local and remote execution.