TensorStore is a high-performance library for reading and writing N-dimensional arrays that live in many different storage systems, from local files to cloud object stores. It separates the logical view (shape, dtype, chunking) from the physical layout so the same code can target Zarr, N5, TIFF pyramids, or custom backends. Rich indexing, slicing, and broadcasting operations make it feel like a familiar array API, while asynchronous I/O pipelines stream chunks efficiently in parallel. Transactional semantics allow atomic updates and consistent snapshots, which is essential for large, shared datasets used by ML and scientific workflows. The library is engineered for scalability—background caching, chunk sharding, and retryable operations keep throughput high even over unreliable networks. With language bindings, it fits into Python-heavy analysis pipelines while retaining a fast C++ core.
Features
- Uniform array API over many on-disk and cloud formats
- Asynchronous, parallel chunked I/O with caching
- Transactional reads and writes for atomic updates
- Flexible indexing, slicing, and dtype conversions
- Pluggable drivers for Zarr, N5, TIFF, and custom backends
- C++ core with Python bindings for ML and science workflows