Pixeltable is an open-source Python data infrastructure framework designed to support the development of multimodal AI applications. The system provides a declarative interface for managing the entire lifecycle of AI data pipelines, including storage, transformation, indexing, retrieval, and orchestration of datasets. Unlike traditional architectures that require multiple tools such as databases, vector stores, and workflow orchestrators, Pixeltable unifies these functions within a table-based abstraction. Developers define data transformations and AI operations using computed columns on tables, allowing pipelines to evolve incrementally as new data or models are added. The framework supports multimodal content including images, video, text, and audio, enabling applications such as retrieval-augmented generation systems, semantic search, and multimedia analytics.
Features
- Declarative table-based infrastructure for AI data pipelines
- Incremental storage transformation and indexing of multimodal data
- Support for images video audio documents and text datasets
- Embedding search and retrieval for RAG and semantic search systems
- Integration with machine learning inference and AI providers
- Unified orchestration of storage transformation retrieval and serving