aws-sdk-pandas (formerly AWS Data Wrangler) bridges pandas with the AWS analytics stack so DataFrames flow seamlessly to and from cloud services. With a few lines of code, you can read from and write to Amazon S3 in Parquet/CSV/JSON/ORC, register tables in the AWS Glue Data Catalog, and query with Amazon Athena directly into pandas. The library abstracts efficient patterns like partitioning, compression, and vectorized I/O so you get performant data lake operations without hand-rolling boilerplate. It also supports Redshift, OpenSearch, and other services, enabling ETL tasks that blend SQL engines and Python transformations. Operational helpers handle IAM, sessions, and concurrency while exposing knobs for encryption, versioning, and catalog consistency. The result is a productive workflow that keeps your analytics in Python while leveraging AWS-native storage and query engines at scale.

Features

  • High-level read/write of DataFrames to S3 with Parquet, CSV, JSON, and ORC
  • Tight integration with AWS Glue Catalog and Athena for schema and SQL queries
  • Convenience methods for Redshift COPY/UNLOAD and data migration patterns
  • Automatic handling of partitions, compression, and columnar formats
  • Session and IAM helpers with options for encryption and versioning
  • Scalable I/O paths optimized for large data lake workloads

Project Samples

Project Activity

See All Activity >

Categories

Data Science

License

Apache License V2.0

Follow AWS SDK for pandas

AWS SDK for pandas Web Site

Other Useful Business Software
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of AWS SDK for pandas!

Additional Project Details

Programming Language

Python

Related Categories

Python Data Science Tool

Registered

2 days ago