AWS SDK for pandas download

aws-sdk-pandas (formerly AWS Data Wrangler) bridges pandas with the AWS analytics stack so DataFrames flow seamlessly to and from cloud services. With a few lines of code, you can read from and write to Amazon S3 in Parquet/CSV/JSON/ORC, register tables in the AWS Glue Data Catalog, and query with Amazon Athena directly into pandas. The library abstracts efficient patterns like partitioning, compression, and vectorized I/O so you get performant data lake operations without hand-rolling boilerplate. It also supports Redshift, OpenSearch, and other services, enabling ETL tasks that blend SQL engines and Python transformations. Operational helpers handle IAM, sessions, and concurrency while exposing knobs for encryption, versioning, and catalog consistency. The result is a productive workflow that keeps your analytics in Python while leveraging AWS-native storage and query engines at scale.

Features

High-level read/write of DataFrames to S3 with Parquet, CSV, JSON, and ORC
Tight integration with AWS Glue Catalog and Athena for schema and SQL queries
Convenience methods for Redshift COPY/UNLOAD and data migration patterns
Automatic handling of partitions, compression, and columnar formats
Session and IAM helpers with options for encryption and versioning
Scalable I/O paths optimized for large data lake workloads

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow AWS SDK for pandas

AWS SDK for pandas Web Site

Other Useful Business Software

Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.

Start Free

Rate This Project

User Reviews

Be the first to post a review of AWS SDK for pandas!

Additional Project Details

Programming Language

Python

Related Categories

Python Data Science Tool

Registered

2 days ago

Similar Business Software

Amazon SageMaker Unified Studio

Amazon SageMaker Unified Studio is a comprehensive, AI and data development environment designed to streamline workflows and simplify the process of building and deploying machine learning models. Built on Amazon DataZone, it integrates various AWS analytics and AI/ML services, such as Amazon...

See Software
Google Cloud BigQuery

BigQuery is a serverless, multicloud data warehouse that simplifies the process of working with all types of data so you can focus on getting valuable business insights quickly. At the core of Google’s data cloud, BigQuery allows you to simplify data integration, cost effectively and securely...

See Software
Teradata VantageCloud

Teradata VantageCloud: The complete cloud analytics and data platform for AI. Teradata VantageCloud is an enterprise-grade, cloud-native data and analytics platform that unifies data management, advanced analytics, and AI/ML capabilities in a single environment. Designed for scalability and...

See Software
Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Quadratic

Quadratic enables your team to work together on data analysis to deliver faster results. You already know how to use a spreadsheet, but you’ve never had this much power. Quadratic speaks Formulas and Python (SQL & JavaScript coming soon). Use the language you and your team already know....

See Software
Qloo

Qloo is the “Cultural AI”, decoding and predicting consumer taste across the globe. A privacy-first API that predicts global consumer preferences and catalogs hundreds of millions of cultural entities. Through our API, we provide contextualized personalization and insights based on a deep...

See Software

Report inappropriate content

AWS SDK for pandas

Easy integration with Athena, Glue, Redshift, Timestream, Neptune

Get an email when there's a new version of AWS SDK for pandas

Features

Project Samples

Project Activity

Categories

License

Follow AWS SDK for pandas

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered