Alternatives to pandas
Compare pandas alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to pandas in 2025. Compare features, ratings, user reviews, pricing, and more from pandas competitors and alternatives in order to make an informed decision for your business.
-
1
Tumult Analytics
Tumult Analytics
Built and maintained by a team of differential privacy experts, and running in production at institutions like the U.S. Census Bureau. Runs on Spark and effortlessly supports input tables containing billions of rows. Supports a large and ever-growing list of aggregation functions, data transformation operators, and privacy definitions. Perform public and private joins, filters, or user-defined functions on your data. Compute counts, sums, quantiles, and more under multiple privacy models. Differential privacy is made easy, thanks to our simple tutorials and extensive documentation. Tumult Analytics is built on our sophisticated privacy foundation, Tumult Core, which mediates access to sensitive data and means that every program and application comes with an embedded proof of privacy. Built by composing small, easy-to-review components. Provably safe stability tracking and floating-point primitives. Uses a generic framework based on peer-reviewed research. -
2
Vaex
Vaex
At Vaex.io we aim to democratize big data and make it available to anyone, on any machine, at any scale. Cut development time by 80%, your prototype is your solution. Create automatic pipelines for any model. Empower your data scientists. Turn any laptop into a big data powerhouse, no clusters, no engineers. We provide reliable and fast data driven solutions. With our state-of-the-art technology we build and deploy machine learning models faster than anyone on the market. Turn your data scientist into big data engineers. We provide comprehensive training of your employees, enabling you to take full advantage of our technology. Combines memory mapping, a sophisticated expression system, and fast out-of-core algorithms. Efficiently visualize and explore big datasets, and build machine learning models on a single machine. -
3
OpenRefine
OpenRefine
OpenRefine (previously Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data. OpenRefine always keeps your data private on your own computer until you want to share or collaborate. Your private data never leaves your computer unless you want it to. (It works by running a small server on your computer and you use your web browser to interact with it). OpenRefine can help you explore large data sets with ease. You can find out more about this functionality by watching the video below. OpenRefine can be used to link and extend your dataset with various webservices. Some services also allow OpenRefine to upload your cleaned data to a central database, such as Wikidata.. A growing list of extensions and plugins is available on the wiki. -
4
Polars
Polars
Knowing of data wrangling habits, Polars exposes a complete Python API, including the full set of features to manipulate DataFrames using an expression language that will empower you to create readable and performant code. Polars is written in Rust, uncompromising in its choices to provide a feature-complete DataFrame API to the Rust ecosystem. Use it as a DataFrame library or as a query engine backend for your data models. -
5
List & Label
combit
List & Label is a report generator for software developers to integrate reporting functions in their web, cloud and desktop applications. Made for development environments such as .NET, C#, Delphi, C++, ASP.NET, ASP.NET MVC, .NET Core etc. It is seamless to integrate, supports a huge variety of data sources and extends applications with extensive print, export and preview functions. With the WYSIWYG Report Designer, developers or end users create or edit different print templates for printing information that originates either from a database or another data source. In the Designer, you then have all the data at your disposal to prepare it for printing in different ways. The additionally included and entirely browser-based Web Report Designer for ASP.NET MVC offers more flexibility in development and is independent from printer drivers. Reports for web applications can be designed anywhere at any time in the browser of your choice.Starting Price: €650/license -
6
statsmodels
statsmodels
statsmodels is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests and statistical data exploration. An extensive list of result statistics is available for each estimator. The results are tested against existing statistical packages to ensure that they are correct. The package is released under the open-source Modified BSD (3-clause) license. statsmodels supports specifying models using R-style formulas and pandas DataFrames. Have a look at dir(results) to see available results. Attributes are described in results.__doc__ and results methods have their own docstrings. You can also use numpy arrays instead of formulas. The easiest way to install statsmodels is to install it as part of the Anaconda distribution, a cross-platform distribution for data analysis and scientific computing. This is the recommended installation method for most users.Starting Price: Free -
7
warcat
Python Software Foundation
Tool and library for handling Web ARChive (WARC) files. Naively join archives into one. Extract files from archive. List commands available. List contents of archive. Load archive and write it back out. Split archives into individual records. Verify digest and validate conformance. The library may not be entirely thread-safe yet. The goal of the Warcat project is to create a tool and library as easily and fast as manipulating any other archive such as tar and zip archives. Warcat is designed to handle large, gzip-ed files by partially extracting them as needed. Warcat is provided without warranty and cannot guarantee the safety of your files. Remember to make backups and test them! A WARC file contains one or more records concatenated together. Each record contains named fields, newline, a content block, newline, and newline. A Content Block may be two types, {binary data} or {Named Fields, newline, and binary data}.Starting Price: Free -
8
h5py
HDF5
The h5py package is a Pythonic interface to the HDF5 binary data format. It lets you store huge amounts of numerical data, and easily manipulate that data from NumPy. For example, you can slice into multi-terabyte datasets stored on disk, as if they were real NumPy arrays. Thousands of datasets can be stored in a single file, categorized and tagged however you want. H5py uses straightforward NumPy and Python metaphors, like dictionary and NumPy array syntax. For example, you can iterate over datasets in a file, or check out the .shape or .dtype attributes of datasets. You don't need to know anything special about HDF5 to get started. In addition to the easy-to-use high level interface, h5py rests on a object-oriented Cython wrapping of the HDF5 C API. Almost anything you can do from C in HDF5, you can do from h5py.Starting Price: Free -
9
Pillow
Pillow
The Python Imaging Library adds image processing capabilities to your Python interpreter. This library provides extensive file format support, an efficient internal representation, and fairly powerful image processing capabilities. The core image library is designed for fast access to data stored in a few basic pixel formats. It should provide a solid foundation for a general image processing tool. Pillow for enterprise is available via the Tidelift subscription. The Python Imaging Library is ideal for image archival and batch processing applications. You can use the library to create thumbnails, convert between file formats, print images, etc. The current version identifies and reads a large number of formats. Write support is intentionally restricted to the most commonly used interchange and presentation formats. The library contains basic image processing functionality, including point operations, filtering with a set of built-in convolution kernels, and color space conversions.Starting Price: Free -
10
Avanzai
Avanzai
Avanzai helps accelerate your financial data analysis by letting you use natural language to output production-ready Python code. Avanzai speeds up financial data analysis for both beginners and experts using plain English. Plot times series data, equity index members, and even stock performance data using natural prompts. Skip the boring parts of financial analysis by leveraging AI to generate code with relevant Python packages already installed. Further edit the code if you wish, once you're ready copy and paste the code into your local environment and get straight to business. Leverage commonly used Python packages for quant analysis such as Pandas, Numpy, etc using plain English. Take financial analysis to the next level, quickly pull fundamental data and calculate the performance of nearly all US stocks. Enhance your investment decisions with accurate and up-to-date information. Avanzai empowers you to write the same Python code that quants use to analyze complex financial data. -
11
NetworkX
NetworkX
NetworkX is a Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks. Generators for classic graphs, random graphs, and synthetic networks. Additional benefits from Python include fast prototyping, easy to teach, and multi-platform. Network structure and analysis measures.Starting Price: Free -
12
Bokeh
Bokeh
Bokeh makes it simple to create common plots, but also can handle custom or specialized use-cases. Plots, dashboards, and apps can be published in web pages or Jupyter notebooks. Python has an incredible ecosystem of powerful analytics tools: NumPy, Scipy, Pandas, Dask, Scikit-Learn, OpenCV, and more. With a wide array of widgets, plot tools, and UI events that can trigger real Python callbacks, the Bokeh server is the bridge that lets you connect these tools to rich, interactive visualizations in the browser. Microscopium is a project maintained by researchers at Monash University. It allows researchers to discover new gene or drug functions by exploring large image datasets with Bokeh’s interactive tools. Panel is a tool for polished data presentation that utilizes the Bokeh server. It is created and supported by Anaconda. Panel makes it simple to create custom interactive web apps and dashboards by connecting user-defined widgets to plots, images, tables, or text.Starting Price: Free -
13
gTTS
gTTS
gTTS (Google Text-to-Speech), a Python library and CLI tool to interface with Google Translate's text-to-speech API. Write spoken mp3 data to a file, a file-like object (bytestring) for further audio manipulation, or stdout. Or simply pre-generate Google Translate TTS request URLs to feed to an external program. Customizable speech-specific sentence tokenizer that allows for unlimited lengths of text to be read, all while keeping proper intonation, abbreviations, decimals and more. Customizable text pre-processors which can, for example, provide pronunciation corrections.Starting Price: Free -
14
JDeli
IDR Solutions
JDeli is a powerful Java SDK designed to help you easily read, write, convert, manipulate and process various image formats in Java. Here’s an overview of its features: -Wide Image Format Support: JDeli reads/writes BMP, GIF, HEIC, JPEG, JPEG2000, PNG, TIFF, and WebP. It also reads DICOM, EMF/WMF, PSD, and SGI formats. -High Performance: JDeli’s encoders and decoders outperform alternatives, making it ideal for performance-critical applications. -File Security: JDeli operates securely on your servers, with no callbacks or cloud access. Critical customer data remains secure. -Ongoing Development: JDeli offers nightly and stable builds with regular new features. It continues to expand its range of supported image formats, including AVIF, HEIC, and JPEG XL. -No Third-Party Libraries: JDeli avoids third-party dependencies, minimizing security risks and JVM crashes.Starting Price: $1600 per year -
15
broot
broot
The ROOT data analysis framework is used much in High Energy Physics (HEP) and has its own output format (.root). ROOT can be easily interfaced with software written in C++. For software tools in Python there exists pyROOT. Unfortunately, pyROOT does not work well with python3.4. broot is a small library that converts data in python numpy ndarrays to ROOT files containing trees with a branch for each array. The goal of this library is to provide a generic way of writing python numpy datastructures to ROOT files. The library should be portable and supports both python2, python3, ROOT v5 and ROOT v6 (requiring no modifications on the ROOT part, just the default installation). Installation of the library should only require a user to compile to library once or install it as a python package.Starting Price: Free -
16
websockets
Python Software Foundation
An implementation of the WebSocket Protocol (RFC 6455 & 7692). websockets is a library for building WebSocket servers and clients in Python with a focus on correctness, simplicity, robustness, and performance. Built on top of asyncio, Python’s standard asynchronous I/O framework, it provides an elegant coroutine-based API. websockets is heavily tested for compliance with RFC 6455. Continuous integration fails under 100% branch coverage. websockets is built for production. For example, it was the only library to handle backpressure correctly before the issue became widely known in the Python community. Memory usage is optimized and configurable. A C extension accelerates expensive operations. It’s pre-compiled for Linux, macOS, and Windows and packaged in the wheel format for each system and Python version. websockets takes care of everything under the hood so you can focus on your application!Starting Price: Free -
17
Pylons
Python Software Foundation
The Pylons web framework is designed for building web applications and sites in an easy and concise manner. They can range from as small as a single Python module, to a substantial directory layout for larger and more complex web applications. Pylons comes with project templates that help boot-strap a new web application project, or you can start from scratch and set things up exactly as desired. A framework to make writing web applications in Python easy. Utilizes a minimalist, component-based philosophy that makes it easy to expand on. Harness existing knowledge about Python. Extensible application design. Fast and efficient, an incredibly small per-request call stack provides top performance. Uses existing and well-tested Python packages. Pylons 1.0 series is stable and production-ready but in maintenance-only mode. The Pylons Project now maintains the Pyramid web framework for future development. Pylons 1.0 users should strongly consider using Pyramid for their next project.Starting Price: Free -
18
imageio
imageio
Imageio is a Python library that provides an easy interface to read and write a wide range of image data, including animated images, volumetric data, and scientific formats. It is cross-platform, runs on Python 3.5+, and is easy to install. Imageio is written in pure Python, so installation is easy. Imageio works on Python 3.5+. It also works on Pypy. Imageio depends on Numpy and Pillow. For some formats, imageio needs additional libraries/executables (e.g. ffmpeg), which imageio helps you to download/install. If something doesn’t work as it should, you need to know where to search for causes. The overview on this page aims to help you in this regard by giving you an idea of how things work, and - hence - where things may go sideways.Starting Price: Free -
19
DHTMLX
DHTMLX
DHTMLX is a JavaScript UI library that provides a set of highly customizable and flexible components for building modern and responsive web applications. The library includes more than 30 UI components, such as Gantt, Scheduler, Kanban, diagrams, charts, grids, spreadsheets, calendars, trees, forms, and more. DHTMLX helps to advance web development and build feature-rich data-intensive applications of any complexity. One of the key advantages of DHTMLX is its compatibility with different web frameworks, including React, Angular, and Vue.js, which makes it a great choice for developers who are already working with these frameworks. DHTMLX is also designed with performance in mind. Its components are optimized for fast rendering. A comprehensive and rich API provides a straightforward way to manipulate JavaScript/HTML5 UI widgets. The library has detailed API documentation, tutorials, and examples that help developers quickly get up to speed and start building their apps.Starting Price: $79 -
20
IMSL
Perforce
Enhance performance and save development time with IMSL numerical libraries. Achieve your strategic objectives using IMSL's build tools. Model regression, make decision trees, establish neural networks, and forecast time series with your IMSL library. Rigorously tested and proven for decades across all industries, the IMSL C Numerical Library gives companies a dependable, high-ROI solution for building cutting-edge analytics tools. From data mining and forecasting, to advanced statistical analysis, the IMSL C Numerical Library can help teams quickly add sophisticated functionality to analytic applications. The IMSL C library makes integration and deployment easy. Enjoy easy migrations, support for common platforms and platform combinations, and no added infrastructure on embed in databases or applications. -
21
Seaborn
Seaborn
Seaborn is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics. For a brief introduction to the ideas behind the library, you can read the introductory notes or the paper. Visit the installation page to see how you can download the package and get started with it. You can browse the example gallery to see some of the things that you can do with seaborn, and then check out the tutorials or API reference to find out how. To see the code or report a bug, please visit the GitHub repository. General support questions are most at home on StackOverflow, which has a dedicated channel for seaborn. -
22
PyQtGraph
PyQtGraph
PyQtGraph is a pure-python graphics and GUI library built on PyQt/PySide and NumPy. It is intended for use in mathematics/scientific/engineering applications. Despite being written entirely in python, the library is very fast due to its heavy leverage of NumPy for number crunching and Qt's GraphicsView framework for fast display. PyQtGraph is distributed under the MIT open-source license. Basic 2D plotting in interactive view boxes. Line and scatter plots. Data can be panned/scaled by mouse. Fast drawing for real-time data display and interaction. Displays most data types (int or float; any bit depth; RGB, RGBA, or luminance). Functions for slicing multidimensional images at arbitrary angles (great for MRI data). Rapid update for video display or real-time interaction. Image display with interactive lookup tables and level control. Mesh rendering with isosurface generation. Interactive viewports rotate/zoom with mouse. Basic 3D scenegraph for easier programming.Starting Price: Free -
23
nuvo
nuvo
nuvo offers AI-powered, secure and scalable data onboarding solutions that empower you and your customers to map, validate, and clean data effortlessly – regardless of the input format. Don't let complex data mappings and transformations burden your developers. Reduce the time you use internally for customer data imports by 90%, or shift completely to a customer self-service model and empower your customers to import their data via an intuitive user interface. With our nuvo Data Importer SDK, we support software companies like Sastrify, Prewave, and Insurwave by automating their data imports, eliminating import errors, and freeing up their development resources to focus on core product functionalities. Experience the power of nuvo yourself! Start a free trial or schedule a quick demo with our team at getnuvo.com. -
24
BuildVu
IDR Solutions
With BuildVu, you’ll unlock precise PDF-to-HTML/SVG conversion, giving you greater control and added functionality over PDF in your web application. -Optimized Content: BuildVu intelligently converts PDFs, optimizing for smaller file sizes and fast rendering in browsers. -File Metadata: Access PDF data in JSON format, including metadata, word lists, outlines (bookmarks), and annotations. -Thumbnails: Generate high-quality page thumbnails with customizable dimensions. -Annotations: Enjoy support for various annotation types (Links, Popups, Sound/Video, Text, Highlight, Underline) in easy-to-use JSON format. -search.json: Extract all text from the document alongside the HTML content. -Font Conversion: Restructure embedded fonts for compatibility across web browsers. -Office Conversion: Combine BuildVu with LibreOffice for seamless conversion from Office formats (Word, PowerPoint, Excel).Starting Price: $450 per month -
25
Wijmo
GrapeCity
High-performance JavaScript UI controls for enterprise apps. Build lightweight, high-speed HTML5/JavaScript apps with zero dependencies, fast! New in Wijmo 2022 v1: - Inventory management reference app built in Angular - Website traffic monitoring reference app built in Angular - FlexMap added to GeoDashboard reference app - Rotated Spline Chart - Re-order FlexGrid rows with drag and drop General Features of Wijmo: - CellTemplates for Angular, React and Vue FlexGrids - Use Wijmo’s single set of true JavaScript controls in any framework, including Angular, React, and Vue.js - Use controls as native HTML elements with Wijmo Web Components - Filter, group, sort, and more with FlexGrid, JavaScript’s fastest data grid - Visualize data with 80+ charts and controls - Instantly analyze and aggregate data with OLAP Pivot GridStarting Price: $695 per user per year -
26
requests
Python Software Foundation
Requests is a simple, yet elegant, HTTP library. Requests allows you to send HTTP/1.1 requests extremely easily. There’s no need to manually add query strings to your URLs, or to form-encode your PUT & POST data, but nowadays, just use the JSON method! Requests is one of the most downloaded Python packages today, pulling in around 30M downloads/week, according to GitHub, Requests is currently depended upon by 1,000,000+ repositories. You may certainly put your trust in this code. Requests is available on PyPI. Requests is ready for the demands of building robust and reliable HTTP–speaking applications, for the needs of today. Automatic content decompression and decoding. International domains and URLs. Sessions with cookie persistence. Browser-style TLS/SSL verification. Basic & digest authentication, and familiar dict–like cookies. Multi-part file uploads. SOCKS proxy support. Connection timeouts and streaming downloads.Starting Price: Free -
27
DataMelt
jWork.ORG
DataMelt (or "DMelt") is an environment for numeric computation, data analysis, data mining, computational statistics, and data visualization. DataMelt can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. Linear, non-linear and symbolic regression are also available. Neural networks and various data-manipulation methods are integrated using Java API. Elements of symbolic computations using Octave/Matlab scripting are supported. DataMelt is a computational environment for Java platform. It can be used with different programming languages on different operating systems. Unlike other statistical programs, it is not limited to a single programming language. This software combines the world's most-popular enterprise language, Java, with the most popular scripting language used in data science, such as Jython (Python), Groovy, JRuby.Starting Price: $0 -
28
Datavore
Datavore Labs
The code-free tool for advanced data analysis. Find insights with speed and accuracy. Why Datavore? Build workflows and combine signals, faster. Discover and track indicators across datasets in order to test and validate signals. Organize. Catalog all your data in one place. Use dynamic filters to quickly find internal and external data. Explore. Build dashboards to compare, evaluate, and monitor lines. Efficiently test and track multiple indicators across datasets. Analyze. Perform deep proprietary research by constructing forecasting models and regression analyses. Platform. Excel versatility with cloud scalability. Easily perform quantitative research and automate tedious operations. Excel Syntax. Write custom functions and use pre-built time series formulas. Patented ingestion engine. Discover concepts and relations within big datasets. Calendar alignment. Match data to company fiscal calendar or predefined periods. Aggregations. -
29
FastCube VCL
Fast Reports
It is not just a new generation of “Decision Cube” and much more than Pivot Grid control, it is a powerful high-speed standalone OLAP-cube engine! Integrate FastCube OLAP-library to your application, prepare data (or several data sets) that will be necessary for your end-users, and bring Business Intelligence directly into your application! Give freedom of report creation to your end-users – support them in getting reasonable decisions! FastCube VCL is a set of components for implementation in-memory data cube, OLAP operations, can be used for creating pivot table reports. It supports Delphi 7-XE8, C++Builder 2005-XE8, Embarcadero RAD Studio 10 - 11 and Lazarus. Your customers will no longer need Excel-tables for multi-dimensional data analysis and cross-tab reporting! FastCube components can be built into the interface of host applications. FastCube end users do not require high programming skills to build reports. FastCube is a set of OLAP Desktop components for Delphi.Starting Price: $199.00/developer -
30
Stata
StataCorp
Stata is a complete, integrated software package that provides all your data science needs: data manipulation, visualization, statistics, and automated reporting. Stata is fast and accurate. It is easy to learn through the extensive graphical interface yet completely programmable. With Stata's menus and dialogs, you get the best of both worlds. You can easily point and click or drag and drop your way to all of Stata's statistical, graphical, and data management features. Use Stata's intuitive command syntax to quickly execute commands. Whether you enter commands directly or use the menus and dialogs, you can create a log of all actions and their results to ensure the reproducibility and integrity of your analysis. Stata also has complete command-line scripting and programming facilities, including a full matrix programming language. You have access to everything you need to script your analysis or even to create new Stata commands--commands that work just like those shipped with Stata.Starting Price: $48.00/6-month/student -
31
Coheris Spad
ChapsVision
Coheris Spad by ChapsVision is a self-service data analysis studio for Data Scientists from all sectors and industries. Coheris Spad by ChapsVision is taught in many major French and foreign schools and universities, giving it a great reputation in the Data Scientists community. Coheris Spad by ChapsVision provides you with a great methodological wealth covering a very broad spectrum in terms of data analysis. In a user-friendly and intuitive environment, you have all the power you need to discover, prepare and analyze your data. Coheris Spad by ChapsVision allows you to connect to many sources to prepare your data. You have a vast library of data processing functions at your disposal: filtering, stacking, aggregation, transposition, join, management of missing data, search for atypical distributions, statistical or supervised recoding, formatting. -
32
OpenTuner
OpenTuner
Program autotuning has been demonstrated in many domains to achieve better or more portable performance. However, autotuners themselves are often not very portable between projects because using a domain-informed search space representation is critical to achieving good results and because no single search technique performs best for all problems. OpenTuner is a new framework for building domain-specific multi-objective program autotuners. OpenTuner supports fully customizable configuration representations, an extensible technique representation to allow for domain-specific techniques, and an easy-to-use interface for communicating with the tuned program. A key capability inside OpenTuner is the use of ensembles of disparate search techniques simultaneously, techniques that perform well will receive larger testing budgets, and techniques which perform poorly will be disabled.Starting Price: Free -
33
yarl
Python Software Foundation
All URL parts, scheme, user, password, host, port, path, query, and fragment are accessible by properties. All URL manipulations produce a new URL object. Strings passed to constructor and modification methods are automatically encoded giving canonical representation as result. Regular properties are percent-decoded, use raw_ versions for getting encoded strings. Human-readable representation of URL is available as .human_repr(). PyPI contains binary wheels for Linux, Windows and MacOS. If you want to install yarl on another operating system (like Alpine Linux, which is not manylinux-compliant because of the missing glibc and therefore, cannot be used with our wheels) the tarball will be used to compile the library from the source code. It requires a C compiler and Python headers installed. Please note that the pure-Python (uncompiled) version is much slower. However, PyPy always uses a pure-Python implementation, and, as such, it is unaffected by this variable.Starting Price: Free -
34
Oracle Data Access Components (ODAC) are a set of Windows and .NET data access drivers and tools. They include support for .NET data access, Microsoft Visual Studio tools for developing Oracle database applications; and ASP.NET providers. ODAC provides comprehensive client support for advanced Oracle database functionality, including performance, high availability, and security, among other features. It is tightly integrated with Visual Studio to provide a seamless development experience for developers. Oracle Data Provider for .NET is an implementation of Microsoft's ADO.NET interface and provides access to Oracle databases. OLAP Data Manipulation Language (OLAP DML) is used to define and manipulate analytic workspace objects. High performance, a feature-rich mechanism for Microsoft ADO and OLE DB data access to Oracle databases, its usage, and its configuration. Provides introductory, installation, post-installation configuration, and usage information.Starting Price: Free
-
35
Beautiful Soup
Beautiful Soup
Beautiful Soup is a library that makes it easy to scrape information from web pages. It sits atop an HTML or XML parser, providing Pythonic idioms for iterating, searching, and modifying the parse tree. Beautiful Soup's support for Python 2 was discontinued on December 31, 2020: one year after the sunset date for Python 2 itself. From this point onward, new Beautiful Soup development will exclusively target Python 3. The final release of Beautiful Soup 4 to support Python 2 was 4.9.3. Beautiful Soup is licensed under the MIT license, so you can also download the tarball, drop the bs4/ directory into almost any Python application (or into your library path) and start using it immediately.Starting Price: Free -
36
Scapy
Scapy
Scapy is a powerful interactive packet manipulation program. It is able to forge or decode packets of a wide number of protocols, send them on the wire, capture them, match requests and replies, and much more. It can easily handle most classical tasks like scanning, tracerouting, probing, unit tests, attacks, or network discovery (it can replace hping, 85% of nmap, arpspoof, arp-sk, arping, tcpdump, tshark, p0f, etc.). It also performs very well at a lot of other specific tasks that most other tools can’t handle, like sending invalid frames, injecting your own 802.11 frames, combining technics (VLAN hopping+ARP cache poisoning, VOIP decoding on WEP encrypted channel), etc. Scapy runs natively on Linux, Windows, OSX, and on most Unixes with libpcap. The same code base now runs natively on both Python 2 and Python 3. Scapy development uses the Git version control system. Scapy reference repository is hosted on GitHub.Starting Price: Free -
37
Altair SLC
Altair
Many organizations have developed SAS language programs over the past 20 years that are vital to their operations. Altair SLC runs programs written in SAS language syntax without translation and without needing to license third-party products. Altair SLC reduces users’ capital costs and operating expenses thanks to its superb ability to handle high levels of throughput. Altair SLC's built-in SAS language compiler runs SAS language and SQL code, and utilizes Python and R compilers to run Python and R code and exchange SAS language datasets, Pandas, and R data frames. The software runs on IBM mainframes, in the cloud, and on servers and workstations running a variety of operating systems. It supports both remote job submission and the ability to exchange data between mainframe, cloud, and on-premises installations. -
38
Jspreadsheet
Jspreadsheet
Jspreadsheet is a robust full-stack JavaScript data grid solution that directly integrates the functionality and user-friendly experience of spreadsheet applications like Excel and Google Sheets into your web applications. It offers a smooth, efficient user interface, enabling batch actions, table manipulation, and a host of other features that ensure flawless compatibility between your web application and Excel/Sheets. This familiar environment enhances productivity, simplifies user adoption, and minimizes the need for extensive training. Jspreadsheet is a comprehensive solution designed to meet a variety of application requirements in spreadsheet and data management for web platforms. It optimizes workflow development, streamlines process automation, and facilitates the smooth transition of tasks from Excel to the web. Additionally, Jspreadsheet provides a wide range of extensions to address diverse needs within the data grid and spreadsheet ecosystem, making it a versatile choice.Starting Price: $49 per developer -
39
zope.interface
Python Software Foundation
This package is intended to be independently reusable in any Python project. It is maintained by the Zope Toolkit project. This package provides an implementation of “object interfaces” for Python. Interfaces are a mechanism for labeling objects as conforming to a given API or contract. So, this package can be considered as an implementation of the Design By Contract methodology support in Python. Interfaces are objects that specify (document) the external behavior of objects that “provide” them. An interface specifies behavior through informal documentation in a doc string, attribute definitions, and invariants, which are conditions that must hold for objects that provide the interface. Attribute definitions specify specific attributes. They define the attribute name and provide documentation and constraints of attribute values. Attribute definitions can take a number of forms.Starting Price: Free -
40
Mako
Mako
It provides a familiar, non-XML syntax that compiles into Python modules for maximum performance. Mako's syntax and API borrows from the best ideas of many others, including Django and Jinja2 templates, Cheetah, Myghty, and Genshi. Conceptually, Mako is an embedded Python (i.e. Python Server Page) language, which refines the familiar ideas of componentized layout and inheritance to produce one of the most straightforward and flexible models available, while also maintaining close ties to Python calling and scoping semantics. As templates are ultimately compiled into Python bytecode, Mako's approach is extremely efficient and was originally written to be just as fast as Cheetah. Today, Mako is very close in speed to Jinja2, which uses a similar approach and for which Mako was an inspiration. Can access variables from their enclosing scope as well as the template's request contextStarting Price: Free -
41
Polymer
Polymer
The Polymer library provides a set of features for creating custom elements. These features are designed to make it easier and faster to make custom elements that work like standard DOM elements. Similar to standard DOM elements, Polymer elements can be instantiated using a constructor or document creation element, configured using attributes or properties, populated with internal DOM inside each instance, responsive to property and attribute changes, styled with internal defaults or externally, and responsive to methods that manipulate its internal state. Registering an element associates a class with a custom element name. The element provides callbacks to manage its lifecycle. Polymer also lets you declare properties, to integrate your element's property API with the Polymer data system. Shadow DOM provides a local, encapsulated DOM tree for your element. Polymer can automatically create and populate a shadow tree for your element from a DOM template.Starting Price: Free -
42
ruffus
ruffus
Ruffus is a computation pipeline library for python. It is open-sourced, powerful and user-friendly, and widely used in science and bioinformatics. Ruffus is designed to allow scientific and other analyses to be automated with the minimum of fuss and the least effort. Suitable for the simplest of tasks. Handles even fiendishly complicated pipelines which would cause make or scons to go cross-eyed and recursive. No "clever magic", no pre-processing. Unambitious, the lightweight syntax which tries to do this one small thing well. Ruffus is available under the permissive MIT free software license. This permits free use and inclusion even within proprietary software. It is good practice to run your pipeline in a temporary, “working” directory away from your original data. Ruffus is a lightweight python module for building computational pipelines. Ruffus requires Python 2.6 or higher or Python 3.0 or higher.Starting Price: Free -
43
openpyxl
openpyxl
openpyxl is a Python library to read/write Excel 2010 xlsx/xlsm/xltx/xltm files. It was born from a lack of an existing library to read/write natively from Python in the Office Open XML format. All kudos to the PHPExcel team as openpyxl was initially based on PHPExcel. By default, openpyxl does not guard against quadratic blowup or billion laughs XML attacks. To guard against these attacks install defusedxml. Install openpyxl using pip. It is advisable to do this in a Python virtualenv without system packages. Sometimes you might want to work with the checkout of a particular version. This may be the case if bugs have been fixed but a release has not yet been made. There is no need to create a file on the filesystem to get started with openpyxl. Just import the Workbook class and start work. Sheets are given a name automatically when they are created. Once you gave a worksheet a name, you can get it as a key of the workbook.Starting Price: Free -
44
python-docx
python-docx
python-docx is a Python library for creating and updating Microsoft Word (.docx) files. Paragraphs are fundamental in Word. They’re used for body text, but also for headings and list items like bullets. You’re free to specify both width and height, but usually, you wouldn’t want to. If you specify only one, python-docx uses it to calculate the properly scaled value of the other. This way the aspect ratio is preserved and your picture doesn’t look stretched. If you don’t know what a Word paragraph style is you should definitely check it out. Basically, it allows you to apply a whole set of formatting options to a paragraph at once. python-docx allows you to create new documents as well as make changes to existing ones. Actually, it only lets you make changes to existing documents; it’s just that if you start with a document that doesn’t have any content, it might feel at first like you’re creating one from scratch.Starting Price: Free -
45
waiting
Python Software Foundation
waiting is a small library for waiting for stuff to happen. It basically waits for a function to return True, in various modes. Waiting is compatible with flux for simulated timelines. The most basic usage is when you have a function you want to wait for. Waiting forever is very simple. If your predicate returns a value, it will be returned as the result of wait(). A timeout parameter can also be specified. When a timeout expires without the predicate being fulfilled, an exception is thrown. Sleeping polls the predicate at a certain interval (by default 1 second). The interval can be changed with the sleep_seconds argument. When waiting for multiple predicates, waiting provides two simple facilities to help aggregate them, any and all. They resemble Python’s built-in any() and all(), except that they don’t call a predicate once it has been satisfied (this is useful when the predicates are inefficient and take time to complete).Starting Price: Free -
46
GDAL
GDAL
GDAL is a translator library for raster and vector geospatial data formats that is released under an MIT style Open Source License by the Open Source Geospatial Foundation. As a library, it presents a single raster abstract data model and single vector abstract data model to the calling application for all supported formats. It also comes with a variety of useful command line utilities for data translation and processing.Starting Price: Free -
47
NumPy
NumPy
Fast and versatile, the NumPy vectorization, indexing, and broadcasting concepts are the de-facto standards of array computing today. NumPy offers comprehensive mathematical functions, random number generators, linear algebra routines, Fourier transforms, and more. NumPy supports a wide range of hardware and computing platforms, and plays well with distributed, GPU, and sparse array libraries. The core of NumPy is well-optimized C code. Enjoy the flexibility of Python with the speed of compiled code. NumPy’s high level syntax makes it accessible and productive for programmers from any background or experience level. NumPy brings the computational power of languages like C and Fortran to Python, a language much easier to learn and use. With this power comes simplicity: a solution in NumPy is often clear and elegant.Starting Price: Free -
48
xlrd
Python Software Foundation
xlrd is a library for developers to extract data from Microsoft Excel (tm) .xls spreadsheet files. xlrd is a library for reading data and formatting information from Excel files in the historical .xls format. This library will no longer read anything other than .xls files. Ignores charts, macros, pictures, any other embedded object, including embedded worksheets, VBA modules, and formulas, but results of formula calculations are extracted, comments, hyperlinks, auto filters, advanced filters, pivot tables, conditional formatting, and data validation. Password-protected files are not supported and cannot be read by this library. From the command line, this will show the first, second, and last rows of each sheet in each file. xlrd is licensed under the BSD license.Starting Price: Free -
49
An easy-to-use, yet powerful package for accessing, manipulating, analyzing and presenting information, now available for cloud and on-site deployment. Why piece together niche software packages from different vendors? A consolidated portfolio that includes three of our most popular products, Base SAS, SAS/STAT and SAS/GRAPH, reduces the cost of licensing, maintenance, training, and support while ensuring that consistent information is available across your enterprise. SAS statistical procedures are constantly being updated to reflect the latest advances in statistical methodology. And technical support is provided by experienced master's- and doctorate-level statisticians who deliver a level of service and knowledge not often found with other software vendors. SAS has more than four decades of experience developing statistical analysis software that organizations around the world rely on to provide accurate answers to data questions.
-
50
WTForms
WTForms
WTForms is a flexible forms validation and rendering library for Python web development. It can work with whatever web framework and template engine you choose. It supports data validation, CSRF protection, internationalization (I18N), and more. There are various community libraries that provide closer integration with popular frameworks. WTForms tries to provide as usable an API as possible. We’ve listed here some of the known libraries to work with WTForms, but if it’s not listed, it doesn’t mean it won’t work. Pretty much any ORM or object-DB should work, as long as data objects allow attribute access to their members. WTForms uses unicode strings throughout the source code and assumes that form input has already been coerced to unicode by your framework. WTForms fields render to unicode strings by default, and therefore as long as your templating engine can work with that, you should have no unicode issues.Starting Price: Free