Alternatives to pandas

Compare pandas alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to pandas in 2024. Compare features, ratings, user reviews, pricing, and more from pandas competitors and alternatives in order to make an informed decision for your business.

  • 1
    Tumult Analytics

    Tumult Analytics

    Tumult Analytics

    Built and maintained by a team of differential privacy experts, and running in production at institutions like the U.S. Census Bureau. Runs on Spark and effortlessly supports input tables containing billions of rows. Supports a large and ever-growing list of aggregation functions, data transformation operators, and privacy definitions. Perform public and private joins, filters, or user-defined functions on your data. Compute counts, sums, quantiles, and more under multiple privacy models. Differential privacy is made easy, thanks to our simple tutorials and extensive documentation. Tumult Analytics is built on our sophisticated privacy foundation, Tumult Core, which mediates access to sensitive data and means that every program and application comes with an embedded proof of privacy. Built by composing small, easy-to-review components. Provably safe stability tracking and floating-point primitives. Uses a generic framework based on peer-reviewed research.
  • 2
    Vaex

    Vaex

    Vaex

    At Vaex.io we aim to democratize big data and make it available to anyone, on any machine, at any scale. Cut development time by 80%, your prototype is your solution. Create automatic pipelines for any model. Empower your data scientists. Turn any laptop into a big data powerhouse, no clusters, no engineers. We provide reliable and fast data driven solutions. With our state-of-the-art technology we build and deploy machine learning models faster than anyone on the market. Turn your data scientist into big data engineers. We provide comprehensive training of your employees, enabling you to take full advantage of our technology. Combines memory mapping, a sophisticated expression system, and fast out-of-core algorithms. Efficiently visualize and explore big datasets, and build machine learning models on a single machine.
  • 3
    OpenRefine

    OpenRefine

    OpenRefine

    OpenRefine (previously Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data. OpenRefine always keeps your data private on your own computer until you want to share or collaborate. Your private data never leaves your computer unless you want it to. (It works by running a small server on your computer and you use your web browser to interact with it). OpenRefine can help you explore large data sets with ease. You can find out more about this functionality by watching the video below. OpenRefine can be used to link and extend your dataset with various webservices. Some services also allow OpenRefine to upload your cleaned data to a central database, such as Wikidata.. A growing list of extensions and plugins is available on the wiki.
  • 4
    Polars

    Polars

    Polars

    Knowing of data wrangling habits, Polars exposes a complete Python API, including the full set of features to manipulate DataFrames using an expression language that will empower you to create readable and performant code. Polars is written in Rust, uncompromising in its choices to provide a feature-complete DataFrame API to the Rust ecosystem. Use it as a DataFrame library or as a query engine backend for your data models.
  • 5
    List & Label
    List & Label is a report generator for software developers to integrate reporting functions in their web, cloud and desktop applications. Made for development environments such as .NET, C#, Delphi, C++, ASP.NET, ASP.NET MVC, .NET Core etc. It is seamless to integrate, supports a huge variety of data sources and extends applications with extensive print, export and preview functions. With the WYSIWYG Report Designer, developers or end users create or edit different print templates for printing information that originates either from a database or another data source. In the Designer, you then have all the data at your disposal to prepare it for printing in different ways. The additionally included and entirely browser-based Web Report Designer for ASP.NET MVC offers more flexibility in development and is independent from printer drivers. Reports for web applications can be designed anywhere at any time in the browser of your choice. List & Label is "Made in Germany" by combit.
    Starting Price: €650/license
  • 6
    Pillow

    Pillow

    Pillow

    The Python Imaging Library adds image processing capabilities to your Python interpreter. This library provides extensive file format support, an efficient internal representation, and fairly powerful image processing capabilities. The core image library is designed for fast access to data stored in a few basic pixel formats. It should provide a solid foundation for a general image processing tool. Pillow for enterprise is available via the Tidelift subscription. The Python Imaging Library is ideal for image archival and batch processing applications. You can use the library to create thumbnails, convert between file formats, print images, etc. The current version identifies and reads a large number of formats. Write support is intentionally restricted to the most commonly used interchange and presentation formats. The library contains basic image processing functionality, including point operations, filtering with a set of built-in convolution kernels, and color space conversions.
    Starting Price: Free
  • 7
    nuvo

    nuvo

    nuvo

    nuvo offers AI-powered, secure and scalable data onboarding solutions that empower you and your customers to map, validate, and clean data effortlessly – regardless of the input format. Don't let complex data mappings and transformations burden your developers. Reduce the time you use internally for customer data imports by 90%, or shift completely to a customer self-service model and empower your customers to import their data via an intuitive user interface. With our nuvo Data Importer SDK, we support software companies like Sastrify, Prewave, and Insurwave by automating their data imports, eliminating import errors, and freeing up their development resources to focus on core product functionalities. Experience the power of nuvo yourself! Start a free trial or schedule a quick demo with our team at getnuvo.com.
  • 8
    Avanzai

    Avanzai

    Avanzai

    Avanzai helps accelerate your financial data analysis by letting you use natural language to output production-ready Python code. Avanzai speeds up financial data analysis for both beginners and experts using plain English. Plot times series data, equity index members, and even stock performance data using natural prompts. Skip the boring parts of financial analysis by leveraging AI to generate code with relevant Python packages already installed. Further edit the code if you wish, once you're ready copy and paste the code into your local environment and get straight to business. Leverage commonly used Python packages for quant analysis such as Pandas, Numpy, etc using plain English. Take financial analysis to the next level, quickly pull fundamental data and calculate the performance of nearly all US stocks. Enhance your investment decisions with accurate and up-to-date information. Avanzai empowers you to write the same Python code that quants use to analyze complex financial data.
  • 9
    statsmodels

    statsmodels

    statsmodels

    statsmodels is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests and statistical data exploration. An extensive list of result statistics is available for each estimator. The results are tested against existing statistical packages to ensure that they are correct. The package is released under the open-source Modified BSD (3-clause) license. statsmodels supports specifying models using R-style formulas and pandas DataFrames. Have a look at dir(results) to see available results. Attributes are described in results.__doc__ and results methods have their own docstrings. You can also use numpy arrays instead of formulas. The easiest way to install statsmodels is to install it as part of the Anaconda distribution, a cross-platform distribution for data analysis and scientific computing. This is the recommended installation method for most users.
    Starting Price: Free
  • 10
    Bokeh

    Bokeh

    Bokeh

    Bokeh makes it simple to create common plots, but also can handle custom or specialized use-cases. Plots, dashboards, and apps can be published in web pages or Jupyter notebooks. Python has an incredible ecosystem of powerful analytics tools: NumPy, Scipy, Pandas, Dask, Scikit-Learn, OpenCV, and more. With a wide array of widgets, plot tools, and UI events that can trigger real Python callbacks, the Bokeh server is the bridge that lets you connect these tools to rich, interactive visualizations in the browser. Microscopium is a project maintained by researchers at Monash University. It allows researchers to discover new gene or drug functions by exploring large image datasets with Bokeh’s interactive tools. Panel is a tool for polished data presentation that utilizes the Bokeh server. It is created and supported by Anaconda. Panel makes it simple to create custom interactive web apps and dashboards by connecting user-defined widgets to plots, images, tables, or text.
    Starting Price: Free
  • 11
    h5py

    h5py

    HDF5

    The h5py package is a Pythonic interface to the HDF5 binary data format. It lets you store huge amounts of numerical data, and easily manipulate that data from NumPy. For example, you can slice into multi-terabyte datasets stored on disk, as if they were real NumPy arrays. Thousands of datasets can be stored in a single file, categorized and tagged however you want. H5py uses straightforward NumPy and Python metaphors, like dictionary and NumPy array syntax. For example, you can iterate over datasets in a file, or check out the .shape or .dtype attributes of datasets. You don't need to know anything special about HDF5 to get started. In addition to the easy-to-use high level interface, h5py rests on a object-oriented Cython wrapping of the HDF5 C API. Almost anything you can do from C in HDF5, you can do from h5py.
    Starting Price: Free
  • 12
    warcat

    warcat

    Python Software Foundation

    Tool and library for handling Web ARChive (WARC) files. Naively join archives into one. Extract files from archive. List commands available. List contents of archive. Load archive and write it back out. Split archives into individual records. Verify digest and validate conformance. The library may not be entirely thread-safe yet. The goal of the Warcat project is to create a tool and library as easily and fast as manipulating any other archive such as tar and zip archives. Warcat is designed to handle large, gzip-ed files by partially extracting them as needed. Warcat is provided without warranty and cannot guarantee the safety of your files. Remember to make backups and test them! A WARC file contains one or more records concatenated together. Each record contains named fields, newline, a content block, newline, and newline. A Content Block may be two types, {binary data} or {Named Fields, newline, and binary data}.
    Starting Price: Free
  • 13
    JDeli

    JDeli

    IDR Solutions

    JDeli is a powerful Java SDK designed to help you easily read, write, convert, manipulate and process various image formats in Java. Here’s an overview of its features: -Wide Image Format Support: JDeli reads/writes BMP, GIF, HEIC, JPEG, JPEG2000, PNG, TIFF, and WebP. It also reads DICOM, EMF/WMF, PSD, and SGI formats. -High Performance: JDeli’s encoders and decoders outperform alternatives, making it ideal for performance-critical applications. -File Security: JDeli operates securely on your servers, with no callbacks or cloud access. Critical customer data remains secure. -Ongoing Development: JDeli offers nightly and stable builds with regular new features. It continues to expand its range of supported image formats, including AVIF, HEIC, and JPEG XL. -No Third-Party Libraries: JDeli avoids third-party dependencies, minimizing security risks and JVM crashes.
    Starting Price: $1600 per year
  • 14
    IMSL

    IMSL

    Perforce

    Enhance performance and save development time with IMSL numerical libraries. Achieve your strategic objectives using IMSL's build tools. Model regression, make decision trees, establish neural networks, and forecast time series with your IMSL library. Rigorously tested and proven for decades across all industries, the IMSL C Numerical Library gives companies a dependable, high-ROI solution for building cutting-edge analytics tools. From data mining and forecasting, to advanced statistical analysis, the IMSL C Numerical Library can help teams quickly add sophisticated functionality to analytic applications. The IMSL C library makes integration and deployment easy. Enjoy easy migrations, support for common platforms and platform combinations, and no added infrastructure on embed in databases or applications.
  • 15
    PyQtGraph

    PyQtGraph

    PyQtGraph

    PyQtGraph is a pure-python graphics and GUI library built on PyQt/PySide and NumPy. It is intended for use in mathematics/scientific/engineering applications. Despite being written entirely in python, the library is very fast due to its heavy leverage of NumPy for number crunching and Qt's GraphicsView framework for fast display. PyQtGraph is distributed under the MIT open-source license. Basic 2D plotting in interactive view boxes. Line and scatter plots. Data can be panned/scaled by mouse. Fast drawing for real-time data display and interaction. Displays most data types (int or float; any bit depth; RGB, RGBA, or luminance). Functions for slicing multidimensional images at arbitrary angles (great for MRI data). Rapid update for video display or real-time interaction. Image display with interactive lookup tables and level control. Mesh rendering with isosurface generation. Interactive viewports rotate/zoom with mouse. Basic 3D scenegraph for easier programming.
    Starting Price: Free
  • 16
    BuildVu

    BuildVu

    IDR Solutions

    With BuildVu, you’ll unlock precise PDF-to-HTML/SVG conversion, giving you greater control and added functionality over PDF in your web application. -Optimized Content: BuildVu intelligently converts PDFs, optimizing for smaller file sizes and fast rendering in browsers. -File Metadata: Access PDF data in JSON format, including metadata, word lists, outlines (bookmarks), and annotations. -Thumbnails: Generate high-quality page thumbnails with customizable dimensions. -Annotations: Enjoy support for various annotation types (Links, Popups, Sound/Video, Text, Highlight, Underline) in easy-to-use JSON format. -search.json: Extract all text from the document alongside the HTML content. -Font Conversion: Restructure embedded fonts for compatibility across web browsers. -Office Conversion: Combine BuildVu with LibreOffice for seamless conversion from Office formats (Word, PowerPoint, Excel).
    Starting Price: $450 per month
  • 17
    NetworkX

    NetworkX

    NetworkX

    NetworkX is a Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks. Generators for classic graphs, random graphs, and synthetic networks. Additional benefits from Python include fast prototyping, easy to teach, and multi-platform. Network structure and analysis measures.
    Starting Price: Free
  • 18
    Datavore

    Datavore

    Datavore Labs

    The code-free tool for advanced data analysis. Find insights with speed and accuracy. Why Datavore? Build workflows and combine signals, faster. Discover and track indicators across datasets in order to test and validate signals. Organize. Catalog all your data in one place. Use dynamic filters to quickly find internal and external data. Explore. Build dashboards to compare, evaluate, and monitor lines. Efficiently test and track multiple indicators across datasets. Analyze. Perform deep proprietary research by constructing forecasting models and regression analyses. Platform. Excel versatility with cloud scalability. Easily perform quantitative research and automate tedious operations. Excel Syntax. Write custom functions and use pre-built time series formulas. Patented ingestion engine. Discover concepts and relations within big datasets. Calendar alignment. Match data to company fiscal calendar or predefined periods. Aggregations.
  • 19
    FastCube VCL

    FastCube VCL

    Fast Reports

    It is not just a new generation of “Decision Cube” and much more than Pivot Grid control, it is a powerful high-speed standalone OLAP-cube engine! Integrate FastCube OLAP-library to your application, prepare data (or several data sets) that will be necessary for your end-users, and bring Business Intelligence directly into your application! Give freedom of report creation to your end-users – support them in getting reasonable decisions! FastCube VCL is a set of components for implementation in-memory data cube, OLAP operations, can be used for creating pivot table reports. It supports Delphi 7-XE8, C++Builder 2005-XE8, Embarcadero RAD Studio 10 - 11 and Lazarus. Your customers will no longer need Excel-tables for multi-dimensional data analysis and cross-tab reporting! FastCube components can be built into the interface of host applications. FastCube end users do not require high programming skills to build reports. FastCube is a set of OLAP Desktop components for Delphi.
    Starting Price: $199.00/developer
  • 20
    gTTS

    gTTS

    gTTS

    gTTS (Google Text-to-Speech), a Python library and CLI tool to interface with Google Translate's text-to-speech API. Write spoken mp3 data to a file, a file-like object (bytestring) for further audio manipulation, or stdout. Or simply pre-generate Google Translate TTS request URLs to feed to an external program. Customizable speech-specific sentence tokenizer that allows for unlimited lengths of text to be read, all while keeping proper intonation, abbreviations, decimals and more. Customizable text pre-processors which can, for example, provide pronunciation corrections.
    Starting Price: Free
  • 21
    Coheris Spad

    Coheris Spad

    ChapsVision

    Coheris Spad by ChapsVision is a self-service data analysis studio for Data Scientists from all sectors and industries. Coheris Spad by ChapsVision is taught in many major French and foreign schools and universities, giving it a great reputation in the Data Scientists community. Coheris Spad by ChapsVision provides you with a great methodological wealth covering a very broad spectrum in terms of data analysis. In a user-friendly and intuitive environment, you have all the power you need to discover, prepare and analyze your data. Coheris Spad by ChapsVision allows you to connect to many sources to prepare your data. You have a vast library of data processing functions at your disposal: filtering, stacking, aggregation, transposition, join, management of missing data, search for atypical distributions, statistical or supervised recoding, formatting.
  • 22
    broot

    broot

    broot

    The ROOT data analysis framework is used much in High Energy Physics (HEP) and has its own output format (.root). ROOT can be easily interfaced with software written in C++. For software tools in Python there exists pyROOT. Unfortunately, pyROOT does not work well with python3.4. broot is a small library that converts data in python numpy ndarrays to ROOT files containing trees with a branch for each array. The goal of this library is to provide a generic way of writing python numpy datastructures to ROOT files. The library should be portable and supports both python2, python3, ROOT v5 and ROOT v6 (requiring no modifications on the ROOT part, just the default installation). Installation of the library should only require a user to compile to library once or install it as a python package.
    Starting Price: Free
  • 23
    websockets

    websockets

    Python Software Foundation

    An implementation of the WebSocket Protocol (RFC 6455 & 7692). websockets is a library for building WebSocket servers and clients in Python with a focus on correctness, simplicity, robustness, and performance. Built on top of asyncio, Python’s standard asynchronous I/O framework, it provides an elegant coroutine-based API. websockets is heavily tested for compliance with RFC 6455. Continuous integration fails under 100% branch coverage. websockets is built for production. For example, it was the only library to handle backpressure correctly before the issue became widely known in the Python community. Memory usage is optimized and configurable. A C extension accelerates expensive operations. It’s pre-compiled for Linux, macOS, and Windows and packaged in the wheel format for each system and Python version. websockets takes care of everything under the hood so you can focus on your application!
    Starting Price: Free
  • 24
    Pylons

    Pylons

    Python Software Foundation

    The Pylons web framework is designed for building web applications and sites in an easy and concise manner. They can range from as small as a single Python module, to a substantial directory layout for larger and more complex web applications. Pylons comes with project templates that help boot-strap a new web application project, or you can start from scratch and set things up exactly as desired. A framework to make writing web applications in Python easy. Utilizes a minimalist, component-based philosophy that makes it easy to expand on. Harness existing knowledge about Python. Extensible application design. Fast and efficient, an incredibly small per-request call stack provides top performance. Uses existing and well-tested Python packages. Pylons 1.0 series is stable and production-ready but in maintenance-only mode. The Pylons Project now maintains the Pyramid web framework for future development. Pylons 1.0 users should strongly consider using Pyramid for their next project.
  • 25
    imageio

    imageio

    imageio

    Imageio is a Python library that provides an easy interface to read and write a wide range of image data, including animated images, volumetric data, and scientific formats. It is cross-platform, runs on Python 3.5+, and is easy to install. Imageio is written in pure Python, so installation is easy. Imageio works on Python 3.5+. It also works on Pypy. Imageio depends on Numpy and Pillow. For some formats, imageio needs additional libraries/executables (e.g. ffmpeg), which imageio helps you to download/install. If something doesn’t work as it should, you need to know where to search for causes. The overview on this page aims to help you in this regard by giving you an idea of how things work, and - hence - where things may go sideways.
    Starting Price: Free
  • 26
    Molecula

    Molecula

    Molecula

    Molecula is an enterprise feature store that simplifies, accelerates, and controls big data access to power machine-scale analytics and AI. Continuously extracting features, reducing the dimensionality of data at the source, and routing real-time feature changes into a central store enables millisecond queries, computation, and feature re-use across formats and locations without copying or moving raw data. The Molecula feature store provides data engineers, data scientists, and application developers a single access point to graduate from reporting and explaining with human-scale data to predicting and prescribing real-time business outcomes with all data. Enterprises spend a lot of money preparing, aggregating, and making numerous copies of their data for every project before they can make decisions with it. Molecula brings an entirely new paradigm for continuous, real-time data analysis to be used for all your mission-critical applications.
  • 27
    Scapy

    Scapy

    Scapy

    Scapy is a powerful interactive packet manipulation program. It is able to forge or decode packets of a wide number of protocols, send them on the wire, capture them, match requests and replies, and much more. It can easily handle most classical tasks like scanning, tracerouting, probing, unit tests, attacks, or network discovery (it can replace hping, 85% of nmap, arpspoof, arp-sk, arping, tcpdump, tshark, p0f, etc.). It also performs very well at a lot of other specific tasks that most other tools can’t handle, like sending invalid frames, injecting your own 802.11 frames, combining technics (VLAN hopping+ARP cache poisoning, VOIP decoding on WEP encrypted channel), etc. Scapy runs natively on Linux, Windows, OSX, and on most Unixes with libpcap. The same code base now runs natively on both Python 2 and Python 3. Scapy development uses the Git version control system. Scapy reference repository is hosted on GitHub.
    Starting Price: Free
  • 28
    DataClarity Unlimited Analytics
    DataClarity Unlimited Analytics is the only free modern embeddable data and analytics platform in the world that provides a self-service, powerful, secure and seamless end-to-end experience. Highlights: SIMPLIFIED DATA INTEGRATION – Easily connect, join, curate, cache and catalog diverse data through drag and drop, custom SQL builder & AI-powered data profiling. | INTERACTIVE DASHBOARDS – craft compelling reports using 80 stunning visualizations, geospatial maps and flexibility to bring your own charts. | REAL-TIME ANALYSIS – Perform advanced analysis & data exploration using drill-down, drill-through, filters, built-in statistical & predictive models, or your own Python and R code. | SEAMLESS APPLICATION INTEGRATION – Achieve smooth integration with versatile APIs, tailor-made configurations & flexible embedding features. | SECURITY & GOVERNANCE – Ensure adherence to your security guidelines, governance standards, multitenancy, row-level data protection, and Single Sign-On (SSO).
  • 29
    DHTMLX

    DHTMLX

    DHTMLX

    DHTMLX is a JavaScript UI library that provides a set of highly customizable and flexible components for building modern and responsive web applications. The library includes more than 30 UI components, such as Gantt, Scheduler, Kanban, diagrams, charts, grids, spreadsheets, calendars, trees, forms, and more. DHTMLX helps to advance web development and build feature-rich data-intensive applications of any complexity. One of the key advantages of DHTMLX is its compatibility with different web frameworks, including React, Angular, and Vue.js, which makes it a great choice for developers who are already working with these frameworks. DHTMLX is also designed with performance in mind. Its components are optimized for fast rendering. A comprehensive and rich API provides a straightforward way to manipulate JavaScript/HTML5 UI widgets. The library has detailed API documentation, tutorials, and examples that help developers quickly get up to speed and start building their apps.
    Starting Price: $79
  • 30
    MyDataModels TADA

    MyDataModels TADA

    MyDataModels

    Deploy best-in-class predictive analytics models TADA by MyDataModels helps professionals use their Small Data to enhance their business with a light, easy-to-set-up tool. TADA provides a predictive modeling solution leading to fast and usable results. Shift from days to a few hours into building ad hoc effective models with our 40% reduced time automated data preparation. Get outcomes from your data without programming or machine learning skills. Optimize your time with explainable and understandable models made of easy-to-read formulas. Turn your data into insights in a snap on any platform and create effective automated models. TADA removes the complexity of building predictive models by automating the generative machine learning process – data in, model out. Build and run machine learning models on any devices and platforms through our powerful web-based pre-processing features.
    Starting Price: $5347.46 per year
  • 31
    ruffus

    ruffus

    ruffus

    Ruffus is a computation pipeline library for python. It is open-sourced, powerful and user-friendly, and widely used in science and bioinformatics. Ruffus is designed to allow scientific and other analyses to be automated with the minimum of fuss and the least effort. Suitable for the simplest of tasks. Handles even fiendishly complicated pipelines which would cause make or scons to go cross-eyed and recursive. No "clever magic", no pre-processing. Unambitious, the lightweight syntax which tries to do this one small thing well. Ruffus is available under the permissive MIT free software license. This permits free use and inclusion even within proprietary software. It is good practice to run your pipeline in a temporary, “working” directory away from your original data. Ruffus is a lightweight python module for building computational pipelines. Ruffus requires Python 2.6 or higher or Python 3.0 or higher.
    Starting Price: Free
  • 32
    Seaborn

    Seaborn

    Seaborn

    Seaborn is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics. For a brief introduction to the ideas behind the library, you can read the introductory notes or the paper. Visit the installation page to see how you can download the package and get started with it. You can browse the example gallery to see some of the things that you can do with seaborn, and then check out the tutorials or API reference to find out how. To see the code or report a bug, please visit the GitHub repository. General support questions are most at home on StackOverflow, which has a dedicated channel for seaborn.
  • 33
    waiting

    waiting

    Python Software Foundation

    waiting is a small library for waiting for stuff to happen. It basically waits for a function to return True, in various modes. Waiting is compatible with flux for simulated timelines. The most basic usage is when you have a function you want to wait for. Waiting forever is very simple. If your predicate returns a value, it will be returned as the result of wait(). A timeout parameter can also be specified. When a timeout expires without the predicate being fulfilled, an exception is thrown. Sleeping polls the predicate at a certain interval (by default 1 second). The interval can be changed with the sleep_seconds argument. When waiting for multiple predicates, waiting provides two simple facilities to help aggregate them, any and all. They resemble Python’s built-in any() and all(), except that they don’t call a predicate once it has been satisfied (this is useful when the predicates are inefficient and take time to complete).
    Starting Price: Free
  • 34
    NumPy

    NumPy

    NumPy

    Fast and versatile, the NumPy vectorization, indexing, and broadcasting concepts are the de-facto standards of array computing today. NumPy offers comprehensive mathematical functions, random number generators, linear algebra routines, Fourier transforms, and more. NumPy supports a wide range of hardware and computing platforms, and plays well with distributed, GPU, and sparse array libraries. The core of NumPy is well-optimized C code. Enjoy the flexibility of Python with the speed of compiled code. NumPy’s high level syntax makes it accessible and productive for programmers from any background or experience level. NumPy brings the computational power of languages like C and Fortran to Python, a language much easier to learn and use. With this power comes simplicity: a solution in NumPy is often clear and elegant.
  • 35
    Wijmo

    Wijmo

    GrapeCity

    High-performance JavaScript UI controls for enterprise apps. Build lightweight, high-speed HTML5/JavaScript apps with zero dependencies, fast! New in Wijmo 2022 v1: - Inventory management reference app built in Angular - Website traffic monitoring reference app built in Angular - FlexMap added to GeoDashboard reference app - Rotated Spline Chart - Re-order FlexGrid rows with drag and drop General Features of Wijmo: - CellTemplates for Angular, React and Vue FlexGrids - Use Wijmo’s single set of true JavaScript controls in any framework, including Angular, React, and Vue.js - Use controls as native HTML elements with Wijmo Web Components - Filter, group, sort, and more with FlexGrid, JavaScript’s fastest data grid - Visualize data with 80+ charts and controls - Instantly analyze and aggregate data with OLAP Pivot Grid
    Starting Price: $695 per user per year
  • 36
    parsel

    parsel

    Python Software Foundation

    Parsel is a BSD-licensed Python library to extract and remove data from HTML and XML using XPath and CSS selectors, optionally combined with regular expressions. Create a selector object for the HTML or XML text that you want to parse. Then use CSS or XPath expressions to select elements. CSS is a language for applying styles to HTML documents. It defines selectors to associate those styles with specific HTML elements. XPath is a language for selecting nodes in XML documents, which can also be used with HTML. You can use either CSS or XPath. CSS is usually more readable, but some things can only be done with XPath. Being built atop lxml, parsel selectors support some EXSLT extensions and come with pre-registered namespaces to use in XPath expressions. Parsel selectors allow you to chain selectors, so most of the time you can just select by class using CSS and then switch to XPath when needed.
    Starting Price: Free
  • 37
    DataViewsJS

    DataViewsJS

    GrapeCity

    DataViewsJS enables you to easily and professionally customize the presentation of your data using different layouts, row templates, data fields, calculations, and editing modes that are completely and easily customizable. Leverage the powerful calc engine to perform calculations on any set of JSON data. With our calculation engine, your computing power is optimized for large data and complex calculations. DataViewsJS was designed from the start to be a fast, full-featured, and completely customizable developer tool for efficient data display and editing to meet any of your JavaScript application’s needs. Localized resources for Chinese, Japanese, and Korean are included with DataViewsJS. Additional languages can be easily added by creating your own resource files and setting them with simple script code. Modern data presentation patterns such as those you see on social networks and other sites become easy when you use DataViewsJS interchangeable layouts.
    Starting Price: $999 per year
  • 38
    Flutter

    Flutter

    Google

    Flutter is Google’s UI toolkit for building beautiful, natively compiled applications for mobile, web, and desktop from a single codebase. Paint your app to life in milliseconds with Stateful Hot Reload. Use a rich set of fully-customizable widgets to build native interfaces in minutes. Quickly ship features with a focus on native end-user experiences. Layered architecture allows for full customization, which results in incredibly fast rendering and expressive and flexible designs. Flutter’s widgets incorporate all critical platform differences such as scrolling, navigation, icons and fonts, and your Flutter code is compiled to native ARM machine code using Dart's native compilers. Flutter's hot reload helps you quickly and easily experiment, build UIs, add features, and fix bugs faster. Experience sub-second reload times without losing state on emulators, simulators, and hardware.
  • 39
    dedupe

    dedupe

    dedupe

    Dedupe.io is a powerful tool that learns the best way to find similar rows in your data. Using cutting-edge research in machine learning we quickly and accurately identify matches in your Excel spreadsheet or database—saving you time and money. In today’s world of big data, there’s never been more information available to work with. Unfortunately, all this data is hard to use, especially if it’s been entered by hand or comes from different systems. The simple task of figuring out who is who in a spreadsheet or database can be a daunting, time-consuming task. That’s where Dedupe.io comes in. We developed the best dynamic and scalable solution for de-duplicating and linking datasets, and built a simple step-by-step wizard for anyone to use it.
    Starting Price: $9 per 1,000 rows
  • 40
    Vega-Altair

    Vega-Altair

    Vega-Altair

    The Vega-Altair open-source project is not affiliated with Altair Engineering, Inc. With Vega-Altair, you can spend more time understanding your data and its meaning. Altair’s API is simple, friendly and consistent and built on top of the powerful Vega-Lite visualization grammar. This elegant simplicity produces beautiful and effective visualizations with a minimal amount of code. The key idea is that you are declaring links between data columns and visual encoding channels, such as the x-axis, y-axis, color, etc. The rest of the plot details are handled automatically. Building on this declarative plotting idea, a surprising range of simple to sophisticated plots and visualizations can be created using relatively concise grammar.
    Starting Price: Free
  • 41
    DataMelt

    DataMelt

    jWork.ORG

    DataMelt (or "DMelt") is an environment for numeric computation, data analysis, data mining, computational statistics, and data visualization. DataMelt can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. Linear, non-linear and symbolic regression are also available. Neural networks and various data-manipulation methods are integrated using Java API. Elements of symbolic computations using Octave/Matlab scripting are supported. DataMelt is a computational environment for Java platform. It can be used with different programming languages on different operating systems. Unlike other statistical programs, it is not limited to a single programming language. This software combines the world's most-popular enterprise language, Java, with the most popular scripting language used in data science, such as Jython (Python), Groovy, JRuby.
  • 42
    SAS Analytics Pro
    An easy-to-use, yet powerful package for accessing, manipulating, analyzing and presenting information, now available for cloud and on-site deployment. Why piece together niche software packages from different vendors? A consolidated portfolio that includes three of our most popular products, Base SAS, SAS/STAT and SAS/GRAPH, reduces the cost of licensing, maintenance, training, and support while ensuring that consistent information is available across your enterprise. SAS statistical procedures are constantly being updated to reflect the latest advances in statistical methodology. And technical support is provided by experienced master's- and doctorate-level statisticians who deliver a level of service and knowledge not often found with other software vendors. SAS has more than four decades of experience developing statistical analysis software that organizations around the world rely on to provide accurate answers to data questions.
  • 43
    Stata

    Stata

    StataCorp

    Stata is a complete, integrated software package that provides all your data science needs: data manipulation, visualization, statistics, and automated reporting. Stata is fast and accurate. It is easy to learn through the extensive graphical interface yet completely programmable. With Stata's menus and dialogs, you get the best of both worlds. You can easily point and click or drag and drop your way to all of Stata's statistical, graphical, and data management features. Use Stata's intuitive command syntax to quickly execute commands. Whether you enter commands directly or use the menus and dialogs, you can create a log of all actions and their results to ensure the reproducibility and integrity of your analysis. Stata also has complete command-line scripting and programming facilities, including a full matrix programming language. You have access to everything you need to script your analysis or even to create new Stata commands--commands that work just like those shipped with Stata.
    Starting Price: $48.00/6-month/student
  • 44
    Kibana

    Kibana

    Elastic

    Kibana is a free and open user interface that lets you visualize your Elasticsearch data and navigate the Elastic Stack. Do anything from tracking query load to understanding the way requests flow through your apps. Kibana gives you the freedom to select the way you give shape to your data. With its interactive visualizations, start with one question and see where it leads you. Kibana core ships with the classics: histograms, line graphs, pie charts, sunbursts, and more. And, of course, you can search across all of your documents. Leverage Elastic Maps to explore location data, or get creative and visualize custom layers and vector shapes. Perform advanced time series analysis on your Elasticsearch data with our curated time series UIs. Describe queries, transformations, and visualizations with powerful, easy-to-learn expressions.
  • 45
    SparkBeyond

    SparkBeyond

    SparkBeyond

    SparkBeyond Discovery autonomously analyzes complex data, finding answers to business problems where you wouldn't think to look. Seamlessly integrate external data into your search space for a better grip on the real influencers of outcomes, and get a holistic perspective of your business reality. Move analytics projects beyond the experimental by interacting with data and insights in natural language, allowing a deeper alliance between analytics and business stakeholders. Future-proof the hard-won benefits of the analytics process by keeping an evergreen cycle of inputs and outputs. The world shifts, and so should your insights. From time-series to geo-spatial data, auto-connect most data in their raw, granular form, without writing a single line of code. See the bigger picture by integrating our curated database of world knowledge (maps, Wikipedia, demographics) or network of 3rd-party data providers.
  • 46
    VertiPaq Analyzer
    VertiPaq Analyzer is useful to analyze VertiPaq storage structures for a data model in Power BI and Analysis Services Tabular. Added measures for segments and partitions: pageable, resident, refresh date, last access. Analysis Services provides many Dynamic Management Views (DMV) to collect information about memory used by a data model. For example, DISCOVER_OBJECT_MEMORY_USAGE is a DMV that provides information about all the objects in memory. You can use such a DMV also to monitor a Multidimensional instance of Analysis Services. Kasper de Jonge created a sample model (BISM Memory Report) that organizes this data in a hierarchical way, making it easy to find the most expensive databases, tables, and columns on a server. If you want to analyze a particular database, you probably want to look at more detailed information, which are available in other DMVs.
  • 47
    Zuar Portal

    Zuar Portal

    Zuar, Inc.

    The first customer portal built to change as often as your customers' requirements do. Easily create, personalize, update, and maintain a data-driven customer experience with Zuar. Behind every “adoption” problem is a series of issues with UI/UX design, slow response times, limited functionality, manual processes, and a lean overworked team tasked with increasing retention KPIs and OKRs. We know it is essential to provide a consistent, omnichannel experience across various platforms and devices. Zuar is the cornerstone of a data-driven customer experience strategy, which aims to empower businesses by leveraging interconnected data to build trust and transparency. Zuar integrates with business intelligence leaders, like Tableau, Power BI, and ThoughtSpot, and cloud database providers, like Snowflake, Google Cloud, Redshift, and Databricks.
  • 48
    Vista

    Vista

    UberMedia

    A powerful insight-as-a-service platform, Vista provides mobile location data, analytics, and media measurement for businesses of all sizes. The Vista platform offers pre-processed insights for any location, available for export in a variety of user-friendly formats. A powerful insight-as-a-service platform, Vista provides mobile location data, analytics, and media measurement for businesses of all sizes to better understand and capitalize on the behaviors of their consumers and the performance of their competitors. Multi-faceted tools designed to fit the specific needs of unique organizations. Insightful analyses tailored to all levels of analyst capabilities. Aggregated and anonymous data ready to integrate seamlessly into existing workflows. Custom reports and analysis created by our in-house team of data scientists and GIS analysts. Where visitors were seen 30 minutes before or after a visit to a location. Geographic area from which a location generates the majority of its visits.
  • 49
    Secure

    Secure

    Retransform

    Secure is an innovative platform designed to meet the emerging facility management requirements offering landlords and property managers real-time data to enable efficient decision making. With a key focus on Data Analytics, Digital Twin and ESG, Secure captures, collates and manages extensive and diverse information and delivers it in an easy-to-use format. Secure reports on your data aggregated from disparate building systems, providing insights into usage such as energy consumption, occupancy and sanitization. A 3D replica of your building, the Secure digital twin creates real-time remote visibility, providing facility managers complete visibility on operations and maintenance. It drives efficiencies and elevates operational performance. Secure makes it easier to manage your journey to net-zero carbon emissions, using smart technology to manage your energy consumption, providing a holistic view of all your properties from the platform.
  • 50
    Deepnote

    Deepnote

    Deepnote

    Deepnote is building the best data science notebook for teams. In the notebook, users can connect their data, explore, and analyze it with real-time collaboration and version control. Users can easily share project links with team collaborators, or with end-users to present polished assets. All of this is done through a powerful, browser-based UI that runs in the cloud. We built Deepnote because data scientists don't work alone. Features: - Sharing notebooks and projects via URL - Inviting others to view, comment and collaborate, with version control - Publishing notebooks with visualizations for presentations - Sharing datasets between projects - Set team permissions to decide who can edit vs view code - Full linux terminal access - Code completion - Automatic python package management - Importing from github - PostgreSQL DB connection
    Starting Price: Free