Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Machine Learning Software
Search Results

Search Results for "python data analysis" - Page 16

x

Sort By:

Relevance

Clear All Filters

OS

Mac 412
Linux 411
Windows 407
More...
BSD 136
ChromeOS 132
Desktop Operating Systems 1

Category

Artificial Intelligence 412
Software Development 61
Business 44
Scientific/Engineering 31
Multimedia 11
Education 9
System 8
Formats and Protocols 2
Internet 2
Communications 1
Database 1
Productivity 1
Social sciences 1
Text Editors 1

License

OSI-Approved Open Source 362
Creative Commons Attribution License 5
GNU Free Documentation License 3

Translations

English 12
Brazilian Portuguese 1
Chinese (Simplified) 1
German 1
More...
Russian 1

Programming Language

Python 300
C++ 23
Java 13
MATLAB 8
More...
TypeScript 7
JavaScript 5
Julia 4
C# 3
R 3
Rust 3
C 2
Go 2
F# 1
PL/SQL 1
Prolog 1
Scala 1
Unix Shell 1

Status

Beta 17
Production/Stable 14
Pre-Alpha 3
Alpha 3
More...
Planning 1
Mature 1

Showing 412 open source projects for "python data analysis"

View related business solutions

Machine Learning Mac Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
1

Tangent

Source-to-source debuggable derivatives in pure Python

Existing libraries implement automatic differentiation by tracing a program's execution (at runtime, like PyTorch) or by staging out a dynamic data-flow graph and then differentiating the graph (ahead-of-time, like TensorFlow). In contrast, Tangent performs ahead-of-time autodiff on the Python source code itself, and produces Python source code as its output. Tangent fills a unique location in the space of machine learning tools. As a result, you can finally read your automatic derivative code just like the rest of your program. ...

Downloads: 0 This Week

Last Update: 2022-08-09
See Project
2

AI learning

AiLearning, data analysis plus machine learning practice

We actively respond to the Research Open Source Initiative (DOCX) . Open source today is not just open source, but datasets, models, tutorials, and experimental records. We are also exploring other categories of open source solutions and protocols. I hope you will understand this initiative, combine this initiative with your own interests, and do what you can. Everyone's tiny contributions, together, are the entire open source ecosystem. We are iBooker, a large open-source community,...

Downloads: 0 This Week

Last Update: 2022-02-18
See Project
3

auto_ml

Automated machine learning for analytics & production

auto_ml is designed for production. Here's an example that includes serializing and loading the trained model, then getting predictions on single dictionaries, roughly the process you'd likely follow to deploy the trained model. Before you go any further, try running the code. Load up some data (either a DataFrame, or a list of dictionaries, where each dictionary is a row of data). Make a column_descriptions dictionary that tells us which attribute name in each row represents the value we’re...

Downloads: 0 This Week

Last Update: 2022-08-12
See Project
4

bulbea

Deep Learning based Python Library for Stock Market Prediction

bulbea is an open-source Python library designed for financial analysis and stock market prediction using machine learning and deep learning techniques. The library provides tools for retrieving financial time series data, preprocessing market data, and training predictive models that estimate future price movements. bulbea integrates common machine learning frameworks such as TensorFlow and Keras to build neural network models capable of learning patterns in historical financial data.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

SPAWNN

SPatial Analysis With self-organizing Neural Networks

The SPAWNN toolkit is an innovative toolkit for spatial analysis with self-organizing neural networks which is particularily useful for spatial analysis, visualization and geographical data mining. To run the toolkit, simply download and execute (double-click) the jar-file. Please cite: - Hagenauer, J., & Helbich, M. (2016). SPAWNN: A Toolkit for SPatial Analysis With Self-Organizing Neural Networks.

Downloads: 1 This Week

Last Update: 2017-07-12
See Project
6

H2O-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning

H2O-3 is an open-source machine learning platform designed to build scalable and distributed machine learning models across large datasets. The system operates as an in-memory computing platform that allows data scientists to train models quickly using distributed resources. It supports many machine learning algorithms including generalized linear models, gradient boosting machines, deep learning networks, and ensemble techniques. The platform provides interfaces for multiple programming languages such as Python, R, Java, and Scala, making it accessible to a wide range of developers and data scientists. ...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
7

PyDaMelo

Python-compatible Data mining elementary objects

An attempt at offering machine learning and data mining algorithms at the finest grain we are able to, easy to combine together through Python scripting to glue together the Lego-like bricks.

Downloads: 0 This Week

Last Update: 2019-02-19
See Project
8

Gait-CAD (Data Mining for MATLAB)

All future developments will be implemented in the new MATLAB toolbox SciXMiner, please visit https://sourceforge.net/projects/scixminer/ to download the newest version. The former Matlab toolbox Gait-CAD was designed for the visualization and analysis of time series and features with a special focus to data mining problems including classification, regression, and clustering.

Downloads: 0 This Week

Last Update: 2017-03-13
See Project
9

Python Machine Learning book

The book code repository and info resource

What you can expect are 400 pages rich in useful material just about everything you need to know to get started with machine learning. From theory to the actual code that you can directly put into action! This is not yet just another "this is how scikit-learn works" book. I aim to explain all the underlying concepts, tell you everything you need to know in terms of best practices and caveats, and we will put those concepts into action mainly using NumPy, scikit-learn, and Theano. This is not...

Downloads: 0 This Week

Last Update: 2021-05-20
See Project
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
10

JCLTP

A Java Class Library for Text Processing

JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided. Thus, analysis of new types...

Downloads: 0 This Week

Last Update: 2017-01-06
See Project
11

Spark Python Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis

Spark Python Notebooks is a curated collection of example Jupyter notebooks designed to help developers and data engineers learn Apache Spark using Python in an interactive environment. Rather than only providing static code files, this project uses notebooks to teach practical data processing workflows, exposing users to real Spark programming patterns like working with RDDs, DataFrames, and distributed computations.

Downloads: 0 This Week

Last Update: 2026-02-17
See Project
12

Mass-based dissimilarity

A data dependent dissimilarity measure based on mass estimation.

This software calculates the mass-based dissimilarity matrix for data mining algorithms relying on a distance measure. References: Overcoming Key Weaknesses of Distance-based Neighbourhood Methods using a Data Dependent Dissimilarity Measure. KDD 2016 http://dx.doi.org/10.1145/2939672.2939779 The source code, presentation slide and poster are attached under "Files". The presentation video in KDD 2016 is published on https://youtu.be/eotD_-SuEoo . Since this software is licensed...

Downloads: 0 This Week

Last Update: 2018-02-26
See Project
13

ExSTraCS

Extended Supervised Tracking and Classifying System

...ExSTraCS combines a number of recent advancements into a single algorithmic platform. It can flexibly handle (1) discrete or continuous attributes, (2) missing data, (3) balanced or imbalanced datasets, and (4) binary or many classes. A complete users guide for ExSTraCS is included. Coded in Python 2.7.

1 Review

Downloads: 2 This Week

Last Update: 2015-11-04
See Project
14

Accelerated Feature Extraction Tool

A fast GPU accelerated feature extraction software for speech analysis

A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.

1 Review

Downloads: 0 This Week

Last Update: 2015-05-25
See Project
15

IMAGINE

Biological image viewer and processor

Detection, enumeration, and sizing of biological organisms by image analysis.

Downloads: 1 This Week

Last Update: 2015-03-18
See Project
16

Chordalysis

Log-linear analysis (data modelling) for high-dimensional data

===== Project moved to https://github.com/fpetitjean/Chordalysis ===== Log-linear analysis is the statistical method used to capture multi-way relationships between variables. However, due to its exponential nature, previous approaches did not allow scale-up to more than a dozen variables. We present here Chordalysis, a log-linear analysis method for big data. Chordalysis exploits recent discoveries in graph theory by representing complex models as compositions of triangular structures, also known as chordal graphs. ...

Downloads: 0 This Week

Last Update: 2015-01-29
See Project
17

MODLEM

rule-based, WEKA compatible, Machine Learning algorithm

This project is a WEKA (Waikato Environment for Knowledge Analysis) compatible implementation of MODLEM - a Machine Learning algorithm which induces minimum set of rules. These rules can be adopted as a classifier (in terms of ML). It is a sequential covering algorithm, which was invented to cope with numeric data without discretization. Actually the nominal and numeric attributes are treated in the same way: attribute's space is being searched to find the best rule condition during rule induction. ...

1 Review

Downloads: 14 This Week

Last Update: 2015-01-28
See Project
18

marsyas

Marsyas (Music Analysis, Retrieval and Synthesis for Audio Signals) is a framework for developing systems for audio processing. It provides an general architecture for connecting audio, soundfiles, signal processing blocks and machine learning. Source code at SF is outdated! Marsyas is now hosted at GitHub: https://github.com/marsyas/marsyas Downloads are now provided at Bintray: https://bintray.com/marsyas

6 Reviews

Downloads: 4 This Week

Last Update: 2014-11-25
See Project
19

KMeansAniX

Animation of kmeans clustering using X Window System

Open source animation of kmeans clustering in X Window System using the C++ libplotter library. Supports Linux, Mac, and BSD. Includes common initialization methods such as Forgy, Macqueen, random, and angular. Sample videos are available through the Files Tab above. The SVN repo is accessible thorugh the Code Tab above. Requires a C++ compiler, libplot-dev, and libncurses5-dev Mac alternative to libplot-dev: macports plotutils +x11

Downloads: 0 This Week

Last Update: 2014-12-08
See Project
20

FineSplice

Enhanced splice junction detection and estimation from RNA-Seq data

FineSplice is a Python wrapper to TopHat2 geared towards a reliable identification of expressed exon junctions from RNA-Seq data, at enhanced detection precision with small loss in sensitivity. Following alignment with TopHat2 using known transcript annotations, FineSplice takes as input the resulting BAM file and outputs a confident set of expressed splice junctions with the corresponding read counts.

Downloads: 0 This Week

Last Update: 2014-04-01
See Project
21

ProximityForest

Efficient Approximate Nearest Neighbors for General Metric Spaces

A proximity forest is a data structure that allows for efficient computation of approximate nearest neighbors of arbitrary data elements in a metric space. See: O'Hara and Draper, "Are You Using the Right Approximate Nearest Neighbor Algorithm?", WACV 2013 (best student paper award). One application of a ProximityForest is given in the following CVPR publication: Stephen O'Hara and Bruce A. Draper, "Scalable Action Recognition with a Subspace Forest," IEEE Conference on Computer...

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
22

feed4weka

feed4weka is an open library that enriches weka (http://www.cs.waikato.ac.nz/ml/weka/), an open source project for data analysis. It integrates new classification and clustering algorithms, and adds the coclustering and outlier detection frameworks

Downloads: 0 This Week

Last Update: 2013-07-01
See Project
23

vbFRET

This is a Matlab software package for single molecule FRET data analysis.

1 Review

Downloads: 1 This Week

Last Update: 2013-04-24
See Project
24

AdPreqFr4SL

Adaptive Prequential Learning Framework

The AdPreqFr4SL learning framework for Bayesian Network Classiﬁers is designed to handle the cost / performance trade-oﬀ and cope with concept drift. Our strategy for incorporating new data is based on bias management and gradual adaptation. Starting with the simple Naive Bayes, we scale up the complexity by gradually updating attributes and structure. Since updating the structure is a costly task, we use new data to primarily adapt the parameters and only if this is really necessary, do we...

Downloads: 0 This Week

Last Update: 2012-12-10
See Project
25

pyIRDG

IMDb Relational Dataset Generator

pyIRDG is a program written in Python to generate relational datasets in Prolog format. It uses data from the Internet Movie Database in combination with IMDbPY as backend. A graphical user interface written in pyQt allows the user to link multiple entities together as model for the generation process. The big four entities are Title, Person, Company and Character. Many attributes can be chosen for adding to the output .pl file.

Downloads: 0 This Week

Last Update: 2014-03-09
See Project

Previous
12
13
14
15
You're on page 16
17
Next

Related Searches

ai

self organizing maps

statistics and machine learning toolbox in matlab

predictive text

mass based dissimilarity

heart disease prediction system in python

mfcc

image analysis

project weka

marsyas

Related Categories

Artificial Intelligence

Software Development

Business

Scientific/Engineering

Multimedia

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise