Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Machine Learning Software
Search Results

Search Results for "python data analysis" - Page 17

x

Sort By:

Relevance

Clear All Filters

OS

Windows 433
Linux 417
Mac 407
More...
BSD 134
ChromeOS 132
Desktop Operating Systems 1
Embedded Operating Systems 1
Mobile Operating Systems 1

Category

Artificial Intelligence 433
Software Development 66
Business 48
Scientific/Engineering 41
Multimedia 13
Education 8
System 8
Internet 4
Database 2
Formats and Protocols 2
Communications 1
Productivity 1
Social sciences 1
Text Editors 1

License

OSI-Approved Open Source 382
Creative Commons Attribution License 6
GNU Free Documentation License 3

Translations

English 17
Brazilian Portuguese 1
Chinese (Simplified) 1
French 1
More...
German 1
Russian 1
Vietnamese 1

Programming Language

Python 315
C++ 26
Java 16
MATLAB 10
More...
TypeScript 7
C# 5
JavaScript 4
Julia 4
C 3
R 3
Rust 3
Go 2
F# 1
PL/SQL 1
Prolog 1
Scala 1
Unix Shell 1

Status

Beta 19
Production/Stable 18
Pre-Alpha 5
Alpha 5
More...
Planning 1
Mature 1
Inactive 1

Showing 433 open source projects for "python data analysis"

View related business solutions

Machine Learning Windows Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

JCLTP

A Java Class Library for Text Processing

JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided. Thus, analysis of new types...

Downloads: 0 This Week

Last Update: 2017-01-06
See Project
2

Spark Python Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis

Spark Python Notebooks is a curated collection of example Jupyter notebooks designed to help developers and data engineers learn Apache Spark using Python in an interactive environment. Rather than only providing static code files, this project uses notebooks to teach practical data processing workflows, exposing users to real Spark programming patterns like working with RDDs, DataFrames, and distributed computations.

Downloads: 0 This Week

Last Update: 2026-02-17
See Project
3

Mass-based dissimilarity

A data dependent dissimilarity measure based on mass estimation.

This software calculates the mass-based dissimilarity matrix for data mining algorithms relying on a distance measure. References: Overcoming Key Weaknesses of Distance-based Neighbourhood Methods using a Data Dependent Dissimilarity Measure. KDD 2016 http://dx.doi.org/10.1145/2939672.2939779 The source code, presentation slide and poster are attached under "Files". The presentation video in KDD 2016 is published on https://youtu.be/eotD_-SuEoo . Since this software is licensed...

Downloads: 0 This Week

Last Update: 2018-02-26
See Project
4

ExSTraCS

Extended Supervised Tracking and Classifying System

...ExSTraCS combines a number of recent advancements into a single algorithmic platform. It can flexibly handle (1) discrete or continuous attributes, (2) missing data, (3) balanced or imbalanced datasets, and (4) binary or many classes. A complete users guide for ExSTraCS is included. Coded in Python 2.7.

1 Review

Downloads: 2 This Week

Last Update: 2015-11-04
See Project
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
5

Accelerated Feature Extraction Tool

A fast GPU accelerated feature extraction software for speech analysis

A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.

1 Review

Downloads: 0 This Week

Last Update: 2015-05-25
See Project
6

IMAGINE

Biological image viewer and processor

Detection, enumeration, and sizing of biological organisms by image analysis.

Downloads: 1 This Week

Last Update: 2015-03-18
See Project
7

Chordalysis

Log-linear analysis (data modelling) for high-dimensional data

===== Project moved to https://github.com/fpetitjean/Chordalysis ===== Log-linear analysis is the statistical method used to capture multi-way relationships between variables. However, due to its exponential nature, previous approaches did not allow scale-up to more than a dozen variables. We present here Chordalysis, a log-linear analysis method for big data. Chordalysis exploits recent discoveries in graph theory by representing complex models as compositions of triangular structures, also known as chordal graphs. ...

Downloads: 0 This Week

Last Update: 2015-01-29
See Project
8

MODLEM

rule-based, WEKA compatible, Machine Learning algorithm

This project is a WEKA (Waikato Environment for Knowledge Analysis) compatible implementation of MODLEM - a Machine Learning algorithm which induces minimum set of rules. These rules can be adopted as a classifier (in terms of ML). It is a sequential covering algorithm, which was invented to cope with numeric data without discretization. Actually the nominal and numeric attributes are treated in the same way: attribute's space is being searched to find the best rule condition during rule induction. ...

1 Review

Downloads: 14 This Week

Last Update: 2015-01-28
See Project
9

marsyas

Marsyas (Music Analysis, Retrieval and Synthesis for Audio Signals) is a framework for developing systems for audio processing. It provides an general architecture for connecting audio, soundfiles, signal processing blocks and machine learning. Source code at SF is outdated! Marsyas is now hosted at GitHub: https://github.com/marsyas/marsyas Downloads are now provided at Bintray: https://bintray.com/marsyas

6 Reviews

Downloads: 4 This Week

Last Update: 2014-11-25
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

FineSplice

Enhanced splice junction detection and estimation from RNA-Seq data

FineSplice is a Python wrapper to TopHat2 geared towards a reliable identification of expressed exon junctions from RNA-Seq data, at enhanced detection precision with small loss in sensitivity. Following alignment with TopHat2 using known transcript annotations, FineSplice takes as input the resulting BAM file and outputs a confident set of expressed splice junctions with the corresponding read counts.

Downloads: 0 This Week

Last Update: 2014-04-01
See Project
11

neural network designer

a dbms for neural nets. Chatbots, DTrees, random forests, n-grams,...

...Do natural language processing, image or data analysis & interpretation,...

Downloads: 0 This Week

Last Update: 2017-03-07
See Project
12

ProximityForest

Efficient Approximate Nearest Neighbors for General Metric Spaces

A proximity forest is a data structure that allows for efficient computation of approximate nearest neighbors of arbitrary data elements in a metric space. See: O'Hara and Draper, "Are You Using the Right Approximate Nearest Neighbor Algorithm?", WACV 2013 (best student paper award). One application of a ProximityForest is given in the following CVPR publication: Stephen O'Hara and Bruce A. Draper, "Scalable Action Recognition with a Subspace Forest," IEEE Conference on Computer...

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
13

Matlab Community Detection Toolbox

CDTB is a MATLAB toolbox which performs Community Detection

We present the Community Detection Toolbox (CDTB), a MATLAB toolbox which can be used to perform community detection. The CDTB contains several functions from the following categories. 1. graph generators; 2. clustering algorithms; 2. cluster number selection functions; 4. clustering evaluation functions. Furthermore, CDTB is designed in a parametric manner so that the user can add his own functions and extensions. The CDTB can be used in at least three ways. The user can employ...

Downloads: 0 This Week

Last Update: 2014-03-14
See Project
14

DocCO

Non-disjoint groupping of Documents based on word sequence approach

This is a GUI for learning non disjoint groups of documents based on Weka machine learning framework. It offers the possibility to make non disjoint clustering of documents using both vectorial and sequential representation (word sequence approach based on WSK kernel). All data format supported by WEKA could be used in DocCO. Data could be loaded from files, from databases or from specified URL. All the preprocessing techniques implemented in WEKA could be used before performing the learning.

Downloads: 0 This Week

Last Update: 2013-08-17
See Project
15

feed4weka

feed4weka is an open library that enriches weka (http://www.cs.waikato.ac.nz/ml/weka/), an open source project for data analysis. It integrates new classification and clustering algorithms, and adds the coclustering and outlier detection frameworks

Downloads: 0 This Week

Last Update: 2013-07-01
See Project
16

vbFRET

This is a Matlab software package for single molecule FRET data analysis.

1 Review

Downloads: 1 This Week

Last Update: 2013-04-24
See Project
17

AdPreqFr4SL

Adaptive Prequential Learning Framework

The AdPreqFr4SL learning framework for Bayesian Network Classiﬁers is designed to handle the cost / performance trade-oﬀ and cope with concept drift. Our strategy for incorporating new data is based on bias management and gradual adaptation. Starting with the simple Naive Bayes, we scale up the complexity by gradually updating attributes and structure. Since updating the structure is a costly task, we use new data to primarily adapt the parameters and only if this is really necessary, do we...

Downloads: 0 This Week

Last Update: 2012-12-10
See Project
18

pyIRDG

IMDb Relational Dataset Generator

pyIRDG is a program written in Python to generate relational datasets in Prolog format. It uses data from the Internet Movie Database in combination with IMDbPY as backend. A graphical user interface written in pyQt allows the user to link multiple entities together as model for the generation process. The big four entities are Title, Person, Company and Character. Many attributes can be chosen for adding to the output .pl file.

Downloads: 0 This Week

Last Update: 2014-03-09
See Project
19

Pronac MediaMonkey Extension

Recommends music based upon your current taste.

A music recommendation engine. It is meant to be an add-on for popular media players like Winamp, Amarok, Rhythmbox or Banshee. Currently supports only MediaMonkey Player. Downlaod, extract and run "pronac.exe". Play the first song from the Now Playing list, it'll recommend you next songs from the same list. NOTE: MAKE SURE THAT SONG SHUFFLE IS TURNED OFF WHILE USING PRONAC. Based upon K-Nearest Neighbor Machine Learning Algorithm, K-Fold Cross Validation and EchoNest for audio features.

Downloads: 0 This Week

Last Update: 2016-03-25
See Project
20

CRFSharp

CRFSharp is a .NET(C#) implementation of Conditional Random Field

CRFSharp(aka CRF#) is a .NET(C#) implementation of Conditional Random Fields, an machine learning algorithm for learning from labeled sequences of examples. It is widely used in Natural Language Process (NLP) tasks, for example: word breaker, postagging, named entity recognized, query chunking and so on. CRF#'s mainly algorithm is the same as CRF++ written by Taku Kudo. It encodes model parameters by L-BFGS. Moreover, it has many significant improvement than CRF++, such as totally...

Downloads: 0 This Week

Last Update: 2015-08-03
See Project
21

RapidMiner Feature Selection Extension

This RapidMiner-plugin consists of operators for feature selection and classification - mainly on high-dimensional (microarray-) data - and some helper-classes/operators.

2 Reviews

Downloads: 0 This Week

Last Update: 2015-08-01
See Project
22

BIL++

BIL++ is a set of standalone C++ packages for data processing in Bioinformatics (Graph mining, Bayesian networks, Genetic algorithm, Discretization, Gene expression data analysis, Hypothesis testing).

Downloads: 0 This Week

Last Update: 2013-05-02
See Project
23

MLTree

A Machine Learning and Data Retrieval Framework

Downloads: 0 This Week

Last Update: 2014-06-29
See Project
24

AODiagrams

Content Addressable Memory, Multi-Variate Statistics, Data Mining Includes analyzing datasets, extracting patterns, creating empirical expert system. Computes joint probabilities and implements a "belief" as the solution of an equilibrium equation

Downloads: 0 This Week

Last Update: 2014-06-26
See Project
25

BCI Project Triathlon

A three-step approach towards experimental brain-computer-interfaces, based on the OCZ nia device for EEG-data acquisition and artificial neural networks for signal-interpretation.

1 Review

Downloads: 0 This Week

Last Update: 2016-09-08
See Project

Previous
13
14
15
16
You're on page 17
18
Next

Related Searches

predictive text

mass based dissimilarity

heart disease prediction system in python

mfcc

image analysis

project weka

marsyas

anomaly detection

aiml

social network analysis in matlab

Related Categories

Artificial Intelligence

Software Development

Business

Scientific/Engineering

Multimedia

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise