Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Machine Learning Software
Search Results

Search Results for "python data analysis"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 16
Windows 16
Mac 13
More...
BSD 4
ChromeOS 4
Embedded Operating Systems 1

Category

Artificial Intelligence 18
Scientific/Engineering 7
Business 5
Software Development 5
Education 2
Internet 2
Database 1
Formats and Protocols 1
Text Editors 1

License

OSI-Approved Open Source 17
GNU Free Documentation License 1

Translations

English 4
Korean 1
Russian 1

Programming Language

Java 18
JavaScript 2
C++ 1
C# 1
JSP 1
More...
Python 1

Status

Production/Stable 4
Beta 3
Pre-Alpha 2
Planning 1
More...
Alpha 1

Showing 18 open source projects for "python data analysis"

View related business solutions

Machine Learning Java Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

Smile

Statistical machine intelligence and learning engine

Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster?

Downloads: 5 This Week

Last Update: 2026-03-29
See Project
2

Tribuo

Tribuo - A Java machine learning library

...Provenance data allows each model to be rebuilt verbatim from scratch and for evaluations to track the models and datasets used for each experiment.

Downloads: 1 This Week

Last Update: 2025-04-03
See Project
3

Ceka

Crowd Environment and its Knowledge Analysis

A knowledge analysis tool for crowdsourcing based on Weka. We also have a Python version of Crowdsourcing Learning: CrowdwiseKit on GitHub (https://github.com/tssai-lab/CrowdwiseKit).

Downloads: 1 This Week

Last Update: 2023-04-20
See Project
4

ModelDB

Open Source ML Model Versioning, Metadata, and Experiment Management

An open-source system for Machine Learning model versioning, metadata, and experiment management. ModelDB is an open-source system to version machine learning models including their ingredients code, data, config, and environment and to track ML metadata across the model lifecycle.

Downloads: 1 This Week

Last Update: 2024-08-15
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
5

DSTK - DataScience ToolKit

DSTK - DataScience ToolKit for All of Us

DSTK - DataScience ToolKit is an opensource free software for statistical analysis, data visualization, text analysis, and predictive analytics. Newer version and smaller file size can be found at: https://sourceforge.net/projects/dstk3/ It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including text analysis and predictive analytics features. ...

Downloads: 0 This Week

Last Update: 2018-05-08
See Project
6

H2O-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning

H2O-3 is an open-source machine learning platform designed to build scalable and distributed machine learning models across large datasets. The system operates as an in-memory computing platform that allows data scientists to train models quickly using distributed resources. It supports many machine learning algorithms including generalized linear models, gradient boosting machines, deep learning networks, and ensemble techniques. The platform provides interfaces for multiple programming languages such as Python, R, Java, and Scala, making it accessible to a wide range of developers and data scientists. ...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
7

JCLTP

A Java Class Library for Text Processing

JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided. Thus, analysis of new types...

Downloads: 0 This Week

Last Update: 2017-01-06
See Project
8

Chordalysis

Log-linear analysis (data modelling) for high-dimensional data

===== Project moved to https://github.com/fpetitjean/Chordalysis ===== Log-linear analysis is the statistical method used to capture multi-way relationships between variables. However, due to its exponential nature, previous approaches did not allow scale-up to more than a dozen variables. We present here Chordalysis, a log-linear analysis method for big data. Chordalysis exploits recent discoveries in graph theory by representing complex models as compositions of triangular structures, also known as chordal graphs. ...

Downloads: 0 This Week

Last Update: 2015-01-29
See Project
9

MODLEM

rule-based, WEKA compatible, Machine Learning algorithm

This project is a WEKA (Waikato Environment for Knowledge Analysis) compatible implementation of MODLEM - a Machine Learning algorithm which induces minimum set of rules. These rules can be adopted as a classifier (in terms of ML). It is a sequential covering algorithm, which was invented to cope with numeric data without discretization. Actually the nominal and numeric attributes are treated in the same way: attribute's space is being searched to find the best rule condition during rule induction. ...

1 Review

Downloads: 14 This Week

Last Update: 2015-01-28
See Project
Add Two Lines of Code. Get Full APM.
AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.

Start Free
10

Flamingo Project

Workflow Designer, Hive Editor, Pig Editor, File System Browser

Flamingo is a open-source Big Data Platform that combine a Ajax Rich Web Interface + Workflow Engine + Workflow Designer + MapReduce + Hive Editor + Pig Editor. 1. Easy Tool for big data 2. Use comfortable in Hadoop EcoSystem projects 3. Based GPL V3 License Supporting Pig IDE, Hive IDE, HDFS Browser, Scheduler, Hadoop Job Monitoring, Workflow Engine, Workflow Designer, MapReduce.

3 Reviews

Downloads: 0 This Week

Last Update: 2016-11-29
See Project
11

DocCO

Non-disjoint groupping of Documents based on word sequence approach

This is a GUI for learning non disjoint groups of documents based on Weka machine learning framework. It offers the possibility to make non disjoint clustering of documents using both vectorial and sequential representation (word sequence approach based on WSK kernel). All data format supported by WEKA could be used in DocCO. Data could be loaded from files, from databases or from specified URL. All the preprocessing techniques implemented in WEKA could be used before performing the learning.

Downloads: 0 This Week

Last Update: 2013-08-17
See Project
12

feed4weka

feed4weka is an open library that enriches weka (http://www.cs.waikato.ac.nz/ml/weka/), an open source project for data analysis. It integrates new classification and clustering algorithms, and adds the coclustering and outlier detection frameworks

Downloads: 0 This Week

Last Update: 2013-07-01
See Project
13

AdPreqFr4SL

Adaptive Prequential Learning Framework

The AdPreqFr4SL learning framework for Bayesian Network Classiﬁers is designed to handle the cost / performance trade-oﬀ and cope with concept drift. Our strategy for incorporating new data is based on bias management and gradual adaptation. Starting with the simple Naive Bayes, we scale up the complexity by gradually updating attributes and structure. Since updating the structure is a costly task, we use new data to primarily adapt the parameters and only if this is really necessary, do we...

Downloads: 0 This Week

Last Update: 2012-12-10
See Project
14

MLTree

A Machine Learning and Data Retrieval Framework

Downloads: 0 This Week

Last Update: 2014-06-29
See Project
15

Java Neural Modeling Framework new GUI

Program to performing the complete cycle of neural networks analysis: preparing data, choosing neural network (CasCor, MP, LogRegression, PNN), learning of network, monitoring learning state, ROC-analysis, optimization of network parameters using GA.

Downloads: 0 This Week

Last Update: 2015-07-04
See Project
16

Blunder

Blunder is an automated tool for analyzing chained exceptions in Java. It's usefull for classify, generate a customized error message and a list for possible solutions.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
17

Data Mining Platform

Data Mining Platform is a platform for data mining and analysis. It contains many of the new and sophisticated methods such as kernel-based classification, two-way clustering, bayesian networks, pattern recognition for time series analysis and many other

Downloads: 0 This Week

Last Update: 2013-04-18
See Project
18

Cinefile

A category-based approach to exploring film data.

...It allows the user to identify abstract categories of films by providing examples of category members, learns to classify films as belonging or not belonging to those categories, and provides a graphical interface for exploring and comparing categories. Cinefile is designed to work with data retrieved from the Internet Movie Database (imdb.com). This data is used for classification and is the subject of the category-based analysis. Cinefile was developed by the University of Mary Washington's Computer Science department (http://cas.umw.edu/computerscience).

Downloads: 0 This Week

Last Update: 2016-11-18
See Project

Previous
You're on page 1
Next

Related Searches

smile

crowd simulation

jasp

predictive text

project weka

weka

neural networks

blunder

python jvm

benchmark

Related Categories

Artificial Intelligence

Scientific/Engineering

Business

Software Development

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise