Showing 1214 open source projects for "python data analysis"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    The FAKE GAME tool uses natural evolution to evolve Data Mining models. It incorporates several preprocessing, optimization and visualization methods aimed to streamline the Knowledge Discovery process. Knowledge Extraction from data is being automated!
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    This is a Matlab software package for single molecule FRET data analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    AdPreqFr4SL

    Adaptive Prequential Learning Framework

    The AdPreqFr4SL learning framework for Bayesian Network Classifiers is designed to handle the cost / performance trade-off and cope with concept drift. Our strategy for incorporating new data is based on bias management and gradual adaptation. Starting with the simple Naive Bayes, we scale up the complexity by gradually updating attributes and structure. Since updating the structure is a costly task, we use new data to primarily adapt the parameters and only if this is really necessary, do we...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Darwin 2: Java Framework for Evolutionary Computation (genetic algorithm, GA). A true framework with out-of-the-box functionality and extensibility of all classes. Interface-based pattern with dependency-injection to configure components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    pyIRDG

    pyIRDG

    IMDb Relational Dataset Generator

    pyIRDG is a program written in Python to generate relational datasets in Prolog format. It uses data from the Internet Movie Database in combination with IMDbPY as backend. A graphical user interface written in pyQt allows the user to link multiple entities together as model for the generation process. The big four entities are Title, Person, Company and Character. Many attributes can be chosen for adding to the output .pl file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    JBoost is a simple, robust system for classification. JBoost contains implementations of several boosting algorithms in an alternating decision tree framework. In addition, JBoost provides extensible software for adding more learning algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    CRFSharp

    CRFSharp

    CRFSharp is a .NET(C#) implementation of Conditional Random Field

    CRFSharp(aka CRF#) is a .NET(C#) implementation of Conditional Random Fields, an machine learning algorithm for learning from labeled sequences of examples. It is widely used in Natural Language Process (NLP) tasks, for example: word breaker, postagging, named entity recognized, query chunking and so on. CRF#'s mainly algorithm is the same as CRF++ written by Taku Kudo. It encodes model parameters by L-BFGS. Moreover, it has many significant improvement than CRF++, such as totally...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Bamboo Engine

    Game framework on top of Python, Panda3D and Twisted

    Bamboo intends to be a complete end-to-end game framework for client/server applications using Twisted for data exchange, Panda3D for rendering and coded in Python. Support for PyPy/CPython may be considered at a later point. An Extreme/Agile Development model is in use to allow for emergent design (IE: changing requirements). Release is updated whenever a feature is added and all tests pass cleanly 100%
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Open Metaheuristic (oMetah) is a library aimed at the conception and the rigourous testing of metaheuristics (i.e. genetic algorithms, simulated annealing, ...). The code design is separated in components : algorithms, problems and a test report generator
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10

    SURF-nanodots

    Very basic computer vision program

    This was my first big programming project. I'd like to clean it up and improve the back end design when time permits. See the wiki for known issues and more information. A computer vision program for analysis of magnetic data collected by a scanning probe microscope. Originated in summer 2007 as a collection of C compiled for Matlab (MEX) files and was eventually ported to a standalone C++ application with a GUI created in Qt. This program takes atomic and magnetic force microscope (AFM/MFM) image pairs as input and uses threshold segmentation to identify magnetic nanodots by intensity in the AFM image. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Febrl (Freely Extensible Biomedical Record Linkage) does data standardisation (segmentation and cleaning) and probabilistic record linkage ("fuzzy" matching) of one or more files or data sources which do not share a unique record key or identifier.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    PyBact is an open source software written in Python for Bacterial Identification. The software generates simulated data matrix which accurately represents the probabilistic positive/negative results of the tested biochemical test.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A toolkit for the optical recognition of Psaltiki 19th century music notation. It is based on and requires the Gamera document image analysis framework (http://gamera.sf.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    RapidMiner Feature Selection Extension
    This RapidMiner-plugin consists of operators for feature selection and classification - mainly on high-dimensional (microarray-) data - and some helper-classes/operators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Ferda is a user friendly data mining tool. It is a modular distributed multiplatform framework based on Internet Communications Engine. Ferda is very powerful in working with association rules.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    Cougar Squared is a new Java library for machine learning and data mining research, supporting research needs of the community. It is written by researchers for researchers. It extends the WEKA and YALE machine learning frameworks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Sciengy RPF!
    Windows application for data mining with self-organizing neural networks. It works with text data files and has convenient user interface. Is solves data mining tasks of class Recognition, continuous value Prediction, and time series Forecast.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Siafu simulates individual agents and their context, from home to city-wide scenarios. As a developer, you use the API to write your simulation for the purposes of data-set generation, test or visualization, optionally hooking it to your own application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A java library for the processing and analysis of natural language texts and other sequential data. The focus is on unsupervised modeling with simple, statistical methods as well as implementations of more complex algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Content Addressable Memory, Multi-Variate Statistics, Data Mining Includes analyzing datasets, extracting patterns, creating empirical expert system. Computes joint probabilities and implements a "belief" as the solution of an equilibrium equation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Monte is machine learning in pure Python. Monte's focus is the construction of gradient based learning machines from many small components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Contextor
    Contextor is a light-weight simple-to-use Java based library to help developers and researchers working with the general concept of a resource; as examples, resources can be text resources, web resources, images and videos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Open data mining platform. Provides common architecture for algorithms of various types. Efficient processing of arbitrarily large volumes of data thanks to data streaming. Weka and Rseslib partially integrated. (www.debellor.org)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Onyx is for rapid prototyping and large-scale experimentation on advanced machine-learning algorithms with an emphasis on algorithms for online or streaming analysis, modeling, and classification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    BCI Project Triathlon
    A three-step approach towards experimental brain-computer-interfaces, based on the OCZ nia device for EEG-data acquisition and artificial neural networks for signal-interpretation.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB