Showing 823 open source projects for "python data analysis"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    FineSplice

    FineSplice

    Enhanced splice junction detection and estimation from RNA-Seq data

    FineSplice is a Python wrapper to TopHat2 geared towards a reliable identification of expressed exon junctions from RNA-Seq data, at enhanced detection precision with small loss in sensitivity. Following alignment with TopHat2 using known transcript annotations, FineSplice takes as input the resulting BAM file and outputs a confident set of expressed splice junctions with the corresponding read counts. Potential false positives arising from spurious alignments are filtered out via a semi...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Matlab Community Detection Toolbox

    Matlab Community Detection Toolbox

    CDTB is a MATLAB toolbox which performs Community Detection

    ... the functions from the MATLAB command line; or he can write his own code, incorporating the CDTB functions; or he can use the Graphical User Interface (GUI) which automates the community detection and includes some data visualization options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    ktree

    clustering, machine learning, algorithms

    This project has moved to github at http://lmwtree.devries.ninja.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Fast Artificial Neural Network Library is a free open source neural network library, which implements multilayer artificial neural networks in C with support for both fully connected and sparsely connected networks. Cross-platform execution in both fixed and floating point are supported. It includes a framework for easy handling of training data sets. It is easy to use, versatile, well documented, and fast. Bindings to more than 15 programming languages are available. An easy to read...
    Downloads: 14 This Week
    Last Update:
    See Project
  • Crowdtesting That Delivers | Testeum Icon
    Crowdtesting That Delivers | Testeum

    Unfixed bugs delaying your launch? Test with real users globally – check it out for free, results in days.

    Testeum connects your software, app, or website to a worldwide network of testers, delivering detailed feedback in under 48 hours. Ensure functionality and refine UX on real devices, all at a fraction of traditional costs. Trusted by startups and enterprises alike, our platform streamlines quality assurance with actionable insights. Click to perfect your product now.
    Click to perfect your product now.
  • 5
    Unsupervised TXT classifier

    Unsupervised TXT classifier

    Classify any two TXT documents, no training required - JAVA

    This program is made to address two most common issues with the known classifying algorithms. First, over-training and second, shortage of data for a training of categories. Instead, each TXT file is a category on its own, rather than an assigned category. In a way, this is similar to clustering but not really a clustering algorithm since there is some training involved. The summarizer from Classifier4J has been adjusted to accept two inputs (lets call them A and B). Then, the summarizer gets...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    SocialModeler

    A set of tools for analyzing open source social media

    SocialModeler leverages natural language processing and statistical text analysis approaches to quickly analyze and explore social media data (e.g. news articles or blogs). It uses an application-based user interface for configuration and analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Fish4Knowledge Project

    Analysis of undersea fish videos

    The Fish4knowledge project investigated: information abstraction and storage methods for analyzing undersea video data (from 10E+15 pixels to 10E+12 units of information), machine and human vocabularies for detecting & describing fish, flexible process architectures to process the data and scientific queries and effective specialised user query interfaces. A combination of computer vision, database storage, workflow and human computer interaction methods were used to achieve...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    iget

    Adaptive Parallel Data Retrieval Mechanism

    Adaptive Synchronous-Retrieval mechanism with Concurrent I/O using Reinforcement Learning. A data retrieval mechanism that can adapt to the continuous contraction and expansion of the network bottleneck so that an optimal concurrency index can be maintained at any time during the data retrieval process. Sample usage: python iget.py <target url> <output file>
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    LExAu: Learning Expectations Autonomously. Library for on-line data driven statistical machine learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud SQL for MySQL, PostgreSQL, and SQL Server Icon
    Cloud SQL for MySQL, PostgreSQL, and SQL Server

    Focus on your application, and leave the database to us

    Fully managed, cost-effective relational database service for PostgreSQL, MySQL, and SQL Server. Try Enterprise Plus edition for a 99.99% availability SLA and category-leading performance.
    Try it for free
  • 10
    RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition. full installation and usage instructions given at http://sourceforge.net/p/rnnl/wiki/Home/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DocCO

    DocCO

    Non-disjoint groupping of Documents based on word sequence approach

    This is a GUI for learning non disjoint groups of documents based on Weka machine learning framework. It offers the possibility to make non disjoint clustering of documents using both vectorial and sequential representation (word sequence approach based on WSK kernel). All data format supported by WEKA could be used in DocCO. Data could be loaded from files, from databases or from specified URL. All the preprocessing techniques implemented in WEKA could be used before performing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Research and development of an ontology for the analysis and representation of comparative data for use in evolutionary analysis. For more information, please see www.evolutionaryontology.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    feed4weka is an open library that enriches weka (http://www.cs.waikato.ac.nz/ml/weka/), an open source project for data analysis. It integrates new classification and clustering algorithms, and adds the coclustering and outlier detection frameworks
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    EMGU Face Recognition

    Using EMGU to perform Principle Component Analysis (PCA)

    ... by Sergio Andrés Gutiérrez Rojas and his original article (codeproject). The reason that face recognition is so popular is not only it’s real world application but also the common use of principle component analysis (PCA). PCA is an ideal method for recognising statistical patterns in data. The popularity of face recognition is the fact a user can apply a method easily and see if it is working without needing to know to much about how the process is working.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 15
    A High-Order Multi-Variate Approximation Scheme for Arbitrary Data Sets, C implementation of the method described in http://web.mit.edu/qiqi/www/paper/interpolation.pdf, with Python and Fortran interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MyNook

    MyNook

    A machine learning system for supervised document classification

    An open source system for supervised document classification based on statistical machine learning techniques. On the contrary of the state of art classification techniques, MyNook just requires the title of the document, not the content itself.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    JProGraM (PRObabilistic GRAphical Models in Java) is a statistical machine learning library. It supports statistical modeling and data analysis along three main directions: (1) probabilistic graphical models (Bayesian networks, Markov random fields, dependency networks, hybrid random fields); (2) parametric, semiparametric, and nonparametric density estimation (Gaussian models, nonparanormal estimators, Parzen windows, Nadaraya-Watson estimator); (3) generative models for random networks (small...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    VDM.net So how can I call you My Dear Project?
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The FAKE GAME tool uses natural evolution to evolve Data Mining models. It incorporates several preprocessing, optimization and visualization methods aimed to streamline the Knowledge Discovery process. Knowledge Extraction from data is being automated!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    This is a Matlab software package for single molecule FRET data analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21

    AdPreqFr4SL

    Adaptive Prequential Learning Framework

    The AdPreqFr4SL learning framework for Bayesian Network Classifiers is designed to handle the cost / performance trade-off and cope with concept drift. Our strategy for incorporating new data is based on bias management and gradual adaptation. Starting with the simple Naive Bayes, we scale up the complexity by gradually updating attributes and structure. Since updating the structure is a costly task, we use new data to primarily adapt the parameters and only if this is really necessary, do we...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Darwin 2: Java Framework for Evolutionary Computation (genetic algorithm, GA). A true framework with out-of-the-box functionality and extensibility of all classes. Interface-based pattern with dependency-injection to configure components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    minder

    minder

    Monitoring your infrastructure for free.

    This software presents a flexible and configurable proposal for monitoring and management of real and virtual HPC infrastructures, compatible with paradigm of cloud computing. We help you to answer: 1) What is the performance of my resources? 2) What equipment and resources do we have already? 3) What do we need to upgrade or repair? 4) What can we consolidate to reduce complexity or reduce energy use? 5) What resources would be better reused somewhere else? Status: PreAlpha, so...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    pyIRDG

    pyIRDG

    IMDb Relational Dataset Generator

    pyIRDG is a program written in Python to generate relational datasets in Prolog format. It uses data from the Internet Movie Database in combination with IMDbPY as backend. A graphical user interface written in pyQt allows the user to link multiple entities together as model for the generation process. The big four entities are Title, Person, Company and Character. Many attributes can be chosen for adding to the output .pl file. Three types of constraints on attributes are available to limit...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    JBoost is a simple, robust system for classification. JBoost contains implementations of several boosting algorithms in an alternating decision tree framework. In addition, JBoost provides extensible software for adding more learning algorithms.
    Downloads: 1 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.