Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Machine Learning Software
Search Results

Search Results for "data" - Page 9

x

Sort By:

Relevance

Clear All Filters

OS

Windows 219
Linux 218
Mac 207
More...
BSD 15
ChromeOS 13
Mobile Operating Systems 1

Category

Artificial Intelligence 228
Software Development 50
Business 34
Scientific/Engineering 20
System 9
Multimedia 8
Education 4
Formats and Protocols 2
Internet 2
Communications 1
Database 1
Productivity 1
Social sciences 1

License

OSI-Approved Open Source 216
Creative Commons Attribution License 2
GNU Free Documentation License 2

Translations

English 8

Programming Language

Python 228
C++ 5
JavaScript 4
C 2
C# 1
More...
Fortran 1
Java 1
PL/SQL 1
Prolog 1
R 1
Rust 1
TypeScript 1
Unix Shell 1

Status

Beta 13
Production/Stable 7
Alpha 3
Pre-Alpha 2

Showing 228 open source projects for "data"

View related business solutions

Machine Learning Python Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Build AI Apps with Gemini 3 on Vertex AI
Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.

Try Vertex AI Free
1

Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

Combining improvements in deep reinforcement learning. Results and pretrained models can be found in the releases. Data-efficient Rainbow can be run using several options (note that the "unbounded" memory is implemented here in practice by manually setting the memory capacity to be the same as the maximum number of timesteps).

Downloads: 0 This Week

Last Update: 2022-08-17
See Project
2

captcha_break

Identification codes

This project will use Keras to build a deep convolutional neural network to identify the captcha verification code. It is recommended to use a graphics card to run the project. The following visualization codes are jupyter notebookall done in . If you want to write a python script, you can run it normally with a little modification. Of course, you can also remove these visualization codes. captcha is a library written in python to generate verification codes. It supports image verification...

Downloads: 0 This Week

Last Update: 2022-08-08
See Project
3

CakeChat

CakeChat: Emotional Generative Dialog System

CakeChat is a backend for chatbots that are able to express emotions via conversations. The code is flexible and allows to condition model's responses by an arbitrary categorical variable. For example, you can train your own persona-based neural conversational model or create an emotional chatting machine. Hierarchical Recurrent Encoder-Decoder (HRED) architecture for handling deep dialog context. Multilayer RNN with GRU cells. The first layer of the utterance-level encoder is always...

Downloads: 0 This Week

Last Update: 2022-08-12
See Project
4

automl-gs

Provide an input CSV and a target field to predict, generate a model

Give an input CSV file and a target field you want to predict to automl-gs, and get a trained high-performing machine learning or deep learning model plus native Python code pipelines allowing you to integrate that model into any prediction workflow. No black box: you can see exactly how the data is processed, and how the model is constructed, and you can make tweaks as necessary. automl-gs is an AutoML tool which, unlike Microsoft's NNI, Uber's Ludwig, and TPOT, offers a zero code/model definition interface to getting an optimized model and data transformation pipeline in multiple popular ML/DL frameworks, with minimal Python dependencies (pandas + scikit-learn + your framework of choice). automl-gs is designed for citizen data scientists and engineers without a deep statistical background under the philosophy that you don't need to know any modern data preprocessing and machine learning engineering techniques to create a powerful prediction workflow.

Downloads: 1 This Week

Last Update: 2024-08-13
See Project
Deploy Apps in Seconds with Cloud Run
Host and run your applications without the need to manage infrastructure. Scales up from and down to zero automatically.

Cloud Run is the fastest way to deploy containerized apps. Push your code in Go, Python, Node.js, Java, or any language and Cloud Run builds and deploys it automatically. Get fast autoscaling, pay only when your code runs, and skip the infrastructure headaches. Two million requests free per month. And new customers get $300 in free credit.

Try Cloud Run Free
5

Tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed

...Uses TensorFlow in the efficient way with no extra overhead. On common CNNs, it runs training 1.2~5x faster than the equivalent Keras code. Your training can probably gets faster if written with Tensorpack. Scalable data-parallel multi-GPU / distributed training strategy is off-the-shelf to use. Squeeze the best data loading performance of Python with tensorpack.dataflow. Symbolic programming (e.g. tf.data) does not offer the data processing flexibility needed in research. Tensorpack squeezes the most performance out of pure Python with various auto parallelization strategies. ...

Downloads: 0 This Week

Last Update: 2022-08-01
See Project
6

TensorImage

Image classification library for easily training and deploying models

(Visit our github repository at https://github.com/TensorImage/tensorimage for more information) TensorImage is and open source package for image classification. It has a wide range of data augmentation operations that can be performed over training data to prevent overfitting and increase testing accuracy. TensorImage is easy to use and manage as all files, trained models and data are organized within a workspace directory, which you can change at any time in the configuration file, therefore being able have an indefinite amount of workspace directories for different purposes. ...

Downloads: 0 This Week

Last Update: 2019-01-13
See Project
7

Skater

Python library for model interpretation/explanations

Skater is a unified framework to enable Model Interpretation for all forms of the model to help one build an Interpretable machine learning system often needed for real-world use-cases(** we are actively working towards to enabling faithful interpretability for all forms models). It is an open-source python library designed to demystify the learned structures of a black box model both globally(inference on the basis of a complete data set) and locally(inference about an individual prediction). The concept of model interpretability in the field of machine learning is still new, largely subjective, and, at times, controversial. Model interpretation is the ability to explain and validate the decisions of a predictive model to enable fairness, accountability, and transparency in algorithmic decision-making. ...

Downloads: 0 This Week

Last Update: 2022-08-22
See Project
8

Scikit-plot

An intuitive library to add plotting functionality to scikit-learn

Single line functions for detailed visualizations. Scikit-plot is the result of an unartistic data scientist's dreadful realization that visualization is one of the most crucial components in the data science process, not just a mere afterthought. Gaining insights is simply a lot easier when you're looking at a colored heatmap of a confusion matrix complete with class labels rather than a single-line dump of numbers enclosed in brackets.

Downloads: 1 This Week

Last Update: 2024-08-09
See Project
9

anaGo

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition

anaGo is a Python library for sequence labeling(NER, PoS Tagging,...), implemented in Keras. anaGo can solve sequence labeling tasks such as named entity recognition (NER), part-of-speech tagging (POS tagging), semantic role labeling (SRL) and so on. Unlike traditional sequence labeling solver, anaGo doesn't need to define any language-dependent features. Thus, we can easily use anaGo for any language. In anaGo, the simplest type of model is the Sequence model. Sequence model includes...

Downloads: 0 This Week

Last Update: 2022-08-15
See Project
$300 in Free Credit for Your Google Cloud Projects
Build, test, and explore on Google Cloud with $300 in free credit. No hidden charges. No surprise bills.

Launch your next project with $300 in free Google Cloud credit—no hidden charges. Test, build, and deploy without risk. Use your credit across the Google Cloud platform to find what works best for your needs. After your credits are used, continue building with free monthly usage products. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
10

AerinSistemas-Noname

Elasticsearch to Pandas dataframe or CSV

API and command line utility, written in Python, for querying Elasticsearch exporting result as documents into a CSV file. The search can be done using logical operators or ranges, in combination or alone. The output can be limited to the desired attributes. Also ToT can insert the querying to a Pandas Dataframe or/and save its in a HDF5 container (under development).

Downloads: 0 This Week

Last Update: 2018-08-07
See Project
11

DIGITS

Deep Learning GPU training system

The NVIDIA Deep Learning GPU Training System (DIGITS) puts the power of deep learning into the hands of engineers and data scientists. DIGITS can be used to rapidly train the highly accurate deep neural network (DNNs) for image classification, segmentation and object detection tasks. DIGITS simplifies common deep learning tasks such as managing data, designing and training neural networks on multi-GPU systems, monitoring performance in real-time with advanced visualizations, and selecting the best performing model from the results browser for deployment. ...

Downloads: 0 This Week

Last Update: 2022-01-31
See Project
12

Tangent

Source-to-source debuggable derivatives in pure Python

Existing libraries implement automatic differentiation by tracing a program's execution (at runtime, like PyTorch) or by staging out a dynamic data-flow graph and then differentiating the graph (ahead-of-time, like TensorFlow). In contrast, Tangent performs ahead-of-time autodiff on the Python source code itself, and produces Python source code as its output. Tangent fills a unique location in the space of machine learning tools. As a result, you can finally read your automatic derivative code just like the rest of your program. ...

Downloads: 0 This Week

Last Update: 2022-08-09
See Project
13

AI learning

AiLearning, data analysis plus machine learning practice

We actively respond to the Research Open Source Initiative (DOCX) . Open source today is not just open source, but datasets, models, tutorials, and experimental records. We are also exploring other categories of open source solutions and protocols. I hope you will understand this initiative, combine this initiative with your own interests, and do what you can. Everyone's tiny contributions, together, are the entire open source ecosystem. We are iBooker, a large open-source community,...

Downloads: 0 This Week

Last Update: 2022-02-18
See Project
14

auto_ml

Automated machine learning for analytics & production

...Here's an example that includes serializing and loading the trained model, then getting predictions on single dictionaries, roughly the process you'd likely follow to deploy the trained model. Before you go any further, try running the code. Load up some data (either a DataFrame, or a list of dictionaries, where each dictionary is a row of data). Make a column_descriptions dictionary that tells us which attribute name in each row represents the value we’re trying to predict. Pass all that into auto_ml, and see what happens! You can pass in your own function to perform feature engineering on the data.

Downloads: 0 This Week

Last Update: 2022-08-12
See Project
15

DSTK - DataScience ToolKit

DSTK - DataScience ToolKit for All of Us

...Of course you may specify JASP for advanced data editing and RapidMiner for advanced prediction modeling. DSTK is written in C#, Java and Python to interface with R, NLTK, and Weka. It can be expanded with plugins using R Scripts. We have also created plugins for more statistical functions, and Big Data Analytics with Microsoft Azure HDInsights (Spark Server) with Livy.

Downloads: 5 This Week

Last Update: 2018-05-08
See Project
16

PyDaMelo

Python-compatible Data mining elementary objects

An attempt at offering machine learning and data mining algorithms at the finest grain we are able to, easy to combine together through Python scripting to glue together the Lego-like bricks.

Downloads: 0 This Week

Last Update: 2019-02-19
See Project
17

ExSTraCS

Extended Supervised Tracking and Classifying System

This advanced machine learning algorithm is a Michigan-style learning classifier system (LCS) developed to specialize in classification, prediction, data mining, and knowledge discovery tasks. Michigan-style LCS algorithms constitute a unique class of algorithms that distribute learned patterns over a collaborative population of of individually interpretable IF:THEN rules, allowing them to flexibly and effectively describe complex and diverse problem spaces. ExSTraCS was primarily developed to address problems in epidemiological data mining to identify complex patterns relating predictive attributes in noisy datasets to disease phenotypes of interest. ...

1 Review

Downloads: 0 This Week

Last Update: 2015-11-04
See Project
18

FineSplice

Enhanced splice junction detection and estimation from RNA-Seq data

FineSplice is a Python wrapper to TopHat2 geared towards a reliable identification of expressed exon junctions from RNA-Seq data, at enhanced detection precision with small loss in sensitivity. Following alignment with TopHat2 using known transcript annotations, FineSplice takes as input the resulting BAM file and outputs a confident set of expressed splice junctions with the corresponding read counts. Potential false positives arising from spurious alignments are filtered out via a semi-supervised anomaly detection strategy based on logistic regression. ...

Downloads: 0 This Week

Last Update: 2014-04-01
See Project
19

ProximityForest

Efficient Approximate Nearest Neighbors for General Metric Spaces

A proximity forest is a data structure that allows for efficient computation of approximate nearest neighbors of arbitrary data elements in a metric space. See: O'Hara and Draper, "Are You Using the Right Approximate Nearest Neighbor Algorithm?", WACV 2013 (best student paper award). One application of a ProximityForest is given in the following CVPR publication: Stephen O'Hara and Bruce A.

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
20

MIR

A High-Order Multi-Variate Approximation Scheme for Arbitrary Data Sets, C implementation of the method described in http://web.mit.edu/qiqi/www/paper/interpolation.pdf, with Python and Fortran interfaces.

Downloads: 0 This Week

Last Update: 2016-07-23
See Project
21

pyIRDG

IMDb Relational Dataset Generator

pyIRDG is a program written in Python to generate relational datasets in Prolog format. It uses data from the Internet Movie Database in combination with IMDbPY as backend. A graphical user interface written in pyQt allows the user to link multiple entities together as model for the generation process. The big four entities are Title, Person, Company and Character. Many attributes can be chosen for adding to the output .pl file.

Downloads: 0 This Week

Last Update: 2014-03-09
See Project
22

BCI Project Triathlon

A three-step approach towards experimental brain-computer-interfaces, based on the OCZ nia device for EEG-data acquisition and artificial neural networks for signal-interpretation.

1 Review

Downloads: 0 This Week

Last Update: 2016-09-08
See Project
23

HypSurGent

This program generates customizable hyper-surfaces (multi-dimensional input and output) and samples data from them to be used further as benchmark for response surface modeling tasks or optimization algorithms.

Downloads: 0 This Week

Last Update: 2016-09-05
See Project
24

Kickapoo

Spider that recollects data from MySpace Social Network. At now, it is only designed to extract information from native american people because it is used for a social science study in the UNAM (Universidad Nacional Autónoma de México).

Downloads: 0 This Week

Last Update: 2013-04-19
See Project
25

PyEEG, EEG Feature Extraction in Python

A Python function library to extract EEG feature from EEG time series in standard Python and numpy data structure. Features include classical spectral analysis, entropies, fractal dimensions, DFA, inter-channel synchrony and order, etc.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-16
See Project

Previous
5
6
7
8
You're on page 9
10
Next

Related Searches

captcha image

ai

rstudio

heart disease prediction system in python

anomaly detection

ocz nia driver

captcha

language

dmx lighting control

audio and video stream

Related Categories

Artificial Intelligence

Software Development

Business

Scientific/Engineering

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: