Join/Login
Open Source Software
Business Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Open Source Software

Business Software

SourceForge Podcast

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Browse Open Source
Search Results

Search Results for "data science" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 3,709
Windows 2,950
Mac 2,617
More...
BSD 2,381
ChromeOS 1,591
Desktop Operating Systems 244
Mobile Operating Systems 40
Server Operating Systems 40
Embedded Operating Systems 7
Game Consoles 2

Category

Scientific/Engineering 2,408
Business 1,487
Software Development 606
Multimedia 468
Formats and Protocols 448
Education 285
Artificial Intelligence 283
System 254
Database 229
Internet 197
Communications 107
Games 71
Social sciences 53
Security 46
Text Editors 43
Desktop Environment 18
Printing 9
Terminals 4
Blockchain 3
Mobile 2
Religion and Philosophy 2
Productivity 1

License

OSI-Approved Open Source 3,331
Other License 78
Creative Commons Attribution License 77
Public Domain 65
More...
GNU Free Documentation License 5

Translations

Programming Language

Status

Production/Stable 1,223
Beta 1,115
Alpha 582
Pre-Alpha 334
More...
Planning 219
Mature 118
Inactive 59

Showing 3709 open source projects for "data science"

View related business solutions

Linux Clear Filters & Widen Search

Our Free Plans just got better! | Auth0 by Okta
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.

Try free now
Bright Data - All in One Platform for Proxies and Web Scraping
Say goodbye to blocks, restrictions, and CAPTCHAs

Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.

Get Started
1

Great Expectations

Always know what to expect from your data

Great Expectations helps data teams eliminate pipeline debt, through data testing, documentation, and profiling. Software developers have long known that testing and documentation are essential for managing complex codebases. Great Expectations brings the same confidence, integrity, and acceleration to data science and data engineering teams. Expectations are assertions for data. They are the workhorse abstraction in Great Expectations, covering all kinds of common data issues. Expectations...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
2

cuDF

GPU DataFrame Library

... with conda (miniconda, or the full Anaconda distribution) from the rapidsai channel. cuDF is supported only on Linux, and with Python versions 3.7 and later. The RAPIDS suite of open-source software libraries aims to enable the execution of end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization but exposing that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

Downloads: 0 This Week

Last Update: 2024-10-09
See Project
3

compromise

Modest natural-language processing

Language is complicated and there's a gazillion words. Compromise is a javascript library that interprets and pre-parses text and makes some reasonable decisions so things are way easier. Compromise tries its best to parse text. it is small, quick, and often good-enough. It is not as smart as you'd think. Conjugate and negate verbs in any tense. Play between plural, singular and possessive forms. Interpret plain-text numbers. Handle implicit terms. Use it on the client-side or as an...

Downloads: 1 This Week

Last Update: 2024-10-10
See Project
4

Kedro

A Python framework for creating reproducible, maintainable code

Kedro is an open sourced Python framework for creating maintainable and modular data science code. Provides the scaffolding to build more complex data and machine-learning pipelines. In addition, there's a focus on spending less time on the tedious "plumbing" required to maintain data science code; this means that you have more time to solve new problems. Standardises team workflows; the modular structure of Kedro facilitates a higher level of collaboration when teams solve problems together...

Downloads: 0 This Week

Last Update: 2024-10-10
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
5

Orchest

Build data pipelines, the easy way

Code, run and monitor your data pipelines all from your browser! From idea to scheduled pipeline in hours, not days. Interactively build your data science pipelines in our visual pipeline editor. Versioned as a JSON file. Run scripts or Jupyter notebooks as steps in a pipeline. Python, R, Julia, JavaScript, and Bash are supported. Parameterize your pipelines and run them periodically on a cron schedule. Easily install language or system packages. Built on top of regular Docker container images...

Downloads: 0 This Week

Last Update: 2023-04-03
See Project
6

DocArray

The data structure for multimodal data

... science powerhouse: greatly accelerate data scientists’ work on embedding, k-NN matching, querying, visualizing, evaluating via Torch/TensorFlow/ONNX/PaddlePaddle on CPU/GPU. Data in transit: optimized for network communication, ready-to-wire at anytime with fast and compressed serialization in Protobuf, bytes, base64, JSON, CSV, DataFrame. Perfect for streaming and out-of-memory data. One-stop k-NN: Unified and consistent API for mainstream vector databases.

Downloads: 0 This Week

Last Update: 2023-12-22
See Project
7

NannyML

Detecting silent model failure. NannyML estimates performance

... science, empowering data scientist to quickly understand and automatically detect silent model failure. By using NannyML, data scientists can finally maintain complete visibility and trust in their deployed machine learning models. When the actual outcome of your deployed prediction models is delayed, or even when post-deployment target labels are completely absent, you can use NannyML's CBPE-algorithm to estimate model performance.

Downloads: 0 This Week

Last Update: 2024-09-06
See Project
8

SIT742

SIT742: Modern Data Science

SIT742: Modern Data Science.

Downloads: 0 This Week

Last Update: 2024-07-28
See Project
9

CARTOframes

CARTO Python package for data scientists

A Python package for integrating CARTO maps, analysis, and data services into data science workflows. Python data analysis workflows often rely on the de facto standards pandas and Jupyter notebooks. Integrating CARTO into this workflow saves data scientists time and energy by not having to export datasets as files or retain multiple copies of the data. Instead, CARTOframes give the ability to communicate reproducible analysis while providing the ability to gain from CARTO's services like...

Downloads: 0 This Week

Last Update: 2023-12-21
See Project
Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
10

NVIDIA Merlin

Library providing end-to-end GPU-accelerated recommender systems

NVIDIA Merlin is an open-source library that accelerates recommender systems on NVIDIA GPUs. The library enables data scientists, machine learning engineers, and researchers to build high-performing recommenders at scale. Merlin includes tools to address common feature engineering, training, and inference challenges. Each stage of the Merlin pipeline is optimized to support hundreds of terabytes of data, which is all accessible through easy-to-use APIs. For more information, see NVIDIA Merlin...

Downloads: 0 This Week

Last Update: 2024-06-14
See Project
11

PySyft

Data science on data without acquiring a copy

Most software libraries let you compute over the information you own and see inside of machines you control. However, this means that you cannot compute on information without first obtaining (at least partial) ownership of that information. It also means that you cannot compute using machines without first obtaining control over those machines. This is very limiting to human collaboration and systematically drives the centralization of data, because you cannot work with a bunch of data without...

Downloads: 0 This Week

Last Update: 2024-09-04
See Project
12

Recommenders

Best practices on recommendation systems

...-of-the-art algorithms are included for self-study and customization in your own applications. Please see the setup guide for more details on setting up your machine locally, on a data science virtual machine (DSVM) or on Azure Databricks. Independent or incubating algorithms and utilities are candidates for the contrib folder. This will house contributions which may not easily fit into the core repository or need time to refactor or mature the code and add necessary tests.

Downloads: 0 This Week

Last Update: 2024-05-01
See Project
13

PSLab Android App

PSLab Android App

Repository for the PSLab Android App for performing experiments with the Pocket Science Lab open-hardware platform. This repository holds the Android App for performing experiments with PSLab. PSLab is a tiny pocket science lab that provides an array of equipment for doing science and engineering experiments. It can function like an oscilloscope, waveform generator, frequency counter, programmable voltage and current source and also as a data logger. PSLab is a tiny pocket science lab...

Downloads: 0 This Week

Last Update: 2024-06-30
See Project
14

SageMaker Inference Toolkit

Serve machine learning models within a Docker container

Serve machine learning models within a Docker container using Amazon SageMaker. Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. Once you have a trained model, you can include it in a Docker container that runs your inference code. A container provides an effectively isolated environment, ensuring a consistent runtime regardless of where...

Downloads: 0 This Week

Last Update: 2023-10-25
See Project
15

SageMaker Training Toolkit

Train machine learning models within Docker containers

Train machine learning models within a Docker container using Amazon SageMaker. Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. To train a model, you can include your training script and dependencies in a Docker container that runs your training code. A container provides an effectively isolated environment, ensuring a consistent runtime...

Downloads: 0 This Week

Last Update: 2024-09-09
See Project
16

DearPyGui

Graphical User Interface Toolkit for Python with minimal dependencies

... and demanding graphical interfaces. DPG offers a solid framework for developing scientific, engineering, gaming, data science and other applications that require fast and interactive interfaces. The Tutorials will provide a great overview and links to each topic in the API Reference for more detailed reading. Complete theme and style control. GPU-based rendering and efficient C/C++ code.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
17

tsfresh

Automatic extraction of relevant features from time series

tsfresh is a python package. It automatically calculates a large number of time series characteristics, the so called features. tsfresh is used to to extract characteristics from time series. Without tsfresh, you would have to calculate all characteristics by hand. With tsfresh this process is automated and all your features can be calculated automatically. Further tsfresh is compatible with pythons pandas and scikit-learn APIs, two important packages for Data Science endeavours in python...

Downloads: 0 This Week

Last Update: 2024-08-03
See Project
18

Mlxtend

A library of extension and helper modules for Python's data analysis

Mlxtend (machine learning extensions) is a Python library of useful tools for day-to-day data science tasks.

Downloads: 0 This Week

Last Update: 2024-08-03
See Project
19

omegaml

MLOps simplified. From ML Pipeline ⇨ Data Product without the hassle

omega|ml is the innovative Python-native MLOps platform that provides a scalable development and runtime environment for your Data Products. Works from laptop to cloud.

Downloads: 0 This Week

Last Update: 2024-07-25
See Project
20

Molly.jl

Molecular simulation in Julia

Much of science can be explained by the movement and interaction of molecules. Molecular dynamics (MD) is a computational technique used to explore these phenomena, from noble gases to biological macromolecules. Molly.jl is a pure Julia package for MD, and for the simulation of physical systems more broadly. The package is described in a talk at Enzyme Conference 2023 and an earlier talk at the JuliaMolSim minisymposium at JuliaCon 2022. Slides are also available for a tutorial in September...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
21

Bayesian Julia

Bayesian Statistics using Julia and Turing

Bayesian statistics is an approach to inferential statistics based on Bayes' theorem, where available knowledge about parameters in a statistical model is updated with the information in observed data. The background knowledge is expressed as a prior distribution and combined with observational data in the form of a likelihood function to determine the posterior distribution. The posterior can also be used for making predictions about future events. Bayesian statistics is a departure from...

Downloads: 0 This Week

Last Update: 2023-12-07
See Project
22

ClimateTools.jl

Climate science package for Julia

Climate analysis tools in Julia. ClimateTools.jl is a collection of commonly-used tools in Climate science. Basics of climate field analysis are covered, with some forays into exploratory techniques associated with climate scenario design. The package is aimed to ease the typical steps of analysis of climate models outputs and gridded datasets (support for weather stations is a work-in-progress). Climate indices and bias correction functions are coded to leverage the use of multiple threads...

Downloads: 0 This Week

Last Update: 2023-12-04
See Project
23

TPOT

A Python Automated Machine Learning tool that optimizes ML

Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming. TPOT stands for Tree-based Pipeline Optimization Tool. Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Downloads: 0 This Week

Last Update: 2024-08-01
See Project
24

Karate Club

An API Oriented Open-source Python Framework for Unsupervised Learning

Karate Club is an unsupervised machine learning extension library for NetworkX. Karate Club consists of state-of-the-art methods to do unsupervised learning on graph-structured data. To put it simply it is a Swiss Army knife for small-scale graph mining research. First, it provides network embedding techniques at the node and graph level. Second, it includes a variety of overlapping and non-overlapping community detection methods. Implemented methods cover a wide range of network science...

Downloads: 0 This Week

Last Update: 2024-08-09
See Project
25

UnionML

Build and deploy machine learning microservices

... learning methods, implement endpoints for fetching data, training models, serving predictions (and much more) to write a complete ML stack in one place. Data science, ML engineering, and MLOps practitioners can all gather around UnionML apps as a way of defining a single source of truth about your ML system’s behavior. This helps you maintain consistent code across your ML stack, from training to prediction logic.

Downloads: 0 This Week

Last Update: 2023-05-25
See Project

Previous
1
You're on page 2
3
4
5
6
Next

Related Searches

android app source code

unstructured data

lotto prediction algorithm

artificial intelligence personal assistant python

lottery prediction algorithm

Related Categories

Scientific/Engineering

Software Development

Formats and Protocols

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 454-5900

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2024 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: