Showing 48 open source projects for "data processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    The Grand Complete Data Science Guide

    The Grand Complete Data Science Guide

    Data Science Guide With Videos And Materials

    The Grand Complete Data Science Materials is a repository curated by a data-science educator that aggregates a wide range of learning resources — from basic programming and math foundation to advanced topics in machine learning, deep learning, natural language processing, computer vision, and deployment practices — into a structured, centralized collection aimed at learners seeking a comprehensive path to data science mastery.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Best-of Python

    Best-of Python

    A ranked list of awesome Python open-source libraries

    ...Ranked list of awesome python libraries for web development. Correctly generate plurals, ordinals, indefinite articles; convert numbers. Libraries for loading, collecting, and extracting data from a variety of data sources and formats. Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    biblatex
    Biblatex is a LaTeX package which provides full-featured bibliographic facilities
    Leader badge
    Downloads: 68 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    xrayutilities

    xrayutilities

    a package with useful scripts for X-ray diffraction physicists

    xrayutilities is a python package used to analyze x-ray diffraction data. It can support with performing diffraction experiments and used for common steps in the data analysis. It can read experimental data from several data formats (spec, edf, xrdml, ...); convert them to reciprocal space for arbitrary goniometer geometries and different detector systems (point, linear as well as area detectors); for further processing the data can be gridded (transformed to a regular grid). ...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 6

    openSkyMatch

    Matches OpenScience Observatories images with astronomical catalogs

    openSkyMatch is a collection of Linux shell and Python scripts designed for the OpenScience Observatories program. It automates the identification and matching of detected celestial objects in locally captured FITS images with entries in large-scale sky catalogs, notably Pan-STARRS1 DR2 (II/389/ps1_dr2). The toolkit supports data preprocessing, coordinate correlation, and catalog-based validation of astronomical detections. All tools are open-source and optimized for reproducibility and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Advanced Trigonometry Calculator

    Advanced Trigonometry Calculator

    Precision Trigonometry: Advanced Calculator for Complex Math

    Advanced Trigonometry Calculator is equipped with a user-friendly interface that allows for easy input of problems and instant computation. Professionals such as engineers who need to perform advanced trigonometric calculations in their work will find this tool extremely useful. ATC Online Alpha: https://advantrigoncalc.sourceforge.io/atc/ More info by clicking below: https://advantrigoncalc.sourceforge.io/ Advanced Trigonometry Calculator was only and always only developed by...
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 8
    data-science-on-gcp

    data-science-on-gcp

    Source code accompanying book: Data Science on the GCP

    The data-science-on-gcp repository is a comprehensive collection of code examples and end-to-end workflows that accompany the book Data Science on the Google Cloud Platform, designed to teach developers how to build scalable data science and machine learning systems using Google Cloud services. It provides structured, chapter-aligned implementations that guide users through the full lifecycle of a data science project, including data ingestion, storage, processing, analysis, model training, and deployment. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AllenNLP

    AllenNLP

    An open-source NLP research library, built on PyTorch

    AllenNLP makes it easy to design and evaluate new deep learning models for nearly any NLP problem, along with the infrastructure to easily run them in the cloud or on your laptop. AllenNLP includes reference implementations of high quality models for both core NLP problems (e.g. semantic role labeling) and NLP applications (e.g. textual entailment). AllenNLP supports loading "plugins" dynamically. A plugin is just a Python package that provides custom registered classes or additional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    Web based cataloging and dedupe application. Highly optimized for processing journal articles. Reads MarcXML and dedupes records using the field 773 combined with a fuzzy search on the title. Written for bibnet.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Python Tutorials

    Python Tutorials

    Machine Learning Tutorials

    ...Created by an experienced instructor and educator, the repository covers a wide range of programming basics and advanced topics. This includes foundational Python concepts, data processing with libraries like NumPy and pandas, threading and multiprocessing for concurrency, and practical use of libraries such as Matplotlib for data visualization. It also provides tutorials on machine learning frameworks and concepts, including TensorFlow, PyTorch, Keras, Scikit-Learn, and reinforcement learning techniques. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Quantitative-Notebooks

    Quantitative-Notebooks

    Educational notebooks on quantitative finance, algorithmic trading

    ...Because quantitative analysis often requires visualization, statistics, and time series processing, these notebooks also serve as templates for real financial research and strategy prototyping. Users can adapt the examples to their own data sources, financial instruments, and modeling techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Data Science at the Command Line

    Data Science at the Command Line

    Data science at the command line

    ...To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools, useful whether you work with Windows, macOS, or Linux. You’ll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you’re comfortable processing data with Python or R, you’ll learn how to greatly improve your data science workflow by leveraging the command line’s power.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PRMLT

    PRMLT

    Matlab code of machine learning algorithms in book PRML

    ...This package requires Matlab R2016b or latter, since it utilizes a new Matlab syntax called Implicit expansion (a.k.a. broadcasting). It also requires Statistics Toolbox (for some simple random number generator) and Image Processing Toolbox (for reading image data). The code is extremely compact. Minimizing code length is a major goal. As a result, the core of the algorithms can be easily spotted. Many tricks for speeding up Matlab code are applied (e.g. vectorization, matrix factorization, etc.). Usually, functions in this package are orders faster than Matlab builtin ones (e.g. kmeans). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    lottie vectors

    lottie vectors

    Create, display and process 2D vectors in a 3D window.

    Lottie Vectors is an application for Matlab that alows you to do some pretty neat things -with vectors. More exactly -displaying them in ways that hopefully will allow you to explore and better understand your vector data. The basic idea is simple, take a vector defined in one of a few different types of data formats and map it on the screen. Add another vector and you start to form a 'route'. Each route or position vector can be accompanied with a 'force' vector. This can be used to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    TESTIMAGES

    TESTIMAGES

    Testing images for scientific purposes

    ...Please cite the following papers when using any image in this archive: * ASUNI N, GIACHETTI A, "TESTIMAGES: A Large Data Archive For Display and Algorithm Testing", Journal of Graphics Tools, Volume 17, Issue 4, 2015, pages 113-125, DOI:10.1080/2165347X.2015.1024298 * ASUNI N, GIACHETTI A, "TESTIMAGES: a large-scale archive for testing visual devices and basic image processing algorithms", STAG - Smart Tools & Apps for Graphics Conference, 2014.
    Leader badge
    Downloads: 111 This Week
    Last Update:
    See Project
  • 17
    RNAseq Tutorial

    RNAseq Tutorial

    Informatics for RNA-seq: A web resource for analysis on the cloud

    ...The version in that repo is deprecated, but still maintains content for those wishing to follow the original published workflow. Includes instruction on cloud computing basics and using cloud environments for large data processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Modular application for biomedical data processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Open Cezeri Library

    Open Cezeri Library

    Effective Linear Algebra and Computer Vision Library with JAVA

    OCL stands for Open Cezeri Library (yet another linear algebra and matrix library). This library provides rapid coding as matlab ease of use. To learn for library please try to use test examples at OpenCezeriLibrary\test\test. It is originally developed at el-cezeri laboratory of Siirt University, in order to establish generic framework of reusable components and software tools for machine vision, machine learning, AI and robotic applications. Currently, it holds following main concepts 1-...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    MARC/Perl

    Perl libraries for processing MARC records

    MARC/Perl (formerly known as MARC.pm) is a project to develop Perl libraries to process MARC (MAchine Readable Cataloging) data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    WebDjVuTextEd

    Edit the OCR text layer of DjVu documents in a web browser

    WebDjVuTextEd allows to edit the text layer of OCR'ed DjVu documents in a web browser. You can modify the structure (paragraphs, lines, words...) create, delete, edit text nodes, modify their container box by mouse, and run a spellchecker. The program does not directly read the DjVu files, it requires exported XML text data and images. When using without a webserver, you can open and save local files, but cannot take advantages of auto-save and spell checking. Note that current SVN...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Yes, finally. The author decide to schedule some of his busy time to release the "Chinese Language Formula"- Step By Step. It will take a while, but eventually Chinese Language Processing is no longer a problem.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    MatrixUser

    MatrixUser

    A Multi-functional GUI-based Program for Image Processing and Analysis

    ...CT, MRI, PET, etc.) comprises multiple frames which represent slices, phases, timing etc. from the same imaging object. Those images can be saved as multidimensional matrices in Matlab thanks to Matlab's powerful support of multidimensional data representation. However, within Matlab, most of image manipulation functions are limited or tailored for processing two-dimensional matrix. The MatrixUser is a software package which features functions designed and optimized specifically for manipulating multidimensional real or complex data matrix. MatrixUser provides a nice graphical environment for easily performing image analysis tasks including multidimensional image display, matrix (image stack) processing and rendering etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ExData Plotting1

    ExData Plotting1

    Plotting Assignment 1 for Exploratory Data Analysis

    ...The dataset covers nearly four years of minute-level measurements, including power consumption, voltage, current intensity, and detailed sub-metering values for different household areas. For analysis, focus is placed on a two-day period in February 2007, highlighting short-term consumption trends. The data requires careful handling due to its size of more than 2 million rows and coded missing values. By processing the date and time fields into proper formats, it becomes possible to generate clear time-series plots of energy usage. The repository demonstrates effective exploratory data analysis practices in R with a reproducible workflow for transforming raw data into visual insights.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Data Processing Model

    Data Processing Model

    Qt5 application for data linking and filtering

    Qt5 application for data linking and filtering. This application allows user to create multiple documents with records and link those records across multiple documents. Once in filter mode, one can filter data on links created to find required data quickly. Install instructions: sudo apt-add-repository ppa:canonical-qt5-edgers/qt5-proper sudo apt-get update install deb package
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB