Showing 652 open source projects for "data"

View related business solutions
  • Go from Data Warehouse to Data and AI platform with BigQuery Icon
    Go from Data Warehouse to Data and AI platform with BigQuery

    Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.

    BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
    Try BigQuery Free
  • Ship AI Apps Faster with Vertex AI Icon
    Ship AI Apps Faster with Vertex AI

    Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

    Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
    Try Vertex AI Free
  • 1
    Febrl (Freely Extensible Biomedical Record Linkage) does data standardisation (segmentation and cleaning) and probabilistic record linkage ("fuzzy" matching) of one or more files or data sources which do not share a unique record key or identifier.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    PyBact is an open source software written in Python for Bacterial Identification. The software generates simulated data matrix which accurately represents the probabilistic positive/negative results of the tested biochemical test.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    ftc is a python script for content-based file type classification based on an file extension and magic number database, and several computational intelligence algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    BCI Project Triathlon
    A three-step approach towards experimental brain-computer-interfaces, based on the OCZ nia device for EEG-data acquisition and artificial neural networks for signal-interpretation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    This program generates customizable hyper-surfaces (multi-dimensional input and output) and samples data from them to be used further as benchmark for response surface modeling tasks or optimization algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Spider that recollects data from MySpace Social Network. At now, it is only designed to extract information from native american people because it is used for a social science study in the UNAM (Universidad Nacional Autónoma de México).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    A Python function library to extract EEG feature from EEG time series in standard Python and numpy data structure. Features include classical spectral analysis, entropies, fractal dimensions, DFA, inter-channel synchrony and order, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Sofa is a CUDA-based reasoner
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    MutationFinder is a biomedical natural language processing (NLP) system for extracting mentions of point mutations from free text. MutationFinder achieves high performance (99% precision, 81% recall on blind test data) as an information extraction system
    Downloads: 2 This Week
    Last Update:
    See Project
  • Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud Icon
    Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud

    Get back to your application and leave the database to us. Cloud SQL automatically handles backups, replication, and scaling.

    Cloud SQL is a fully managed relational database for MySQL, PostgreSQL, and SQL Server. We handle patching, backups, replication, encryption, and failover—so you can focus on your app. Migrate from on-prem or other clouds with free Database Migration Service. IDC found customers achieved 246% ROI. New customers get $300 in credits plus a 30-day free trial.
    Try Cloud SQL Free
  • 10
    PyPlayground is an environment for developing algorithms involving movement in a space of up to three dimensions using Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    This project simulates a multi-agent system (swarm) behavior both graphically and not. The purpose of this project is to research the properties suggested in "stability analysis of swarms" V.Gazi & K.M.Passino. Using the vpython library for 3D modeling
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    An intelligent, web-crawling agent that builds invariant representations of data collected.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SenseRank Sys: - builds the dictionaries (multidim matrices) of words’ values; - for the set utterance in certain language builds a figure in multidimensional space (in the matrix space) of values (visual schema), which is topological view of sense
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Software tool that emphasizes useful information, communication, and intent. Accomplish any conceivable computer task by transforming fuzzy abstract data-structures in a fractal vector graphics user interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Cathnet is developing the infrastructure for the Catholic Semantic Web. Technologies involved include, but are not limited to, XML, RDF, NLP, Zope, Plone and Plone products.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ...The code can be processed over multiple common machines with fault-tolerance. It is designed to distributively run any Python's script, which can be applied to a given input data set.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Robotic Manipulator Development and Simulation Environment in Python and Blender. IMPORTANT: Development moved to github. http://github.com/ajnsit/r2d3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    VFML -- Very Fast Machine Learning toolkit. A collection of tools, learners, and APIs for working with high-speed data streams and very large data sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    If you are looking for a way to get WordNetŽ 's data into your relational DBMS, wordnet2sql is what you need!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    pySPACE

    pySPACE

    Signal Processing and Classification Environment in Python using YAML

    pySPACE is a modular software for processing of large data streams that has been specifically designed to enable distributed execution and empirical evaluation of signal processing chains. Various signal processing algorithms (so called nodes) are available within the software, from finite impulse response filters over data-dependent spatial filters (e.g. CSP, xDAWN) to established classifiers (e.g.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    pyBoids is a free/open-source project that implements (in Python/TKinter) Craig Reynold's famous boids algorithm. This algorithm intelligently simulates flocking, herding, swarming, and schooling behavior as found in nature.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Cellicone is a project to develop an artificial life organism with the necessary components to make it comparable to biological life as we know it. This includes components ranging from proteins to cells to organs to limbs, and many steps between.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    nBoost is a suite of boosting algorithms designed to solve binary classification problems on data that is not linearly separable by a convex combination of base hypotheses, i.e. noisy data. WARNING: Active development. Underlying algorithm is unstable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    litersta

    litersta

    Litersta - textual analytics - software

    ...This strategy keeps your data confidential and secure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    text_summurization_abstractive_methods

    Multiple implementations for abstractive text summurization

    This repo is built to collect multiple implementations for abstractive approaches to address text summarization it is built to simply run on google colab , in one notebook so you would only need an internet connection to run these examples without the need to have a powerful machine , so all the code examples would be in a jupyter format , and you don't have to download data to your device as we connect these jupyter notebooks to google drive
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB