Search Results for "data collection algorithm" - Page 7

Showing 1921 open source projects for "data collection algorithm"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    TIGRE

    TIGRE

    TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox

    TIGRE is an open-source toolbox for fast and accurate 3D tomographic reconstruction for any geometry. Its focus is on iterative algorithms for improved image quality that have all been optimized to run on GPUs (including multi-GPUs) for improved speed. It combines the higher-level abstraction of MATLAB or Python with the performance of CUDA at a lower level in order to make it both fast and easy to use. TIGRE is free to download and distribute: use it, modify it, add to it, and share it. Our...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    ScientificTypes.jl

    ScientificTypes.jl

    An API for dispatching on the "scientific" type of data

    This package makes a distinction between machine type and scientific type of a Julia object. The machine type refers to the Julia type being used to represent the object (for instance, Float64). The scientific type is one of the types defined in ScientificTypesBase.jl reflecting how the object should be interpreted (for instance, Continuous or Multiclass).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CTGAN

    CTGAN

    Conditional GAN for generating synthetic tabular data

    CTGAN is a collection of Deep Learning based synthetic data generators for single table data, which are able to learn from real data and generate synthetic data with high fidelity. If you're just getting started with synthetic data, we recommend installing the SDV library which provides user-friendly APIs for accessing CTGAN. The SDV library provides wrappers for preprocessing your data as well as additional usability features like constraints. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Random Cut Forest by AWS

    Random Cut Forest by AWS

    An implementation of the Random Cut Forest data structure

    This repository contains implementations of the Random Cut Forest (RCF) probabilistic data structure. RCFs were originally developed at Amazon to use in a nonparametric anomaly detection algorithm for streaming data. Later new algorithms based on RCFs were developed for density estimation, imputation, and forecasting. The different directories correspond to equivalent implementations in different languages, and bindings to to those base implementations, using language-specific features for greater flexibility of use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    nkn

    nkn

    Official Go implementation of NKN full node

    NKN (New Kind of Network) is a blockchain-based network infrastructure that utilizes a novel consensus mechanism to facilitate decentralized data transmission. By incentivizing network participants with tokens, NKN aims to create a more open, efficient, and secure internet. Its architecture supports a wide range of applications, from messaging to content delivery.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    My Python Eggs

    My Python Eggs

    Python Examples

    My Python Eggs, commonly associated with the geekcomputers Python repository, is a large collection of practical Python scripts and small programs created primarily for experimentation, automation, and educational purposes. Rather than being a single cohesive application, it functions as a repository of utilities that demonstrate how Python can be used to solve everyday problems and automate repetitive tasks. The scripts cover a wide range of topics, including file management, networking,...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    PHP Map

    PHP Map

    PHP arrays and collections made easy

    Aimeos/Map is a lightweight PHP collection utility that makes working with arrays and array-like data much easier and more expressive by offering a fluent, object-oriented interface inspired by jQuery and Laravel collections. It wraps standard PHP arrays in a “Map” object, letting developers chain powerful methods for creating, transforming, filtering, sorting, and aggregating data without verbose boilerplate code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    tsfresh

    tsfresh

    Automatic extraction of relevant features from time series

    tsfresh is a python package. It automatically calculates a large number of time series characteristics, the so called features. tsfresh is used to to extract characteristics from time series. Without tsfresh, you would have to calculate all characteristics by hand. With tsfresh this process is automated and all your features can be calculated automatically. Further tsfresh is compatible with pythons pandas and scikit-learn APIs, two important packages for Data Science endeavours in python....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    JSON Forms

    JSON Forms

    Customizable JSON Schema-based forms with React, Angular and Vue

    JSON Forms is a declarative framework for efficiently building form-based web UIs. These UIs are targeted at entering, modifying and viewing data and are usually embedded within an application. Writing HTML templates and Javascript for data binding by hand is hard, especially in applications of reasonable size. Furthermore, a form is often more than just a collection of input fields and more advanced functionality is required, e.g. validation or conditional visibility. JSON Forms utilizes the capabilities of JSON and JSON schema and provides a simple and declarative way of describing forms. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Prodigy Hack

    Prodigy Hack

    One of the best Prodigy hacks

    ...It is typically implemented as a collection of scripts that can be executed in real time while the game is running, enabling immediate changes without modifying server infrastructure. The tool is often used for experimentation, reverse engineering, or demonstrating vulnerabilities in browser-based applications. It also reflects broader concepts in web security, particularly the importance of server-side validation and secure state management.
    Downloads: 111 This Week
    Last Update:
    See Project
  • 11
    OpenNMS

    OpenNMS

    Enterprise-Grade Open-Source Network Management Platform

    OpenNMS is an open-source network monitoring platform that helps you visualize and monitor everything on your local and distributed networks. OpenNMS offers comprehensive fault, performance, and traffic monitoring with alarm generation in one place. Highly customizable and scalable, OpenNMS integrates with your core business applications and workflows.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    xhs-spider

    xhs-spider

    Desktop tool for collecting and exporting Xiaohongshu post data

    XHS-Spider is a desktop data collection tool designed to gather content and metadata from the Xiaohongshu platform. It provides a graphical interface that allows users to explore posts, collect information, and download media such as images and videos from individual notes or search results. It was developed primarily as a learning project to demonstrate approaches to building web crawlers and experimenting with technologies such as WebView2 and WPF UI.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    DynamicHMC

    DynamicHMC

    Implementation of robust dynamic Hamiltonian Monte Carlo methods

    Implementation of robust dynamic Hamiltonian Monte Carlo methods in Julia. In contrast to frameworks that utilize a directed acyclic graph to build a posterior for a Bayesian model from small components, this package requires that you code a log-density function of the posterior in Julia. Derivatives can be provided manually, or using automatic differentiation. Consequently, this package requires that the user is comfortable with the basics of the theory of Bayesian inference, to the extent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Claude Scientific Skills

    Claude Scientific Skills

    A set of ready to use Agent Skills for research, science, engineering

    Claude Scientific Skills is a large open source collection of ready-to-use scientific capabilities that extend AI coding agents into full research assistants. The project provides more than 170 curated skills covering domains such as genomics, drug discovery, medical imaging, physics, and advanced data analysis. Each skill bundles documentation, examples, and tool integrations so agents can reliably execute complex multi-step scientific workflows.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 15
    Jitsu

    Jitsu

    Jitsu is an open-source Segment alternative

    Jitsu is a fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days. Installing Jitsu is a matter of selecting your framework and adding few lines of code to your app. Jitsu is built to be framework agnostic, so regardless of your stack, we have a solution that'll work for your team. Connect data warehouse (Snowflake, Clickhouse, BigQuery, S3, Redshift ot Postgres) and query your data instantly. Jitsu can either stream data in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Video-subtitle-remover (VSR)

    Video-subtitle-remover (VSR)

    AI tool that removes hardcoded subtitles and text from videos locally

    ...In addition to video processing, the project supports removing text-like watermarks from images through similar techniques. The processing runs locally without requiring any external API services, enabling offline use and greater control over the data being processed.
    Downloads: 96 This Week
    Last Update:
    See Project
  • 17
    vitae

    vitae

    R Markdown Résumés and CVs

    vitae is an R package that streamlines resume and CV creation via R Markdown. It includes a collection of LaTeX and HTML templates along with helper functions to dynamically populate content from data sources such as ORCID or spreadsheets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    VikingDB MCP Server

    VikingDB MCP Server

    A mcp server for vikingdb store and search

    An MCP server that interfaces with VikingDB, a high-performance vector database developed by ByteDance, enabling efficient vector storage and search capabilities. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Apache Drill

    Apache Drill

    Apache Drill is a distributed MPP query layer for self describing data

    ...Drill supports a variety of NoSQL databases and file systems, including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud Storage, Swift, NAS and local files. A single query can join data from multiple datastores. For example, you can join a user profile collection in MongoDB with a directory of event logs in Hadoop.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    syslog-ng

    syslog-ng

    Log management solution that improves the performance of SIEM

    ...Instead of deploying multiple agents on hosts, organizations can unify their log data collection and management. syslog-ng Store Box provides automated archiving, tamper-proof encrypted storage, granular access controls to protect log data. The largest appliance can store up to 10TB of raw logs.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    Machine Learning Tutorials Repository

    Machine Learning Tutorials Repository

    Dive deep into the realms of Machine Learning and other topics

    The Machine Learning Tutorials Repository is a comprehensive collection of resources, examples, and implementations designed to help users understand and apply machine learning concepts. It covers a wide range of topics, including supervised learning, unsupervised learning, neural networks, and data preprocessing techniques. The project is structured to provide both theoretical explanations and practical code examples, making it suitable for learners at different levels. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Weibo Crawler

    Weibo Crawler

    Python crawler for collecting and downloading Sina Weibo user data

    weibo-crawler is a Python-based data collection tool designed to retrieve information from Sina Weibo user accounts. It automates the process of gathering posts, user profile details, and engagement metrics from one or more target accounts. weibo-crawler can extract comprehensive information about users, including profile attributes such as nickname, follower count, following count, and account metadata.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    DALI

    DALI

    A GPU-accelerated library containing highly optimized building blocks

    The NVIDIA Data Loading Library (DALI) is a library for data loading and pre-processing to accelerate deep learning applications. It provides a collection of highly optimized building blocks for loading and processing image, video and audio data. It can be used as a portable drop-in replacement for built-in data loaders and data iterators in popular deep learning frameworks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Custom Erase Algorithm by Mariano Ortu

    Custom Erase Algorithm by Mariano Ortu

    Secure file deletion algorithm with customizable overwrite patterns.

    This project provides a secure file deletion algorithm designed to permanently erase data by overwriting files with customizable patterns. It ensures that deleted files cannot be recovered by standard recovery tools, enhancing data privacy and security. The algorithm supports multiple overwrite passes with user-defined sequences, adapting to different security standards and user needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Stan.jl

    Stan.jl

    Stan.jl illustrates the usage of the 'single method' packages

    A collection of example Stan Language programs demonstrating all methods available in Stan's cmdstan executable (as an external program) from Julia. For most applications one of the "single method" packages, e.g. StanSample.jl, StanDiagnose.jl, etc., is a better choice for day-to-day use. To execute the most important method in Stan ("sample"), use StanSample.jl. Some Pluto notebook examples can be found in the repository.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB