Search Results for "data collection algorithm" - Page 4

Showing 1711 open source projects for "data collection algorithm"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 1
    Argilla

    Argilla

    The open-source data curation platform for LLMs

    ...Argilla is free, open-source, and 100% compatible with major NLP libraries (Hugging Face transformers, spaCy, Stanford Stanza, Flair, etc.). In fact, you can use and combine your preferred libraries without implementing any specific interface. Most annotation tools treat data collection as a one-off activity at the beginning of each project. In real-world projects, data collection is a key activity of the iterative process of ML model development. Once a model goes into production, you want to monitor and analyze its predictions, and collect more data to improve your model over time. Argilla is designed to close this gap, enabling you to iterate as much as you need.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Live-Forensicator

    Live-Forensicator

    A suite of Tools to aid Incidence Response and Live Forensics

    Live-Forensicator is a toolkit intended for live forensic collection and initial triage on Windows machines. It automates the capture of volatile information—running processes, network connections, loaded drivers, account sessions, and in-memory artifacts—into a consistent artifact set that investigators can analyze offline. The tool tries to be non-invasive while collecting sensitive data quickly and logs the collection steps to preserve chain-of-custody details and to help auditors understand potential collection side effects. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    PeriodicTable.jl

    PeriodicTable.jl

    Periodic Table for Julians

    ...PeriodicTable.jl provides a Julia interface to a small database of element properties for all of the elements in the periodic table. In particular PeriodicTable exports a global variable called elements, which is a collection of Element data structures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Feather

    Feather

    Simply beautiful open source icons

    Feather is a collection of simply beautiful open source icons. Each icon is designed on a 24x24 grid with an emphasis on simplicity, consistency, and flexibility. At its core, Feather is a collection of SVG files. This means that you can use Feather icons in all the same ways you can use SVGs (e.g. img, background-image, inline, object, embed, iframe).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 5
    ClickHouse

    ClickHouse

    A fast open-source OLAP database management system

    ClickHouse® is a fast, open-source column-oriented database management system that can generate analytical data reports through SQL queries in real time. According to several independent benchmarks, it far exceeds other comparable column-oriented database management systems, working even up to 1000 times faster. It is able to process hundreds of millions to more than a billion rows and tens of gigabytes of data per single server per second. Apart from its blazing speed, ClickHouse is highly...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 6
    Homemade Machine Learning

    Homemade Machine Learning

    Python examples of popular machine learning algorithms

    homemade-machine-learning is a repository by Oleksii Trekhleb containing Python implementations of classic machine-learning algorithms done “from scratch”, meaning you don’t rely heavily on high-level libraries but instead write the logic yourself to deepen understanding. Each algorithm is accompanied by mathematical explanations, visualizations (often via Jupyter notebooks), and interactive demos so you can tweak parameters, data, and observe outcomes in real time. The purpose is pedagogical: you’ll see linear regression, logistic regression, k-means clustering, neural nets, decision trees, etc., built in Python using fundamentals like NumPy and Matplotlib, not hidden behind API calls. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Helium Browser

    Helium Browser

    Private, fast, and honest web browser

    ...Helium blocks ads and trackers by default through an integrated, unbiased uBlock Origin extension prepackaged as a native browser component. Its UI and feature set emphasize minimalism, no “smart” recommendations, account sync, or background data collection, resulting in a distraction-free browsing experience that respects user autonomy. The browser is available across macOS, Linux, and Windows, each version built from a fully open source pipeline for reproducibility and trust. Development focuses on maintaining compatibility with modern web standards while decoupling Chromium from its Google dependencies and services.
    Downloads: 150 This Week
    Last Update:
    See Project
  • 8
    google-java-format

    google-java-format

    Reformats Java source code to comply with Google Java Style

    google-java-format is a program that reformats Java source code to comply with Google Java Style. The formatter can act on whole files, on limited lines, on specific offsets, passing through to standard-out (default) or altered in-place. There is no configurability as to the formatter's algorithm for formatting. This is a deliberate design decision to unify our code formatting on a single format. The formatter can be used in software that generates java to output more legible java code. Just...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    hctsa

    hctsa

    Highly comparative time-series analysis

    hctsa is a Matlab software package for running highly comparative time-series analysis. It extracts thousands of time-series features from a collection of univariate time series and includes a range of tools for visualizing and analyzing the resulting time-series feature matrix.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Corcel

    Corcel

    Use WordPress backend with Laravel or any PHP application

    A collection of Model classes that allows you to get data directly from a WordPress database. Corcel is a collection of PHP classes built on top of Eloquent ORM (from Laravel framework), that provides a fluent interface to connect and get data directly from a WordPress database. You can use WordPress as the backend (administration panel) or CMS, for inserting posts, custom types, etc, and any other PHP app in the other side querying those data (as a Model layer). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DataFrame

    DataFrame

    C++ DataFrame for statistical, Financial, and ML analysis

    ...You can multi-column sort, custom pick, and delete the data. DataFrame also includes a large collection of analytical algorithms in the form of visitors. These are from basic stats such as Mean, and Std Deviation and return, … to more involved analysis such as Affinity Propagation, Polynomial Fit, and Fast Fourier transform of arbitrary length … including a good collection of trading indicators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Tulip.jl

    Tulip.jl

    Interior-point solver in pure Julia

    Tulip is an open-source interior-point solver for linear optimization, written in pure Julia. It implements the homogeneous primal-dual interior-point algorithm with multiple centrality corrections and therefore handles unbounded and infeasible problems. Tulip’s main feature is that its algorithmic framework is disentangled from linear algebra implementations. This allows to seamless integration of specialized routines for structured problems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Loggie

    Loggie

    A lightweight, cloud-native data transfer agent and aggregator

    Loggie is a lightweight, high-performance, cloud-native agent and aggregator based on Golang. Loggie includes LogConfig/ClusterLogConfig/Interceptor/Sink CRDs, allowing for the creation of data collection, transfer, processing, and sending pipelines through simple YAML file creation. Supports deployment as an independent intermediate machine, which can receive aggregated data sent by Loggie Agent and can also be used to consume and process various data sources. Configure Filebeat and Loggie to collect logs, and send them to a Kafka topic without using client compression, with the Kafka topic partition configured as 3. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    OpenNetMeter

    OpenNetMeter

    A simple program to monitor your network/data usage

    ...It lives in the system tray, making it accessible and minimally intrusive for everyday use. Since it's open-source, users can inspect how data collection is done and trust that it’s not sending data elsewhere. While still evolving, it serves a practical niche for users who want more transparency than what built-in OS tools provide.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    etcd

    etcd

    Distributed reliable key-value store for the most critical data

    A distributed, reliable key-value store for the most critical data of a distributed system. etcd is a strongly consistent, distributed key-value store that provides a reliable way to store data that needs to be accessed by a distributed system or cluster of machines. It gracefully handles leader elections during network partitions and can tolerate machine failure, even in the leader node. etcd is a consistent distributed key-value store. Mainly used as a separate coordination service, in...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    KCloud‑Platform‑IoT

    KCloud‑Platform‑IoT

    KCloud-Platform-IoT

    KCloud-Platform-IoT is a comprehensive open-source IoT management platform built with Spring Cloud and Vue.js. It supports device registration, data collection, rule-based processing, and dashboard visualization. Designed for scalability and modularity, the platform is ideal for managing large IoT fleets in industrial or smart city environments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    MoshiX

    MoshiX

    Extensions for Moshi including IR plugins, moshi-sealed, and more

    Extensions for Moshi. Moshi is a modern JSON library for Android, Java and Kotlin. It makes it easy to parse JSON into Java and Kotlin classes.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    My Brain

    My Brain

    Open-source, All-in-one productivity app for Tasks, Notes, Calendar

    My Brain is an Open source, All-in-one productivity app for Tasks, Notes, Calendars, Diary, and Bookmarks. Private with no data collection and no internet permission at all. Create tasks with priority, sub-tasks, descriptions, and due dates and reminders. Create Notes that support markdown which enables you to use Headers, lists, links, etc. Record your mood daily and view your mood summary with beautiful graphs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Apache DevLake

    Apache DevLake

    Apache DevLake is an open-source dev data platform

    ...You can ask Apache DevLake many questions regarding your development process. Just connect and query. Your Dev Data lives in many silos and tools. DevLake brings them all together to give you a complete view of your Software Development Life Cycle (SDLC). From DORA to scrum retros, DevLake implements metrics effortlessly with prebuilt dashboards supporting common frameworks and goals. DevLake fits teams of all shapes and sizes, and can be readily extended to support new data sources, metrics, and dashboards, with a flexible framework for data collection and transformation.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    HugeGraph

    HugeGraph

    A graph database that supports more than 100+ billion data

    ...HugeGraph supports fast import performance in the case of more than 10 billion Vertices and Edges Graph, millisecond-level OLTP query capability, and can be integrated into big data platforms like Hadoop or Spark for OLAP analysis. The main scenarios of HugeGraph include correlation search, fraud detection, and knowledge graph. Not only supports Gremlin graph query language and RESTful API but also provides commonly used graph algorithm APIs. To help users easily implement various queries and analyses, HugeGraph has a full range of accessory tools, such as supporting distributed storage, data replication, scaling horizontally, and supports many built-in backends of storage engines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    SecLists

    SecLists

    The Pentester’s Companion

    SecLists is the ultimate security tester’s companion. It is a collection of various types of lists commonly used during security assessments, all in one place. SecLists helps to increase efficiency and productivity in security testing by conveniently providing all the lists a security tester may need in one repository. List types include those for usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and many more.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 22
    Whirlwind

    Whirlwind

    Ridiculously fast, fully asynchronous, sharded hashmap for Rust

    Collection of thread-safe, asynchronous data structures. Whirlwind is a lightweight CSS framework focused on performance and simplicity, offering developers a minimal approach to styling web applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    MultivariatePolynomials.jl

    MultivariatePolynomials.jl

    Multivariate polynomials interface

    MultivariatePolynomials.jl is an implementation-independent library for manipulating multivariate polynomials. It defines abstract types and an API for multivariate monomials, terms, and polynomials and gives default implementation for common operations on them using the API. On the one hand, This packages allows you to implement algorithms on multivariate polynomials that will be independant on the representation of the polynomial that will be chosen by the user. On the other hand, it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    RStudio Cheatsheets

    RStudio Cheatsheets

    Curated collection of official cheat sheets for data science tools

    The cheatsheets repository from RStudio is a curated collection of official cheat sheets for R, RStudio, the tidyverse, Shiny, and related data science tools. Each cheat sheet is a single (or double) page PDF that condenses important syntax, functions, workflows, and best practices into a visually organized format ideal for quick reference. The repository contains source files (R Markdown or LaTeX) that generate the cheat sheets, version history, and metadata (title, author, description) for each. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Supabase Wrappers

    Supabase Wrappers

    Postgres Foreign Data Wrapper development framework in Rust

    Supabase Wrappers is a collection of PostgreSQL extensions (written in Rust) that allow Postgres to interact with external systems like APIs, files, and other databases. It uses Foreign Data Wrappers (FDWs) to treat external resources as if they were native SQL tables, enabling seamless integration with external services directly from SQL queries.
    Downloads: 1 This Week
    Last Update:
    See Project