Showing 149 open source projects for "data structure"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • 1
    Cookiecutter Data Science

    Cookiecutter Data Science

    Project structure for doing and sharing data science work

    A logical, reasonably standardized, but flexible project structure for doing and sharing data science work. When we think about data analysis, we often think just about the resulting reports, insights, or visualizations. While these end products are generally the main event, it's easy to focus on making the products look nice and ignore the quality of the code that generates them. Because these end products are created programmatically, code quality is still important! ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    CausalInference.jl

    CausalInference.jl

    Causal inference, graphical models and structure learning in Julia

    Julia package for causal inference and analysis, graphical models and structure learning. This package contains code for the PC algorithm and the extended FCI algorithm, the score based greedy equivalence search (GES) algorithm, the Bayesian Causal Zig-Zag sampler and a function suite for adjustment set search.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    MinerU

    MinerU

    A high-quality tool for convert PDF to Markdown and JSON

    MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 4
    PlantUML

    PlantUML

    Generate diagrams from textual description

    Generate UML diagram from textual description. PlantUML is not affected by the log4j vulnerability. The easiest way to test PlantUML is in an online solution that has PlantUML embedded, such as our online server. After testing, you may want to install PlantUML locally. Run (or have your software call) PlantUML, using sequenceDiagram.txt as input. The output is an image, which either appears in the other software, or is written to an image file on disk. Diagrams are defined using a simple and...
    Downloads: 43 This Week
    Last Update:
    See Project
  • Desktop and Mobile Device Management Software Icon
    Desktop and Mobile Device Management Software

    It's a modern take on desktop management that can be scaled as per organizational needs.

    Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
    Learn More
  • 5
    MultilayerGraphs.jl

    MultilayerGraphs.jl

    Julia package for the creation and analysis of multilayer graphs

    MultilayerGraphs.jl is a Julia package for the creation, manipulation and analysis of the structure, dynamics and functions of multilayer graphs. A multilayer graph is a graph consisting of multiple standard subgraphs called layers which can be interconnected through bipartite graphs called interlayers composed of the vertex sets of two different layers and the edges between them. The vertices in each layer represent a single set of nodes, although not all nodes have to be represented in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    OpenProject

    OpenProject

    OpenProject is the leading open source project management software

    Open source project management software. Efficient classic, agile or hybrid project management in a secure environment. Take control of your data and stay secure. Take back your data ownership, we give you the power to protect your privacy in a secure environment and with highest standards in data security and privacy. OpenProject can be installed on-premises in your own infrastructure, having full control and 100% data ownership. You know where your data is stored and what happens to it, without external influence. ...
    Downloads: 80 This Week
    Last Update:
    See Project
  • 7
    MarketData.jl

    MarketData.jl

    Time series market data

    The MarketData package provides open-source financial data for research and testing. The data is from Quandl and is free end-of-day stock data. It is public domain without restrictions. The TimeSeries TimeArray data structure is used to store the data, but conversion to other data structures, including DataFrames and AxisArrays, is supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    MolecularGraph.jl

    MolecularGraph.jl

    Graph-based molecule modeling toolkit for cheminformatics

    MolecularGraph.jl is a graph-based molecule modeling and chemoinformatics analysis toolkit fully implemented in Julia.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ProbabilisticCircuits.jl

    ProbabilisticCircuits.jl

    Probabilistic Circuits from the Juice library

    This module provides a Julia implementation of Probabilistic Circuits (PCs), tools to learn structure and parameters of PCs from data, and tools to do tractable exact inference with them. Probabilistic Circuits provides a unifying framework for several family of tractable probabilistic models. PCs are represented as computational graphs that define a joint probability distribution as recursive mixtures (sum units) and factorizations (product units) of simpler distributions (input units). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • DAT Freight and Analytics - DAT Icon
    DAT Freight and Analytics - DAT

    DAT Freight and Analytics operates DAT One truckload freight marketplace

    DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
    Learn More
  • 10
    Milvus

    Milvus

    Vector database for scalable similarity search and AI applications

    Milvus is an open-source vector database built to power embedding similarity search and AI applications. Milvus makes unstructured data search more accessible, and provides a consistent user experience regardless of the deployment environment. Milvus 2.0 is a cloud-native vector database with storage and computation separated by design. All components in this refactored version of Milvus are stateless to enhance elasticity and flexibility. Average latency measured in milliseconds on trillion...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    Outerbase

    Outerbase

    A lightweight Database GUI in your browser

    Outerbase Studio is an open-source, browser-based database GUI designed for simplicity and versatility. It supports connecting to various relational databases, providing an intuitive interface for managing and querying data without the need for complex setups.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Symfony PropertyInfo

    Symfony PropertyInfo

    Extracts information about PHP class' properties using metadata

    Symfony PropertyInfo is a component that extracts information about the properties of PHP classes, such as their names, types, visibility, and documentation. It is particularly useful in scenarios like serialization, form generation, and validation, where understanding the structure of an object is essential. PropertyInfo can fetch data from PHPDoc annotations, reflection, and type hints, offering flexible integration with Symfony and other systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Documind

    Documind

    Open-source platform for extracting structured data from documents

    Documind is an advanced document processing tool that leverages AI to extract structured data from PDFs. It is built to handle PDF conversions, extract relevant information, and format results as specified by customizable schemas.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    AI Hedge Fund

    AI Hedge Fund

    An AI Hedge Fund Team

    This repository demonstrates how to build a simplified, automated hedge fund strategy powered by AI/ML. It integrates financial data collection, preprocessing, feature engineering, and predictive modeling to simulate decision-making in trading. The code shows workflows for pulling stock or market data, applying machine learning algorithms to forecast trends, and generating buy/sell/hold signals based on the predictions. Its structure is educational: intended more as a proof-of-concept than a ready-to-use financial product, giving learners insight into the mechanics of quantitative finance automation. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    IoTDB

    IoTDB

    Apache IoTDB

    Apache IoTDB (Database for Internet of Things) is an IoT native database with high performance for data management and analysis, deployable on the edge and the cloud. Due to its light-weight architecture, high performance and rich feature set together with its deep integration with Apache Hadoop, Spark and Flink, Apache IoTDB can meet the requirements of massive data storage, high-speed data ingestion and complex data analysis in the IoT industrial fields. In the scene of factories, there...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    city-roads

    city-roads

    Visualization of all roads within any city

    city-roads is a data visualization and mapping project that renders street networks of cities as vector paths, offering an interactive, zoomable experience that highlights how cities are stitched together by their road infrastructure. It typically fetches open map data (such as from OpenStreetMap) and then processes that data into geometry suited for rendering in the browser, allowing users to explore intricate road layouts at various scales.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Fondant

    Fondant

    Production-ready data processing made easy and shareable

    Fondant is a modular, pipeline-based framework designed to simplify the preparation of large-scale datasets for training machine learning models, especially foundation models. It offers an end-to-end system for ingesting raw data, applying transformations, filtering, and formatting outputs—all while remaining scalable and traceable. Fondant is designed with reproducibility in mind and supports containerized steps using Docker, making it easy to share and reuse data processing components....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CellTypist

    CellTypist

    A tool for semi-automatic cell type classification, harmonization

    CellTypist is an automated tool for cell type classification, harmonization, and integration. Classification, transfer cell type labels from the reference to query dataset. Harmonization, match and harmonize cell types defined by independent datasets. integration, integrate cell and cell types with supervision from harmonization. CellTypist recapitulates cell type structure and biology of independent datasets. Regularised linear models with Stochastic Gradient Descent provide a fast and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    EAGO.jl

    EAGO.jl

    A development environment for robust and global optimization

    EAGO is an open-source development environment for robust and global optimization in Julia. EAGO is a deterministic global optimizer designed to address a wide variety of optimization problems, emphasizing nonlinear programs (NLPs), by propagating McCormick relaxations along the factorable structure of each expression in the NLP. Most operators supported by modern automatic differentiation (AD) packages (e.g., +, sin, cosh) are supported by EAGO and a number of utilities for sanitizing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Common Core Ontologies

    Common Core Ontologies

    The Common Core Ontology Repository

    The Common Core Ontologies (CCO) comprise twelve ontologies that are designed to represent and integrate taxonomies of generic classes and relations across all domains of interest. CCO is a mid-level extension of Basic Formal Ontology (BFO), an upper-level ontology framework widely used to structure and integrate ontologies in the biomedical domain (Arp, et al., 2015). BFO aims to represent the most generic categories of entity and the most generic types of relations that hold between them,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    LinearSolve.jl

    LinearSolve.jl

    High-Performance Unified Interface for Linear Solvers in Julia

    LinearSolve.jl is a unified interface for the linear solving packages of Julia. It interfaces with other packages of the Julia ecosystem to make it easy to test alternative solver packages and pass small types to control algorithm swapping. It also interfaces with the ModelingToolkit.jl world of symbolic modeling to allow for automatically generating high-performance code. Performance is key: the current methods are made to be highly performant on scalar and statically sized small problems,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Strapi

    Strapi

    API creation made simple, secure and fast

    ...All your favorite dev tools-- from static site generators and databases to hosting platforms work with Strapi, so you're never locked in. Strapi is designed to be developer-first, with powerful features that allow you to create a flexible data structure in just minutes. Strapi also makes it easy for editors to manage their content and distribute it anywhere. With Strapi, you always have full control of your data.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    Discord.SortedSet

    Discord.SortedSet

    Elixir SortedSet backed by a Rust-based NIF

    SortedSet NIF is a performant and reliable sorted set data structure for Elixir, implemented in Rust using the Rustler crate to take advantage of native performance while maintaining seamless integration with the BEAM ecosystem. It provides ordering and uniqueness guarantees, with all terms stored according to Elixir’s built-in sorting rules. Internally, it uses a vector of vectors layout rather than a single vector to minimize costly reallocations, allowing efficient bucket pointer copying instead of expensive term copying during growth. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    PowerSimulations.jl

    PowerSimulations.jl

    Julia for optimization simulation and modeling of PowerSystems

    PowerSimulations.jl is a Julia package for power system modeling and simulation of Power Systems operations. Provide a flexible modeling framework that can accommodate problems of different complexity and at different time scales. Streamline the construction of large-scale optimization problems to avoid repetition of work when adding/modifying model details. Exploit Julia's capabilities to improve computational performance of large-scale power system quasi-static simulations. The flexible...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    brms

    brms

    brms R package for Bayesian generalized multivariate models using Stan

    brms is an R package by Paul Bürkner which provides a high-level interface for fitting Bayesian multilevel (i.e. mixed effects) models, generalized linear / non-linear / multivariate models using Stan as the backend. It allows R users to specify complex Bayesian models using formula syntax similar to lme4 but with far more flexibility (distributions, link functions, hierarchical structure, nonlinear terms, etc.). It supports model diagnostics, posterior predictive checking, model comparison,...
    Downloads: 0 This Week
    Last Update:
    See Project