Showing 28 open source projects for "dataset"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Easy DataSet

    Easy DataSet

    A powerful tool for creating datasets for LLM fine-tuning

    ...The system includes automated question-generation capabilities, hierarchical label trees, and answer generation pipelines that use LLM APIs to produce coherent paired data with customizable templates. Beyond dataset creation, Easy-dataset also provides a built-in evaluation system with model testing and blind-test features, helping teams validate model performance using curated test sets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    IPFS GeoIP

    IPFS GeoIP

    GeoIP lookup over DAG-CBOR dataset loaded from IPFS

    GeoIP lookup over IPFS. GeoIP lookup over DAG-CBOR dataset loaded from IPFS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    AI Deadlines

    AI Deadlines

    AI conference deadline countdowns

    ...The repository powers a website that displays countdown timers and structured information for top research conferences across subfields such as computer vision, natural language processing, machine learning, and robotics. The project maintains a curated dataset of conferences that includes metadata such as submission deadlines, abstract deadlines, event dates, conference locations, and related information. Researchers and students use the platform to plan their paper submissions and manage academic schedules without manually tracking multiple conference announcements. The repository includes configuration files and data sources that allow contributors to add or update conferences through pull requests, enabling community-driven maintenance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    emojilib

    emojilib

    Emoji keyword library

    Emoji keyword library. Make emoji searchable with this keyword library. If you are looking for the unicode emoji dataset, including version, grouping, ordering, and skin tone support flag, check out unicode-emoji-json.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Magda

    Magda

    A federated data catalog for all your big and small data

    Magda is an open-source data catalog system designed to make datasets easier to find, access, and use. Built for government and enterprise use, it supports harvesting metadata from multiple sources, managing data access policies, and integrating with data APIs. Magda is highly customizable and ideal for building open data portals or internal data discovery tools.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Kiln

    Kiln

    Open source platform for managing, testing, and deploying AI apps

    Kiln is an open source platform designed to help developers build, evaluate, and deploy AI-powered applications with greater structure and reliability. It provides a unified environment for managing prompts, datasets, and evaluation workflows, allowing teams to iterate on AI behavior in a controlled and measurable way. Kiln emphasizes reproducibility, enabling users to track changes to prompts and models while comparing outputs across different configurations. Kiln also supports systematic...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    dejavu

    dejavu

    The missing web UI for Elasticsearch

    dejavu is the missing web UI for Elasticsearch. Existing web UIs leave much to be desired or are built with server-side page rendering techniques that make it less responsive and bulkier to run (I am looking at you, Kibana). We started building dejavu with the goal of creating a modern Web UI (no page reloads, infinite scroll, filtered views, realtime updates, search UI builder) for Elasticsearch with 100% client-side rendering so one can easily run it as a hosted app on github pages, as a...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    GeoNode

    GeoNode

    GeoNode is an open source platform for geospatial data

    ...It brings together mature and stable open-source software projects under a consistent and easy-to-use interface allowing non-specialized users to share data and create interactive maps. Data management tools built into GeoNode allow for integrated creation of data, metadata, and map visualization. Each dataset in the system can be shared publicly or restricted to allow access to only specific users. Social features like user profiles and commenting and rating systems allow for the development of communities around each platform to facilitate the use, management, and quality control of the data the GeoNode instance contains. It is also designed to be a flexible platform that software developers can extend, modify or integrate against to meet requirements in their own applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    LLM Datasets

    LLM Datasets

    Curated list of datasets and tools for post-training

    ...Quality is a recurring theme: examples and utilities help filter low-value samples, enforce length limits, and split train/validation consistently so results are comparable. Licensing and provenance are surfaced to encourage compliant usage and to guide dataset selection in commercial settings. For practitioners, the repo is a practical “starting pantry” that accelerates experimentation and helps keep data wrangling from dominating the project timeline.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    4gen

    4gen

    a password generator for windows

    4gen is a secure, offline, lightweight password generator for windows packaged in hta. it offers cool features like mouse-movement entropy, full charset control (lowercase, uppercase, digits, symbols, unicode), and character exclusions. you can save and reuse multiple custom generation profiles with persistent local storage to quickly generate passwords for anything. it can be used as a portable app or as an installed program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    RetroPie BIOS

    RetroPie BIOS

    Full BIOS collection for RetroPie

    ...It also includes checksum validation mechanisms, allowing users to verify file integrity and ensure that BIOS files match expected standards. By maintaining a structured and verified dataset, the repository reduces common issues associated with missing or incorrect BIOS files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Frappe Charts

    Frappe Charts

    Simple, responsive, modern SVG Charts with zero dependencies

    GitHub-inspired simple and modern SVG charts for the web with zero dependencies. An axis chart is generally a 2D rendition of data, where a set of values corresponds to every point in a dataset. That's why, data is the most important component for a chart. A chart can have multiple datasets. In an axis chart, every dataset is represented individually. Frappe Charts are responsive, as they rerender all the data in the current available container width. In order to set the bar width, instead of defining it and the space between the bars independently, we simply define the ratio of the space between bars to the bar width. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Rent Qualify
    ...Version 1.2.0 will find affordable cities based on your calculations. Requires a FREE API key from HUD to use the affordable city matching feature. Get your free api key from https://www.huduser.gov/portal/dataset/fmr-api.html
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Universal Data Tool

    Universal Data Tool

    Collaborate & label any type of data, images, text, or documents etc.

    ...Simplicity without sacrificing any powerful developer features and integrations. Use the Universal Data Tool directly from a web browser or with a Windows, Mac or Linux desktop application. Join a link to a collaborative session and see dataset samples from team members complete in real-time. Import from your S3 buckets easily with IAM or Cognito authentication. Working together, we can accomplish more. The Universal Data Tool was built to bring together the best ideas from different machine learning communities. Upload your dataset to Courses to create a training course. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    SBA Public Datasets

    Provides a list of all the datasets available in the Public Data Inven

    Based on the data.json file obtain from https://catalog.data.gov/dataset/sba-public-datasets. THIS APPLICATION IS DEVELOPED BY A PRIVATE COMPANY AND NOT THE SMALL BUSINESS ADMINISTRATION. THE SMALL BUSINESS ADMINISTRATIONS PROVIDES ONLY THE DATA.JSON FROM DATA.GOV.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DCGAN in Tensorflow

    DCGAN in Tensorflow

    Deep Convolutional Generative Adversarial Networks

    DCGAN-tensorflow is a classic TensorFlow implementation of Deep Convolutional Generative Adversarial Networks, intended to demonstrate and reproduce the stabilized GAN architecture described in the original research. The repository provides complete training scripts, model definitions, and utilities for generating synthetic images from datasets such as MNIST and CelebA. It serves both as an educational reference and as a practical starting point for developers experimenting with generative...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Supervised Reptile

    Supervised Reptile

    Code for the paper "On First-Order Meta-Learning Algorithms"

    ...Because Reptile is a first-order algorithm, it avoids computing second derivatives or full meta-gradients, making it computationally simpler while retaining good performance. The repo includes training scripts, dataset fetchers (Omniglot, Mini-ImageNet), and modules for defining the Reptile update logic, variables, and hyperparameters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CuteReport

    CuteReport

    Qt based report solution

    CuteReport is a report solution like Jasper Report, Crystal Reports or FastReport, but based on Qt framework. It can be easily used with any Qt application. In general, CuteReport consists of two parts: core library and template designer. Both are totally modular and theirs functionality can be easily extended by writing additional modules. It's totally abstract of used data and can use as storage: file system, database, version control systems, etc. The project's goal is to provide...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 19

    js-snail

    An innovative way to visualise matrix

    This is an offspring of an idea I had back in 2004 of a way to visualise and navigate within a dataset. The snail is like a metric with rows and columns but instead visualised in sectors and rows with a central level indicator that provides an aggregated indicator. One can switch view (global by sector, global by rows, details) by clicking the central indicator or selecting rows/sectors, and of course drill-down. This is based on D3 (d3js.org), the original one was in Flash. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Roomba

    Roomba

    A Node.js tool to examine the correctness of Open Data Metadata

    Linked Open Data (LOD) has emerged as one of the largest collection of interlinked datasets on the web. Benefiting from this mine of data requires the existence of descriptive information about each dataset in the accompanying metadata. Such meta information is currently very limited to few data portals where they are usually provided manually thus giving little or bad quality insights. To address this issue, we propose a scalable automatic approach for extracting, validating and generating descriptive linked dataset profiles. This approach applies several techniques to check the validity of the attached metadata as well as providing descriptive and statistical information of a certain dataset as well as a whole data portal. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ConvNetJS

    ConvNetJS

    Deep learning in Javascript to train convolutional neural networks

    ConvNetJS is a Javascript library for training Deep Learning models (Neural Networks) entirely in your browser. Open a tab and you're training. No software requirements, no compilers, no installations, no GPUs, no sweat. ConvNetJS is an implementation of Neural networks, together with nice browser-based demos. It currently supports common Neural Network modules (fully connected layers, non-linearities), classification (SVM/Softmax) and Regression (L2) cost functions, ability to specify and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    tedo

    dom oriented javascript template for dynamic pages

    ...Example of usage is tedo.my_list.addLast() The goal is that it should be compatible with the newer versions of all big browsers both mobile and stationary, but it requires some newer features like classlist, dataset and elementNode references.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    VCL.JS

    VCL.JS

    TypeScript component based framework for enterprise web application

    ...//Simple dbgrid bounded to a query import V = require("VCL/VCL"); export class PageHome extends V.TPage { constructor() { super(); //create a backend query var qur = new V.TQuery(this); qur.SQL = "SELECT CustomerKey, FirstName, LastName FROM Customers"; qur.open(); //create a grid on the screen var grd = new V.TDBGrid(this, "grid"); grd.Dataset = qur; //bind the grid to the dataset grd.PageSize = 15; var col = grd.createColumn(“FirstName”); var col = grd.createColumn(“Lastname”,”Last Name”); } }
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Document Analysis and Exploitation
    The Document Analysis and Exploitation Platform is a Drupal based web interface to a cloud enabled Document Analysis resource set.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Messi is a REST ajax framework for java and php. Database is manipulated transparently via SQL statements using javascript. Records are fetched to browser as scrollable dataset linked to data-aware JS widges such as editable grid and various fields.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB