Showing 53 open source projects for "index data"

View related business solutions
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    The Hypersim Dataset

    The Hypersim Dataset

    Photorealistic Synthetic Dataset for Holistic Indoor Scene

    ...The dataset spans diverse furniture layouts, room types, and camera trajectories, enabling robust training for geometry, segmentation, and SLAM-adjacent tasks. Rendering pipelines and utilities allow researchers to reproduce sequences, generate novel views, or extract task-specific supervision. Because the data are perfectly labeled and controllable, Hypersim is well suited for pretraining and for studying domain transfer to real imagery. The repository acts as both a dataset index and a set of scripts for downloading, managing, and evaluating on standardized splits.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ZeusDB Vector Database

    ZeusDB Vector Database

    Blazing-fast vector DB with similarity search and metadata filtering

    ZeusDB is a vector database built for fast, scalable similarity search with strong production ergonomics. It combines high-performance approximate nearest neighbor indexes with clean APIs and metadata filtering so applications can retrieve semantically relevant items at low latency. The storage layer is designed for durability and growth, supporting sharding, replication, and background compaction while keeping query tails predictable. Developers get multiple ingestion paths—batch,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Weather Cast

    Weather Cast

    A desktop weather app powered by AI

    Weather app is a desktop weather app for Windows OS that shows detailed weather information for the searched city. The dashboard shows the current temperature of the city, description of temperature, pressure, wind, humidity, dew point, uv index, local time, air pollution index.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    RSM

    Radiation Spectrum Method : a modal BPM (Beam Propagation Method)

    RSM (Radiation Spectrum Method) is a 2D rigorous tool to solve the Maxwell equations for the propagation of light in integrated optics or photonics devices. It makes use of an EigenMode Expansion method (EME) to solve the electromagnetic problem. This software running on Windows and MacOS comes with a GUI that permits to define with the aid of files or scripts the arbitrary and complex geometry of the waveguide. Of that way any waveguide geometry can be handled. Several plots are available...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5

    GromacsProSuite

    Graphical User Interface for Gromacs

    This tool is an integrated graphical interface that simplifies molecular dynamics simulations using Gromacs. It provides a structured, tab-based environment to set up, execute, and analyze simulations data without complex command-line operations. The software automates tasks such as topology generation, solvation, ion addition, minimization, equilibration, and production runs while executing GROMACS commands in the background. Built-in monitoring tracks CPU, RAM, and disk usage to ensure...
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    dirhunt

    dirhunt

    Web crawler that finds hidden web directories without brute force

    Dirhunt is an open source security tool designed to discover web directories and analyze website structures without relying on brute-force techniques. Instead of sending large numbers of guess-based requests, it operates as a specialized crawler that intelligently explores websites to identify accessible or hidden directories. Dirhunt can detect directories that expose “Index Of” listings, which may reveal files and other resources that were not intended to be publicly visible. It can also...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    PRM800K

    PRM800K

    800,000 step-level correctness labels on LLM solutions to MATH problem

    ...The repository releases the raw labels and the labeler instructions used in two project phases, enabling researchers to study how human raters graded intermediate reasoning. Data are stored as newline-delimited JSONL files tracked with Git LFS, where each line is a full solution sample that can contain many step-level labels and rich metadata such as labeler UUIDs, timestamps, generation identifiers, and quality-control flags. Each labeled step can include multiple candidate completions with ratings of -1, 0, or +1, optional human-written corrections (phase 1), and a chosen completion index, along with a final finish reason such as found_error, solution, bad_problem, or give_up.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    whiteboxgui

    whiteboxgui

    An interactive GUI for WhiteboxTools in a Jupyter-based environment

    ...WhiteboxTools also contains advanced tooling for spatial hydrological analysis (e.g. flow-accumulation, watershed delineation, stream network analysis, sink removal), terrain analysis (e.g. common terrain indices such as slope, curvatures, wetness index, hillshading; hypsometric analysis; etc.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Compose

    Compose

    A machine learning tool for automated prediction engineering

    ...It allows you to structure prediction problems and generate labels for supervised learning. An end user defines an outcome of interest by writing a labeling function, then runs a search to automatically extract training examples from historical data. Its result is then provided to Featuretools for automated feature engineering and subsequently to EvalML for automated machine learning. Prediction problems are structured by using a label maker and a labeling function. The label maker automatically extracts data along the time index to generate labels. The process starts by setting the first cutoff time after the minimum amount of data. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Smart Contract Sanctuary

    Smart Contract Sanctuary

    A home for ethereum smart contracts

    A home for Ethereum smart contracts verified on Etherscan. This is the index repository for the smart contract sanctuary. Bookmark this repo. Chain-specific sub-repos and the index are updated twice a day. Expect a full, recursive check-out to take 2GB+ disk space. Existing repository but submodules never initialized - checkout submodules and update all chain-specific sub repositories. Contains smart contract sources for various networks, grouped by the first two chars of the contract address. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Aquila DB

    Aquila DB

    An easy to use Neural Search Engine

    Aquila DB is a Neural search engine. In other words, it is a database to index Latent Vectors generated by ML models along with JSON Metadata to perform k-NN retrieval. It is dead simple to set up, language-agnostic, and drop in addition to your Machine Learning Applications. Aquila DB, as of current features is a ready solution for Machine Learning engineers and Data scientists to build Neural Information Retrieval applications out of the box with minimal dependencies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DrQA

    DrQA

    Reading Wikipedia to Answer Open-Domain Questions

    ...The retriever relies on classic IR features (like TF-IDF and n-gram statistics) to remain lightweight and scalable to millions of documents. The reader is a neural model trained on supervised QA data to estimate start and end positions within a paragraph, and it can be adapted to new domains through fine-tuning or distant supervision. The repository includes scripts to build the Wikipedia index, train the reader, and evaluate end-to-end performance. DrQA popularized a practical recipe for combining IR and neural reading, and it remains a strong baseline for open-domain QA research and production prototypes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MachineLearningStocks

    MachineLearningStocks

    Using python and scikit-learn to make stock predictions

    MachineLearningStocks is a Python-based template project that demonstrates how machine learning can be applied to predicting stock market performance. The project provides a structured workflow that collects financial data, processes features, trains predictive models, and evaluates trading strategies. Using libraries such as pandas and scikit-learn, the repository shows how historical financial indicators can be transformed into machine learning features. The model attempts to predict whether specific stocks will outperform a benchmark index such as the S&P 500. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Invenio

    Invenio

    Invenio digital library framework

    Invenio is a highly customizable open-source framework for building large-scale digital repositories and research data platforms. Developed by CERN, it is designed to manage, index, and provide access to metadata-rich content such as publications, datasets, and multimedia files. Invenio provides a modular architecture, making it suitable for libraries, archives, and research institutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    pyFileSearcher

    pyFileSearcher

    simple searching tool for big fileservers

    pyFileSearcher was designed to be lightweight, easy to use, but capable of handling a large volume of files tool. A tool that I personally could use on large corporate servers to find out - which files have taken all my space in the last few days? It's free, it's opensource, it's for linux and windows. The program is written in Python 3 using the Qt5.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    VividSTORM

    VividSTORM

    Correlated confocal and SMLM data visualization and analysis

    VividSTORM is a free and open-source standalone software with graphical user interface, for the correlated visualization and analysis of superresolution single molecule localization microscopy (SMLM) molecule lists and conventional pixel intensity-based images. The localization points (LPs) within this ROI can be analyzed using the selected built-in functions. NOTE: If you encounter issues not addressed by the user guide, please contact by message on this site or via e-mail for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    diskover

    diskover

    File system crawler and disk space usage software

    diskover is a file system crawler and disk space usage software that uses Elasticsearch to index your file metadata. diskover crawls and indexes your files on a local computer or remote storage server over network mounts. diskover helps manage your storage by identifying old and unused files and give better insights into data change "hotfiles", file duplication "dupes" and wasted space. It is designed to help deal with managing large amounts of data growth and provide detailed storage analytics. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Tushare

    Tushare

    TuShare is a utility for crawling historical data of China stocks

    Tushare is a Python library that provides access to a wide range of financial data focused on the Chinese stock market. It allows users to retrieve real-time and historical market data, financial reports, index data, and macroeconomic indicators. Tushare is widely used in quantitative trading, data analysis, and academic research. It supports both free and premium data tiers via Tushare Pro, which requires an API token.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    2DMED

    2DMED

    Phase Sensitive Detection and Modulated Enhanced Diffraction Software

    2DMED it is the second out of two programs which allows implementation of Phase Sensitive Detection (PSD) and/or Modulated Enhanced Diffraction (MED) techniques on the in situ data. This software takes normalized and averaged in situ data in flat file format (FLT) and performs PSD transformation which has been described in Urakawa, A.et al. Chem. Eng. Science 2008, 63, 4902. User can choose demodulation index k and perform several demodulation calculations. This is extremely important since demodulation calculations at k = 2 on in situ powder diffraction data represent implementation of MED method which been described in Chernyshov, D. et al. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ImpraStorage

    ImpraStorage

    ImpraStorage provided a private imap access to store large files

    ...ImpraStorage automatically launch the parts to download, then merge parts in the appropriate way to rebuild the original file. Inversely, a file to upload is split (in several parts with addition of noise data), and ImpraStorage randomly upload each parts then update the index.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Meresco is both an OAI Data Provider and a Service Provider. SourceForge is only used to host the source control (subversion). Sources: http://sources.meresco.org/ Binaries: http://repository.cq2.org/ Mail: http://groups.google.com/group/meresco
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    openBMD

    openBMD

    A cross-platform transcription application.

    openBMD is a multi-platform desktop application for the transcription of Birth, Marriage and Death (BMD) index records as part of the FreeBMD project to transcribe the Civil Registration index for England and Wales.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    ndexr

    Indexes directories and outputs to JSON

    Tool to use from Python interactive interpreter or script. Use it to make JSON indexes of all files in a directory, or files of a certain type. Could be used to observe activity in a server directory, to generate JSON to send to clients searching remotely, or to keep an index of files in a directory when the user of that directory does not have permission to probe the files system for files without knowing their names. Uses a configuration file, or optionally you can pass configuration...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB