75 projects for "index data" with 1 filter applied:

  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    Index

    Index

    The SOTA Open-Source Browser Agent

    ...The project is built to integrate easily with applications through a simple programming interface, allowing developers to embed browser automation capabilities directly into their software systems. Index can perform tasks such as navigating pages, filling forms, collecting data, and analyzing web content without requiring manual scripting for each website.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    LlamaIndex

    LlamaIndex

    Central interface to connect your LLM's with external data

    LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. LlamaIndex is a simple, flexible interface between your external data and LLMs. It provides the following tools in an easy-to-use fashion. Provides indices over your unstructured and structured data for use with LLM's. These indices help to abstract away common boilerplate and pain points for in-context learning.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    LEANN

    LEANN

    Local RAG engine for private multimodal knowledge search on devices

    LEANN is an open source system designed to enable retrieval-augmented generation (RAG) and semantic search across personal data while running entirely on local devices. It focuses on dramatically reducing the storage overhead typically required for vector search and embedding indexes, enabling efficient large-scale knowledge retrieval on consumer hardware. LEANN introduces a storage-efficient approximate nearest neighbor index combined with on-the-fly embedding recomputation to avoid storing large embedding vectors. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    tinysearch

    tinysearch

    Tiny, full-text search engine for static websites built with Rust

    tinysearch is a lightweight full-text search engine designed specifically for static websites, offering fast and efficient search capabilities without requiring a backend server. It is written in Rust and compiled to WebAssembly, allowing it to run entirely in the browser while maintaining a very small footprint. The engine uses compact data structures such as XOR filters to efficiently index and query text, significantly reducing memory usage compared to traditional search libraries. TinySearch is particularly well-suited for blogs and documentation sites generated by static site generators, where simplicity and performance are critical. It processes content into a serialized binary index that can be loaded quickly and searched client-side, eliminating the need for external search services. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    Obsidian Visual Skills Pack

    Obsidian Visual Skills Pack

    Generate Canvas, Excalidraw, and Mermaid diagrams from text

    LLM-TLDR is a Python-based tool designed to dramatically reduce the amount of code a large language model needs to read by extracting the essential structure and context from a codebase and presenting only the most relevant parts to the model. Traditional approaches often dump entire files into a model’s context, which quickly exceeds token limits; LLM-TLDR instead indexes project structure, traces dependencies, and summarizes code in a way that preserves semantic relevance while shrinking...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Rapid YAML

    Rapid YAML

    Library to parse and emit YAML, and do it fast

    ...(If you are looking to use your programs with a YAML tree as a configuration tree with override facilities, take a look at c4conf). ryml parses both read-only and in-situ source buffers; the resulting data nodes hold only views to sub-ranges of the source buffer. No string copies or duplications are done, and no virtual functions are used. The data tree is a flat index-based structure stored in a single array. Serialization happens only at your direct request, after parsing / before emitting. Internally, the data tree representation stores only string views and has no knowledge of types, but of course, every node can have a YAML type tag. ryml makes it easy and fast to read and modify the data tree.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    Spider

    Spider

    High-performance Rust web crawler and scraper for large-scale data

    ...Spider also provides mechanisms for subscribing to crawl events so developers can process page data such as URLs, status codes, or HTML content as it is discovered. It supports advanced capabilities such as headless browser rendering, background crawling tasks, and configurable rules that control crawl depth or ignored paths. These capabilities make the project suitable for building search indexers, data extraction pipelines, & SEO analysis tools.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    YourInfo

    YourInfo

    Real-time browser fingerprinting demo with cross-browser tracking

    YourInfo is a personal information management tool designed to let users securely store, structure, and retrieve their key data — such as contacts, credentials, personal notes, and preferences — while also enabling AI-assisted queries or reminders using that data. The platform prioritizes privacy by focusing on local storage or user-controlled databases, ensuring sensitive data stays under the user’s control rather than in third-party servers. Users can define types of information, tag...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OP Vault

    OP Vault

    Give ChatGPT long-term memory using the OP Stack

    ...It combines a backend written in Go with a React frontend, allowing users to upload files such as PDFs, text documents, and books to create a searchable repository of information. The system uses vector databases like Pinecone alongside OpenAI models to index and retrieve relevant content, enabling precise question-answering grounded in the uploaded materials. Users can query the system in natural language and receive answers that include references to specific files and sections, improving transparency and trust in the responses. The project is designed to handle large volumes of data, making it suitable for personal knowledge management, research archives, or enterprise documentation systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Public APIs

    Public APIs

    A collective list of free APIs

    public-apis is a collaboratively maintained repository that provides an extensive, categorized list of publicly available APIs for developers. Curated by community contributors and the team at APILayer, it serves as a centralized resource for discovering APIs across a wide range of domains, including data, machine learning, weather, entertainment, and finance. The project aims to make API exploration and integration more accessible by offering a single, organized index of open and free-to-use APIs. Developers can leverage this list to enhance their products, prototypes, or research projects without the need to build data sources from scratch. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    Anomaly Detection Learning Resources

    Anomaly Detection Learning Resources

    Anomaly detection related books, papers, videos, and toolboxes

    Anomaly Detection Learning Resources is a curated open-source repository that collects educational materials, tools, and academic references related to anomaly detection and outlier analysis in data science. The project serves as a centralized index for researchers and practitioners who want to explore algorithms, datasets, and publications associated with detecting unusual patterns in data. The repository organizes resources into structured categories such as books, tutorials, academic papers, datasets, benchmark frameworks, and open-source toolkits. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    OpenArchiver

    OpenArchiver

    An open-source platform for legally compliant email archiving

    OpenArchiver is a comprehensive, self-hosted email archiving and compliance platform built to help organizations ingest, index, store, and search email communication data across diverse sources like Gmail, Microsoft 365, IMAP, PST, and more. It’s designed for scenarios where reliable, tamper-proof archiving and full-text search across both emails and attachments are essential for legal discovery, compliance, or long-term records retention. The platform combines a modern web UI with powerful backend services, including fast indexing, deduplication, encryption at rest, and asynchronous ingestion workflows, making it suitable for both small teams and enterprise deployments. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    Kernel Memory

    Kernel Memory

    Research project. A Memory solution for users, teams, and applications

    ...Applications can then query these indexed data sources to retrieve relevant information and include it as context for AI responses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TONL

    TONL

    TONL (Token-Optimized Notation Language)

    TONL is a cutting-edge data platform built around a production-ready serialization format designed to be both compact and powerful, combining human readability with performance features that make it suitable for large-scale applications and AI workflows. It provides a serialization format that significantly reduces token usage compared with traditional JSON, which can result in lower costs and more efficient prompt size utilization in LLM-driven systems. TONL isn’t just a format — it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Engram

    Engram

    A New Axis of Sparsity for Large Language Models

    Engram is a high-performance embedding and similarity search library focused on making retrieval-augmented workflows efficient, scalable, and easy to adopt by developers building search, recommendation, or semantic matching systems. It provides utilities to generate embeddings from text or other structured data, index them using efficient approximate nearest neighbor algorithms, and perform real-time similarity queries even on large corpora. Engineered with speed and memory efficiency in mind, Engram supports batched indexing, incremental updates, and custom distance metrics so developers can tailor search behaviors to their domain’s needs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    clip-retrieval

    clip-retrieval

    Easily compute clip embeddings and build a clip retrieval system

    clip-retrieval is an open-source toolkit designed to build large-scale semantic search systems for images and text by leveraging CLIP embeddings to enable multimodal retrieval. It allows developers to compute embeddings for both images and text efficiently and then index them for fast similarity search across massive datasets. The system is optimized for performance and scalability, capable of processing tens or even hundreds of millions of embeddings using GPU acceleration. It includes...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    type-fest

    type-fest

    A collection of essential TypeScript types

    type-fest is a TypeScript utility types library that offers a curated, battle-tested suite of type definitions and type transformations that aren’t included in the TypeScript standard library. It provides types like Except, Merge, LiteralUnion, Writable, Promisable, PartialDeep, JsonObject, and many others that solve everyday typing needs in complex TypeScript codebases. Developers pull in just the types they need, which makes code more expressive and safer without reinventing tricky type...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    CS-Books

    CS-Books

    Collection of computer science textbooks, learning materials

    CS-Books is a massive curated collection of computer science textbooks, learning materials, and resource links that covers a wide range of topics from programming languages like C/C++ and Python to core subjects such as data structures, algorithms, operating systems, databases, networks, and design patterns. The repository aggregates over a thousand classic reference books and educational resources into a single index, making it a valuable starting point for self-learners, students preparing for technical interviews, and professionals deepening their knowledge across different CS domains. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The Hypersim Dataset

    The Hypersim Dataset

    Photorealistic Synthetic Dataset for Holistic Indoor Scene

    ...The dataset spans diverse furniture layouts, room types, and camera trajectories, enabling robust training for geometry, segmentation, and SLAM-adjacent tasks. Rendering pipelines and utilities allow researchers to reproduce sequences, generate novel views, or extract task-specific supervision. Because the data are perfectly labeled and controllable, Hypersim is well suited for pretraining and for studying domain transfer to real imagery. The repository acts as both a dataset index and a set of scripts for downloading, managing, and evaluating on standardized splits.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    gonic

    gonic

    music streaming server / free-software subsonic server API

    ...Its modular configuration allows users to customize transcoding, storage paths, and playback behavior. Overall, it provides a flexible and efficient alternative to commercial music streaming services with full control over data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Software Engineering Blogs

    Software Engineering Blogs

    A curated list of engineering blogs

    Software Engineering Blogs is a curated collection of engineering-focused blog posts and resources aggregated from leading technology companies and developers. The repository serves as a centralized index that helps users discover high-quality technical content across a wide range of topics, including software engineering, system design, infrastructure, and data engineering. It is organized in a structured manner, making it easy to browse and find articles from specific companies or domains of interest. The project is continuously updated by the community, ensuring that the content remains relevant and reflective of current industry practices. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    KG-LLM-Papers

    KG-LLM-Papers

    Papers integrating knowledge graphs (KGs) and large language models

    KG-LLM-Papers is a curated academic resource that collects and organizes research papers exploring the intersection between knowledge graphs and large language models. The repository functions as a continuously updated index of scholarly work that investigates how structured knowledge representations can enhance the reasoning, factual accuracy, and interpretability of language models. It includes surveys, benchmark studies, and cutting-edge research that examine topics such as knowledge graph-guided prompting, retrieval-augmented generation, reasoning over structured data, and hybrid architectures combining symbolic and neural systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ZeusDB Vector Database

    ZeusDB Vector Database

    Blazing-fast vector DB with similarity search and metadata filtering

    ZeusDB is a vector database built for fast, scalable similarity search with strong production ergonomics. It combines high-performance approximate nearest neighbor indexes with clean APIs and metadata filtering so applications can retrieve semantically relevant items at low latency. The storage layer is designed for durability and growth, supporting sharding, replication, and background compaction while keeping query tails predictable. Developers get multiple ingestion paths—batch,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AI_Tutorial

    AI_Tutorial

    A selection of learning materials, search, recommendation, advertising

    AI_Tutorial is a large curated repository that aggregates high-quality learning resources related to artificial intelligence, machine learning, deep learning, natural language processing, and data engineering. The project functions as a centralized knowledge base designed to help engineers and researchers discover tutorials, technical articles, algorithm explanations, and architecture discussions from across the AI ecosystem. Rather than focusing on a single framework or course, the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Chordii

    Chordii

    Easy lead sheets from text input

    ChordPro creates elegant, stafless lead sheets for musicians needing only chords and lyrics. It processes plain text input in ChordPro format and it is a rewrite of the old though still popular Chord/Chordii programs.
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB