Showing 51 open source projects for "collections"

View related business solutions
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Postman MCP Server

    Postman MCP Server

    An MCP server that provides access to Postman

    The Postman MCP Server is a TypeScript-based Model Context Protocol (MCP) server that integrates with the Postman API, providing comprehensive management of Postman collections, environments, etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    QMD

    QMD

    mini cli search engine for your docs, knowledge bases, etc.

    QMD is a powerful and lightweight command-line tool that acts as an on-device search engine for your personal knowledge base, allowing you to index and search files like Markdown notes, meeting transcripts, technical documentation, and other text collections without depending on cloud services. Designed to keep all search activity local, it combines classic full-text search techniques with modern semantic features such as vector similarity and hybrid ranking so that queries return not just literal matches but conceptually relevant results. Users can organize content into named collections, embed documents for semantic retrieval, and then perform keyword searches, semantic searches, or hybrid natural-language queries to quickly surface the most useful information across all indexed sources. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    PaperAI

    PaperAI

    Semantic search and workflows for medical/scientific papers

    PaperAI is an open-source framework for searching and analyzing scientific papers, particularly useful for researchers looking to extract insights from large-scale document collections.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    AI Notes

    AI Notes

    Curated AI engineering notes on LLMs, generative models, and tools

    ...It functions as a living knowledge base composed of numerous markdown files that organize topics such as text generation, image generation, AI infrastructure, and code generation models. These notes include observations, references, experiments, and summaries of important research and industry developments in AI. ai-notes also contains collections of prompts, curated learning materials, and categorized resources intended to help developers explore AI capabilities and practical applications.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    COLMAP

    COLMAP

    Structure-from-Motion and Multi-View Stereo

    COLMAP is a general-purpose Structure-from-Motion (SfM) and Multi-View Stereo (MVS) pipeline with a graphical and command-line interface. It offers a wide range of features for the reconstruction of ordered and unordered image collections. The software is licensed under the new BSD license.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 6
    PapersGPT

    PapersGPT

    A powerful Zotero AI and MCP plugin with ChatGPT, Gemini 3.1, Claude

    PapersGPT is an AI-powered plugin that integrates directly into Zotero to transform how researchers interact with academic papers and literature collections. It enables users to chat with individual PDFs or entire collections, allowing them to extract insights, generate summaries, and explore connections between documents without leaving the Zotero environment. The plugin supports a wide range of state-of-the-art language models, including GPT, Claude, Gemini, and open-source alternatives, giving users flexibility in choosing performance, cost, and privacy trade-offs. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    ...It also supports optical character recognition to extract text from images and scanned documents, including images embedded inside PDF files. It integrates text mining and analytics capabilities that allow users to examine relationships, topics, and structured data within document collections.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ai-renamer

    ai-renamer

    A Node.js CLI that uses Ollama and LM Studio models

    ...It integrates with local and cloud-based AI providers such as Ollama, LM Studio, and OpenAI, allowing users to choose between offline and API-based workflows depending on their needs. The tool supports batch processing, making it particularly useful for organizing large collections of files quickly and efficiently. It also provides customization options such as naming conventions, language preferences, and prompt modifications to tailor the output to specific use cases. By leveraging AI for semantic understanding, it significantly reduces the time spent on manual file organization and improves discoverability.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Semantra

    Semantra

    Multi-tool for semantic search

    Semantra is an open-source semantic search tool designed to help users explore large collections of documents by meaning rather than simple keyword matching. The software analyzes text and PDF documents stored locally and creates embeddings that allow queries to retrieve results based on conceptual similarity. It is primarily intended for individuals who need to extract insights from large document collections, including researchers, journalists, students, and historians. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 10
    MetaScreener

    MetaScreener

    AI-powered tool for efficient abstract and PDF screening

    MetaScreener is an open-source AI-assisted tool designed to streamline the screening process in systematic literature reviews and academic research workflows. The system helps researchers analyze large collections of academic abstracts and research papers to determine which studies are relevant for inclusion in evidence synthesis projects. Instead of manually reviewing hundreds or thousands of documents, researchers can use MetaScreener to apply machine learning techniques that assist with classification and prioritization of candidate papers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DocETL

    DocETL

    A system for agentic LLM-powered data processing and ETL

    DocETL is an open-source system designed to build and execute data processing pipelines powered by large language models, particularly for analyzing complex collections of documents and unstructured datasets. The platform allows developers and researchers to construct structured workflows that extract, transform, and organize information from sources such as reports, transcripts, legal documents, and other text-heavy data. Instead of relying on single prompts or ad-hoc scripts, DocETL provides a declarative pipeline framework that breaks complex document analysis tasks into manageable operations that can be optimized and orchestrated automatically. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    NVIDIA NeMo Framework

    NVIDIA NeMo Framework

    Scalable generative AI framework built for researchers and developers

    NVIDIA NeMo is a scalable, cloud-native generative AI framework aimed at researchers and PyTorch developers working on large language models, multimodal models, and speech AI (ASR and TTS), with growing support for computer vision. It provides collections of domain-specific modules and reference implementations that make it easier to pre-train, fine-tune, and deploy very large models on multi-GPU and multi-node infrastructure. NeMo 2.0 introduces a Python-based configuration system, replacing YAML with more flexible, programmable configs that can be versioned and composed for different experiments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Director

    Director

    AI video agents framework for next-gen video interactions

    Director is a video database management system designed to organize, search, and retrieve large collections of video content efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ViMax

    ViMax

    Director, Screenwriter, Producer, and Video Generator All-in-One

    ViMax is an open-source framework for performing large-scale multi-modal vision-language modeling and reasoning by combining powerful image encoders with advanced language models to solve complex visual tasks. It integrates components like visual encoders, cross-modal fusion techniques, and reasoning modules so that users can go beyond simple captioning or classification to perform tasks such as visual question answering, multi-image inference, and structured scene understanding. ViMax’s...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    WeKnora

    WeKnora

    LLM framework for document understanding and semantic retrieval

    WeKnora is an open source framework developed for deep document understanding and semantic information retrieval using large language models. It focuses on analyzing complex and heterogeneous documents by combining multiple processing stages such as multimodal document parsing, vector indexing, and intelligent retrieval. It follows the Retrieval-Augmented Generation (RAG) paradigm, where relevant document segments are retrieved and used by language models to generate accurate, context-aware...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    GBrain

    GBrain

    Garry's Opinionated OpenClaw/Hermes Agent Brain

    GBrain is an open-source AI memory system designed to give autonomous agents persistent, structured, and scalable long-term memory across interactions and workflows. It operates by transforming large collections of markdown documents, personal notes, and external data into a searchable knowledge base backed by PostgreSQL and vector embeddings, enabling both semantic and keyword-based retrieval. The system is tightly integrated with agent frameworks such as OpenClaw and Hermes, allowing AI agents to read from and write to memory continuously, effectively evolving their understanding over time. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    SemTools

    SemTools

    Semantic search and document parsing tools for the command line

    SemTools is an open-source command-line toolkit designed for document parsing, semantic indexing, and semantic search workflows. The project focuses on enabling developers and AI agents to process large document collections and extract meaningful semantic representations that can be searched efficiently. Built with Rust for performance and reliability, the toolchain provides fast processing of text and structured documents while maintaining low system overhead. SemTools can parse documents, build semantic embeddings, and perform similarity searches across datasets, making it useful for research, knowledge management, and AI-assisted coding workflows. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Sinas

    Sinas

    Open-source platform for building AI agents and serverless automation

    ...The platform supports isolated container execution for functions, which helps separate automation logic from the rest of the system. It also includes reusable skills, state stores, document collections, database connections, and embeddable UI components. Sinas can be managed through a web console or declarative YAML configuration, making it suitable for both interactive administration and GitOps-style workflows. Its main value is combining agents, functions, permissions, storage, and automation into one self-hosted AI application platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    NVIDIA NeMo

    NVIDIA NeMo

    Toolkit for conversational AI

    NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    AliceVision

    AliceVision

    3D Computer Vision Framework

    AliceVision is an open-source photogrammetric computer vision framework designed to reconstruct detailed 3D scenes and camera motion from collections of images or videos. It provides a complete pipeline for structure-from-motion (SfM), multi-view stereo (MVS), and mesh generation, allowing users to convert 2D imagery into accurate 3D models. The framework is built with a strong emphasis on research-grade algorithms while maintaining the robustness required for production environments, making it suitable for industries such as visual effects, cultural heritage preservation, and robotics. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    AgentGuide

    AgentGuide

    AI Agent Development Guide, LangGraph in Action, Advanced RAG

    AgentGuide is an open-source learning resource designed to provide a structured pathway for understanding and building AI agents. The project aggregates tutorials, research papers, frameworks, and practical resources related to agent development with large language models. Instead of presenting scattered resources, the repository organizes them into a systematic learning roadmap that guides learners from foundational concepts to advanced AI agent systems. The guide covers topics such as...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LLM-Aided OCR Project

    LLM-Aided OCR Project

    Enhances Tesseract OCR output using LLMs (local or API)

    LLM Aided OCR is an open-source system designed to improve optical character recognition accuracy by combining traditional OCR tools with large language models. The project addresses common OCR challenges such as distorted text, unusual fonts, historical documents, and complex layouts that often produce inaccurate results with standard OCR pipelines. The system first extracts raw text using OCR engines and then applies language models to analyze and correct recognition errors based on...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    In-The-Wild Jailbreak Prompts on LLMs

    In-The-Wild Jailbreak Prompts on LLMs

    A dataset consists of 15,140 ChatGPT prompts from Reddit

    ...Researchers analyze these prompts to identify patterns, attack strategies, and techniques commonly used to trick language models into producing restricted or harmful outputs. The dataset includes thousands of prompts collected across multiple platforms and represents one of the largest collections of jailbreak attempts available for research.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    OpenHome Abilities

    OpenHome Abilities

    Open-source abilities for OpenHome agents

    OpenHome Abilities is an open-source repository of modular voice AI plugins created for OpenHome agents, giving developers a lightweight way to extend what an agent can do through spoken triggers. Each ability is intentionally simple in structure, centering on a single main.py file that contains the core Python logic, which lowers the barrier to building and sharing custom behaviors. The system is meant to support a wide range of voice-driven actions, from API calls and media playback to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SimpleMem

    SimpleMem

    SimpleMem: Efficient Lifelong Memory for LLM Agents

    ...Unlike monolithic systems where memory management is ad-hoc, SimpleMem formalizes a memory lifecycle—write, index, retrieve, refine—so applications can handle user history, document collections, or dynamic contextual state systematically. It supports customizable embedding models, efficient vector indexes, and relevance weighting, making it practical for building assistants, personal agents, or domain-specific retrieval systems that need persistent knowledge.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo