Search Results for "document search engine" - Page 2

Showing 121 open source projects for "document search engine"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do bestβ€”building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    Mini Agent

    Mini Agent

    A minimal yet professional single agent demo project

    ...The project exposes an Anthropic-compatible API interface and fully supports interleaved thinking, letting the agent alternate between reasoning steps and tool invocations during long, complex tasks. It includes a basic toolset for file-system operations and shell commands, plus integrations with MCP tools such as web search and knowledge graph access. Mini-Agent also comes with β€œClaude Skills”-style predefined skills for tasks like document processing, design work, and testing, packaged as reusable behaviors that can be invoked by the agent as needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ChatGPT Academic

    ChatGPT Academic

    ChatGPT extension for scientific research work

    ChatGPT extension for scientific research work, specially optimized academic paper polishing experience, supports custom shortcut buttons, supports custom function plug-ins, supports markdown table display, double display of Tex formulas, complete code display function, new local Python/C++/Go project tree Analysis function/Project source code self-translation ability, newly added PDF and Word document batch summary function/PDF paper full-text translation function. All buttons are...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Mezzanine

    Mezzanine

    CMS framework for Django

    Mezzanine is a powerful open source content management platform built using the Django framework. In many ways it is like many other content management tools, offering an intuitive interface for managing all of your content. But Mezzanine is different in that it provides most of its functionality by default. While other platforms rely heavily on modules or reusable applications, Mezzanine comes ready with all the functionality you need, making it the more efficient choice. Mezzanine has a...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Superduper

    Superduper

    Superduper: Integrate AI models and machine learning workflows

    Superduper is a Python-based framework for building end-2-end AI-data workflows and applications on your own data, integrating with major databases. It supports the latest technologies and techniques, including LLMs, vector-search, RAG, and multimodality as well as classical AI and ML paradigms. Developers may leverage Superduper by building compositional and declarative objects that out-source the details of deployment, orchestration versioning, and more to the Superduper engine. This allows developers to completely avoid implementing MLOps, ETL pipelines, model deployment, data migration, and synchronization. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Turn more customers into advocates. Icon
    Turn more customers into advocates.

    Fight skyrocketing paid media costs by turning your customers into a primary vehicle for acquisition, awareness, and activation with Extole.

    The platform's advanced capabilities ensure companies get the most out of their referral programs. Leverage custom events, profiles, and attributes to enable dynamic, audience-specific referral experiences. Use first-party data to tailor customer segment messaging, rewards, and engagement strategies. Use our flexible APIs to build management capabilities and consumer experiences–headlessly or hybrid. We have all the tools you need to build scalable, secure, and high-performing referral programs.
    Learn More
  • 5
    FLAML

    FLAML

    A fast library for AutoML and tuning

    FLAML is a lightweight Python library that finds accurate machine learning models automatically, efficiently and economically. It frees users from selecting learners and hyperparameters for each learner. For common machine learning tasks like classification and regression, it quickly finds quality models for user-provided data with low computational resources. It supports both classical machine learning models and deep neural networks. It is easy to customize or extend. Users can find their...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    PySR

    PySR

    High-Performance Symbolic Regression in Python and Julia

    ...Over a period of several years, PySR has been engineered from the ground up to be (1) as high-performance as possible, (2) as configurable as possible, and (3) easy to use. PySR is developed alongside the Julia library SymbolicRegression.jl, which forms the powerful search engine of PySR. The details of these algorithms are described in the PySR paper. Symbolic regression works best on low-dimensional datasets, but one can also extend these approaches to higher-dimensional spaces by using "Symbolic Distillation" of Neural Networks, as explained in 2006.11287, where we apply it to N-body problems. Here, one essentially uses symbolic regression to convert a neural net to an analytic equation. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    qBittorrent

    qBittorrent

    A free and reliable P2P BitTorrent client

    An advanced and multi-platform BitTorrent client with a nice Qt user interface as well as a Web UI for remote control and an integrated search engine. qBittorrent aims to meet the needs of most users while using as little CPU and memory as possible.
    Leader badge
    Downloads: 445,878 This Week
    Last Update:
    See Project
  • 8
    sqlmap

    sqlmap

    Automatic SQL injection and database takeover tool

    sqlmap is a powerful, feature-filled, open source penetration testing tool. It makes detecting and exploiting SQL injection flaws and taking over the database servers an automated process. sqlmap comes with a great range of features that along with its powerful detection engine make it the ultimate penetration tester. It offers full support for MySQL, Oracle, PostgreSQL, Microsoft SQL Server, Microsoft Access, IBM DB2, SQLite, Firebird, and many other database management systems. It also...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 9
    Langdesk

    Langdesk

    Windows application to search multiple pdfs and chat with them

    Langdesk is desktop application for windows that allows the user to assemble a knowledge base consisting of multiple pdf documents, retrieve information from them and chat with the retrieved content. Currently in BETA mode. Feel free to reach us for any request at info@tecnoesis.gr . We are currently seeking user scenarios, also open to customizations / additions / cooperation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
    Try for free
  • 10
    The Sashimi project hosts the Trans-Proteomic Pipeline (TPP), a mature suite of tools for mass-spec (MS, MS/MS) based proteomics: statistical validation, quantitation, visualization, and converters from raw MS data to the open mzML/mzXML formats.
    Leader badge
    Downloads: 39 This Week
    Last Update:
    See Project
  • 11
    KeyParaStocX

    KeyParaStocX

    Set styles to words and create a Table of Contents in a click

    KeyParaStocX (Keyword-based Paragraph Styling and Table of Contents eXtension) is a LibreOffice/Apache OpenOffice/OpenOffice.org extension that searches for the configured keywords in a text, changes their style and builds a Table of Contents for them, up to 7 levels. The keywords and their target styles can be configured by the users and used for every document they open. The extension integrates into Writer options and is independent of the operating system (should work on all). See...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Create Index from PDF

    PDF Indexing Script: Searches PDF for words, records page numbers

    This Python script helps automate the process of creating an index for a PDF document. It reads a list of words from a text file, searches through each page of the PDF, and records the page numbers where each word appears. The script accounts for the first 24 pages of the PDF that use Roman numerals (i-xxiv) and adjusts the page numbers accordingly. It is designed to be case-insensitive, ensuring that variations in capitalization do not affect the search results.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    4allDownloader Converter

    4allDownloader Converter

    Video,audio&Files Downloader&Convert with built-in browser with AI.

    ...Features a built-in browser with JavaScript injection support that remembers logins while maintaining complete privacy. Five powerful tabs streamline your workflow: Home tab for pasting URLs, channels, playlists, and direct search; Browser tab for seamless web navigation; Downloads tab with resume/cancel support and download history; Utilities tab offering lossless conversion for all file formats plus URL analysis tools; Settings tab for proxy configuration, search limits, download engine selection, and advanced customization. Experience lightning-fast downloads, universal format support, and privacy-focused browsing in one revolutionary application. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    Eva AI

    Eva AI

    Eva is an A.I. assistant that helps users multi-task.

    ...It also has the purpose of helping people with disabilities use the computer with a greater ease. Eva can open and close system related and non-system related applications, search content on web applications, set timers, and take screenshots. Tell Eva "Listen" or "Hey listen" followed by a command. For more instructions, check the instruction manual included in the application. [Update] * πŸ†• Removed paged memory cleanup * πŸ†• Re-added physical model switch-up * πŸ†• Added automatic microphone audio level maximisation * πŸ†• Re-calibrated the * 🐞 Re-added the wake word engine reset mechanism * 🐞 Fixed UI related issues regarding threading * 🐞 Fixed thread synchronisation bugs
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Django-CRM - Free Open Source  Software

    Django-CRM - Free Open Source Software

    Enterprise CRM software + Tasks management, Email marketing and more!

    Django CRM system is easy to use and can be run on a personal laptop for a private entrepreneur (for example) or on a cloud web server for a large enterprise. Written in Python, CRM is an open-source software package for managing customer interactions and sales. This free CRM software is a powerful tool designed to optimize workflows, and support data-driven decisions. Key features include role-based access control, intuitive navigation, powerful filtering, and search functionalities. The...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    Sweep AI

    Sweep AI

    Sweep: AI-powered Junior Developer for small features and bug fixes

    Let Sweep handle your tech debt so you can focus on the exciting problems. Sweep is an AI junior developer that transforms bug reports & feature requests into code changes. Describe bugs, small features, and refactors like you would to a junior developer and Sweep.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    garysfm

    garysfm

    An advanced file manager with qss themes and iso and folder previews

    ...The exe definitely works on Windows 11 and 10 amd and intel 64 bit systems. the exe is not for arm systems. Full theming engine with qss support for advanced theming in 1.2.4 To use themes other than the default download gsfmt.zip, and qss.zip and extract them to the directory where your garysfm is located. (version 1.2.3 and above for gsfmt and 1.2.4 and above for qss.) 1.2.9 has cloud syncing, bookmarks, tags, and collections, and cleaner
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    mindflow

    mindflow

    AI-powered CLI git wrapper, boilerplate code generator, chat history

    I-powered CLI git wrapper, boilerplate code generator, chat history manager, and code search engine to streamline your dev workflow. The ChatGPT-powered swiss army knife for the modern developer! We provide an AI-powered CLI git wrapper, boilerplate code generator, code search engine, a conversation history manager, and much more! Configure the model used for generating responses by running mf config and selecting either GPT 3.5 Turbo (default) or GPT 4. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Prime QA

    Prime QA

    State-of-the-art Multilingual Question Answering research

    PrimeQA is a public open source repository that enables researchers and developers to train state-of-the-art models for question answering (QA). By using PrimeQA, a researcher can replicate the experiments outlined in a paper published in the latest NLP conference while also enjoying the capability to download pre-trained models (from an online repository) and run them on their own custom data. PrimeQA is built on top of the Transformers toolkit and uses datasets and models that are directly...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Image Downloader

    Image Downloader

    Download images from Google, Bing, Baidu

    Crawl and download images using Selenium Using python3 and PyQt5. Supported Search Engine: Google, Bing, Baidu. Keywords input from the keyboard or input from line separated keywords list file for batch process. Download image using a customizable number of threads. Fully supported conditional search (eg. filetype:, site:). Switch for Google safe mode. Proxy configuration (socks, HTTP). CMD and GUI ways of using are provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Alphafold

    Alphafold

    Open source code for AlphaFold

    ...The total download size for the full databases is around 415 GB and the total size when unzipped is 2.2 TB. Please make sure you have a large enough hard drive space, bandwidth and time to download. We recommend using an SSD for better genetic search performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    cerche

    cerche

    Experimental search engine for conversational AI such as parl.ai

    This is an experimental search engine for conversational AI such as parl.ai, large language models such as OpenAI GPT3, and humans (maybe).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    searx

    searx

    Privacy-respecting metasearch engine

    Searx is a free and open source internet metasearch engine that respects your privacy. It aggregates results from over 70 search services, providing accurate, relevant results with absolutely no user tracking or profiling going on. It’s like having your very own private search engine. It has secure, encrypted connections and does not use cookies by default, giving you the assurance that your searches are 100% private and secure.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 24
    Webifier

    Webifier

    A GitHub Action to deploy Notebooks, Markdowns

    Webifier is a stand-alone build tool for converting any repository into a deployable jekyll website. You can define your pages via yaml files and provide notebooks, markdown and pdf and other files for Webifier to render. It uses python markdown providing additional control over attributes and other extensive functionalities. It lets you define and direct how your web pages feel and automatically manages your assets, making it a perfect solution for fast static website development and a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Rank-BM25

    Rank-BM25

    A Collection of BM25 Algorithms in Python

    A collection of algorithms for querying a set of documents and returning the ones most relevant to the query. The most common use case for these algorithms is, as you might have guessed, to create search engines.
    Downloads: 0 This Week
    Last Update:
    See Project