Showing 107 open source projects for "web indexing"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 1
    Zinc Search Engine

    Zinc Search Engine

    ZincSearch. A lightweight alternative to elasticsearch

    Zinc is a search engine that does full text indexing. It is a lightweight alternative to Elasticsearch and runs using a fraction of the resources. It uses bluge as the underlying indexing library. It is very simple and easy to operate as opposed to Elasticsearch which requires a couple dozen knobs to understand and tune which you can get up and running in 2 minutes. You can index large amounts of text Zinc and search instantly. Zinc is a batteries-included setup. Embedded UI provides an easy...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    ACHE Focused Crawler

    ACHE Focused Crawler

    ACHE is a web crawler for domain-specific search

    ACHE is a focused web crawler. It collects web pages that satisfy some specific criteria, e.g., pages that belong to a given domain or that contain a user-specified pattern. ACHE differs from generic crawlers in sense that it uses page classifiers to distinguish between relevant and irrelevant pages in a given domain. A page classifier can be defined as a simple regular expression (e.g., that matches every page that contains a specific word) or a machine-learning-based classification model...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    Just the Class

    Just the Class

    A modern, highly customizable, responsive Jekyll template

    A modern, highly customizable, responsive Jekyll template for course websites. Just the Class is a GitHub Pages template developed for the purpose of quickly deploying course websites. In addition to serving plain web pages and files, it provides a boilerplate for announcements, course calendar, etc. Just the Class is a template that extends the popular Just the Docs theme, which provides a robust and thoroughly-tested foundation for your website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    RBush

    RBush

    High-performance JavaScript R-tree-based 2D spatial index

    RBush is a high-performance JavaScript library for 2D spatial indexing of points and rectangles. It's based on an optimized R-tree data structure with bulk insertion support. Spatial index is a special data structure for points and rectangles that allows you to perform queries like "all items within this bounding box" very efficiently (e.g. hundreds of times faster than looping over all items). It's most commonly used in maps and data visualizations. The demos contain visualization of trees...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Eptura Workplace Software Icon
    Eptura Workplace Software

    From desk booking and visitor management, to space planning and office utilization data, Eptura Workplace helps your entire organization work smarter.

    With the world of work changed forever, it’s essential to manage your workplace and assets together to effectively create a high-performing environment. The Eptura experience combines the power of workplace management software with asset management, enabling you to effectively operate your building and facilitate hybrid work.
    Learn More
  • 5
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    OpenKM is a electronic document management system and record management system EDRMS ( DMS, RMS, CMS ). It provides modern and flexible architecture that meet today's IT demands, based on open technology (Java, Tomcat, GWT, Lucene, Hibernate, Spring and jBPM), powerful and scalable multiplatform application. OpenKM is a Web 2.0 application that works with Internet Explorer, Firefox, Safari and Opera. Can be configured in major DMBS like Oracle, PostgreSQL and MySQL among others...
    Leader badge
    Downloads: 961 This Week
    Last Update:
    See Project
  • 6
    IMS Open Corpus Workbench

    IMS Open Corpus Workbench

    Indexing and query tools for very large text corpora

    The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP, which can be used interactively in a terminal session, as a backend e.g. from a Perl script, or through the Web-based GUI CQPweb.
    Leader badge
    Downloads: 203 This Week
    Last Update:
    See Project
  • 7
    Koha + DSpace 8.0 Live DVD ISO (2024.06)

    Koha + DSpace 8.0 Live DVD ISO (2024.06)

    Koha 24.05.01 + DSpace 8.0 Live ISO Installer based on Ubuntu 22.04.4

    Koha + DSpace Live is a Live Bootable and Installer ISO based on Ubuntu 22.04.4 (Koha Version: 24.05.01 and DSpace Version: 8.0) This ISO boots only in Legacy BIOS mode and not in Secure Boot Mode After booting from the Live DVD/USB, use the following login credentials: Login: library (Displayed as 'Open Digital Library') Password: library This Live ISO contains additional Desktop Environments (aka D.E) providing different User Interfaces. If you wish to choose a Lightweight...
    Downloads: 64 This Week
    Last Update:
    See Project
  • 8
    SLiMS Library Management System

    SLiMS Library Management System

    Free & Open Community Edition Server in a Complete Virtual Machine

    This VM is created for 2 reasons: 1. Very little initial setup work required to make a Library Management System live, within minutes. 2. This system should keep running for Years, without requiring Updates / Breakages. If you are new to Virtual Machines, then please watch the Video below ( taken from my other project. just replace td with lm wherever mentioned ) After starting this VM, please access these websites ( Just Accept Any Warnings ) : Public Website Address:...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9
    Moodle, DSpace 7, Koha, Calibre Live ISO

    Moodle, DSpace 7, Koha, Calibre Live ISO

    Moodle 4.2 + DSpace 7.6 + Koha 23.11 Live ISO based on Ubuntu 22.04.3

    Moodle, DSpace, Koha, Calibre-Web is a Live Bootable and Installer ISO based on Ubuntu 22.04.3 Moodle Version: 4.2.1 DSpace Version: 7.6 Koha Version: 23.11.01 Calibre Version: 6.26 This ISO boots only in Legacy BIOS mode and not in Secure Boot Mode After booting from the Live DVD/USB, use the following login credentials: Login: library (Displayed as 'Open Learning Platform') Password: admin123 This Live ISO contains additional Desktop Environments (aka D.E) providing...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Boost your bottom line in less time | IBM Cognos Analytics Icon
    Boost your bottom line in less time | IBM Cognos Analytics

    Business intelligence solution for businesses wanting to make better analytics and confident decisions

    IBM Cognos Analytics acts as your trusted co-pilot for business with the aim of making you smarter, faster, and more confident in your data-driven decisions.
    Learn More
  • 10
    eXist-db

    eXist-db

    eXist-db is a feature rich Open Source native XML database

    eXist-db is a native XML database featuring efficient, index-based XQuery processing, extensions for keyword search, XUpdate support, XSLT support, XForms support, REST and tight integration with existing XML development tools. Moved to Github - https://www.github.com/exist-db/exist
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11

    Apache Allura

    Forge software for hosting software projects

    Apache Allura's website and resources are all at https://allura.apache.org/ Allura was originally developed here, which is why this project exists.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ReconSpider

    ReconSpider

    Most Advanced Open Source Intelligence (OSINT) Framework

    ... the capabilities of Wave, Photon and Recon Dog to do a comprehensive enumeration of attack surfaces. Reconnaissance is a mission to obtain information by various detection methods, about the activities and resources of an enemy or potential enemy, or geographic characteristics of a particular area. A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web, typically for the purpose of Web indexing (web spidering).
    Downloads: 11 This Week
    Last Update:
    See Project
  • 13
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Records Management System

    Records Management System

    Save a digital copy of your personal or business records

    Your personal and business records are considered private documents. You should avoid using cloud providers, even google drive! Records Management System is a localized data store using SQLITE and integrates with any connected scanner using the TWAIN toolkit (a license may be needed?) The resource that RMS is based is from the book Filing Systems and Records Management (College series) 3rd Edition available on Amazon at https://www.amazon.com/dp/0070614717
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    YaCy Peer-to-Peer Search Engine

    YaCy Peer-to-Peer Search Engine

    Decentralized Web Search Engine

    YaCy is a free search engine that anyone can use to build search the internet (www and ftp) or to create a search portal for others (internet or intranet). The scale of YaCy is limited only by the number of users and can index billions of web pages. In p2p mode it is fully decentralized, all users of the search engine network are equal and it is not possible for anyone to censor the content of the distributed index.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    hordes

    hordes

    WordPress Plug curates list of links with titles icons and categories.

    ... for a title or a link or any of the tags you can do so using the search box or Horde's convenient Alphbetic search tool. Kind of like a indexing system that shows you everything that is under "A" for example. Store all this information with quick search reminder tags such as the name of the site or the name of the subject for which you wanted to remember the site for. Demo at https://cginvoice.com/info/hordes-list/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Lens

    Lens

    Lens is an opt-in search engine and data collection tool

    Lens is an opt-in search engine and data collection tool to aid content discovery of the distributed web. It exposes a simple, minimal API for intelligently indexing and querying content on IPFS. Initially integrated with Temporal, Lens will allow users to optionally have the data they upload be searched and indexed and be awarded with RTC for participating in the data collection process. Users can then search for content using a simple-to-use API. Searching through Lens will be facilitated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Tera

    Tera

    An Internet-scale database

    Tera is a high-performance distributed NoSQL database, which is inspired by google's BigTable and designed for real-time applications. Tera can easily scale to petabytes of data across thousands of commodity servers. Besides, Tera is widely used in many Baidu products with varied demands, which range from throughput-oriented applications to latency-sensitive services, including web indexing, WebPage DB, LinkBase DB, etc. Tera is the collection of many sparse, distributed, multidimensional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    owl-indexer

    Full-text index generator of OWL literals

    owl-indexer generates full text index of OWL literal via either Apache Lucene or Sphinx Search. It is based on OWL API (https://github.com/owlcs/owlapi).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 21
    SSEP - Site Search Engine PHP-Ajax

    SSEP - Site Search Engine PHP-Ajax

    A Free site search engine script build with PHP and Ajax.

    A Site Search engine script that uses MySQL to store your website's indexed pages, to add Search Functionality to Your Web Site. It is build with PHP and JavaScript, the search results are loaded via Ajax. The search system combine MySQL full text with SQL regexp, and words weight according to their location in the HTML elements, to determine the relevance of the search results. It can be included in any web site.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    jspforum-simple

    jspforum-simple

    A simplified forum application based on Java EE: strust+spring+ibatis

    A simplified forum application based on full and professional Java EE technology: struts+spring+ibatis (ssb) / hibernate (ssh). Which be implemented as the prototype of web forum / BBS, and provided for all the functionalities. Note: the WAR file can be directly imported from Eclipse or other Java EE IDEs and it is also including all the source code inside. IMPORTANT: while launching the app in the server, please re-configure the fields in the files of log4j.properties
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    mds-utils

    General purpose utilities for C++ and Python developers

    ...++ classes that help on treating Python file objects as C++ streams. 6. a review and refactor of the indexing support in Python extensions. Now access in write mode is supported too. More details on the Doxygen documentation. Documentation is available through doxygen. Once downloaded and uncompressed, issue the "doxygen" command from the root folder. The documentation will be into "doc/html". An online version of this documentation is available at the link here below (mds-utils Web Site).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    geog-server-embedded

    geog-server-embedded

    GeoG Embedded Server

    GeoG Embedded Server with GeoG's Own Database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    IndexFile (IFile)

    IndexFile (IFile)

    IFile, PHP based framework for indexing and search in the documents

    ...); OpenOffice.org Calc (.ods); Adobe Portable Document Format (.pdf); Text file (.txt); Web page (.htm - .html)
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next