Showing 39 open source projects for "file indexing"

View related business solutions
  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • Shift, the browser that merges all of your web apps into one powerful window. Icon
    Shift, the browser that merges all of your web apps into one powerful window.

    Your power browser.

    Streamline everything you do online when you install Shift and access thousands of apps without leaving your browser. Connect all of your Gmail, Outlook, and Office 365 accounts and manage everything from one centralized window. Build out your Shift browser with apps that integrate seamlessly so you have ultra-fast access to all the tools you use to stream, shop, work, browse, and stay connected. Shift brings it all together.
    Try for Free
  • 1
    Apache Druid

    Apache Druid

    A high performance real-time analytics database

    Druid is designed for workflows where fast ad-hoc analytics, instant data visibility, or supporting high concurrency is important. As such, Druid is often used to power UIs where an interactive, consistent user experience is desired. Druid streams data from message buses such as Kafka, and Amazon Kinesis, and batch load files from data lakes such as HDFS, and Amazon S3. Druid supports most popular file formats for structured and semi-structured data. Druid has been benchmarked to greatly...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Apache Hudi

    Apache Hudi

    Upserts, Deletes And Incremental Processing on Big Data

    ... efficient upserts, by mapping a given hoodie key (record key + partition path) consistently to a file id, via an indexing mechanism. This mapping between record key and file group/file id, never changes once the first version of a record has been written to a file. In short, the mapped file group contains all versions of a group of records.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    File System Crawler for Elasticsearch

    File System Crawler for Elasticsearch

    Elasticsearch File System Crawler (FS Crawler)

    This crawler helps to index binary documents such as PDF, Open Office, MS Office. Local file system (or a mounted drive) crawling and indexing new files, updating existing ones, and removing old ones. Remote file system over SSH/FTP crawling. REST interface to let you “upload” your binary documents to elastic search.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    .... Due to its technological architecture design, OpenKM meets the document management needs of businesses of all sizes (from SMEs to big corporations). Thanks to its elegant and intuitive interface, OpenKM transforms complex operations into easy tasks. The most relevant functions of OpenKM is the indexing of the most common types of files: text, Office, Office 2007, OpenOffice, PDF, HTML, XML, MP3, JPEG, etc. For a complete feature list take a look at http://goo.gl/au8cQy
    Leader badge
    Downloads: 934 This Week
    Last Update:
    See Project
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 5
    Hypernomicon

    Hypernomicon

    Hypertext-infused philosophy personal database software

    Hypernomicon is a personal productivity/database application for researchers that combines structured note-taking, mind-mapping, management of files (e.g., PDFs) and folders, and reference management into an integrated environment that organizes all of the above into semantic networks or hierarchies in terms of debates, positions, arguments, labels, terminology/concepts, and user-defined keywords by means of database relations and automatically generated hyperlinks (hence ‘Hyper’ in the...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 6

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    eXist-db

    eXist-db

    eXist-db is a feature rich Open Source native XML database

    eXist-db is a native XML database featuring efficient, index-based XQuery processing, extensions for keyword search, XUpdate support, XSLT support, XForms support, REST and tight integration with existing XML development tools. Moved to Github - https://www.github.com/exist-db/exist
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Disk Indexer
    ... descending order of file sizes. And hence it can greatly help in identifying the files those can be deleted. You can list down files for multiple folders by running scan n number of times. All the history scans will be available for viewing unless user purges a scan. It can be useful for both kind of users - individual users and enterprise users(system admins) It can be scaled up for enterprise use. Check out wiki pages. Start with reading Readme.txt file in the downloaded package.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows...
    Downloads: 21 This Week
    Last Update:
    See Project
  • Free CRM Software With Something for Everyone Icon
    Free CRM Software With Something for Everyone

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
    Get free CRM
  • 10
    YouSeer is an open source search engine framework, which was built on top of other open source components. It’s part of the general SeerSuite framework. YouSeer utilizes Hereitrix as a crawler and solr as an indexing system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    jspforum-simple

    jspforum-simple

    A simplified forum application based on Java EE: strust+spring+ibatis

    A simplified forum application based on full and professional Java EE technology: struts+spring+ibatis (ssb) / hibernate (ssh). Which be implemented as the prototype of web forum / BBS, and provided for all the functionalities. Note: the WAR file can be directly imported from Eclipse or other Java EE IDEs and it is also including all the source code inside. IMPORTANT: while launching the app in the server, please re-configure the fields in the files of log4j.properties
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Personalized Search Engine

    Personalized Search Engine for Your Files

    MySearchEngine (Personalized Search Engine) is a Java software to search files and folders in an OS file system. It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary can...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    eLibrary

    Personalized Search Engine for Commonly Used Files

    eLibrary (electric library) is a Java software to search files and folders in an OS file system. It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary can also extract...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    COAR-DMS

    COAR-DMS

    DMS for linux, C++ library, server, webUI , SOAP

    COAR-DMS is document management system for 32/64 bit. linux. Acts as library, server and tools. Library features: - storage management, free pages recycling - transaction log - indexing: full text, tags, metadata, document attributes - inverted index - versioning, collaboration - document trees, trees versionning - folders - plugins for auth (PAM,LDAP), db, file types plugins - tags - metadata (key value pairs) - object level security, folders documents ACL, - unix like...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Omega Base

    Omega Base

    Web-based knowledge base template.

    A Knowledge Base and document management system (DMS). With strong user management, security, and file indexing for search.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    JOPS
    Java Objects Persistence Service (JOPS) is a simple and very useful library dedicated to developers who don't need all the power of traditional DBs (MySQL,Oracle.etc) but still need to be able to persist and manipulate their persisted objects easily
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MySQL to PostgreSQL

    MySQL to PostgreSQL

    Convert MySQL dumps into PostgreSQL dumps

    Given a simple MySQL dump file, converts into PostgreSQL format with some optimizations including not indexing until after data load and loading data via the PostgreSQL COPY command from CSV files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Lotus Notes Mail Query

    Lotus Notes Mail Query

    Notes Mail Query allows you to search arcoss multiple notes archives

    Many a time, we would remember a couple of words in our email archives but not the subjects.. and we would fumble around with multiple archives each taking so much time to open and then sift through the subjects... Notes Mail Query is written in java and can run in Mac, Linux and Windows helping you to index your email archives and help you find data from all the text fields in the emails. Double clicking on the results opens the mail in our native email client (Notes). Its is a basic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    PatentX - EPOScan extra utilities

    EPOScan ext folder utilities

    This is a software to operate some functions over the "ext" folder created by EPOScan(European Patent Office software for indexing and scanning patent document images) when the downloading option is selected. This folder is usually used by the ST33 software to convert the indexed images into ST33 standard.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Personal Document Manager DMS
    Personal Document Manager is a tool for storing, indexing and finding documents. In short: a simple, small Document Management System. Userguide at Project Webpage. Currently I am looking for people willing to join the project. Needed are: - Java developers. - Testers - Help writers Especially developers working on windows would be needet to synchronise some window management things which are different on UX based systems and Windows. And of course.... working alone...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Linoratix Intranet Search (lIntraSearch) is an Desktopsearch alike application indexing the content of many office- and other files in a network so you can search, download and modify all the documents in your network from your computer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ANts P2P
    ANts P2P realizes a third generation P2P net. It protects your privacy while you are connected and makes you not trackable, hiding your identity (ip) and crypting everything you are sending/receiving from others.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    A fulltext indexing engine designed for xml files on a slow readonly medium.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Puggle
    Puggle is a graphical desktop search engine written exclusively in Java. It provides full text and metadata search over files, folders, music, photos, web pages and more that are stored locally on your computer.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next