Showing 16 open source projects for "document search"

View related business solutions
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • 1
    Floki

    Floki

    Floki is a simple HTML parser that enables search for nodes using CSS

    Floki is a simple HTML parser that enables search for nodes using CSS selectors. Floki needs the :leex module in order to compile. Normally this module is installed with Erlang in a complete installation. By default, Floki uses a patched version of mochiweb_html for parsing fragments due to its ease of installation (it's written in Erlang and has no outside dependencies). fast_html is generally faster, according to the benchmarks conducted by its developers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Nokogiri

    Nokogiri

    Tool to work with XML and HTML from Ruby

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2 (C) and xerces (Java). Be secure-by-default by treating all documents as untrusted by default. Be a thin-as-reasonable layer on top of the underlying parsers, and don't attempt to fix behavioral differences between the parsers. "Native...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    XRender

    XRender

    Easy-to-use middle and back-end "form/table/chart" solution

    ..., and to bring the ultimate development experience with a series of peripheral products such as form editors, plug-ins, and custom components. On the road to develop 1.0, we made a series of trade-offs, see the 0.x - 1.0 migration document for details
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    QXmlEdit

    QXmlEdit

    Simple XML editor and XSD viewer

    QXmlEdit is a simple XML editor written in qt. Its main features are unusual data visualization modes, nice XML manipulation and presentation and it is multi platform. It can split very big XML files into fragments, compare XML and XSD files, and has a graphical XSD viewers. Project site: http://qxmledit.org Source code hosted at GitHub (moved from Google Code) https://github.com/lbellonda/qxmledit Report issues at: https://github.com/lbellonda/qxmledit/issues Discussion...
    Leader badge
    Downloads: 157 This Week
    Last Update:
    See Project
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
    Get started free
  • 5

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    SubEthaEdit

    SubEthaEdit

    General purpose plain text editor for macOS

    The app that started the collaborative editing revolution is back. Write articles, code, notes or meeting minutes with friends – wherever they are. Ideal for extreme programming sessions, tutoring, and creative writing. And now it is free and open source! Have as many windows open of one document as you like. Great for comparing multiple places, keeping a reference section open, or following others. Use your desired flavor of regex for the search. From simple shell to full power ruby style...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    cquery

    cquery

    C/C++ language server supporting multi-million line code base

    C/C++ language server supporting multi-million line code base, powered by libclang. Emacs, Vim, VSCode, and others with language server protocol support. Cross-references, completion, diagnostics, semantic highlighting, and more. cquery is a highly-scalable, low-latency language server for C/C++/Objective-C. It is tested and designed for large codebases like Chromium. cquery provides accurate and fast semantic analysis without interrupting workflow. cquery implements almost the entire...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Synder

    Synder

    Atom/RSS/SearchSuggestion/OPML processing framework

    Synder is an Atom/RSS/SearchSuggestion/OPML processing framework It is Optimized For Android, but works with Java SE/EE. Synder has a small-footprint, low memory consumption. The only dependency for parsing is a SAX2 implementation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • A new approach to fast data transfer | IBM Aspera Icon
    A new approach to fast data transfer | IBM Aspera

    For organizations interested in a file transfer and streaming solution

    IBM Aspera takes a different approach to tackling the challenges of big data movement over global WANs. Rather than optimize or accelerate data transfer, Aspera eliminates underlying bottlenecks by using a breakthrough transport technology that fully utilizes available network bandwidth to maximize speed and quickly scale up with no theoretical limit.
    Learn More
  • 10
    XmlView
    GUI utility in pure Java for viewing and editing XML content; example of application built with Superficial http://superficial.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PHP-Index uses a plain text file as an index for a efficient search on data. The index is a simple ordered list, so a binary search can be performed. Current implementation supports a XML document as database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Joomla! component to achieve a semantic extensión of the WCMS It converts the infrastructure of Joomla! into the semantically equivalent. It allows both manual and automatic semantic annotations as well as enhanced search capabilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Irudiko is a library written in C++ for generating Locality Sensitive Hashing sketches from any textual and web document. Mainly designed to work with HTML pages, it has also an optimization support for English or Italian documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Grep tool for OASIS Open Document Format (ODF) files. Search content and meta streams for regular expressions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Prototype for a framework and user interface for combining various structured search and document clustering techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DXIE is a Dynamic XML Instance Editor that provides dynamic GUI editing of XML documents. DXIE reads a Document (XML) and its Schema (XSD) and produces a dynamic UI for editing (search, add, edit, and delete) of the Document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next