Showing 29 open source projects for "html search engine"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    CSSBox

    CSSBox

    Pure Java HTML / CSS rendering engine

    CSSBox is an (X)HTML/CSS rendering engine written in pure Java. Its primary purpose is to provide a complete information about the rendered page suitable for further processing. However, it also allows displaying the rendered document.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    XML Editor/Validator/Designer with CAMV

    XML Editor/Validator/Designer with CAMV

    CAM XML Editor for XML+JSON+Hibernate+SQL Open-XDX sponsored by Oracle

    ...Do SQL data import/map+ Open Data API delivery+ Hibernate JPA + UI Forms/Pencil Makes XSD schema (default, flatten, NIEM, & OASIS modes) + JAXB bindings; Mindmap FreeMind or UML models(XMI); XML unit test & live SQL data; HTML docs + spreadsheets (NIEM IEPDs). Canonical component dictionaries from schema sets, SQL, JSON, ERwin XSD, or spreadsheets. The XML CAM templates (OASIS standard) store the exchange structure, content model, code lists, DBMappings, SQL lookups+business rules (XPath). Java CAMV XML/JSON validation engine is a complete exchange test framework [XMLUnit, TEAM(Schematron)]. ...
    Downloads: 82 This Week
    Last Update:
    See Project
  • 4
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    ...With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions of iText build on the success of previous versions and feature an improved document engine, high and low-level programming capabilities, and a more efficient modular structure. iText represents the next level for developers looking to leverage PDF in document workflows. ...
    Downloads: 107 This Week
    Last Update:
    See Project
  • 99.99% Uptime for MySQL and PostgreSQL Databases Icon
    99.99% Uptime for MySQL and PostgreSQL Databases

    Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

    Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.
    Try Free
  • 5
    JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    CSVboard

    CSV editor to open CSV files with minimum effort

    CSVboard is a tool for loading CSV files with minimum effort. Since I placed a great importance on its easy of use I implemented a search and filtering engine which provides you with the facility to efficiently find specified rows within a table. DON'T FORGET TO READ THE QUICK TUTORIAL!! Features: Lightweight and portable Set and reset title with Ctrl+q and Ctrl+w Auto-set column widths Auto-detection of delimiters Load files by Dragging&Dropping XML export Powerful search and filtering engine CSVboard was written in Java and JavaFX and actually it is a result of some recycled code collections. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    thymeleaf
    Thymeleaf is a java web template engine designed for XML/XHTML/HTML5.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Chunk, an HTML Template Engine for Java

    Chunk, an HTML Template Engine for Java

    Clean, powerful templates for Java

    A powerful Java Template Engine, great for building HTML or XML docs. Chunk can handle many other needs and situations as well. In-tag filters & default values, multiple snippets per file, layered themes, macros, conditional includes, localization & more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 63 This Week
    Last Update:
    See Project
  • 11
    Jaxe
    Jaxe is a free Java XML editor with a configurable GUI, using XML schemas for validation and XSL for exports in HTML or XML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13

    SwingBox

    Java Swing HTML / CSS rendering component

    SwingBox is a Java Swing component that allows displaying the (X)HTML documents including the CSS support. It is designed as a JEditorPane replacement with considerably better rendering results. SwingBox is pure Java and it is using the CSSBox rendering engine for rendering the documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Researchers Ontology

    Researchers Ontology Search Engine

    This project is a search engine that gathers data from an ontology. We took as exemple an ontology of researchers. The search uses the properties and works with partial queries and substrings. The client searching doesn't have to use the DLQuery nor the Manchester OWL Syntax, the engine we made builds the query in these syntaxes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    knowceans

    Utility classes from maps to search engine to random samplers

    .... --- Highlights: --- org.knowceans.util: IndexQuickSort, TableList: apply order of one array/list to others +++ Vectors, ArrayUtils: array convenience +++ RandomSamplers, CokusRandom, ArmSampler, Densities: random sampling and distributions +++ Arguments: command line parser +++ StopWatch, Which, ExternalProcess: runtime stuff +++ ParallelFor: OpenMP workalike +++ PatternString, NamedGroupRegex: regex convenience --- org.knowceans.corpus: CorpusSearcher: full-text search engine +++ LabelNumCorpus: svmlight corpus storage and filtering +++ NIPS corpus with text, authors, labels and citations --- org.knowceans.map: InvertibleHashMultiMap, BijectiveHashMap: implement n:m and 1:1 relations. --- Other libs: knowceans-arms = port of the Adaptive Rejection Metropolis Sampler (ARMS) for arbitrary distributions +++ lda-j = port of lda-c, implementing Latent Dirichlet Allocation (LDA)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Java browser and wysiwyg | source editor of html SFI(Structure-Fragment-Identifier)-files: 1) creates dynamically Table-Of-Contents 2) One-to-One mapping ToC - browser|editors 3) indexes the words 4) developers can use it for their help-system.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    tinyTiM is a small and lightweight implementation of the Topic Maps API (TMAPI). The engine is implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Book Shepherd helps to manage ebooks, articles, links and other electronic documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Glue 2 is a Semantic Web Service discovery engine fully compatible with the WSMO meta-model and the WSML language that aims at solving polarization problems by using mediators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    JeCARS (Java Extendable Contents And Rights System) is a RESTful webservice which delivers pluggable output formats, e.g. Atom feeds or HTML. Third party applications can be plugged in. A JCR (JSR-170) repository (Jackrabbit) is used for storage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Transforms the markup of a text, based on customizable tables, such as tags from html 3.2 to html+css (or vice)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Glue is a WSMO compliant discovery engine that aims at developing an efficient system for the management of semantically described Web Services and their discovery.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Generates RDF and RDFS ontology documents automatically from HTML pages once given a set of rules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This is an expandable RSS retrieval engine, it has contained the spider, the full text index and so on
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A search engine based on the concept of "Resource" to treat the ordinay "files and dirctories" in a different way in order to develop a new type of resources research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo