Search Results for "java html parser" - Page 2

Showing 2102 open source projects for "java html parser"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    markdown-rs

    markdown-rs

    CommonMark compliant markdown parser in Rust with ASTs and extensions

    markdown-rs is an open-source markdown parser written in Rust. It’s implemented as a state machine (#![no_std] + alloc) that emits concrete tokens, so that every byte is accounted for, with positional info. The API then exposes this information as an AST, which is easier to work with, or it compiles directly to HTML. While most markdown parsers work towards compliancy with CommonMark (or GFM), this project goes further by following how the reference parsers (cmark, cmark-gfm) work, which...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Karate

    Karate

    Test automation made simple

    Karate is the only open-source tool to combine API test-automation, mocks, performance-testing and even UI automation into a single, unified framework. The BDD syntax popularized by Cucumber is language-neutral, and easy for even non-programmers. Assertions and HTML reports are built-in, and you can run tests in parallel for speed. There’s also a cross-platform stand-alone executable for teams not comfortable with Java. You don’t have to compile code. Just write tests in a simple, readable...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Intellij rainbow brackets

    Intellij rainbow brackets

    Rainbow Brackets for IntelliJ based IDEs/Android Studio/HUAWEI DevEco

    Compatible with IntelliJ IDEA, PhpStorm, WebStorm, PyCharm, RubyMine, AppCode, CLion, Gogland, DataGrip, Rider, MPS, Android Studio, HUAWEI DevEco Studio, DataSpell, Code With Me. Supports Java, Scala, Clojure, Kotlin, Python, Haskell, Agda, Rust, JavaScript, TypeScript, Erlang, Go, Groovy, Ruby, Elixir, ObjectiveC, PHP, HTML, XML, SQL, Apex language, C#, Dart, Pug/Jade, Bash, Vue.js, C# Razor Pages, GLSL(the OpenGL Shading Language), Go Template, C++, C, etc. Eliminate context switching...
    Downloads: 8 This Week
    Last Update:
    See Project
  • Turn Your Content into Interactive Magic - For Free Icon
    Turn Your Content into Interactive Magic - For Free

    From Canva to Slides, Desmos to YouTube, Lumio works with the tech tools you are already using.

    Transform anything you share into an engaging digital experience - for free. Instantly convert your PDFs, slides, and files into dynamic, interactive sessions with built-in collaboration tools, activities, and real-time assessment. From teaching to training to team building, make every presentation unforgettable. Used by millions for education, business, and professional development.
    Start Free Forever
  • 5
    Quarkdown

    Quarkdown

    Markdown with superpowers, from ideas to papers, and presentations

    Quarkdown is a lightweight Markdown processor and static site generator written in Java. It converts Markdown files into styled HTML pages with customizable themes, supporting blog creation and simple documentation websites. Quarkdown emphasizes simplicity and speed, providing an out-of-the-box experience for minimal personal sites.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    capacitor

    capacitor

    Build cross-platform native progressive web apps for iOS and Android

    Capacitor is an open source native runtime for building Web Native apps. Create cross-platform iOS, Android, and Progressive Web Apps with JavaScript, HTML, and CSS. Capacitor’s native plugin APIs make it extremely easy to access and invoke common device functionality across multiple platforms. Build web-based applications that run equally well across iOS, Android, and as Progressive Web Apps. Access the full Native SDKs on each platform, and easily deploy to the App Stores (and the web). Add...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    Opal

    Opal

    Opal is a Ruby to JavaScript source-to-source compiler

    Opal is a Ruby to JavaScript source-to-source compiler. It comes packed with the Ruby corelib you know and love. It is both fast as a runtime and small in its footprint. The lib directory holds the Opal parser/compiler used to compile Ruby into JavaScript. It is also built ready for the browser into opal-parser.js to allow compilation in any JavaScript environment. This directory holds the Opal runtime and corelib implemented in Ruby and JavaScript. opal-parser allows you to eval Ruby code...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    crawley

    crawley

    The unix-way web crawler

    Crawls web pages and prints any link it can find. Fast HTML SAX-parser (powered by golang.org/x/net/html) Small (below 1500 SLOC), idiomatic, 100% test-covered codebase. Grabs most of useful resources URLs (pics, videos, audios, forms, etc...) Found URLs are streamed to stdout and guaranteed to be unique (with fragments omitted) Scan depth (limited by starting host and path, by default - 0) can be configured. Can crawl rules and sitemaps from robots.txt. Brute mode - scan HTML comments for URLs...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Agents-Flex

    Agents-Flex

    Agents-Flex is an elegant LLM Application Framework like LangChain

    ... definitions, parsing, callbacks through LLMs, and executing local methods to obtain results. Agents-Flex offers Loader, Parser, and Splitter components for the Document. Each component has multiple implementations, making it easy to load data from the web, local files, databases, and various data types.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Secure remote access solution to your private network, in the cloud or on-prem. Icon
    Secure remote access solution to your private network, in the cloud or on-prem.

    Deliver secure remote access with OpenVPN.

    OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
    Get started — no credit card required.
  • 10
    Cheerio

    Cheerio

    Implementation of core jQuery designed for the server

    Fast, flexible & lean implementation of core jQuery designed specifically for the server. Cheerio implements a subset of core jQuery. Cheerio removes all the DOM inconsistencies and browser cruft from the jQuery library, revealing its truly gorgeous API. Cheerio works with a very simple, consistent DOM model. As a result parsing, manipulating, and rendering are incredibly efficient. Cheerio wraps around parse5 parser and can optionally use @FB55's forgiving htmlparser2. Cheerio can parse nearly...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    JSqlParser

    JSqlParser

    JSqlParser parses an SQL statement and translate it into a hierarchy

    JSqlParser is a SQL statement parser. It translates SQLs in a traversable hierarchy of Java classes. JSqlParser is not limited to one database but provides support for a lot of specials of Oracle, SqlServer, MySQL, PostgreSQL. To name some, it has support for Oracles join syntax using (+), PostgreSQLs cast syntax using ::, relational operators like != and so on. Then the result can be accessed in a structured way. The generated Java class hierarchy can be navigated using the Visitor Pattern.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    JavaParser

    JavaParser

    Java 1-17 Parser and Abstract Syntax Tree for Java

    This project contains a set of libraries implementing a Java 1.0 - Java 17 Parser with advanced analysis functionalities. The project binaries are available in Maven Central. We strongly advise users to adopt Maven, Gradle or another build system for their projects. If you are not familiar with them we suggest taking a look at the maven quickstart projects. Since Version 3.5.10, the JavaParser project includes the JavaSymbolSolver. While JavaParser generates an Abstract Syntax Tree...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    ngx-markdown

    ngx-markdown

    Angular markdown component/directive/pipe/service to parse static

    Angular markdown component/directive/pipe/service to parse static, dynamic or remote content to HTML with syntax highlight and more. As the library is using Marked parser you will need to add node_modules/marked/marked.min.js to your application. To use the line numbers plugin that shows line numbers in code blocks, in addition to Prism.js configuration files, you will need to include the following files from prismjs/plugins/line-numbers directory to your application. Using markdown component...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    WebMagic

    WebMagic

    A scalable web crawler framework for Java

    WebMagic is a scalable crawler framework. It covers the whole lifecycle of crawler, downloading, url management, content extraction and persistent. It can simplify the development of a specific crawler. WebMagic is a simple but scalable crawler framework. You can develop a crawler easily based on it. WebMagic has a simple core with high flexibility, a simple API for html extracting. It also provides annotation with POJO to customize a crawler, and no configuration is needed. Some other features...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    JSON Schema Validator

    JSON Schema Validator

    A fast Java JSON schema validator that supports draft V4, V6, V7, V201

    This is a Java implementation of the JSON Schema Core Draft v4, v6, v7, v2019-09 and v2020-12(partial) specification for JSON schema validation. In addition, it also works for OpenAPI 3.0 request/response validation with some configuration flags. For users who want to collect information from a JSON node based on the schema, the walkers can help. The default JSON parser is the Jackson which is the most popular one. As it is a key component in our light-4j microservices framework to validate...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    DocFX

    DocFX

    Static site generator for .NET API documentation

    DocFX can produce documentation from source code (including C#, F#, Visual Basic, REST, JavaScript, Java, Python and TypeScript) as well as raw Markdown files. DocFX can run on Linux, macOS, and Windows. The generated static website can be deployed to any host such as GitHub Pages or Azure Websites with no additional configuration. DocFX provides a flexible way to customize templates and themes. DocFX makes it extremely easy to generate your developer hub with a landing page, API reference...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Kryo

    Kryo

    Java binary serialization and cloning, fast, efficient, automatic

    Kryo is a fast and efficient binary object graph serialization framework for Java. The goals of the project are high speed, low size, and an easy-to-use API. The project is useful any time objects need to be persisted, whether to a file, database or over the network. Kryo can also perform automatic deep and shallow copying/cloning. This is direct copying from object to object, not object to bytes to object. Kryo has three sets of methods for reading and writing objects. If the concrete class...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    AngleSharp

    AngleSharp

    The ultimate angle brackets parser library parsing HTML5, MathML, SVG

    AngleSharp follows the W3C specifications and gives you the same results as state of the art browsers. Besides the official API AngleSharp adds some useful extension methods on top. This makes working with the DOM convenient. AngleSharp integrates everything you need to explore and mutate the DOM tree. Node retrieval is straight forward by using powerful CSS query selectors. The CSS queries in AngleSharp are super fast and very simple to use. AngleSharp respects the relationship of HTML...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Byte Buddy

    Byte Buddy

    Runtime code generation for the Java virtual machine

    Byte Buddy is a code generation and manipulation library for creating and modifying Java classes during the runtime of a Java application and without the help of a compiler. Other than the code generation utilities that ship with the Java Class Library, Byte Buddy allows the creation of arbitrary classes and is not limited to implementing interfaces for the creation of runtime proxies. Furthermore, Byte Buddy offers a convenient API for changing classes either manually, using a Java agent...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    sakura

    sakura

    A minimal CSS framework/theme

    ..., especially when working on backend sites and can't yet be bothered to fidget with CSS/HTML. Building a quick (but pretty) site/blog for your best friend or aunt! No need to remember tons of different class names for every other CSS framework. Works amazingly with markdown generated HTML pages (eliminates the need of hacks like including .img img-responsive in markdown-parser generated <img></img> tags).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor and Toolchain written with JavaFX 19

    Asciidoc FX is a WYSIWYG editor for the Asciidoc markup language. You can build PDF, Epub, and HTML books, documents, and slides. Supported Operating Systems and Builds shows the list of available builds with links for reference. If you are looking for the very latest version, visit the link in the note above to be guaranteed of downloading the latest and greatest version of AsciidocFX. AsciidocFX converts documents via the AsciidoctorJ library.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    apiDoc

    apiDoc

    RESTful web API documentation Generator

    apiDoc creates a documentation from API annotations in your source code. apiDoc gives you the ability to attach a version number to an API so you can easily track changes between versions. Creates an apiDoc of all files within dir myapp/, uses template from dir mytemplate/ and put all output to dir apidoc/. Without any parameter, apiDoc generate a documentation from all .cs .dart .erl .go .java .js .php .py .rb .ts files in current dir (incl. subdirs) and writes the output to ./doc/. apiDoc...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    Apollo Kotlin

    Apollo Kotlin

    A strongly-typed, caching GraphQL client for the JVM, Android, etc.

    A strongly-typed, caching GraphQL client for Java and Kotlin multiplatform. Apollo Kotlin (formerly Apollo Android) is a GraphQL client that generates Kotlin and Java models from GraphQL queries. Apollo Kotlin executes queries and mutations against a GraphQL server and returns results as query-specific Kotlin types. This means you don't have to deal with parsing JSON, or passing around Maps and making clients cast values to the right type manually. You also don't have to write model types...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    JSON implementation for Ruby

    JSON implementation for Ruby

    JSON implementation for Ruby

    This is an implementation of the JSON specification according to RFC 4627. You can think of it as a low-fat alternative to XML if you want to store data on disk or transmit it over a network rather than use a verbose markup language. Both variants of the JSON generator generate UTF-8 character sequences by default. If an:ascii_only option with a true value is given, they escape all non-ASCII and control characters with \uXXXX escape sequences, and support UTF-16 surrogate pairs in order to...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    Sanitize

    Sanitize

    Ruby HTML and CSS sanitizer

    ... that you don't explicitly allow will be removed. Sanitize is based on the Nokogiri HTML5 parser, which parses HTML the same way modern browsers do, and Crass, which parses CSS the same way modern browsers do. As long as your allowlist config only allows safe markup and CSS, even the most malformed or malicious input will be transformed into safe output.
    Downloads: 1 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.