THTMLdom is a (Delphi) class with functions to read a HTML source file and dissect it into a tree of THTMLelement. The attributes of the HTML tags are stored in the elements. Functions are provided to select elements on the basis of the attribute values or tag names. The structure of the tree can be shown and it can be rendered as plain text.
The source is plain Delphi pascal, requiring a version that supports Tdictionary. There is no dependency on 3rd party units.
The file to be parsed must have valid HTML4/5 tags. It is not necessary that the HTML is ‘correct’ in the sense that end tags may be wrongly placed or be absent altogether. The speed of processing (reading+parsing) is formidable: 15-40 msec per Mbyte or around 1 msec per 1000 HTML tags.

Features

  • HTML 4 & 5, also with incorrect tag placings
  • Parsed into a tree with the tag attributes in the nodes
  • Text parts in separate elements
  • Javascript lookalike functions for retrieval
  • Fast

Project Activity

See All Activity >

License

MIT License

Follow HTML parser in Delphi

HTML parser in Delphi Web Site

Other Useful Business Software
Level Up Your Cyber Defense with External Threat Management Icon
Level Up Your Cyber Defense with External Threat Management

See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
Try for Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • hello
Read more reviews >

Additional Project Details

Operating Systems

Windows

Languages

English

Intended Audience

Developers, Information Technology

Programming Language

Delphi/Kylix

Related Categories

Delphi/Kylix HTML XHTML, Delphi/Kylix Software Development Software

Registered

2019-11-24