HTML-hul Module

1 Introduction

The HTML-hul module recognizes and validates the HTML (Hypertext Markup Language) format. [HTML].

The module is invoked by the:

    jhove ... -m HTML-hul ...
  

command line option.

The HTML-hul module recognizes XHTML 1.0 (including transitional, frameset and strict) and 1.1, making use of the XML-hul module. If the XML-hul module is not available, only limited information will be provided on XHTML documents.

This module can be configured with the following parameters:

2 Coverage

The HTML-hul module recognizes and validates the following public profiles:

3 Well-Formedness

For the HTML profiles JHOVE uses the criteria for HTML well-formedness defined by [HTML 3.2, HTML 4.0, HTML 4.01]; for the XHTML profiles, JHOVE uses the criteria defined by [XML]. Specifically, a well-formed HTML document must have no syntactic errors, and must contain at least one of the tags HTML, HEAD, BODY or TITLE.

4 Validity

For the HTML profiles JHOVE uses the criteria for HTML validity defined by [HTML 3.2, HTML 4.0, HTML 4.01]; for the XHTML profiles JHOVE uses the criteria defined by [XHTML 1.0, XHTML 1.1].

5 Representation Information

The MIME type is reported as: text/html [RFC 2854]

In addition to the standard JHOVE representation information, the following HTML-specific properties are reported:

6 Additional Module Properties