UTF8-hul Module

1 Introduction

The UTF8-hul module recognizes and validates content streams encoded with the Unicode UTF-8 encoding.

The module is invoked by the:

      jhove ... -m UTF8-hul ...
    

command line option.

This module can be configured with the following parameters:

Coverage

Well-Formedness

The following criteria must be met by an UTF8 content streams for JHOVE to consider it well-formed:

Validity

The following criteria must be met by an UTF-8 encoded file for JHOVE to consider it valid:

Representation Information

The MIME type is reported as: text/plain; charset=UTF-8

In addition to the standard JHOVE representation information, the module defines the following properties: