public class Utf8Module extends ModuleBase
Modifier and Type | Field and Description |
---|---|
protected Map<Integer,String> |
_controlCharMap |
protected ChecksumInputStream |
_cstream
PRIVATE INSTANCE FIELDS.
|
protected DataInputStream |
_dstream |
protected boolean |
_lineEndCR |
protected boolean |
_lineEndCRLF |
protected boolean |
_lineEndLF |
protected int |
_prevChar |
protected TextMDMetadata |
_textMD |
protected boolean |
_withTextMD |
protected Utf8BlockMarker |
blockMarker |
protected int[] |
initialBytes |
_app, _bigEndian, _checksumFinished, _countStream, _coverage, _crc32, _date, _defaultParams, _features, _format, _init, _isRandomAccess, _je, _logger, _md5, _mimeType, _name, _nByte, _note, _param, _release, _repInfoNote, _rights, _sha1, _signature, _specification, _validityNote, _vendor, _verbosity, _wellFormedNote
MAXIMUM_VERBOSITY, MINIMUM_VERBOSITY
Constructor and Description |
---|
Utf8Module()
Creates a Utf8Module.
|
Modifier and Type | Method and Description |
---|---|
protected void |
checkLineEnd(int ch)
Accumulate information about line endings.
|
protected boolean |
checkMark(int byt,
RepInfo info) |
void |
checkSignatures(File file,
InputStream stream,
RepInfo info)
Check if the digital object conforms to this Module's internal signature
information.
|
int |
parse(InputStream stream,
RepInfo info,
int parseIndex)
Parse the content of a stream digital object and store the results in
RepInfo.
|
addIntegerProperty, addIntegerProperty, applyDefaultParams, calcRAChecksum, checkSignatures, getApp, getBase, getBufferedDataStream, getCoverage, getCRC32, getDate, getDefaultParams, getFeatures, getFormat, getMimeType, getName, getNByte, getNote, getRelease, getRepInfoNote, getRights, getSignature, getSpecification, getValidityNote, getVendor, getWellFormedNote, hasFeature, init, initFeatures, initParse, isBigEndian, isRandomAccess, param, parse, readByteBuf, readDouble, readDouble, readDouble, readFloat, readFloat, readSignedByte, readSignedByte, readSignedByte, readSignedInt, readSignedInt, readSignedInt, readSignedLong, readSignedRational, readSignedRational, readSignedShort, readSignedShort, readSignedShort, readUnsignedByte, readUnsignedByte, readUnsignedByte, readUnsignedInt, readUnsignedInt, readUnsignedInt, readUnsignedRational, readUnsignedRational, readUnsignedRational, readUnsignedShort, readUnsignedShort, readUnsignedShort, resetParams, setApp, setBase, setChecksums, setCRC32, setDefaultParams, setMD5, setNByte, setSHA1, setValidityNote, setVerbosity, show, skipBytes, skipBytes, vectorToPropArray
protected ChecksumInputStream _cstream
protected DataInputStream _dstream
protected boolean _lineEndCR
protected boolean _lineEndLF
protected boolean _lineEndCRLF
protected int _prevChar
protected int[] initialBytes
protected Utf8BlockMarker blockMarker
protected boolean _withTextMD
protected TextMDMetadata _textMD
public final int parse(InputStream stream, RepInfo info, int parseIndex) throws IOException
parse
in interface Module
parse
in class ModuleBase
stream
- An InputStream, positioned at its beginning, which is
generated from the object to be parsed. If multiple calls to
parse
are made on the basis of a nonzero value
being returned, a new InputStream must be provided each time.info
- A fresh (on the first call) RepInfo object which will be
modified to reflect the results of the parsing If multiple
calls to parse
are made on the basis of a nonzero
value being returned, the same RepInfo object should be passed
with each call.parseIndex
- Must be 0 in first call to parse
. If
parse
returns a nonzero value, it must be called
again with parseIndex
equal to that return value.IOException
public void checkSignatures(File file, InputStream stream, RepInfo info) throws IOException
checkSignatures
in interface Module
checkSignatures
in class ModuleBase
file
- A File object for the object being parsedstream
- An InputStream, positioned at its beginning, which is
generated from the object to be parsedinfo
- A fresh RepInfo object which will be modified to reflect the
results of the testIOException
protected void checkLineEnd(int ch)
ch
- Current characterprotected boolean checkMark(int byt, RepInfo info)
Copyright © 2008–2017 The Open Preservation Foundation. All rights reserved.