Performs unicode normalization on all XML documents in a fileset using one of the four standard normalization forms provided by the Unicode Consortium.
For more information on the reasons for and practice of Unicode normalization, see:
The transformer is written to work on any file/fileset that can be represented by the org.daisy.util.fileset package.
Normalization will only be done on XML members of the input fileset; all other types of members pass through untouched.
If no file in the fileset is of type XML, then the whole fileset will pass through untouched. It is therefore safe to place this transformer in contexts whose dataflow varies considerably.
A file/fileset whose XML members has been normalized using one of the four Unicode normalization algorithms. See parameters
No specific recovery scheme. On error, this transformer will send a fatal message, then throw an exception and abort.
Selects normalization form to use. Allowed values: NFD|NFKD|NFC|NFKC. Default: NFC, which is the one recommended in Character Model for the World Wide Web.
None.
No known refactoring wishes at the time of writing.
Markus Gylling, Daisy Consortium
LGPL