[sword-devel] MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".
DM Smith
dmsmith at crosswire.org
Wed Aug 6 17:20:19 EDT 2025
I’ve just checked in a change for osis2mod.
MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".
Also:
* Fixed a bug in hex numeric entities which defined &xHHHH; rather than &#xHHHH;.
* Added entity sanity check of maximum length of 32.
* Refactored entity handling into handleEntities and comment handling into handleComments.
* Changed t_entitytype and t_commentstate into class enums EntityType and CommentState.
* Added -d 1024 for entity and comment parsing.
Note: The coding allows for 0 padding of the numeric entities.
Note: The 5 need to be treated specially.
& or & → &
< or < → <
> or > → >
" or " → " or "
' or ' → ' or '
When converted to these forms, " should be transformed into " except in attributes using " and likewise ' into ' except in attributes having ‘.
I need to update the wiki to match.
In Him,
DM Smith
More information about the sword-devel
mailing list