[sword-devel] MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".

DM Smith dmsmith at crosswire.org
Wed Aug 6 17:20:19 EDT 2025


I’ve just checked in a change for osis2mod.

MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".

Also:
 * Fixed a bug in hex numeric entities which defined &xHHHH; rather than &#xHHHH;.
 * Added entity sanity check of maximum length of 32.
 * Refactored entity handling into handleEntities and comment handling into handleComments.
 * Changed t_entitytype and t_commentstate into class enums EntityType and CommentState.
 * Added -d 1024 for entity and comment parsing.

Note: The coding allows for 0 padding of the numeric entities.
Note: The 5 need to be treated specially.
	& or &#x26; → &
	< or &#x3C; → <
	> or &#x3E; → >
	" or &#x22; → " or "
	' or &#x27; → ' or '
When converted to these forms, " should be transformed into " except in attributes using " and likewise ' into ' except in attributes having ‘.

I need to update the wiki to match.

In Him,
	DM Smith



More information about the sword-devel mailing list