[sword-devel] MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".
David Haslam
dfhdfh at protonmail.com
Thu Aug 7 00:24:41 EDT 2025
- Previous message (by thread): [sword-devel] MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".
- Next message (by thread): [sword-devel] MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".
- Messages sorted by:
[ date ]
[ thread ]
[ subject ]
[ author ]
Hi DM,
I’m puzzled.
You seems to have thought there was a bug which actually wasn’t.
Please refer to [https://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references](https://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references?wprov=sfti1#)
The # was not a bug !
Regards
David
Sent from [Proton Mail](https://proton.me/mail/home) for iOS
On Wed, Aug 6, 2025 at 22:20, DM Smith < [dmsmith at crosswire.org](mailto:On Wed, Aug 6, 2025 at 22:20, DM Smith <<a href=)> wrote:
> I’ve just checked in a change for osis2mod.
>
> MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".
>
> Also:
> * Fixed a bug in hex numeric entities which defined &xHHHH; rather than &#xHHHH;.
> * Added entity sanity check of maximum length of 32.
> * Refactored entity handling into handleEntities and comment handling into handleComments.
> * Changed t_entitytype and t_commentstate into class enums EntityType and CommentState.
> * Added -d 1024 for entity and comment parsing.
>
> Note: The coding allows for 0 padding of the numeric entities.
> Note: The 5 need to be treated specially.
> & or & → &
> < or < → <
> > or > → >
> " or " → " or "
> ' or ' → ' or '
> When converted to these forms, " should be transformed into " except in attributes using " and likewise ' into ' except in attributes having ‘.
>
> I need to update the wiki to match.
>
> In Him,
> DM Smith
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://crosswire.org/pipermail/sword-devel/attachments/20250807/14fdc631/attachment.htm>
- Previous message (by thread): [sword-devel] MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".
- Next message (by thread): [sword-devel] MODTOOLS-17 To osis2mod, added conversion of hex and decimal numeric entities to UTF-8, with special handling of <, >, &, ', and ".
- Messages sorted by:
[ date ]
[ thread ]
[ subject ]
[ author ]
More information about the sword-devel
mailing list