[sword-devel] Questions about usfm2osis.pl

johnduffy at cgcf.net johnduffy at cgcf.net
Wed Oct 28 15:56:27 MST 2009


Sorry - I meant poetry not quotation marks.  I've been using Unicode
quotations in UTF-8 instead of <<, < etc.  I read somewhere that the Unicode
quotations in UTF-8 would not require osis markup, which would avoid the
problem nested quotes or quotes going across chapter or section divisions.
The original text does not make good use of quotation marks too which
requires this approach, as it often inserts an additional opening quotation
mark at each new paragraph within a quoted section, while only having one
closing quotation mark and the very end.  I trust that using Unicode
characters will therefore avoid problems in osis and module creation.

John Duffy

-----Original Message-----
From: Chris Little [mailto:chrislit at crosswire.org] 
Sent: 28 October 2009 21:12
To: SWORD Developers' Collaboration Forum
Subject: Re: [sword-devel] Questions about usfm2osis.pl

johnduffy at cgcf.net wrote:
> Does usfm2osis.pl process USFM \q quotation marks?  Bibledit converts them
> to paragraph marks, but I would be keen to keep them as quotation markup
> the osis file, if possible.

\q marks a poetic line, not quotations. usfm2osis.pl correctly encodes 
these as OSIS poetry in all cases that I know of.

USFM offers \qt to mark OT quotations found in the NT, if that's what 
your thinking of. If you just mean quoted speech, there are two standard 
conventions: Modern USFM editions, especially those that are encoded in 
UTF-8, simply encode the marks themselves in UTF-8. Older SFM sometimes 
encodes quotation marks as <<, >>, <, and > for double left & right and 
single left and right quotation marks respectively. usfm2osis.pl decodes 
these and marks them with OSIS <q>.


sword-devel mailing list: sword-devel at crosswire.org
Instructions to unsubscribe/change your settings at above page

More information about the sword-devel mailing list