[sword-devel] Questions about usfm2osis.pl

DM Smith dmsmith at crosswire.org
Wed Oct 28 09:19:31 MST 2009


On Oct 28, 2009, at 12:04 PM, <johnduffy at cgcf.net> wrote:

> Hi Daniel,
>
> Yes, it works even using a relative path if only one file is  
> specified but not the *.sfm.

Expansion of * is something that the os needs to do. I don't know if  
or how Windows does it.
>
> It is interesting that an unrecognised character is seen at the  
> start of the file, which appears as a square box in front of (or  
> here on top of) the first backslash.
This sounds like a BOM (byte order mark). FEFF, I think.  It can be  
deleted if the file is utf-8.

>  It also keeps the \id as in the extract below, but misses the book  
> div and all book abbreviations in the verseIDs.
>
> From USFM2osis.pl for single book of Psalms in Windows:
> </header>
> \id PSA
> <title type="main">Leabhar na SALM</title>
> <div type="majorSection">
> <title>An Chéad Leabhar</title>
> <chapter sID=".1" osisID=".1"/>
> <div type="section">
> <title>Fíorshonas</title>
> <lg>
> <l level="1">
> <verse sID=".1.1" osisID=".1.1"/>
> Is aoibhinn don fhear nach...
>
> From USFM2osis.pl in Linux Psalms extract from whole Bible (Note no  
> poetry markup was present when this was done, as well as combined  
> titles):
> <div type="book" osisID="Ps">
>   <title short="Leabhar na SALM"/>
>   <div>
>    <title type="main">Leabhar na SALM</title>
>    <chapter sID="Ps.1" osisID="Ps.1" n="1"/>
>   </div>
>   <div type="section" canonical="true">
>    <title level="1">An Chéad Leabhar -- Fíorshonas</title>
>    <verse sID="Ps.1.1" osisID="Ps.1.1"/>
>    <p> Is aoibhinn don fhear nach...
>
> In the process, I've seen on individual files that USFM2osis.pl does  
> handle poetry as well as paragraph markup.  If it handled the  
> verseIDs correctly, I'd be tempted to do each Bible book separately  
> and then try to put them all together into a single osis file.
>
> Alternatively, since I'm happy with the USFM content now that I've  
> got both the paragraph and poetry markup in place, as well as  
> headings separated out into different levels, do you think that  
> you'd have time to compile it again in Linux?
>
> Thanks
>
> John Duffy
>
> When I have used usfm2osis.pl successfully it has always been in  
> Linux.
> I'd be willing to bet that Chris knows the answer to this one, but the
> problem is not the USFM files but how usfm2osis.pl grabs the input  
> from
> them. It is possible to process one file if you give an argument  
> with an
> absolute path. However, if you use wildcards like *.sfm, it doesn't
> work. I don't know why. Again, I'm guessing Chris knows how to do  
> this,
> but I've never figured it out. I've just used Linux.
>
> Daniel
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page



More information about the sword-devel mailing list