[sword-devel] Scripture Reference Tagging Tool
Don A. Elbourne Jr.
Wed, 12 Feb 2003 12:14:39 -0600
Thanks Chris. I just toyed around with vs2osisref. Very cool. Its a good
start. I tried all sorts of abbreviations and verse spans and it correctly
interpreted all but the most obscure non-standard ones I threw at it. Now if
we could just feed it a whole text file and have it loop through looking for
everything it recognized and generate a logfile so you could eyeball the
thing for mistakes, that would rock. I need to learn Perl.
by grace alone,
Don A. Elbourne Jr.
----- Original Message -----
From: "Chris Little" <firstname.lastname@example.org>
Sent: Wednesday, February 12, 2003 11:23 AM
Subject: Re: [sword-devel] Scripture Reference Tagging Tool
> On Wed, 12 Feb 2003, Don A. Elbourne Jr. wrote:
> > Is there a scripture reference tagging tool? It seems I remember Chris
> > mentioning one once. What I mean is a tool that would suck up an input
> > recognize verse reference strings like "John 3:16" and wrap some sort of
> > ThML/OSIS/whatever markup around it.
> Not currently. There are two parts to the problem:
> First is recognizing the reference. This is generally very difficult
> because of the hundreds of ways you can identify a passage, plus the
> myriad book abbreviations/names that can be used, plus the different
> possibilities for ranges/lists of verses. I wrote something in Perl to do
> this for the Catholic Encyclopedia a while back, but haven't been able to
> locate it. It was extremely specific to that text anyway. Perl seems
> like the best way handle recognizing references because of its regular
> expression support.
> Second, you need to convert the reference to OSIS. For simple references
> this is pretty simple, but anything with dis-contiguous ranges/lists will
> require that you decide where the parts are located in the reference and
> mark them. There's a vs2osisref utility in the utilities directory that
> you may use to generate the actual OSIS tags, but it's a long way from a
> complete solution.
> This would be a good area for someone looking to have a little project to
> work in. :) If the Perl/regex part could be converted to C, that would be
> even better.
> sword-devel mailing list