[sword-devel] sword add-on

chrislit at crosswire.org chrislit at crosswire.org
Tue May 31 21:40:09 MST 2005


Kirk Lowery is probably the best person to answer how to encode morphology 
in OSIS since he is working on the Linguistic Annotation module, using his 
Leningrad Codex as a partial basis.

On Tue, 31 May 2005, Martin Gruner wrote:

> Hi,
>
> I have a question. The WLC source text I work from includes morphological
> separation markers, which is _very_ helpful when reading the Old Testament.
>
> a) How do I encode them in OSIS? With <seg>some</seg><seg>thing</seg>?

<seg> is the tag you would use in OSIS 2.x as it is currently. But you'll 
need to add some kind of type information, e.g. <seg type="x-morpheme">. 
It works fine even if you have segs embedded within <w> elements.

> b) Should we add an OptionFilter (OSISMorphSegmentation) to turn this on and
> off in the display, as well as a corresponding .conf file entry?

Sounds good to me. What do you have in mind to differentiate 
display/hiding of morphological segmentation? OLB has two BHS modules, one 
with morphemes divided by spaces. As a result, there's no way to 
differentiate between morpheme and word breaks. Pipes, dashes, etc. might 
be better to express morpheme breaks. (Dashes happen to be the linguistic 
standard.)

If you intend to put other grammatical information in the morpheme tags, 
it would probably be better to go with <w> instead of <seg>. It isn't 
really the intended purpose of <w>, but <w> does have attributes for lemma 
& morphological data, which <seg> doesn't. Kirk might be able to advise 
better on this particular point.

--Chris


More information about the sword-devel mailing list