[sword-devel] Chinese Strongs markup

Daniel Owens dhowens at pmbx.net
Wed Apr 25 07:34:51 MST 2012


This issue crops up in the WHM module because prefixed and suffixed 
morphemes are parsed separately. Here is an example: <w 
lemma="strong:lemma1 strong:lemma2" morph="whmmorph:morph1 
whmmorph:morph2" src="1 2">וַיְהִ֣י</w>. The waw conjunction is parsed 
separately from the verb. I do not think that @src is essential, but it 
might be useful information some day.


On 04/25/2012 08:49 AM, Troy A. Griffitts wrote:
> David,
> I'm sorry no (including me) has responded to you on this yet.
> We used to use '|' to separate lists in osis in the first rev or so 
> (or at least we (CrossWire) used them, until it was pointed out that 
> lists in XML (at least in TEI) were delineated with spaces, so the 
> lemma and morph attributes on <w> elements are space separated lists.  
> fine, but this doesn't solve your problem.  With '|', I would have 
> encoded your example like this:
> > eg: Gen.3.8: <w lemma="strong:H8085 strong:H853 strong:H3068"
> > morph="StrongsMorph:H8799">[hear]</w>
> <w lemma="strong:H8085|strong:H853|strong:H3068" 
> morph="StrongsMorph:H8799||">[hear]</w>
> I suppose you could still do that with 2 spaces instead of of the '|', 
> but this isn't that it's not simply extraneous spaces in your 
> attribute value.
> Anyway, the morph list should mirror the lemma list.  How you do that? 
> Well, maybe:
> <w lemma="strong:H8085 strong:H853 strong:H3068" 
> morph="StrongsMorph:H8799 StrongsMorph:H00 StrongsMorph:H00">[hear]</w>
> (your email to jsword reminding me of this old question from you and 
> sparked a possible solution)
> Maybe someone else has a better suggestion,
> Troy
> On 04/12/2012 10:16 AM, David Instone-Brewer wrote:
>> I'm starting work on correcting some errors in the Chinese Strongs 
>> markup,
>> but I want to step carefully cos I'm new to this.
>> I couldn't find a Wiki page outlining what the rules are for Strongs
>> markup.
>> There's a note at
>> http://crosswire.org/wiki/Modules_in_the_beta_repository/ChiUns
>> pointing out one problem, but there are many, though it is basically a
>> very good markup.
>> It appears to be based on the free KJV markup, though it also notes
>> every occurrence of the Hebrew article /eth
>> /This isn't very useful, but it shows that they were keen to include
>> everything and that they did a lot of work on it.
>> In order to make sure they include every Hebrew word, they often
>> combined words in one entry,
>> eg: Gen.1.2: <w lemma="strong:H5921 strong:H6440">[on the surface 
>> of]</w>
>> Sometimes this becomes complicated when one or more of the words is a 
>> verb,
>> eg: Gen.2.2: <w lemma="strong:H4480 strong:H3605 strong:H834
>> strong:H6213" morph="StrongsMorph:H8804">[rested him from all the]</w>
>> - the numbers represent the words for "from" "all" "/eth/" and "working"
>> - in this case the morphology goes with the last lemma (H6213) but this
>> isn't always the case.
>> eg: Gen.3.8: <w lemma="strong:H8085 strong:H853 strong:H3068"
>> morph="StrongsMorph:H8799">[hear]</w>
>> - the numbers represent: "hear" "/eth" /"LORD" and "Qal Imperfect"
>> -in this case the morphology relates to the first of the three lemmas
>> listed.
>> I can sort this out, but I'd like to know how to record it
>> - ie how can I indicate in correct coding that a morphology links with a
>> particular lemma when more than one lemma is translated by a single 
>> word?
>> I notice that the same thing happens in the KJV strongs (though less 
>> often),
>> eg Gen.6.3 <w lemma="strong:H07683 strong:H01571"
>> morph="strongMorph:TH8800 strongMorph:TH8677">for that he also</w>
>> Is it OK to mark such occurrences as follows?
>> Gen.6.3 <w lemma="strong:H07683" morph="strongMorph:TH8800"
>> lemma="strong:H01571" morph="strongMorph:TH8677">for that he also</w>
>> This isn't good XML, but perhaps it is allowable?
>> Or is there some other way to indicate which lemma the morphology
>> belongs to?
>>     David IB
>>     /// Dr David Instone-Brewer
>>     dib Senior Research Fellow in Rabbinics and the New Testament
>>     ^ Tyndale House, 36 Selwyn Gardens, Cambridge, CB3 9BA, UK
>>     \=/ Rabbinics at Tyndale.cam.ac.uk www.TyndaleHouse.com
>> <http://www.tyndalehouse.com/>
>> _______________________________________________
>> sword-devel mailing list: sword-devel at crosswire.org
>> http://www.crosswire.org/mailman/listinfo/sword-devel
>> Instructions to unsubscribe/change your settings at above page
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page

More information about the sword-devel mailing list