[osis-core] morph regex error

Troy A. Griffitts osis-core@bibletechnologieswg.org
Thu, 04 Dec 2003 11:30:18 -0700


:)

Unless I'm going senile-- which I've been suspecting for some time now-- 
I believe that the last discussion on this subject, before release of 
2.0, concluded that lemma, xlit, gloss, and morph WOULD NOT be 
restricted by osisRef syntax.  We would make a separate complexType for 
them, which basically would allow: prefix:any_string

I think I wanted to allow spaces (expecially for gloss), Patrick found 
real world occurances of other systems that used prohibiting characters, 
as well.

So the conclusion was either:

prefix:any_string

or

prefix:any string

I think Steve may have made some push for replacing the 'space' but 
don't remember the conclusion on that one.

But regardless, there are no spaces in my offending line that I quoted 
earlier, and yet I still get an error.

If I have to remove the cobwebs to defend this again, I will try, but 
think it's just a mis-sight in the .xsd.

	-Troy.




Chris Little wrote:
> Okay, okay.  No need to shout.  Don't kill the messenger.  Etc. :)
> 
> The problem with changing the format is that we can no longer use morph, 
> lemma, etc. values as osisRefs.  As it stands, any of these attributes 
> could double as an osisRef/osisID.  So your lexicon, organized by lemma, 
> could have divisions with osisIDs that are the same as their lemma 
> values.  Likewise, if you organize the Robinson morphology scheme as a 
> sort of lexicon, you can look up entries and tag them with osisIDs that 
> are identical to your morph value.
> 
> --Chris
> 
> Troy A. Griffitts wrote:
> 
>> NO!
>>
>>
>> Chris Little wrote:
>>
>>> Troy A. Griffitts wrote:
>>>
>>>> Hey guys.  It seems we may have messed up the regex on the morph 
>>>> attribute of <w>.
>>>>
>>>> Here my line:
>>>>
>>>> <w xml:lang="grc" lemma="strongs:15" morph="robinsons:V-PAM-2P" 
>>>> xlit="la:agaqopoieite">GREEK UTF8 TEXT HERE</w>
>>>>
>>>>
>>>>
>>>> Here's the MSV error output:
>>>>
>>>> Error at line:279, column:117 of 
>>>> file:///space/home/scribe/msv/./lexcounts
>>>>   attribute "morph" has a bad value: the value does not match the 
>>>> regular expression 
>>>> "((((\p{L}|\p{N}|_)+)(\.(\p{L}|\p{N}|_))*:)((((\p{L})|(\p{N})|_)+)(((\.(\p{L}|\p{N}|_)+)*))?))". 
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>> The value you give has never been valid.  Hyphens have never been 
>>> allowed in morph or lemma attributes (nor have spaces and various 
>>> other characters).  I think the decision we made before releasing 2.0 
>>> was to force folks to transcode these as '_'.
>>>
>>> Does that work for you?
>>>
>>> --Chris
> 
> 
> 
> _______________________________________________
> osis-core mailing list
> osis-core@bibletechnologieswg.org
> http://www.bibletechnologieswg.org/mailman/listinfo/osis-core