[osis-core] OSIS work regex

Patrick Durusau osis-core@bibletechnologieswg.org
Wed, 14 Aug 2002 11:16:44 -0400


Steve,

Yes, not a hyphen since it is the range delimiter.

Patrick

Steve DeRose wrote:

> At 09:30 AM -0400 08/14/02, Patrick Durusau wrote:
>
>> Harry,
>>
>> Harry Plantinga wrote:
>>
>>> If schema RegExps behave as they do in Perl, the ? is superfluous.
>>> Perhaps
>>>
>>>  [\L\N][\.\L\N]*
>>
>> Unfortunately, XML Schemas are deliberately inconsistent with 
>> customary regex expressions. Well, the "?" operator is familiar from 
>> SGML DTD syntax but the use of "^" as negation, for example, is 
>> contrary to its usual role as an anchor (the reasoning being that in 
>> XML Schemas, matches are always anchored at the start of the line. no 
>> sure why that justifies being inconsistent but there you have it)
>>
>>>
>>> The underscore character (_) is pretty commonly used in names and 
>>> may be
>>> present in documents converted to OSIS. I can't see that it would do 
>>> any
>>> harm. Could it be included?  Perhaps
>>> [\L\N_][\.\L\N_]*
>>>
>> Actually I think:
>>
>> (\p{L}\p{N}_)((\.(\p{L}\p{N}_)*)
>>
>> (untested)
>>
>> Steve: comments on adding the underscore?
>>
>
> I'm fine with adding underscore; presumably not hyphen, though, since 
> it's our range delimiter, yes?
>
>
>
>

-- 
Patrick Durusau
Director of Research and Development
Society of Biblical Literature
pdurusau@emory.edu