[osis-core] OSIS work regex

Steve DeRose osis-core@bibletechnologieswg.org
Wed, 14 Aug 2002 11:22:16 -0400


At 09:30 AM -0400 08/14/02, Patrick Durusau wrote:
>Harry,
>
>Harry Plantinga wrote:
>
>>If schema RegExps behave as they do in Perl, the ? is superfluous.
>>Perhaps
>>
>>  [\L\N][\.\L\N]*
>Unfortunately, XML Schemas are deliberately inconsistent with 
>customary regex expressions. Well, the "?" operator is familiar from 
>SGML DTD syntax but the use of "^" as negation, for example, is 
>contrary to its usual role as an anchor (the reasoning being that in 
>XML Schemas, matches are always anchored at the start of the line. 
>no sure why that justifies being inconsistent but there you have it)
>
>>
>>The underscore character (_) is pretty commonly used in names and may be
>>present in documents converted to OSIS. I can't see that it would do any
>>harm. Could it be included?  Perhaps
>>[\L\N_][\.\L\N_]*
>>
>Actually I think:
>
>(\p{L}\p{N}_)((\.(\p{L}\p{N}_)*)
>
>(untested)
>
>Steve: comments on adding the underscore?
>

I'm fine with adding underscore; presumably not hyphen, though, since 
it's our range delimiter, yes?




-- 

Steve DeRose -- http://www.stg.brown.edu/~sjd
Chair, Bible Technologies Group -- http://www.bibletechnologies.net
Email: sderose@speakeasy.net
Backup email: sjd@stg.brown.edu