[osis-core] Supplemental on regex

Todd Tillinghast osis-core@bibletechnologieswg.org
Wed, 2 Jul 2003 10:50:14 -0600


Patrick,

I still get a problem with <div type="x-abc">.

What did you find with the IBM validator?

I have to admit I am not a regex expert.

Does the #x0000-#x10FFFF range include only ASCII characters or is it
broader than that?  We should allow "x-someNonRomanCharacters".

Do we want to allow all type of punctuation?  I agree that "." and "-"
are good to include.

Todd

> -----Original Message-----
> From: osis-core-admin@bibletechnologieswg.org [mailto:osis-core-
> admin@bibletechnologieswg.org] On Behalf Of Patrick Durusau
> Sent: Wednesday, July 02, 2003 4:42 AM
> To: osis-core@bibletechnologieswg.org
> Subject: [osis-core] Supplemental on regex
> 
> Guys,
> 
> Still tinkering with the regex.
> 
> What is odd is that:
> 
> <xs:pattern value="x-([x0000-x10FFFF-[\p{P}\p{Z}\p{C}]])+"/>
> 
> Works but:
> 
> <xs:pattern value="x-([#x0000-#x10FFFF-[\p{P}\p{Z}\p{C}]])+"/>
> 
> As found at the W3C site:  [#x0000-#x10FFFF]-[\p{P}\p{Z}\p{C}]
> (http://www.w3.org/TR/xmlschema-2/) does not.
> 
> Note that with \p{P} included, then "x-ab.c" becomes invalid, so it
> appear that the regex is working.
> 
> What is really odd is that I can't get "x-ab.c" to validate even if
the
> regex is: "x-([x0000-x10FFFF])+". Validates "x-abc" but not "x-ab.c".
> 
> Very odd.
> 
> Suggestions?
> 
> Thanks!
> 
> Hope everyone is waking up to a great day!
> 
> Patrick
> 
> --
> Patrick Durusau
> Director of Research and Development
> Society of Biblical Literature
> Patrick.Durusau@sbl-site.org
> Chair, V1 - Text Processing: Office and Publishing Systems Interface
> Co-Editor, ISO 13250, Topic Maps -- Reference Model
> 
> Topic Maps: Human, not artificial, intelligence at work!
> 
> 
> _______________________________________________
> osis-core mailing list
> osis-core@bibletechnologieswg.org
> http://www.bibletechnologieswg.org/mailman/listinfo/osis-core