[osis-core] 1.9.7 attached!

Patrick Durusau osis-core@bibletechnologieswg.org
Sat, 18 Oct 2003 04:56:52 -0400


Todd,

The regex is probably not correct. Written after I was very tired. Up 
too early now but after some more sleep I will do it again and this time 
follow my usual course of checking it in my regex checking XML file.

Sorry 'bout that!

Hope you are having a great day!

Patrick

Todd Tillinghast wrote:
> Patrick,
> 
> Looking at the schema and trying out an example, I think the new regex
> for osisGenType is not quite right.
> 
> I was expecting the following to be valid: <w lemma="x:34 x:554">xyz</w>
> did I miss something?
> 
> Todd
> 
> 
>>Greetings!
>>
>>osisCore.1.9.7 is attached!
>>
>>xml:lang:
>>
>>Now required on osisTextCT.
>>
>>XML Spec says the following:
>>
>>
>>>[IETF RFC 1766] tags are constructed from two-letter language codes
>>
> as
> 
>>defined by [ISO 639], from two-letter country codes as defined by [ISO
>>3166], or from language identifiers registered with the Internet
> 
> Assigned
> 
>>Numbers Authority [IANA-LANGCODES]. It is expected that the successor
> 
> to
> 
>>[IETF RFC 1766] will introduce three-letter language codes for
> 
> languages
> 
>>not presently covered by [ISO 639].
>>
>>This means you get three bites:
>>
>>1. ISO 639 - http://sunsite.berkeley.edu/amher/iso_639.html
>>
>>2. RFC 1766 - http://www.ietf.org/rfc/rfc1766.txt
>>
>>3. User-defined language code (must begin with "x-"
>>
>>Note that attempting to add it to language (as an attribute) produces
> 
> a
> 
>>fist full of errors.
>>
>>Since xml:lang should be sufficient, deleted all the osisLanguage
> 
> stuff
> 
>>that we added for ISO 639 and the like. If need more, use IETF or
> 
> x-/X-.
> 
>>(does that work for everybody? I somehow screwed up the syntax and had
>>to redo the edits with periodic validation. Really tired so we can
>>revisit tomorrow if need be.)
>>
>>osisGenType:
>>
>>Used on subject, osisIdentifier, POS, lemma and morph.
>>
>>Structure: prefix refers to work, followed by any combination of
>>letters, numbers, plus _ (underscore), - (hyphen), . (period), or :
>>(colon). (NameChar)
>>
>>Regex: <xs:pattern
>>value="(((\p{L}|\p{N}|_)+)((\.(\p{L}|\p{N}|_)+)*)+:)(\c)*"/>
>>
>>Other mods:
>>
>>Added base attribute to osisLanguageUsage.
>>
>>
>>Added default value to cell role to be data (can be label or data)
>>
>>
>>Added encoder as note type (needs note in users manual to suppress
> 
> when
> 
>>producing a printed or electronic version for viewing)
>>
>>Well, virus scan about to run so shutting down for the night!
>>
>>Thanks guys!
>>
>>Will poll the congregation again tomorrow!
>>
>>Hope everyone is having a great evening!
>>
>>Patrick
>>
>>
>>
>>
>>--
>>Patrick Durusau
>>Director of Research and Development
>>Society of Biblical Literature
>>Patrick.Durusau@sbl-site.org
>>Chair, V1 - Text Processing: Office and Publishing Systems Interface
>>Co-Editor, ISO 13250, Topic Maps -- Reference Model
>>
>>Topic Maps: Human, not artificial, intelligence at work!
> 
> 
> _______________________________________________
> osis-core mailing list
> osis-core@bibletechnologieswg.org
> http://www.bibletechnologieswg.org/mailman/listinfo/osis-core
> 


-- 
Patrick Durusau
Director of Research and Development
Society of Biblical Literature
Patrick.Durusau@sbl-site.org
Chair, V1 - Text Processing: Office and Publishing Systems Interface
Co-Editor, ISO 13250, Topic Maps -- Reference Model

Topic Maps: Human, not artificial, intelligence at work!