[sword-devel] Dotted and dotless I in Turkish & Azerbaijani, etc.

David Haslam d.haslam at ukonline.co.uk
Sat Feb 20 02:58:44 MST 2010


Please first read this article.

http://en.wikipedia.org/wiki/Dotted_and_dotless_I
http://en.wikipedia.org/wiki/Dotted_and_dotless_I 

Languages such as Turkish and Northern Azeri have both dotted and dotless
letter I in their Latin-based alphabets.

This has implications for letter case. 
Such alphabets break the familiar case relationship between uppercase I and
lowercase i. 
Instead they have as a upper and lower case pairs:

I and ı  
İ and i

Questions: 
Does sword_icu properly address this in terms of case folding? 
How does each front-end application address these issues, e.g. in terms of
case-insensitive searches, etc?

cf.  We already have two Turkish Bible modules, and work is about to start
on a Bible module for Northern Azeri.

Working on the Go Bible for the Azerbaijani translation is how I became
alerted to this issue.

David


-- 
View this message in context: http://n4.nabble.com/Dotted-and-dotless-I-in-Turkish-Azerbaijani-etc-tp1562704p1562704.html
Sent from the SWORD Dev mailing list archive at Nabble.com.



More information about the sword-devel mailing list