[sword-devel] usfm2osis.py

Chris Little chrislit at crosswire.org
Sun Aug 5 17:19:32 MST 2012



On Aug 5, 2012, at 11:37 AM, David Haslam <dfhmch at googlemail.com> wrote:

> FWIW, I just came across this  http://www.pythonregex.com/ Python Regular
> Expression Testing Tool 
> 
> Does Python support the full 21-bit Unicode range?
> 
> cf. Many other regular expression engines only support the Basic
> Multilingual Plane.
> 

Yes, Python regex supports non-BMP characters. The language tags are Plane 14, I believe. An engine that supports only the BMP can't be said to support Unicode and is probably just processing bytes.

--Chris




More information about the sword-devel mailing list