[sword-devel] for the love of unicode

Chris Little sword-devel@crosswire.org
Fri, 15 Jun 2001 21:52:18 -0700


Bob Pritchett replied to me privately suggesting that we use UTF-8 for
everything and I tend to agree.  Since the UTF-8 encoded characters for
a given module will have relatively close unicode values, there will be
a lot of repeated bytes that will compress very well.  The resulting
compressed modules should be only marginally larger than compressed
versions of our current modules.

So I would suggest that we start moving EVERYTHING over to UTF-8 after
1.5.2 is finished and made public.  It's nice to support multiple
encodings, but much simpler to support just UTF-8.

--Chris