[sword-devel] conf utf-8

Adrian Korten adrian_korten at sil.org
Mon Feb 14 19:36:23 MST 2005


Thanks for explaining this with the proper terminology. The explanation 
agrees with my understanding as received from others. I think too that 
it can be read and thrown away -- but it should be handled.


Chris Little wrote:
> Troy A. Griffitts wrote:
>>     My guess about the characters which keep the .conf file from being 
>> recognized... try adding a few newlines to the beginning of the file.  
>> I would guess that XXX[Section Name] at the beginning is just causing 
>> our .conf reader to not recognize the "Section Name".
> The three characters are the Unicode byte-order mark (BOM). See 
> http://www.unicode.org/faq/utf_bom.html#BOM for full details. But, 
> basically, it's the codepoint U+FEFF, encoded at the beginning of a 
> file. From this character, you can tell whether you have UTF-16 
> big-endian, UTF-16 little-endian, or UTF-8.
> I would recommend we go ahead and support it (to the extent that we 
> check for it and throw it away) since it's not something that just 
> notepad adds to file. (No need to fix before the trip, though, I think.)
> --Chris
> _______________________________________________
> sword-devel mailing list
> sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel

More information about the sword-devel mailing list