[sword-devel] ChiUns module contains superfluous spaces (ANSI 32)

Leo So leoso at usa.net
Wed Oct 20 09:38:01 MST 2010


Agree. Those ANSI space character in both simplified and traditional versions should be gone.

Immanuel,
leo.
--
  Leo So <leoso at usa.net> http://leo4jc.spaces.live.com
  Jesus said, "Therefore do not be anxious, saying, 'What shall
  we eat?' or 'What shall we drink?' or 'What shall we wear?'
  But seek first the kingdom of God and His righteousness, and
  all these things will be added to you." (Matt 6:31,33)




From: David Haslam 
Sent: Wednesday, October 20, 2010 8:08 AM
To: sword-devel at crosswire.org 
Subject: [sword-devel] ChiUns module contains superfluous spaces (ANSI 32)



While displaying the simplified Chinese Union Version module ChiUns using
Xiphos, I have observed that there is at least one ordinary ANSI space
between each CJK ideogram.

This is confirmed as being module content, rather than a rendering artefact.

I exported the contents using MOD2IMP, and then removed all XML tags (e.g.
for Strongs & Morph) from a copy of the raw IMP text file.

It has many such spaces on every line.  e.g.

$$$Genesis 1:1
起 初  ,   神  创 造   天  地  。
$$$Genesis 1:2
地  是   空 虚  混 沌  , 渊  面  黑 暗  ;   神  的 灵  运 行   在 水  面  上  。
$$$Genesis 1:3
  神  说   : 要 冇   光  , 就 冇 了  光  。
$$$Genesis 1:4
  神  看   光  是 好 的  , 就 把 光  暗  分 幵 了   。
$$$Genesis 1:5
  神  称   光  为 昼  , 称   暗  为 夜  。 冇  晚 上  , 冇  早 晨  , 这 是 头 一  日  。
The only spaces that should be in any Chinese text are the ideographic space
U+3000, 
as the general style is for the CJK ideograms to appear as on a grid
pattern. 
cf. The ideographic space is largely used to precede the Divine Name.  i.e.
"  神"

The ordinary spaces probably originated from the preprocessing used to add
tags for Strongs & Morph markup.
They shouldn't be there.  The same passage should appear like this:

$$$Genesis 1:1
起初 神创造天地。
$$$Genesis 1:2
地是空虚混沌,渊面黑暗; 神的灵运行在水面上。
$$$Genesis 1:3
 神说:“要有光。”就有了光。
$$$Genesis 1:4
 神看光是好的,就把光暗分开了。
$$$Genesis 1:5
 神称光为“昼”,称暗为“夜”。有晚上,有早晨,这是头一日。
I imagine the same symptom occurs for the traditional Chinese Union module,
and for our other Chinese modules, though I haven't yet researched this.

I know that Chris intends eventually to remake these modules from a better
text source (see JIRA bugs for modules), yet I thought this aspect warrants
a wider understanding.

btw.  I'm not an expert on Chinese, but I have acquired quite a few Chinese
friends and contacts in the course of my work on Go Bible.

David Haslam




-- 
View this message in context: http://sword-dev.350566.n4.nabble.com/ChiUns-module-contains-superfluous-spaces-ANSI-32-tp3004072p3004072.html
Sent from the SWORD Dev mailing list archive at Nabble.com.

_______________________________________________
sword-devel mailing list: sword-devel at crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.crosswire.org/pipermail/sword-devel/attachments/20101020/6900f185/attachment-0001.html>


More information about the sword-devel mailing list