[sword-devel] Den Norsk Bibelen - possible digitization artefacts (-)
    David Haslam 
    d.haslam at ukonline.co.uk
       
    Sat Jul 12 11:07:44 MST 2008
    
    
  
I have just discovered that there are 1286 occurrences of the token (-) in
the text of the Norwegian Bible module Den Norsk Bibelen.
I suspect that these are artefacts of a file format conversion performed
since the original digitization.
I wonder whether they should all be replaced by the codepoint for an ndash
or an mdash, or even a horizontal bar ?
If someone has access to a printed copy of Den Norsk Bibelen (1906 / 1930),
perhaps they could advise what these should be?
cf. The text of the Unbound Bible project file for Det Norsk Bibelselskap
(1930) just has a hyphen in these places.
-- David
-- 
View this message in context: http://www.nabble.com/Den-Norsk-Bibelen---possible-digitization-artefacts-%28-%29-tp18422148p18422148.html
Sent from the SWORD Dev mailing list archive at Nabble.com.
    
    
More information about the sword-devel
mailing list