[sword-devel] French ligatures in Louis SÉGOND’s text
dmsmith555 at yahoo.com
Sun Jul 15 20:08:27 MST 2007
The KJV has ae ligatures. The Webster's dictionary has ligatures but
I don't remember which. There may be other modules.
WRT to the KJV the tradition is to reduce it to one character.
My guess is that alternate representations of ligatures and accented
characters (e.g. umlauts) is locale sensitive.
It is easy enough to use "char map" or some other virtual keyboard to
enter them. Even cut and paste.
IMHO, our software should allow for the text to be robustly
represented and our search flexible enough to find the most
appropriate match. With regard to lucene, this can be done by
additionally indexing normalized words and additionally searching the
On Jul 15, 2007, at 9:31 PM, Leandro GuimarÃ£es Faria Corcete DUTRA
> Chris Little <chrislit at crosswire.org> writes:
>> We could change oe to oe-ligature where appropriate in Louis Segond.
>> That would be simple enough since editions exist online that use
>> oe-ligature correctly.
> Also, it is not that many words using that… cœur, sœur, mœur…
> Is there anyone to do it already, or should I do it?
>> However, since we won't be doing language-specific search tweaks
> That is not what I meant — I mean a general fix, where ligatures at
> the search box would find expanded characters, and vice‐versa.
> Just like
> Google does it, with all kind of European ligatures.
> In the end it is an Unicode question, I guess?
>> since oe-ligature basicallly can't be typed on French keyboards
> Yes, but regardless of keyboards us GNU/Linux users who love
> typography (admittedly a small subset) have it mapped and used it
> quite often.
> More seriously, now that computers are more capable there
> has been a
> slow movement back to fine typography, with Unicode, OpenType and
> even updates
> to venerable Τεχ software helping. And as I see it, it is not a
> language-specific issue. I have no idea of how difficult would be to
> implement proper search semantics, so feel free to ignore me, but
> it would be
>> the change would make these words unsearchable. While the ligature
>> might be
>> helpful to non-native speakers needing cues to pronunciation, I
>> don't think
>> any native French speakers would have any issues with reading the
>> exactly as it is now.
> Indeed, but it is ugly. Just as unaccented uppercase in French was
> considered acceptable in the typewriter age but now is frowned upon.
>> Do you still think the change would be beneficial?
> I do.
>> Do any users who use French as their primary language consider the
>> change an improvement?
> Are there enough of us here for a poll?
> I count myself in in the absence of Portuguese texts…
> +55 (11) 9406 7191 http://br.geocities.com./lgcdutra/
> +55 (11) 5685 2219 gTalk: xmpp:leandrod at jabber.org
> +55 (11) 3040 7300 r151 Yahoo!: ymsgr:sendIM?lgcdutra
> +55 (11) 5686 9607 ICQ/AIM: aim:GoIM?screenname=61287803
> MSN: msnim:chat?contact=leandro at dutra.fastmail.fm
> sword-devel mailing list: sword-devel at crosswire.org
> Instructions to unsubscribe/change your settings at above page
More information about the sword-devel