[sword-devel] testing for diacritics

Peter Von Kaehne refdoc at gmx.net
Fri Aug 28 08:29:53 MST 2015


That is an option, but I do not like it. Reason is that it requires continous maintenance - adding new diacritic characters to the strip filters to expand their range is an ongoing effort. This would mean two places need constant attention. I am trying as much as possible to take the human factor out of module making.

Peter

 

> Gesendet: Freitag, 28. August 2015 um 15:42 Uhr
> Von: "David Troidl" <DavidTroidl at aol.com>
> An: sword-devel at crosswire.org
> Betreff: Re: [sword-devel] testing for diacritics
>
> How about regular expressions:
> 
> Modern Greek Accented
> [\u0370-\u0390 \u03AA-\u03B0 \u03CA-\u03D4]
> 
> Polytonic Greek Accented
> [\u1F00-\u1FFE]
> 
> Hebrew Vowel Points
> [\u05BB-\u05B0]
> 
> Hebrew Cantillation
> [\u0591-\u05AE]
> 
> I don't know about Arabic.
> 
> Peace,
> 
> David
> 
> On 8/28/2015 4:21 AM, Peter von Kaehne wrote:
> > On Fri, 2015-08-28 at 01:27 +0200, Matěj Cepl wrote:
> >> iconv -f utf8 -t us-ascii//translit file.xml \
> >>          |diff -u - file.xml
> > Thanks Matej,
> >
> > This would probably work on latin scripts with diacritics, but not on
> > the scripts I am interested in - Hebrew, Arabic derrived and Greek.
> >
> > Peter
> >
> > _______________________________________________
> > sword-devel mailing list: sword-devel at crosswire.org
> > http://www.crosswire.org/mailman/listinfo/sword-devel
> > Instructions to unsubscribe/change your settings at above page
> 
> 
> ---
> This email has been checked for viruses by Avast antivirus software.
> https://www.avast.com/antivirus
> 
> 
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page



More information about the sword-devel mailing list