[sword-devel] Comming soon: new improved sword searching

Chris Little sword-devel@crosswire.org
Sun, 8 Sep 2002 13:12:03 -0700 (MST)


On Sun, 8 Sep 2002, Jerry Hastings wrote:

> At 12:48 AM 9/9/2002 +0800, Leon Brooks wrote:
> 
> >All verses containing two or more of God, Good or Greed: (g[ore]*d){2,}
> 
> I don't believe that gives the desired result. At least not in BibleCS. I 
> don't know if that is a front end issue or a lib issue. In the past I have 
> found that parentheses do not work as expected, if at all. I have not 
> looked at this for some time though. If someone can show me that they do 
> work, I will document these advanced features in the BibleCS helps.

FWIW, we need to upgrade our regexp engine.  The current one (from GNU) 
has a couple of problems that I was aware of.  First it is GPL--this is 
the last GPL component in the library.  If it were replaced with something 
else, we could license Sword under non-GPL licenses to other entities 
(e.g. Bible societies that don't want to deal with GPL's restrictions) or 
put it out publicly under a license that we write that better meets our 
needs than the GPL.  Second (and probably more immediately important) it 
doesn't handle UTF-8.

Perl Regexp fixes both of these problems.  We can use it under the
Artistic License & it's got nice UTF-8 support built in.  I nominate
switching to this as a .5 priority. :)

If there are other quirks in the GNU Regexp implementation like you
mention, we can pray that Perl Regexp fixes those also.

--Chris