[bt-devel] Fwd: Re: clucene crash when searching

Martin Gruner mg.pub at gmx.net
Tue Nov 18 10:21:03 MST 2008


Hi Eeli.

Unfortunately I do not agree with you, as you may expect.

Regarding Sword, there is no work going on for the search engine(s), and the 
cluene implementation of Sword has (at least) the same problems our 
implementation has. I second your attitude of collaboration in Sword, but am 
still pessimistic about the speed of its development.

I do not see that we would gain much by adding support of Sword's non-indexed 
search engines, except for the ability to search for phrases.

Searching in BT should be simple and consistent. That means that we should 
not, in my opinion, offer different search syntaxes to the user. Maybe one 
exception: a regexp-based search for power users, but all "normal" users 
should have one single search to work with (from a user's point of view).

Using a search engine that works with modern, index-based technologies is the 
proper way to go.

> > The problem is that CLucene is almost unmaintained and crashes on certain
> > kinds of systems (got reports about crashes on BSD for example). Java
> > Lucene is much, much better. We've hoped that CLucene developes like
> > JLucene, but sadly that didn't work that way...
>
> Ok. That's one more reason to support many engines.

No. It is one reason to choose the search engine wisely.

My suggestion would be to talk about the search engine we do use, clucene. I 
just checked - they released a bugfix 0.9.21 version recently, and 0.9.23, 
which is a beta-quality preview release of their next development branch, 
which is supposed to improve Lucene compatibility/feature coverage. Ben also 
told me that he was going to implement the wildcard operator in the beginning 
of words (like "*minded").
But nobody can say how long this will take. So we may want to use another open 
source search engine which suits our needs better.

We could start a wiki page listing the specific problems that we see with 
clucene, and investigate if they can be solved. At the same time we can 
collect information about other search engines in a matrix of 
features/properties that we do need. Maybe we come up with something better, 
more stable and feature-rich than clucene?

A major problem that I see: What about our release roadmap? We should not 
start changing the search engine in the 1.7.x branch/release cycle. I'm 
unhappy with the status quo, we cannot stay in beta state for a long time and 
continue changing the internals of our software. We should release FIRST, and 
THEN start making major changes.

What do you think, Eeli, and all others?

God bless,

mg





More information about the bt-devel mailing list