[sword-devel] Search bug & New Arabic Bible, Not Shaped SVD Version
dmsmith at crosswire.org
Mon Dec 10 12:37:34 MST 2012
IIRC, the StandardAnalyzer that SWORD uses doesn't allow for that. It has its own handling of the punctuation that is fixed. I've said before, the analyzer is only good for English like languages.
On Dec 10, 2012, at 11:17 AM, David Haslam <dfhmch at googlemail.com> wrote:
> There are some languages in which the apostrophe is used a letter of the
> alphabet rather than an item of punctuation.
> e.g. Somali, in which the apostrophe represents the /Alef/.
> See http://en.wikipedia.org/wiki/Somali_alphabet
> Guessing that our Lucene indexing method generally strips out such
> punctuation marks, it would be a useful enhancement in SWORD to be able to
> specify in the conf file that a particular punctuation mark should be parsed
> as a letter, such that the search index would then include the words
> containing this letter.
> PS. There is a related issue in the SomKQA module that I'm researching with
> the providers of the source text.
> It's conceivable that all the single right quotation marks should really be
> Their inclusion in the text may easily have been due to an artifact of their
> original editing environment.
> View this message in context: http://sword-dev.350566.n4.nabble.com/Re-Search-bug-New-Arabic-Bible-Not-Shaped-SVD-Version-tp4651330p4651383.html
> Sent from the SWORD Dev mailing list archive at Nabble.com.
> sword-devel mailing list: sword-devel at crosswire.org
> Instructions to unsubscribe/change your settings at above page
More information about the sword-devel