Page MenuHomePhabricator

1st search result does not have the word I entered into the search box
Closed, ResolvedPublic

Description

Author: michaelk

Description:
Do a search for 'commodity' at en.wikipedia.org.

The first two results are:

1: Commode
2: Commodity.

The article for 'Commode' doesn't even have the word 'commodity' in it, but it's
rated at 100% relevancy (the page hasn't been edited for nearly a month).
'Commodity' should come up first and be rated at 100% relevancy.


Version: unspecified
Severity: minor

Details

Reference
bz2511

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 21 2014, 8:35 PM
bzimport set Reference to bz2511.
bzimport added a subscriber: Unknown Object (MLST).

jeluf wrote:

Should this be filed against Lucene?

It looks like a stemming issue.

rainman wrote:

Fixed in Lucene Search 2. Original words are indexes alongside with stemmed words.
The queries are rewritten to have original words with higher boost, and stemmed words with lower boost. This ensures that the unstemmed words are preferred.

[Merging "MediaWiki extensions/Lucene Search" into "Wikimedia/lucene-search2", see bug 46542. You can filter bugmail for: search-component-merge-20130326 ]