Page MenuHomePhabricator

CirrusSearch: Index non-expanded article content as well
Closed, ResolvedPublic

Description

It'd be useful for folks looking for errors to be able to search non-expanded text. This will take up more hard drive space but we should investigate it. I wonder if we can turn off some of the analyzers and the FVH for this text? After all, if the whole point is to find errors then you don't really want stemming to "help" you. Without stemming we no longer need the FVH's field combination magic.


Version: unspecified
Severity: major

Details

Reference
bz60487

Event Timeline

bzimport raised the priority of this task from to High.Nov 22 2014, 3:06 AM
bzimport added a project: CirrusSearch.
bzimport set Reference to bz60487.

Setting to low priority because we don't have the space right now. Will raise when we can save space/find more hardware.

Change 110604 had a related patch set uploaded by Chad:
Begin indexing unexpanded text forms

https://gerrit.wikimedia.org/r/110604

Change 110604 abandoned by Chad:
Begin indexing unexpanded text forms

Reason:
We can always restore later if need be. I'd like us to go the bug 43652 route anyway.

https://gerrit.wikimedia.org/r/110604

Change 110604 restored by Chad:
Begin indexing unexpanded text forms

https://gerrit.wikimedia.org/r/110604

Change 110604 merged by jenkins-bot:
Begin indexing unexpanded text forms

https://gerrit.wikimedia.org/r/110604

Meringing that isn't really good enough to solve the bug, but its a start.

Resolved by implementing insource: and insource://.