Page MenuHomePhabricator

Search should support searching all old revisions of the wiki pages
Closed, DeclinedPublic

Description

Author: anon.hui

Description:
MWSearch should support searching all old revisions of the wiki pages.

This will help the task of investigating the historical pages.


Version: unspecified
Severity: enhancement

Details

Reference
bz20784

Related Objects

Event Timeline

bzimport raised the priority of this task from to Lowest.Nov 21 2014, 10:47 PM
bzimport added a project: CirrusSearch.
bzimport set Reference to bz20784.
bzimport added a subscriber: Unknown Object (MLST).

anon.hui wrote:

Lucene-search should support searching all old revisions of the wiki pages.

Marking this one 'later'; would be dependent on having the general infrastructure for full-history search in UI and updating (bug 10643).

As an interim solution, you can use the Diffdb (https://github.com/whym/diffindexer) and Wikihadoop (https://github.com/whym/wikihadoop) for this. Contact me if you want to know more.

[Removing RESOLVED LATER as discussed in
http://lists.wikimedia.org/pipermail/wikitech-l/2012-November/064240.html .
Reopening and setting priority to "Lowest".
For future reference, please use either RESOLVED WONTFIX (for issues that will
not be fixed), or simply set lowest priority. Thanks a lot!]

[Merging "MediaWiki extensions/Lucene Search" into "Wikimedia/lucene-search2", see bug 46542. You can filter bugmail for: search-component-merge-20130326 ]

We're migrating from Lucene to CirrusSearch. Moving this bug accordingly.

This feature probably makes the most sense on a per-page basis, e.g. the user story "As an editor, I want to be able to search the history of a specific page so I can know if certain information was added into the page and removed.".

Also, title fix.

Actually the more I think about it, this is going to be a WONTFIX for MediaWiki & Cirrus. Search is inherently about finding information that's currently on the wiki. Plus storing all the revisions of the page would basically double our storage requirements for what's essentially a power-user feature.

For history searching, we have dumps.

essentially a power-user feature.

power users don't matter?