Page MenuHomePhabricator

Create an __NOINDEX__ equivalent to prevent indexing by internal search engine
Closed, DuplicatePublic

Description

Author: stigmj

Description:
Shouldn't Lucene-search also respect the NOINDEX tag? If not, there should exist another magic word to stop certain pages from being indexed and searchable. Alternatively a "robots.txt" file (maybe a better solution to be able to restrict this exclusion-feature to admins only) in the MediaWiki-namespace should be consulted.


Version: unspecified
Severity: enhancement

Details

Reference
bz24169

Event Timeline

bzimport raised the priority of this task from to Low.Nov 21 2014, 11:06 PM
bzimport added a project: MediaWiki-Search.
bzimport set Reference to bz24169.
bzimport added a subscriber: Unknown Object (MLST).

NOINDEX is primarily to stop google et al. from finding it. If this was to be implemented, I would say it should have a different magic word (or at the very least a config option).

Changing component to search, because if this was to be implemented, it should prevent indexing on any of mediawiki's internal search backends, not just lucene. (not 100% sure compoenent change is right. please revert if not)

rainman wrote:

What is a rationale for a change like this? Why should we have content that cannot be found in any way except from direct link?

stigmj wrote:

Well, one use-case would be in situations like this:
http://no.wikipedia.org/wiki/Wikipedia:Statistikk/1000_mest_bes%C3%B8kte
where we have a large collection of pages with potential searchwords and we don't want to create a new ns...

rd232 wrote:

*** This bug has been marked as a duplicate of bug 22251 ***

I wouldn't want a new magic word. May a config option, if NOINDEX should also apply to internal search would do.