Page MenuHomePhabricator

Add iw_regex field
Closed, DuplicatePublic

Description

As Dantman points out, "Interwiki patterns also do not support generic URLs very well. They are primarily intended for other wikis and hence MediaWiki currently does some encoding of the value after the prefix: in ways that make it unusable for adding new sites generically, for example see what it does to a google: interwiki link." This would present a barrier to using the interwiki table as a spam whitelist for those URLs. https://www.mediawiki.org/wiki/Alternatives_to_applying_the_same_nofollow/dofollow_value_to_all_external_links#Make_more_use_of_interwiki_links

We could add a serialized field to the interwiki table that uses regular expressions to change, say, spaces to plus signs, if that's how the website formats its URLs. I'm not all that skilled with regular expressions, and my use cases are usually pretty simple, so I just use https://www.mediawiki.org/wiki/Extension:InterwikiRegex , which uses str_replace. Regex would be more powerful, though.


Version: 1.23.0
Severity: enhancement
See Also:
https://bugzilla.wikimedia.org/show_bug.cgi?id=15274

Details

Reference
bz57054

Event Timeline

bzimport raised the priority of this task from to Low.Nov 22 2014, 2:39 AM
bzimport set Reference to bz57054.
bzimport added a subscriber: Unknown Object (MLST).

This is not about a problem with database, but I have no idea of a better component. Mabye add 'schema-changes' key word to reflect to need on the database.

bug 15274 is about a problem with + vs. space

  • This bug has been marked as a duplicate of bug 41618 ***