Page MenuHomePhabricator

Flag/exclude events originating from WMF's IP range
Closed, DeclinedPublic

Description

Given the amount of testing happening from WMF IP addresses we should exclude/flag these events so as not to pollute the data.


Version: unspecified
Severity: enhancement

Details

Reference
bz43639

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 22 2014, 1:20 AM
bzimport set Reference to bz43639.

swalling wrote:

To note it from our discussion in IRC:

The only case where we potentially _do_ want WMF IP events collected is on test and test2. So if it's not a pain, excluding all but those would be one of acceptable solutions AFAIK.

The potential skews from bugs that we're not able to diagnose as a result of being in a special-cased IP block seems larger than the skew from having our test events contaminate the data, so I don't want to filter out WMF-generated events altogether. Flagging them should be doable. We need to find out our IP range. Steven, Dario: can one of you check? I think James F. blocked the IP range from anon-editing Wikipedia, so if you look at current blocks you should be able to get the correct range from there.

swalling wrote:

(In reply to comment #2)

The potential skews from bugs that we're not able to diagnose as a result of
being in a special-cased IP block seems larger than the skew from having our
test events contaminate the data, so I don't want to filter out WMF-generated
events altogether. Flagging them should be doable. We need to find out our IP
range. Steven, Dario: can one of you check? I think James F. blocked the IP
range from anon-editing Wikipedia, so if you look at current blocks you
should
be able to get the correct range from there.

He only blocked on MediaWiki.org, IIRC.

(In reply to comment #4)

He only blocked on MediaWiki.org, IIRC.

Are you able to see the ranges? If so, could you paste them here or (if they are sensitive) e-mail them to me?

I think the solution to this might simply be to provide a good opt-out mechanism.

[moving from MediaWiki extensions to Analytics product - see bug 61946]

We've lived without this since 2013, so I'm declining this.