Page MenuHomePhabricator

Raw webrequest partitions for 2014-10-20T10/1H not marked successful
Closed, ResolvedPublic

Description

The bits partition [1] for 2014-10-20T02/1H has not been marked
successful.

What happened?

[1]


qchris@stat1002 jobs: 0 time: 09:04:05 // exit code: 0
cwd: ~
~/cluster-scripts/dump_webrequest_status.sh

+------------------+--------+--------+--------+--------+
| Date             |  bits  | mobile |  text  | upload |
+------------------+--------+--------+--------+--------+

[...]

| 2014-10-20T08/1H |    .   |    .   |    .   |    .   |    
| 2014-10-20T09/1H |    .   |    .   |    .   |    .   |    
| 2014-10-20T10/1H |    X   |    .   |    .   |    .   |    
| 2014-10-20T11/1H |    .   |    .   |    .   |    .   |    
| 2014-10-20T12/1H |    .   |    .   |    .   |    .   |

[...]

+------------------+--------+--------+--------+--------+

Statuses:

. --> Partition is ok
M --> Partition manually marked ok
X --> Partition is not ok (duplicates, missing, or nulls)

Version: unspecified
Severity: normal
See Also:
https://bugzilla.wikimedia.org/show_bug.cgi?id=72252

Details

Reference
bz72295

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 3:52 AM
bzimport set Reference to bz72295.
bzimport added a subscriber: Unknown Object (MLST).

The affected period is 10:37:16--10:37:20, which nicely matches the
manual kafka leader re-election from 10:38.

Mismatching data is minimal:

+----------------------------+-----------+--------------+
| Host                       | # missing | # duplicates |
+----------------------------+-----------+--------------+
| cp3019.esams.wikimedia.org |         0 |          252 |
| cp3020.esams.wikimedia.org |         0 |          252 |
| cp3021.esams.wikimedia.org |       525 |            0 |
| cp4004.ulsfo.wmnet         |       220 |            0 |
+----------------------------+-----------+--------------+

Total worth of mismatched data <<1 second.