Page MenuHomePhabricator

Jenkins: Parser tests fail intermittently due to concurrent I/O access to the same tmpfs file
Closed, DeclinedPublic

Description

Parser tests interminnently fail on mediawiki/core REL1_22.

"filesize(): stat failed for /var/lib/jenkins-slave/tmpfs/mediawiki-core-phpunit-parser/Foobar.svg"

https://gerrit.wikimedia.org/r/#/c/121042/
https://integration.wikimedia.org/ci/job/mediawiki-core-phpunit-parser/21604/console


Version: unspecified
Severity: normal

Details

Reference
bz63106

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 3:03 AM
bzimport set Reference to bz63106.
bzimport added a subscriber: Unknown Object (MLST).

That sounds like a race condition with two jobs running on the same host, one of them deleting the file while the other job attempt to access it later. Looking at the build history, that job had only one instance which dismiss the race condition.

My bet is on a weird disk issue that failed the stat() call, although the file is on a tmpfs (memory disk).

These now use concurrent workspaces, and that directory names is also used for tmpfs.

https://integration.wikimedia.org/ci/job/mediawiki-core-phpunit-parser/28179/console

Workspace:
/srv/ssd/jenkins-slave/workspace/mediawiki-core-phpunit-parser@2

Tmpdir:
/var/lib/jenkins-slave/tmpfs/mediawiki-core-phpunit-parser@2