Page MenuHomePhabricator

[trebuchet] salt commands exceed timeout
Closed, ResolvedPublic

Description

deployment-bastion has a bunch of idling python commands. When running puppet I noticed some timeout being exceeded:

Error: /Stage[main]/Role::Salt::Minions/Salt::Grain[instanceproject]/Exec[ensure_instanceproject_deployment-prep]/unless: Check "/usr/local/sbin/grain-ensure contains instanceproject deployment-prep" exceeded timeout

Error: /Stage[main]/Deployment::Deployment_server/Salt::Grain[deployment_repo_user]/Exec[ensure_deployment_repo_user_trebuchet]/unless: Check "/usr/local/sbin/grain-ensure contains deployment_repo_user trebuchet" exceeded timeout

Error: /Stage[main]/Role::Trebuchet/Salt::Grain[trebuchet_master]/Exec[ensure_trebuchet_master_deployment-bastion.eqiad.wmflabs]/unless: Check "/usr/local/sbin/grain-ensure contains trebuchet_master deployment-bastion.eqiad.wmflabs" exceeded timeout

...

Same happens on deployment-salt:

64 python /usr/local/sbin/grain-ensure contains instanceproject deployment-prep
62 python /usr/local/sbin/grain-ensure contains trebuchet_master deployment-bastion.eqiad.wmflabs

Might be similar as bug 70858 "ferm policy on deployment-bastion prevents scap rsync from mw hosts"


Version: unspecified
Severity: normal
See Also:
https://bugzilla.wikimedia.org/show_bug.cgi?id=70858

Details

Reference
bz70863

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 22 2014, 3:44 AM
bzimport set Reference to bz70863.
bzimport added a subscriber: Unknown Object (MLST).

Antoine: is this still happening?

greg raised the priority of this task from Medium to Needs Triage.Nov 25 2014, 9:39 PM
greg set Security to None.
hashar claimed this task.

deployment-salt and deployment-bastion have long uptime and no idling grain-ensure process.

I originally referenced a ferm rule issue: T72858 , was most probably related.