Page MenuHomePhabricator

Error mails from SGE are encoded as application/octet-stream
Closed, ResolvedPublic

Description

Per http://permalink.gmane.org/gmane.comp.clustering.opengridengine.user/7134:

This is done by your MTA (AFAICS not all are doing it), as
it detects a "0-backkspace-INFINITY" and encodes the
"backspace". What can be done: remove the 0-backspace in a
wrapper and set "mailer" in SGE's configuration to this:

#!/bin/bash
sed "s/0"$'\b'"INFINITY/INFINITY/g" | mail -s "$2" "$3"

So:

  • install /usr/local/bin/gridengine-mailer on all exec nodes, master and shadow,
  • set "mailer" in master config,
  • test.

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:57 AM
bzimport added a project: Toolforge.
bzimport set Reference to bz61160.

Done that; needs puppetization now.

scfc triaged this task as Low priority.Feb 21 2015, 5:07 AM
scfc updated the task description. (Show Details)
scfc set Security to None.

I've added:

# Debugging to see on which instances mailer is called by whom; T63160.  --scfc
id -u > $(mktemp /var/tmp/gridengine-mailer-called.XXXXXXX)

and deployed the script on all instances to see if exec, master and shadow are "enough". Also the script was missing on tools-exec-20 & Co., so the debugging will show if this had any negative effects.

The debugging showed that it was indeed called (only) on tools-exec-* and tools-master. So generalize those to "all hosts that execute jobs" and "all hosts that master the grid engine" to be on the safe side.

Change 203656 had a related patch set uploaded (by Tim Landscheidt):
gridengine: Puppetize gridengine-mailer

https://gerrit.wikimedia.org/r/203656

Change 203656 merged by Yuvipanda:
gridengine: Puppetize gridengine-mailer

https://gerrit.wikimedia.org/r/203656