We all make heavy use of web.archive.org and we're expanding it ([[mw:Archived Pages]]), so let's use it also for Etherpad.
Akosiaris tells me the current robots.txt is just the default, so this is IMHO a trivially desirable change.
Hopefully, adding this should be enough (https://webarchive.jira.com/browse/HER-1):
User-agent: ia_archiver
Allow: /
Allow: /p/
But once deployed it's easy to check with their new live-retrieving/on-demand saving feature.
More background from #wikimedia-tech:
akosiaris> [...] I must say etherpad.wikimedia.org never was intended for permanent storage. Preservation of a pad is up to the people interested in preserving that pad in another format. The software is well known to corrupt pads (hopefully the latest issues are resolved with 1.3.0 but we never know when others might show up) and restoring a pad from database backups is neigh to impossible. [...]
Nemo_bis> akosiaris: that's what I'm saying :) if we don't plan to make archives, let's let others do so
Version: unspecified
Severity: enhancement
URL: http://etherpad.wikimedia.org/robots.txt