2 # Copyright (c) 2004 The Trustees of Princeton University (Trustees).
4 # Faiyaz Ahmed <faiyaza@cs.princeton.edu>
6 # $Id: emailTxt.py,v 1.10 2007/08/29 17:26:50 soltesz Exp $
10 # This file contains the texts of the automatically generated
11 # emails sent to techs and PIs
16 newdown_one=("""PlanetLab node(s) down: %(loginbase)s""",
20 As part of PlanetLab node monitoring, we noticed the following nodes were down at your site:
23 We're writing because we need your help returning them to their regular operation.
25 To help, please confirm that a recent BootCD is installed in the machine (Version 3.0 or greater). Then, after checking that the node is properly networked, power cycle the machine. Note that rebooting the machine may not fully resolve the problems we are seeing. Once the machine has come back up, please visit the Comon status page to verify that your node is accessible from the network. It may take several minutes before Comon registers your node. Until that time, visiting the link below will return an 'Internal Server Error'.
27 http://summer.cs.princeton.edu/status/tabulator.cgi?table=nodes/table_%(hostname)s&limit=50
29 If the machine has booted successfully, you may check it more quickly by logging in with your site_admin account, and running:
33 If you have a BootCD older than 3.0, you will need to create a new Boot CD and configuration file. You can find instructions for this at the Technical Contact's Guide:
35 https://www.planet-lab.org/doc/guides/tech#NodeInstallation
37 If after following these directions and finding your machine reported by CoMon, there is no need to respond to this message. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can help resolve the issue. Including this message in your reply will help us coordinate our records with the actions you've taken.
39 After a week, we will disable your site's ability to create new slices. Because this action will directly affect your site's registered PI, we will also CC the PI for help at that time.
41 Thank you for your help,
42 -- PlanetLab Central (support@planet-lab.org)
45 newdown_two=("""PlanetLab node(s) down: %(loginbase)s""",
49 As part of PlanetLab node monitoring, we noticed the following nodes were down at your site:
52 We're writing again because our previous correspondence, sent only to the registered Technical Contact, has gone unacknowledged for at least a week, and we need your help returning these machines to their regular operation. We understand that machine maintenance can take time. So, while we wait for the machines to return to their regular operation slice creation has been suspended at your site. No new slices may be created, but the existing slices and services running within them will be unaffected.
54 To help, please confirm that a recent BootCD is installed in the machine (Version 3.0 or greater). Then, after checking that the node is properly networked, power cycle the machine. Note that rebooting the machine may not fully resolve the problems we are seeing. Once the machine has come back up, please visit the Comon status page to verify that your node is accessible from the network. It may take several minutes before Comon registers your node.
56 http://summer.cs.princeton.edu/status/tabulator.cgi?table=nodes/table_%(hostname)s&limit=50
58 If the machine has booted successfully, you may check it more quickly by logging in with your site_admin account, and running:
62 If you have a BootCD older than 3.0, you will need to create a new Boot CD and configuration file. You can find instructions for this at the Technical Contact's Guide:
64 https://www.planet-lab.org/doc/guides/tech#NodeInstallation
66 If after following these directions and finding your machine reported by CoMon, there is no need to respond to this message. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can help resolve the issue. Including this message in your reply will help us coordinate our records with the actions you've taken.
68 After another week, we will disable all slices currently running on PlanetLab. Because this action will directly affect all users of these slices, these users will also be notified at that time.
70 Thank you for your help,
71 -- PlanetLab Central (support@planet-lab.org)
74 newdown_three=("""PlanetLab node(s) down: %(loginbase)s""",
78 As part of PlanetLab node monitoring, we noticed the following nodes were down at your site:
81 We understand that machine maintenance can take time. We're writing again because our previous correspondences, sent first to the registered Technical Contact then the the Site PI, have gone unacknowledged for at least two weeks, and we need your help returning these machines to their regular operation. This is the third time attempting to contact someone in regard to these machines at your site. So, while we wait for the machines to return to their regular operation all current slice activity will be suspended. Current experiments will be stopped and will not be be able to start again until there is evidence that you have begun to help with the maintenance of these machines.
83 To help, please confirm that a recent BootCD is installed in the machine (Version 3.0 or greater). Then, after checking that the node is properly networked, power cycle the machine. Note that rebooting the machine may not fully resolve the problems we are seeing. Once the machine has come back up, please visit the Comon status page to verify that your node is accessible from the network. It may take several minutes before Comon registers your node.
85 http://summer.cs.princeton.edu/status/tabulator.cgi?table=nodes/table_%(hostname)s&limit=50
87 If the machine has booted successfully, you may check it more quickly by logging in with your site_admin account, and running:
91 If you have a BootCD older than 3.0, you will need to create a new Boot CD and configuration file. You can find instructions for this at the Technical Contact's Guide:
93 https://www.planet-lab.org/doc/guides/tech#NodeInstallation
95 If after following these directions and finding your machine reported by CoMon, there is no need to respond to this message. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can help resolve the issue. Including this message in your reply will help us coordinate our records with the actions you've taken.
97 Thank you for your help,
98 -- PlanetLab Central (support@planet-lab.org)
101 newbootcd_one=(""" Planetlab nodes need a new BootCD: %(loginbase)s""", # : %(hostname)s""",
102 """As part of PlanetLab node monitoring, we noticed the following nodes have an out-dated BootCD:
105 This usually implies that you need to update the BootCD and node configuration file stored on the read-only media (Either the all-in-one ISO CD, floppy disk, or write-protected USB stick).
107 To check the status of these and any other machines that you manage please visit:
109 http://comon.cs.princeton.edu/status
111 Instructions to perform the steps necessary for a BootCD upgrade are available in the Technical Contact's Guide.
113 https://www.planet-lab.org/doc/guides/tech#NodeInstallation
115 If your node returns to normal operation after following these directions, then there's no need to respond to this message. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can help resolve the issue. Including this message in your reply will help us coordinate our records with the actions you've taken.
117 After a week, we will disable your site's ability to create new slices. Because this action will directly affect your site's registered PI, we will also CC the PI for help at that time.
119 Thank you for your help,
120 -- PlanetLab Central (support@planet-lab.org)
122 newbootcd_two=(""" Planetlab nodes need a new BootCD: %(loginbase)s""", # : %(hostname)s""",
123 """As part of PlanetLab node monitoring, we noticed the following nodes have an out-dated BootCD:
126 This usually implies that you need to update the BootCD and node configuration file stored on the read-only media (Either the all-in-one ISO CD, floppy disk, or write-protected USB stick).
128 We're writing again because our previous correspondence, sent only to the registered Technical Contact, has gone unacknowledged for at least a week, and we need your help returning these machines to their regular operation. We understand that machine maintenance can take time. So, while we wait for the machines to return to their regular operation, slice creation has been suspended at your site. No new slices may be created, but the existing slices and services running within them will be unaffected.
130 To check the status of these and any other machines that you manage please visit:
132 http://comon.cs.princeton.edu/status
134 Instructions to perform the steps necessary for a BootCD upgrade are available in the Technical Contact's Guide.
136 https://www.planet-lab.org/doc/guides/tech#NodeInstallation
138 If your node returns to normal operation after following these directions, then there's no need to respond to this message. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can help resolve the issue. Including this message in your reply will help us coordinate our records with the actions you've taken.
140 After another week, we will disable all slices currently running on PlanetLab. Because this action will directly affect all users of these slices, these users will also be notified at that time.
142 Thank you for your help,
143 -- PlanetLab Central (support@planet-lab.org)
145 newbootcd_three=(""" Planetlab nodes need a new BootCD: %(loginbase)s""", # : %(hostname)s""",
146 """As part of PlanetLab node monitoring, we noticed the following nodes have an out-dated BootCD:
149 This usually implies that you need to update the BootCD and node configuration file stored on the read-only media (Either the all-in-one ISO CD, floppy disk, or write-protected USB stick).
151 We understand that machine maintenance can take time. We're writing again because our previous correspondences, sent first to the registered Technical Contact then the the Site PI, have gone unacknowledged for at least two weeks, and we need your help returning these machines to their regular operation. This is the third time attempting to contact someone in regard to these machines at your site. So, while we wait for the machines to return to their regular operation all current slice activity will be suspended. Current experiments will be stopped and will not be be able to start again until there is evidence that you have begun to help with the maintenance of these machines.
153 To check the status of these and any other machines that you manage please visit:
155 http://comon.cs.princeton.edu/status
157 Instructions to perform the steps necessary for a BootCD upgrade are available in the Technical Contact's Guide.
159 https://www.planet-lab.org/doc/guides/tech#NodeInstallation
161 If your node returns to normal operation after following these directions, then there's no need to respond to this message. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can help resolve the issue. Including this message in your reply will help us coordinate our records with the actions you've taken.
163 Thank you for your help,
164 -- PlanetLab Central (support@planet-lab.org)
166 thankyou=("""Thank you for helping maintain your PlanetLab nodes - %(loginbase)s""",
168 While monitoring your site, we noticed that the following nodes *improved*
172 Often, system administration is a thankless job, but not today. :-)
175 -- PlanetLab Central (support@planet-lab.org)
178 PROD- This state is the production state where the node can contact PlanetLab,
179 and install slices from users.
180 DEBUG- This state designates a node that could not boot successfully.
181 OLDBOOTCD- This state corresponds to the situation where an oldbootcd prevented
182 the normal operation of the node.
183 ERROR- This is an error state, where there is absolutely no contact
187 nmreset =("""NM Reset at %(loginbase)s""",
189 Monitor restarted NM on the following machines:
195 # TODO: need reminder versions for repeats...
196 newdown=[newdown_one, newdown_two, newdown_three]
197 newbootcd=[newbootcd_one, newbootcd_two, newbootcd_three]
198 newthankyou=[thankyou,thankyou,thankyou]
199 NMReset=[nmreset,nmreset,nmreset]
201 down=("""PlanetLab node %(hostname)s down.""", """As part of PlanetLab node monitoring, we noticed %(hostname)s has been down for %(days)s days.
203 Please check the node's connectivity and, if properly networked, power cycle the machine. Note that rebooting the machine may not fully resolve the problems we're seeing. Once the machine has come back up, please visit the Comon status page to verify that your node is accessible from the network.
205 http://summer.cs.princeton.edu/status/tabulator.cgi?table=table_nodeviewshort&select='address==%(hostbyteorder)s'
207 http://www.planet-lab.org/db/sites/index.php?id=%(site_id)d
209 There's no need to respond to this message if CoMon reports that your machine is accessible. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can resolve the issue.
214 -- PlanetLab Central (support@planet-lab.org)
217 dbg=("""Planetlab node %(hostname)s requires reboot.""", """As part of PlanetLab node monitoring, we noticed %(hostname)s is in debug mode. This usually implies the node was rebooted unexpectedly and could not come up cleanly.
219 Please check the node's connectivity and, if properly networked, power cycle the machine. Note that rebooting the machine may not fully resolve the problems we're seeing. Once the machine has come back up, please visit the Comon status page to verify that your node is accessible from the network.
221 http://summer.cs.princeton.edu/status/tabulator.cgi?table=table_nodeviewshort&select='address==%(hostbyteorder)s'
223 There's no need to respond to this message if CoMon reports that your machine is accessible. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can resolve the issue.
225 Thank you for your help,
226 -- PlanetLab Central (support@planet-lab.org)
229 planet_cnf=(""" Planetlab node %(hostname)s needs an updated configuration file""", """As part of PlanetLab node monitoring, we noticed %(hostname)s has an out-dated planet.cnf file with no NODE_ID. This can happen after an upgrade and requires your assistance in correcting. All that is needed is to visit:
231 https://www.planet-lab.org/db/nodes/index.php?id=%(node_id)d
233 And follow the "Download conf file" link to generate a new configuration file for each node. Copy this file to the appropriate read-only media, either floppy or USB stick, and reboot the machines.
235 There's no need to respond to this message if you're able to update the configuration files without difficulty and your node returns to normal operation. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can help resolve the issue.
237 Thank you for your help,
238 -- PlanetLab Central (support@planet-lab.org)
241 bootcd=(""" Planetlab node %(hostname)s needs a new BootCD""",
242 """As part of PlanetLab node monitoring, we noticed %(hostname)s has an out-dated BootCD: "%(version)". This usually implies that you need to update both the BootCD and regenerate the planet.cnf file stored on the read-only floppy (Or read-only USB stick that stores the content of BootCD and planet.cnf).
244 Instructions to perform the steps necessary for a BootCD upgrade are available in the Technical Contact Guide.
245 https://www.planet-lab.org/doc/guides/tech#NodeInstallation
247 There's no need to respond to this message if you're able to follow the directions without difficulty and your node returns to normal operation. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can help resolve the issue.
249 Thanks you for your help,
250 -- PlanetLab Central (support@planet-lab.org)
253 ssh=("""Planetlab node %(hostname)s down.""", """As part of PlanetLab node monitoring, we noticed node %(hostname)s is not available for ssh.
255 Please check the node's connectivity and, if properly networked, power cycle the machine. Note that rebooting the machine may not fully resolve the problems we're seeing. Once the machine has come back up, please visit the Comon status page to verify that your node is accessible from the network.
257 http://summer.cs.princeton.edu/status/tabulator.cgi?table=table_nodeviewshort&select='address==%(hostbyteorder)s'
259 There's no need to respond to this message if CoMon reports that your machine is accessible. However, if there are any console messages relating to the node's failure, please report them to PlanetLab support (support@planet-lab.org) so we can resolve the issue.
264 -- PlanetLab Central (support@planet-lab.org)
268 dns=("""Planetlab node %(hostname)s down.""", """As part of PlanetLab node monitoring, we noticed the DNS servers used by %(hostname)s are not responding to queries.
270 Please verify the DNS information used by the node is correct. You can find directions on how to update the node's network information on the PlanetLab Technical Contacts Guid (http://www.planet-lab.org/doc/TechsGuide.php#id268898).
274 -- PlanetLab Central (support@planet-lab.org)
278 filerw=("""Planetlab node %(hostname)s has a bad disk.""", """As part of PlanetLab node monitoring, we noticed %(hostname)s has a read-only filesystem.
280 Please verify the integrity of the disk and email the site if a replacement is needed.
284 -- PlanetLab Central (support@planet-lab.org)
288 clock_drift=("""Planetlab node %(hostname)s and NTP.""", """As part of PlanetLab node monitoring, we noticed %(hostname)s cannot reach our NTP server.
290 Please verify that the NTP port (tcp/123) is not blocked by your site.
294 -- PlanetLab Central (support@planet-lab.org)
299 removedSliceCreation=("""PlanetLab slice creation/renewal suspension.""","""As part of PlanetLab node monitoring, we noticed the %(loginbase)s site has less than 2 nodes up. We have attempted to contact the PI and Technical contacts %(times)s times and have not received a response.
301 Slice creation and renewal are now suspended for the %(loginbase)s site. Please be aware that failure to respond will result in the automatic suspension of all running slices on PlanetLab.
304 -- PlanetLab Central (support@planet-lab.org)
308 suspendSlices=("""PlanetLab slices suspended.""","""As part of PlanetLab node monitoring, we noticed the %(loginbase)s site has less than 2 nodes up. We have attempted to contact the PI and Technical contacts %(times)s times and have not received a response.
310 All %(loginbase)s slices are now suspended.
313 -- PlanetLab Central (support@planet-lab.org)
317 pcu_broken=("""%(hostname)s failed to reinstall""","""Hello,
319 %(hostname)s was remotely rebooted via your power control unit but has not contacted PlanetLab since. It should contact upon every boot, hence we believe that either the node has some hardware problems, is not properly connected to the power control unit, or has network connectivity issues. Could you please reboot the node and watch the console for error messages?
324 -- PlanetLab Central (support@planet-lab.org)
330 We have set %(hostname)s to reinstall, but because your site does not have a power control unit, we are unable to powercycle the node. Please
334 -- PlanetLab Central (support@planet-lab.org)