mom.git
11 years agoSetting tag mom-2.3-5 master mom-2.3-5
Thierry Parmentelat [Thu, 19 Jul 2012 21:57:40 +0000 (23:57 +0200)]
Setting tag mom-2.3-5
use bwlimit from plnode

11 years agoremove old svn-keywords & stuff
Thierry Parmentelat [Wed, 18 Jul 2012 12:47:18 +0000 (14:47 +0200)]
remove old svn-keywords & stuff

11 years agofind bwlimit at plnode-utils
Thierry Parmentelat [Wed, 18 Jul 2012 12:46:29 +0000 (14:46 +0200)]
find bwlimit at plnode-utils

11 years agore-add Marco's .gitignore after re-running the svn2git as per git-clone-svn.sh
Thierry Parmentelat [Thu, 14 Jun 2012 15:39:05 +0000 (18:39 +0300)]
re-add Marco's .gitignore after re-running the svn2git as per git-clone-svn.sh

12 years agoSetting tag Mom-2.3-4
Scott Baker [Tue, 10 May 2011 18:05:01 +0000 (18:05 +0000)]
Setting tag Mom-2.3-4
Avoid writing error messages to stdout that are due to errors
reported from vps.

12 years agoignore lines with ERR messages in them from vps
Scott Baker [Tue, 10 May 2011 03:03:25 +0000 (03:03 +0000)]
ignore lines with ERR messages in them from vps

13 years agoSetting tag Mom-2.3-3
Scott Baker [Wed, 20 Apr 2011 17:31:33 +0000 (17:31 +0000)]
Setting tag Mom-2.3-3
Fix parsing of vps in swapmon, add swapmon error messages when unparseable
lines encountered, add stuck rpm checking to pl_mop

13 years agolook for rpm instead of yum
Scott Baker [Wed, 20 Apr 2011 16:26:27 +0000 (16:26 +0000)]
look for rpm instead of yum

13 years agofix error output
Scott Baker [Wed, 20 Apr 2011 16:25:21 +0000 (16:25 +0000)]
fix error output

13 years agofix problem parsing output of vps; add --memstatus command
Scott Baker [Wed, 20 Apr 2011 15:51:21 +0000 (15:51 +0000)]
fix problem parsing output of vps; add --memstatus command

13 years agocheck for stuck RPM processes
Scott Baker [Wed, 9 Mar 2011 05:02:52 +0000 (05:02 +0000)]
check for stuck RPM processes

14 years agoSetting tag Mom-2.3-2
Daniel Hokka Zakrisson [Sun, 13 Dec 2009 15:06:29 +0000 (15:06 +0000)]
Setting tag Mom-2.3-2
Don't restart disabled services.

14 years agoOnly restart services which are enabled.
Daniel Hokka Zakrisson [Sun, 13 Dec 2009 14:58:02 +0000 (14:58 +0000)]
Only restart services which are enabled.

15 years agoTagging module Mom - Mom-2.3-1
Faiyaz Ahmed [Tue, 7 Apr 2009 15:57:12 +0000 (15:57 +0000)]
Tagging module Mom - Mom-2.3-1
Merge from trunk.

15 years agovserver is no longer needed.
Faiyaz Ahmed [Thu, 5 Feb 2009 15:51:26 +0000 (15:51 +0000)]
vserver is no longer needed.

15 years agoMerge from trunk. Kills procs in sliver rather than restarting context.
Faiyaz Ahmed [Wed, 19 Nov 2008 17:11:53 +0000 (17:11 +0000)]
Merge from trunk.  Kills procs in sliver rather than restarting context.

15 years agoKill procs in vserver using most memory.
Faiyaz Ahmed [Wed, 10 Sep 2008 18:24:47 +0000 (18:24 +0000)]
Kill procs in vserver using most memory.

15 years agoJust kill the processes and stop screwing with vservers.
Faiyaz Ahmed [Wed, 10 Sep 2008 01:01:17 +0000 (01:01 +0000)]
Just kill the processes and stop screwing with vservers.

15 years agoOn vserver create, the slice goes to init 1 which causes an extra column to get print...
Faiyaz Ahmed [Tue, 29 Jul 2008 19:46:19 +0000 (19:46 +0000)]
On vserver create, the slice goes to init 1 which causes an extra column to get printed in the output of ps.  Just pass over this.

15 years agoRemoved functions that didn't do anything, like update vserver and check netflow...
Faiyaz Ahmed [Tue, 29 Jul 2008 19:38:18 +0000 (19:38 +0000)]
Removed functions that didn't do anything, like update vserver and check netflow.  Deprecated.

15 years agoUse hostname instead of planetlab.org if config file can't be found.
Faiyaz Ahmed [Thu, 24 Apr 2008 20:39:29 +0000 (20:39 +0000)]
Use hostname instead of planetlab.org if config file can't be found.

15 years agoFixed bug where vps was causing slice info to dict was failing.
Faiyaz Ahmed [Thu, 24 Apr 2008 20:23:57 +0000 (20:23 +0000)]
Fixed bug where vps was causing slice info to dict was failing.
Updated to use new vserver.py.

15 years agoIncrease channel width in vps so regex doesn't fail.
Faiyaz Ahmed [Wed, 23 Apr 2008 20:47:55 +0000 (20:47 +0000)]
Increase channel width in vps so regex doesn't fail.

16 years agotrying to bring some sense in the numbering scheme
Thierry Parmentelat [Fri, 28 Mar 2008 17:37:13 +0000 (17:37 +0000)]
trying to bring some sense in the numbering scheme

16 years agoMust use vkill to kill outside of root.
Faiyaz Ahmed [Wed, 19 Mar 2008 19:54:50 +0000 (19:54 +0000)]
Must use vkill to kill outside of root.

16 years agobump version.
Faiyaz Ahmed [Wed, 12 Mar 2008 18:41:50 +0000 (18:41 +0000)]
bump version.

16 years agoBug in debug code. Also now sends to mom list address.
Faiyaz Ahmed [Thu, 6 Mar 2008 18:11:41 +0000 (18:11 +0000)]
Bug in debug code.  Also now sends to mom list address.

16 years agoMade DATAFILE global. Same for VERSION.
Faiyaz Ahmed [Tue, 19 Feb 2008 22:54:43 +0000 (22:54 +0000)]
Made DATAFILE global.  Same for VERSION.

16 years agoready for tagging
Thierry Parmentelat [Tue, 8 Jan 2008 11:49:43 +0000 (11:49 +0000)]
ready for tagging
reviewed URLs and %{plrelease}

16 years agoAdded License file. Also updated readme. Just house keeping.
Faiyaz Ahmed [Thu, 3 Jan 2008 19:51:33 +0000 (19:51 +0000)]
Added License file.  Also updated readme.  Just house keeping.

16 years agoFixed indenting.
Faiyaz Ahmed [Tue, 4 Dec 2007 21:16:56 +0000 (21:16 +0000)]
Fixed indenting.

16 years agoversions were off from branch. Bumped back.
Faiyaz Ahmed [Tue, 4 Dec 2007 20:55:43 +0000 (20:55 +0000)]
versions were off from branch.  Bumped back.

16 years agofixed version
Faiyaz Ahmed [Tue, 4 Dec 2007 20:52:02 +0000 (20:52 +0000)]
fixed version

16 years agoKills and restarts NM when too many found running.
Faiyaz Ahmed [Tue, 4 Dec 2007 20:50:44 +0000 (20:50 +0000)]
Kills and restarts NM when too many found running.

16 years agoCopy head to branch. ALso bump spec. 2.1
Faiyaz Ahmed [Thu, 29 Nov 2007 21:57:02 +0000 (21:57 +0000)]
Copy head to branch.  ALso bump spec.

16 years agoSomewhere in the CVS to SVN conversion, or maybe it was just me using ViM incorrectly...
Faiyaz Ahmed [Thu, 29 Nov 2007 21:50:10 +0000 (21:50 +0000)]
Somewhere in the CVS to SVN conversion, or maybe it was just me using ViM incorrectly, tabbig got all messed up.  Also cleaned up the abortion that was main.

..This was never written correctly and is badly in need of a rewrite.

16 years agoAdded keyword ID propset.
Faiyaz Ahmed [Thu, 29 Nov 2007 20:17:46 +0000 (20:17 +0000)]
Added keyword ID propset.

16 years agokill NM processes running in a slice.
Faiyaz Ahmed [Thu, 29 Nov 2007 20:06:03 +0000 (20:06 +0000)]
kill NM processes running in a slice.

16 years agotabs != spaces. I hate CVS.
Faiyaz Ahmed [Mon, 12 Nov 2007 21:18:23 +0000 (21:18 +0000)]
tabs != spaces. I hate CVS.

16 years agoTest checkin.
Faiyaz Ahmed [Wed, 7 Nov 2007 20:35:38 +0000 (20:35 +0000)]
Test checkin.

16 years agoUpdated Readme
Faiyaz Ahmed [Tue, 25 Sep 2007 19:12:02 +0000 (19:12 +0000)]
Updated Readme

16 years agoNo more pl_conf.
Faiyaz Ahmed [Tue, 25 Sep 2007 18:18:39 +0000 (18:18 +0000)]
No more pl_conf.

16 years ago/usr/local should never be a path for packaged modules. Moving to /usr
Faiyaz Ahmed [Tue, 14 Aug 2007 16:31:32 +0000 (16:31 +0000)]
/usr/local should never be a path for packaged modules.  Moving to /usr

16 years agoRemoved bwmon.
Faiyaz Ahmed [Wed, 25 Jul 2007 17:10:16 +0000 (17:10 +0000)]
Removed bwmon.

17 years agoBump version number.
Faiyaz Ahmed [Wed, 28 Feb 2007 19:47:53 +0000 (19:47 +0000)]
Bump version number.

17 years agoBump release
Faiyaz Ahmed [Tue, 27 Feb 2007 23:14:57 +0000 (23:14 +0000)]
Bump release

17 years agoUnneeded any longer.
Faiyaz Ahmed [Tue, 27 Feb 2007 23:09:06 +0000 (23:09 +0000)]
Unneeded any longer.

17 years ago* remove reference to pl_nm
Faiyaz Ahmed [Wed, 14 Feb 2007 20:48:46 +0000 (20:48 +0000)]
* remove reference to pl_nm

17 years ago* Bump revision and version
Faiyaz Ahmed [Wed, 14 Feb 2007 19:31:37 +0000 (19:31 +0000)]
* Bump revision and version

17 years agoRemoved nm_mom and bwmon.
Faiyaz Ahmed [Wed, 14 Feb 2007 19:24:06 +0000 (19:24 +0000)]
Removed nm_mom and bwmon.

17 years agosatisfy FC6 rpm requirements for python files
Marc Fiuczynski [Thu, 18 Jan 2007 18:57:29 +0000 (18:57 +0000)]
satisfy FC6 rpm requirements for python files

17 years agoBump release, not version. Was a minor change.
Faiyaz Ahmed [Wed, 10 Jan 2007 16:56:10 +0000 (16:56 +0000)]
Bump release, not version.  Was a minor change.

17 years agoBump revision.
Faiyaz Ahmed [Wed, 10 Jan 2007 16:53:27 +0000 (16:53 +0000)]
Bump revision.

17 years ago* bwlimit.set() expects bits/s. Was giving it bytes/s.
Faiyaz Ahmed [Wed, 10 Jan 2007 16:51:04 +0000 (16:51 +0000)]
* bwlimit.set() expects bits/s.  Was giving it bytes/s.

17 years ago* Removed debug statement that prints out extra NM crap because I don't trust NM...
Faiyaz Ahmed [Mon, 8 Jan 2007 21:58:13 +0000 (21:58 +0000)]
* Removed debug statement that prints out extra NM crap because I don't trust NM any more than I can throw steve.

17 years ago* NM rate values are in bits/s. Fixed Byte limits.
Faiyaz Ahmed [Mon, 8 Jan 2007 21:32:45 +0000 (21:32 +0000)]
*  NM rate values are in bits/s.  Fixed Byte limits.

17 years ago* Bumps version to 0.7
Faiyaz Ahmed [Wed, 3 Jan 2007 20:15:36 +0000 (20:15 +0000)]
*  Bumps version to 0.7

17 years ago* time.time() returns a float which python doesn't like to divide. Fixed.
Faiyaz Ahmed [Wed, 3 Jan 2007 20:15:06 +0000 (20:15 +0000)]
* time.time() returns a float which python doesn't like to divide.  Fixed.
* Made email slightly more readable.

17 years ago- bump release to incorporate bwmon.py low limit fix
Mark Huang [Thu, 28 Dec 2006 22:37:52 +0000 (22:37 +0000)]
- bump release to incorporate bwmon.py low limit fix

17 years ago* Capped rate can only go as low as default_MinRate which is 8bits/s
Faiyaz Ahmed [Tue, 19 Dec 2006 16:52:24 +0000 (16:52 +0000)]
* Capped rate can only go as low as default_MinRate which is 8bits/s

17 years ago* Bump release to 9.
Faiyaz Ahmed [Wed, 13 Dec 2006 21:50:37 +0000 (21:50 +0000)]
*  Bump release to 9.

17 years ago* Fixed syntax error with low bw thresholds.
Faiyaz Ahmed [Wed, 13 Dec 2006 21:39:23 +0000 (21:39 +0000)]
* Fixed syntax error with low bw thresholds.

17 years ago* Fixed output of -s -v and -d flags.
Faiyaz Ahmed [Wed, 6 Dec 2006 19:22:45 +0000 (19:22 +0000)]
* Fixed output of -s -v and -d flags.

Still testing.

17 years ago* Byte limits are in KB when passed or requested via XMLRPC to NM. Bytes otherwise.
Faiyaz Ahmed [Tue, 5 Dec 2006 21:46:56 +0000 (21:46 +0000)]
*  Byte limits are in KB when passed or requested via XMLRPC to NM.  Bytes otherwise.

17 years agoRemove slice email notification. (commented. will remove notification code after...
Faiyaz Ahmed [Tue, 5 Dec 2006 16:46:58 +0000 (16:46 +0000)]
Remove slice email notification.  (commented.  will remove notification code after testing)

17 years ago* Queries NM for: "nm_net_max_byte",
Faiyaz Ahmed [Mon, 4 Dec 2006 21:53:09 +0000 (21:53 +0000)]
* Queries NM for: "nm_net_max_byte",
"nm_net_max_exempt_byte",
"nm_net_max_thresh_byte",
"nm_net_max_thresh_exempt_byte"
* Backwards compatible with avgrate and avgexemptrate slice attributes.

17 years ago- add SZ ("potential" memory usage) to e-mails to see if this can be
Mark Huang [Sat, 2 Dec 2006 19:11:47 +0000 (19:11 +0000)]
- add SZ ("potential" memory usage) to e-mails to see if this can be
  used as a metric
- memtotal: return SwapTotal as well
- summary: completely broken when used in the emergency reboot case, fix
- parse --min-thresh
- just warn system slices once (again)

17 years ago* Fixed floating point arith error. tc likes whole numbers.
Faiyaz Ahmed [Fri, 1 Dec 2006 22:02:01 +0000 (22:02 +0000)]
* Fixed floating point arith error.  tc likes whole numbers.
* Added extra debug statements.

17 years ago* bwmon uses byte limits instead of average rates (slice attributes). After reachin...
Faiyaz Ahmed [Fri, 1 Dec 2006 20:05:30 +0000 (20:05 +0000)]
*  bwmon uses byte limits instead of average rates (slice attributes).  After reaching threshold bytelimit, cap to (bytelimit - threshold)/time_left_in_period
*  Still need appropriate slice attribute to be added to NM.  Using hardcoded values until new NM is ready.

Still testing.  Not for public consumption yet.

17 years ago- disable delivery status notifications
Mark Huang [Sat, 18 Nov 2006 18:00:14 +0000 (18:00 +0000)]
- disable delivery status notifications

17 years ago* Bumped revision.
Faiyaz Ahmed [Wed, 11 Oct 2006 17:48:31 +0000 (17:48 +0000)]
* Bumped revision.

17 years ago* change nm_mom.py to run once at noon every day. Sometimes NM is being restarted...
Faiyaz Ahmed [Wed, 11 Oct 2006 16:14:10 +0000 (16:14 +0000)]
* change nm_mom.py to run once at noon every day.  Sometimes NM is being restarted while querying pl_conf and pl_conf dies. Running that once a day instead of every 15 should fix that.

17 years ago- put nm_mom.py in the right place
Mark Huang [Tue, 10 Oct 2006 19:19:08 +0000 (19:19 +0000)]
- put nm_mom.py in the right place

17 years ago- install and add nm_mom.py to spec
Mark Huang [Tue, 10 Oct 2006 19:16:36 +0000 (19:16 +0000)]
- install and add nm_mom.py to spec

17 years ago* Incremented version of RPM.
Faiyaz Ahmed [Tue, 10 Oct 2006 18:08:12 +0000 (18:08 +0000)]
* Incremented version of RPM.
* Included nm_mom.py in RPM.

17 years ago- bump release number for swapmon fixes
Mark Huang [Thu, 24 Aug 2006 20:15:53 +0000 (20:15 +0000)]
- bump release number for swapmon fixes

17 years agoSleep for 10 seconds after stoppiing NM; in the off chance NM is working, it takes...
Faiyaz Ahmed [Thu, 17 Aug 2006 20:21:09 +0000 (20:21 +0000)]
Sleep for 10 seconds after stoppiing NM; in the off chance NM is working, it takes a bit to stop.

17 years agoRemoved unneeded imports. There are probably more, but I'm still uncertain of python...
Faiyaz Ahmed [Thu, 17 Aug 2006 20:12:09 +0000 (20:12 +0000)]
Removed unneeded imports.  There are probably more, but I'm still uncertain of python's inheritence of imports.

17 years agoAdded nm_mom to cron.
Faiyaz Ahmed [Thu, 17 Aug 2006 20:06:26 +0000 (20:06 +0000)]
Added nm_mom to cron.

17 years agoChecks NM to see if still alive by querying XMLRPC. If time out, restart NM.
Faiyaz Ahmed [Thu, 17 Aug 2006 20:05:24 +0000 (20:05 +0000)]
Checks NM to see if still alive by querying XMLRPC.  If time out, restart NM.

17 years ago* Added hard limit of 200MB to be considered a hog.
Faiyaz Ahmed [Wed, 16 Aug 2006 16:18:45 +0000 (16:18 +0000)]
* Added hard limit of 200MB to be considered a hog.
* If you've been reset more than 3 times, kill until you behave.
* Update slice statistics after every slice reset/kill.

17 years agoStops growing after reaching specified size and waits..
Faiyaz Ahmed [Tue, 25 Jul 2006 18:09:19 +0000 (18:09 +0000)]
Stops growing after reaching specified size and waits..

17 years agoRemoved debug statement.
Faiyaz Ahmed [Wed, 19 Jul 2006 19:40:55 +0000 (19:40 +0000)]
Removed debug statement.

17 years ago* Wed Jul 19 2006 Faiyaz Ahmed <faiyaza@cs.princeton.edu> 0.4-7.planetlab
Mark Huang [Wed, 19 Jul 2006 19:04:06 +0000 (19:04 +0000)]
* Wed Jul 19 2006 Faiyaz Ahmed <faiyaza@cs.princeton.edu> 0.4-7.planetlab
- swapmon: Added state info so same reset email does not keep getting
  sent. If slice is already being killed, will wait for a timeout
  before trying to kill again.

17 years agowaitpid for forked processes. Was zombie'ing before.
Faiyaz Ahmed [Wed, 19 Jul 2006 18:55:06 +0000 (18:55 +0000)]
waitpid for forked processes.   Was zombie'ing before.

17 years agoAdded state info so same reset email does not keep getting sent. If slice is already
Faiyaz Ahmed [Wed, 19 Jul 2006 18:24:37 +0000 (18:24 +0000)]
Added state info so same reset email does not keep getting sent.  If slice is already
being killed, will wait for a timeout before trying to kill again.

17 years ago- bump release number
Mark Huang [Mon, 17 Jul 2006 21:34:39 +0000 (21:34 +0000)]
- bump release number

17 years agoAdded state information per slice. Slices are now killed after consuming more
Faiyaz Ahmed [Mon, 17 Jul 2006 19:31:27 +0000 (19:31 +0000)]
Added state information per slice.  Slices are now killed after consuming more
memory than the threshold more than twice.  Also fixed the slice restart issue where
slices would be restarted and repeated restart emails would be sent.

17 years agoRemoved unused class variables.
Faiyaz Ahmed [Mon, 10 Jul 2006 20:22:12 +0000 (20:22 +0000)]
Removed unused class variables.

17 years ago- bump release for initialization bugfix
Mark Huang [Mon, 10 Jul 2006 19:23:36 +0000 (19:23 +0000)]
- bump release for initialization bugfix

17 years agoInitted default limits before connecting to NM for first time in case NM except'ns.
Faiyaz Ahmed [Mon, 10 Jul 2006 19:19:07 +0000 (19:19 +0000)]
Initted default limits before connecting to NM for first time in case NM except'ns.

17 years ago- fix changelog date
Mark Huang [Mon, 10 Jul 2006 16:00:59 +0000 (16:00 +0000)]
- fix changelog date

17 years ago- bump release number
Mark Huang [Mon, 10 Jul 2006 15:51:01 +0000 (15:51 +0000)]
- bump release number
- start a changelog

17 years agoStores last values reported by NM. Uses cached vals if NM stops responding instead of
Faiyaz Ahmed [Mon, 10 Jul 2006 15:19:35 +0000 (15:19 +0000)]
Stores last values reported by NM.  Uses cached vals if NM stops responding instead of
global defaults.

17 years ago- nm_net parameters are now in bps
Mark Huang [Mon, 26 Jun 2006 20:20:17 +0000 (20:20 +0000)]
- nm_net parameters are now in bps

17 years ago- deal with error results from NM correctly, and don't give up after a single error
Mark Huang [Fri, 2 Jun 2006 04:01:41 +0000 (04:01 +0000)]
- deal with error results from NM correctly, and don't give up after a single error

17 years ago- stop old version of pl_mom before upgrading
Mark Huang [Tue, 30 May 2006 17:42:00 +0000 (17:42 +0000)]
- stop old version of pl_mom before upgrading

17 years ago- add a fixit script to fill the gap between the current situation and
Mark Huang [Tue, 9 May 2006 22:26:31 +0000 (22:26 +0000)]
- add a fixit script to fill the gap between the current situation and
  the desired situation. Ideally, nodes' configuration files and GPG
  keys would be initially installed by the Boot Manager, then pushed to
  the reference images and system slices by vserver-reference
  init/cron. However, we don't want to re-install nodes, and we don't
  want to upgrade vserver-reference, so between PlanetLabConf,
  NodeUpdate, and this script, we should be able to get to a point where
  we can enable GPG checking of RPMs.

17 years agovps sometimes prints ERR instead of a context ID if it
Mark Huang [Tue, 9 May 2006 03:23:57 +0000 (03:23 +0000)]
vps sometimes prints ERR instead of a context ID if it
cannot identify the context of an orphaned (usually dying)
process. Skip these processes.

17 years ago- only print a single warning per run if NM query times out
Mark Huang [Tue, 9 May 2006 03:22:25 +0000 (03:22 +0000)]
- only print a single warning per run if NM query times out