mom.git
17 years agomerge changes from HEAD planetlab-3_3-branch
Faiyaz Ahmed [Wed, 10 Jan 2007 16:59:48 +0000 (16:59 +0000)]
merge changes from HEAD

17 years agomerge changes from HEAD
Faiyaz Ahmed [Mon, 8 Jan 2007 22:01:36 +0000 (22:01 +0000)]
merge changes from HEAD

17 years agomerge changes from head
Mark Huang [Fri, 5 Jan 2007 17:56:58 +0000 (17:56 +0000)]
merge changes from head

17 years agomerge to head
Mark Huang [Thu, 28 Dec 2006 22:39:43 +0000 (22:39 +0000)]
merge to head

17 years agomerge changes from HEAD
Mark Huang [Fri, 15 Dec 2006 20:43:10 +0000 (20:43 +0000)]
merge changes from HEAD

17 years agoThis commit was manufactured by cvs2svn to create branch
Planet-Lab Support [Fri, 1 Dec 2006 20:05:31 +0000 (20:05 +0000)]
This commit was manufactured by cvs2svn to create branch
'planetlab-3_3-branch'.

17 years ago* Bump revision.
Faiyaz Ahmed [Wed, 11 Oct 2006 17:49:52 +0000 (17:49 +0000)]
* Bump revision.

17 years ago* change nm_mom.py to run once at noon every day. Sometimes NM is being restarted...
Faiyaz Ahmed [Wed, 11 Oct 2006 17:20:10 +0000 (17:20 +0000)]
* change nm_mom.py to run once at noon every day.  Sometimes NM is being restarted while querying pl_conf and pl_conf dies. Running that once a day instead of every 15 should fix that.

17 years ago* change nm_mom.py to run once at noon every day. Sometimes NM is being restarted...
Faiyaz Ahmed [Wed, 11 Oct 2006 16:22:39 +0000 (16:22 +0000)]
* change nm_mom.py to run once at noon every day.  Sometimes NM is being restarted while querying pl_conf and pl_conf dies. Running that once a day instead of every 15 should fix that.

17 years ago- merge revision 1.32
Mark Huang [Tue, 10 Oct 2006 19:19:33 +0000 (19:19 +0000)]
- merge revision 1.32
date: 2006/10/10 19:19:08;  author: mlhuang;  state: Exp;  lines: +1 -1
- put nm_mom.py in the right place

17 years ago- merge revision 1.31
Mark Huang [Tue, 10 Oct 2006 19:17:23 +0000 (19:17 +0000)]
- merge revision 1.31
date: 2006/10/10 19:16:36;  author: mlhuang;  state: Exp;  lines: +2 -1
- install and add nm_mom.py to spec

17 years ago- merge revision 1.30
Mark Huang [Tue, 10 Oct 2006 19:13:01 +0000 (19:13 +0000)]
- merge revision 1.30
date: 2006/10/10 18:08:12;  author: faiyaza;  state: Exp;  lines: +4 -1
* Incremented version of RPM.
* Included nm_mom.py in RPM.

17 years ago- merge changes from HEAD for swapmon fixes
Mark Huang [Thu, 24 Aug 2006 20:16:14 +0000 (20:16 +0000)]
- merge changes from HEAD for swapmon fixes

17 years agoThis commit was manufactured by cvs2svn to create branch
Planet-Lab Support [Thu, 17 Aug 2006 20:21:11 +0000 (20:21 +0000)]
This commit was manufactured by cvs2svn to create branch
'planetlab-3_3-branch'.

17 years agomerge 0.4-7.planetlab changes
Mark Huang [Wed, 19 Jul 2006 19:47:03 +0000 (19:47 +0000)]
merge 0.4-7.planetlab changes

17 years ago- merge revision 1.27
Mark Huang [Mon, 17 Jul 2006 21:35:19 +0000 (21:35 +0000)]
- merge revision 1.27
date: 2006/07/17 21:34:39;  author: mlhuang;  state: Exp;  lines: +1 -1
- bump release number

17 years ago- merge revision 1.6
Mark Huang [Mon, 17 Jul 2006 21:34:20 +0000 (21:34 +0000)]
- merge revision 1.6
date: 2006/07/17 19:31:27;  author: faiyaza;  state: Exp;  lines: +139 -25
Added state information per slice.  Slices are now killed after consuming more
memory than the threshold more than twice.  Also fixed the slice restart issue where
slices would be restarted and repeated restart emails would be sent.

17 years agomerge 0.4-5.planetlab changes
Mark Huang [Mon, 10 Jul 2006 21:12:50 +0000 (21:12 +0000)]
merge 0.4-5.planetlab changes

17 years agomerge 0.4-4.planetlab changes
Mark Huang [Mon, 10 Jul 2006 16:01:13 +0000 (16:01 +0000)]
merge 0.4-4.planetlab changes

17 years agomerge 0.4-4.planetlab changes
Mark Huang [Mon, 10 Jul 2006 15:59:20 +0000 (15:59 +0000)]
merge 0.4-4.planetlab changes

17 years ago- deal with error results from NM correctly, and don't give up after a single error
Mark Huang [Fri, 2 Jun 2006 04:01:41 +0000 (04:01 +0000)]
- deal with error results from NM correctly, and don't give up after a single error

17 years ago- merge revision 1.22
Mark Huang [Tue, 30 May 2006 17:43:02 +0000 (17:43 +0000)]
- merge revision 1.22
date: 2006/05/30 17:42:00;  author: mlhuang;  state: Exp;  lines: +8 -1
- stop old version of pl_mom before upgrading

17 years agomerge to 0.4-1 (HEAD as of 2006-05-09
Mark Huang [Tue, 9 May 2006 22:28:48 +0000 (22:28 +0000)]
merge to 0.4-1 (HEAD as of 2006-05-09

17 years agoThis commit was manufactured by cvs2svn to create branch
Planet-Lab Support [Tue, 9 May 2006 03:23:58 +0000 (03:23 +0000)]
This commit was manufactured by cvs2svn to create branch
'planetlab-3_3-branch'.

17 years agovps sometimes prints ERR instead of a context ID if it
Mark Huang [Tue, 9 May 2006 03:23:57 +0000 (03:23 +0000)]
vps sometimes prints ERR instead of a context ID if it
cannot identify the context of an orphaned (usually dying)
process. Skip these processes.

17 years ago- only print a single warning per run if NM query times out
Mark Huang [Tue, 9 May 2006 03:22:25 +0000 (03:22 +0000)]
- only print a single warning per run if NM query times out

17 years agoremove call to non-existent close() function
Mark Huang [Mon, 8 May 2006 17:37:28 +0000 (17:37 +0000)]
remove call to non-existent close() function

17 years ago- getting a weird exception very occasionally while parsing vps output,
Mark Huang [Tue, 2 May 2006 17:23:14 +0000 (17:23 +0000)]
- getting a weird exception very occasionally while parsing vps output,
  print it

17 years agoadd stupid test program that leaks memory at a configurable rate
Mark Huang [Mon, 1 May 2006 18:28:22 +0000 (18:28 +0000)]
add stupid test program that leaks memory at a configurable rate

17 years agosupport one-shot --status argument to get quick summary
Mark Huang [Mon, 1 May 2006 18:28:01 +0000 (18:28 +0000)]
support one-shot --status argument to get quick summary

18 years ago- query nm for the right average rate parameters
Mark Huang [Fri, 28 Apr 2006 20:25:19 +0000 (20:25 +0000)]
- query nm for the right average rate parameters

18 years ago- remove unnecessary textwrap import
Mark Huang [Fri, 28 Apr 2006 19:32:18 +0000 (19:32 +0000)]
- remove unnecessary textwrap import

18 years ago- bump version number
Mark Huang [Fri, 28 Apr 2006 19:30:11 +0000 (19:30 +0000)]
- bump version number
- add necessary Requires: lines for vps, bwlimit.py, and tc
- improve description
- renamed pl_mom to swapmon
- install bwmon

18 years ago- standard initscript for swapmon
Mark Huang [Fri, 28 Apr 2006 19:29:16 +0000 (19:29 +0000)]
- standard initscript for swapmon

18 years ago- renamed pl_mom daemon to swapmon
Mark Huang [Fri, 28 Apr 2006 19:28:59 +0000 (19:28 +0000)]
- renamed pl_mom daemon to swapmon

18 years ago- utility functions common to swapmon and bwmon
Mark Huang [Fri, 28 Apr 2006 19:27:43 +0000 (19:27 +0000)]
- utility functions common to swapmon and bwmon

18 years ago- rewrite pl_mom.pl
Mark Huang [Fri, 28 Apr 2006 19:26:59 +0000 (19:26 +0000)]
- rewrite pl_mom.pl
- now single-threaded, several other improvements
- pick up support and slice e-mail addresses, and system slice prefix,
  from PLC configuration in /etc/planetlab
- use vserver.py module to reset slices instead of exec()ing chcontext
- use vps instead of slicestat; slicestat may not run on all nodes, in
  particular, does not run on private PlanetLab nodes. Calling vps is
  much faster than querying slicestat.
- list slices active before reboot, in the reboot notification
- list all slice processes running before reset, in the reset
  notification
- previously, only the reboot threshold was checked every second. Now,
  both reboot and reset thresholds are checked every second. Since it is
  relatively expensive and can take a while to run under heavy load, vps
  is still run only every 30 seconds, or when a large change in swap
  utilization is detected. This cuts down on the time necessary to
  detect fast growing hogs, before critical utilization is reached.

18 years agorename to pl_mom.cron since bwmon is run from cron now
Mark Huang [Fri, 28 Apr 2006 19:12:09 +0000 (19:12 +0000)]
rename to pl_mom.cron since bwmon is run from cron now

18 years ago- renamed BandwidthMonitor.py
Mark Huang [Fri, 28 Apr 2006 19:11:20 +0000 (19:11 +0000)]
- renamed BandwidthMonitor.py

18 years ago- rewrite, rename to swapmon.py and swapmon.init
Mark Huang [Fri, 28 Apr 2006 19:11:08 +0000 (19:11 +0000)]
- rewrite, rename to swapmon.py and swapmon.init

18 years ago- rename to bwmon.py
Mark Huang [Fri, 28 Apr 2006 19:10:32 +0000 (19:10 +0000)]
- rename to bwmon.py

18 years agoRewrite bandwidth monitoring to use bwlimit.py module and to manage exempt limits...
Mark Huang [Tue, 25 Apr 2006 14:40:28 +0000 (14:40 +0000)]
Rewrite bandwidth monitoring to use bwlimit.py module and to manage exempt limits as well

18 years agoUndo bad checkin
Steve Muir [Wed, 15 Mar 2006 15:46:43 +0000 (15:46 +0000)]
Undo bad checkin

18 years agouse BWAVGRATE parameter rather than BWMAXRATE
Steve Muir [Tue, 14 Mar 2006 15:08:00 +0000 (15:08 +0000)]
use BWAVGRATE parameter rather than BWMAXRATE

18 years ago- bump release number to differentiate from branch
Mark Huang [Fri, 10 Mar 2006 02:07:29 +0000 (02:07 +0000)]
- bump release number to differentiate from branch

18 years ago- start pl_conf in case it's dead
Mark Huang [Mon, 6 Mar 2006 20:40:33 +0000 (20:40 +0000)]
- start pl_conf in case it's dead

18 years ago- bwlimit script now sets up capped and exempt subclasses; capped
Mark Huang [Mon, 27 Feb 2006 02:41:43 +0000 (02:41 +0000)]
- bwlimit script now sets up capped and exempt subclasses; capped
  buckets all begin with 0x1000. Ignore the root (0x1000) and default
  (0x1fff) buckets, as well as exempt buckets.
- don't send mail when debugging
- log bandwidth caps and actually cap bandwidth when debugging

18 years ago- bump release number for hex minor fix
Mark Huang [Wed, 22 Feb 2006 23:48:06 +0000 (23:48 +0000)]
- bump release number for hex minor fix

18 years ago- fix a minor (haha) bug: tc reports minor class numbers in hexadecimal,
Mark Huang [Wed, 22 Feb 2006 23:47:37 +0000 (23:47 +0000)]
- fix a minor (haha) bug: tc reports minor class numbers in hexadecimal,
  not decimal. Fix things so that it works with VNET, which classifies
  packets from xid 500 into class 1:1f4, *not* 1:500.
- set the "default" minor class number to ffff to emphasize this point.

18 years ago- bump release number (shutdown instead of remounting /vservers after fixing)
Mark Huang [Thu, 26 Jan 2006 19:28:58 +0000 (19:28 +0000)]
- bump release number (shutdown instead of remounting /vservers after fixing)

18 years ago- shutdown instead of remounting /vservers after fixing
Mark Huang [Thu, 26 Jan 2006 19:26:20 +0000 (19:26 +0000)]
- shutdown instead of remounting /vservers after fixing

18 years agosupport pldistro "variants"
Mark Huang [Thu, 15 Dec 2005 16:50:39 +0000 (16:50 +0000)]
support pldistro "variants"

18 years ago- bump release number for fibbed rc6 or planned rc7
Mark Huang [Thu, 1 Dec 2005 23:39:31 +0000 (23:39 +0000)]
- bump release number for fibbed rc6 or planned rc7

18 years ago- no need to restart autofs
Mark Huang [Thu, 1 Dec 2005 23:38:58 +0000 (23:38 +0000)]
- no need to restart autofs

18 years ago- fix how pl_conf is checked and restarted
Mark Huang [Thu, 3 Nov 2005 17:23:44 +0000 (17:23 +0000)]
- fix how pl_conf is checked and restarted

18 years ago- whoops, list pl_mop job in %files
Mark Huang [Tue, 11 Oct 2005 17:36:54 +0000 (17:36 +0000)]
- whoops, list pl_mop job in %files

18 years ago- change Copyright to License
Mark Huang [Tue, 11 Oct 2005 17:36:15 +0000 (17:36 +0000)]
- change Copyright to License

18 years ago- install pl_mop job
Mark Huang [Tue, 11 Oct 2005 17:35:10 +0000 (17:35 +0000)]
- install pl_mop job

18 years agoRuns once a day to "fix" nodes in various ways
Mark Huang [Tue, 11 Oct 2005 17:34:57 +0000 (17:34 +0000)]
Runs once a day to "fix" nodes in various ways

18 years agoremove deprecated spec file
Mark Huang [Tue, 11 Oct 2005 17:34:23 +0000 (17:34 +0000)]
remove deprecated spec file

18 years ago- check $PL_BOOTCD instead of runlevel
Mark Huang [Sun, 28 Aug 2005 16:20:57 +0000 (16:20 +0000)]
- check $PL_BOOTCD instead of runlevel

18 years agoBug fix: default rate cap was always applied
Andy Bavier [Tue, 7 Jun 2005 22:24:57 +0000 (22:24 +0000)]
Bug fix: default rate cap was always applied

18 years agoBug fix to handle change in output of 'tc -s -d qdisc show' in 2.6.10 kernel
Andy Bavier [Fri, 3 Jun 2005 17:42:39 +0000 (17:42 +0000)]
Bug fix to handle change in output of 'tc -s -d qdisc show' in 2.6.10 kernel

19 years agoBump release number
Andy Bavier [Thu, 7 Apr 2005 19:42:58 +0000 (19:42 +0000)]
Bump release number

19 years agoWhoops, turn off debug flag
Andy Bavier [Thu, 7 Apr 2005 18:36:59 +0000 (18:36 +0000)]
Whoops, turn off debug flag

19 years agoSupport per-slice daily bandwidth caps
Andy Bavier [Thu, 7 Apr 2005 18:00:48 +0000 (18:00 +0000)]
Support per-slice daily bandwidth caps

19 years agoWhoops, turn off debug flag
Andy Bavier [Thu, 24 Feb 2005 14:47:13 +0000 (14:47 +0000)]
Whoops, turn off debug flag

19 years agoRemove bandwidth caps when pl_mom restarts
Andy Bavier [Tue, 22 Feb 2005 18:06:47 +0000 (18:06 +0000)]
Remove bandwidth caps when pl_mom restarts

19 years agoSend bw cap mail to slices
Andy Bavier [Thu, 10 Feb 2005 16:13:24 +0000 (16:13 +0000)]
Send bw cap mail to slices

19 years agoFixed problem with 'top' by including full paths
Andy Bavier [Thu, 27 Jan 2005 15:27:53 +0000 (15:27 +0000)]
Fixed problem with 'top' by including full paths

19 years agoTurn on/off bandwidth caps
Andy Bavier [Fri, 21 Jan 2005 14:07:04 +0000 (14:07 +0000)]
Turn on/off bandwidth caps

19 years agoBandwidth limiting added
Andy Bavier [Thu, 6 Jan 2005 21:55:56 +0000 (21:55 +0000)]
Bandwidth limiting added

19 years agoBeginning of changes to allow pl_moom to monitor, adjust bandwidth
Andy Bavier [Wed, 22 Dec 2004 18:57:48 +0000 (18:57 +0000)]
Beginning of changes to allow pl_moom to monitor, adjust bandwidth

19 years ago- change URL to repository location
Mark Huang [Wed, 17 Nov 2004 20:32:07 +0000 (20:32 +0000)]
- change URL to repository location

19 years agoMinor changes for PlanetLab V3
Andy Bavier [Wed, 17 Nov 2004 20:24:11 +0000 (20:24 +0000)]
Minor changes for PlanetLab V3

19 years agoUse curl for PlanetLab v3 deployment
Andy Bavier [Wed, 17 Nov 2004 18:23:32 +0000 (18:23 +0000)]
Use curl for PlanetLab v3 deployment

19 years agoAdded 'top' output to slice reset message, added -f option to sendmail
Andy Bavier [Fri, 5 Nov 2004 20:42:54 +0000 (20:42 +0000)]
Added 'top' output to slice reset message, added -f option to sendmail

19 years agoREADME file
Andy Bavier [Fri, 29 Oct 2004 18:17:34 +0000 (18:17 +0000)]
README file

19 years agoThis commit was generated by cvs2svn to compensate for changes in r359,
Andy Bavier [Fri, 29 Oct 2004 18:03:29 +0000 (18:03 +0000)]
This commit was generated by cvs2svn to compensate for changes in r359,
which included commits to RCS files with non-trunk default branches.

20 years agoNew repository initialized by cvs2svn.
Planet-Lab Support [Wed, 4 Jun 2003 20:07:28 +0000 (20:07 +0000)]
New repository initialized by cvs2svn.