monitor.git
14 years agochanges for 3.0 1.0 1.0
Stephen Soltesz [Mon, 1 Jun 2009 19:00:19 +0000 (19:00 +0000)]
changes for 3.0

14 years agotypo
Stephen Soltesz [Fri, 15 May 2009 23:22:36 +0000 (23:22 +0000)]
typo

14 years agoadd a better bmlog history ; better late than never.
Stephen Soltesz [Fri, 15 May 2009 23:22:21 +0000 (23:22 +0000)]
add a better bmlog history ; better late than never.
plc try to bail if we get a series of errors, but this doens't seem to work...

14 years agoreduce false exceptions that should be ignored or handled better in the code.
Stephen Soltesz [Thu, 30 Apr 2009 21:10:11 +0000 (21:10 +0000)]
reduce false exceptions that should be ignored or handled better in the code.

15 years agoTagging module Monitor - Monitor-1.0-16
Stephen Soltesz [Mon, 13 Apr 2009 19:08:27 +0000 (19:08 +0000)]
Tagging module Monitor - Monitor-1.0-16
add email exceptions.
add better timeouts for commands even once ssh has opened a connection

15 years agoadded two new sequences to rins a node if not installed.
Stephen Soltesz [Fri, 27 Mar 2009 17:09:51 +0000 (17:09 +0000)]
added two new sequences to rins a node if not installed.

15 years agocatch the 'cable error' status in ipals
Stephen Soltesz [Tue, 17 Mar 2009 18:58:23 +0000 (18:58 +0000)]
catch the 'cable error' status in ipals

15 years agomore meaningful exception messages to help track down problems.
Stephen Soltesz [Mon, 16 Mar 2009 16:42:06 +0000 (16:42 +0000)]
more meaningful exception messages to help track down problems.

15 years agowait a day before rebooting a node.
Stephen Soltesz [Fri, 13 Mar 2009 22:32:08 +0000 (22:32 +0000)]
wait a day before rebooting a node.
add default message for grouprins

15 years agomerge with changes to 2.0 branch, since it will help with a timely completion.
Stephen Soltesz [Wed, 11 Mar 2009 21:15:14 +0000 (21:15 +0000)]
Merge ... changes to 2.0 branch, since it will help with a timely completion.

15 years agoadd email_exception() to all except: statements.
Stephen Soltesz [Tue, 10 Mar 2009 20:25:50 +0000 (20:25 +0000)]
add email_exception() to all except: statements.

15 years agoTagging module Monitor - Monitor-1.0-15
Stephen Soltesz [Thu, 26 Feb 2009 16:53:43 +0000 (16:53 +0000)]
Tagging module Monitor - Monitor-1.0-15
fixed bug that prevented penalties from escalating and PIs from getting
emails.

15 years agofix bug that prevented penalty escallation from occurring.
Stephen Soltesz [Mon, 23 Feb 2009 22:33:01 +0000 (22:33 +0000)]
fix bug that prevented penalty escallation from occurring.

15 years agoTagging module Monitor - Monitor-1.0-14
Stephen Soltesz [Fri, 20 Feb 2009 21:34:26 +0000 (21:34 +0000)]
Tagging module Monitor - Monitor-1.0-14
updated printbadnodes.py to include diff_time()
changed eamils to use literal addresses not tech-* and pi-* aliases
updates to automation script to prevent false errors.

15 years agotypos and wrong variable names.
Stephen Soltesz [Fri, 13 Feb 2009 23:35:45 +0000 (23:35 +0000)]
typos and wrong variable names.

15 years agoerror caused by older versions of python on newer code.
Stephen Soltesz [Fri, 13 Feb 2009 17:24:28 +0000 (17:24 +0000)]
error caused by older versions of python on newer code.

15 years agomissed a formatting char.
Stephen Soltesz [Wed, 26 Nov 2008 20:04:45 +0000 (20:04 +0000)]
missed a formatting char.

15 years agoaddress issues found on TP myplc. default values in getsshkeys, generalize
Stephen Soltesz [Tue, 25 Nov 2008 22:22:13 +0000 (22:22 +0000)]
address issues found on TP myplc.  default values in getsshkeys, generalize
email addresses in bootman, and add extra param to monitor-default.conf

15 years agotweak to allow for whitelisted nodes.
Stephen Soltesz [Wed, 12 Nov 2008 22:49:38 +0000 (22:49 +0000)]
tweak to allow for whitelisted nodes.

15 years agoremoving an inappropriate import
Guthemberg Silvestre [Mon, 10 Nov 2008 04:09:03 +0000 (04:09 +0000)]
removing an inappropriate import

15 years agolast contact information added to python html output
Guthemberg Silvestre [Wed, 5 Nov 2008 14:32:43 +0000 (14:32 +0000)]
last contact information added to python html output

15 years agoupdate on automated default script to became easier to be added as a cron job
Guthemberg Silvestre [Wed, 5 Nov 2008 14:31:22 +0000 (14:31 +0000)]
update on automated default script to became easier to be added as a cron job

15 years agoTagging module Monitor - Monitor-1.0-13
Stephen Soltesz [Tue, 4 Nov 2008 16:21:02 +0000 (16:21 +0000)]
Tagging module Monitor - Monitor-1.0-13

Fixed for email change in unified_model.py tweaks in others.

15 years agofixed bug in unified_model regarding the new email routines.
Stephen Soltesz [Tue, 4 Nov 2008 16:19:45 +0000 (16:19 +0000)]
fixed bug in unified_model regarding the new email routines.
two other tweaks in monitor-server.cron and added a new condition for rins in
bootman.py

15 years agoTagging module Monitor - Monitor-1.0-12
Stephen Soltesz [Wed, 29 Oct 2008 15:57:46 +0000 (15:57 +0000)]
Tagging module Monitor - Monitor-1.0-12

Improvements in web pages, and default installation options, and RT queue
setup.

Send direct emails rather than using tech- & pi- aliases

15 years agoupdates to improve generalization and auto-installation.
Stephen Soltesz [Wed, 29 Oct 2008 15:55:12 +0000 (15:55 +0000)]
updates to improve generalization and auto-installation.
fixed links in web pages
added config option for monitor-default for RT_QUEUE
changed unified_model to return literal email addresses not to use the tech- &
pi- aliases.

15 years agorestructure some of the output, tweak links, ignore comon output for myplcs.
Stephen Soltesz [Tue, 28 Oct 2008 17:52:03 +0000 (17:52 +0000)]
restructure some of the output, tweak links, ignore comon output for myplcs.

15 years agoTagging module Monitor - Monitor-1.0-11
Stephen Soltesz [Tue, 21 Oct 2008 18:48:23 +0000 (18:48 +0000)]
Tagging module Monitor - Monitor-1.0-11
send direct email, not to tech-* and pi-* aliases.

15 years agochange mode on monitor.cron. cron doesn't work if the file is executable.
Stephen Soltesz [Wed, 15 Oct 2008 16:53:39 +0000 (16:53 +0000)]
change mode on monitor.cron.  cron doesn't work if the file is executable.

15 years agoChanges emails sent from tech- and pi- aliases to the actual registered email
Stephen Soltesz [Tue, 14 Oct 2008 19:06:59 +0000 (19:06 +0000)]
Changes emails sent from tech- and pi- aliases to the actual registered email
addresses of the users.

added functions for fetching these addresses to plc.py

added a check to create act_all db if it does not already exist in
automate-default.sh script.

15 years agoTagging module Monitor - Monitor-1.0-10
Stephen Soltesz [Tue, 14 Oct 2008 17:49:32 +0000 (17:49 +0000)]
Tagging module Monitor - Monitor-1.0-10
an update to the latest tag.  looks like I actually needed to update the tags
file more than this.

15 years agotweaks to improve the automated, rpm installation of monitor-server.
Stephen Soltesz [Tue, 14 Oct 2008 17:40:19 +0000 (17:40 +0000)]
tweaks to improve the automated, rpm installation of monitor-server.

15 years agoadds checks for readonly fs, dns errors, resets message timer if
Stephen Soltesz [Mon, 29 Sep 2008 19:19:16 +0000 (19:19 +0000)]
adds checks for readonly fs, dns errors, resets message timer if
stage==improvement, and minor tweaks to nodehistory/common

15 years agoTagging module Monitor - Monitor-1.0-9
Stephen Soltesz [Thu, 25 Sep 2008 19:55:02 +0000 (19:55 +0000)]
Tagging module Monitor - Monitor-1.0-9
includes all removals of 'monitorconfig'

15 years agogenerate shell version of config options
Stephen Soltesz [Wed, 24 Sep 2008 19:29:22 +0000 (19:29 +0000)]
generate shell version of config options

15 years agoremove dependency on the deprecated module, 'monitorconfig.py'
Stephen Soltesz [Wed, 24 Sep 2008 19:17:42 +0000 (19:17 +0000)]
remove dependency on the deprecated module, 'monitorconfig.py'

15 years agoBranch 1.1 for module Monitor created from tag Monitor-1.0-8
Stephen Soltesz [Wed, 24 Sep 2008 15:49:22 +0000 (15:49 +0000)]
Branch 1.1 for module Monitor created from tag Monitor-1.0-8

15 years agoTagging module Monitor - Monitor-1.0-8
Stephen Soltesz [Wed, 24 Sep 2008 15:48:06 +0000 (15:48 +0000)]
Tagging module Monitor - Monitor-1.0-8

These are all changes in the latest Monitor code.  I will branch this version
next, before making additional large changes.

15 years agoM emailTxt.py
Stephen Soltesz [Tue, 23 Sep 2008 19:53:34 +0000 (19:53 +0000)]
M    emailTxt.py
updated description of error message from CoMon since it has changed.
M    showlatlon.py
updated hardware spec thresholds to include more machines.
M    clean_policy.py
stricter activation of the 'action-levels' that each event triggers.
Previously things were out of sorts.
M    unified_model.py
works with the 'action-level' changes above.
M    nodesets.py
M    grouprins.py
add a site option
M    nodecommon.py
add missing module
M    bootman.py
M    rtinfo.py
M    todo

15 years agoadds last_contact column
Stephen Soltesz [Tue, 23 Sep 2008 19:49:03 +0000 (19:49 +0000)]
adds last_contact column

15 years agoTagging module Monitor - Monitor-1.0-7
Thierry Parmentelat [Mon, 1 Sep 2008 12:54:52 +0000 (12:54 +0000)]
Tagging module Monitor - Monitor-1.0-7
Checkpointing current version for 4.2-rc21 - many many changes

15 years agoupdate of all changes in the last week that fine-tuned the behavior of Monitor
Stephen Soltesz [Tue, 26 Aug 2008 02:02:06 +0000 (02:02 +0000)]
update of all changes in the last week that fine-tuned the behavior of Monitor
with the new clean_policy and unified_model.

15 years ago(no commit message)
Stephen Soltesz [Fri, 15 Aug 2008 22:37:53 +0000 (22:37 +0000)]

15 years agorequire nmap, and add a place holder for the monitorconfig.php file generated
Stephen Soltesz [Fri, 15 Aug 2008 22:36:56 +0000 (22:36 +0000)]
require nmap, and add a place holder for the monitorconfig.php file generated
after monitor.conf is filled in.

15 years agowww/database.php
Stephen Soltesz [Fri, 15 Aug 2008 22:07:49 +0000 (22:07 +0000)]
www/database.php
add hack to conver monitor.conf into monitorconfig.php
findbad.py
use new util.file rather than old config.*
nodebad.py
added --site argument
testapi.py
accept site from cmd line.
nodecommon.py
added get_nodeset to consolidate the code for getting a set of nodes from
cmd line.
findbadpcu.py
added --site argument
database.py
use package's version of PHPUnserialize and PHPSerialize.
monitor-default.conf
filled in usual default values.
phpconfig.py
load monitor.conf not old monitorconfig.py
todo

15 years agouse config module rather than monitorconfig.py
Stephen Soltesz [Fri, 15 Aug 2008 21:07:45 +0000 (21:07 +0000)]
use config module rather than monitorconfig.py

15 years agofixes
Stephen Soltesz [Fri, 15 Aug 2008 21:07:23 +0000 (21:07 +0000)]
fixes

15 years agolousy brute force install of python package.
Stephen Soltesz [Fri, 15 Aug 2008 19:57:40 +0000 (19:57 +0000)]
lousy brute force install of python package.

15 years ago(no commit message)
Stephen Soltesz [Fri, 15 Aug 2008 19:57:08 +0000 (19:57 +0000)]

15 years agoadd files to repository.
Stephen Soltesz [Fri, 15 Aug 2008 19:56:42 +0000 (19:56 +0000)]
add files to repository.

15 years agowww/printbadnodes.py
Stephen Soltesz [Fri, 15 Aug 2008 19:13:35 +0000 (19:13 +0000)]
www/printbadnodes.py
www/runlevels.py
use the new site-package module to pull in configuration information and
locate database files.

config.py
uses ConfigParser now.

nodequery.py
doesn't unconfitionally load findbad db.  sets up fbpcus for reboot module
printbadcsv.py
pull from monitor module for database and configuration
findbad.py
change default db name, and perform node_selects correctly with fb
fetch.py
debug statements to track parser.py error.
mailmonitor.py
simplify content.

nodebad.py
remove unnecessary db loads, and import from 'const' module.
parser.py
set nodeselect default to None rather than 'False'.
expected changes to parse_args() based on new config.py
pcubad.py
import constants from correct module
clean_policy.py
don't import from www modules
don't import from policy module
unified_model.py
remove unconditional db load
add two definitions from nodecommon and model.py
grouprins.py
use normal config.py
fix pcu identification logic.  really need to have a better fix for this.
monitor-server.spec
add requirements, the monitor python module,
nodecommon.py
moved node_end_record to unified_model.py
bootman.py
load from 'const.py'
additional boot sequence ids.
policy.py
removed various functions for clean policy.
findbadpcu.py
update use of parser module
nodeinfo.py
remove unconditional loads
add node.last_changed output
database.py
conflate config and monitorconfig
reboot.py
remove unconditional load
monitor_policy.py
add funcs from policy.py

15 years agothirdparty package for creating threadpools.
Stephen Soltesz [Thu, 14 Aug 2008 21:12:59 +0000 (21:12 +0000)]
thirdparty package for creating threadpools.

15 years agorename to default value.
Stephen Soltesz [Thu, 14 Aug 2008 20:44:06 +0000 (20:44 +0000)]
rename to default value.

15 years agodefault config file
Stephen Soltesz [Thu, 14 Aug 2008 20:43:28 +0000 (20:43 +0000)]
default config file

15 years agoa directory for the python module files. this will help separate libraries
Stephen Soltesz [Wed, 13 Aug 2008 18:27:29 +0000 (18:27 +0000)]
a directory for the python module files.  this will help separate libraries
from commands, as well as make them globally accessible for www scripts as
well as command line cmds.

15 years agoout.
Stephen Soltesz [Wed, 13 Aug 2008 18:19:00 +0000 (18:19 +0000)]
out.

15 years agosvn renamed it.
Stephen Soltesz [Wed, 13 Aug 2008 18:18:42 +0000 (18:18 +0000)]
svn renamed it.

15 years agochanged my mind.
Stephen Soltesz [Wed, 13 Aug 2008 18:17:27 +0000 (18:17 +0000)]
changed my mind.

15 years agorename the old 'monitor.py' script.
Stephen Soltesz [Wed, 13 Aug 2008 18:17:14 +0000 (18:17 +0000)]
rename the old 'monitor.py' script.

15 years agotweaks
Stephen Soltesz [Tue, 12 Aug 2008 19:59:30 +0000 (19:59 +0000)]
tweaks

15 years agoadded a new type of disk error pattern.
Stephen Soltesz [Tue, 12 Aug 2008 19:59:10 +0000 (19:59 +0000)]
added a new type of disk error pattern.

15 years agospecifies a different tmpdir for g++ to use; and completes build.
Stephen Soltesz [Tue, 12 Aug 2008 19:57:58 +0000 (19:57 +0000)]
specifies a different tmpdir for g++ to use; and completes build.

15 years agoTry to build the AMT command. I expect the /tmp fs to be too small.
Stephen Soltesz [Tue, 12 Aug 2008 19:38:23 +0000 (19:38 +0000)]
Try to build the AMT command. I expect the /tmp fs to be too small.

15 years agono need to do this during make
Stephen Soltesz [Tue, 12 Aug 2008 19:24:18 +0000 (19:24 +0000)]
no need to do this during make

15 years agoabsolute version of RT for reference.
Stephen Soltesz [Mon, 11 Aug 2008 21:30:26 +0000 (21:30 +0000)]
absolute version of RT for reference.

15 years agodoes not belong in svn any more
Stephen Soltesz [Mon, 11 Aug 2008 21:24:30 +0000 (21:24 +0000)]
does not belong in svn any more

15 years agodoesn't belong in svn any more.
Stephen Soltesz [Mon, 11 Aug 2008 21:21:45 +0000 (21:21 +0000)]
doesn't belong in svn any more.

15 years agodefault automation script for running Monitor
Stephen Soltesz [Mon, 11 Aug 2008 21:21:15 +0000 (21:21 +0000)]
default automation script for running Monitor

15 years agoTagging module Monitor - Monitor-1.0-6
Stephen Soltesz [Mon, 11 Aug 2008 19:21:40 +0000 (19:21 +0000)]
Tagging module Monitor - Monitor-1.0-6
This is a major tag of every thing.  probably needs a very different release
number.

15 years agofix problem with spec file error. didn't reference an installed file in
Stephen Soltesz [Mon, 11 Aug 2008 15:44:06 +0000 (15:44 +0000)]
fix problem with spec file error.  didn't reference an installed file in
%files section.

15 years agoupdates to monitor-server.spec and notes in 'todo' based on first attempt to
Stephen Soltesz [Fri, 8 Aug 2008 22:37:35 +0000 (22:37 +0000)]
updates to monitor-server.spec and notes in 'todo' based on first attempt to
install and run this package within a myplc.

15 years agocreate a php config file from the python config file monitorconfig.py
Stephen Soltesz [Fri, 8 Aug 2008 19:28:54 +0000 (19:28 +0000)]
create a php config file from the python config file monitorconfig.py

15 years agoupdate spec file; I will test it within a myplc
Stephen Soltesz [Fri, 8 Aug 2008 19:28:12 +0000 (19:28 +0000)]
update spec file; I will test it within a myplc

15 years agosimple script to verify basic sanity of the api before proceeding with the
Stephen Soltesz [Fri, 8 Aug 2008 18:04:18 +0000 (18:04 +0000)]
simple script to verify basic sanity of the api before proceeding with the
collection of node information

15 years agoThe parser module, derived from config.py
Stephen Soltesz [Fri, 8 Aug 2008 18:03:39 +0000 (18:03 +0000)]
The parser module, derived from config.py

15 years agoignore pyc
Stephen Soltesz [Fri, 8 Aug 2008 18:02:48 +0000 (18:02 +0000)]
ignore pyc

15 years agofile operations for reading and writing lists of nodes/sites/pcus, etc.
Stephen Soltesz [Fri, 8 Aug 2008 18:02:12 +0000 (18:02 +0000)]
file operations for reading and writing lists of nodes/sites/pcus, etc.

15 years agosplit config.py and parser.py into two modules. updated all files to use the
Stephen Soltesz [Fri, 8 Aug 2008 18:01:33 +0000 (18:01 +0000)]
split config.py and parser.py into two modules.  updated all files to use the
parser module for option parsing and config for the resulting values.

one extra variable added to monitorconfig-default.py

15 years agoshow days up to 14, display time up or down in nodegroup_dislplay
Stephen Soltesz [Mon, 4 Aug 2008 21:56:36 +0000 (21:56 +0000)]
show days up to 14, display time up or down in nodegroup_dislplay

15 years agouse new monitorconfig.py format
Stephen Soltesz [Mon, 4 Aug 2008 21:53:47 +0000 (21:53 +0000)]
use new monitorconfig.py format

15 years agono explicit auth structures in python to enable easier parsing by bash and php
Stephen Soltesz [Mon, 4 Aug 2008 21:53:26 +0000 (21:53 +0000)]
no explicit auth structures in python to enable easier parsing by bash and php

15 years agoadd paths based on monitorconfig values
Stephen Soltesz [Mon, 4 Aug 2008 21:42:51 +0000 (21:42 +0000)]
add paths based on monitorconfig values

15 years agouse new name of data access module.
Stephen Soltesz [Mon, 4 Aug 2008 16:45:41 +0000 (16:45 +0000)]
use new name of data access module.

15 years agotake out of repository.
Stephen Soltesz [Mon, 4 Aug 2008 16:44:53 +0000 (16:44 +0000)]
take out of repository.

15 years agorenamed php version of data access module
Stephen Soltesz [Mon, 4 Aug 2008 16:43:23 +0000 (16:43 +0000)]
renamed php version of data access module

15 years agoignore line update.
Stephen Soltesz [Mon, 4 Aug 2008 16:22:09 +0000 (16:22 +0000)]
ignore line update.

15 years agoChanged 'import auth' statements to use plc.py or monitorconfig.py
Stephen Soltesz [Mon, 4 Aug 2008 16:18:20 +0000 (16:18 +0000)]
Changed 'import auth' statements to use plc.py or monitorconfig.py

15 years agoAdded a check for bad dns on the node that prevents bootmanager from booting.
Stephen Soltesz [Mon, 4 Aug 2008 15:32:45 +0000 (15:32 +0000)]
Added a check for bad dns on the node that prevents bootmanager from booting.

added a findbad.py check after running grouprins in automate_pl03.sh

additional todos.

15 years agowrong path on install
Stephen Soltesz [Fri, 1 Aug 2008 22:09:25 +0000 (22:09 +0000)]
wrong path on install

15 years agohandle monitorconfig.py correctly
Stephen Soltesz [Fri, 1 Aug 2008 22:08:06 +0000 (22:08 +0000)]
handle monitorconfig.py correctly

15 years agolatest addition of /var/lib/ for current and archived data files
Stephen Soltesz [Fri, 1 Aug 2008 22:03:31 +0000 (22:03 +0000)]
latest addition of /var/lib/ for current and archived data files

15 years agoThis commit changes the 'soltesz.py' module into 'moncommands.py' and
Stephen Soltesz [Fri, 1 Aug 2008 20:48:32 +0000 (20:48 +0000)]
This commit changes the 'soltesz.py' module into 'moncommands.py' and
'database.py'

Also, findbad*.py include a timeout that should allow them to exit even if the
'futex' bug gets hung somewhere.

Also, the mailer, rt, and others are updated to use monitorconfig.py as the
source of their username and password information rather than random files
here and there.  This also allows us to keep this information out of svn.

15 years agorenamed from 'soltesz' to reflect it's function and content. Should be able
Stephen Soltesz [Fri, 1 Aug 2008 20:37:43 +0000 (20:37 +0000)]
renamed from 'soltesz' to reflect it's function and content.  Should be able
to hide the re-implementation of some of the backend ultimately.

15 years agogone.
Stephen Soltesz [Fri, 1 Aug 2008 20:36:50 +0000 (20:36 +0000)]
gone.

15 years agoso avoid getting this messed up with my local copy all the time.
Stephen Soltesz [Fri, 1 Aug 2008 20:36:28 +0000 (20:36 +0000)]
so avoid getting this messed up with my local copy all the time.

15 years agoallow reference to the data dir and the install path.
Stephen Soltesz [Fri, 1 Aug 2008 20:34:56 +0000 (20:34 +0000)]
allow reference to the data dir and the install path.

15 years agothe propername for this file
Stephen Soltesz [Fri, 1 Aug 2008 20:31:52 +0000 (20:31 +0000)]
the propername for this file

15 years agoadding to maintain history from 'soltesz.py' but it's a bad name choice.
Stephen Soltesz [Fri, 1 Aug 2008 20:30:23 +0000 (20:30 +0000)]
adding to maintain history from 'soltesz.py' but it's a bad name choice.

15 years agolatest todo items
Stephen Soltesz [Fri, 1 Aug 2008 17:56:01 +0000 (17:56 +0000)]
latest todo items

15 years agoignore fields
Stephen Soltesz [Fri, 1 Aug 2008 17:55:25 +0000 (17:55 +0000)]
ignore fields