Stephen Soltesz [Mon, 17 Aug 2009 22:03:47 +0000 (22:03 +0000)]
changed 'monitordebug' to failboot
added run_level to nodebad record
added site_id to database
added DRAC6 support
added GeorgeTown PCU to BayTech to catch their wacky PCU.
add extra sequences too bootman to catch fsck error
remove setting nodes to 'disabled', leave them in failboot
send exception and Unknown Error messages to config.exception_email
disable synccheck in automate-default.sh script
Stephen Soltesz [Mon, 17 Aug 2009 21:49:31 +0000 (21:49 +0000)]
added advanced query
added simple and detailed view rather than single, 'pcuview'
simplified nodelist
clarified some labels
added error page for quickjumps without a target
added site:* and node:* queries for quickjump:
now you can see all sites that match a pattern:
site:mlab* or node:*clara*
Stephen Soltesz [Tue, 28 Jul 2009 22:22:07 +0000 (22:22 +0000)]
added firewall checks and notices
added extra views for a simple and detailed view for sites
added firewall_notice to emailTxt
added extra fields for firewall, external_dns_status, as well as several other
fields reserved for future use, such as traceroute, uptime and rpms.
updated policy to act on firewalled nodes
add dependency on 'nc' netcat for a more reliable nmap, port probing utility.
reorganized scanapi.py, hopefully can generalize this further in the future.
Stephen Soltesz [Tue, 28 Jul 2009 22:16:17 +0000 (22:16 +0000)]
add email notice for 'failed twice' nodes
added extra bootmanager sequence string
Stephen Soltesz [Mon, 27 Jul 2009 19:58:58 +0000 (19:58 +0000)]
an upgrade script to extend convert existing databases
Stephen Soltesz [Mon, 27 Jul 2009 16:32:15 +0000 (16:32 +0000)]
setup default values if they're not already set.
Stephen Soltesz [Mon, 27 Jul 2009 16:23:41 +0000 (16:23 +0000)]
add a 'delete_recent' function to ActionRecord
collect 'uptime'
Stephen Soltesz [Mon, 27 Jul 2009 16:22:43 +0000 (16:22 +0000)]
make zabbix import conditional
Stephen Soltesz [Mon, 27 Jul 2009 16:22:22 +0000 (16:22 +0000)]
add code to automatically handle manual FSCK errors.
Stephen Soltesz [Mon, 27 Jul 2009 16:21:57 +0000 (16:21 +0000)]
add better node configuration message.
Stephen Soltesz [Mon, 27 Jul 2009 16:21:26 +0000 (16:21 +0000)]
ugg. output hangs on large file contents. truncate data. need a better
solution.
Stephen Soltesz [Mon, 27 Jul 2009 16:20:13 +0000 (16:20 +0000)]
enable zabbix option before importing zabbix db
Stephen Soltesz [Mon, 27 Jul 2009 16:19:22 +0000 (16:19 +0000)]
add links for length of query for nodescanhistory view
Stephen Soltesz [Thu, 23 Jul 2009 00:34:31 +0000 (00:34 +0000)]
disable zabbix_enabled flag by default
copy monitor logs files for archives
Barış Metin [Tue, 21 Jul 2009 10:47:59 +0000 (10:47 +0000)]
don't create/enable X_monitor user.
Stephen Soltesz [Mon, 20 Jul 2009 23:04:43 +0000 (23:04 +0000)]
sleep could probably be reduced, but the extra newlines are required.
Stephen Soltesz [Thu, 9 Jul 2009 22:04:11 +0000 (22:04 +0000)]
add a timeout command to the client package
new boot manager sequence to bootman.py
use timeout with scanapi operations
Stephen Soltesz [Wed, 8 Jul 2009 16:41:59 +0000 (16:41 +0000)]
correct expired() logic. was reversed.
Barış Metin [Thu, 2 Jul 2009 14:27:13 +0000 (14:27 +0000)]
use sortLastContact from plekit
Stephen Soltesz [Thu, 2 Jul 2009 01:22:40 +0000 (01:22 +0000)]
give noop myops xmlrpc call a extra parameter to allow it to work with the
PLC() object which passes an auth struct to every call.
clear penalties for blacklisted sites.
fixed string in grep in RunlevelAgent
added crazy fix for new BayTech sequence.
Stephen Soltesz [Thu, 2 Jul 2009 01:20:16 +0000 (01:20 +0000)]
place more default values in /etc/planetlab/*.xml
Barış Metin [Wed, 1 Jul 2009 16:15:15 +0000 (16:15 +0000)]
export PATH for cronjob to work and redirect script's error output to the log file too
Stephen Soltesz [Wed, 1 Jul 2009 00:24:24 +0000 (00:24 +0000)]
improved post server setup code in Monitor.spec
improved blacklist.py
added module path to exception in plc.py
included correct functions file for monitor-server.init
Stephen Soltesz [Mon, 29 Jun 2009 00:12:44 +0000 (00:12 +0000)]
Tagging module Monitor - Monitor-3.0-18
bug fixes.
improved templates and views
cleaned controller code for web
added IPMI requirement to pcucontrol package.
Stephen Soltesz [Mon, 29 Jun 2009 00:10:21 +0000 (00:10 +0000)]
bug fix in summary template
new access method for fbnode
removed fs_status check b/c it caused false negatives for down nodes that
weren't due to the check hanging.
added a new reboot check.
Stephen Soltesz [Mon, 29 Jun 2009 00:05:55 +0000 (00:05 +0000)]
remove rt3 from monitor
Stephen Soltesz [Mon, 29 Jun 2009 00:05:20 +0000 (00:05 +0000)]
remove rt from monitor package.
Stephen Soltesz [Sun, 28 Jun 2009 23:58:30 +0000 (23:58 +0000)]
add an entry for IPMI in case db-config doesn't update.
Stephen Soltesz [Sun, 28 Jun 2009 23:44:32 +0000 (23:44 +0000)]
require OpenIPMI-tools to provide ipmitool command.
Stephen Soltesz [Fri, 26 Jun 2009 23:42:15 +0000 (23:42 +0000)]
make work with new libraries
Stephen Soltesz [Fri, 26 Jun 2009 23:40:44 +0000 (23:40 +0000)]
add a link to both types of node history, status and data
add a quick search at the top of sitemenu for hostnames and loginbases
Stephen Soltesz [Thu, 25 Jun 2009 23:03:12 +0000 (23:03 +0000)]
simplify pcu bad logic
ignore additional fields in sitehistory record
just show days down rather than weeks. weeks is confusing before 30 days
Stephen Soltesz [Thu, 25 Jun 2009 23:01:59 +0000 (23:01 +0000)]
added references to google gadgets
added sitesummary and summary controls for gadgets
clarified sitehistory, and other history views
simplified controllers.py in many places, trying to make node() faster.
Stephen Soltesz [Thu, 25 Jun 2009 04:45:09 +0000 (04:45 +0000)]
convert non-ascii character strings to just ascii
Stephen Soltesz [Thu, 25 Jun 2009 04:44:33 +0000 (04:44 +0000)]
update LogoLinkURL in template.
Stephen Soltesz [Thu, 25 Jun 2009 04:43:30 +0000 (04:43 +0000)]
custom templates for RT
Stephen Soltesz [Thu, 25 Jun 2009 04:43:00 +0000 (04:43 +0000)]
use https by default
add Custom Logo fields.
Stephen Soltesz [Thu, 25 Jun 2009 04:40:27 +0000 (04:40 +0000)]
remove unnecessary scrips and move queues to plc-template.
Stephen Soltesz [Thu, 25 Jun 2009 04:39:45 +0000 (04:39 +0000)]
make custom names make sense.
Stephen Soltesz [Thu, 25 Jun 2009 04:38:41 +0000 (04:38 +0000)]
add template translation on all custom templates in rtconf.d/
copy custom templates from local dir into rt3 configuration dirs
Stephen Soltesz [Wed, 24 Jun 2009 23:00:54 +0000 (23:00 +0000)]
collect CCs from original email, with extra settings in RT_SiteConfig
no output from adduserstort.pl
move some scrips and templates from initialdata to planetlab.pl
Stephen Soltesz [Wed, 24 Jun 2009 22:21:49 +0000 (22:21 +0000)]
only send unknownsquence messages to admin list
just use the PCU IP address if the hostname doesn't match the IP.
Stephen Soltesz [Wed, 24 Jun 2009 22:20:09 +0000 (22:20 +0000)]
add filters to catch google gadgets
http://host.name/monitor/gadget.xml
http://host.name/monitor/sitemonitor.xml
add googlegadget templates.
simplify controllers.py by removing old code
add action* functions to controllers.py
Stephen Soltesz [Wed, 24 Jun 2009 22:17:23 +0000 (22:17 +0000)]
fill-in action* pages
title pages correctly
remove nodeview and siteview since these are replaced by pcuview
add information to home page
Stephen Soltesz [Wed, 24 Jun 2009 20:11:00 +0000 (20:11 +0000)]
ensure that runlevelagent starts when a new package is installed.
Stephen Soltesz [Wed, 24 Jun 2009 19:18:36 +0000 (19:18 +0000)]
Repeatedly try to AuthCheck() to avoid transient DNS errors and temporary
session variables out-of-sync with plc.
Stephen Soltesz [Mon, 22 Jun 2009 19:53:38 +0000 (19:53 +0000)]
provide sitename to person query
install updates.
Stephen Soltesz [Mon, 22 Jun 2009 19:51:09 +0000 (19:51 +0000)]
run certain scripts periodically
run certain scripts at startup
use plcquery instead of getusersfromplc
give 'priv' argument to adduserstort.pl
Stephen Soltesz [Mon, 22 Jun 2009 18:51:28 +0000 (18:51 +0000)]
simplify
Stephen Soltesz [Mon, 22 Jun 2009 18:33:05 +0000 (18:33 +0000)]
rename init to conf for inital configuration data
Stephen Soltesz [Mon, 22 Jun 2009 18:18:06 +0000 (18:18 +0000)]
allow deployment-specific settings in rtinit.d/*.pl
Stephen Soltesz [Mon, 22 Jun 2009 15:52:46 +0000 (15:52 +0000)]
typo city.
Stephen Soltesz [Mon, 22 Jun 2009 15:44:09 +0000 (15:44 +0000)]
include PlanetLab-like scrips and templates by default for all queues.
Stephen Soltesz [Sun, 21 Jun 2009 20:48:19 +0000 (20:48 +0000)]
better formatted auto-reply, with custom-field default value .
Stephen Soltesz [Sun, 21 Jun 2009 20:12:25 +0000 (20:12 +0000)]
add cron script to sync plc db and rt every hour
install cron script.
Stephen Soltesz [Sun, 21 Jun 2009 19:49:27 +0000 (19:49 +0000)]
sync users from PLC to RT db.
add sendmail configuration to monitor-rt3.init
Stephen Soltesz [Sun, 21 Jun 2009 00:15:26 +0000 (00:15 +0000)]
wrong pattern.
Stephen Soltesz [Sun, 21 Jun 2009 00:04:18 +0000 (00:04 +0000)]
cleaned and improved monitor-rt3.init
removed unnecessary info in initaldata
added ENABLED checks to init scripts
added extra xml to Monitor.spec for default setup.
this should be fine until it's ready to go into myplc
Stephen Soltesz [Sat, 20 Jun 2009 22:03:42 +0000 (22:03 +0000)]
own new files
Stephen Soltesz [Sat, 20 Jun 2009 21:57:06 +0000 (21:57 +0000)]
typo
Stephen Soltesz [Sat, 20 Jun 2009 21:54:47 +0000 (21:54 +0000)]
break apart init scripts for monitor, zabbix, and rt3
add a monitor-rt rpm package
add templates for rt configs
Stephen Soltesz [Sat, 20 Jun 2009 19:07:42 +0000 (19:07 +0000)]
directory for rt3 setup and configuration files
Stephen Soltesz [Sat, 20 Jun 2009 00:47:14 +0000 (00:47 +0000)]
added templates for emailTxt
added links in welcome page to site/node/pcus
added stub for actionsummary ; need to add links
fixed policy logic for disabled nodes
add an option to save an action record for messages that don't go through rt
Stephen Soltesz [Fri, 19 Jun 2009 03:42:29 +0000 (03:42 +0000)]
Tagging module Monitor - Monitor-3.0-17
added bootmanager log links
addressed root cause of IntegrityErrors ; big deal
adjusted templates to accomodate fix for IntegrityErrors
added session.flush() to bootman.py to write out ActionsRecords
fixed policy to either pause penalties or apply them ; not both.
Stephen Soltesz [Fri, 19 Jun 2009 03:39:59 +0000 (03:39 +0000)]
no idea how I clobbered this.
Stephen Soltesz [Fri, 19 Jun 2009 03:36:49 +0000 (03:36 +0000)]
pause penalties or apply them, not both...
plcquery tool frame.
Stephen Soltesz [Fri, 19 Jun 2009 01:44:06 +0000 (01:44 +0000)]
added bootmanager log for monitor-collected log files to be referenced by web
added keyboard interrupt exception to nodequery
addressed root cause of IntegrityErrors from lousy code in controllers.py
no elements of the Objects are assigned.
this patch addresses functionality, need to clean up for clarity.
adjusted templates to use new aggregate object of nodes, pcus, and
'pre-processed' values
updated emailTxt to use just 'pattern' in www node urls
added session.flush() to bootman.py to write out ActionsRecords
Stephen Soltesz [Thu, 18 Jun 2009 17:41:02 +0000 (17:41 +0000)]
add a directory
Stephen Soltesz [Wed, 17 Jun 2009 21:06:11 +0000 (21:06 +0000)]
Tagging module Monitor - Monitor-3.0-16
Added Rpyc from 1.0 branch.
add pcuhistory
add setup-agent for password protected keys.
other minor improvements.
Stephen Soltesz [Wed, 17 Jun 2009 20:38:04 +0000 (20:38 +0000)]
copy Rpyc from 1.0 branch into trunk
Stephen Soltesz [Wed, 17 Jun 2009 20:32:41 +0000 (20:32 +0000)]
add a directory to run automated scripts by default
add a script to setup the ssh-agent to allow for password protected keys
Stephen Soltesz [Wed, 17 Jun 2009 20:31:36 +0000 (20:31 +0000)]
add a view for pcu history.
Stephen Soltesz [Wed, 17 Jun 2009 20:29:40 +0000 (20:29 +0000)]
updating with tweaks to live deployment.
Stephen Soltesz [Wed, 17 Jun 2009 17:10:36 +0000 (17:10 +0000)]
Tagging module Monitor - Monitor-3.0-15
automate install
auto-close tickets
Stephen Soltesz [Tue, 16 Jun 2009 20:35:34 +0000 (20:35 +0000)]
enable monitor in plc-config
add easy_install packages to update critical packages for monitor
auto-close a ticket whose status was changed in RT
Stephen Soltesz [Fri, 12 Jun 2009 20:21:01 +0000 (20:21 +0000)]
Tagging module Monitor - Monitor-3.0-14
update web
update policy
added statistics dir
Stephen Soltesz [Fri, 12 Jun 2009 19:17:01 +0000 (19:17 +0000)]
convenience, maintenance scripts
Stephen Soltesz [Fri, 12 Jun 2009 19:15:21 +0000 (19:15 +0000)]
apply penalties
send notices to users.
Stephen Soltesz [Fri, 12 Jun 2009 16:27:07 +0000 (16:27 +0000)]
add scrappy statistics gathering scripts
Stephen Soltesz [Fri, 12 Jun 2009 16:22:49 +0000 (16:22 +0000)]
use 'pattern' argument to site and node pages.
add a link to ssl to allow the racadm binary to work.
add c++ compat library for racadm binary
Stephen Soltesz [Wed, 10 Jun 2009 21:54:45 +0000 (21:54 +0000)]
add plc_nodeid and plc_siteid to history records so that the web templates can
reference sites and nodes by id rather than name, since the site_pattern and
nodepattern references do not exist any more in the new 4.3 wwwgui.
Stephen Soltesz [Wed, 10 Jun 2009 18:45:09 +0000 (18:45 +0000)]
remove unnecessary informatiton on pcuview page
add history for pcu
change 'increase penalty' to 'reduce privilege'
Stephen Soltesz [Tue, 9 Jun 2009 22:34:42 +0000 (22:34 +0000)]
fixed what looked like a typo in pculist.kid, restores 'view all'
use explicit model name, rather than translate names in pcucontrol/reboot.py
use TurboGears from f8 as base for install.
disable crazy pam module to let crond work correctly.
there is no pcu_id for messages.
run synciwithplc in automate-default
Stephen Soltesz [Mon, 8 Jun 2009 18:37:34 +0000 (18:37 +0000)]
Tagging module Monitor - Monitor-3.0-13
remove plccache from controllers, all lookups from db.
reformat emailTxt messags
updated bootstates in bootman.py
Stephen Soltesz [Sat, 6 Jun 2009 00:57:38 +0000 (00:57 +0000)]
properties and emailTxt
Stephen Soltesz [Sat, 6 Jun 2009 00:56:12 +0000 (00:56 +0000)]
actually delete site from db
Stephen Soltesz [Fri, 5 Jun 2009 20:42:35 +0000 (20:42 +0000)]
updates to emailTxt
typos in bootman and scanapi
fixed headers in nodehistory
removed missing pcu notices from policy
Stephen Soltesz [Wed, 3 Jun 2009 23:00:03 +0000 (23:00 +0000)]
controllers should allow refreshes while findall is running.
add description to RunlevelAgent
something is failing in the monitor-server-deps post install process for
easy_setup dependencies.
Stephen Soltesz [Wed, 3 Jun 2009 21:36:43 +0000 (21:36 +0000)]
stop using the cached values since they mess things up and keep the server out
of sync with the db. not sure what I as thinking ...
Stephen Soltesz [Tue, 2 Jun 2009 21:33:54 +0000 (21:33 +0000)]
Tagging module Monitor - Monitor-3.0-12
tag of latest changes.
need to test end to end.
Stephen Soltesz [Tue, 2 Jun 2009 21:30:28 +0000 (21:30 +0000)]
added comonquery command-line tool.
added flush and clear commands to the beginning of each web entry point in
controllers.py; I think this will help address the IntegrityErrors seen here
and at PLE.
moved plccache to local functions to speed invocation of some calls (nodequery)
added several tags to bootman.py to help with new 3.0 BootManager issue.
moved bootman import in monitor/database/info/interface.py due to import
error. still need to investigate this
added extra RPM checks to node environment checks. This isn't put in the db
but the log files can be queried over time.
Stephen Soltesz [Tue, 2 Jun 2009 17:57:30 +0000 (17:57 +0000)]
Something bizarre occurs with the dc7800 display after this runs. adding the
delay helps, but only slightly. Apparently, disconnecting the display cable
and reattaching it clears up the problem. No idea why.
Barış Metin [Tue, 2 Jun 2009 11:48:35 +0000 (11:48 +0000)]
in nodes page: print the status computed by monitor instead of the status retrieved from PLC
--debug lines
Thierry Parmentelat [Sat, 30 May 2009 10:27:07 +0000 (10:27 +0000)]
Tagging module Monitor - Monitor-3.0-11
big merge from the 2.0 branch
Stephen Soltesz [Thu, 28 May 2009 15:25:13 +0000 (15:25 +0000)]
merges from v2
Barış Metin [Wed, 27 May 2009 09:09:27 +0000 (09:09 +0000)]
quote values
Thierry Parmentelat [Tue, 26 May 2009 10:53:06 +0000 (10:53 +0000)]
Tagging module Monitor - Monitor-3.0-10
minor improvements in rendering with sortable tables
Barış Metin [Tue, 19 May 2009 17:21:59 +0000 (17:21 +0000)]
add bootcd version to nodes table
Barış Metin [Tue, 19 May 2009 15:36:53 +0000 (15:36 +0000)]
cosmetic fixes
Barış Metin [Tue, 19 May 2009 09:05:46 +0000 (09:05 +0000)]
Tagging module Monitor - Monitor-3.0-9