Date: Sat, 23 Oct 1999 04:44:37 -0400 (EDT) From: Mike Nowlin <mike@argos.org> To: Gong Wei <ccegongw@nus.edu.sg> Cc: freebsd-stable@freebsd.org Subject: Re: 3.3 Stable Performance Monitoring Message-ID: <Pine.LNX.4.05.9910230437200.18308-200000@jason.argos.org> In-Reply-To: <762388C091FAD01180FF00A02462137801AC5BFD@exchange.nus.edu.sg>
next in thread | previous in thread | raw e-mail | index | archive | help
[-- Attachment #1 --]
> We also have a few Solaris machines around. We've purchased a SNMP agent
> from Empire Technology (www.empiretech.com) which can report various system
> performance related parameters, like swap usage, system load, cpu
> utilization, number of open file descriptor, number of processes, etc.
>
> The bad news is that their product doesn't support FreeBSD, although it does
> support Linux. So we cannot use this tool to monitor the system
> performance. Instead, we need something else which can do roughly the same
> thing.
>
> Among so many parameters our immediate interests is the following:
> * CPU utilization, % used in Kernel space vs % used in user space
> * RAM utilization
> * SWAP utilization
> * Network bandwidth usage
> * number of file descriptors used
>
> As ususal, any hints/comments are more than welcomed. Please do mail a copy
> of your response to me directly. Thanks!
>
I've been writing a program to monitor various values dealing with SNMP --
it's not finished, but it works. Basically, you tell it what to watch,
and if the values go outside defined thresholds or certain values are/are
not met, it triggers an "alert" -- mail, paging (both TAP and SNPP), etc.
Right now, it's running under Linux with ucd-snmp, but porting it over to
FreeBSD should be simple -- the errors I'm getting are dumb ones that are
easily fixed. If anybody's interested, let me know -- it's not available
to the general public (I'm sorta embarresed by the code), but the geeks of
the world can get their hands on what I have so far by asking.....
mike
(I'll include one of the config files for your browsing and commentary.)
[-- Attachment #2 --]
# Machines at SML
#doublewalk {
# name "r2d2_processlist"
# host "r2d2.smlab.com"
# community "Mlx-20L"
# fromoid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrorflag"
# tooid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrmessage"
# frequency 41
# mode 0
# alert "mike_pager"
#}
#doublewalk {
# name "r2d2_df"
# host "r2d2.smlab.com"
# community "Mlx-20L"
# fromoid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrorflag"
# tooid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrormsg"
# frequency 42
# mode 0
# alert "mike_pager"
#}
doublewalk {
name "palpatine_pslist"
host "palpatine.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrmessage"
frequency 41
mode 0
alert "mike_pager"
}
doublewalk {
name "palpatine_df"
host "palpatine.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrormsg"
frequency 42
mode 0
alert "mike_pager"
}
doublewalk {
name "watto_pslist"
host "watto.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrmessage"
frequency 41
mode 0
alert "mike_pager"
}
doublewalk {
name "watto_df"
host "watto.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrormsg"
frequency 42
mode 0
alert "mike_pager"
}
# check owen's transmit.LOCK lockfile for the reporting/paging system to make
# sure it's not too long.......
reportchain {
name "owen_txlockfile"
host "owenpub.smlab.com"
community "Mlx-20L"
oidroot ".1.3.6.1.4.enterprises.ucdavis.50.101"
frequency 60
alert "mike_pager"
}
doublewalk {
name "owen_processload"
host "owenpub.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.loadtable.laentry.laerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.loadtable.laentry.laerrmessage"
frequency 51
mode 0
alert "mike_pager"
}
doublewalk {
name "owen_df"
host "owenpub.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrormsg"
frequency 52
mode 0
alert "mike_pager"
alert "mike_mail"
}
doublewalk {
name "owen_processlist"
host "owenpub.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrmessage"
frequency 53
mode 0
alert "mike_pager"
}
doublewalk {
name "tarkin_processload"
host "www.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.loadtable.laentry.laerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.loadtable.laentry.laerrmessage"
frequency 51
mode 0
alert "mike_pager"
}
doublewalk {
name "tarkin_df"
host "www.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrormsg"
frequency 52
mode 0
alert "mike_pager"
}
doublewalk {
name "tarkin_processlist"
host "www.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrmessage"
frequency 53
mode 0
alert "mike_pager"
}
# check newproc.LOCK file on xwing
reportchain {
name "xwing_faxoutlock"
host "xwing.smlab.com"
community "Mlx-20L"
oidroot ".1.3.6.1.4.enterprises.ucdavis.50.101"
frequency 65
alert "mike_pager"
}
doublewalk {
name "xwing_df"
host "xwing.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrormsg"
frequency 61
mode 0
alert "mike_pager"
}
doublewalk {
name "xwing_processlist"
host "xwing.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrmessage"
frequency 62
mode 0
alert "mike_pager"
}
doublewalk {
name "anakin_df"
host "anakin.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.disktable.dskentry.dskerrormsg"
frequency 71
mode 0
alert "mike_pager"
}
doublewalk {
name "anakin_processlist"
host "anakin.smlab.com"
community "Mlx-20L"
fromoid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrorflag"
tooid ".1.3.6.1.4.enterprises.ucdavis.proctable.prentry.prerrmessage"
frequency 72
mode 0
alert "mike_pager"
}
# HP switch-1.smlab.com info
valuechange {
name "switch-1_watto"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.2"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_palpatine"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.3"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_ohsosf-192"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.5"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_ohsosf-208"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.6"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_tarkin-192"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.7"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_srv-6"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.8"
frequency 60
alert "mike_pager"
}
valuechange {
name "switch-1_lab-hub"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.11"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_billing-hub"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.12"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_anakin"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.13"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_ewok"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.16"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_owen"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.17"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_wookie"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.18"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_SO-hub"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.20"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_mike-extra"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.21"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_mike-hub"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.22"
frequency 45
alert "mike_pager"
}
valuechange {
name "switch-1_mainnet"
host "switch-1.smlab.com"
community "Mlx-20L"
oid "interfaces.ifTable.ifEntry.ifOperStatus.24"
frequency 45
alert "mike_pager"
}
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?Pine.LNX.4.05.9910230437200.18308-200000>
