Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 11 Jul 1997 07:54:34 -0500 (CDT)
From:      Terry Todd <tlt@badger.tltodd.com>
To:        freebsd-questions@freebsd.org
Subject:   It did it again.
Message-ID:  <199707111254.HAA09444@badger.tltodd.com>

next in thread | raw e-mail | index | archive | help

My system got sick again.  The symptoms were as before where no
TCP/IP traffic was going out or coming in.  This time I was able
to capture a fair amount of data.  I am running FreeBSD 2.1.6
and using it to run several mailing lists.  I am using the
procmail/SmartList package.  I have maxconcur set to 16 in
rc.init for SmartList.  I was able to kill -1 on the pid of
pppd and get it to restart this time.  This leads me to believe
it may be a bug in pppd.  There were a couple of other commands
that were suggested I run if this happened again which I did but
they just sat there and produced no output so I ^C'ed out.  I think
the chances of the problem happening again can be decreased by
decreasing maxconcur but the OS/pppd should be able to recover
from this by itself.  Any pointers or help with this problem would
be appreciated.

Thanks,
Terry Todd

Here's the result of running ping.
just before it went totally offline...
tlt@badger> ping ns1.megsinet.net
PING ns1.megsinet.net (208.150.60.2): 56 data bytes
ping: sendto: No buffer space available
ping: wrote ns1.megsinet.net 64 chars, ret=-1
ping: sendto: No buffer space available
ping: wrote ns1.megsinet.net 64 chars, ret=-1
64 bytes from 208.150.60.2: icmp_seq=0 ttl=252 time=16395.492 ms
64 bytes from 208.150.60.2: icmp_seq=1 ttl=252 time=15433.823 ms
64 bytes from 208.150.60.2: icmp_seq=2 ttl=252 time=14434.907 ms
64 bytes from 208.150.60.2: icmp_seq=3 ttl=252 time=13450.170 ms
64 bytes from 208.150.60.2: icmp_seq=4 ttl=252 time=12814.616 ms
64 bytes from 208.150.60.2: icmp_seq=5 ttl=252 time=11818.169 ms
64 bytes from 208.150.60.2: icmp_seq=6 ttl=252 time=10819.124 ms
64 bytes from 208.150.60.2: icmp_seq=7 ttl=252 time=9846.337 ms
^C
--- ns1.megsinet.net ping statistics ---
18 packets transmitted, 8 packets received, 55% packet loss
round-trip min/avg/max = 9846.337/13126.579/16395.492 ms
tlt@badger>

and after it was dead....
(after it gets in this state it will never recover)
tlt@badger> ping ns1.megsinet.net
PING ns1.megsinet.net (208.150.60.2): 56 data bytes
ping: sendto: No buffer space available
ping: wrote ns1.megsinet.net 64 chars, ret=-1
ping: sendto: No buffer space available
ping: wrote ns1.megsinet.net 64 chars, ret=-1
ping: sendto: No buffer space available
ping: wrote ns1.megsinet.net 64 chars, ret=-1
ping: sendto: No buffer space available
ping: wrote ns1.megsinet.net 64 chars, ret=-1
ping: sendto: No buffer space available
ping: wrote ns1.megsinet.net 64 chars, ret=-1
^C
--- ns1.megsinet.net ping statistics ---
5 packets transmitted, 0 packets received, 100% packet loss
tlt@badger>

Here's the result of various netstat commands...
netstat -m
381 mbufs in use:
	151 mbufs allocated to data
	156 mbufs allocated to packet headers
	68 mbufs allocated to protocol control blocks
	6 mbufs allocated to socket names and addresses
65/124 mbuf clusters in use
295 Kbytes allocated to network (60% in use)
0 requests for memory denied
0 requests for memory delayed
0 calls to protocol drain routines

netstat -nr
Routing tables

Internet:
Destination        Gateway            Flags     Refs     Use     Netif Expire
default            208.133.80.1       UGc       114    61968      ppp0
127.0.0.1          127.0.0.1          UH          3   659000       lo0
208.133.80.1       208.133.92.209     UH        114        0      ppp0
208.133.92.208/29  link#1             UC          0        0 
208.133.92.211     0:60:97:26:50:b5   UHLW        2     4684       ed0   1104
208.133.92.212     0:20:af:d8:a:4e    UHLW        0    11123       ed0   1066

netstat -nrs
routing:
	0 bad routing redirects
	0 dynamically created routes
	0 new gateways due to redirects
	546 destinations found unreachable
	0 uses of a wildcard route

netstat -s
ip:
	2275262 total packets received
	0 bad header checksums
	0 with size smaller than minimum
	0 with data size < data length
	0 with header length < data size
	0 with data length < header length
	0 with bad options
	0 with incorrect version number
	172 fragments received
	0 fragments dropped (dup or out of space)
	0 fragments dropped after timeout
	86 packets reassembled ok
	2141054 packets for this host
	14278 packets for unknown/unsupported protocol
	118491 packets forwarded
	940 packets not forwardable
	0 redirects sent
	2135189 packets sent from this host
	39 packets sent with fabricated ip header
	0 output packets dropped due to no bufs, etc.
	272 output packets discarded due to no route
	0 output datagrams fragmented
	0 fragments created
	0 datagrams that can't be fragmented
icmp:
	5296 calls to icmp_error
	59 errors not generated 'cuz old message was icmp
	Output histogram:
		echo reply: 731
		destination unreachable: 3960
		source quench: 858
		time exceeded: 413
	77 messages with bad code fields
	0 messages < minimum length
	0 bad checksums
	0 messages with bad length
	Input histogram:
		echo reply: 120
		destination unreachable: 13277
		source quench: 8
		echo: 731
		time exceeded: 1012
	731 message responses generated
igmp:
	0 messages received
	0 messages received with too few bytes
	0 messages received with bad checksum
	0 membership queries received
	0 membership queries received with invalid field(s)
	0 membership reports received
	0 membership reports received with invalid field(s)
	0 membership reports received for groups to which we belong
	0 membership reports sent
tcp:
	1322206 packets sent
		785862 data packets (271454502 bytes)
		42641 data packets (26059172 bytes) retransmitted
		2991 resends initiated by MTU discovery
		354992 ack-only packets (98931 delayed)
		0 URG only packets
		1374 window probe packets
		3386 window update packets
		156471 control packets
	1410062 packets received
		853894 acks (for 272958170 bytes)
		210110 duplicate acks
		0 acks for unsent data
		600327 packets (45329669 bytes) received in-sequence
		92607 completely duplicate packets (4754201 bytes)
		0 old duplicate packets
		376 packets with some dup. data (36534 bytes duped)
		71558 out-of-order packets (635751 bytes)
		5 packets (0 bytes) of data after window
		0 window probes
		3764 window update packets
		322 packets received after close
		753 discarded for bad checksums
		0 discarded for bad header offset fields
		0 discarded because packet too short
	82635 connection requests
	7639 connection accepts
	43 bad connection attempts
	0 listen queue overflows
	74818 connections established (including accepts)
	90155 connections closed (including 3183 drops)
		2049 connections updated cached RTT on close
		2049 connections updated cached RTT variance on close
		636 connections updated cached ssthresh on close
	14961 embryonic connections dropped
	728447 segments updated rtt (of 754105 attempts)
	47182 retransmit timeouts
		62 connections dropped by rexmit timeout
	1428 persist timeouts
		0 connections dropped by persist timeout
	6441 keepalive timeouts
		11 keepalive probes sent
		6430 connections dropped by keepalive
	90173 correct ACK header predictions
	263041 correct data packet header predictions
udp:
	730122 datagrams received
	0 with incomplete header
	1 with bad data length field
	3 with bad checksum
	3943 dropped due to no socket
	7200 broadcast/multicast datagrams dropped due to no socket
	0 dropped due to full socket buffers
	403651 not for hashed pcb
	718975 delivered
	737376 datagrams output

and the result of running ifconfig...
ifconfig -a
ed0: flags=8863<UP,BROADCAST,NOTRAILERS,RUNNING,SIMPLEX,MULTICAST> mtu 1500
	inet 208.133.92.210 netmask 0xfffffff8 broadcast 208.133.92.215
lo0: flags=8049<UP,LOOPBACK,RUNNING,MULTICAST> mtu 16384
	inet 127.0.0.1 netmask 0xff000000 
ppp0: flags=8051<UP,POINTOPOINT,RUNNING,MULTICAST> mtu 1524
	inet 208.133.92.209 --> 208.133.80.1 netmask 0xfffffff8 
sl0: flags=c010<POINTOPOINT,LINK2,MULTICAST> mtu 552
tun0: flags=8010<POINTOPOINT,MULTICAST> mtu 1500

and the result of pstat....
pstat -s
Device      512-blocks     Used    Avail Capacity  Type
/dev/wd0s1b     286624        0   286496     0%    Interleaved
/dev/vn0c       262144        0   262016     0%    Interleaved
Total           548512        0   548512     0%

here's vmstat....
vmstat -s
  5427559 cpu context switches
273249653 device interrupts
  3525240 software interrupts
 20682765 traps
 60302738 system calls
        0 swap pager pageins
        0 swap pager pages paged in
        0 swap pager pageouts
        0 swap pager pages paged out
      417 vnode pager pageins
     1775 vnode pager pages paged in
        0 vnode pager pageouts
        0 vnode pager pages paged out
        0 page daemon wakeups
        0 pages examined by the page daemon
   264612 pages reactivated
  4582333 copy-on-write faults
  5736908 zero fill pages zeroed
        2 intransit blocking page faults
 21349301 total VM faults taken
 18359250 pages freed
        0 pages freed by daemon
  8548611 pages freed by exiting processes
    11884 pages active
      558 pages inactive
     9615 pages in VM cache
     3387 pages wired down
     2241 pages free
     4096 bytes per page
  5926643 total name lookups
          cache hits (76% pos + 4% neg) system 3% per-process
          deletions 2%, falsehits 0%, toolong 0%

here's what was running at the time...
ps aux
USER       PID %CPU %MEM   VSZ  RSS  TT  STAT STARTED       TIME COMMAND
tlt      27264  0.0  0.3   500  312  p0  R+   10:25PM    0:00.01 ps -aux
root         1  0.0  0.2   436  208  ??  Is   29Jun97    0:10.54 /sbin/init --
root         2  0.0  0.0     0   12  ??  DL   29Jun97    0:00.00  (pagedaemon)
root         3  0.0  0.0     0   12  ??  DL   29Jun97    0:00.00  (vmdaemon)
root         4  0.0  0.0     0   12  ??  DL   29Jun97   19:19.57  (update)
root        22  0.0  0.1   216   80  ??  Is   29Jun97    0:00.01 adjkerntz -i
root        54  0.0  0.2   192  244  ??  Is   29Jun97    0:09.64 routed -q
root        70  0.0  0.3   192  324  ??  Ss   29Jun97    1:30.04 syslogd
daemon      78  0.0  0.2   176  240  ??  Is   29Jun97    0:00.01 portmap
root        88  0.0  0.3   220  308  ??  Is   29Jun97    0:18.43 inetd
root        95  0.0  0.4   432  472  ??  Ss   29Jun97    3:42.17 /usr/local/htt
nobody      97  0.0  0.6   484  612  ??  I    29Jun97    0:00.22 /usr/local/htt
nobody      98  0.0  0.6   484  612  ??  I    29Jun97    0:00.20 /usr/local/htt
nobody      99  0.0  0.5   484  608  ??  I    29Jun97    0:00.24 /usr/local/htt
nobody     100  0.0  0.6   496  612  ??  I    29Jun97    0:00.27 /usr/local/htt
nobody     101  0.0  0.6   496  616  ??  I    29Jun97    0:00.24 /usr/local/htt
root       102  0.0  0.3   268  320  ??  Is   29Jun97    0:18.57 cron
root       109  0.0  0.3   560  340  ??  Is   29Jun97    0:20.92 sendmail: acce
root       155  0.0  0.4   156  480  v2  Is+  29Jun97    0:00.02 /usr/libexec/g
root       198  0.0  0.5   260  600  a2  Is+  29Jun97    0:00.06 /usr/sbin/pppd
nobody   11830  0.0  0.6   496  620  ??  I    29Jun97    0:00.27 /usr/local/htt
nobody    1329  0.0  0.6   484  612  ??  I     1Jul97    0:00.18 /usr/local/htt
nobody   11879  0.0  0.5   484  604  ??  I    Mon12AM    0:00.09 /usr/local/htt
nobody   11880  0.0  0.6   496  616  ??  I    Mon12AM    0:00.13 /usr/local/htt
root      4319  0.0  1.1  1176 1192  ??  Ss   Tue06AM    1:58.25 named -b /etc/
root      4228  0.0  0.5   156  524  v0  Is+  Wed07PM    0:00.02 /usr/libexec/g
tlt       8188  0.0  0.6   344  708  v1  Is+  10:48PM    0:00.32 -ksh (ksh)
root     22118  0.0  1.0   752 1148  ??  I     5:44PM    0:00.99 sendmail: RAA2
root     22213  0.0  1.0   760 1140  ??  I     5:50PM    0:00.98 sendmail: RAA2
root     22255  0.0  1.0   736 1140  ??  I     5:50PM    0:00.56 sendmail: RAA2
root     22715  0.0  1.0   764 1124  ??  I     6:46PM    0:00.82 sendmail: SAA2
root     22717  0.0  1.0   752 1128  ??  I     6:46PM    0:00.81 sendmail: SAA2
root     23139  0.0  1.0   760 1144  ??  I     6:59PM    0:00.80 sendmail: SAA2
root     23141  0.0  1.0   744 1140  ??  I     6:59PM    0:00.70 sendmail: SAA2
root     23280  0.0  1.0   760 1120  ??  S     7:10PM    0:00.83 sendmail: TAA2
root     23282  0.0  1.0   744 1120  ??  I     7:10PM    0:00.70 sendmail: TAA2
root     23530  0.0  1.0   756 1140  ??  I     7:13PM    0:00.72 sendmail: TAA2
root     23532  0.0  1.0   744 1140  ??  I     7:13PM    0:00.71 sendmail: TAA2
root     23631  0.0  1.0   760 1128  ??  S     7:22PM    0:00.76 sendmail: TAA2
root     23633  0.0  1.0   744 1108  ??  I     7:22PM    0:00.64 sendmail: TAA2
root     23727  0.0  1.0   756 1140  ??  S     7:28PM    0:00.69 sendmail: TAA2
root     23729  0.0  1.0   744 1140  ??  I     7:28PM    0:00.69 sendmail: TAA2
root     23873  0.0  1.0   744 1120  ??  S     7:47PM    0:00.63 sendmail: TAA2
root     23876  0.0  1.0   736 1112  ??  I     7:47PM    0:00.60 sendmail: TAA2
root     24331  0.0  1.0   740 1140  ??  I     8:17PM    0:00.51 sendmail: UAA2
root     24334  0.0  1.0   736 1140  ??  S     8:17PM    0:00.45 sendmail: UAA2
root     24335  0.0  1.0   712 1104  ??  S     8:17PM    0:00.34 sendmail: UAA2
root     24360  0.0  1.0   720 1128  ??  I     8:17PM    0:00.41 sendmail: UAA2
root     24584  0.0  1.0   756 1096  ??  I     8:24PM    0:00.57 sendmail: UAA2
root     24586  0.0  1.0   744 1116  ??  S     8:24PM    0:00.52 sendmail: UAA2
root     24666  0.0  0.8   776  860  ??  I     8:41PM    0:00.52 sendmail: UAA2
root     24685  0.0  1.0   756 1124  ??  I     8:43PM    0:00.57 sendmail: UAA2
root     24687  0.0  1.0   744 1104  ??  S     8:43PM    0:00.49 sendmail: UAA2
root     25117  0.0  1.0   700 1076  ??  I     9:01PM    0:00.28 sendmail: VAA2
root     25128  0.0  1.0   732 1104  ??  I     9:01PM    0:00.41 sendmail: VAA2
root     25131  0.0  1.0   732 1128  ??  I     9:01PM    0:00.35 sendmail: VAA2
root     25132  0.0  1.0   708 1088  ??  I     9:01PM    0:00.28 sendmail: VAA2
root     25173  0.0  1.0   700 1076  ??  I     9:01PM    0:00.27 sendmail: VAA2
root     25397  0.0  1.0   756 1124  ??  I     9:08PM    0:00.52 sendmail: VAA2
root     25399  0.0  1.0   744 1100  ??  S     9:08PM    0:00.48 sendmail: VAA2
root     25464  0.0  0.7   744  820  ??  I     9:11PM    0:00.38 sendmail: VAA2
root     25505  0.0  1.0   736 1108  ??  S     9:21PM    0:00.46 sendmail: VAA2
root     25508  0.0  1.0   736 1108  ??  I     9:21PM    0:00.34 sendmail: VAA2
root     25509  0.0  0.9   704 1060  ??  S     9:21PM    0:00.25 sendmail: VAA2
root     25901  0.0  1.0   732 1100  ??  I     9:27PM    0:00.37 sendmail: VAA2
root     25904  0.0  1.0   732 1104  ??  I     9:27PM    0:00.33 sendmail: VAA2
root     25905  0.0  0.9   704 1056  ??  S     9:27PM    0:00.25 sendmail: VAA2
root     25942  0.0  1.0   700 1064  ??  I     9:27PM    0:00.23 sendmail: VAA2
root     26150  0.0  0.9   700 1052  ??  S     9:27PM    0:00.22 sendmail: VAA2
root     26153  0.0  1.0   748 1112  ??  S     9:27PM    0:00.42 sendmail: VAA2
root     26155  0.0  1.0   740 1100  ??  I     9:27PM    0:00.40 sendmail: VAA2
root     26255  0.0  1.0   736 1108  ??  S     9:35PM    0:00.42 sendmail: VAA2
root     26258  0.0  1.0   736 1108  ??  I     9:35PM    0:00.34 sendmail: VAA2
root     26259  0.0  0.9   700 1052  ??  S     9:35PM    0:00.24 sendmail: VAA2
root     26654  0.0  0.8   776  860  ??  S     9:41PM    0:00.56 sendmail: VAA2
root     26688  0.0  0.9   700 1060  ??  I     9:51PM    0:00.20 sendmail: VAA2
root     26691  0.0  1.0   752 1096  ??  I     9:51PM    0:00.43 sendmail: VAA2
root     26693  0.0  1.0   736 1112  ??  S     9:51PM    0:00.35 sendmail: VAA2
root     26694  0.0  0.9   644 1020  ??  I     9:51PM    0:00.13 sendmail: VAA2
root     26695  0.0  1.0   700 1072  ??  I     9:51PM    0:00.21 sendmail: VAA2
root     26825  0.0  0.9   700 1052  ??  S    10:10PM    0:00.18 sendmail: WAA2
root     26827  0.0  0.9   644 1008  ??  I    10:10PM    0:00.13 sendmail: WAA2
root     26832  0.0  0.9   644 1016  ??  S    10:10PM    0:00.13 sendmail: WAA2
root     26835  0.0  0.9   644 1016  ??  I    10:10PM    0:00.14 sendmail: WAA2
root     26836  0.0  0.9   680 1044  ??  I    10:10PM    0:00.31 sendmail: WAA2
root     26838  0.0  0.9   696 1044  ??  I    10:10PM    0:00.15 sendmail: WAA2
root     26839  0.0  1.0   676 1072  ??  I    10:10PM    0:00.25 sendmail: WAA2
root     26840  0.0  0.9   700 1048  ??  I    10:10PM    0:00.17 sendmail: WAA2
root     26867  0.0  0.9   648 1008  ??  I    10:10PM    0:00.15 sendmail: WAA2
root     27068  0.0  0.8   764  852  ??  S    10:11PM    0:00.36 sendmail: VAA2
root     27069  0.0  0.9   644 1016  ??  I    10:11PM    0:00.13 sendmail: WAA2
root     27072  0.0  0.5   216  592  ??  S    10:13PM    0:10.99 telnetd
tlt      27073  0.0  0.6   348  680  p0  Ss   10:13PM    0:00.58 -ksh (ksh)
root     27114  0.0  0.4   600  472  ??  I    10:22PM    0:00.02 sendmail: serv
root     27256  0.0  0.4   600  484  ??  I    10:24PM    0:00.02 sendmail: serv
root         0  0.0  0.0     0    0  ??  DLs  -          0:00.00  (swapper)

Here's my mailq shortened for the sake of brevity here...
		Mail Queue (309 requests)
--Q-ID-- --Size-- -----Q-Time----- ------------Sender/Recipient------------
WAA27565*     318 Thu Jul 10 22:30 rcg-forum@tltodd.com
				   Eric.Walter@quickmail.llnl.gov
WAA26833      230 Thu Jul 10 22:10 rcg-forum@tltodd.com
                 (host map: lookup (hyphe.com): deferred)
				   jutson@hyphe.com
VAA26253*     318 Thu Jul 10 21:35 rcg-forum@tltodd.com
                 (wat@uniserve.com,ctolmie@uniserve.com... timeout waiting for)
				   ctolmie@uniserve.com
				   wat@uniserve.com
VAA26257*     318 Thu Jul 10 21:35 rcg-forum@tltodd.com
                 (dbedwell@shastalink.k12.ca.us... timeout waiting for input d)
				   dbedwell@shastalink.k12.ca.us
				   jhendric@cabin.llcc.cc.il.us
.........much more snipped off here.....



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199707111254.HAA09444>