From owner-freebsd-current@FreeBSD.ORG Wed Oct 20 22:25:36 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id E81A116A4CF for ; Wed, 20 Oct 2004 22:25:36 +0000 (GMT) Received: from cicero2.cybercity.dk (cicero2.cybercity.dk [212.242.40.53]) by mx1.FreeBSD.org (Postfix) with ESMTP id BE52443D3F for ; Wed, 20 Oct 2004 22:25:35 +0000 (GMT) (envelope-from tom@motd.dk) Received: from bart.motd.dk (port95.ds1-ro.adsl.cybercity.dk [212.242.60.98]) by cicero2.cybercity.dk (Postfix) with ESMTP id 9C28F18F5BF for ; Thu, 21 Oct 2004 00:25:33 +0200 (CEST) Received: from localhost (localhost.motd.dk [127.0.0.1]) by bart.motd.dk (Postfix) with ESMTP id 511CE62EA for ; Thu, 21 Oct 2004 00:32:21 +0200 (CEST) Received: from bart.motd.dk ([127.0.0.1]) by localhost (bart.motd.dk [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 00850-05 for ; Thu, 21 Oct 2004 00:32:17 +0200 (CEST) Received: from home03 (unknown [192.168.10.3]) by bart.motd.dk (Postfix) with ESMTP id 65AC460D3 for ; Thu, 21 Oct 2004 00:32:17 +0200 (CEST) From: "Tom Jensen" To: Date: Thu, 21 Oct 2004 00:23:48 +0200 MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_NextPart_000_0000_01C4B704.3BEDAA00" X-Mailer: Microsoft Office Outlook, Build 11.0.6353 Thread-Index: AcS283hB6hFpZrXCQmSeYQJNDEMCfw== X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1441 Message-Id: <20041020223217.65AC460D3@bart.motd.dk> X-Virus-Scanned: by amavisd-new at motd.dk Subject: Machine hangs(Beta7), only reset button works X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 20 Oct 2004 22:25:37 -0000 This is a multi-part message in MIME format. ------=_NextPart_000_0000_01C4B704.3BEDAA00 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Hi I've been seeing a pretty strange problem lately with my server. The box completely freeze typically when it's done running the first part of my backup script, resulting in no possibility to login on the console or by SSH, the freeze even happens when I'm sitting in a terminal and working. There is no indication in log files etc. about what's causing the problem and it's not breaking into debugger either :-( The backup script is really simple, creating a .tgz file of a given directory, mounting a windows share (mount_smbfs) and copying the file. The script is run by cron six times (start at the same time) in six different directories, this results in the box freezes after the tar processes finishes. Attached is the dmesg.boot and the latest top, don't know if it's any use but it's seems rather strange that a lot of processes are in a STATE usf (not sure what this means but I don't sees this when the box is running normally) The kernel is mostly a generic with the following modifications: options IPFIREWALL options IPFIREWALL_VERBOSE options IPFIREWALL_VERBOSE_LIMIT=400 options IPDIVERT options IPSEC options IPSEC_ESP options IPSEC_DEBUG device ath device ath_hal options KDB options DDB bash-2.05b# uname -a FreeBSD bart.motd.dk 5.3-BETA7 FreeBSD 5.3-BETA7 #6: Tue Oct 19 00:36:59 CEST 2004 root@bart.motd.dk:/usr/obj/usr/src/sys/GW i386 Any more info needed please let me know. Best regards - Tom ------=_NextPart_000_0000_01C4B704.3BEDAA00 Content-Type: text/plain; name="top.txt" Content-Transfer-Encoding: quoted-printable Content-Disposition: attachment; filename="top.txt" last pid: 27710; load averages: 0.06, 4.54, 6.90 up = 1+00:00:50 00:06:27 211 processes: 1 running, 210 sleeping CPU states: 0.8% user, 0.0% nice, 0.0% system, 0.4% interrupt, 98.8% = idle Mem: 64M Active, 48M Inact, 58M Wired, 9292K Cache, 28M Buf, 568K Free Swap: 491M Total, 118M Used, 372M Free, 24% Inuse PID USERNAME PRI NICE SIZE RES STATE TIME WCPU CPU = COMMAND 806 www -4 0 110M 3764K ufs 5:55 0.00% 0.00% java 382 root 96 0 1348K 204K select 0:43 0.00% 0.00% natd 1156 tom 4 0 6092K 344K select 0:34 0.00% 0.00% sshd 50265 root 96 0 2616K 1084K RUN 0:33 0.00% 0.00% top 682 tom 96 0 6092K 632K select 0:20 0.00% 0.00% sshd 853 vscan 20 0 28440K 124K kserel 0:18 0.00% 0.00% clamd 549 dnslog -8 0 1216K 256K piperd 0:07 0.00% 0.00% = multilog 839 root 96 0 7156K 544K select 0:06 0.00% 0.00% httpd 832 vscan 4 0 31544K 808K select 0:06 0.00% 0.00% perl 532 root -8 0 1180K 184K piperd 0:05 0.00% 0.00% = readproctitle 531 root -4 0 1236K 148K ufs 0:05 0.00% 0.00% svscan 85831 vscan 4 0 32620K 12K select 0:04 0.00% 0.00% perl 87607 vscan 20 0 32676K 12K lockf 0:04 0.00% 0.00% perl 49767 root 96 0 572K 124K select 0:04 0.00% 0.00% make 1032 root 96 0 4876K 228K select 0:04 0.00% 0.00% master 1691 tom 4 0 6092K 352K select 0:03 0.00% 0.00% sshd 49992 root 4 0 1216K 104K kqread 0:03 0.00% 0.00% tail 439 root 96 0 1312K 380K select 0:03 0.00% 0.00% = syslogd 25940 root 96 0 9080K 8960K select 0:03 0.00% 0.00% make 541 dnscache 4 0 2544K 532K select 0:03 0.00% 0.00% = dnscache 1034 postfix 4 0 4968K 312K select 0:02 0.00% 0.00% qmgr 49814 root 96 0 792K 244K select 0:02 0.00% 0.00% make 1712 root -4 0 2224K 548K ufs 0:01 0.00% 0.00% bash 27424 root -8 0 1356K 688K piperd 0:01 0.00% 0.00% cron 22110 root 96 0 3148K 1484K select 0:01 0.00% 0.00% make 25339 root 96 0 3772K 3640K select 0:01 0.00% 0.00% make 1164 root 8 0 2224K 12K wait 0:01 0.00% 0.00% bash 580 root -4 0 1356K 188K ufs 0:01 0.00% 0.00% cron 27591 root -4 0 7324K 5920K ufs 0:00 0.00% 0.00% cc1 18691 root 96 0 788K 272K select 0:00 0.00% 0.00% make 718 root 8 0 2224K 12K wait 0:00 0.00% 0.00% bash 25312 root 96 0 1476K 780K select 0:00 0.00% 0.00% make 27607 root -4 0 6424K 4992K ufs 0:00 0.00% 0.00% cc1 27484 root -8 0 4884K 1872K piperd 0:00 0.00% 0.00% = sendmail 23359 root 96 0 776K 268K select 0:00 0.00% 0.00% make 27534 root -8 0 4864K 1720K piperd 0:00 0.00% 0.00% = postdrop 27646 root -4 0 5744K 4352K ufs 0:00 0.00% 0.00% cc1 26158 root 96 0 892K 768K select 0:00 0.00% 0.00% make 18707 root 96 0 552K 268K select 0:00 0.00% 0.00% make 879 root 4 0 1260K 176K select 0:00 0.00% 0.00% = couriertcpd 26624 root 96 0 972K 844K select 0:00 0.00% 0.00% make 27659 root -4 0 5220K 3712K ufs 0:00 0.00% 0.00% cc1 870 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 919 dhcpd 4 0 2252K 12K select 0:00 0.00% 0.00% dhcpd 872 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 873 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 538 root 4 0 1188K 92K select 0:00 0.00% 0.00% = supervise 868 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 866 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 869 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 537 root 4 0 1188K 92K select 0:00 0.00% 0.00% = supervise 871 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 874 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 864 root 4 0 1376K 64K select 0:00 0.00% 0.00% = authdaemond.plain 536 root 4 0 1188K 92K select 0:00 0.00% 0.00% = supervise 867 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 875 root 4 0 1420K 64K select 0:00 0.00% 0.00% = authdaemond.plain 943 root 4 0 1232K 12K select 0:00 0.00% 0.00% moused 535 root 4 0 1188K 92K select 0:00 0.00% 0.00% = supervise 18768 root 96 0 536K 264K select 0:00 0.00% 0.00% make 849 www 20 0 7312K 12K lockf 0:00 0.00% 0.00% httpd ------=_NextPart_000_0000_01C4B704.3BEDAA00 Content-Type: application/octet-stream; name="dmesg.boot" Content-Transfer-Encoding: quoted-printable Content-Disposition: attachment; filename="dmesg.boot" Copyright (c) 1992-2004 The FreeBSD Project.=0A= Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994=0A= The Regents of the University of California. All rights reserved.=0A= FreeBSD 5.3-BETA7 #6: Tue Oct 19 00:36:59 CEST 2004=0A= root@bart.motd.dk:/usr/obj/usr/src/sys/GW=0A= WARNING: debug.mpsafenet forced to 0 as ipsec requires Giant=0A= WARNING: MPSAFE network stack disabled, expect reduced performance.=0A= Timecounter "i8254" frequency 1193182 Hz quality 0=0A= CPU: Pentium III/Pentium III Xeon/Celeron (548.74-MHz 686-class CPU)=0A= Origin =3D "GenuineIntel" Id =3D 0x673 Stepping =3D 3=0A= = Features=3D0x383f9ff=0A= real memory =3D 201326592 (192 MB)=0A= avail memory =3D 187334656 (178 MB)=0A= npx0: [FAST]=0A= npx0: on motherboard=0A= npx0: INT 16 interface=0A= acpi0: on motherboard=0A= acpi0: Power Button (fixed)=0A= Timecounter "ACPI-safe" frequency 3579545 Hz quality 1000=0A= acpi_timer0: <24-bit timer at 3.579545MHz> port 0x8008-0x800b on acpi0=0A= cpu0: on acpi0=0A= pcib0: port 0xcf8-0xcff on acpi0=0A= pci0: on pcib0=0A= agp0: mem = 0xf8000000-0xfbffffff at device 0.0 on pci0=0A= pcib1: at device 1.0 on pci0=0A= pci1: on pcib1=0A= pci1: at device 0.0 (no driver attached)=0A= isab0: at device 7.0 on pci0=0A= isa0: on isab0=0A= atapci0: port = 0x1420-0x142f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 7.1 on pci0=0A= ata0: channel #0 on atapci0=0A= ata1: channel #1 on atapci0=0A= uhci0: port 0x1400-0x141f irq = 9 at device 7.2 on pci0=0A= uhci0: [GIANT-LOCKED]=0A= usb0: on uhci0=0A= usb0: USB revision 1.0=0A= uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1=0A= uhub0: 2 ports with 2 removable, self powered=0A= pci0: at device 7.3 (no driver attached)=0A= pci0: at device 12.0 (no driver attached)=0A= xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0x1000-0x107f mem = 0xf4018000-0xf401807f irq 11 at device 13.0 on pci0=0A= miibus0: on xl0=0A= xlphy0: <3c905C 10/100 internal PHY> on miibus0=0A= xlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto=0A= xl0: Ethernet address: 00:50:04:d0:46:f5=0A= xl0: [GIANT-LOCKED]=0A= pci0: at device 16.0 (no driver attached)=0A= xl1: <3Com 3c905C-TX Fast Etherlink XL> port 0x1080-0x10ff mem = 0xf4018400-0xf401847f irq 10 at device 17.0 on pci0=0A= miibus1: on xl1=0A= xlphy1: <3c905C 10/100 internal PHY> on miibus1=0A= xlphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto=0A= xl1: Ethernet address: 00:50:04:d6:d6:3e=0A= xl1: [GIANT-LOCKED]=0A= fdc0: port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on = acpi0=0A= fdc0: [FAST]=0A= fd0: <1440-KB 3.5" drive> on fdc0 drive 0=0A= sio0 port 0x3f8-0x3ff irq 4 on acpi0=0A= sio0: type 16550A=0A= ppc0 port 0x378-0x37b irq 7 on acpi0=0A= ppc0: Generic chipset (EPP/NIBBLE) in COMPATIBLE mode=0A= ppbus0: on ppc0=0A= plip0: on ppbus0=0A= lpt0: on ppbus0=0A= lpt0: Interrupt-driven port=0A= ppi0: on ppbus0=0A= atkbdc0: port 0x64,0x60 irq 1 on acpi0=0A= atkbd0: irq 1 on atkbdc0=0A= kbd0 at atkbd0=0A= atkbd0: [GIANT-LOCKED]=0A= psm0: irq 12 on atkbdc0=0A= psm0: [GIANT-LOCKED]=0A= psm0: model IntelliMouse, device ID 3=0A= orm0: at iomem = 0xe4000-0xeffff,0xe0000-0xe3fff,0xca000-0xca7ff,0xc9800-0xc9fff,0xc0000-0= xc97ff on isa0=0A= pmtimer0 on isa0=0A= sc0: at flags 0x100 on isa0=0A= sc0: VGA <16 virtual consoles, flags=3D0x300>=0A= sio1: configured irq 3 not in bitmap of probed irqs 0=0A= sio1: port may not be enabled=0A= vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0=0A= Timecounter "TSC" frequency 548738654 Hz quality 800=0A= Timecounters tick every 10.000 msec=0A= IPsec: Initialized Security Association Processing.=0A= ipfw2 initialized, divert enabled, rule-based forwarding disabled, = default to deny, logging limited to 400 packets/entry by default=0A= ad0: 8693MB [17662/16/63] at ata0-master UDMA33=0A= ata1-slave: FAILURE - SETFEATURES SET TRANSFER MODE status=3D1 = error=3D4=0A= acd0: CDROM at ata1-master UDMA33=0A= afd0: REMOVABLE at ata1-slave BIOSPIO=0A= Mounting root from ufs:/dev/ad0s1a=0A= WARNING: / was not properly dismounted=0A= WARNING: /tmp was not properly dismounted=0A= WARNING: /usr was not properly dismounted=0A= WARNING: /var was not properly dismounted=0A= ------=_NextPart_000_0000_01C4B704.3BEDAA00--