Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 29 Mar 2000 05:17:10 +0100 (BST)
From:      Jasper Wallace <jasper@ivision.co.uk>
To:        freebsd-stable@freebsd.org
Cc:        Nick Ludlam <nick@ivision.co.uk>, sysadmin@ivision.co.uk
Subject:   Repeted panics with 3.4
Message-ID:  <Pine.GSO.4.21.0003290507040.18891-200000@avengers.ivision.co.uk>

next in thread | raw e-mail | index | archive | help

[-- Attachment #1 --]

We've got a pair of rackmount webservers, both running 3.4, one has been
fine, the other has just started panic's soon after boot, normally
associated with file system access - i've seen it when trying to mount a
file sys, when trying to run a peice of software amd when doing 'viunm
start':

here is one of the panics: (cut and pasted, the machine has a serial
console).
------------------------------------------------------------------------
# mount /usr/local                                                                                                        
                                                                                                                          
                                                                                                                          
Fatal trap 12: page fault while in kernel mode                                                                            
fault virtual address   = 0x56204e49                                                                                      
fault code              = supervisor read, page not present                                                               
instruction pointer     = 0x8:0xc014b2b8                                                                                  
stack pointer           = 0x10:0xd1bfecb4                                                                                 
frame pointer           = 0x10:0xd1bfecd8                                                                                 
code segment            = base 0x0, limit 0xfffff, type 0x1b                                                              
                        = DPL 0, pres 1, def32 1, gran 1                                                                  
processor eflags        = interrupt enabled, resume, IOPL = 0                                                             
current process         = 57 (mount)                                                                                      
interrupt mask          = net tty bio cam                                                                                 
trap number             = 12                                                                                              
panic: page fault                                                                                                         
                                                                                                                          
syncing disks... done                                                                                                     
Automatic reboot in 15 seconds - press a key on the console to abort                                                      
Rebooting...                                                                                                              
------------------------------------------------------------------------

All the panics have been at instruction pointer 0x8:0xc014b2b8, running nm
on the kernel gives:

# nm /kernel | sort | grep c014b                                                                                          
c014b038 T malloc                                                                                                         
c014b038 t gcc2_compiled.                                                                                                 
c014b348 T free                                                                                                           
c014b488 t kmeminit                                                                                                       

Which dosn't make much sense to me...


questions:

these machines have ecc ram - if the ram fails the ecc checks what does
freebsd do? whould it panic as above?

Is this some irq masking/spl thing caused by using a serial console? (i.e.
is our use of a serial console sufficienty unusual that we are walking
into a less tested area?)

dmesg attached.

-- 
Internet Vision          Internet Consultancy           Tel: 0171 589 4500
60 Albert Court            & Web development            Fax: 0171 589 4522
Prince Consort Road                                   vision@ivision.co.uk
London SW7 2BE                                   http://www.ivision.co.uk/



[-- Attachment #2 --]
Copyright (c) 1992-1999 FreeBSD Inc.
Copyright (c) 1982, 1986, 1989, 1991, 1993
        The Regents of the University of California. All rights reserved.
FreeBSD 3.4-RELEASE #0: Fri Mar 10 03:08:56 GMT 2000
    root@:/usr/src/sys/compile/UMS-WEB
Timecounter "i8254"  frequency 1193182 Hz
Timecounter "TSC"  frequency 551252595 Hz
CPU: Pentium III (551.25-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x673  Stepping = 3
  Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,<b25>>
real memory  = 1073741824 (1048576K bytes)
avail memory = 1042788352 (1018348K bytes)
vinum: loaded
Pentium Pro MTRR support enabled
Probing for devices on PCI bus 0:
chip0: <Intel 82443GX host to PCI bridge> rev 0x00 on pci0.0.0
chip1: <Intel 82443GX host to AGP bridge> rev 0x00 on pci0.1.0
chip2: <Intel 82371AB PCI to ISA bridge> rev 0x02 on pci0.7.0
ide_pci0: <Intel PIIX4 Bus-master IDE controller> rev 0x01 on pci0.7.1
chip3: <Intel 82371AB Power management controller> rev 0x02 on pci0.7.3
ahc0: <Adaptec aic7896/97 Ultra2 SCSI adapter> rev 0x00 int a irq 10 on pci0.11.0
ahc0: aic7896/97 Wide Channel A, SCSI Id=7, 16/255 SCBs
ahc1: <Adaptec aic7896/97 Ultra2 SCSI adapter> rev 0x00 int a irq 10 on pci0.11.1
ahc1: aic7896/97 Wide Channel B, SCSI Id=7, 16/255 SCBs
fxp0: <Intel EtherExpress Pro 10/100B Ethernet> rev 0x08 int a irq 11 on pci0.13.0
fxp0: Ethernet address 00:e0:81:10:89:05
Probing for devices on PCI bus 1:
vga0: <Trident model 9750 VGA-compatible display device> rev 0xf3 int a irq 0 on pci1.0.0
Probing for PnP devices:
Probing for devices on the ISA bus:
vt0 on isa
vt0: unknown trident, 80 col, color, 8 scr, unknown kbd, [R3.20-b24]
atkbdc0 at 0x60-0x6f on motherboard
atkbd0 irq 1 on isa
psm0 not found
sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa
sio0: type 16550A, console
sio1 at 0x2f8-0x2ff irq 3 on isa
sio1: type 16550A
fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
fdc0: FIFO enabled, 8 bytes threshold
fd0: 1.44MB 3.5in
wdc0 not found at 0x1f0
wdc1 at 0x170-0x177 irq 15 on isa
wdc1: unit 0 (atapi): <TOSHIBA CD-ROM XM-6702B/1005>, removable, accel, dma, iordis
acd0: drive speed 8268KB/sec, 128KB cache
acd0: supported read types: CD-R, CD-RW, CD-DA
acd0: Audio: play, 16 volume levels
acd0: Mechanism: ejectable tray
acd0: Medium: no/blank disc inside, unlocked
ppc0 at 0x378 irq 7 flags 0x40 on isa
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
ppi0: <generic parallel i/o> on ppbus 0
vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa
npx0 on motherboard
npx0: INT 16 interface
Waiting 5 seconds for SCSI devices to settle
chanda0 at ahc0 bus 0 target 0 lun 0
da0: <SEAGATE ST39175LW 0001> Fixed Direct Access SCSI-2 device 
da0: 80.000MB/s transfers (40.000MHz, offset 15, 16bit), Tagged Queueing Enabled
da0: 8683MB (17783240 512 byte sectors: 255H 63S/T 1106C)
da1 at ahc0 bus 0 target 1 lun 0
da1: <SEAGATE ST39175LW 0001> Fixed Direct Access SCSI-2 device 
da1: 80.000MB/s transfers (40.000MHz, offset 15, 16bit), Tagged Queueing Enabled
da1: 8683MB (17783240 512 byte sectors: 255H 63S/T 1106C)
ging root device to da0s1a
WARNING: / was not properly dismounted
vinum: no drives found


Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?Pine.GSO.4.21.0003290507040.18891-200000>