Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 11 Dec 2008 18:00:56 +0900
From:      Pyun YongHyeon <pyunyh@gmail.com>
To:        Victor Balada Diaz <victor@bsdes.net>
Cc:        freebsd-stable@freebsd.org, freebsd-amd64@freebsd.org
Subject:   Re: [ATA] and re(4) stability issues
Message-ID:  <20081211090056.GH42370@cdnetworks.co.kr>
In-Reply-To: <20081211081045.GJ1320@alf.bsdes.net>
References:  <20081209185236.GA1320@alf.bsdes.net> <20081210061226.GC37837@cdnetworks.co.kr> <20081210085934.GB1320@alf.bsdes.net> <20081210102800.GH37837@cdnetworks.co.kr> <20081210113225.GD1320@alf.bsdes.net> <20081210120719.GK37837@cdnetworks.co.kr> <20081211075707.GH1320@alf.bsdes.net> <20081211081045.GJ1320@alf.bsdes.net>

next in thread | previous in thread | raw e-mail | index | archive | help
On Thu, Dec 11, 2008 at 09:10:45AM +0100, Victor Balada Diaz wrote:
 > On Thu, Dec 11, 2008 at 08:57:07AM +0100, Victor Balada Diaz wrote:
 > > On Wed, Dec 10, 2008 at 09:07:19PM +0900, Pyun YongHyeon wrote:
 > > > On Wed, Dec 10, 2008 at 12:32:25PM +0100, Victor Balada Diaz wrote:
 > > >  > Also i didn't see any problem with interfaces going up and down,
 > > >  > but that usually happen after some hours of uptime, so i'll let
 > > >  > you know if the error happens again.
 > > >  > 
 > > 
 > > After writing to the HD with dd for a few hours and using
 > > stress -i 10 -d 10 the machine lost connectivity. I waited until
 > > today to be sure if the machine hung, paniced or just lost network
 > > connectivity. I don't have local access or serial access, so this
 > > is the only way i could do it. I've seen in the logs during the
 > > night various messages of:
 > > 
 > > 
 > > Dec 10 00:33:49 yac kernel: re0: watchdog timeout
 > > Dec 10 00:33:49 yac kernel: re0: link state changed to DOWN
 > > Dec 10 00:33:52 yac kernel: re0: link state changed to UP
 > > 
 > > The interface never recovered and i wasn't able to ping the machine
 > > until i rebooted. Nagios was checking all the time and no recovery
 > > happened.
 > > 
 > > The netstat -i in daily scripts shows just one Oerrs. I'm used to
 > > have a lot of them, but seems this time the card didn't recover from
 > > the only one. I also want to say that this is not a regression, as
 > > it happened before with 7.1 -BETA 2 code.
 > > 
 > > Is there anything more i can try?
 > 
 > Sorry it's too early in the morning and i thought today was 10
 > instead of 11. I don't even know the day i'm today.
 > 
 > Looking at today's log i see no link state changed messages
 > but i see this other messages that started happening more or
 > less at the same time i lost connectivity to the server:
 > 
 > Dec 10 18:20:32 yac kernel: re0: link state changed to DOWN
 > Dec 10 18:20:32 yac kernel: re0: PHY read failed
 > 

I've reverted r185756 which caused GMII access issues on some
controllers. If you are brave enough to try beta code, you can
get latest re(4) in the following URL. Note, I don't have PCIe
based RealTek controllers so the code was not tested at all.

http://people.freebsd.org/~yongari/re/if_re.c
http://people.freebsd.org/~yongari/re/if_rlreg.h

-- 
Regards,
Pyun YongHyeon



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20081211090056.GH42370>