From owner-freebsd-hackers@freebsd.org Sun Jan 31 15:27:58 2016 Return-Path: Delivered-To: freebsd-hackers@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 227E5A73F97 for ; Sun, 31 Jan 2016 15:27:58 +0000 (UTC) (envelope-from zhao6014@gmail.com) Received: from mail-yk0-x22c.google.com (mail-yk0-x22c.google.com [IPv6:2607:f8b0:4002:c07::22c]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id D6DF99E4 for ; Sun, 31 Jan 2016 15:27:57 +0000 (UTC) (envelope-from zhao6014@gmail.com) Received: by mail-yk0-x22c.google.com with SMTP id r207so76764270ykd.2 for ; Sun, 31 Jan 2016 07:27:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=aq1K9AQo77b6imKmChXzdQAFcC1UBQGTCX4afusxFy8=; b=sCs+nHFX7I+x7Z6MSyf/85kQDgHVhYKkKujVlhcKvAkopynh7yzz4XCVWiISIlzifS OfSRZNv4QGA3RExnkS4Ag0g2DP2j5jWfDUdGLYZflMUu3XBB6DddPh1Q5aa+87hcD1I1 UyklQZxGPcFywLmQHgqW4Z67Krvn/tvXrKbveDcoi64yAHVdmKMyidEYddsWMW4skae1 fmVy46l6dY4AeptnztGIsose/a1EradFb8N0DxzUo7fpIIBEBQN3dwUBZNecrN3153MG 1g1NJ3OtUxV7rAYJrOP0q+RTOKYqM9Mnsn8NtfMAaoK5s22Wf/rqxEUHPfsY+2HFY+QE d+jg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=aq1K9AQo77b6imKmChXzdQAFcC1UBQGTCX4afusxFy8=; b=l4d9vPNR2zg+eZ3nEw1FG0A8oV9I+H6J1jZvm0wxkBMHTPAXfnJFGtiwwz9r6xH/xC 6aunA5iV1Rb8XlmqUvWwMG6xgl6TAlQDm0MOMP90UCr/6nf+BINXaoN5MDwYSDJM8uyw W5jK6QWLZRcXUzJudP+uKg8eZa4aSfIIZlTsh/QyIWvk0PvZZPToIweDJVx3pqQeq0d9 phzOCNAbxPm6lqHpHHIbW3YcLCdRF1S7LpzalqFhEk5Wux3yGy+9s8rX9ZcAukTeNQhn bH8yOxyPlCP8F2CyOM8k9c8/OQubniuT0bmjidZRdxEJL99z5XbuarQGFn33NCG05XNE kSBQ== X-Gm-Message-State: AG10YORk4K7fudCDefuC0YzN75RfhwUZX+OKWDe8IrNx0IdjRi1M2jdidLlEFWB3rqeu3/AvadbBaZBDZx0lmw== MIME-Version: 1.0 X-Received: by 10.129.72.70 with SMTP id v67mr7672703ywa.156.1454254077053; Sun, 31 Jan 2016 07:27:57 -0800 (PST) Received: by 10.37.79.6 with HTTP; Sun, 31 Jan 2016 07:27:56 -0800 (PST) Received: by 10.37.79.6 with HTTP; Sun, 31 Jan 2016 07:27:56 -0800 (PST) In-Reply-To: <20160130071346.31022.37189@wrigleys.postgresql.org> References: <20160130071346.31022.37189@wrigleys.postgresql.org> Date: Sun, 31 Jan 2016 23:27:56 +0800 Message-ID: Subject: Fwd: [BUGS] BUG #13900: stop standby failed with writer process hang(happen 3 times in 2 days) From: Jov To: freebsd-hackers@freebsd.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.20 X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 31 Jan 2016 15:27:58 -0000 how can I help to debug this problem?the process still there in Ds state. ---------- =E8=BD=AC=E5=8F=91=E7=9A=84=E9=82=AE=E4=BB=B6 ---------- =E5=8F=91=E4=BB=B6=E4=BA=BA=EF=BC=9A =E6=97=A5=E6=9C=9F=EF=BC=9A2016=E5=B9=B41=E6=9C=8830=E6=97=A5 3:14 PM =E4=B8=BB=E9=A2=98=EF=BC=9A[BUGS] BUG #13900: stop standby failed with writ= er process hang(happen 3 times in 2 days) =E6=94=B6=E4=BB=B6=E4=BA=BA=EF=BC=9A =E6=8A=84=E9=80=81=EF=BC=9A The following bug has been logged on the website: Bug reference: 13900 Logged by: Jov Email address: amutu@amutu.com PostgreSQL version: 9.3.7 Operating system: FreeBSD 10.2 amd64 Description: I am updating my 3 database from pg9.3 to pg9.5,but may find a bug for the bgwriter of pg9.3.I can't stop all the stand by process,even for immediate stop mode and kill -9,the writer process still there,with ps state "Ds" (D Marks a process in disk (or other short term, uninterruptible) wait) .googl= e say the only method to clean the "Ds" process is rebooting the system. truss say no info for the process,and procstat say the process is calling the poll system call in the kernel. These is the detail info: pg_ctl -D ./slave stop -m fast waiting for server to shut down............................................................... failed pg_ctl: server does not shut down psql postgres psql: FATAL: the database system is shutting down pg_ctl -D ./slave stop -m immediate waiting for server to shut down.... done server stopped ps auxwww | grep postgres jovz 976 0.0 0.3 28840 5232 - Is 17 116 0:00.04 postgres: logger process (postgres) jovz 979 0.0 0.7 196940 13552 - Ds 17 116 0:06.03 postgres: writer process (postgres) log: 2016-01-30 14:23:22.350 CST,,,947,,569b1bc2.3b3,3,,2016-01-17 12:42:42 CST,,0,LOG,00000,"received fast shutdown request",,,,,,,,,"" 2016-01-30 14:23:22.350 CST,,,947,,569b1bc2.3b3,4,,2016-01-17 12:42:42 CST,,0,LOG,00000,"aborting any active transactions",,,,,,,,,"" 2016-01-30 14:25:35.271 CST,,,64815,"",56ac575f.fd2f,1,"",2016-01-30 14:25:35 CST,,0,LOG,00000,"connection received: host=3D[local]",,,,,,,,,"" 2016-01-30 14:25:35.274 CST,"jovz","f",64815,"[local]",56ac575f.fd2f,2,"",2016-01-30 14:25:35 CST,,0,FATAL,57P03,"the database system is shutting down",,,,,,,,,"" 2016-01-30 14:25:38.324 CST,,,64817,"",56ac5762.fd31,1,"",2016-01-30 14:25:38 CST,,0,LOG,00000,"connection received: host=3D[local]",,,,,,,,,"" 2016-01-30 14:25:38.324 CST,"jovz","f",64817,"[local]",56ac5762.fd31,2,"",2016-01-30 14:25:38 CST,,0,FATAL,57P03,"the database system is shutting down",,,,,,,,,"" 2016-01-30 14:47:36.727 CST,,,65457,"",56ac5c88.ffb1,1,"",2016-01-30 14:47:36 CST,,0,LOG,00000,"connection received: host=3D[local]",,,,,,,,,"" 2016-01-30 14:47:36.727 CST,"jovz","postgres",65457,"[local]",56ac5c88.ffb1,2,"",2016-01-30 14:47:3= 6 CST,,0,FATAL,57P03,"the database system is shutting down",,,,,,,,,"" 2016-01-30 14:50:04.564 CST,,,947,,569b1bc2.3b3,5,,2016-01-17 12:42:42 CST,,0,LOG,00000,"received immediate shutdown request",,,,,,,,,"" truss -p 979 ^Ctruss: Unexpect stop in waitpid: Interrupted system call root@fblax:~ # procstat -kk 979 PID TID COMM TDNAME KSTACK 979 100688 postgres - mi_switch+0xe1 sleepq_timedwait_sig+0x8b _cv_timedwait_sig_sbt+0x18b seltdwait+0xa4 kern_poll+0x464 sys_poll+0x61 amd64_syscall+0x357 Xfast_syscall+0xfb root@fb:~ # kill -9 979 root@fb:~ # procstat -kk 979 PID TID COMM TDNAME KSTACK 979 100688 postgres - mi_switch+0xe1 sleepq_timedwait_sig+0x8b _cv_timedwait_sig_sbt+0x18b seltdwait+0xa4 kern_poll+0x464 sys_poll+0x61 amd64_syscall+0x357 Xfast_syscall+0xfb -- Sent via pgsql-bugs mailing list (pgsql-bugs@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-bugs