Article ID: 118311, created on Oct 30, 2013, last review on Jun 17, 2016

  • Applies to:
  • Virtuozzo 6.0
  • Virtuozzo containers for Linux 4.7
  • Virtuozzo hypervisor

Symptoms

The hardware server with Parallels Cloud Server can crash while resuming a container. In this case, the kernel buffer extracted from the crash dump file or the output of the serial console is as follows:

<1>[ 3814.733281] BUG: unable to handle kernel NULL pointer dereference at 000000000000005d
<1>[ 3814.733508] IP: [<ffffffff814a3232>] tcp_enter_frto+0x102/0x290
<4>[ 3814.733629] PGD 1035747067 PUD 10374f0067 PMD 0
<4>[ 3814.733756] Oops: 0000 [#1] SMP
<4>[ 3814.733877] last sysfs file: /sys/devices/system/node/node1/meminfo
<4>[ 3814.733999] CPU 20
<4>[ 3814.734006] Modules linked in: ...
<4>[ 3814.736070]
<4>[ 3814.736173] Pid: 0, comm: swapper veid: 0 Tainted: P        W  ---------------    2.6.32-042stab081.5 #1 042stab081_5 Dell Inc. PowerEdge R620/0PXXHP
<4>[ 3814.736413] RIP: 0010:[<ffffffff814a3232>]  [<ffffffff814a3232>] tcp_enter_frto+0x102/0x290
<4>[ 3814.736628] RSP: 0018:ffff880069a83d90  EFLAGS: 00010246
<4>[ 3814.736740] RAX: 0000000000000000 RBX: ffff880faad1a2c0 RCX: 0000000000000000
<4>[ 3814.736857] RDX: 0000000000000000 RSI: 0000000000000038 RDI: ffff880faad1a2c0
<4>[ 3814.736975] RBP: ffff880069a83da0 R08: 0000000000000000 R09: 0000000000000000
<4>[ 3814.737093] R10: 0000000000000005 R11: 0000000000000096 R12: ffff881ff5f62000
<4>[ 3814.737207] R13: ffff880faad1a388 R14: 000000000000000f R15: ffff882039420d70
<4>[ 3814.737321] FS:  0000000000000000(0000) GS:ffff880069a80000(0000) knlGS:0000000000000000
<4>[ 3814.737528] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
<4>[ 3814.737742] CR2: 000000000000005d CR3: 000000102d444000 CR4: 00000000000407e0
<4>[ 3814.737867] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
<4>[ 3814.737981] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
<4>[ 3814.738095] Process swapper (pid: 0, veid: 0, threadinfo ffff8810396ea000, task ffff882039420d70)
<4>[ 3814.738302] Stack:
<4>[ 3814.738402]  ffff880faad1a2c0 ffff881ff5f62000 ffff880069a83dd0 ffffffff814ae668
<4>[ 3814.738528] <d> ffff880faad1a2c0 ffff880faad1a308 ffffffff81aa8b40 ffff882039420d70
<4>[ 3814.738757] <d> ffff880069a83e10 ffffffff814aec68 0000000000000086 ffff8810396c4000
<4>[ 3814.739090] Call Trace:
<4>[ 3814.739200]  <IRQ>
<4>[ 3814.739312]  [<ffffffff814ae668>] tcp_retransmit_timer+0x2d8/0x6e0
<4>[ 3814.739429]  [<ffffffff814aec68>] tcp_write_timer+0x1f8/0x270
<4>[ 3814.739545]  [<ffffffff8108533c>] run_timer_softirq+0x1bc/0x380
<4>[ 3814.739661]  [<ffffffff8105bda5>] ? scheduler_tick+0x125/0x270
<4>[ 3814.739777]  [<ffffffff814aea70>] ? tcp_write_timer+0x0/0x270
<4>[ 3814.739894]  [<ffffffff81012ca9>] ? read_tsc+0x9/0x20
<4>[ 3814.740007]  [<ffffffff8107ac7d>] __do_softirq+0x10d/0x250
<4>[ 3814.740123]  [<ffffffff810ae365>] ? tick_dev_program_event+0x65/0xc0
<4>[ 3814.740239]  [<ffffffff810a1ecb>] ? hrtimer_interrupt+0x14b/0x260
<4>[ 3814.740355]  [<ffffffff8100c2ac>] call_softirq+0x1c/0x30
<4>[ 3814.740469]  [<ffffffff8100df05>] do_softirq+0x65/0xa0
<4>[ 3814.740582]  [<ffffffff8107aa9d>] irq_exit+0xcd/0xd0
<4>[ 3814.740696]  [<ffffffff81520460>] smp_apic_timer_interrupt+0x70/0x9b
<4>[ 3814.740812]  [<ffffffff8100bc53>] apic_timer_interrupt+0x13/0x20
<4>[ 3814.740926]  <EOI>
<4>[ 3814.741038]  [<ffffffff812e00ee>] ? intel_idle+0xde/0x170
<4>[ 3814.741159]  [<ffffffff812e00d1>] ? intel_idle+0xc1/0x170
<4>[ 3814.741275]  [<ffffffff81422cc7>] cpuidle_idle_call+0xa7/0x140
<4>[ 3814.741392]  [<ffffffff8100a026>] cpu_idle+0xb6/0x110
<4>[ 3814.741506]  [<ffffffff8150fa0d>] start_secondary+0x2ac/0x2ef
<4>[ 3814.741619] Code: 06 00 00 48 8b 83 c8 00 00 00 48 8d 93 c8 00 00 00 c7 83 44 06 00 00 00 00 00 00 48 39 c2 ba 00 00 00 00 48 0f 44 c2 48 8d 70 38 <0f> b6 56 25 0f b6 ca f6 c1 82 74 11 c7 83 40 06 00 00 00 00 00
<1>[ 3814.742253] RIP  [<ffffffff814a3232>] tcp_enter_frto+0x102/0x290
<4>[ 3814.742372]  RSP <ffff880069a83d90>
<4>[ 3814.742479] CR2: 000000000000005d

Cause

The current investigation in the assigned request PSBM-23227 suggests that there was incomplete information put about a TCP socket in the memory dump file on the stage of suspending a container which lead to the kernel panic shortly after resuming such container.

Resolution

The fix is expected to be available in the future kernel updates, 2.6.38-042stab084.x or later.

See also

Search Words

tcp_enter_frto+0x102/0x290

BUG: unable to handle kernel NULL pointer dereference at 000000000000005d

PSBM-23227

c62e8726973f80975db0531f1ed5c6a2 2897d76d56d2010f4e3a28f864d69223 0dd5b9380c7d4884d77587f3eb0fa8ef a26b38f94253cdfbf1028d72cf3a498b e8e50b42231236b82df27684e7ec0beb d02f9caf3e11b191a38179103495106f 0c05f0c76fec3dd785e9feafce1099a9

Email subscription for changes to this article
Save as PDF