[Eisfair] E1 friert ein wg. Speicherleck

Jürgen Witt j-witt at web.de
Mo Feb 8 19:54:24 CET 2016


Hallo Marcus,

Am 08.02.2016 um 17:17 schrieb Marcus Roeckrath:
> Hallo Jürgen,
> 
> Wie Thomas im anderen Thread schon erläutert hat, braucht
> brute-force-Blocking Speicher und ruft den oom-Killer auf, um seine
> Anforderung durchsetzen zu können.
> 
> Ob der Amok läuft, läßt sich nun daraus nicht sagen.
> 
> Ich würde als ersten Test mal den 2.16.0 booten; dann könnte man sich auch
> mal bei laufendem 2.18.0 mittels top (sooft die > Taste drücken, bis nach
> der %Mem Spalte sortiert wird.
> 
> Da kannst Du dann mal sehen, wer viel Mem (bei mir krallt sich der clamd
> locker 25% davon) verbraucht oder einen steigenden Membedarf hat.

jetzt ist der nächste Server an einem anderen Standort eingefroren. BFB
ist dort nicht aktiviert. Der Server hat auch am Sonntag sein
Kernel-Update bekommen. Lief auf bis dahin unauffällig. Hat kein Java,
dafür aber Fax/Capi.


System:    Host: server Kernel: 3.2.75-eisfair-1-SMP i686 (32 bit)
Console: tty 0 Distro: eisfair-1
Machine:   System: Gigabyte product: N/A
           Mobo: Gigabyte model: Z77M-D3H v: x.x Bios: American
Megatrends v: F5 date: 03/29/2012
CPU:       Dual core Intel Core i3-3225 (-HT-MCP-) cache: 3072 KB
           clock speeds: max: 3329 MHz 1: 3329 MHz 2: 3329 MHz 3: 3329
MHz 4: 3329 MHz
Graphics:  Card: Intel Xeon E3-1200 v2/3rd Gen Core processor Graphics
Controller
           Display Server: N/A driver: N/A tty size: 154x63 Advanced
Data: N/A for root out of X
Network:   Card-1: Intel 82574L Gigabit Network Connection driver: e1000e
           IF: eth0 state: up speed: 1000 Mbps duplex: full mac:
68:05:ca:0d:58:05
           Card-2: AVM Fritz!Card PCI v2.0 ISDN driver: f1pci
           IF: N/A state: N/A speed: N/A duplex: N/A mac: N/A
Drives:    HDD Total Size: 4000.8GB (44.2% used) ID-1: /dev/sda model:
WDC_WD10EFRX size: 1000.2GB
           ID-2: /dev/sdb model: WDC_WD10EFRX size: 1000.2GB ID-3:
/dev/sdc model: WDC_WD20EARX size: 2000.4GB
Partition: ID-1: / size: 9.9G used: 1.3G (14%) fs: ext3 dev: /dev/md3
           ID-2: /boot size: 53M used: 18M (36%) fs: ext3 dev: /dev/md1
           ID-3: swap-1 size: 0.54GB used: 0.00GB (0%) fs: swap dev:
/dev/md2
RAID:      Device-1: /dev/md4 - active raid: 1 components: online: 2/2 -
sda4 sdb4
           Device-2: /dev/md3 - active raid: 1 components: online: 2/2 -
sda3 sdb3
           Device-3: /dev/md2 - active raid: 1 components: online: 2/2 -
sda2 sdb2
           Device-4: /dev/md1 - active raid: 1 components: online: 2/2 -
sdb1 sda1
Sensors:   None detected - is lm-sensors installed and configured?
Info:      Processes: 124 Uptime: 1:19 Memory: 156.0/3487.1MB Init:
SysVinit runlevel: 2
           Client: Shell (hwdiag-list) inxi: 2.2.28

Auszug aus /var/log/messages

Feb  8 16:24:55 server kernel: INFO: rcu_sched detected stall on CPU 3
(t=15001 jiffies)
Feb  8 16:24:56 server kernel: Pid: 2544, comm: smbd Tainted: P
  O 3.2.75-eisfair-1-SMP #1
Feb  8 16:24:56 server kernel: Call Trace:
Feb  8 16:24:56 server kernel:  [__rcu_pending+0x64/0x28f]
__rcu_pending+0x64/0x28f
Feb  8 16:24:56 server kernel:  [rcu_check_callbacks+0x6d/0x98]
rcu_check_callbacks+0x6d/0x98
Feb  8 16:24:56 server kernel:  [update_process_times+0x2d/0x58]
update_process_times+0x2d/0x58
Feb  8 16:24:56 server kernel:  [tick_sched_timer+0x13f/0x166]
tick_sched_timer+0x13f/0x166
Feb  8 16:24:56 server kernel:  [__run_hrtimer.isra.27+0x3d/0x91]
__run_hrtimer.isra.27+0x3d/0x91
Feb  8 16:24:56 server kernel:  [hrtimer_interrupt+0xe2/0x1cb]
hrtimer_interrupt+0xe2/0x1cb
Feb  8 16:24:56 server kernel:  [smp_apic_timer_interrupt+0x67/0x7a]
smp_apic_timer_interrupt+0x67/0x7a
Feb  8 16:24:56 server kernel:  [apic_timer_interrupt+0x2a/0x30]
apic_timer_interrupt+0x2a/0x30
Feb  8 16:24:56 server kernel:  [any_slab_objects+0x15/0x1b] ?
any_slab_objects+0x15/0x1b
Feb  8 16:24:56 server kernel:  [_raw_spin_lock+0x18/0x1c] ?
_raw_spin_lock+0x18/0x1c
Feb  8 16:24:56 server kernel:  [unix_state_double_lock+0x3d/0x41]
unix_state_double_lock+0x3d/0x41
Feb  8 16:24:56 server kernel:  [unix_dgram_connect+0x83/0x153]
unix_dgram_connect+0x83/0x153
Feb  8 16:24:56 server kernel:  [sys_connect+0x63/0x88]
sys_connect+0x63/0x88
Feb  8 16:24:56 server kernel:  [sys_socketcall+0x76/0x192]
sys_socketcall+0x76/0x192
Feb  8 16:24:56 server kernel:  [syscall_after_call+0x0/0x04]
syscall_call+0x7/0x7
Feb  8 16:24:56 server kernel:  [mcheck_cpu_init+0x137/0x2d2] ?
mcheck_cpu_init+0x137/0x2d2
Feb  8 16:25:19 server kernel: capidrv-1: controller dead ??
Feb  8 16:25:19 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:26:19 server kernel: capidrv-1: controller dead ??
Feb  8 16:26:19 server kernel: capidrv-1: listen_change_state state=3
event=1 ????


Feb  8 16:27:19 server kernel: capidrv-1: controller dead ??
Feb  8 16:27:20 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:27:56 server kernel: INFO: rcu_sched detected stall on CPU 3
(t=60032 jiffies)
Feb  8 16:27:56 server kernel: Pid: 2544, comm: smbd Tainted: P
  O 3.2.75-eisfair-1-SMP #1
Feb  8 16:27:56 server kernel: Call Trace:
Feb  8 16:27:56 server kernel:  [__rcu_pending+0x64/0x28f]
__rcu_pending+0x64/0x28f
Feb  8 16:27:56 server kernel:  [rcu_check_callbacks+0x6d/0x98]
rcu_check_callbacks+0x6d/0x98
Feb  8 16:27:56 server kernel:  [update_process_times+0x2d/0x58]
update_process_times+0x2d/0x58
Feb  8 16:27:56 server kernel:  [tick_sched_timer+0x13f/0x166]
tick_sched_timer+0x13f/0x166
Feb  8 16:27:56 server kernel:  [__run_hrtimer.isra.27+0x3d/0x91]
__run_hrtimer.isra.27+0x3d/0x91
Feb  8 16:27:56 server kernel:  [hrtimer_interrupt+0xe2/0x1cb]
hrtimer_interrupt+0xe2/0x1cb
Feb  8 16:27:56 server kernel:  [smp_apic_timer_interrupt+0x67/0x7a]
smp_apic_timer_interrupt+0x67/0x7a
Feb  8 16:27:56 server kernel:  [apic_timer_interrupt+0x2a/0x30]
apic_timer_interrupt+0x2a/0x30
Feb  8 16:27:56 server kernel:  [any_slab_objects+0x15/0x1b] ?
any_slab_objects+0x15/0x1b
Feb  8 16:27:56 server kernel:  [_raw_spin_lock+0x16/0x1c] ?
_raw_spin_lock+0x16/0x1c
Feb  8 16:27:56 server kernel:  [unix_state_double_lock+0x3d/0x41]
unix_state_double_lock+0x3d/0x41
Feb  8 16:27:56 server kernel:  [unix_dgram_connect+0x83/0x153]
unix_dgram_connect+0x83/0x153
Feb  8 16:27:56 server kernel:  [sys_connect+0x63/0x88]
sys_connect+0x63/0x88
Feb  8 16:27:56 server kernel:  [sys_socketcall+0x76/0x192]
sys_socketcall+0x76/0x192
Feb  8 16:27:56 server kernel:  [syscall_after_call+0x0/0x04]
syscall_call+0x7/0x7
Feb  8 16:27:56 server kernel:  [mcheck_cpu_init+0x137/0x2d2] ?
mcheck_cpu_init+0x137/0x2d2
Feb  8 16:28:20 server kernel: capidrv-1: controller dead ??
Feb  8 16:28:20 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:29:20 server kernel: capidrv-1: controller dead ??
Feb  8 16:29:20 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:30:00 server fcron[16586]: Job '/root/prfweg.sh &> /dev/null'
started for user root (pid 16587)
Feb  8 16:30:02 server fcron[16586]: Job '/root/prfweg.sh &> /dev/null'
completed
Feb  8 16:30:20 server kernel: capidrv-1: controller dead ??
Feb  8 16:30:20 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:30:56 server kernel: INFO: rcu_sched detected stall on CPU 3
(t=105062 jiffies)
Feb  8 16:30:56 server kernel: Pid: 2544, comm: smbd Tainted: P
  O 3.2.75-eisfair-1-SMP #1

Feb  8 16:31:20 server kernel: capidrv-1: controller dead ??
Feb  8 16:31:20 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:32:20 server kernel: capidrv-1: controller dead ??
Feb  8 16:32:20 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:33:20 server kernel: capidrv-1: controller dead ??
Feb  8 16:33:21 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:33:56 server kernel: INFO: rcu_sched detected stall on CPU 3
(t=150093 jiffies)
Feb  8 16:33:56 server kernel: Pid: 2544, comm: smbd Tainted: P
  O 3.2.75-eisfair-1-SMP #1
Feb  8 16:33:56 server kernel: Call Trace:
Feb  8 16:33:56 server kernel:  [__rcu_pending+0x64/0x28f]
__rcu_pending+0x64/0x28f
Feb  8 16:33:56 server kernel:  [rcu_check_callbacks+0x6d/0x98]
rcu_check_callbacks+0x6d/0x98
Feb  8 16:33:56 server kernel:  [update_process_times+0x2d/0x58]
update_process_times+0x2d/0x58
Feb  8 16:33:56 server kernel:  [tick_sched_timer+0x13f/0x166]
tick_sched_timer+0x13f/0x166
Feb  8 16:33:56 server kernel:  [__run_hrtimer.isra.27+0x3d/0x91]
__run_hrtimer.isra.27+0x3d/0x91
Feb  8 16:33:56 server kernel:  [hrtimer_interrupt+0xe2/0x1cb]
hrtimer_interrupt+0xe2/0x1cb
Feb  8 16:33:56 server kernel:  [smp_apic_timer_interrupt+0x67/0x7a]
smp_apic_timer_interrupt+0x67/0x7a
Feb  8 16:33:56 server kernel:  [apic_timer_interrupt+0x2a/0x30]
apic_timer_interrupt+0x2a/0x30
Feb  8 16:33:56 server kernel:  [any_slab_objects+0x15/0x1b] ?
any_slab_objects+0x15/0x1b
Feb  8 16:33:56 server kernel:  [_raw_spin_lock+0x10/0x1c] ?
_raw_spin_lock+0x10/0x1c
Feb  8 16:33:56 server kernel:  [unix_state_double_lock+0x3d/0x41]
unix_state_double_lock+0x3d/0x41
Feb  8 16:33:56 server kernel:  [unix_dgram_connect+0x83/0x153]
unix_dgram_connect+0x83/0x153
Feb  8 16:33:56 server kernel:  [sys_connect+0x63/0x88]
sys_connect+0x63/0x88
Feb  8 16:33:56 server kernel:  [sys_socketcall+0x76/0x192]
sys_socketcall+0x76/0x192
Feb  8 16:33:57 server kernel:  [syscall_after_call+0x0/0x04]
syscall_call+0x7/0x7
Feb  8 16:33:57 server kernel:  [mcheck_cpu_init+0x137/0x2d2] ?
mcheck_cpu_init+0x137/0x2d2
Feb  8 16:34:21 server kernel: capidrv-1: controller dead ??
Feb  8 16:34:21 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:35:21 server kernel: capidrv-1: controller dead ??
Feb  8 16:35:21 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:36:21 server kernel: capidrv-1: controller dead ??
Feb  8 16:36:21 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 16:36:56 server kernel: INFO: rcu_sched detected stall on CPU 3
(t=195124 jiffies)
Feb  8 16:36:56 server kernel: Pid: 2544, comm: smbd Tainted: P
  O 3.2.75-eisfair-1-SMP #1

Feb  8 18:04:35 server kernel: capidrv-1: controller dead ??
Feb  8 18:04:35 server kernel: capidrv-1: listen_change_state state=3
event=1 ????
Feb  8 18:05:00 server fcron[19701]: Job
'/var/install/bin/smartmon-plot' started for user root (pid 19702)
Feb  8 18:05:04 server kernel: smbd invoked oom-killer: gfp_mask=0xd0,
order=1, oom_adj=0, oom_score_adj=0
Feb  8 18:05:04 server kernel: Pid: 25920, comm: smbd Tainted: P
   O 3.2.75-eisfair-1-SMP #1
Feb  8 18:05:04 server kernel: Call Trace:
Feb  8 18:05:05 server kernel:  [dump_header.isra.8+0x55/0x156]
dump_header.isra.8+0x55/0x156
Feb  8 18:05:06 server kernel:  [___ratelimit+0x93/0xac] ?
___ratelimit+0x93/0xac
Feb  8 18:05:06 server kernel:
[oom_kill_process.constprop.13+0x26/0x1ef]
oom_kill_process.constprop.13+0x26/0x1ef
Feb  8 18:05:06 server kernel:  [has_capability_noaudit+0x19/0x25] ?
has_capability_noaudit+0x19/0x25
Feb  8 18:05:06 server kernel:  [out_of_memory+0x1f1/0x259]
out_of_memory+0x1f1/0x259
Feb  8 18:05:07 server kernel:  [__alloc_pages_nodemask+0x458/0x536]
__alloc_pages_nodemask+0x458/0x536
Feb  8 18:05:07 server kernel:  [copy_process.part.44+0x56/0xd05]
copy_process.part.44+0x56/0xd05
Feb  8 18:05:07 server kernel:  [do_fork+0x10e/0x287] do_fork+0x10e/0x287
Feb  8 18:05:07 server kernel:  [set_current_blocked+0x27/0x38] ?
set_current_blocked+0x27/0x38
Feb  8 18:05:08 server kernel:  [copy_to_user+0x23/0x2c] ?
copy_to_user+0x23/0x2c
Feb  8 18:05:08 server kernel:  [sys_clone+0x1b/0x20] sys_clone+0x1b/0x20
Feb  8 18:05:08 server kernel:  [ptregs_clone+0x15/0x40]
ptregs_clone+0x15/0x40
Feb  8 18:05:08 server kernel:  [syscall_after_call+0x0/0x04] ?
syscall_call+0x7/0x7
Feb  8 18:05:08 server kernel:  [mcheck_cpu_init+0x137/0x2d2] ?
mcheck_cpu_init+0x137/0x2d2

18:05 Uhr ging dann nichts mehr.

Gruß
Jürgen





Mehr Informationen über die Mailingliste Eisfair