[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Kernel Oops during/after rsync



Hello,

recently, I _sometimes_ encounter a kernel oops during a backup run.
This did not occur until a few days ago, furthermore, all the crashes
so far (a backup takes place every half hour and I’ve only seen three
during the last five days) occurred while I was playing Patrician III
with Wine. I don’t know whether Wine/Patrician III has anything to do
with that; if anything, I suspect temperature issues, but the same
machine survives five hours of video encoding just fine (though the
CPU frequency is sometimes automatically lowered due to high
temperature).

The backup script, attached below, consists of the following stages:

a) log that the backup started
b) rsync some data
c) log that the backup finished
d) call sync
e) log that the sync finished

a) and c) are logged:

Aug  7 23:30:01 ares Backup_Alix[638]: Backup started.
Aug  7 23:30:05 ares Backup_Alix[648]: Backup finished.

but not e). At the same time, the following is
written to kern.log, the screen shows part of that message and the
computer powers off after a while.

Aug  7 23:30:04 ares kernel: [75763.117797] BUG: unable to handle kernel paging request at ffff8812e5463620
Aug  7 23:30:04 ares kernel: [75763.117844] IP: [<ffffffff810d6985>] __d_lookup_rcu+0x136/0x156
Aug  7 23:30:04 ares kernel: [75763.117880] PGD 1a0c063 PUD 0 
Aug  7 23:30:04 ares kernel: [75763.117901] Oops: 0000 [#1] SMP 
Aug  7 23:30:04 ares kernel: [75763.117924] CPU 3 
Aug  7 23:30:04 ares kernel: [75763.117934] Modules linked in: e1000e tp_smapi(O) thinkpad_ec(O) iwlwifi [last unloaded: e1000e]
Aug  7 23:30:04 ares kernel: [75763.117987] 
Aug  7 23:30:04 ares kernel: [75763.117996] Pid: 641, comm: rsync Tainted: G           O 3.4.1.a2017.1 #1 LENOVO 291239G/291239G
Aug  7 23:30:04 ares kernel: [75763.118042] RIP: 0010:[<ffffffff810d6985>]  [<ffffffff810d6985>] __d_lookup_rcu+0x136/0x156
Aug  7 23:30:04 ares kernel: [75763.118082] RSP: 0018:ffff880143349c48  EFLAGS: 00010282
Aug  7 23:30:04 ares kernel: [75763.118107] RAX: 0000000000000008 RBX: ffff8812e5463608 RCX: 0000000000000014
Aug  7 23:30:04 ares kernel: [75763.118139] RDX: 0000000000000f40 RSI: ffffc90000002000 RDI: ffff88011c927a40
Aug  7 23:30:04 ares kernel: [75763.118171] RBP: ffff88011c927a40 R08: 0000000000000008 R09: 2f2f2f2f2f2f2f2f
Aug  7 23:30:04 ares kernel: [75763.118203] R10: 00000000ef94b111 R11: ffffffff8d9093ff R12: ffff880143349e18
Aug  7 23:30:04 ares kernel: [75763.118235] R13: ffff880143349d70 R14: ffff880143349d70 R15: ffff880143349e18
Aug  7 23:30:04 ares kernel: [75763.118268] FS:  00007fa33fd9f700(0000) GS:ffff88023bd80000(0000) knlGS:0000000000000000
Aug  7 23:30:04 ares kernel: [75763.118303] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug  7 23:30:04 ares kernel: [75763.118330] CR2: ffff8812e5463620 CR3: 00000001427fd000 CR4: 00000000000007e0
Aug  7 23:30:04 ares kernel: [75763.118362] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug  7 23:30:04 ares kernel: [75763.118393] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Aug  7 23:30:04 ares kernel: [75763.118425] Process rsync (pid: 641, threadinfo ffff880143348000, task ffff88023073d980)
Aug  7 23:30:04 ares kernel: [75763.118460] Stack:
Aug  7 23:30:04 ares kernel: [75763.118471]  ffff880115b097d0 0000000000000081 ffff8801e8ce6015 ffff88023073d980
Aug  7 23:30:04 ares kernel: [75763.118511]  ffff88023073d980 ffff8801e8ce6015 ffff880143349cf4 0000000000000008
Aug  7 23:30:04 ares kernel: [75763.118551]  ffff880115b097d0 ffff880143349e08 ffff880143349d48 ffff88011c927a40
Aug  7 23:30:04 ares kernel: [75763.118590] Call Trace:
Aug  7 23:30:04 ares kernel: [75763.118606]  [<ffffffff810cdac8>] ? do_lookup+0x41/0x291
Aug  7 23:30:04 ares kernel: [75763.118632]  [<ffffffff810cf160>] ? path_lookupat+0xe0/0x670
Aug  7 23:30:04 ares kernel: [75763.118660]  [<ffffffff810e709c>] ? __getblk+0x24/0x245
Aug  7 23:30:04 ares kernel: [75763.118684]  [<ffffffff810cf70e>] ? do_path_lookup+0x1e/0x9a
Aug  7 23:30:04 ares kernel: [75763.118711]  [<ffffffff810cd876>] ? getname_flags+0x148/0x1e2
Aug  7 23:30:04 ares kernel: [75763.118738]  [<ffffffff810d0fcb>] ? user_path_at_empty+0x47/0x7b
Aug  7 23:30:04 ares kernel: [75763.118768]  [<ffffffff81158eb2>] ? jbd2_journal_stop+0x1f0/0x202
Aug  7 23:30:04 ares kernel: [75763.118797]  [<ffffffff8113a266>] ? __ext4_journal_stop+0x6e/0x74
Aug  7 23:30:04 ares kernel: [75763.118825]  [<ffffffff81136aa0>] ? ext4_link+0x121/0x156
Aug  7 23:30:04 ares kernel: [75763.118851]  [<ffffffff810c86e7>] ? vfs_fstatat+0x32/0x60
Aug  7 23:30:04 ares kernel: [75763.118876]  [<ffffffff810ce9e8>] ? vfs_link+0x161/0x17f
Aug  7 23:30:04 ares kernel: [75763.118901]  [<ffffffff810c883c>] ? sys_newlstat+0x12/0x2b
Aug  7 23:30:04 ares kernel: [75763.118928]  [<ffffffff810db57d>] ? mntput_no_expire+0x10/0xf4
Aug  7 23:30:04 ares kernel: [75763.118955]  [<ffffffff810d1672>] ? sys_linkat+0x1b8/0x1c7
Aug  7 23:30:04 ares kernel: [75763.118983]  [<ffffffff8159cca2>] ? system_call_fastpath+0x16/0x1b
Aug  7 23:30:04 ares kernel: [75763.119012] Code: 00 00 00 00 49 83 c8 ff 48 31 f2 49 d3 e0 49 f7 d0 49 85 d0 75 0d 48 8b 44 24 30 44 89 28 4d 89 1e eb 16 48 8b 1b 48 85 db 74 0b <44> 39 53 18 75 f2 e9 24 ff ff ff 45 31 ff 48 83 c4 48 4c 89 f8 
Aug  7 23:30:04 ares kernel: [75763.119236] RIP  [<ffffffff810d6985>] __d_lookup_rcu+0x136/0x156
Aug  7 23:30:04 ares kernel: [75763.119266]  RSP <ffff880143349c48>
Aug  7 23:30:04 ares kernel: [75763.119282] CR2: ffff8812e5463620
Aug  7 23:30:05 ares kernel: [75763.550822] ---[ end trace f053761f106b0def ]---

The most curious thing here is that apparently, the kernel oops
occurred at 23:30:04, while the script was able to continue until
23:30:05 (when it logged that the backup finished). However, the
backup did not make it to the disk (i. e. neither of the directories
inc_alix-2012-08-07T23_30 (temporary) nor alix-2012-08-07T23_30
(final) exists).

Some data about the computer: 8 GB of RAM, Intel Core i5 520M;
self-compiled kernel 3.4.1, Debian Testing/Sid, rsync at version
3.0.9-3. Wifi, Bluetooth, UMTS or GPS have not been used during the
crashes, but the directory whereto rsync was supposed to store the
backup was located on an external hard drive with LUKS encryption.

Unfortunately, googling for anything from the above trace or just
kernel oopses in relation to rsync turned out unsuccessful, I would
hence be very thankful if someone could provide me with additional
pointers :)

Best regards,

Claudius

---- backup script: ----
#!/bin/bash

if [ -e /var/run/backup_alix ]; then
    exit 1;
fi

if [ -e /tmp/disbackup ]; then
    exit 2;
fi

# ok, this is certainly rather ugly...
if [ ! $(mount | grep -c "datapriv") -eq 1 ]; then
    exit 3;
fi

logger -t Backup_Alix -i -p cron.info Backup started.

date=`date "+%Y-%m-%dT%H_%M"`

touch /var/run/backup

rsync -e "ssh -o VisualHostKey=no" --stats -aq --link-dest=/datapriv/backup/alix/current --delete --delete-excluded --exclude=/sys/* --exclude=/proc/* --exclude=/dev/* --exclude=/tmp/* alix.home.chubig.net:/ /datapriv/backup/alix/inc_alix-$date

mv /datapriv/backup/alix/inc_alix-$date /datapriv/backup/alix/alix-$date
rm -f /datapriv/backup/alix/current
ln -s /datapriv/backup/alix/alix-$date /datapriv/backup/alix/current

rm /var/run/backup
logger -t Backup_Alix -i -p cron.info Backup finished.
sync
logger -t Backup_Alix -i -p cron.info Sync finished.
---- :backup script ----
-- 
              A board is the planck unit of boredom.                
http://chubig.net                          telnet nightfall.org 4242

Attachment: signature.asc
Description: PGP signature


Reply to: