[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

cron not running jobs after crash (potato)



last friday there was a power outage at one of the sites
i run, it was long enough that the UPSs didn't last and
much of the equipment crashed. it all came back up ok,
but on one system(haven't noticed it on others yet)
crontab does not appear to be running the jobs anymore.

I have a cronjob that runs twice a day, everyday and
emails me output. it has not done this since friday night.
if i run the script manually it works fine, cron is running,
running an strace on it, shows activity (doing a bunch of
stat calls).

the crontab entries have not changed:

0 7,16 * * * /usr/local/bin/sav-update.sh



i had 1>/dev/null 2>/dev/null on the first one but
took them off to try to get it to generate output to
email. it did not.

runinng strace on the cron process i do not see it
even attempting to run the script. i set it to run
16:41, and watched the clock on the server as well
as watched strace and all it did was:

mail:~# tail -f debug.log
--- SIGCHLD (Child exited) ---
wait4(-1, [WIFEXITED(s) && WEXITSTATUS(s) == 0], WNOHANG, NULL) = 16594
wait4(-1, 0xbffff9f8, WNOHANG, NULL)    = -1 ECHILD (No child processes)
sigreturn()                             = ? (mask now [])
time(NULL)                              = 1014770402
time(NULL)                              = 1014770402
rt_sigprocmask(SIG_BLOCK, [CHLD], [], 8) = 0
rt_sigaction(SIGCHLD, NULL, {0x8049d98, [], SA_RESTART|0x4000000}, 8) = 0
rt_sigprocmask(SIG_SETMASK, [], NULL, 8) = 0
nanosleep({59, 0}, {59, 0})             = 0
time(NULL)                              = 1014770461
stat("crontabs", {st_mode=S_IFDIR|0700, st_size=4096, ...}) = 0
stat("/etc/crontab", {st_mode=S_IFREG|0644, st_size=596, ...}) = 0
stat("/etc/cron.d", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
stat("/etc/cron.d/sendmail", {st_mode=S_IFREG|0644, st_size=2324,
...}) = 0stat("/etc/cron.d/mailman", {st_mode=S_IFREG|0644, st_size=260, ...}) = 0
stat("/etc/cron.d/sysstat", {st_mode=S_IFREG|0644, st_size=236, ...}) = 0
time(NULL)                              = 1014770461
rt_sigprocmask(SIG_BLOCK, [CHLD], [], 8) = 0
rt_sigaction(SIGCHLD, NULL, {0x8049d98, [], SA_RESTART|0x4000000}, 8) = 0
rt_sigprocmask(SIG_SETMASK, [], NULL, 8) = 0
nanosleep({60, 0}, {60, 0})             = 0
time(NULL)                              = 1014770521


this is driving me up the wall. i modified the script
to create a file in /root when it runs, just incase
something else in it is broken, but the file is not
created so clearly cron is not working right anymore

im gonna try to reinstall cron, does anyone have any
ideas what could be going wrong ?

the system was up for 331 days before the outage ...

nate






Reply to: