climateprediction.net home page
Task 15769240

Task 15769240

Name hadcm3n_4kzl_1940_40_008306329_1
Workunit 8457464
Created 9 May 2013, 16:57:32 UTC
Sent 9 May 2013, 16:57:34 UTC
Report deadline 9 Aug 2013, 0:24:45 UTC
Received 4 Sep 2013, 14:32:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1291528
Run time 3 days 23 hours 56 min 57 sec
CPU time 3 days 19 hours 11 min 58 sec
Validate state Invalid
Credit 1,555.20
Device peak FLOPS 2.00 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
00:55:15 (9428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:59:35 (11686): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:03:44 (11954): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:03:45 (11954): No heartbeat from core client for 30 sec - exiting
01:03:46 (11954): No heartbeat from core client for 30 sec - exiting
01:03:47 (11954): No heartbeat from core client for 30 sec - exiting
01:07:31 (12077): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:12:08 (12170): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:16:15 (12276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:20:15 (12368): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:26:25 (12469): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:30:10 (12689): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:30:11 (12689): No heartbeat from core client for 30 sec - exiting
03:30:12 (12689): No heartbeat from core client for 30 sec - exiting
03:30:13 (12689): No heartbeat from core client for 30 sec - exiting
03:30:14 (12689): No heartbeat from core client for 30 sec - exiting
03:30:15 (12689): No heartbeat from core client for 30 sec - exiting
03:30:16 (12689): No heartbeat from core client for 30 sec - exiting
03:30:17 (12689): No heartbeat from core client for 30 sec - exiting
03:30:18 (12689): No heartbeat from core client for 30 sec - exiting
03:41:30 (12779): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:44:55 (12906): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:45:36 (12986): No heartbeat from core client for 30 sec - exiting
03:45:38 (12986): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:49:39 (13069): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:53:26 (13153): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:20:45 (13233): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:24:33 (13364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:24:38 (13364): No heartbeat from core client for 30 sec - exiting
04:28:35 (13444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:36:44 (13536): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:44:29 (13647): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:47:09 (13731): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:12:17 (13880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:55:34 (14818): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:59:35 (15026): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:59:36 (15026): No heartbeat from core client for 30 sec - exiting
05:59:37 (15026): No heartbeat from core client for 30 sec - exiting
06:03:30 (15133): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:09:58 (15247): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:13:42 (16032): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:17:24 (16112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:21:14 (16193): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:25:12 (16275): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:29:16 (16353): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:32:55 (16435): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:36:34 (16523): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:36:52 (16523): No heartbeat from core client for 30 sec - exiting
18:40:34 (16611): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:19:55 (16685): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:23:21 (16860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:23:24 (16860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
15:48:57 (3907): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:48:58 (3907): No heartbeat from core client for 30 sec - exiting
15:48:59 (3907): No heartbeat from core client for 30 sec - exiting
15:49:00 (3907): No heartbeat from core client for 30 sec - exiting
15:49:01 (3907): No heartbeat from core client for 30 sec - exiting
15:49:02 (3907): No heartbeat from core client for 30 sec - exiting
15:49:03 (3907): No heartbeat from core client for 30 sec - exiting
15:49:04 (3907): No heartbeat from core client for 30 sec - exiting
15:49:05 (3907): No heartbeat from core client for 30 sec - exiting
15:49:06 (3907): No heartbeat from core client for 30 sec - exiting
15:49:07 (3907): No heartbeat from core client for 30 sec - exiting
15:49:08 (3907): No heartbeat from core client for 30 sec - exiting
15:49:09 (3907): No heartbeat from core client for 30 sec - exiting
15:49:10 (3907): No heartbeat from core client for 30 sec - exiting
15:49:11 (3907): No heartbeat from core client for 30 sec - exiting
16:39:51 (4274): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:42:09 (4911): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:50:59 (5017): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:51:00 (5017): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
17:00:45 (5236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:00:46 (5236): No heartbeat from core client for 30 sec - exiting
17:49:36 (5607): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:54:20 (35863): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:18:22 (3494): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
13:29:55 (3744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:29:56 (3744): No heartbeat from core client for 30 sec - exiting
13:29:57 (3744): No heartbeat from core client for 30 sec - exiting
13:40:00 (4475): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
14:28:24 (2928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:28:25 (2928): No heartbeat from core client for 30 sec - exiting
14:28:26 (2928): No heartbeat from core client for 30 sec - exiting
14:28:27 (2928): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7fdb400]
[0xf7fdb430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf7df81df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7dfb825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf7de34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7fdb400]
[0xf7fdb430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf7df81df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7dfb825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf7de34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7fdb400]
[0xf7fdb430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf7df81df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7dfb825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf7de34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7fdb400]
[0xf7fdb430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf7df81df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7dfb825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf7de34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7fdb400]
[0xf7fdb430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf7df81df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7dfb825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf7de34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7fdb400]
[0xf7fdb430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf7df81df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7dfb825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf7de34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 May 2013 20:11:56 1282911 15769240 hadcm3n_4kzl_1940_40_008306329_1 129,600 296,439 2.2873
12 May 2013 17:31:26 1281428 15769240 hadcm3n_4kzl_1940_40_008306329_1 103,680 237,770 2.2933
11 May 2013 23:12:31 1281428 15769240 hadcm3n_4kzl_1940_40_008306329_1 77,760 176,746 2.2730
11 May 2013 04:05:57 1281428 15769240 hadcm3n_4kzl_1940_40_008306329_1 51,840 115,165 2.2215
10 May 2013 08:51:15 1281428 15769240 hadcm3n_4kzl_1940_40_008306329_1 25,920 54,706 2.1106


©2024 cpdn.org