climateprediction.net home page
Task 15284468

Task 15284468

Name hadcm3n_zlu8_1880_40_008199818_3
Workunit 8354942
Created 14 Sep 2012, 10:28:35 UTC
Sent 14 Sep 2012, 10:43:32 UTC
Report deadline 14 Dec 2012, 18:10:43 UTC
Received 13 Oct 2012, 16:05:32 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1212677
Run time 28 days 23 hours 48 min 43 sec
CPU time 25 days 21 hours 58 min 16 sec
Validate state Invalid
Credit 11,508.48
Device peak FLOPS 3.21 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
21:10:46 (14789): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:15:10 (31069): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:19:51 (31147): No heartbeat from core client for 30 sec - exiting
21:20:56 (31147): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:26:30 (31267): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:31:23 (31361): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:33:18 (710): No heartbeat from core client for 30 sec - exiting
21:34:19 (710): No heartbeat from core client for 30 sec - exiting
21:34:20 (710): No heartbeat from core client for 30 sec - exiting
21:34:21 (710): No heartbeat from core client for 30 sec - exiting
21:34:22 (710): No heartbeat from core client for 30 sec - exiting
21:34:23 (710): No heartbeat from core client for 30 sec - exiting
21:34:24 (710): No heartbeat from core client for 30 sec - exiting
21:34:25 (710): No heartbeat from core client for 30 sec - exiting
21:34:26 (710): No heartbeat from core client for 30 sec - exiting
21:34:27 (710): No heartbeat from core client for 30 sec - exiting
21:34:28 (710): No heartbeat from core client for 30 sec - exiting
21:34:29 (710): No heartbeat from core client for 30 sec - exiting
21:34:30 (710): No heartbeat from core client for 30 sec - exiting
21:34:31 (710): No heartbeat from core client for 30 sec - exiting
21:34:32 (710): No heartbeat from core client for 30 sec - exiting
21:35:02 (710): No heartbeat from core client for 30 sec - exiting
21:35:03 (710): No heartbeat from core client for 30 sec - exiting
21:35:04 (710): No heartbeat from core client for 30 sec - exiting
21:35:05 (710): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:38:07 (782): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:38:08 (782): No heartbeat from core client for 30 sec - exiting
21:38:09 (782): No heartbeat from core client for 30 sec - exiting
21:42:19 (928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:44:30 (1085): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:02:06 (1148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:06:32 (25890): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:20:03 (26054): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:29:26 (26170): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:34:35 (26330): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:34:36 (26330): No heartbeat from core client for 30 sec - exiting
12:34:37 (26330): No heartbeat from core client for 30 sec - exiting
12:34:38 (26330): No heartbeat from core client for 30 sec - exiting
12:34:39 (26330): No heartbeat from core client for 30 sec - exiting
12:34:41 (26330): No heartbeat from core client for 30 sec - exiting
12:34:42 (26330): No heartbeat from core client for 30 sec - exiting
12:34:43 (26330): No heartbeat from core client for 30 sec - exiting
12:38:04 (26360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 1 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:44:31 (7515): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:44:32 (7515): No heartbeat from core client for 30 sec - exiting
14:44:33 (7515): No heartbeat from core client for 30 sec - exiting
14:44:34 (7515): No heartbeat from core client for 30 sec - exiting
14:44:35 (7515): No heartbeat from core client for 30 sec - exiting
14:44:36 (7515): No heartbeat from core client for 30 sec - exiting
14:44:37 (7515): No heartbeat from core client for 30 sec - exiting
14:54:20 (3698): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:13:58 (3872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:36:49 (4045): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
SIGABRT: abort called
Stack trace (8 frames):
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xffffe400]
/lib/libc.so.6(gsignal+0x45)[0xf755e8c5]
/lib/libc.so.6(abort+0x175)[0xf75601d5]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf3)[0xf754a003]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5434, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xffffe400]
/lib/libc.so.6(gsignal+0x45)[0xf76128c5]
/lib/libc.so.6(abort+0x175)[0xf76141d5]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf3)[0xf75fe003]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5434, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xffffe400]
/lib/libc.so.6(gsignal+0x45)[0xf761a8c5]
/lib/libc.so.6(abort+0x175)[0xf761c1d5]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf3)[0xf7606003]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5434, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xffffe400]
/lib/libc.so.6(gsignal+0x45)[0xf75988c5]
/lib/libc.so.6(abort+0x175)[0xf759a1d5]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf3)[0xf7584003]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5434, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xffffe400]
/lib/libc.so.6(gsignal+0x45)[0xf75f08c5]
/lib/libc.so.6(abort+0x175)[0xf75f21d5]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf3)[0xf75dc003]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5434, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xffffe400]
/lib/libc.so.6(gsignal+0x45)[0xf76308c5]
/lib/libc.so.6(abort+0x175)[0xf76321d5]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/andy/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf3)[0xf761c003]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5434, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Oct 2012 11:12:45 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 959,040 2,225,120 2.3202
12 Oct 2012 14:41:18 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 933,120 2,161,251 2.3162
11 Oct 2012 18:39:42 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 907,200 2,098,608 2.3133
10 Oct 2012 22:48:42 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 881,280 2,035,967 2.3102
10 Oct 2012 04:25:24 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 855,360 1,975,178 2.3092
09 Oct 2012 10:10:48 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 829,440 1,918,369 2.3128
08 Oct 2012 15:52:30 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 803,520 1,859,683 2.3144
07 Oct 2012 22:13:19 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 777,600 1,802,470 2.3180
07 Oct 2012 04:08:36 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 751,680 1,744,833 2.3212
06 Oct 2012 10:07:57 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 725,760 1,686,419 2.3237
05 Oct 2012 15:28:52 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 699,840 1,626,026 2.3234
04 Oct 2012 19:21:50 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 673,920 1,565,036 2.3223
04 Oct 2012 01:06:25 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 648,000 1,506,534 2.3249
03 Oct 2012 06:16:00 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 622,080 1,446,618 2.3255
02 Oct 2012 11:17:31 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 596,160 1,386,357 2.3255
01 Oct 2012 16:22:10 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 570,240 1,324,942 2.3235
30 Sep 2012 22:23:08 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 544,320 1,264,668 2.3234
30 Sep 2012 03:40:37 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 518,400 1,203,708 2.3220
29 Sep 2012 09:20:59 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 492,480 1,142,620 2.3201
28 Sep 2012 15:42:44 1212677 15284468 hadcm3n_zlu8_1880_40_008199818_3 466,560 1,081,125 2.3172


©2024 climateprediction.net