climateprediction.net home page
Task 15784702

Task 15784702

Name hadcm3n_zkg6_1920_40_008361063_2
Workunit 8511922
Created 15 May 2013, 1:58:51 UTC
Sent 15 May 2013, 1:59:00 UTC
Report deadline 14 Aug 2013, 9:26:11 UTC
Received 30 May 2013, 16:49:30 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1282401
Run time 15 days 8 hours 2 min 54 sec
CPU time 15 days 0 hours 12 min 35 sec
Validate state Invalid
Credit 5,598.72
Device peak FLOPS 2.01 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
15:42:45 (23013): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
17:25:14 (2334): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:35:20 (2991): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:54:28 (13123): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:40:35 (10374): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:45:35 (38887): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:45:36 (38887): No heartbeat from core client for 30 sec - exiting
06:45:37 (38887): No heartbeat from core client for 30 sec - exiting
06:45:38 (38887): No heartbeat from core client for 30 sec - exiting
06:45:39 (38887): No heartbeat from core client for 30 sec - exiting
06:45:40 (38887): No heartbeat from core client for 30 sec - exiting
06:45:41 (38887): No heartbeat from core client for 30 sec - exiting
06:45:42 (38887): No heartbeat from core client for 30 sec - exiting
06:45:43 (38887): No heartbeat from core client for 30 sec - exiting
06:45:44 (38887): No heartbeat from core client for 30 sec - exiting
06:45:45 (38887): No heartbeat from core client for 30 sec - exiting
06:45:46 (38887): No heartbeat from core client for 30 sec - exiting
06:45:47 (38887): No heartbeat from core client for 30 sec - exiting
06:45:48 (38887): No heartbeat from core client for 30 sec - exiting
06:45:49 (38887): No heartbeat from core client for 30 sec - exiting
06:45:50 (38887): No heartbeat from core client for 30 sec - exiting
10:36:02 (39089): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:40:27 (41722): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:40:28 (41722): No heartbeat from core client for 30 sec - exiting
10:40:29 (41722): No heartbeat from core client for 30 sec - exiting
10:40:30 (41722): No heartbeat from core client for 30 sec - exiting
10:40:31 (41722): No heartbeat from core client for 30 sec - exiting
10:40:32 (41722): No heartbeat from core client for 30 sec - exiting
10:40:33 (41722): No heartbeat from core client for 30 sec - exiting
10:40:34 (41722): No heartbeat from core client for 30 sec - exiting
10:40:35 (41722): No heartbeat from core client for 30 sec - exiting
10:40:36 (41722): No heartbeat from core client for 30 sec - exiting
10:40:37 (41722): No heartbeat from core client for 30 sec - exiting
10:40:38 (41722): No heartbeat from core client for 30 sec - exiting
10:40:39 (41722): No heartbeat from core client for 30 sec - exiting
14:03:47 (41934): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:13:25 (44003): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:13:26 (44003): No heartbeat from core client for 30 sec - exiting
14:13:27 (44003): No heartbeat from core client for 30 sec - exiting
14:13:28 (44003): No heartbeat from core client for 30 sec - exiting
14:13:29 (44003): No heartbeat from core client for 30 sec - exiting
14:51:27 (44376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:51:28 (44376): No heartbeat from core client for 30 sec - exiting
14:51:29 (44376): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
15:05:46 (44887): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:47:25 (45302): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:26:39 (45872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:31:38 (46362): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:31:39 (46362): No heartbeat from core client for 30 sec - exiting
16:31:40 (46362): No heartbeat from core client for 30 sec - exiting
17:29:13 (46568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:29:14 (46568): No heartbeat from core client for 30 sec - exiting
17:29:15 (46568): No heartbeat from core client for 30 sec - exiting
17:29:16 (46568): No heartbeat from core client for 30 sec - exiting
17:29:17 (46568): No heartbeat from core client for 30 sec - exiting
17:29:18 (46568): No heartbeat from core client for 30 sec - exiting
17:29:19 (46568): No heartbeat from core client for 30 sec - exiting
17:29:20 (46568): No heartbeat from core client for 30 sec - exiting
17:29:21 (46568): No heartbeat from core client for 30 sec - exiting
17:29:22 (46568): No heartbeat from core client for 30 sec - exiting
17:29:23 (46568): No heartbeat from core client for 30 sec - exiting
17:29:24 (46568): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7714400]
[0xf7714425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75311df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7534825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf751c4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47436, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf778c400]
[0xf778c425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75a91df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75ac825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75944d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47436, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf774a400]
[0xf774a425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75671df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf756a825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75524d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47436, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77d9400]
[0xf77d9425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75f61df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75f9825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75e14d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47436, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77bf400]
[0xf77bf425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75dc1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75df825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75c74d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47436, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7726400]
[0xf7726425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75431df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7546825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf752e4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47436, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 May 2013 12:21:02 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 466,560 1,283,343 2.7506
29 May 2013 14:50:00 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 440,640 1,214,413 2.7560
28 May 2013 20:11:33 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 414,720 1,148,039 2.7682
28 May 2013 00:30:38 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 388,800 1,081,594 2.7819
27 May 2013 05:42:52 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 362,880 1,015,766 2.7992
26 May 2013 10:55:48 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 336,960 950,920 2.8221
25 May 2013 15:44:22 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 311,040 884,296 2.8430
24 May 2013 20:36:55 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 285,120 817,239 2.8663
24 May 2013 00:09:54 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 259,200 749,344 2.8910
23 May 2013 04:36:40 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 233,280 681,741 2.9224
22 May 2013 07:02:39 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 207,360 604,692 2.9161
21 May 2013 06:42:13 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 181,440 519,340 2.8623
20 May 2013 06:55:15 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 155,520 435,560 2.8007
19 May 2013 07:16:41 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 129,600 352,125 2.7170
18 May 2013 08:53:39 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 103,680 273,518 2.6381
17 May 2013 12:16:41 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 77,760 203,127 2.6122
16 May 2013 16:52:02 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 51,840 132,185 2.5499
15 May 2013 21:17:18 1282401 15784702 hadcm3n_zkg6_1920_40_008361063_2 25,920 66,055 2.5484


©2024 climateprediction.net