climateprediction.net home page
Task 15595753

Task 15595753

Name hadcm3n_4bjw_1940_40_008308841_0
Workunit 8459976
Created 7 Feb 2013, 18:54:12 UTC
Sent 7 Feb 2013, 19:45:43 UTC
Report deadline 10 May 2013, 3:12:54 UTC
Received 19 Feb 2013, 0:35:07 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1219011
Run time 5 days 4 hours 12 min 57 sec
CPU time 4 days 22 hours 16 min 58 sec
Validate state Invalid
Credit 4,043.52
Device peak FLOPS 4.00 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:53:40 (3483): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
00:57:33 (14684): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:57:34 (14684): No heartbeat from core client for 30 sec - exiting
00:57:35 (14684): No heartbeat from core client for 30 sec - exiting
00:57:36 (14684): No heartbeat from core client for 30 sec - exiting
00:57:37 (14684): No heartbeat from core client for 30 sec - exiting
00:57:38 (14684): No heartbeat from core client for 30 sec - exiting
00:57:39 (14684): No heartbeat from core client for 30 sec - exiting
00:57:40 (14684): No heartbeat from core client for 30 sec - exiting
00:57:41 (14684): No heartbeat from core client for 30 sec - exiting
00:57:42 (14684): No heartbeat from core client for 30 sec - exiting
00:57:43 (14684): No heartbeat from core client for 30 sec - exiting
00:57:44 (14684): No heartbeat from core client for 30 sec - exiting
00:57:45 (14684): No heartbeat from core client for 30 sec - exiting
00:57:46 (14684): No heartbeat from core client for 30 sec - exiting
00:57:47 (14684): No heartbeat from core client for 30 sec - exiting
00:57:48 (14684): No heartbeat from core client for 30 sec - exiting
00:57:49 (14684): No heartbeat from core client for 30 sec - exiting
00:57:50 (14684): No heartbeat from core client for 30 sec - exiting
00:57:51 (14684): No heartbeat from core client for 30 sec - exiting
00:57:52 (14684): No heartbeat from core client for 30 sec - exiting
00:57:53 (14684): No heartbeat from core client for 30 sec - exiting
00:57:54 (14684): No heartbeat from core client for 30 sec - exiting
00:57:55 (14684): No heartbeat from core client for 30 sec - exiting
00:57:56 (14684): No heartbeat from core client for 30 sec - exiting
00:57:57 (14684): No heartbeat from core client for 30 sec - exiting
00:57:58 (14684): No heartbeat from core client for 30 sec - exiting
00:57:59 (14684): No heartbeat from core client for 30 sec - exiting
00:58:00 (14684): No heartbeat from core client for 30 sec - exiting
00:58:01 (14684): No heartbeat from core client for 30 sec - exiting
00:58:02 (14684): No heartbeat from core client for 30 sec - exiting
00:58:03 (14684): No heartbeat from core client for 30 sec - exiting
00:58:04 (14684): No heartbeat from core client for 30 sec - exiting
00:58:05 (14684): No heartbeat from core client for 30 sec - exiting
00:58:06 (14684): No heartbeat from core client for 30 sec - exiting
00:58:07 (14684): No heartbeat from core client for 30 sec - exiting
00:58:08 (14684): No heartbeat from core client for 30 sec - exiting
00:58:09 (14684): No heartbeat from core client for 30 sec - exiting
00:58:10 (14684): No heartbeat from core client for 30 sec - exiting
00:58:11 (14684): No heartbeat from core client for 30 sec - exiting
00:58:12 (14684): No heartbeat from core client for 30 sec - exiting
00:58:13 (14684): No heartbeat from core client for 30 sec - exiting
00:58:14 (14684): No heartbeat from core client for 30 sec - exiting
00:58:15 (14684): No heartbeat from core client for 30 sec - exiting
00:58:16 (14684): No heartbeat from core client for 30 sec - exiting
00:58:17 (14684): No heartbeat from core client for 30 sec - exiting
00:58:18 (14684): No heartbeat from core client for 30 sec - exiting
00:58:19 (14684): No heartbeat from core client for 30 sec - exiting
00:58:20 (14684): No heartbeat from core client for 30 sec - exiting
00:58:21 (14684): No heartbeat from core client for 30 sec - exiting
01:30:57 (14928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:30:58 (14928): No heartbeat from core client for 30 sec - exiting
01:30:59 (14928): No heartbeat from core client for 30 sec - exiting
01:31:00 (14928): No heartbeat from core client for 30 sec - exiting
01:31:01 (14928): No heartbeat from core client for 30 sec - exiting
01:31:02 (14928): No heartbeat from core client for 30 sec - exiting
01:31:03 (14928): No heartbeat from core client for 30 sec - exiting
01:31:04 (14928): No heartbeat from core client for 30 sec - exiting
01:31:05 (14928): No heartbeat from core client for 30 sec - exiting
01:31:06 (14928): No heartbeat from core client for 30 sec - exiting
01:31:07 (14928): No heartbeat from core client for 30 sec - exiting
01:31:08 (14928): No heartbeat from core client for 30 sec - exiting
01:31:09 (14928): No heartbeat from core client for 30 sec - exiting
01:31:10 (14928): No heartbeat from core client for 30 sec - exiting
01:31:11 (14928): No heartbeat from core client for 30 sec - exiting
01:31:12 (14928): No heartbeat from core client for 30 sec - exiting
01:31:13 (14928): No heartbeat from core client for 30 sec - exiting
01:31:14 (14928): No heartbeat from core client for 30 sec - exiting
01:31:15 (14928): No heartbeat from core client for 30 sec - exiting
01:31:16 (14928): No heartbeat from core client for 30 sec - exiting
01:31:17 (14928): No heartbeat from core client for 30 sec - exiting
01:31:18 (14928): No heartbeat from core client for 30 sec - exiting
01:31:19 (14928): No heartbeat from core client for 30 sec - exiting
01:31:20 (14928): No heartbeat from core client for 30 sec - exiting
01:31:21 (14928): No heartbeat from core client for 30 sec - exiting
01:31:22 (14928): No heartbeat from core client for 30 sec - exiting
01:31:23 (14928): No heartbeat from core client for 30 sec - exiting
01:31:24 (14928): No heartbeat from core client for 30 sec - exiting
01:31:25 (14928): No heartbeat from core client for 30 sec - exiting
01:31:26 (14928): No heartbeat from core client for 30 sec - exiting
01:31:27 (14928): No heartbeat from core client for 30 sec - exiting
01:31:28 (14928): No heartbeat from core client for 30 sec - exiting
01:31:29 (14928): No heartbeat from core client for 30 sec - exiting
01:31:30 (14928): No heartbeat from core client for 30 sec - exiting
01:31:31 (14928): No heartbeat from core client for 30 sec - exiting
01:31:32 (14928): No heartbeat from core client for 30 sec - exiting
01:31:33 (14928): No heartbeat from core client for 30 sec - exiting
01:31:34 (14928): No heartbeat from core client for 30 sec - exiting
01:31:35 (14928): No heartbeat from core client for 30 sec - exiting
01:31:36 (14928): No heartbeat from core client for 30 sec - exiting
01:31:37 (14928): No heartbeat from core client for 30 sec - exiting
01:31:38 (14928): No heartbeat from core client for 30 sec - exiting
01:31:39 (14928): No heartbeat from core client for 30 sec - exiting
01:31:40 (14928): No heartbeat from core client for 30 sec - exiting
01:31:41 (14928): No heartbeat from core client for 30 sec - exiting
01:31:42 (14928): No heartbeat from core client for 30 sec - exiting
01:31:43 (14928): No heartbeat from core client for 30 sec - exiting
01:31:44 (14928): No heartbeat from core client for 30 sec - exiting
01:31:45 (14928): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (9 frames):
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf773a400]
[0xf773a430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf754c1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf754f825]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75374d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7715400]
[0xf7715430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75271df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf752a825]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75124d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76dd400]
[0xf76dd430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf74ef1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf74f2825]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74da4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77c7400]
[0xf77c7430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75d91df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75dc825]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75c44d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf772e400]
[0xf772e430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75401df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7543825]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf752b4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7733400]
[0xf7733430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75451df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7548825]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75304d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Feb 2013 03:02:00 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 336,960 402,949 1.1958
17 Feb 2013 18:11:37 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 311,040 372,511 1.1976
14 Feb 2013 01:54:12 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 285,120 343,643 1.2053
13 Feb 2013 13:20:13 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 259,200 311,736 1.2027
13 Feb 2013 03:35:15 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 233,280 278,186 1.1925
12 Feb 2013 05:43:58 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 207,360 244,546 1.1793
11 Feb 2013 04:00:10 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 181,440 214,374 1.1815
10 Feb 2013 18:17:08 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 155,520 182,387 1.1728
09 Feb 2013 16:45:57 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 129,600 149,017 1.1498
09 Feb 2013 07:06:25 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 103,680 118,798 1.1458
08 Feb 2013 22:29:02 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 77,760 89,075 1.1455
08 Feb 2013 13:55:25 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 51,840 59,184 1.1417
08 Feb 2013 05:20:50 1219011 15595753 hadcm3n_4bjw_1940_40_008308841_0 25,920 29,293 1.1301


©2024 cpdn.org