climateprediction.net home page
Task 13552204

Task 13552204

Name hadcm3n_ymld_1900_40_007523255_0
Workunit 7720730
Created 28 Oct 2011, 13:21:48 UTC
Sent 31 Oct 2011, 16:12:50 UTC
Report deadline 30 Jan 2012, 23:40:01 UTC
Received 29 Dec 2011, 22:26:12 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1160890
Run time 5 days 1 hours 11 min 11 sec
CPU time 4 days 7 hours 47 min 22 sec
Validate state Invalid
Credit 2,488.32
Device peak FLOPS 2.29 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3440, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3420, iMonCtr=1
Model crash detected, will try to restart...
08:24:55 (3548): No heartbeat from core client for 30 sec - exiting
08:24:56 (3548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:30:50 (3240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:50:08 (3260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:51:10 (3208): No heartbeat from core client for 30 sec - exiting
07:51:11 (3208): No heartbeat from core client for 30 sec - exiting
07:51:12 (3208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:34:04 (840): No heartbeat from core client for 30 sec - exiting
14:34:05 (840): No heartbeat from core client for 30 sec - exiting
14:34:06 (840): No heartbeat from core client for 30 sec - exiting
14:34:07 (840): No heartbeat from core client for 30 sec - exiting
14:34:08 (840): No heartbeat from core client for 30 sec - exiting
14:34:09 (840): No heartbeat from core client for 30 sec - exiting
14:34:10 (840): No heartbeat from core client for 30 sec - exiting
14:34:11 (840): No heartbeat from core client for 30 sec - exiting
14:34:12 (840): No heartbeat from core client for 30 sec - exiting
14:34:14 (840): No heartbeat from core client for 30 sec - exiting
14:34:15 (840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3552, iMonCtr=1
Model crash detected, will try to restart...
22:33:54 (3248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3364, iMonCtr=1
Model crash detected, will try to restart...
22:19:28 (3520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:25:27 (3120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:35:26 (3392): No heartbeat from core client for 30 sec - exiting
09:35:27 (3392): No heartbeat from core client for 30 sec - exiting
09:35:29 (3392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2312, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3716, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2852, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Dec 2011 15:19:15 1160890 13552204 hadcm3n_ymld_1900_40_007523255_0 207,360 356,456 1.7190
11 Dec 2011 14:13:11 1160890 13552204 hadcm3n_ymld_1900_40_007523255_0 181,440 310,279 1.7101
03 Dec 2011 18:44:23 1160890 13552204 hadcm3n_ymld_1900_40_007523255_0 155,520 263,779 1.6961
29 Nov 2011 11:41:13 1160890 13552204 hadcm3n_ymld_1900_40_007523255_0 129,600 216,944 1.6740
26 Nov 2011 13:30:45 1160890 13552204 hadcm3n_ymld_1900_40_007523255_0 103,680 170,180 1.6414
03 Nov 2011 17:20:21 1160890 13552204 hadcm3n_ymld_1900_40_007523255_0 77,760 126,215 1.6231
03 Nov 2011 05:51:40 1160890 13552204 hadcm3n_ymld_1900_40_007523255_0 51,840 85,131 1.6422
02 Nov 2011 18:47:50 1160890 13552204 hadcm3n_ymld_1900_40_007523255_0 25,920 42,969 1.6578


©2024 cpdn.org