climateprediction.net home page
Task 13625683

Task 13625683

Name hadcm3n_ymq0_1940_40_007543038_0
Workunit 7740270
Created 10 Nov 2011, 0:06:06 UTC
Sent 10 Nov 2011, 0:11:08 UTC
Report deadline 9 Feb 2012, 7:38:19 UTC
Received 23 Nov 2011, 20:51:02 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1080897
Run time 4 days 7 hours 48 min 48 sec
CPU time 4 days 7 hours 48 min 48 sec
Validate state Invalid
Credit 1,866.24
Device peak FLOPS 2.35 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
19:49:48 (3300): No heartbeat from core client for 30 sec - exiting
19:49:49 (3300): No heartbeat from core client for 30 sec - exiting
19:49:50 (3300): No heartbeat from core client for 30 sec - exiting
19:49:51 (3300): No heartbeat from core client for 30 sec - exiting
19:49:52 (3300): No heartbeat from core client for 30 sec - exiting
19:49:54 (3300): No heartbeat from core client for 30 sec - exiting
19:49:55 (3300): No heartbeat from core client for 30 sec - exiting
19:49:56 (3300): No heartbeat from core client for 30 sec - exiting
19:49:57 (3300): No heartbeat from core client for 30 sec - exiting
19:49:58 (3300): No heartbeat from core client for 30 sec - exiting
19:49:59 (3300): No heartbeat from core client for 30 sec - exiting
19:50:00 (3300): No heartbeat from core client for 30 sec - exiting
19:50:01 (3300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2920, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
08:09:28 (1420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:10:20 (1420): No heartbeat from core client for 30 sec - exiting
15:35:30 (3016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1
Model crash detected, will try to restart...
08:20:09 (2944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=568, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Nov 2011 00:11:32 1080897 13625683 hadcm3n_ymq0_1940_40_007543038_0 155,520 342,366 2.2014
20 Nov 2011 14:24:05 1080897 13625683 hadcm3n_ymq0_1940_40_007543038_0 129,600 286,021 2.2070
19 Nov 2011 22:35:23 1080897 13625683 hadcm3n_ymq0_1940_40_007543038_0 103,680 229,373 2.2123
18 Nov 2011 09:49:26 1080897 13625683 hadcm3n_ymq0_1940_40_007543038_0 77,760 172,554 2.2191
17 Nov 2011 00:16:11 1080897 13625683 hadcm3n_ymq0_1940_40_007543038_0 51,840 116,440 2.2461
15 Nov 2011 20:38:32 1080897 13625683 hadcm3n_ymq0_1940_40_007543038_0 25,920 57,424 2.2154


©2024 cpdn.org