climateprediction.net home page
Task 16079786

Task 16079786

Name hadcm3n_7zqq_1980_40_008457605_2
Workunit 8608461
Created 12 Nov 2013, 20:53:32 UTC
Sent 12 Nov 2013, 20:53:40 UTC
Report deadline 12 Feb 2014, 4:20:51 UTC
Received 17 Nov 2013, 0:46:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1298350
Run time 3 days 13 hours 6 min 17 sec
CPU time 3 days 12 hours 53 min 32 sec
Validate state Invalid
Credit 6,842.88
Device peak FLOPS 4.00 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:29:03 (5420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2932, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Nov 2013 13:46:51 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 570,240 303,024 0.5314
16 Nov 2013 10:00:41 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 544,320 289,550 0.5319
16 Nov 2013 06:09:43 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 518,400 275,780 0.5320
16 Nov 2013 02:08:49 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 492,480 261,609 0.5312
15 Nov 2013 17:56:11 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 466,560 247,865 0.5313
15 Nov 2013 14:07:16 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 440,640 234,202 0.5315
15 Nov 2013 10:21:07 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 414,720 220,532 0.5318
15 Nov 2013 06:30:24 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 388,800 206,875 0.5321
15 Nov 2013 02:44:33 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 362,880 193,202 0.5324
14 Nov 2013 22:58:41 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 336,960 179,524 0.5328
14 Nov 2013 19:07:45 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 311,040 165,817 0.5331
14 Nov 2013 15:22:31 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 285,120 152,112 0.5335
14 Nov 2013 11:29:07 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 259,200 138,348 0.5338
14 Nov 2013 07:38:02 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 233,280 124,624 0.5342
14 Nov 2013 03:52:08 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 207,360 110,938 0.5350
14 Nov 2013 00:05:20 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 181,440 97,270 0.5361
13 Nov 2013 20:09:58 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 155,520 83,379 0.5361
13 Nov 2013 16:17:41 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 129,600 69,480 0.5361
13 Nov 2013 12:26:58 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 103,680 55,565 0.5359
13 Nov 2013 08:31:03 1298350 16079786 hadcm3n_7zqq_1980_40_008457605_2 77,760 41,668 0.5359


©2024 cpdn.org