climateprediction.net (CPDN) home page
Task 15806245

Task 15806245

Name hadcm3n_o3c5_2140_40_008269454_4
Workunit 8424578
Created 30 May 2013, 7:54:55 UTC
Sent 30 May 2013, 8:10:55 UTC
Report deadline 29 Aug 2013, 15:38:06 UTC
Received 5 Jun 2013, 22:09:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1211982
Run time 6 days 9 hours 56 min 18 sec
CPU time 5 days 21 hours 56 min 35 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 3.54 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
11:22:39 (9984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Jun 2013 15:18:13 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 440,640 501,231 1.1375
05 Jun 2013 07:14:40 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 414,720 471,697 1.1374
04 Jun 2013 21:14:29 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 388,800 441,929 1.1366
04 Jun 2013 12:21:27 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 362,880 412,432 1.1366
04 Jun 2013 04:17:36 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 336,960 382,749 1.1359
03 Jun 2013 19:14:47 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 311,040 353,109 1.1353
03 Jun 2013 10:25:01 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 285,120 323,459 1.1345
03 Jun 2013 01:22:45 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 259,200 293,734 1.1332
02 Jun 2013 15:46:16 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 233,280 264,059 1.1319
02 Jun 2013 07:58:28 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 207,360 234,502 1.1309
01 Jun 2013 23:05:00 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 181,440 205,130 1.1306
01 Jun 2013 13:46:19 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 155,520 176,153 1.1327
01 Jun 2013 04:40:52 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 129,600 146,864 1.1332
31 May 2013 20:15:39 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 103,680 117,259 1.1310
31 May 2013 11:17:14 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 77,760 87,805 1.1292
31 May 2013 02:18:43 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 51,840 58,527 1.1290
30 May 2013 17:12:51 1211982 15806245 hadcm3n_o3c5_2140_40_008269454_4 25,920 29,259 1.1288


©2024 cpdn.org