climateprediction.net home page
Task 15652880

Task 15652880

Name hadcm3n_4fsr_1940_40_008302757_2
Workunit 8453892
Created 7 Mar 2013, 5:37:41 UTC
Sent 7 Mar 2013, 5:37:46 UTC
Report deadline 6 Jun 2013, 13:04:57 UTC
Received 15 Apr 2013, 13:12:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1251647
Run time 4 days 23 hours 16 min 26 sec
CPU time 4 days 14 hours 22 min 39 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 3.61 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:20:12 (15532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:17:46 (6904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9856, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9856, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:16:11 (12268): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10832, iMonCtr=1
Model crash detected, will try to restart...
11:38:48 (10832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6236, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6236, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12864, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Mar 2013 06:22:50 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 388,800 391,159 1.0061
14 Mar 2013 22:20:22 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 362,880 366,121 1.0089
14 Mar 2013 15:10:33 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 336,960 341,061 1.0122
12 Mar 2013 05:00:28 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 311,040 314,156 1.0100
11 Mar 2013 19:08:43 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 285,120 280,940 0.9853
10 Mar 2013 17:26:02 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 259,200 251,686 0.9710
10 Mar 2013 09:56:41 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 233,280 226,293 0.9700
10 Mar 2013 02:30:03 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 207,360 200,815 0.9684
09 Mar 2013 18:34:04 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 181,440 175,460 0.9670
09 Mar 2013 10:52:49 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 155,520 150,309 0.9665
08 Mar 2013 19:09:49 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 129,600 125,398 0.9676
08 Mar 2013 12:12:33 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 103,680 101,111 0.9752
08 Mar 2013 05:10:17 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 77,760 76,713 0.9865
07 Mar 2013 21:42:15 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 51,840 51,784 0.9989
07 Mar 2013 13:40:11 1251647 15652880 hadcm3n_4fsr_1940_40_008302757_2 25,920 25,900 0.9992


©2024 climateprediction.net