climateprediction.net home page
Task 15490332

Task 15490332

Name hadcm3n_3ain_1940_40_008262001_0
Workunit 8417125
Created 21 Dec 2012, 0:08:29 UTC
Sent 21 Dec 2012, 0:17:23 UTC
Report deadline 22 Mar 2013, 7:44:34 UTC
Received 14 Jan 2013, 16:48:15 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1002677
Run time 16 days 18 hours 36 min 22 sec
CPU time 15 days 9 hours 40 min 20 sec
Validate state Invalid
Credit 10,575.36
Device peak FLOPS 3.27 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2528, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:32:54 (4900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6696, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Jan 2013 23:41:24 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 881,280 1,328,051 1.5070
13 Jan 2013 00:23:05 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 855,360 1,289,584 1.5077
12 Jan 2013 13:50:17 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 829,440 1,251,958 1.5094
12 Jan 2013 02:38:10 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 803,520 1,214,653 1.5117
10 Jan 2013 19:39:49 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 777,600 1,176,866 1.5135
08 Jan 2013 21:47:25 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 751,680 1,139,423 1.5158
06 Jan 2013 23:34:03 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 725,760 1,101,819 1.5182
06 Jan 2013 09:06:09 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 699,840 1,065,597 1.5226
05 Jan 2013 21:37:33 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 673,920 1,026,526 1.5232
05 Jan 2013 08:35:41 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 648,000 987,579 1.5240
04 Jan 2013 21:10:45 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 622,080 948,886 1.5253
03 Jan 2013 17:51:41 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 596,160 908,511 1.5239
01 Jan 2013 21:11:27 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 570,240 868,260 1.5226
01 Jan 2013 09:26:29 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 544,320 828,011 1.5212
31 Dec 2012 21:23:16 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 518,400 787,878 1.5198
31 Dec 2012 09:23:33 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 492,480 747,888 1.5186
30 Dec 2012 12:10:11 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 466,560 707,768 1.5170
30 Dec 2012 00:17:15 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 440,640 667,646 1.5152
29 Dec 2012 12:10:07 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 414,720 628,823 1.5163
29 Dec 2012 00:08:12 1002677 15490332 hadcm3n_3ain_1940_40_008262001_0 388,800 589,300 1.5157


©2024 climateprediction.net