climateprediction.net home page
Task 13295898

Task 13295898

Name hadcm3n_p455_1940_40_007420633_2
Workunit 7618268
Created 25 Aug 2011, 12:20:30 UTC
Sent 25 Aug 2011, 12:24:39 UTC
Report deadline 24 Nov 2011, 19:51:50 UTC
Received 24 Sep 2011, 14:02:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1210909
Run time 15 days 20 hours 37 min 21 sec
CPU time 15 days 9 hours 14 min 33 sec
Validate state Invalid
Credit 9,953.28
Device peak FLOPS 3.00 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:01:15 (3784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4108, iMonCtr=1
Model crash detected, will try to restart...
12:54:48 (4380): No heartbeat from core client for 30 sec - exiting
12:54:49 (4380): No heartbeat from core client for 30 sec - exiting
12:54:50 (4380): No heartbeat from core client for 30 sec - exiting
12:54:51 (4380): No heartbeat from core client for 30 sec - exiting
12:54:52 (4380): No heartbeat from core client for 30 sec - exiting
12:54:53 (4380): No heartbeat from core client for 30 sec - exiting
12:54:54 (4380): No heartbeat from core client for 30 sec - exiting
12:54:55 (4380): No heartbeat from core client for 30 sec - exiting
12:54:57 (4380): No heartbeat from core client for 30 sec - exiting
12:54:58 (4380): No heartbeat from core client for 30 sec - exiting
12:54:59 (4380): No heartbeat from core client for 30 sec - exiting
12:55:00 (4380): No heartbeat from core client for 30 sec - exiting
12:55:01 (4380): No heartbeat from core client for 30 sec - exiting
12:55:02 (4380): No heartbeat from core client for 30 sec - exiting
12:55:03 (4380): No heartbeat from core client for 30 sec - exiting
12:55:04 (4380): No heartbeat from core client for 30 sec - exiting
12:55:05 (4380): No heartbeat from core client for 30 sec - exiting
12:55:06 (4380): No heartbeat from core client for 30 sec - exiting
12:55:07 (4380): No heartbeat from core client for 30 sec - exiting
12:55:09 (4380): No heartbeat from core client for 30 sec - exiting
12:55:10 (4380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=1
Model crash detected, will try to restart...
07:10:58 (4808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1504, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1504, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Sep 2011 14:50:22 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 829,440 1,300,628 1.5681
10 Sep 2011 02:51:30 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 803,520 1,259,720 1.5678
09 Sep 2011 13:15:44 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 777,600 1,219,984 1.5689
09 Sep 2011 01:35:36 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 751,680 1,179,935 1.5697
08 Sep 2011 13:02:09 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 725,760 1,139,507 1.5701
08 Sep 2011 01:43:22 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 699,840 1,099,240 1.5707
07 Sep 2011 14:24:14 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 673,920 1,059,115 1.5716
07 Sep 2011 03:09:03 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 648,000 1,018,939 1.5724
06 Sep 2011 15:51:20 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 622,080 978,862 1.5735
06 Sep 2011 00:01:04 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 596,160 938,326 1.5739
05 Sep 2011 12:22:10 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 570,240 897,881 1.5746
05 Sep 2011 00:53:54 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 544,320 857,602 1.5755
04 Sep 2011 13:15:51 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 518,400 817,292 1.5766
04 Sep 2011 01:39:47 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 492,480 776,995 1.5777
03 Sep 2011 14:03:46 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 466,560 736,614 1.5788
03 Sep 2011 02:26:45 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 440,640 696,049 1.5796
02 Sep 2011 14:49:17 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 414,720 655,460 1.5805
02 Sep 2011 03:00:40 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 388,800 614,528 1.5806
01 Sep 2011 15:20:44 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 362,880 573,543 1.5805
01 Sep 2011 03:28:39 1091571 13295898 hadcm3n_p455_1940_40_007420633_2 336,960 532,298 1.5797


©2024 cpdn.org