climateprediction.net home page
Task 15726539

Task 15726539

Name hadcm3n_o4ja_2140_40_008269979_3
Workunit 8425103
Created 16 Apr 2013, 8:32:38 UTC
Sent 16 Apr 2013, 8:32:44 UTC
Report deadline 16 Jul 2013, 15:59:55 UTC
Received 10 May 2013, 8:26:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1269992
Run time 14 days 15 hours 14 min 49 sec
CPU time 14 days 3 hours 39 min 4 sec
Validate state Invalid
Credit 7,776.00
Device peak FLOPS 2.64 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5968, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:56:56 (4588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5164, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5300, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	04:59:53 PM	No files match the supplied pattern.
MainError:	04:59:53 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	04:55:23 PM	No files match the supplied pattern.
MainError:	04:55:23 PM	No files match the supplied pattern.
MainError:	02:23:27 PM	No files match the supplied pattern.
MainError:	02:23:27 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	01:40:39 PM	No files match the supplied pattern.
MainError:	01:40:39 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	01:03:05 PM	No files match the supplied pattern.
MainError:	01:03:05 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	11:26:15 AM	No files match the supplied pattern.
MainError:	11:26:15 AM	No files match the supplied pattern.
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 May 2013 11:37:07 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 648,000 1,193,420 1.8417
08 May 2013 13:55:04 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 622,080 1,146,789 1.8435
07 May 2013 14:40:28 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 596,160 1,101,148 1.8471
06 May 2013 14:25:24 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 570,240 1,054,061 1.8485
05 May 2013 17:11:32 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 544,320 1,007,301 1.8506
04 May 2013 17:01:17 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 518,400 961,076 1.8539
03 May 2013 18:55:23 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 492,480 915,321 1.8586
02 May 2013 19:58:35 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 466,560 868,152 1.8608
01 May 2013 21:42:04 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 440,640 821,245 1.8638
01 May 2013 09:08:59 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 414,720 774,863 1.8684
30 Apr 2013 11:31:20 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 388,800 728,171 1.8729
29 Apr 2013 12:38:55 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 362,880 682,550 1.8809
28 Apr 2013 13:22:31 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 336,960 637,540 1.8920
27 Apr 2013 14:30:24 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 311,040 589,074 1.8939
26 Apr 2013 15:21:27 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 285,120 540,268 1.8949
25 Apr 2013 17:21:38 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 259,200 491,659 1.8968
24 Apr 2013 17:37:18 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 233,280 445,570 1.9100
23 Apr 2013 18:51:27 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 207,360 396,859 1.9139
22 Apr 2013 21:24:52 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 181,440 348,909 1.9230
22 Apr 2013 06:48:09 1269992 15726539 hadcm3n_o4ja_2140_40_008269979_3 155,520 298,718 1.9208


©2024 cpdn.org