climateprediction.net home page
Task 13770838

Task 13770838

Name hadcm3n_ybrs_1900_40_007526700_4
Workunit 7724175
Created 12 Dec 2011, 22:06:04 UTC
Sent 14 Dec 2011, 0:46:29 UTC
Report deadline 14 Mar 2012, 8:13:40 UTC
Received 31 Aug 2012, 21:10:07 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1171759
Run time 12 days 18 hours 35 min 14 sec
CPU time 12 days 7 hours 17 min 18 sec
Validate state Invalid
Credit 8,709.12
Device peak FLOPS 2.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:25:13 (4864): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
00:23:53 (4608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/ybrsko.pja8c10
Error converting file to netcdf: dataout/ybrsko.pia8c10
Error converting file to netcdf: dataout/ybrsko.pfa8c10
Error converting file to netcdf: dataout/ybrska.pha8c10
Error converting file to netcdf: dataout/ybrska.pga8c10
Error converting file to netcdf: dataout/ybrska.pea8c10
Error converting file to netcdf: dataout/ybrska.pda8c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:34:36 (716): No heartbeat from core client for 30 sec - exiting
23:34:37 (716): No heartbeat from core client for 30 sec - exiting
23:34:38 (716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:22:30 (5084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7020, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7020, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7020, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2484, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2484, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2484, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Aug 2012 10:05:44 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 725,760 1,050,871 1.4480
03 Apr 2012 08:03:28 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 699,840 1,012,409 1.4466
01 Apr 2012 08:49:38 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 673,920 975,349 1.4473
31 Mar 2012 18:27:47 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 648,000 937,672 1.4470
30 Mar 2012 04:11:26 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 622,080 898,897 1.4450
16 Mar 2012 04:40:28 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 596,160 860,535 1.4435
15 Mar 2012 17:27:53 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 570,240 822,018 1.4415
14 Mar 2012 23:36:01 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 544,320 783,389 1.4392
14 Mar 2012 09:40:35 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 518,400 745,282 1.4377
18 Jan 2012 09:34:21 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 492,480 707,508 1.4366
15 Jan 2012 16:01:57 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 466,560 669,629 1.4352
15 Jan 2012 05:19:43 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 440,640 631,567 1.4333
14 Jan 2012 19:01:10 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 414,720 593,260 1.4305
14 Jan 2012 07:26:53 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 388,800 554,500 1.4262
13 Jan 2012 20:57:33 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 362,880 517,214 1.4253
13 Jan 2012 10:23:00 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 336,960 479,675 1.4235
12 Jan 2012 23:35:38 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 311,040 441,454 1.4193
30 Dec 2011 05:29:05 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 285,120 403,103 1.4138
29 Dec 2011 05:22:56 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 259,200 364,827 1.4075
20 Dec 2011 15:50:58 1171759 13770838 hadcm3n_ybrs_1900_40_007526700_4 233,280 327,849 1.4054


©2024 cpdn.org