climateprediction.net home page
Task 13105118

Task 13105118

Name hadcm3n_ydpi_1900_40_007350624_1
Workunit 7548054
Created 6 Jul 2011, 14:07:35 UTC
Sent 16 Jul 2011, 22:16:51 UTC
Report deadline 16 Oct 2011, 5:44:02 UTC
Received 13 Sep 2011, 18:22:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1340931
Run time 19 days 6 hours 40 min 35 sec
CPU time 15 days 7 hours 33 min 46 sec
Validate state Invalid
Credit 9,642.24
Device peak FLOPS 2.84 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:45:31 (5872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:45:33 (5872): No heartbeat from core client for 30 sec - exiting
14:45:34 (5872): No heartbeat from core client for 30 sec - exiting
14:45:35 (5872): No heartbeat from core client for 30 sec - exiting
14:45:36 (5872): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8688, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/ydpiko.pjc2c10
Error converting file to netcdf: dataout/ydpiko.pic2c10
Error converting file to netcdf: dataout/ydpiko.pfc2c10
Error converting file to netcdf: dataout/ydpika.phc2c10
Error converting file to netcdf: dataout/ydpika.pgc2c10
Error converting file to netcdf: dataout/ydpika.pec2c10
Error converting file to netcdf: dataout/ydpika.pdc2c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:06:12 (4156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12116, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13804, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Sep 2011 11:30:56 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 803,520 1,284,328 1.5984
06 Sep 2011 21:44:22 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 777,600 1,242,804 1.5983
06 Sep 2011 04:59:31 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 751,680 1,202,834 1.6002
31 Aug 2011 05:21:14 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 725,760 1,160,664 1.5992
27 Aug 2011 09:18:19 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 699,840 1,119,492 1.5996
24 Aug 2011 09:17:19 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 673,920 1,080,343 1.6031
23 Aug 2011 22:07:24 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 648,000 1,040,291 1.6054
23 Aug 2011 10:43:00 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 622,080 1,000,242 1.6079
22 Aug 2011 19:28:17 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 596,160 959,919 1.6102
18 Aug 2011 20:18:26 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 570,240 918,496 1.6107
17 Aug 2011 02:37:57 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 544,320 874,624 1.6068
16 Aug 2011 00:55:36 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 518,400 830,908 1.6028
15 Aug 2011 10:30:08 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 492,480 790,497 1.6051
14 Aug 2011 18:35:32 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 466,560 749,396 1.6062
14 Aug 2011 03:13:33 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 440,640 708,555 1.6080
11 Aug 2011 19:31:40 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 414,720 668,295 1.6114
11 Aug 2011 08:14:03 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 388,800 628,461 1.6164
10 Aug 2011 18:42:06 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 362,880 587,344 1.6186
10 Aug 2011 03:15:32 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 336,960 546,269 1.6212
09 Aug 2011 09:39:45 1097785 13105118 hadcm3n_ydpi_1900_40_007350624_1 311,040 503,743 1.6195


©2024 cpdn.org