climateprediction.net home page
Task 15633505

Task 15633505

Name hadcm3n_zgzu_1920_40_008319229_0
Workunit 8470364
Created 24 Feb 2013, 8:41:20 UTC
Sent 24 Feb 2013, 8:42:59 UTC
Report deadline 26 May 2013, 16:10:10 UTC
Received 14 Apr 2013, 19:24:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1074182
Run time 12 days 13 hours 1 min 53 sec
CPU time 6 days 10 hours 47 min 31 sec
Validate state Invalid
Credit 2,799.36
Device peak FLOPS 1.74 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Ocean Restart file copy failed on zgzuko.dac42h0
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/zgzuko.pjc4c10
Error converting file to netcdf: dataout/zgzuko.pic4c10
Error converting file to netcdf: dataout/zgzuko.pfc4c10
Error converting file to netcdf: dataout/zgzuka.phc4c10
Error converting file to netcdf: dataout/zgzuka.pgc4c10
Error converting file to netcdf: dataout/zgzuka.pec4c10
Error converting file to netcdf: dataout/zgzuka.pdc4c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3628, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
18:27:53 (3804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:00:53 (3360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3688, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3688, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3688, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1216, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
08:17:17 (2680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:17:18 (2680): No heartbeat from core client for 30 sec - exiting
08:17:19 (2680): No heartbeat from core client for 30 sec - exiting
08:17:20 (2680): No heartbeat from core client for 30 sec - exiting
08:17:21 (2680): No heartbeat from core client for 30 sec - exiting
08:17:23 (2680): No heartbeat from core client for 30 sec - exiting
08:17:24 (2680): No heartbeat from core client for 30 sec - exiting
08:17:25 (2680): No heartbeat from core client for 30 sec - exiting
08:17:26 (2680): No heartbeat from core client for 30 sec - exiting
08:17:27 (2680): No heartbeat from core client for 30 sec - exiting
08:17:28 (2680): No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3332, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3332, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Mar 2013 01:17:37 1074182 15633505 hadcm3n_zgzu_1920_40_008319229_0 233,280 524,148 2.2469
05 Mar 2013 23:38:38 1074182 15633505 hadcm3n_zgzu_1920_40_008319229_0 207,360 465,548 2.2451
04 Mar 2013 19:10:05 1074182 15633505 hadcm3n_zgzu_1920_40_008319229_0 181,440 405,185 2.2332
03 Mar 2013 16:51:00 1074182 15633505 hadcm3n_zgzu_1920_40_008319229_0 155,520 348,017 2.2378
02 Mar 2013 11:54:27 1074182 15633505 hadcm3n_zgzu_1920_40_008319229_0 129,600 289,875 2.2367
01 Mar 2013 09:12:35 1074182 15633505 hadcm3n_zgzu_1920_40_008319229_0 103,680 230,991 2.2279
27 Feb 2013 21:34:13 1074182 15633505 hadcm3n_zgzu_1920_40_008319229_0 77,760 172,592 2.2195
26 Feb 2013 13:20:19 1074182 15633505 hadcm3n_zgzu_1920_40_008319229_0 51,840 114,237 2.2036
25 Feb 2013 10:50:26 1074182 15633505 hadcm3n_zgzu_1920_40_008319229_0 25,920 57,586 2.2217


©2024 cpdn.org