climateprediction.net home page
Task 15820348

Task 15820348

Name hadcm3n_zj65_1960_40_008321121_4
Workunit 8472256
Created 3 Jun 2013, 2:12:25 UTC
Sent 3 Jun 2013, 2:25:27 UTC
Report deadline 2 Sep 2013, 9:52:38 UTC
Received 14 Aug 2013, 16:41:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1239712
Run time 30 days 7 hours 55 min 27 sec
CPU time 28 days 15 hours 52 min 31 sec
Validate state Invalid
Credit 10,886.40
Device peak FLOPS 2.26 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
Zariadenie nepozná tento príkaz.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:12:38 (19788): No heartbeat from core client for 30 sec - exiting
21:12:39 (19788): No heartbeat from core client for 30 sec - exiting
21:12:40 (19788): No heartbeat from core client for 30 sec - exiting
21:12:41 (19788): No heartbeat from core client for 30 sec - exiting
21:12:42 (19788): No heartbeat from core client for 30 sec - exiting
21:12:43 (19788): No heartbeat from core client for 30 sec - exiting
21:12:44 (19788): No heartbeat from core client for 30 sec - exiting
21:12:45 (19788): No heartbeat from core client for 30 sec - exiting
21:12:46 (19788): No heartbeat from core client for 30 sec - exiting
21:12:47 (19788): No heartbeat from core client for 30 sec - exiting
21:12:48 (19788): No heartbeat from core client for 30 sec - exiting
21:12:49 (19788): No heartbeat from core client for 30 sec - exiting
21:12:50 (19788): No heartbeat from core client for 30 sec - exiting
21:12:51 (19788): No heartbeat from core client for 30 sec - exiting
21:12:52 (19788): No heartbeat from core client for 30 sec - exiting
21:12:53 (19788): No heartbeat from core client for 30 sec - exiting
21:12:54 (19788): No heartbeat from core client for 30 sec - exiting
21:12:55 (19788): No heartbeat from core client for 30 sec - exiting
21:12:56 (19788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6020, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5192, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/zj65ko.pji2c10
Error converting file to netcdf: dataout/zj65ko.pii2c10
Error converting file to netcdf: dataout/zj65ko.pfi2c10
Error converting file to netcdf: dataout/zj65ka.phi2c10
Error converting file to netcdf: dataout/zj65ka.pgi2c10
Error converting file to netcdf: dataout/zj65ka.pei2c10
Error converting file to netcdf: dataout/zj65ka.pdi2c10
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6704, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6908, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6908, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6352, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7472, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3712, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/zj65ko.pji8c10
Error converting file to netcdf: dataout/zj65ko.pii8c10
Error converting file to netcdf: dataout/zj65ko.pfi8c10
Error converting file to netcdf: dataout/zj65ka.phi8c10
Error converting file to netcdf: dataout/zj65ka.pgi8c10
Error converting file to netcdf: dataout/zj65ka.pei8c10
Error converting file to netcdf: dataout/zj65ka.pdi8c10
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6216, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1488, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6544, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6696, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5516, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5516, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6216, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5832, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5832, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5560, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6316, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Aug 2013 16:46:06 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 907,200 2,433,586 2.6825
14 Aug 2013 16:46:06 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 881,280 2,366,264 2.6850
14 Aug 2013 16:46:06 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 855,360 2,293,932 2.6818
14 Aug 2013 16:46:05 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 829,440 2,228,537 2.6868
14 Aug 2013 16:46:05 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 803,520 2,159,305 2.6873
14 Aug 2013 16:46:05 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 777,600 2,089,933 2.6877
26 Jul 2013 13:47:55 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 751,680 2,014,238 2.6796
24 Jul 2013 16:25:39 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 725,760 1,934,682 2.6657
23 Jul 2013 21:54:28 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 699,840 1,860,629 2.6586
11 Jul 2013 09:36:22 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 673,920 1,787,179 2.6519
09 Jul 2013 16:28:09 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 648,000 1,713,876 2.6449
08 Jul 2013 11:48:10 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 622,080 1,643,290 2.6416
07 Jul 2013 09:56:18 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 596,160 1,583,636 2.6564
04 Jul 2013 14:25:05 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 570,240 1,519,802 2.6652
02 Jul 2013 18:44:00 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 544,320 1,449,344 2.6627
02 Jul 2013 11:48:22 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 518,400 1,378,528 2.6592
02 Jul 2013 09:47:46 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 492,480 1,308,214 2.6564
27 Jun 2013 12:00:24 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 466,560 1,236,551 2.6504
26 Jun 2013 13:47:02 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 440,640 1,165,145 2.6442
25 Jun 2013 16:21:00 1239712 15820348 hadcm3n_zj65_1960_40_008321121_4 414,720 1,095,080 2.6405


©2024 cpdn.org