climateprediction.net home page
Task 15895105

Task 15895105

Name hadcm3n_u5mq_2020_40_008337061_4
Workunit 8487922
Created 18 Jul 2013, 12:17:54 UTC
Sent 18 Jul 2013, 12:59:23 UTC
Report deadline 17 Oct 2013, 20:26:34 UTC
Received 14 Aug 2013, 16:02:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1286530
Run time 14 days 14 hours 13 min 23 sec
CPU time 13 days 9 hours 27 min 58 sec
Validate state Invalid
Credit 11,819.52
Device peak FLOPS 3.32 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:49:43 (5908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:25:02 (4140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 62 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/u5mqko.pjp0c10
Error converting file to netcdf: dataout/u5mqko.pip0c10
Error converting file to netcdf: dataout/u5mqko.pfp0c10
Error converting file to netcdf: dataout/u5mqko.pcp0c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:19:24 (2852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1464, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1464, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1464, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 984,960 1,142,053 1.1595
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 933,120 1,082,834 1.1604
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 907,200 1,053,341 1.1611
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 881,280 1,027,451 1.1659
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 855,360 997,821 1.1666
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 803,520 938,603 1.1681
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 777,600 908,948 1.1689
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 751,680 878,768 1.1691
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 725,760 849,467 1.1705
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 699,840 819,747 1.1713
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 673,920 789,411 1.1714
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 648,000 760,285 1.1733
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 622,080 730,751 1.1747
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 596,160 701,395 1.1765
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 570,240 671,366 1.1773
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 544,320 641,336 1.1782
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 518,400 612,009 1.1806
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 492,480 582,735 1.1833
14 Aug 2013 16:11:04 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 466,560 552,510 1.1842
30 Jul 2013 09:47:16 1286530 15895105 hadcm3n_u5mq_2020_40_008337061_4 440,640 521,464 1.1834


©2024 climateprediction.net