climateprediction.net home page
Task 15679062

Task 15679062

Name hadcm3n_o5ct_2060_40_008335623_0
Workunit 8486484
Created 23 Mar 2013, 2:25:34 UTC
Sent 23 Mar 2013, 2:29:52 UTC
Report deadline 22 Jun 2013, 9:57:03 UTC
Received 2 Apr 2013, 22:43:50 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1259079
Run time 9 days 10 hours 53 min 20 sec
CPU time 9 days 4 hours 54 min 35 sec
Validate state Invalid
Credit 8,087.04
Device peak FLOPS 3.63 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
03:49:18 (7688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:49:22 (7688): No heartbeat from core client for 30 sec - exiting
03:49:25 (7688): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:42:45 (3092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:42:46 (3092): No heartbeat from core client for 30 sec - exiting
14:42:49 (3092): No heartbeat from core client for 30 sec - exiting
14:42:52 (3092): No heartbeat from core client for 30 sec - exiting
14:42:55 (3092): No heartbeat from core client for 30 sec - exiting
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o5ctko.pjr6c10
Error converting file to netcdf: dataout/o5ctko.pir6c10
Error converting file to netcdf: dataout/o5ctko.pfr6c10
Error converting file to netcdf: dataout/o5ctka.phr6c10
Error converting file to netcdf: dataout/o5ctka.pgr6c10
Error converting file to netcdf: dataout/o5ctka.per6c10
Error converting file to netcdf: dataout/o5ctka.pdr6c10
CPDN Monitor - Quit request from BOINC...
16:56:50 (5428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:56:53 (5428): No heartbeat from core client for 30 sec - exiting
16:56:56 (5428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
01:01:54 (3016): No heartbeat from core client for 30 sec - exiting
01:01:58 (3016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:02:01 (3016): No heartbeat from core client for 30 sec - exiting
01:11:55 (1364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:26:58 (1792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:27:00 (1792): No heartbeat from core client for 30 sec - exiting
01:27:03 (1792): No heartbeat from core client for 30 sec - exiting
01:27:06 (1792): No heartbeat from core client for 30 sec - exiting
01:27:09 (1792): No heartbeat from core client for 30 sec - exiting
01:27:11 (1792): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Apr 2013 13:25:32 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 673,920 785,692 1.1659
02 Apr 2013 04:21:56 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 648,000 754,892 1.1650
01 Apr 2013 18:55:52 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 622,080 723,691 1.1633
01 Apr 2013 08:44:24 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 596,160 692,382 1.1614
31 Mar 2013 23:46:06 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 570,240 660,764 1.1587
31 Mar 2013 14:45:31 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 544,320 629,195 1.1559
31 Mar 2013 05:53:24 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 518,400 597,901 1.1534
30 Mar 2013 21:15:14 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 492,480 567,591 1.1525
30 Mar 2013 12:40:39 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 466,560 537,441 1.1519
30 Mar 2013 04:12:58 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 440,640 507,199 1.1511
29 Mar 2013 19:47:21 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 414,720 477,718 1.1519
29 Mar 2013 10:16:28 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 388,800 448,146 1.1526
29 Mar 2013 00:56:39 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 362,880 418,190 1.1524
28 Mar 2013 09:23:53 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 336,960 388,704 1.1536
28 Mar 2013 01:05:57 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 311,040 359,651 1.1563
27 Mar 2013 16:50:52 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 285,120 330,463 1.1590
27 Mar 2013 08:22:11 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 259,200 300,539 1.1595
26 Mar 2013 23:53:10 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 233,280 270,598 1.1600
26 Mar 2013 07:15:36 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 207,360 240,803 1.1613
25 Mar 2013 22:42:04 1259079 15679062 hadcm3n_o5ct_2060_40_008335623_0 181,440 211,068 1.1633


©2024 cpdn.org