climateprediction.net home page
Task 13125481

Task 13125481

Name hadcm3n_ylkb_1900_40_007360805_0
Workunit 7558235
Created 6 Jul 2011, 15:15:08 UTC
Sent 7 Jul 2011, 16:52:32 UTC
Report deadline 7 Oct 2011, 0:19:43 UTC
Received 7 Aug 2011, 8:23:45 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 523127
Run time 8 days 23 hours 49 min 38 sec
CPU time 8 days 23 hours 49 min 38 sec
Validate state Invalid
Credit 3,421.44
Device peak FLOPS 2.20 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.2.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
17:19:20 (2156): No heartbeat from core client for 30 sec - exiting
17:19:21 (2156): No heartbeat from core client for 30 sec - exiting
17:19:27 (2156): No heartbeat from core client for 30 sec - exiting
17:19:29 (2156): No heartbeat from core client for 30 sec - exiting
17:19:34 (2156): No heartbeat from core client for 30 sec - exiting
17:19:36 (2156): No heartbeat from core client for 30 sec - exiting
17:19:38 (2156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:44:38 (2188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:16:33 (2768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
22:18:19 (540): No heartbeat from core client for 30 sec - exiting
22:18:20 (540): No heartbeat from core client for 30 sec - exiting
22:18:21 (540): No heartbeat from core client for 30 sec - exiting
22:18:22 (540): No heartbeat from core client for 30 sec - exiting
22:18:23 (540): No heartbeat from core client for 30 sec - exiting
22:18:24 (540): No heartbeat from core client for 30 sec - exiting
22:18:25 (540): No heartbeat from core client for 30 sec - exiting
22:18:26 (540): No heartbeat from core client for 30 sec - exiting
22:18:27 (540): No heartbeat from core client for 30 sec - exiting
22:18:28 (540): No heartbeat from core client for 30 sec - exiting
22:18:30 (540): No heartbeat from core client for 30 sec - exiting
22:18:31 (540): No heartbeat from core client for 30 sec - exiting
22:18:32 (540): No heartbeat from core client for 30 sec - exiting
22:18:33 (540): No heartbeat from core client for 30 sec - exiting
22:18:34 (540): No heartbeat from core client for 30 sec - exiting
22:18:35 (540): No heartbeat from core client for 30 sec - exiting
22:18:36 (540): No heartbeat from core client for 30 sec - exiting
22:18:37 (540): No heartbeat from core client for 30 sec - exiting
22:18:38 (540): No heartbeat from core client for 30 sec - exiting
22:18:39 (540): No heartbeat from core client for 30 sec - exiting
22:18:40 (540): No heartbeat from core client for 30 sec - exiting
22:18:42 (540): No heartbeat from core client for 30 sec - exiting
22:18:43 (540): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Restart file copy failed on ylkbka.dab25a0

Model crashed: TEMPHIST: Failed in OPEN of history file                                                                                                                                                                                                                        tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_se_6.07_windows_intelx86.dll after 11 attempts
cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_um_6.07_windows_intelx86.exe after 11 attempts
cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_ylkb_1900_40_007360805/jobs/xabnk.ihist after 11 attempts
22:42:39 (448): No heartbeat from core client for 30 sec - exiting
cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_ylkb_1900_40_007360805/jobs/xabnk.namelists after 11 attempts
22:42:40 (448): No heartbeat from core client for 30 sec - exiting
cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_ylkb_1900_40_007360805/dataout/atmos_restart.day after 11 attempts
22:42:41 (448): No heartbeat from core client for 30 sec - exiting
cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_ylkb_1900_40_007360805/dataout/ocean_restart.day after 11 attempts
22:42:41 (448): Can't open init data file - running in standalone mode
Could not launch model process. Last Error=6
Called boinc_finish
22:42:42 (448): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=344, iMonCtr=1
Model crash detected, will try to restart...
01:03:36 (344): No heartbeat from core client for 30 sec - exiting
01:03:37 (344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Jul 2011 08:06:51 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 285,120 745,505 2.6147
25 Jul 2011 22:58:03 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 259,200 677,354 2.6132
25 Jul 2011 22:11:27 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 233,280 608,752 2.6095
25 Jul 2011 20:53:44 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 207,360 543,944 2.6232
25 Jul 2011 16:23:14 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 181,440 478,991 2.6399
25 Jul 2011 15:48:17 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 155,520 411,791 2.6478
25 Jul 2011 15:01:53 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 129,600 342,876 2.6456
25 Jul 2011 15:01:53 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 103,680 275,047 2.6528
10 Jul 2011 17:46:02 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 77,760 207,022 2.6623
09 Jul 2011 09:20:32 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 51,840 139,084 2.6829
08 Jul 2011 14:23:40 523127 13125481 hadcm3n_ylkb_1900_40_007360805_0 25,920 69,768 2.6917


©2024 cpdn.org