climateprediction.net home page
Task 13542592

Task 13542592

Name hadcm3n_ydp8_1900_40_007518456_0
Workunit 7715931
Created 28 Oct 2011, 12:58:12 UTC
Sent 20 Nov 2011, 18:58:24 UTC
Report deadline 20 Feb 2012, 2:25:35 UTC
Received 20 Mar 2012, 14:15:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1178522
Run time 3 days 23 hours 19 min 7 sec
CPU time 3 days 14 hours 46 min 39 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 3.28 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
18:39:11 (1928): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4548, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/ydp8ko.pja8c10
Error converting file to netcdf: dataout/ydp8ko.pia8c10
Error converting file to netcdf: dataout/ydp8ko.pfa8c10
Error converting file to netcdf: dataout/ydp8ka.pha8c10
Error converting file to netcdf: dataout/ydp8ka.pga8c10
Error converting file to netcdf: dataout/ydp8ka.pea8c10
Error converting file to netcdf: dataout/ydp8ka.pda8c10
19:10:45 (4528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
10:48:12 (2012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3828, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3828, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3828, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ydp8_1900_40_007518456/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Mar 2012 02:47:07 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 259,200 312,447 1.2054
18 Mar 2012 22:39:16 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 233,280 280,317 1.2016
16 Mar 2012 21:06:06 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 207,360 248,046 1.1962
09 Mar 2012 20:20:34 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 181,440 217,834 1.2006
29 Feb 2012 21:15:07 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 155,520 187,398 1.2050
28 Feb 2012 19:12:09 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 129,600 155,961 1.2034
24 Feb 2012 20:56:41 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 103,680 124,585 1.2016
17 Feb 2012 16:54:34 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 77,760 92,610 1.1910
19 Jan 2012 15:08:17 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 51,840 61,673 1.1897
17 Jan 2012 19:30:33 1178522 13542592 hadcm3n_ydp8_1900_40_007518456_0 25,920 30,996 1.1958


©2024 cpdn.org