climateprediction.net home page
Task 17659190

Task 17659190

Name hadcm3n_xc2b_1940_40_009151929_4
Workunit 9282265
Created 29 Dec 2014, 12:49:30 UTC
Sent 29 Dec 2014, 12:51:01 UTC
Report deadline 30 Mar 2015, 20:18:12 UTC
Received 21 Feb 2015, 17:16:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1241124
Run time 15 days 6 hours 7 min 4 sec
CPU time 14 days 14 hours 26 min 39 sec
Validate state Invalid
Credit 3,421.44
Device peak FLOPS 1.21 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.4.36</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:00:10 (6040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
15:19:56 (1336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:43:05 (4948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:57:14 (6100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:15:32 (5832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:15:34 (5832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
17:01:34 (4028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
20:08:25 (4100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6956, iMonCtr=1
Model crash detected, will try to restart...
13:10:18 (3404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:42:26 (3252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:42:28 (3252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:20:28 (3544): No heartbeat from core client for 30 sec - exiting
07:20:29 (3544): No heartbeat from core client for 30 sec - exiting
07:20:30 (3544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:20:31 (3544): No heartbeat from core client for 30 sec - exiting
07:20:33 (3544): No heartbeat from core client for 30 sec - exiting
07:20:34 (3544): No heartbeat from core client for 30 sec - exiting
07:20:35 (3544): No heartbeat from core client for 30 sec - exiting
07:20:36 (3544): No heartbeat from core client for 30 sec - exiting
07:20:37 (3544): No heartbeat from core client for 30 sec - exiting
07:20:38 (3544): No heartbeat from core client for 30 sec - exiting
07:20:39 (3544): No heartbeat from core client for 30 sec - exiting
07:20:40 (3544): No heartbeat from core client for 30 sec - exiting
07:20:41 (3544): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7440, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
16:17:09 (5764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5208, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5208, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/ocean_restart.day after 11 attempts
CPDN Monitor - Quit request from BOINC...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9052, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9052, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
06:53:54 (9052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8340, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/ocean_restart.day after 11 attempts
CPDN Monitor - Quit request from BOINC...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_xc2b_1940_40_009151929/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43744, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Feb 2015 12:23:08 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 285,120 1,194,679 4.1901
15 Feb 2015 16:30:00 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 259,200 1,084,810 4.1852
09 Feb 2015 14:50:59 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 233,280 976,961 4.1879
08 Feb 2015 06:18:26 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 207,360 862,814 4.1609
05 Feb 2015 07:58:44 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 181,440 750,742 4.1377
02 Feb 2015 07:14:14 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 155,520 643,473 4.1376
30 Jan 2015 09:14:07 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 129,600 538,268 4.1533
25 Jan 2015 02:50:44 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 103,680 432,231 4.1689
12 Jan 2015 09:02:40 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 77,760 325,132 4.1812
08 Jan 2015 16:16:49 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 51,840 215,049 4.1483
03 Jan 2015 22:31:15 1241124 17659190 hadcm3n_xc2b_1940_40_009151929_4 25,920 106,224 4.0981


©2024 cpdn.org