climateprediction.net home page
Task 10978107

Task 10978107

Name hadsm3dhet2_jm3o_006591638_0
Workunit 6795011
Created 15 Mar 2010, 11:56:26 UTC
Sent 17 Oct 2010, 19:29:13 UTC
Report deadline 30 Sep 2011, 0:49:13 UTC
Received 13 May 2011, 18:06:22 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 725427
Run time 16 days 1 hours 20 min 42 sec
CPU time 14 days 8 hours 13 min 32 sec
Validate state Invalid
Credit 6,351.58
Device peak FLOPS 2.30 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5036, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5144, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2212, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8164, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
MainError:	04:37:08 PM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3492, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5700, iMonCtr=1
Model crash detected, will try to restart...
CCPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3912, iMonCtr=1
Model crash detected, will try to restart...
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
MainError:	11:23:38 PM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2944, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 172,832 1,223,050 1.7691
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 162,030 1,202,798 1.7675
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 151,228 1,183,480 1.7671
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 140,426 1,164,387 1.7671
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 129,624 1,144,421 1.7658
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 118,822 1,125,230 1.7656
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 108,020 1,104,358 1.7627
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 97,218 1,084,129 1.7608
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 86,416 1,064,408 1.7596
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 75,614 1,045,166 1.7592
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 64,812 1,026,233 1.7593
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 54,010 1,007,564 1.7599
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 43,208 988,487 1.7598
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 32,406 968,568 1.7581
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 21,604 948,614 1.7564
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 10,802 929,649 1.7564
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 259,248 910,763 1.7565
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 248,446 891,075 1.7551
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 237,644 871,202 1.7533
13 May 2011 18:08:13 725427 10978107 hadsm3dhet2_jm3o_006591638_0 226,842 851,302 1.7513


©2024 climateprediction.net