climateprediction.net home page
Task 11077665

Task 11077665

Name hadsm3dhet2_jts7_006601593_6
Workunit 6804966
Created 15 Mar 2010, 12:09:23 UTC
Sent 13 Jun 2010, 0:31:11 UTC
Report deadline 26 May 2011, 5:51:11 UTC
Received 21 Jun 2010, 0:09:45 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 922180
Run time 7 days 7 hours 17 min 29 sec
CPU time 7 days 5 hours 20 min 15 sec
Validate state Invalid
Credit 4,168.22
Device peak FLOPS 2.28 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=3864, selfPID=3864, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=5840, selfPID=5840, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=7884, selfPID=7884, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=7588, selfPID=7588, iMonCtr=1
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
MainError:	06:03:31 AM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=5728, selfPID=5728, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=5560, selfPID=5560, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=3864, selfPID=3864, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=6944, selfPID=6944, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2460, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2460, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2460, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2460, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2460, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2460, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Jun 2010 12:50:05 922180 11077665 hadsm3dhet2_jts7_006601593_6 194,436 613,978 1.3533
20 Jun 2010 08:43:54 922180 11077665 hadsm3dhet2_jts7_006601593_6 183,634 599,586 1.3538
20 Jun 2010 05:40:41 922180 11077665 hadsm3dhet2_jts7_006601593_6 172,832 585,306 1.3546
20 Jun 2010 00:39:45 922180 11077665 hadsm3dhet2_jts7_006601593_6 162,030 570,913 1.3552
19 Jun 2010 20:36:20 922180 11077665 hadsm3dhet2_jts7_006601593_6 151,228 556,707 1.3562
19 Jun 2010 16:08:42 922180 11077665 hadsm3dhet2_jts7_006601593_6 140,426 542,507 1.3574
19 Jun 2010 12:00:22 922180 11077665 hadsm3dhet2_jts7_006601593_6 129,624 528,273 1.3585
19 Jun 2010 07:58:32 922180 11077665 hadsm3dhet2_jts7_006601593_6 118,822 514,167 1.3600
19 Jun 2010 03:56:16 922180 11077665 hadsm3dhet2_jts7_006601593_6 108,020 500,002 1.3614
18 Jun 2010 23:58:57 922180 11077665 hadsm3dhet2_jts7_006601593_6 97,218 485,899 1.3631
18 Jun 2010 19:46:48 922180 11077665 hadsm3dhet2_jts7_006601593_6 86,416 471,800 1.3649
18 Jun 2010 15:46:48 922180 11077665 hadsm3dhet2_jts7_006601593_6 75,614 457,674 1.3668
18 Jun 2010 11:42:41 922180 11077665 hadsm3dhet2_jts7_006601593_6 64,812 443,687 1.3692
18 Jun 2010 07:44:18 922180 11077665 hadsm3dhet2_jts7_006601593_6 54,010 429,668 1.3716
18 Jun 2010 01:39:26 922180 11077665 hadsm3dhet2_jts7_006601593_6 43,208 415,674 1.3743
17 Jun 2010 21:19:43 922180 11077665 hadsm3dhet2_jts7_006601593_6 32,406 401,602 1.3770
17 Jun 2010 17:14:40 922180 11077665 hadsm3dhet2_jts7_006601593_6 21,604 386,565 1.3764
17 Jun 2010 10:13:44 922180 11077665 hadsm3dhet2_jts7_006601593_6 10,802 371,548 1.3758
17 Jun 2010 06:05:40 922180 11077665 hadsm3dhet2_jts7_006601593_6 259,248 356,069 1.3735
17 Jun 2010 01:48:58 922180 11077665 hadsm3dhet2_jts7_006601593_6 248,446 340,732 1.3715


©2024 cpdn.org