climateprediction.net home page
Task 13111812

Task 13111812

Name hadcm3n_ygah_1900_40_007353971_0
Workunit 7551401
Created 6 Jul 2011, 14:30:40 UTC
Sent 15 Jul 2011, 15:05:26 UTC
Report deadline 14 Oct 2011, 22:32:37 UTC
Received 8 Aug 2011, 4:28:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1122800
Run time 8 days 7 hours 34 min 19 sec
CPU time 7 days 23 hours 59 min 3 sec
Validate state Invalid
Credit 6,842.88
Device peak FLOPS 3.11 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5148, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7988, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5296, iMonCtr=1
Model crash detected, will try to restart...
08:42:12 (5188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:44:56 (5188): No heartbeat from core client for 30 sec - exiting
08:45:04 (5188): No heartbeat from core client for 30 sec - exiting
08:45:05 (5188): No heartbeat from core client for 30 sec - exiting
08:45:06 (5188): No heartbeat from core client for 30 sec - exiting
08:45:07 (5188): No heartbeat from core client for 30 sec - exiting
08:45:08 (5188): No heartbeat from core client for 30 sec - exiting
08:45:09 (5188): No heartbeat from core client for 30 sec - exiting
08:45:10 (5188): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Aug 2011 10:33:44 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 570,240 661,723 1.1604
03 Aug 2011 15:48:35 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 544,320 629,626 1.1567
03 Aug 2011 07:42:04 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 518,400 599,613 1.1567
02 Aug 2011 13:51:18 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 492,480 567,992 1.1533
01 Aug 2011 17:56:28 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 466,560 536,998 1.1510
01 Aug 2011 07:18:11 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 440,640 506,125 1.1486
28 Jul 2011 18:40:39 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 414,720 474,020 1.1430
28 Jul 2011 05:59:20 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 388,800 442,786 1.1389
27 Jul 2011 05:31:13 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 362,880 411,229 1.1332
25 Jul 2011 23:01:11 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 336,960 379,195 1.1253
25 Jul 2011 22:49:19 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 311,040 345,998 1.1124
25 Jul 2011 21:58:49 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 285,120 315,677 1.1072
25 Jul 2011 20:39:01 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 259,200 284,841 1.0989
25 Jul 2011 19:39:26 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 233,280 253,818 1.0880
25 Jul 2011 19:07:52 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 207,360 223,441 1.0776
25 Jul 2011 19:07:41 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 181,440 192,264 1.0597
25 Jul 2011 18:55:33 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 155,520 161,472 1.0383
25 Jul 2011 18:10:40 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 129,600 131,170 1.0121
25 Jul 2011 17:56:51 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 103,680 122,484 1.1814
25 Jul 2011 17:30:26 1122800 13111812 hadcm3n_ygah_1900_40_007353971_0 77,760 92,370 1.1879


©2024 cpdn.org