climateprediction.net home page
Task 15421073

Task 15421073

Name hadcm3n_yak0_1980_40_008243040_2
Workunit 8398164
Created 30 Oct 2012, 20:15:15 UTC
Sent 30 Oct 2012, 20:15:31 UTC
Report deadline 30 Jan 2013, 3:42:42 UTC
Received 7 Dec 2012, 17:04:11 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1203097
Run time 5 days 12 hours 16 min 21 sec
CPU time 5 days 11 hours 21 min 52 sec
Validate state Invalid
Credit 4,354.56
Device peak FLOPS 2.85 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4476, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5944, iMonCtr=1
Model crash detected, will try to restart...
20:12:18 (1140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5416, iMonCtr=1
Model crash detected, will try to restart...
20:48:28 (5520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
23:22:56 (5492): No heartbeat from core client for 30 sec - exiting
23:22:57 (5492): No heartbeat from core client for 30 sec - exiting
23:22:58 (5492): No heartbeat from core client for 30 sec - exiting
23:22:59 (5492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5092, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
18:36:29 (5660): No heartbeat from core client for 30 sec - exiting
18:36:30 (5660): No heartbeat from core client for 30 sec - exiting
18:36:31 (5660): No heartbeat from core client for 30 sec - exiting
18:36:32 (5660): No heartbeat from core client for 30 sec - exiting
18:36:33 (5660): No heartbeat from core client for 30 sec - exiting
18:36:34 (5660): No heartbeat from core client for 30 sec - exiting
18:36:35 (5660): No heartbeat from core client for 30 sec - exiting
18:36:36 (5660): No heartbeat from core client for 30 sec - exiting
18:36:37 (5660): No heartbeat from core client for 30 sec - exiting
18:36:38 (5660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
09:24:08 (5860): No heartbeat from core client for 30 sec - exiting
09:24:09 (5860): No heartbeat from core client for 30 sec - exiting
09:24:10 (5860): No heartbeat from core client for 30 sec - exiting
09:24:11 (5860): No heartbeat from core client for 30 sec - exiting
09:24:12 (5860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Dec 2012 09:29:34 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 362,880 464,731 1.2807
02 Dec 2012 20:56:17 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 336,960 431,565 1.2808
02 Dec 2012 11:59:14 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 311,040 398,464 1.2811
01 Dec 2012 16:07:53 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 285,120 365,305 1.2812
30 Nov 2012 09:16:54 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 259,200 332,344 1.2822
28 Nov 2012 19:05:33 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 233,280 298,345 1.2789
25 Nov 2012 19:40:11 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 207,360 265,276 1.2793
22 Nov 2012 19:50:16 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 181,440 232,398 1.2809
18 Nov 2012 11:20:10 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 155,520 199,004 1.2796
17 Nov 2012 13:05:59 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 129,600 166,025 1.2811
04 Nov 2012 20:30:19 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 103,680 132,697 1.2799
04 Nov 2012 11:56:04 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 77,760 99,400 1.2783
02 Nov 2012 23:08:24 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 51,840 66,179 1.2766
01 Nov 2012 18:31:48 1203097 15421073 hadcm3n_yak0_1980_40_008243040_2 25,920 33,068 1.2758


©2024 cpdn.org