climateprediction.net home page
Task 13189533

Task 13189533

Name hadam3p_eu_2jlh_2001_1_007387063_0
Workunit 7584493
Created 1 Aug 2011, 22:11:52 UTC
Sent 1 Aug 2011, 22:18:01 UTC
Report deadline 14 Jul 2012, 3:38:01 UTC
Received 12 Oct 2011, 6:53:23 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1105901
Run time 5 days 22 hours 31 min
CPU time 4 days 11 hours 40 min 43 sec
Validate state Invalid
Credit 2,187.67
Device peak FLOPS 2.49 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10920, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11496, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8380, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
16:46:34 (8216): No heartbeat from core client for 30 sec - exiting
16:46:35 (8216): No heartbeat from core client for 30 sec - exiting
16:46:36 (8216): No heartbeat from core client for 30 sec - exiting
16:46:37 (8216): No heartbeat from core client for 30 sec - exiting
16:46:38 (8216): No heartbeat from core client for 30 sec - exiting
16:46:39 (8216): No heartbeat from core client for 30 sec - exiting
16:46:40 (8216): No heartbeat from core client for 30 sec - exiting
16:46:41 (8216): No heartbeat from core client for 30 sec - exiting
16:46:42 (8216): No heartbeat from core client for 30 sec - exiting
16:46:43 (8216): No heartbeat from core client for 30 sec - exiting
16:46:44 (8216): No heartbeat from core client for 30 sec - exiting
16:46:45 (8216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=524, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6212, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4948, selfPID=6448, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13244, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12324, selfPID=13044, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6448, selfPID=6920, iMonCtr=1
Model crash detected, will try to restart...
20:43:00 (6792): No heartbeat from core client for 30 sec - exiting
20:43:01 (6792): No heartbeat from core client for 30 sec - exiting
20:43:02 (6792): No heartbeat from core client for 30 sec - exiting
20:43:03 (6792): No heartbeat from core client for 30 sec - exiting
20:43:04 (6792): No heartbeat from core client for 30 sec - exiting
20:43:05 (6792): No heartbeat from core client for 30 sec - exiting
20:43:06 (6792): No heartbeat from core client for 30 sec - exiting
20:43:07 (6792): No heartbeat from core client for 30 sec - exiting
20:43:08 (6792): No heartbeat from core client for 30 sec - exiting
20:43:09 (6792): No heartbeat from core client for 30 sec - exiting
20:43:10 (6792): No heartbeat from core client for 30 sec - exiting
20:43:12 (6792): No heartbeat from core client for 30 sec - exiting
20:43:13 (6792): No heartbeat from core client for 30 sec - exiting
20:43:14 (6792): No heartbeat from core client for 30 sec - exiting
20:43:48 (6792): No heartbeat from core client for 30 sec - exiting
20:43:49 (6792): No heartbeat from core client for 30 sec - exiting
20:43:50 (6792): No heartbeat from core client for 30 sec - exiting
20:43:51 (6792): No heartbeat from core client for 30 sec - exiting
20:43:52 (6792): No heartbeat from core client for 30 sec - exiting
20:43:53 (6792): No heartbeat from core client for 30 sec - exiting
20:43:54 (6792): No heartbeat from core client for 30 sec - exiting
20:43:55 (6792): No heartbeat from core client for 30 sec - exiting
20:43:56 (6792): No heartbeat from core client for 30 sec - exiting
20:43:58 (6792): No heartbeat from core client for 30 sec - exiting
20:43:59 (6792): No heartbeat from core client for 30 sec - exiting
20:44:00 (6792): No heartbeat from core client for 30 sec - exiting
20:44:01 (6792): No heartbeat from core client for 30 sec - exiting
20:44:02 (6792): No heartbeat from core client for 30 sec - exiting
20:44:03 (6792): No heartbeat from core client for 30 sec - exiting
20:44:04 (6792): No heartbeat from core client for 30 sec - exiting
20:44:05 (6792): No heartbeat from core client for 30 sec - exiting
20:44:06 (6792): No heartbeat from core client for 30 sec - exiting
20:44:07 (6792): No heartbeat from core client for 30 sec - exiting
20:44:08 (6792): No heartbeat from core client for 30 sec - exiting
20:44:09 (6792): No heartbeat from core client for 30 sec - exiting
20:44:10 (6792): No heartbeat from core client for 30 sec - exiting
20:44:11 (6792): No heartbeat from core client for 30 sec - exiting
20:44:12 (6792): No heartbeat from core client for 30 sec - exiting
20:44:13 (6792): No heartbeat from core client for 30 sec - exiting
20:44:14 (6792): No heartbeat from core client for 30 sec - exiting
20:44:15 (6792): No heartbeat from core client for 30 sec - exiting
20:44:16 (6792): No heartbeat from core client for 30 sec - exiting
20:44:17 (6792): No heartbeat from core client for 30 sec - exiting
20:44:18 (6792): No heartbeat from core client for 30 sec - exiting
20:44:19 (6792): No heartbeat from core client for 30 sec - exiting
20:44:20 (6792): No heartbeat from core client for 30 sec - exiting
20:44:21 (6792): No heartbeat from core client for 30 sec - exiting
20:44:22 (6792): No heartbeat from core client for 30 sec - exiting
20:44:23 (6792): No heartbeat from core client for 30 sec - exiting
20:44:24 (6792): No heartbeat from core client for 30 sec - exiting
20:44:25 (6792): No heartbeat from core client for 30 sec - exiting
20:44:26 (6792): No heartbeat from core client for 30 sec - exiting
20:44:27 (6792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
21:04:36 (4308): No heartbeat from core client for 30 sec - exiting
21:04:37 (4308): No heartbeat from core client for 30 sec - exiting
21:04:38 (4308): No heartbeat from core client for 30 sec - exiting
21:04:39 (4308): No heartbeat from core client for 30 sec - exiting
21:04:40 (4308): No heartbeat from core client for 30 sec - exiting
21:04:41 (4308): No heartbeat from core client for 30 sec - exiting
21:04:42 (4308): No heartbeat from core client for 30 sec - exiting
21:04:43 (4308): No heartbeat from core client for 30 sec - exiting
21:04:44 (4308): No heartbeat from core client for 30 sec - exiting
21:04:45 (4308): No heartbeat from core client for 30 sec - exiting
21:04:46 (4308): No heartbeat from core client for 30 sec - exiting
21:04:47 (4308): No heartbeat from core client for 30 sec - exiting
21:04:48 (4308): No heartbeat from core client for 30 sec - exiting
21:04:49 (4308): No heartbeat from core client for 30 sec - exiting
21:04:50 (4308): No heartbeat from core client for 30 sec - exiting
21:04:51 (4308): No heartbeat from core client for 30 sec - exiting
21:04:52 (4308): No heartbeat from core client for 30 sec - exiting
21:04:53 (4308): No heartbeat from core client for 30 sec - exiting
21:04:54 (4308): No heartbeat from core client for 30 sec - exiting
21:04:55 (4308): No heartbeat from core client for 30 sec - exiting
21:04:56 (4308): No heartbeat from core client for 30 sec - exiting
21:04:57 (4308): No heartbeat from core client for 30 sec - exiting
21:04:58 (4308): No heartbeat from core client for 30 sec - exiting
21:04:59 (4308): No heartbeat from core client for 30 sec - exiting
21:05:00 (4308): No heartbeat from core client for 30 sec - exiting
21:05:01 (4308): No heartbeat from core client for 30 sec - exiting
21:05:02 (4308): No heartbeat from core client for 30 sec - exiting
21:05:03 (4308): No heartbeat from core client for 30 sec - exiting
21:05:04 (4308): No heartbeat from core client for 30 sec - exiting
21:05:05 (4308): No heartbeat from core client for 30 sec - exiting
21:05:06 (4308): No heartbeat from core client for 30 sec - exiting
21:05:07 (4308): No heartbeat from core client for 30 sec - exiting
21:05:08 (4308): No heartbeat from core client for 30 sec - exiting
21:05:09 (4308): No heartbeat from core client for 30 sec - exiting
21:05:10 (4308): No heartbeat from core client for 30 sec - exiting
21:05:11 (4308): No heartbeat from core client for 30 sec - exiting
21:05:12 (4308): No heartbeat from core client for 30 sec - exiting
21:05:13 (4308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:48:04 (3664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7844, selfPID=4904, iMonCtr=1
Model crash detected, will try to restart...
12:25:50 (5272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:28:51 (8804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:30:35 (4632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:32:11 (3880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:13:23 (2496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:13:24 (2496): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4336, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7620, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:23:44 (6644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:23:46 (6644): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:15:45 (6800): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
19:15:46 (6800): No heartbeat from core client for 30 sec - exiting
19:15:47 (6800): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7908, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3648, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5268, selfPID=8788, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Oct 2011 06:43:31 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 126,816 367,532 2.8982
06 Oct 2011 00:09:58 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 115,296 332,650 2.8852
29 Sep 2011 07:22:00 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 103,776 299,547 2.8865
19 Sep 2011 01:07:53 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 92,256 266,818 2.8921
06 Sep 2011 21:18:27 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 80,736 233,704 2.8947
02 Sep 2011 18:31:37 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 69,216 200,796 2.9010
31 Aug 2011 05:16:05 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 57,696 167,822 2.9087
30 Aug 2011 10:35:13 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 46,176 134,847 2.9203
16 Aug 2011 07:06:11 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 34,656 101,660 2.9334
12 Aug 2011 07:04:13 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 23,136 68,169 2.9464
04 Aug 2011 03:56:20 1105901 13189533 hadam3p_eu_2jlh_2001_1_007387063_0 11,616 34,130 2.9382


©2024 cpdn.org