Task 17554113

Name	hadam3p_anz_f20l_2012_1_009265148_1
Workunit	9358064
Created	6 Dec 2014, 8:48:11 UTC
Sent	6 Dec 2014, 8:52:19 UTC
Report deadline	18 Nov 2015, 14:12:19 UTC
Received	11 Feb 2015, 14:33:07 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1347325
Run time	16 days 13 hours 55 min 53 sec
CPU time	14 days 23 hours 9 min 41 sec
Validate state	Workunit error - check skipped
Credit	5,974.74
Device peak FLOPS	2.02 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86
Stderr	<core_client_version>7.4.27</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3832, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 10:54:59 (1596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:55:00 (1596): No heartbeat from core client for 30 sec - exiting 10:55:02 (1596): No heartbeat from core client for 30 sec - exiting 11:05:44 (2796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:05:47 (2796): No heartbeat from core client for 30 sec - exiting 11:05:48 (2796): No heartbeat from core client for 30 sec - exiting 11:05:49 (2796): No heartbeat from core client for 30 sec - exiting 11:05:50 (2796): No heartbeat from core client for 30 sec - exiting 11:05:51 (2796): No heartbeat from core client for 30 sec - exiting 11:27:19 (1476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:27:20 (1476): No heartbeat from core client for 30 sec - exiting 11:48:41 (1544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:48:42 (1544): No heartbeat from core client for 30 sec - exiting 11:48:43 (1544): No heartbeat from core client for 30 sec - exiting Glonobal Wller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1712, iMonCtr=2 Model crash detected, will try to restart... :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3848, iMonCtr=2 CPDN Monitor - Quit request from BOINC... GController:: CPDN procesl is not ronnal Workering, exiting, bRetVal = 1, checkPID=0, selfPID=3184, iMonCtr=2 Model crash detected, will try to restart... :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=420, iMonCtr=2 GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3764, iMonCtr=2 Model crash detected, will try to restart... lobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2332, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1768, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=2 Model crash detected, will try to restart... CGntrollerbal CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=2 Model crash detected, will try to restart... :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3924, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2588, selfPID=2588, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1336, iMonCtr=2 Model crash detected, will try to restart... Global WorkerCPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1844, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1992, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4080, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3504, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3116, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1172, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3280, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3556, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:51:12 (3756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:51:14 (3756): No heartbeat from core client for 30 sec - exiting 08:37:56 (2676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:37:58 (2676): No heartbeat from core client for 30 sec - exiting 08:38:00 (2676): No heartbeat from core client for 30 sec - exiting 08:38:01 (2676): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3804, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:58:25 (3484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:58:26 (3484): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1720, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2516, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=848, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3360, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=396, selfPID=396, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2748, selfPID=2748, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3088, selfPID=3088, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2156, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2108, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global WorkerController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3776, iMonCtr=2 Model crash detected, will try to restart... :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1060, iMonCtr=2 Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2276, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2332, selfPID=2668, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2544, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:27:41 (3692): No heartbeat from core client for 30 sec - exiting 07:27:42 (3692): No heartbeat from core client for 30 sec - exiting 07:27:43 (3692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2468, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1008, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1804, selfPID=3880, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4052, selfPID=2516, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:31:41 (3776): No heartbeat from core client for 30 sec - exiting 07:31:42 (3776): No heartbeat from core client for 30 sec - exiting 07:31:43 (3776): No heartbeat from core client for 30 sec - exiting 07:31:44 (3776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3236, selfPID=2928, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3900, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:42:43 (3536): No heartbeat from core client for 30 sec - exiting 07:42:44 (3536): No heartbeat from core client for 30 sec - exiting 07:42:45 (3536): No heartbeat from core client for 30 sec - exiting 07:42:46 (3536): No heartbeat from core client for 30 sec - exiting 07:42:48 (3536): No heartbeat from core client for 30 sec - exiting 07:42:49 (3536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:21:49 (2740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:21:50 (2740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3768, selfPID=3180, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=264, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:48:39 (4712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2748, selfPID=2748, iMonCtr=2 17:48:47 (4712): No heartbeat from core client for 30 sec - exiting 17:48:48 (4712): No heartbeat from core client for 30 sec - exiting 17:48:49 (4712): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5276, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4100, selfPID=3876, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2212, selfPID=3504, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=936, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=472, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3816, iMonCtr=2 Model crash detected, will try to restart... 07:36:11 (2608): No heartbeat from core client for 30 sec - exiting 07:36:12 (2608): No heartbeat from core client for 30 sec - exiting 07:36:13 (2608): No heartbeat from core client for 30 sec - exiting 07:36:15 (2608): No heartbeat from core client for 30 sec - exiting 07:36:16 (2608): No heartbeat from core client for 30 sec - exiting 07:36:17 (2608): No heartbeat from core client for 30 sec - exiting 07:36:19 (2608): No heartbeat from core client for 30 sec - exiting 07:36:20 (2608): No heartbeat from core client for 30 sec - exiting 07:36:21 (2608): No heartbeat from core client for 30 sec - exiting 07:36:22 (2608): No heartbeat from core client for 30 sec - exiting 07:36:23 (2608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:36:24 (2608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
11 Feb 2015 12:52:34	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	138,539	1,292,010	9.3260
04 Feb 2015 11:35:49	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	127,019	1,185,373	9.3322
30 Jan 2015 11:39:39	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	115,499	1,083,319	9.3795
26 Jan 2015 09:27:38	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	103,979	975,081	9.3777
22 Jan 2015 07:33:06	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	92,459	869,505	9.4042
09 Jan 2015 06:27:55	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	80,939	766,132	9.4655
05 Jan 2015 08:02:38	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	69,419	662,555	9.5443
29 Dec 2014 14:09:57	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	57,899	547,887	9.4628
24 Dec 2014 12:52:26	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	46,379	436,159	9.4042
19 Dec 2014 14:51:49	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	34,859	333,815	9.5761
15 Dec 2014 16:05:43	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	23,339	214,989	9.2116
10 Dec 2014 06:19:23	1347325	17554113	hadam3p_anz_f20l_2012_1_009265148_1	11,819	107,912	9.1304