Task 16401152

Name	hadam3p_anz_n9jh_2012_1_008586053_0
Workunit	8732565
Created	25 Mar 2014, 19:51:32 UTC
Sent	26 Mar 2014, 0:43:23 UTC
Report deadline	8 Mar 2015, 6:03:23 UTC
Received	19 Apr 2014, 15:22:17 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1290798
Run time	13 days 12 hours 18 min 23 sec
CPU time	12 days 19 hours 46 min 46 sec
Validate state	Workunit error - check skipped
Credit	5,974.74
Device peak FLOPS	2.59 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86
Stderr	<core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6184, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6124, iMonCtr=2 Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6252, selfPID=5432, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6164, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6256, selfPID=5452, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3088, selfPID=1744, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6240, selfPID=5856, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... C22:34:05 (6428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:34:06 (6428): No heartbeat from core client for 30 sec - exiting 22:34:07 (6428): No heartbeat from core client for 30 sec - exiting 22:34:08 (6428): No heartbeat from core client for 30 sec - exiting 22:34:09 (6428): No heartbeat from core client for 30 sec - exiting 22:34:10 (6428): No heartbeat from core client for 30 sec - exiting 22:34:11 (6428): No heartbeat from core client for 30 sec - exiting 22:34:12 (6428): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18496, selfPID=9516, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5392, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7096, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7156, selfPID=5964, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4904, selfPID=5928, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2344, selfPID=5876, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6212, selfPID=5840, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5840, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5500, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6000, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6092, selfPID=2936, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=952, selfPID=5704, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2316, selfPID=2364, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5532, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
19 Apr 2014 13:57:28	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	138,539	1,107,280	7.9926
18 Apr 2014 12:22:38	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	127,019	1,018,304	8.0169
16 Apr 2014 22:12:15	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	115,499	928,240	8.0368
14 Apr 2014 04:19:39	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	103,979	835,977	8.0399
12 Apr 2014 16:01:35	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	92,459	742,198	8.0273
10 Apr 2014 18:49:32	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	80,939	647,663	8.0019
08 Apr 2014 04:15:05	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	69,419	552,855	7.9640
06 Apr 2014 12:19:57	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	57,899	461,373	7.9686
05 Apr 2014 09:25:20	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	46,379	372,206	8.0253
03 Apr 2014 03:09:42	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	34,859	280,054	8.0339
01 Apr 2014 01:54:27	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	23,339	188,350	8.0702
29 Mar 2014 09:36:08	1290798	16401152	hadam3p_anz_n9jh_2012_1_008586053_0	11,819	94,874	8.0272