Task 17534634

Name	hadam3p_anz_m625_2012_1_009270145_0
Workunit	9363061
Created	1 Dec 2014, 16:32:05 UTC
Sent	1 Dec 2014, 18:57:48 UTC
Report deadline	14 Nov 2015, 0:17:48 UTC
Received	19 Dec 2014, 15:57:34 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1295575
Run time	5 days 12 hours 48 min 8 sec
CPU time	5 days 11 hours 4 min 29 sec
Validate state	Workunit error - check skipped
Credit	5,974.74
Device peak FLOPS	3.54 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86
Stderr	<core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6252, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8112, selfPID=4724, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6340, selfPID=4644, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6556, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7996, selfPID=5160, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6296, selfPID=5352, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 06:28:02 (5184): No heartbeat from core client for 30 sec - exiting 06:28:03 (5184): No heartbeat from core client for 30 sec - exiting 06:28:05 (5184): No heartbeat from core client for 30 sec - exiting 06:28:06 (5184): No heartbeat from core client for 30 sec - exiting 06:28:07 (5184): No heartbeat from core client for 30 sec - exiting 06:28:08 (5184): No heartbeat from core client for 30 sec - exiting 06:28:09 (5184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6516, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3608, selfPID=4652, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5988, selfPID=4608, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6316, selfPID=4444, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6460, selfPID=4432, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5124, selfPID=5080, iMonCtr=1 Model crash detected, will try to restart... 19:18:39 (2688): No heartbeat from core client for 30 sec - exiting 19:18:40 (2688): No heartbeat from core client for 30 sec - exiting 19:18:41 (2688): No heartbeat from core client for 30 sec - exiting 19:18:42 (2688): No heartbeat from core client for 30 sec - exiting 19:18:43 (2688): No heartbeat from core client for 30 sec - exiting 19:18:44 (2688): No heartbeat from core client for 30 sec - exiting 19:18:46 (2688): No heartbeat from core client for 30 sec - exiting 19:18:47 (2688): No heartbeat from core client for 30 sec - exiting 19:18:48 (2688): No heartbeat from core client for 30 sec - exiting 19:18:49 (2688): No heartbeat from core client for 30 sec - exiting 19:18:50 (2688): No heartbeat from core client for 30 sec - exiting 19:18:51 (2688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6692, selfPID=4548, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7316, selfPID=4536, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6932, selfPID=4676, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7104, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7136, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4640, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6924, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6956, selfPID=5588, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6664, selfPID=4648, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7340, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7364, selfPID=5152, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6668, selfPID=5380, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5668, selfPID=5576, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5680, selfPID=4604, iMonCtr=1 Model crash detected, will try to restart... C14:38:18 (4836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6628, selfPID=4648, iMonCtr=1 Model crash detected, will try to restart... 14:35:08 (4956): No heartbeat from core client for 30 sec - exiting 14:35:09 (4956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4712, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7232, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7288, selfPID=4680, iMonCtr=1 Model crash detected, will try to restart... 12:49:47 (5000): No heartbeat from core client for 30 sec - exiting 12:49:48 (5000): No heartbeat from core client for 30 sec - exiting 12:49:49 (5000): No heartbeat from core client for 30 sec - exiting 12:49:50 (5000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
19 Dec 2014 14:41:48	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	138,539	471,540	3.4037
17 Dec 2014 16:07:56	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	127,019	432,161	3.4023
15 Dec 2014 19:46:15	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	115,499	392,472	3.3981
14 Dec 2014 06:09:17	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	103,979	352,897	3.3939
13 Dec 2014 11:18:29	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	92,459	313,252	3.3880
12 Dec 2014 17:36:11	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	80,939	274,061	3.3860
11 Dec 2014 15:53:35	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	69,419	234,175	3.3734
10 Dec 2014 13:53:19	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	57,899	194,621	3.3614
07 Dec 2014 13:58:58	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	46,379	155,338	3.3493
06 Dec 2014 14:02:36	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	34,859	116,723	3.3484
05 Dec 2014 17:52:20	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	23,339	78,365	3.3577
03 Dec 2014 17:40:34	1295575	17534634	hadam3p_anz_m625_2012_1_009270145_0	11,819	39,648	3.3546