Task 13700528

Name	hadam3p_saf_7lu5_2008_1_007585238_0
Workunit	7763368
Created	2 Dec 2011, 17:32:08 UTC
Sent	3 Dec 2011, 17:27:18 UTC
Report deadline	14 Nov 2012, 22:47:18 UTC
Received	7 Jan 2012, 2:57:07 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1071380
Run time	4 days 15 hours 32 min 24 sec
CPU time	4 days 15 hours 32 min 24 sec
Validate state	Workunit error - check skipped
Credit	2,244.09
Device peak FLOPS	2.65 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4752, selfPID=1792, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5200, selfPID=1960, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=2 Model crash detected, will try to restart... 09:26:44 (3916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:36:23 (2528): No heartbeat from core client for 30 sec - exiting 21:36:24 (2528): No heartbeat from core client for 30 sec - exiting 21:36:25 (2528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3508, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1388, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3020, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3904, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2596, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1228, iMonCtr=2 Model crash detected, will try to restart... 09:26:36 (2572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:43:49 (3412): No heartbeat from core client for 30 sec - exiting 03:43:50 (3412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1124, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1212, iMonCtr=2 Model crash detected, will try to restart... 01:57:09 (2908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3872, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=2 23:54:05 (3364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=936, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3844, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2032, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5864, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2800, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1996, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2004, iMonCtr=2 Model crash detected, will try to restart... 10:32:39 (2448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=140, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3528, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
06 Jan 2012 07:32:46	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	138,336	400,859	2.8977
03 Jan 2012 15:17:48	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	126,816	367,828	2.9005
31 Dec 2011 16:58:16	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	115,296	335,458	2.9095
26 Dec 2011 15:06:21	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	103,783	303,391	2.9233
26 Dec 2011 01:53:51	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	103,776	302,899	2.9188
22 Dec 2011 17:46:22	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	92,256	270,688	2.9341
18 Dec 2011 23:30:31	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	80,744	238,071	2.9485
17 Dec 2011 19:12:49	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	80,736	237,594	2.9429
15 Dec 2011 17:54:20	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	69,216	203,600	2.9415
12 Dec 2011 14:00:00	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	57,701	170,779	2.9597
12 Dec 2011 00:00:59	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	57,696	170,291	2.9515
08 Dec 2011 10:58:47	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	46,176	136,867	2.9640
07 Dec 2011 23:09:50	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	34,659	103,149	2.9761
06 Dec 2011 17:12:41	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	34,656	102,689	2.9631
05 Dec 2011 18:42:13	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	23,136	69,630	3.0096
04 Dec 2011 18:50:03	1071380	13700528	hadam3p_saf_7lu5_2008_1_007585238_0	11,616	35,813	3.0831