Task 16338408

Name	hadam3p_eu_f1t2_2013_1_008548733_0
Workunit	8696245
Created	5 Mar 2014, 16:04:53 UTC
Sent	8 Mar 2014, 12:26:48 UTC
Report deadline	18 Feb 2015, 17:46:48 UTC
Received	20 Mar 2014, 10:38:45 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	-226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID	1316256
Run time	2 days 17 hours 7 min 50 sec
CPU time	9 hours 48 min 53 sec
Validate state	Invalid
Credit	1,392.75
Device peak FLOPS	2.07 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86
Stderr	<core_client_version>6.8.44</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20092, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2068, selfPID=3428, iMonCtr=1 Model crash detected, will try to restart... 01:53:40 (4992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5160, selfPID=4856, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27988, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8164, selfPID=5192, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5572, selfPID=4764, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Glontrollerrker:: CPDN pess is nos not runnin exiting, bRetVatVal = 1, checkPID, selfPID=5132, iMonCtr=tr= Mo del crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4172, selfPID=4344, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5332, selfPID=4772, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=86560, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3808, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=225580, iMonCtr=2 Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5984, selfPID=5232, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5512, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2464, selfPID=4788, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5180, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5524, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3076, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2284, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4492, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4436, selfPID=4472, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=5344, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Glontrolobal: CPDN :: ocess is not running, exiting, bRetVal = 1, checkPID=0, selfPID=selfPID=5192, iMo MCtr=2 odel crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6108, selfPID=1544, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6232, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8972, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1400, selfPID=6628, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
16 Mar 2014 19:32:13	1316256	16338408	hadam3p_eu_f1t2_2013_1_008548733_0	80,736	15,263	0.1890
16 Mar 2014 06:46:45	1316256	16338408	hadam3p_eu_f1t2_2013_1_008548733_0	69,216	155,525	2.2470
16 Mar 2014 06:46:45	1316256	16338408	hadam3p_eu_f1t2_2013_1_008548733_0	57,696	131,824	2.2848
16 Mar 2014 06:46:45	1316256	16338408	hadam3p_eu_f1t2_2013_1_008548733_0	46,176	110,959	2.4030
16 Mar 2014 06:46:45	1316256	16338408	hadam3p_eu_f1t2_2013_1_008548733_0	34,656	89,961	2.5958
10 Mar 2014 18:34:43	1316256	16338408	hadam3p_eu_f1t2_2013_1_008548733_0	23,136	65,040	2.8112
09 Mar 2014 04:34:34	1316256	16338408	hadam3p_eu_f1t2_2013_1_008548733_0	11,616	33,045	2.8448