Task 16052950

Name	hadcm3n_o0vr_1940_40_008379127_2
Workunit	8529986
Created	2 Oct 2013, 2:12:18 UTC
Sent	2 Oct 2013, 6:17:32 UTC
Report deadline	1 Jan 2014, 13:44:43 UTC
Received	27 Oct 2013, 12:27:55 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1285956
Run time	7 days 10 hours 49 min 46 sec
CPU time	7 days 0 hours 2 min 27 sec
Validate state	Invalid
Credit	2,799.36
Device peak FLOPS	2.06 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:07:53 (1380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:30:12 (1328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:39:25 (3068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1368, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1368, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1368, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1368, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1368, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1368, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
15 Oct 2013 12:43:14	1285956	16052950	hadcm3n_o0vr_1940_40_008379127_2	233,280	556,620	2.3861
15 Oct 2013 12:43:14	1285956	16052950	hadcm3n_o0vr_1940_40_008379127_2	207,360	494,686	2.3856
15 Oct 2013 12:43:14	1285956	16052950	hadcm3n_o0vr_1940_40_008379127_2	181,440	433,746	2.3906
15 Oct 2013 12:43:14	1285956	16052950	hadcm3n_o0vr_1940_40_008379127_2	155,520	367,839	2.3652
15 Oct 2013 12:43:14	1285956	16052950	hadcm3n_o0vr_1940_40_008379127_2	129,600	304,349	2.3484
07 Oct 2013 23:33:15	1285956	16052950	hadcm3n_o0vr_1940_40_008379127_2	103,680	247,250	2.3847
06 Oct 2013 23:28:45	1285956	16052950	hadcm3n_o0vr_1940_40_008379127_2	77,760	187,976	2.4174
04 Oct 2013 02:17:55	1285956	16052950	hadcm3n_o0vr_1940_40_008379127_2	51,840	128,317	2.4753
03 Oct 2013 02:18:13	1285956	16052950	hadcm3n_o0vr_1940_40_008379127_2	25,920	67,813	2.6162