Task 13116650

Name	hadcm3n_yi5n_1900_40_007356389_1
Workunit	7553819
Created	6 Jul 2011, 14:46:18 UTC
Sent	9 Jul 2011, 18:56:51 UTC
Report deadline	9 Oct 2011, 2:24:02 UTC
Received	17 Jul 2011, 15:45:50 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1123224
Run time	5 days 10 hours 45 min 20 sec
CPU time	4 days 21 hours 37 min 2 sec
Validate state	Invalid
Credit	2,799.36
Device peak FLOPS	2.60 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.60</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 04:48:38 (3724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:49:16 (5888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:16:15 (2448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1768, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1768, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1768, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3144, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3740, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Jul 2011 15:42:30	1123224	13116650	hadcm3n_yi5n_1900_40_007356389_1	233,280	405,944	1.7402
25 Jul 2011 15:42:30	1123224	13116650	hadcm3n_yi5n_1900_40_007356389_1	207,360	360,351	1.7378
25 Jul 2011 15:42:29	1123224	13116650	hadcm3n_yi5n_1900_40_007356389_1	181,440	315,885	1.7410
25 Jul 2011 15:42:29	1123224	13116650	hadcm3n_yi5n_1900_40_007356389_1	155,520	270,304	1.7381
25 Jul 2011 15:42:29	1123224	13116650	hadcm3n_yi5n_1900_40_007356389_1	129,600	224,571	1.7328
25 Jul 2011 15:42:29	1123224	13116650	hadcm3n_yi5n_1900_40_007356389_1	103,680	179,609	1.7323
25 Jul 2011 15:42:29	1123224	13116650	hadcm3n_yi5n_1900_40_007356389_1	77,760	134,559	1.7304
11 Jul 2011 03:16:16	1123224	13116650	hadcm3n_yi5n_1900_40_007356389_1	51,840	89,791	1.7321
10 Jul 2011 16:14:41	1123224	13116650	hadcm3n_yi5n_1900_40_007356389_1	25,920	45,216	1.7444