Task 13372870

Name	hadcm3n_ylqc_1940_40_007453040_1
Workunit	7650543
Created	10 Sep 2011, 14:48:42 UTC
Sent	10 Sep 2011, 19:12:28 UTC
Report deadline	11 Dec 2011, 2:39:39 UTC
Received	26 Sep 2011, 15:11:26 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1065462
Run time	8 days 17 hours 24 min 9 sec
CPU time	7 days 15 hours 53 min 33 sec
Validate state	Invalid
Credit	3,732.48
Device peak FLOPS	2.32 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=216, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6792, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8312, iMonCtr=1 Model crash detected, will try to restart... 08:30:34 (5444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:30:35 (6776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Sep 2011 18:24:00	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	311,040	624,589	2.0081
24 Sep 2011 17:13:37	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	285,120	573,058	2.0099
23 Sep 2011 13:24:10	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	259,200	522,441	2.0156
21 Sep 2011 19:53:18	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	233,280	468,905	2.0101
20 Sep 2011 01:52:26	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	207,360	414,643	1.9996
18 Sep 2011 22:40:26	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	181,440	363,211	2.0018
17 Sep 2011 20:16:49	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	155,520	311,685	2.0041
16 Sep 2011 17:52:46	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	129,600	259,505	2.0024
15 Sep 2011 14:50:44	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	103,680	208,333	2.0094
14 Sep 2011 01:24:34	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	77,760	155,989	2.0060
12 Sep 2011 22:32:38	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	51,840	102,680	1.9807
11 Sep 2011 20:18:06	1065462	13372870	hadcm3n_ylqc_1940_40_007453040_1	25,920	51,815	1.9990