Task 13681417

Name	hadcm3n_yer2_1900_40_007517819_2
Workunit	7715294
Created	1 Dec 2011, 17:25:06 UTC
Sent	1 Dec 2011, 22:12:29 UTC
Report deadline	2 Mar 2012, 5:39:40 UTC
Received	19 Dec 2011, 8:54:06 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	950229
Run time	10 days 23 hours 33 min 46 sec
CPU time	4 days 21 hours 8 min 16 sec
Validate state	Invalid
Credit	4,354.56
Device peak FLOPS	2.07 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 23:06:13 (5132): Can't acquire lockfile (32) - waiting 35s 23:06:35 (4740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 08:06:10 (4880): No heartbeat from core client for 30 sec - exiting 08:06:11 (4880): No heartbeat from core client for 30 sec - exiting 08:06:12 (4880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:06:13 (4880): No heartbeat from core client for 30 sec - exiting 08:06:14 (4880): No heartbeat from core client for 30 sec - exiting 08:06:15 (4880): No heartbeat from core client for 30 sec - exiting 08:06:16 (4880): No heartbeat from core client for 30 sec - exiting 08:06:17 (4880): No heartbeat from core client for 30 sec - exiting 08:06:18 (4880): No heartbeat from core client for 30 sec - exiting Model crashed: INITDUMP: Wrong no of ocean prognostic fields tmp/pipe_dummy 2048 Model crashed: INITDUMP: Wrong no of ocean prognostic fields tmp/pipe_dummy 2048 Model crashed: INITDUMP: Wrong no of ocean prognostic fields tmp/pipe_dummy 2048 Model crashed: INITDUMP: Wrong no of ocean prognostic fields tmp/pipe_dummy 2048 Model crashed: INITDUMP: Wrong no of ocean prognostic fields tmp/pipe_dummy 2048 Model crashed: INITDUMP: Wrong no of ocean prognostic fields tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
19 Dec 2011 04:46:46	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	362,880	408,361	1.1253
18 Dec 2011 09:47:42	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	336,960	343,836	1.0204
17 Dec 2011 15:15:20	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	311,040	784,096	2.5209
16 Dec 2011 20:29:54	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	285,120	718,642	2.5205
16 Dec 2011 01:44:43	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	259,200	653,151	2.5199
15 Dec 2011 07:03:53	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	233,280	587,595	2.5188
14 Dec 2011 12:33:53	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	207,360	522,279	2.5187
13 Dec 2011 17:38:56	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	181,440	457,086	2.5192
12 Dec 2011 23:07:19	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	155,520	391,547	2.5177
12 Dec 2011 04:37:20	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	129,600	326,089	2.5161
11 Dec 2011 10:42:15	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	103,680	260,805	2.5155
10 Dec 2011 15:47:40	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	77,760	195,626	2.5158
09 Dec 2011 20:57:57	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	51,840	130,722	2.5216
09 Dec 2011 03:31:25	950229	13681417	hadcm3n_yer2_1900_40_007517819_2	25,920	65,273	2.5182