Task 13353159

Name	hadcm3n_o255_1940_40_007444824_1
Workunit	7642327
Created	9 Sep 2011, 11:28:47 UTC
Sent	9 Sep 2011, 11:29:53 UTC
Report deadline	9 Dec 2011, 18:57:04 UTC
Received	25 Oct 2011, 15:28:05 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1166047
Run time	12 days 14 hours 38 min 46 sec
CPU time	11 days 20 hours 16 min 3 sec
Validate state	Invalid
Credit	5,909.76
Device peak FLOPS	2.46 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin
Stderr	<core_client_version>6.12.35</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:39:57 (1294): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:37:00 (49853): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:48:36 (59542): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:19:14 (11387): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:08:27 (11594): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:08:28 (11594): No heartbeat from core client for 30 sec - exiting 18:08:29 (11594): No heartbeat from core client for 30 sec - exiting 18:10:34 (15285): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:12:51 (15379): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
19 Oct 2011 08:16:00	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	492,480	1,006,674	2.0441
13 Oct 2011 10:16:31	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	466,560	953,542	2.0438
12 Oct 2011 09:30:52	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	440,640	900,271	2.0431
11 Oct 2011 08:30:58	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	414,720	847,287	2.0430
10 Oct 2011 07:58:35	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	388,800	794,165	2.0426
09 Oct 2011 15:48:14	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	362,880	741,105	2.0423
09 Oct 2011 00:29:14	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	336,960	687,828	2.0413
08 Oct 2011 09:19:15	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	311,040	634,596	2.0402
07 Oct 2011 08:50:51	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	285,120	581,117	2.0381
06 Oct 2011 08:33:04	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	259,200	527,818	2.0363
05 Oct 2011 05:21:26	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	233,280	474,600	2.0345
04 Oct 2011 04:27:38	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	207,360	422,101	2.0356
03 Oct 2011 03:11:47	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	181,440	369,593	2.0370
02 Oct 2011 11:12:46	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	155,520	316,586	2.0357
01 Oct 2011 20:19:58	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	129,600	263,567	2.0337
01 Oct 2011 03:53:51	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	103,680	211,030	2.0354
30 Sep 2011 02:25:03	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	77,760	158,419	2.0373
29 Sep 2011 01:42:14	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	51,840	105,596	2.0370
27 Sep 2011 06:51:40	1166047	13353159	hadcm3n_o255_1940_40_007444824_1	25,920	52,774	2.0360