Task 14283494

Name	hadcm3n_u69a_1980_40_007832165_2
Workunit	7987277
Created	18 Mar 2012, 16:15:30 UTC
Sent	18 Mar 2012, 16:15:43 UTC
Report deadline	17 Jun 2012, 23:42:54 UTC
Received	9 Apr 2012, 1:23:31 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1189010
Run time	10 days 23 hours 8 min 54 sec
CPU time	10 days 16 hours 23 min 47 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.99 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 01:59:55 (9364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:44:54 (9397): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:59:05 (9438): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:03:00 (9475): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:04:14 (9515): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:05:45 (9551): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:10:57 (9587): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:08:40 (9623): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:09:55 (9661): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:11:28 (9701): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:12:25 (9737): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:13:07 (24713): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:14:20 (24747): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:25:09 (24791): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:26:15 (24848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:16:14 (24900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:17:52 (25056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:19:34 (25098): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:10 (25136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:22:39 (25172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:57:33 (25210): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:12:55 (25331): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:20:38 (27332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:49 (27904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:23:10 (27938): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:25:07 (27972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:29:59 (28006): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:31:10 (28320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:32:23 (28354): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:34:34 (28388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:08:01 (2801): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:19:02 (7037): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:27:27 (7072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:30:29 (11533): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:37:00 (11569): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 15 received, exiting... Called boinc_finish 03:01:17 (2562): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:02:32 (9644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:05:22 (9674): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... 02:08:10 (2370): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
09 Apr 2012 01:26:30	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	518,400	923,051	1.7806
08 Apr 2012 11:57:17	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	492,480	876,952	1.7807
07 Apr 2012 22:57:59	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	466,560	830,846	1.7808
07 Apr 2012 09:22:40	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	440,640	784,351	1.7800
06 Apr 2012 20:48:36	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	414,720	737,776	1.7790
06 Apr 2012 06:38:47	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	388,800	691,084	1.7775
05 Apr 2012 17:32:00	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	362,880	644,825	1.7770
05 Apr 2012 04:21:11	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	336,960	598,374	1.7758
04 Apr 2012 15:14:07	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	311,040	551,540	1.7732
04 Apr 2012 03:11:13	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	285,120	508,520	1.7835
03 Apr 2012 14:10:10	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	259,200	462,074	1.7827
03 Apr 2012 01:01:10	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	233,280	415,722	1.7821
02 Apr 2012 11:51:37	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	207,360	369,889	1.7838
01 Apr 2012 22:17:59	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	181,440	324,100	1.7863
01 Apr 2012 09:30:23	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	155,520	277,930	1.7871
31 Mar 2012 19:09:51	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	129,600	231,441	1.7858
31 Mar 2012 06:37:37	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	103,680	185,134	1.7856
30 Mar 2012 16:35:16	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	77,760	139,151	1.7895
30 Mar 2012 03:25:41	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	51,840	92,558	1.7855
29 Mar 2012 13:37:03	1189010	14283494	hadcm3n_u69a_1980_40_007832165_2	25,920	46,362	1.7887