Task 15480533

Name	hadcm3n_zmk9_1880_40_008249571_2
Workunit	8404695
Created	16 Dec 2012, 15:48:53 UTC
Sent	16 Dec 2012, 15:48:54 UTC
Report deadline	17 Mar 2013, 23:16:05 UTC
Received	24 Jan 2013, 19:21:43 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1253381
Run time	13 days 5 hours 39 min 46 sec
CPU time	12 days 19 hours 47 min 45 sec
Validate state	Invalid
Credit	5,909.76
Device peak FLOPS	2.19 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu
Stderr	<core_client_version>6.12.43</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... Signal 1 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish 19:32:38 (3832): Can't acquire lockfile (-154) - waiting 35s 19:33:13 (3832): Can't acquire lockfile (-154) - exiting 19:33:43 (3423): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Suspended CPDN Monitor - Suspend request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... Called boinc_finish cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/vmorgo/projects/climateprediction.net/hadcm3n_zmk9_1880_40_008249571/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
21 Jan 2013 21:27:53	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	492,480	1,051,873	2.1359
19 Jan 2013 23:27:44	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	466,560	996,109	2.1350
18 Jan 2013 20:41:42	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	440,640	940,576	2.1346
15 Jan 2013 00:11:39	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	414,720	885,372	2.1349
13 Jan 2013 03:38:40	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	388,800	830,119	2.1351
12 Jan 2013 01:22:56	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	362,880	775,489	2.1370
10 Jan 2013 14:25:30	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	336,960	718,792	2.1332
06 Jan 2013 20:21:41	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	311,040	663,038	2.1317
05 Jan 2013 21:37:32	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	285,120	606,966	2.1288
03 Jan 2013 21:16:55	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	259,200	551,338	2.1271
02 Jan 2013 16:36:59	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	233,280	496,042	2.1264
30 Dec 2012 20:24:25	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	207,360	440,484	2.1242
29 Dec 2012 15:47:15	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	181,440	385,061	2.1222
27 Dec 2012 00:37:38	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	155,520	330,121	2.1227
25 Dec 2012 20:35:08	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	129,600	274,869	2.1209
24 Dec 2012 15:46:04	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	103,680	219,955	2.1215
23 Dec 2012 02:26:43	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	77,760	164,898	2.1206
21 Dec 2012 02:03:36	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	51,840	110,102	2.1239
18 Dec 2012 05:01:01	1253381	15480533	hadcm3n_zmk9_1880_40_008249571_2	25,920	54,843	2.1159