Task 12925810

Name	hadcm3n_o5pg_1940_40_007266329_1
Workunit	7464569
Created	2 Jun 2011, 17:11:04 UTC
Sent	2 Jun 2011, 17:11:09 UTC
Report deadline	2 Sep 2011, 0:38:20 UTC
Received	9 Aug 2011, 16:14:49 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	953015
Run time	24 days 5 hours 43 min 6 sec
CPU time	24 days 5 hours 43 min 6 sec
Validate state	Invalid
Credit	12,441.60
Device peak FLOPS	2.45 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.4.5</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 17:22:47 (6632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN proces21:13:39 (5964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6340, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5424, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6976, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5456, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=292, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4328, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5588, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5436, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6824, iMonCtr=1 Model crash detected, will try to restart... 19:45:28 (3868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o5pg_1940_40_007266329/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
09 Aug 2011 16:16:34	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	1,036,800	2,094,252	2.0199
09 Aug 2011 16:16:34	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	1,010,880	2,045,547	2.0235
09 Aug 2011 16:16:34	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	984,960	1,995,622	2.0261
09 Aug 2011 16:16:34	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	959,040	1,946,755	2.0299
09 Aug 2011 16:16:33	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	933,120	1,898,083	2.0341
28 Jul 2011 10:26:15	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	907,200	1,849,946	2.0392
26 Jul 2011 17:48:18	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	881,280	1,798,118	2.0403
25 Jul 2011 22:57:57	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	855,360	1,747,269	2.0427
25 Jul 2011 20:57:20	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	829,440	1,697,162	2.0462
25 Jul 2011 20:23:28	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	803,520	1,648,176	2.0512
25 Jul 2011 19:12:45	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	777,600	1,598,432	2.0556
25 Jul 2011 19:00:28	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	751,680	1,547,481	2.0587
25 Jul 2011 17:28:16	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	725,760	1,496,669	2.0622
25 Jul 2011 14:53:16	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	699,840	1,440,883	2.0589
25 Jul 2011 14:29:56	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	673,920	1,388,952	2.0610
25 Jul 2011 14:29:56	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	648,000	1,339,230	2.0667
25 Jul 2011 14:29:56	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	622,080	1,289,476	2.0728
25 Jul 2011 14:29:56	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	596,160	1,235,655	2.0727
09 Jul 2011 18:08:24	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	570,240	1,182,816	2.0742
09 Jul 2011 02:12:01	953015	12925810	hadcm3n_o5pg_1940_40_007266329_1	544,320	1,131,094	2.0780