Task 15444436

Name	hadcm3n_ze5j_1880_40_008247337_1
Workunit	8402461
Created	21 Nov 2012, 7:00:16 UTC
Sent	21 Nov 2012, 7:00:22 UTC
Report deadline	20 Feb 2013, 14:27:33 UTC
Received	25 Jan 2013, 7:06:57 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1183496
Run time	17 days 10 hours 21 min 46 sec
CPU time	17 days 3 hours 55 min 46 sec
Validate state	Valid
Credit	12,441.60
Device peak FLOPS	2.48 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> 08:02:31 (4648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5084, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3648, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4288, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4884, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 07:21:27 (5312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:09:44 (2680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5316, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5568, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=1 Model crash detected, will try to restart... 08:08:28 (2528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1 Model crash detected, will try to restart... 11:00:40 (436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=576, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3296, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3296, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3296, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4728, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=580, iMonCtr=1 Model crash detected, will try to restart... 07:59:31 (1076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=1 Model crash detected, will try to restart... 15:55:54 (5100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4404, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=468, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... cpdnmonitor: cannot open input file dataout/ocean_restart.day after 11 attempts cpdnmonitor: cannot open input file dataout/atmos_restart.hold after 11 attempts cpdnmonitor: cannot open input file dataout/ocean_restart.day after 11 attempts OPEN: Unable to Open File dataout/ze5jka.pdb3c10 for Read/Write Model crashed: STWORK : Error opening output PP file on unit 63 tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_se_6.07_windows_intelx86.dll after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_um_6.07_windows_intelx86.exe after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/jobs/xabnk.namelists after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/dataout/ocean_restart.day after 11 attempts 09:17:36 (5032): Can't open init data file - running in standalone mode Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_se_6.07_windows_intelx86.dll after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_um_6.07_windows_intelx86.exe after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/jobs/xabnk.namelists after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/dataout/ocean_restart.day after 11 attempts 09:19:57 (5032): Can't open init data file - running in standalone mode Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4288, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5092, iMonCtr=1 Model crash detected, will try to restart... 10:52:26 (1148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1 Model crash detected, will try to restart... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
24 Jan 2013 15:35:39	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	1,036,800	1,482,365	1.4298
23 Jan 2013 20:57:28	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	1,010,880	1,445,404	1.4298
23 Jan 2013 07:12:56	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	984,960	1,408,521	1.4300
21 Jan 2013 15:50:51	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	959,040	1,371,828	1.4304
20 Jan 2013 14:15:37	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	933,120	1,335,436	1.4312
17 Jan 2013 14:31:08	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	907,200	1,298,424	1.4312
16 Jan 2013 13:40:29	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	881,280	1,261,517	1.4315
15 Jan 2013 12:25:06	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	855,360	1,223,793	1.4307
12 Jan 2013 13:40:15	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	829,440	1,188,003	1.4323
10 Jan 2013 09:53:50	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	803,520	1,150,851	1.4323
09 Jan 2013 08:20:08	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	777,600	1,114,080	1.4327
08 Jan 2013 07:26:42	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	751,680	1,077,402	1.4333
05 Jan 2013 19:40:40	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	725,760	1,039,694	1.4326
03 Jan 2013 11:06:26	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	699,840	1,002,206	1.4321
02 Jan 2013 09:50:20	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	673,920	964,942	1.4318
31 Dec 2012 10:53:53	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	648,000	927,761	1.4317
30 Dec 2012 10:19:51	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	622,080	890,943	1.4322
29 Dec 2012 12:15:08	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	596,160	854,315	1.4330
25 Dec 2012 20:09:41	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	570,240	816,880	1.4325
19 Dec 2012 16:03:13	1183496	15444436	hadcm3n_ze5j_1880_40_008247337_1	544,320	778,282	1.4298