Task 15486313

Name	hadcm3n_3g13_1940_40_008258732_0
Workunit	8413856
Created	20 Dec 2012, 11:57:31 UTC
Sent	20 Dec 2012, 11:57:51 UTC
Report deadline	21 Mar 2013, 19:25:02 UTC
Received	18 Mar 2013, 8:56:40 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1255589
Run time	16 days 4 hours 18 min 14 sec
CPU time	15 days 7 hours 49 min 19 sec
Validate state	Invalid
Credit	10,575.36
Device peak FLOPS	1.60 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5832, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5832, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5004, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4912, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5308, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5668, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5264, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3428, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1236, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
15 Mar 2013 13:04:38	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	881,280	1,305,664	1.4816
14 Mar 2013 09:55:10	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	855,360	1,267,130	1.4814
12 Mar 2013 15:15:31	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	829,440	1,228,516	1.4811
11 Mar 2013 13:04:39	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	803,520	1,190,182	1.4812
08 Mar 2013 10:01:50	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	777,600	1,152,129	1.4816
06 Mar 2013 15:00:29	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	751,680	1,113,647	1.4815
05 Mar 2013 12:01:44	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	725,760	1,074,378	1.4803
01 Mar 2013 17:31:53	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	699,840	1,036,001	1.4803
28 Feb 2013 14:50:20	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	673,920	997,696	1.4804
27 Feb 2013 13:00:05	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	648,000	959,079	1.4801
26 Feb 2013 10:00:41	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	622,080	920,640	1.4799
22 Feb 2013 15:20:34	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	596,160	882,202	1.4798
21 Feb 2013 13:00:10	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	570,240	843,622	1.4794
20 Feb 2013 12:03:43	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	544,320	807,460	1.4834
19 Feb 2013 08:53:16	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	518,400	769,377	1.4841
15 Feb 2013 15:30:38	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	492,480	730,827	1.4840
14 Feb 2013 11:56:23	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	466,560	691,187	1.4815
01 Feb 2013 16:01:30	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	440,640	652,805	1.4815
31 Jan 2013 14:00:24	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	414,720	614,517	1.4818
30 Jan 2013 11:53:35	1255589	15486313	hadcm3n_3g13_1940_40_008258732_0	388,800	576,025	1.4815