Task 14033108

Name	hadcm3n_u3g3_1980_40_007743076_1
Workunit	7898184
Created	30 Jan 2012, 17:14:50 UTC
Sent	30 Jan 2012, 17:30:56 UTC
Report deadline	1 May 2012, 0:58:07 UTC
Received	12 Feb 2012, 21:21:29 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1376550
Run time	12 days 2 hours 59 min 36 sec
CPU time	11 days 5 hours 18 min 46 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	1.33 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 01:44:16 (6884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:44:17 (6884): No heartbeat from core client for 30 sec - exiting 01:44:18 (6884): No heartbeat from core client for 30 sec - exiting 01:44:19 (6884): No heartbeat from core client for 30 sec - exiting 01:44:21 (6884): No heartbeat from core client for 30 sec - exiting 01:44:22 (6884): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:11:06 (6184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7604, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2360, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5684, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5684, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5684, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4860, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
11 Feb 2012 13:29:56	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	259,200	899,768	3.4713
10 Feb 2012 14:44:42	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	233,280	821,227	3.5203
09 Feb 2012 10:53:02	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	207,360	737,047	3.5544
08 Feb 2012 11:51:31	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	181,440	640,379	3.5294
07 Feb 2012 02:24:41	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	155,520	551,344	3.5452
05 Feb 2012 19:49:10	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	129,600	460,494	3.5532
04 Feb 2012 17:50:53	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	103,680	371,350	3.5817
03 Feb 2012 13:49:21	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	77,760	278,502	3.5816
02 Feb 2012 10:13:24	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	51,840	185,362	3.5757
01 Feb 2012 07:18:15	1119324	14033108	hadcm3n_u3g3_1980_40_007743076_1	25,920	94,165	3.6329