Task 14794610

Name	hadcm3n_y9zl_1980_40_007831713_4
Workunit	7986825
Created	17 Jun 2012, 4:49:05 UTC
Sent	17 Jun 2012, 4:49:12 UTC
Report deadline	16 Sep 2012, 12:16:23 UTC
Received	6 Jul 2012, 12:00:53 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	-226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID	1220473
Run time	12 days 13 hours 53 min 53 sec
CPU time	12 days 5 hours 24 min 38 sec
Validate state	Invalid
Credit	2,799.36
Device peak FLOPS	2.82 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.25</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> 17:34:34 (9144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:33:34 (3480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8224, iMonCtr=1 Model crash detected, will try to restart... 00:03:26 (6500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4636, iMonCtr=1 Model crash detected, will try to restart... 14:02:39 (6100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:01:38 (2856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... 00:25:56 (6312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5196, iMonCtr=1 Model crash detected, will try to restart... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5264, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3736, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3736, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
05 Jul 2012 14:54:31	1220473	14794610	hadcm3n_y9zl_1980_40_007831713_4	233,280	997,922	4.2778
03 Jul 2012 05:27:31	1220473	14794610	hadcm3n_y9zl_1980_40_007831713_4	207,360	885,305	4.2694
02 Jul 2012 13:32:50	1220473	14794610	hadcm3n_y9zl_1980_40_007831713_4	181,440	771,975	4.2547
30 Jun 2012 09:47:43	1220473	14794610	hadcm3n_y9zl_1980_40_007831713_4	155,520	656,339	4.2203
29 Jun 2012 00:46:04	1220473	14794610	hadcm3n_y9zl_1980_40_007831713_4	129,600	543,998	4.1975
27 Jun 2012 17:01:27	1220473	14794610	hadcm3n_y9zl_1980_40_007831713_4	103,680	433,233	4.1786
26 Jun 2012 10:37:31	1220473	14794610	hadcm3n_y9zl_1980_40_007831713_4	77,760	324,641	4.1749
19 Jun 2012 20:25:26	1220473	14794610	hadcm3n_y9zl_1980_40_007831713_4	51,840	215,405	4.1552
18 Jun 2012 13:30:54	1220473	14794610	hadcm3n_y9zl_1980_40_007831713_4	25,920	104,701	4.0394