Task 15882379

Name	hadcm3n_4lvm_1980_40_008390198_2
Workunit	8541057
Created	6 Jul 2013, 4:00:36 UTC
Sent	6 Jul 2013, 6:43:11 UTC
Report deadline	5 Oct 2013, 14:10:22 UTC
Received	15 Aug 2013, 7:42:49 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1279907
Run time	10 days 15 hours 14 min 55 sec
CPU time	8 days 4 hours 2 min 41 sec
Validate state	Invalid
Credit	7,153.92
Device peak FLOPS	2.98 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5220, iMonCtr=1 Model crash detected, will try to restart... 21:25:59 (5220): No heartbeat from core client for 30 sec - exiting 21:26:00 (5220): No heartbeat from core client for 30 sec - exiting 21:26:01 (5220): No heartbeat from core client for 30 sec - exiting 21:26:02 (5220): No heartbeat from core client for 30 sec - exiting 21:26:03 (5220): No heartbeat from core client for 30 sec - exiting 21:26:04 (5220): No heartbeat from core client for 30 sec - exiting 21:26:05 (5220): No heartbeat from core client for 30 sec - exiting 21:26:06 (5220): No heartbeat from core client for 30 sec - exiting 21:26:07 (5220): No heartbeat from core client for 30 sec - exiting 21:26:08 (5220): No heartbeat from core client for 30 sec - exiting 21:26:09 (5220): No heartbeat from core client for 30 sec - exiting 21:26:10 (5220): No heartbeat from core client for 30 sec - exiting 21:26:11 (5220): No heartbeat from core client for 30 sec - exiting 21:26:12 (5220): No heartbeat from core client for 30 sec - exiting 21:26:13 (5220): No heartbeat from core client for 30 sec - exiting 21:26:14 (5220): No heartbeat from core client for 30 sec - exiting 21:26:15 (5220): No heartbeat from core client for 30 sec - exiting 21:26:16 (5220): No heartbeat from core client for 30 sec - exiting 21:26:17 (5220): No heartbeat from core client for 30 sec - exiting 21:26:18 (5220): No heartbeat from core client for 30 sec - exiting 21:26:19 (5220): No heartbeat from core client for 30 sec - exiting 21:26:20 (5220): No heartbeat from core client for 30 sec - exiting 21:26:21 (5220): No heartbeat from core client for 30 sec - exiting Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6324, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8004, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
23 Jul 2013 14:24:11	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	596,160	735,900	1.2344
23 Jul 2013 14:24:10	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	570,240	703,117	1.2330
23 Jul 2013 14:24:10	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	544,320	671,186	1.2331
23 Jul 2013 14:24:10	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	518,400	639,539	1.2337
23 Jul 2013 14:24:10	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	492,480	607,961	1.2345
23 Jul 2013 14:24:09	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	466,560	576,231	1.2351
23 Jul 2013 14:24:09	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	440,640	544,320	1.2353
23 Jul 2013 14:24:09	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	414,720	512,559	1.2359
23 Jul 2013 14:24:09	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	388,800	480,731	1.2364
23 Jul 2013 14:24:09	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	362,880	447,968	1.2345
23 Jul 2013 14:24:08	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	336,960	416,210	1.2352
23 Jul 2013 14:24:08	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	311,040	384,000	1.2346
23 Jul 2013 14:24:06	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	285,120	351,488	1.2328
23 Jul 2013 14:24:06	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	259,200	319,690	1.2334
23 Jul 2013 14:24:05	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	233,280	286,976	1.2302
12 Jul 2013 01:47:10	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	207,360	255,038	1.2299
11 Jul 2013 16:49:00	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	181,440	223,135	1.2298
11 Jul 2013 07:55:57	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	155,520	191,459	1.2311
10 Jul 2013 23:02:23	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	129,600	159,609	1.2316
10 Jul 2013 13:41:58	1279907	15882379	hadcm3n_4lvm_1980_40_008390198_2	103,680	127,626	1.2310