climateprediction.net home page
Task 10975447

Task 10975447

Name hadsm3dhet2_jlwa_006591372_0
Workunit 6794745
Created 15 Mar 2010, 11:56:10 UTC
Sent 18 Oct 2010, 13:20:10 UTC
Report deadline 30 Sep 2011, 18:40:10 UTC
Received 21 Jan 2011, 11:40:33 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1089500
Run time 12 days 4 hours 40 min 56 sec
CPU time 9 days 14 hours 16 min 38 sec
Validate state Invalid
Credit 6,153.09
Device peak FLOPS 2.05 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=5328, selfPID=5328, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5064, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5576, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5312, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5392, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5300, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5500, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5776, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=1
Model crash detected, will try to restart...
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
MainError:	11:40:31 PM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6112, iMonCtr=1
Model crash detected, will try to restart...
CMainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
MainError:	07:34:01 PM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
CNo heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3628, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Jan 2011 19:30:34 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 151,228 826,226 1.2337
20 Jan 2011 07:11:32 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 140,426 811,963 1.2323
19 Jan 2011 18:05:59 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 129,624 798,443 1.2319
18 Jan 2011 01:01:51 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 118,822 784,959 1.2317
16 Jan 2011 22:30:55 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 108,020 771,246 1.2310
14 Jan 2011 21:22:35 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 97,218 757,648 1.2305
13 Jan 2011 20:06:53 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 86,416 744,419 1.2306
12 Jan 2011 15:20:35 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 75,614 730,700 1.2299
10 Jan 2011 23:29:58 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 64,812 716,997 1.2292
09 Jan 2011 18:39:18 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 54,010 703,647 1.2291
08 Jan 2011 14:54:15 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 43,208 690,670 1.2296
06 Jan 2011 21:00:06 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 32,406 676,951 1.2288
05 Jan 2011 20:55:22 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 21,604 663,577 1.2286
04 Jan 2011 22:11:54 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 10,802 650,295 1.2286
03 Jan 2011 19:39:29 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 259,248 636,567 1.2277
02 Jan 2011 03:06:49 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 248,446 623,123 1.2274
30 Dec 2010 21:06:49 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 237,644 609,871 1.2274
29 Dec 2010 16:19:35 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 226,842 596,689 1.2275
28 Dec 2010 15:21:17 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 216,040 583,265 1.2272
26 Dec 2010 14:32:08 1089500 10975447 hadsm3dhet2_jlwa_006591372_0 205,238 570,289 1.2278


©2024 climateprediction.net