climateprediction.net home page
Task 10997886

Task 10997886

Name hadsm3dhet2_jnml_006593615_9
Workunit 6796988
Created 15 Mar 2010, 11:58:59 UTC
Sent 12 Oct 2010, 16:20:07 UTC
Report deadline 24 Sep 2011, 21:40:07 UTC
Received 11 Nov 2010, 18:07:21 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 888167
Run time
CPU time 6 days 6 hours 25 min 10 sec
Validate state Invalid
Credit 3,969.74
Device peak FLOPS 2.64 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
MainError:	02:26:27 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7224, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7380, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day

Model crashed: (null)
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Nov 2010 00:50:14 888167 10997886 hadsm3dhet2_jnml_006593615_9 172,832 538,735 1.2468
10 Nov 2010 00:13:12 888167 10997886 hadsm3dhet2_jnml_006593615_9 162,030 525,349 1.2470
09 Nov 2010 20:05:52 888167 10997886 hadsm3dhet2_jnml_006593615_9 151,228 511,982 1.2473
09 Nov 2010 15:43:36 888167 10997886 hadsm3dhet2_jnml_006593615_9 140,426 498,492 1.2472
09 Nov 2010 13:34:32 888167 10997886 hadsm3dhet2_jnml_006593615_9 129,624 484,860 1.2468
06 Nov 2010 04:51:13 888167 10997886 hadsm3dhet2_jnml_006593615_9 118,822 471,313 1.2466
06 Nov 2010 00:35:38 888167 10997886 hadsm3dhet2_jnml_006593615_9 108,020 457,651 1.2461
05 Nov 2010 20:53:26 888167 10997886 hadsm3dhet2_jnml_006593615_9 97,218 444,123 1.2459
05 Nov 2010 16:31:21 888167 10997886 hadsm3dhet2_jnml_006593615_9 86,416 430,496 1.2454
04 Nov 2010 22:06:58 888167 10997886 hadsm3dhet2_jnml_006593615_9 75,614 416,762 1.2446
04 Nov 2010 17:55:37 888167 10997886 hadsm3dhet2_jnml_006593615_9 64,812 403,215 1.2443
02 Nov 2010 18:18:18 888167 10997886 hadsm3dhet2_jnml_006593615_9 54,010 389,574 1.2436
01 Nov 2010 22:33:14 888167 10997886 hadsm3dhet2_jnml_006593615_9 43,208 375,788 1.2425
01 Nov 2010 20:20:00 888167 10997886 hadsm3dhet2_jnml_006593615_9 32,406 362,304 1.2422
31 Oct 2010 21:17:18 888167 10997886 hadsm3dhet2_jnml_006593615_9 21,604 348,686 1.2415
31 Oct 2010 17:00:42 888167 10997886 hadsm3dhet2_jnml_006593615_9 10,802 335,141 1.2410
30 Oct 2010 02:56:21 888167 10997886 hadsm3dhet2_jnml_006593615_9 259,248 321,623 1.2406
29 Oct 2010 22:06:56 888167 10997886 hadsm3dhet2_jnml_006593615_9 248,446 308,127 1.2402
29 Oct 2010 17:57:59 888167 10997886 hadsm3dhet2_jnml_006593615_9 237,644 294,653 1.2399
27 Oct 2010 22:03:55 888167 10997886 hadsm3dhet2_jnml_006593615_9 226,842 281,140 1.2394


©2024 cpdn.org