Name | hadsm3dhet2_jnml_006593615_9 |
Workunit | 6796988 |
Created | 15 Mar 2010, 11:58:59 UTC |
Sent | 12 Oct 2010, 16:20:07 UTC |
Report deadline | 24 Sep 2011, 21:40:07 UTC |
Received | 11 Nov 2010, 18:07:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 888167 |
Run time | |
CPU time | 6 days 6 hours 25 min 10 sec |
Validate state | Invalid |
Credit | 3,969.74 |
Device peak FLOPS | 2.64 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.2.19</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. MainError: 02:26:27 AM No files match the supplied pattern. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7224, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7380, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day Model crashed: (null) cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day Model crashed: (null) cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day Model crashed: (null) cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day Model crashed: (null) cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day Model crashed: (null) cpdnmonitor: cannot open input file O:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jnml_006593615/dataout/restart.day Model crashed: (null) Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Nov 2010 00:50:14 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 172,832 | 538,735 | 1.2468 |
10 Nov 2010 00:13:12 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 162,030 | 525,349 | 1.2470 |
09 Nov 2010 20:05:52 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 151,228 | 511,982 | 1.2473 |
09 Nov 2010 15:43:36 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 140,426 | 498,492 | 1.2472 |
09 Nov 2010 13:34:32 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 129,624 | 484,860 | 1.2468 |
06 Nov 2010 04:51:13 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 118,822 | 471,313 | 1.2466 |
06 Nov 2010 00:35:38 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 108,020 | 457,651 | 1.2461 |
05 Nov 2010 20:53:26 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 97,218 | 444,123 | 1.2459 |
05 Nov 2010 16:31:21 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 86,416 | 430,496 | 1.2454 |
04 Nov 2010 22:06:58 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 75,614 | 416,762 | 1.2446 |
04 Nov 2010 17:55:37 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 64,812 | 403,215 | 1.2443 |
02 Nov 2010 18:18:18 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 54,010 | 389,574 | 1.2436 |
01 Nov 2010 22:33:14 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 43,208 | 375,788 | 1.2425 |
01 Nov 2010 20:20:00 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 32,406 | 362,304 | 1.2422 |
31 Oct 2010 21:17:18 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 21,604 | 348,686 | 1.2415 |
31 Oct 2010 17:00:42 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 10,802 | 335,141 | 1.2410 |
30 Oct 2010 02:56:21 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 259,248 | 321,623 | 1.2406 |
29 Oct 2010 22:06:56 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 248,446 | 308,127 | 1.2402 |
29 Oct 2010 17:57:59 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 237,644 | 294,653 | 1.2399 |
27 Oct 2010 22:03:55 | 888167 | 10997886 | hadsm3dhet2_jnml_006593615_9 | 226,842 | 281,140 | 1.2394 |
©2024 cpdn.org