Name | hadsm3dhet2_jjok_006588502_4 |
Workunit | 6791875 |
Created | 15 Mar 2010, 11:50:25 UTC |
Sent | 26 Oct 2010, 16:48:59 UTC |
Report deadline | 8 Oct 2011, 22:08:59 UTC |
Received | 29 Nov 2010, 23:22:01 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 979850 |
Run time | |
CPU time | 9 days 21 hours 6 min 55 sec |
Validate state | Invalid |
Credit | 3,870.49 |
Device peak FLOPS | 1.79 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>5.10.45</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. MainError: 11:57:07 AM No files match the supplied pattern. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3888, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CNo heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1236, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1236, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1236, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1236, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1236, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1236, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Nov 2010 18:40:04 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 162,030 | 851,910 | 2.0222 |
29 Nov 2010 07:28:50 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 151,228 | 830,798 | 2.0240 |
28 Nov 2010 16:14:51 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 140,426 | 809,500 | 2.0254 |
27 Nov 2010 16:51:12 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 129,624 | 788,773 | 2.0284 |
25 Nov 2010 22:37:57 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 118,822 | 768,132 | 2.0317 |
25 Nov 2010 00:42:20 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 108,020 | 747,317 | 2.0348 |
24 Nov 2010 14:29:25 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 97,218 | 726,006 | 2.0367 |
23 Nov 2010 21:18:36 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 86,416 | 705,695 | 2.0416 |
23 Nov 2010 08:48:09 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 75,614 | 685,251 | 2.0464 |
22 Nov 2010 21:49:21 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 64,812 | 664,586 | 2.0508 |
21 Nov 2010 18:31:20 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 54,010 | 644,439 | 2.0572 |
20 Nov 2010 21:41:54 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 43,208 | 623,363 | 2.0610 |
19 Nov 2010 21:45:34 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 32,406 | 602,318 | 2.0652 |
18 Nov 2010 23:58:36 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 21,604 | 581,127 | 2.0692 |
18 Nov 2010 13:16:33 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 10,802 | 559,123 | 2.0704 |
17 Nov 2010 11:59:30 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 259,248 | 532,033 | 2.0522 |
17 Nov 2010 06:01:53 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 248,446 | 511,370 | 2.0583 |
16 Nov 2010 16:03:51 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 237,644 | 490,905 | 2.0657 |
15 Nov 2010 23:13:47 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 226,842 | 469,892 | 2.0715 |
14 Nov 2010 19:34:13 | 979850 | 10946749 | hadsm3dhet2_jjok_006588502_4 | 216,040 | 448,131 | 2.0743 |
©2024 cpdn.org