Name | hadsm3dhet2_jjbb_006588025_6 |
Workunit | 6791398 |
Created | 15 Mar 2010, 11:49:34 UTC |
Sent | 28 Oct 2010, 8:42:02 UTC |
Report deadline | 10 Oct 2011, 14:02:02 UTC |
Received | 10 Mar 2011, 10:51:04 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Done |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 861410 |
Run time | |
CPU time | 19 days 1 hours 55 min 17 sec |
Validate state | Invalid |
Credit | 6,847.79 |
Device peak FLOPS | 1.71 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>5.10.45</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5664, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4404, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5628, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1 Model crash detected, will try to restart... MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. MainError: 07:13:23 AM No files match the supplied pattern. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5728, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5748, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. MainError: 11:17:22 AM No files match the supplied pattern. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5364, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Model crashed: (null) Model crashed: (null) Model crashed: (null) Model crashed: (null) Model crashed: (null) Model crashed: (null) Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Mar 2011 21:19:00 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 226,842 | 1,626,173 | 2.1818 |
08 Mar 2011 11:03:53 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 216,040 | 1,603,131 | 2.1825 |
08 Mar 2011 11:03:53 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 205,238 | 1,579,453 | 2.1824 |
08 Mar 2011 11:03:53 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 194,436 | 1,555,756 | 2.1822 |
08 Mar 2011 11:03:53 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 183,634 | 1,532,688 | 2.1829 |
08 Mar 2011 11:03:53 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 172,832 | 1,509,109 | 2.1829 |
08 Mar 2011 11:03:53 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 162,030 | 1,484,497 | 2.1814 |
08 Mar 2011 11:03:53 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 151,228 | 1,461,024 | 2.1815 |
08 Mar 2011 11:03:53 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 140,426 | 1,437,817 | 2.1821 |
08 Mar 2011 11:03:53 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 129,624 | 1,413,532 | 2.1810 |
28 Feb 2011 08:05:30 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 118,822 | 1,389,766 | 2.1806 |
26 Feb 2011 07:54:00 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 108,020 | 1,366,139 | 2.1805 |
25 Feb 2011 22:43:12 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 97,218 | 1,343,701 | 2.1823 |
25 Feb 2011 00:24:09 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 86,416 | 1,319,819 | 2.1818 |
23 Feb 2011 21:40:04 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 75,614 | 1,296,510 | 2.1823 |
22 Feb 2011 08:57:52 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 64,812 | 1,272,391 | 2.1813 |
21 Feb 2011 21:24:04 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 54,010 | 1,249,169 | 2.1819 |
20 Feb 2011 03:05:25 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 43,208 | 1,226,237 | 2.1831 |
19 Feb 2011 21:59:39 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 32,406 | 1,202,968 | 2.1836 |
17 Feb 2011 21:37:39 | 861410 | 10941981 | hadsm3dhet2_jjbb_006588025_6 | 21,604 | 1,178,475 | 2.1820 |
©2024 climateprediction.net