Name | hadam3p_necg_1968_2_006167082_5 |
Workunit | 6433139 |
Created | 24 Jun 2010, 14:32:51 UTC |
Sent | 24 Jun 2010, 15:16:00 UTC |
Report deadline | 6 Jun 2011, 20:36:00 UTC |
Received | 19 Jul 2010, 4:25:07 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 958277 |
Run time | 4 days 11 hours 18 min 32 sec |
CPU time | 2 days 23 hours 4 min 51 sec |
Validate state | Invalid |
Credit | 997.92 |
Device peak FLOPS | 1.12 GFLOPS |
Application version | UK Met Office HADAM3P v6.14 windows_intelx86 |
Stderr | <core_client_version>6.10.56</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 10:14:01 (3636): No heartbeat from core client for 30 sec - exiting 10:14:02 (3636): No heartbeat from core client for 30 sec - exiting 10:14:03 (3636): No heartbeat from core client for 30 sec - exiting 10:14:04 (3636): No heartbeat from core client for 30 sec - exiting 10:14:05 (3636): No heartbeat from core client for 30 sec - exiting 10:14:06 (3636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4688, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( 23:23:11 (1140): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Jul 2010 00:54:27 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 34,560 | 244,916 | 7.0867 |
16 Jul 2010 23:03:25 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 31,680 | 225,078 | 7.1047 |
16 Jul 2010 00:30:28 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 28,800 | 206,080 | 7.1556 |
14 Jul 2010 20:29:30 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 25,920 | 185,950 | 7.1740 |
11 Jul 2010 02:14:16 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 23,040 | 163,546 | 7.0984 |
09 Jul 2010 19:24:56 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 20,160 | 144,317 | 7.1586 |
08 Jul 2010 05:13:01 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 17,280 | 125,361 | 7.2547 |
06 Jul 2010 05:22:16 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 14,400 | 105,768 | 7.3450 |
03 Jul 2010 00:29:53 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 11,520 | 81,982 | 7.1165 |
30 Jun 2010 22:36:25 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 8,640 | 60,730 | 7.0289 |
28 Jun 2010 23:19:22 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 5,760 | 40,834 | 7.0892 |
27 Jun 2010 21:49:33 | 958277 | 11591460 | hadam3p_necg_1968_2_006167082_5 | 2,880 | 21,199 | 7.3608 |
©2024 climateprediction.net