Name | hadam3p_anz_m6ox_2013_1_009742682_0 |
Workunit | 9813680 |
Created | 9 Apr 2015, 12:09:00 UTC |
Sent | 13 Apr 2015, 0:04:24 UTC |
Report deadline | 25 Mar 2016, 5:24:24 UTC |
Received | 21 May 2015, 8:46:22 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1355028 |
Run time | 4 days 8 hours 53 min |
CPU time | 4 days 1 hours 10 min 23 sec |
Validate state | Invalid |
Credit | 2,993.82 |
Device peak FLOPS | 3.32 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>6.8.44</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=25636, selfPID=25636, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:49:40 (5392): No heartbeat from core client for 30 sec - exiting 14:49:41 (5392): No heartbeat from core client for 30 sec - exiting 14:49:42 (5392): No heartbeat from core client for 30 sec - exiting 14:49:43 (5392): No heartbeat from core client for 30 sec - exiting 14:49:44 (5392): No heartbeat from core client for 30 sec - exiting 14:49:45 (5392): No heartbeat from core client for 30 sec - exiting 14:49:46 (5392): No heartbeat from core client for 30 sec - exiting 14:49:47 (5392): No heartbeat from core client for 30 sec - exiting 14:49:48 (5392): No heartbeat from core client for 30 sec - exiting 14:49:49 (5392): No heartbeat from core client for 30 sec - exiting 14:49:50 (5392): No heartbeat from core client for 30 sec - exiting 14:49:51 (5392): No heartbeat from core client for 30 sec - exiting 14:49:52 (5392): No heartbeat from core client for 30 sec - exiting 14:49:53 (5392): No heartbeat from core client for 30 sec - exiting 14:49:54 (5392): No heartbeat from core client for 30 sec - exiting 14:49:55 (5392): No heartbeat from core client for 30 sec - exiting 14:49:56 (5392): No heartbeat from core client for 30 sec - exiting 14:49:57 (5392): No heartbeat from core client for 30 sec - exiting 14:49:58 (5392): No heartbeat from core client for 30 sec - exiting 14:49:59 (5392): No heartbeat from core client for 30 sec - exiting 14:50:00 (5392): No heartbeat from core client for 30 sec - exiting 14:50:01 (5392): No heartbeat from core client for 30 sec - exiting 14:50:02 (5392): No heartbeat from core client for 30 sec - exiting 14:50:03 (5392): No heartbeat from core client for 30 sec - exiting 14:50:04 (5392): No heartbeat from core client for 30 sec - exiting 14:50:05 (5392): No heartbeat from core client for 30 sec - exiting 14:50:06 (5392): No heartbeat from core client for 30 sec - exiting 14:50:07 (5392): No heartbeat from core client for 30 sec - exiting 14:50:08 (5392): No heartbeat from core client for 30 sec - exiting 14:50:09 (5392): No heartbeat from core client for 30 sec - exiting 14:50:10 (5392): No heartbeat from core client for 30 sec - exiting 14:50:11 (5392): No heartbeat from core client for 30 sec - exiting 14:50:12 (5392): No heartbeat from core client for 30 sec - exiting 14:50:13 (5392): No heartbeat from core client for 30 sec - exiting 14:50:14 (5392): No heartbeat from core client for 30 sec - exiting 14:50:15 (5392): No heartbeat from core client for 30 sec - exiting 14:50:16 (5392): No heartbeat from core client for 30 sec - exiting 14:50:17 (5392): No heartbeat from core client for 30 sec - exiting 14:50:18 (5392): No heartbeat from core client for 30 sec - exiting 14:50:19 (5392): No heartbeat from core client for 30 sec - exiting 14:50:20 (5392): No heartbeat from core client for 30 sec - exiting 14:50:21 (5392): No heartbeat from core client for 30 sec - exiting 14:50:22 (5392): No heartbeat from core client for 30 sec - exiting 14:50:23 (5392): No heartbeat from core client for 30 sec - exiting 14:50:24 (5392): No heartbeat from core client for 30 sec - exiting 14:50:25 (5392): No heartbeat from core client for 30 sec - exiting 14:50:26 (5392): No heartbeat from core client for 30 sec - exiting 14:50:27 (5392): No heartbeat from core client for 30 sec - exiting 14:50:28 (5392): No heartbeat from core client for 30 sec - exiting 14:50:29 (5392): No heartbeat from core client for 30 sec - exiting 14:50:30 (5392): No heartbeat from core client for 30 sec - exiting 14:50:31 (5392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1352, selfPID=1352, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9400, selfPID=9400, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2204, selfPID=2204, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6844, selfPID=6844, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3164, selfPID=3164, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6324, selfPID=6324, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 06:55:01 (3492): No heartbeat from core client for 30 sec - exiting 06:55:02 (3492): No heartbeat from core client for 30 sec - exiting 06:55:03 (3492): No heartbeat from core client for 30 sec - exiting 06:55:04 (3492): No heartbeat from core client for 30 sec - exiting 06:55:05 (3492): No heartbeat from core client for 30 sec - exiting 06:55:06 (3492): No heartbeat from core client for 30 sec - exiting 06:55:07 (3492): No heartbeat from core client for 30 sec - exiting 06:55:08 (3492): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=5488, iMonCtr=1 06:55:09 (3492): No heartbeat from core client for 30 sec - exiting 06:55:10 (3492): No heartbeat from core client for 30 sec - exiting 06:55:11 (3492): No heartbeat from core client for 30 sec - exiting 06:55:12 (3492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=5284, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1636, selfPID=3528, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_anz_m6ox_2013_1_009742682_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m6ox_2013_1_009742682_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m6ox_2013_1_009742682_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m6ox_2013_1_009742682_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m6ox_2013_1_009742682_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m6ox_2013_1_009742682_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 May 2015 07:53:12 | 1355028 | 18296673 | hadam3p_anz_m6ox_2013_1_009742682_0 | 69,419 | 349,344 | 5.0324 |
08 May 2015 19:22:28 | 1355028 | 18296673 | hadam3p_anz_m6ox_2013_1_009742682_0 | 57,899 | 292,108 | 5.0451 |
29 Apr 2015 15:06:49 | 1355028 | 18296673 | hadam3p_anz_m6ox_2013_1_009742682_0 | 46,379 | 231,997 | 5.0022 |
25 Apr 2015 22:54:30 | 1355028 | 18296673 | hadam3p_anz_m6ox_2013_1_009742682_0 | 34,859 | 174,709 | 5.0119 |
19 Apr 2015 23:10:15 | 1355028 | 18296673 | hadam3p_anz_m6ox_2013_1_009742682_0 | 23,339 | 118,645 | 5.0836 |
14 Apr 2015 13:34:25 | 1355028 | 18296673 | hadam3p_anz_m6ox_2013_1_009742682_0 | 11,819 | 60,014 | 5.0778 |
©2024 cpdn.org