Name | hadam3p_anz_f0um_2012_1_009777139_0 |
Workunit | 9833103 |
Created | 24 Apr 2015, 14:33:32 UTC |
Sent | 6 May 2015, 11:47:57 UTC |
Report deadline | 17 Apr 2016, 17:07:57 UTC |
Received | 15 May 2015, 23:26:06 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1330326 |
Run time | 5 days 12 hours 56 min 56 sec |
CPU time | 4 days 13 hours 4 min 5 sec |
Validate state | Invalid |
Credit | 5,477.92 |
Device peak FLOPS | 4.48 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.36</core_client_version> <![CDATA[ <stderr_txt> 09:39:23 (4792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:10:01 (1128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:10:02 (1128): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3228, selfPID=3228, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:39:18 (6768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:28:09 (3700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4204, selfPID=4204, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:00:56 (5848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:00:57 (5848): No heartbeat from core client for 30 sec - exiting 08:00:58 (5848): No heartbeat from core client for 30 sec - exiting 08:00:59 (5848): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6320, selfPID=6320, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... 09:32:32 (7056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 05:07:34 (6100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1912, selfPID=1912, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:11:38 (3696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6452, selfPID=6452, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5288, selfPID=5288, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4208, selfPID=4208, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:38:47 (6612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:38:48 (6612): No heartbeat from core client for 30 sec - exiting 06:38:49 (6612): No heartbeat from core client for 30 sec - exiting 06:38:51 (6612): No heartbeat from core client for 30 sec - exiting 06:38:52 (6612): No heartbeat from core client for 30 sec - exiting 06:38:53 (6612): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:13:51 (6764): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6320, selfPID=6320, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3928, selfPID=3928, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:18:47 (3184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:21:54 (2988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4580, selfPID=1456, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_f0um_2012_1_009777139_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 May 2015 11:25:24 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 127,019 | 391,245 | 3.0802 |
18 May 2015 11:23:07 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 115,499 | 355,786 | 3.0804 |
14 May 2015 14:22:31 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 103,979 | 319,816 | 3.0758 |
13 May 2015 16:49:26 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 92,459 | 283,948 | 3.0711 |
12 May 2015 21:12:11 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 80,939 | 249,065 | 3.0772 |
12 May 2015 15:00:52 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 69,419 | 213,777 | 3.0795 |
10 May 2015 10:54:30 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 57,899 | 178,339 | 3.0802 |
09 May 2015 11:02:09 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 46,379 | 142,537 | 3.0733 |
08 May 2015 19:28:17 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 34,859 | 105,998 | 3.0408 |
08 May 2015 19:26:22 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 23,339 | 71,460 | 3.0618 |
08 May 2015 19:24:37 | 1330326 | 18342559 | hadam3p_anz_f0um_2012_1_009777139_0 | 11,819 | 36,065 | 3.0514 |
©2024 cpdn.org