Name | hadam3p_anz_a1mr_2007_1_009839947_0 |
Workunit | 9892700 |
Created | 28 May 2015, 15:25:12 UTC |
Sent | 28 May 2015, 15:30:32 UTC |
Report deadline | 9 May 2016, 20:50:32 UTC |
Received | 10 Jun 2015, 11:46:19 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1361123 |
Run time | 2 days 12 hours 58 min 45 sec |
CPU time | 2 days 9 hours 41 min 59 sec |
Validate state | Invalid |
Credit | 4,484.28 |
Device peak FLOPS | 3.96 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.42</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 02:00:33 (4672): No heartbeat from core client for 30 sec - exiting 02:00:34 (4672): No heartbeat from core client for 30 sec - exiting 02:00:35 (4672): No heartbeat from core client for 30 sec - exiting 02:00:36 (4672): No heartbeat from core client for 30 sec - exiting 02:00:37 (4672): No heartbeat from core client for 30 sec - exiting 02:00:38 (4672): No heartbeat from core client for 30 sec - exiting 02:00:39 (4672): No heartbeat from core client for 30 sec - exiting 02:00:40 (4672): No heartbeat from core client for 30 sec - exiting 02:00:41 (4672): No heartbeat from core client for 30 sec - exiting 02:00:42 (4672): No heartbeat from core client for 30 sec - exiting 02:00:43 (4672): No heartbeat from core client for 30 sec - exiting 02:00:44 (4672): No heartbeat from core client for 30 sec - exiting 02:00:45 (4672): No heartbeat from core client for 30 sec - exiting 02:00:46 (4672): No heartbeat from core client for 30 sec - exiting 02:00:47 (4672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:48 (4672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6036, selfPID=6036, iMonCtr=2 02:00:35 (5676): No heartbeat from core client for 30 sec - exiting 02:00:36 (5676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6228, selfPID=6228, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4536, selfPID=4536, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 02:00:36 (5196): No heartbeat from core client for 30 sec - exiting 02:00:37 (5196): No heartbeat from core client for 30 sec - exiting 02:00:38 (5196): No heartbeat from core client for 30 sec - exiting 02:00:39 (5196): No heartbeat from core client for 30 sec - exiting 02:00:40 (5196): No heartbeat from core client for 30 sec - exiting 02:00:41 (5196): No heartbeat from core client for 30 sec - exiting 02:00:42 (5196): No heartbeat from core client for 30 sec - exiting 02:00:43 (5196): No heartbeat from core client for 30 sec - exiting 02:00:44 (5196): No heartbeat from core client for 30 sec - exiting 02:00:45 (5196): No heartbeat from core client for 30 sec - exiting 02:00:46 (5196): No heartbeat from core client for 30 sec - exiting 02:00:47 (5196): No heartbeat from core client for 30 sec - exiting 02:00:48 (5196): No heartbeat from core client for 30 sec - exiting 02:00:49 (5196): No heartbeat from core client for 30 sec - exiting 02:00:50 (5196): No heartbeat from core client for 30 sec - exiting 02:00:51 (5196): No heartbeat from core client for 30 sec - exiting 02:00:52 (5196): No heartbeat from core client for 30 sec - exiting 02:00:53 (5196): No heartbeat from core client for 30 sec - exiting 02:00:54 (5196): No heartbeat from core client for 30 sec - exiting 02:00:55 (5196): No heartbeat from core client for 30 sec - exiting 02:00:56 (5196): No heartbeat from core client for 30 sec - exiting 02:00:57 (5196): No heartbeat from core client for 30 sec - exiting 02:00:58 (5196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 02:00:35 (6864): No heartbeat from core client for 30 sec - exiting 02:00:36 (6864): No heartbeat from core client for 30 sec - exiting 02:00:37 (6864): No heartbeat from core client for 30 sec - exiting 02:00:38 (6864): No heartbeat from core client for 30 sec - exiting 02:00:39 (6864): No heartbeat from core client for 30 sec - exiting 02:00:40 (6864): No heartbeat from core client for 30 sec - exiting 02:00:41 (6864): No heartbeat from core client for 30 sec - exiting 02:00:42 (6864): No heartbeat from core client for 30 sec - exiting 02:00:43 (6864): No heartbeat from core client for 30 sec - exiting 02:00:44 (6864): No heartbeat from core client for 30 sec - exiting 02:00:45 (6864): No heartbeat from core client for 30 sec - exiting 02:00:46 (6864): No heartbeat from core client for 30 sec - exiting 02:00:47 (6864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:40:19 (5092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:40:21 (5092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5220, selfPID=5220, iMonCtr=2 02:00:35 (752): No heartbeat from core client for 30 sec - exiting 02:00:36 (752): No heartbeat from core client for 30 sec - exiting 02:00:37 (752): No heartbeat from core client for 30 sec - exiting 02:00:38 (752): No heartbeat from core client for 30 sec - exiting 02:00:39 (752): No heartbeat from core client for 30 sec - exiting 02:00:40 (752): No heartbeat from core client for 30 sec - exiting 02:00:41 (752): No heartbeat from core client for 30 sec - exiting 02:00:42 (752): No heartbeat from core client for 30 sec - exiting 02:00:43 (752): No heartbeat from core client for 30 sec - exiting 02:00:44 (752): No heartbeat from core client for 30 sec - exiting 02:00:45 (752): No heartbeat from core client for 30 sec - exiting 02:00:46 (752): No heartbeat from core client for 30 sec - exiting 02:00:47 (752): No heartbeat from core client for 30 sec - exiting 02:00:48 (752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6452, selfPID=6452, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:00:35 (2756): No heartbeat from core client for 30 sec - exiting 02:00:36 (2756): No heartbeat from core client for 30 sec - exiting 02:00:37 (2756): No heartbeat from core client for 30 sec - exiting 02:00:38 (2756): No heartbeat from core client for 30 sec - exiting 02:00:39 (2756): No heartbeat from core client for 30 sec - exiting 02:00:40 (2756): No heartbeat from core client for 30 sec - exiting 02:00:41 (2756): No heartbeat from core client for 30 sec - exiting 02:00:42 (2756): No heartbeat from core client for 30 sec - exiting 02:00:43 (2756): No heartbeat from core client for 30 sec - exiting 02:00:44 (2756): No heartbeat from core client for 30 sec - exiting 02:00:45 (2756): No heartbeat from core client for 30 sec - exiting 02:00:46 (2756): No heartbeat from core client for 30 sec - exiting 02:00:47 (2756): No heartbeat from core client for 30 sec - exiting 02:00:48 (2756): No heartbeat from core client for 30 sec - exiting 02:00:49 (2756): No heartbeat from core client for 30 sec - exiting 02:00:50 (2756): No heartbeat from core client for 30 sec - exiting 02:00:51 (2756): No heartbeat from core client for 30 sec - exiting 02:00:52 (2756): No heartbeat from core client for 30 sec - exiting 02:00:53 (2756): No heartbeat from core client for 30 sec - exiting 02:00:54 (2756): No heartbeat from core client for 30 sec - exiting 02:00:55 (2756): No heartbeat from core client for 30 sec - exiting 02:00:56 (2756): No heartbeat from core client for 30 sec - exiting 02:00:57 (2756): No heartbeat from core client for 30 sec - exiting 02:00:58 (2756): No heartbeat from core client for 30 sec - exiting 02:00:59 (2756): No heartbeat from core client for 30 sec - exiting 02:01:00 (2756): No heartbeat from core client for 30 sec - exiting 02:01:01 (2756): No heartbeat from core client for 30 sec - exiting 02:01:02 (2756): No heartbeat from core client for 30 sec - exiting 02:01:03 (2756): No heartbeat from core client for 30 sec - exiting 02:01:04 (2756): No heartbeat from core client for 30 sec - exiting 02:01:05 (2756): No heartbeat from core client for 30 sec - exiting 02:01:06 (2756): No heartbeat from core client for 30 sec - exiting 02:01:07 (2756): No heartbeat from core client for 30 sec - exiting 02:01:08 (2756): No heartbeat from core client for 30 sec - exiting 02:01:09 (2756): No heartbeat from core client for 30 sec - exiting 02:01:10 (2756): No heartbeat from core client for 30 sec - exiting 02:01:11 (2756): No heartbeat from core client for 30 sec - exiting 02:01:12 (2756): No heartbeat from core client for 30 sec - exiting 02:01:13 (2756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:01:14 (2756): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:00:36 (4868): No heartbeat from core client for 30 sec - exiting 02:00:37 (4868): No heartbeat from core client for 30 sec - exiting 02:00:38 (4868): No heartbeat from core client for 30 sec - exiting 02:00:39 (4868): No heartbeat from core client for 30 sec - exiting 02:00:40 (4868): No heartbeat from core client for 30 sec - exiting 02:00:41 (4868): No heartbeat from core client for 30 sec - exiting 02:00:42 (4868): No heartbeat from core client for 30 sec - exiting 02:00:43 (4868): No heartbeat from core client for 30 sec - exiting 02:00:44 (4868): No heartbeat from core client for 30 sec - exiting 02:00:45 (4868): No heartbeat from core client for 30 sec - exiting 02:00:46 (4868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6896, selfPID=6896, iMonCtr=2 04:13:33 (6580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7012, selfPID=7012, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=6600, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1944, selfPID=268, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_a1mr_2007_1_009839947_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a1mr_2007_1_009839947_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a1mr_2007_1_009839947_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Jun 2015 12:54:35 | 1361123 | 18496455 | hadam3p_anz_a1mr_2007_1_009839947_0 | 103,979 | 198,744 | 1.9114 |
08 Jun 2015 09:58:15 | 1361123 | 18496455 | hadam3p_anz_a1mr_2007_1_009839947_0 | 92,459 | 176,759 | 1.9118 |
06 Jun 2015 13:51:25 | 1361123 | 18496455 | hadam3p_anz_a1mr_2007_1_009839947_0 | 80,939 | 154,476 | 1.9085 |
05 Jun 2015 12:05:40 | 1361123 | 18496455 | hadam3p_anz_a1mr_2007_1_009839947_0 | 69,419 | 132,255 | 1.9052 |
03 Jun 2015 12:06:20 | 1361123 | 18496455 | hadam3p_anz_a1mr_2007_1_009839947_0 | 57,899 | 110,334 | 1.9056 |
03 Jun 2015 11:04:34 | 1361123 | 18496455 | hadam3p_anz_a1mr_2007_1_009839947_0 | 46,379 | 88,349 | 1.9049 |
01 Jun 2015 11:59:56 | 1361123 | 18496455 | hadam3p_anz_a1mr_2007_1_009839947_0 | 34,859 | 66,506 | 1.9079 |
31 May 2015 12:22:55 | 1361123 | 18496455 | hadam3p_anz_a1mr_2007_1_009839947_0 | 23,339 | 44,573 | 1.9098 |
30 May 2015 12:01:01 | 1361123 | 18496455 | hadam3p_anz_a1mr_2007_1_009839947_0 | 11,819 | 22,629 | 1.9146 |
©2024 cpdn.org