Name | hadam3p_anz_rm41_2012_1_008955236_1 |
Workunit | 9099411 |
Created | 2 Sep 2014, 2:40:12 UTC |
Sent | 2 Sep 2014, 2:56:20 UTC |
Report deadline | 15 Aug 2015, 8:16:20 UTC |
Received | 19 Oct 2014, 8:51:26 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1327695 |
Run time | 5 days 20 hours 54 min 36 sec |
CPU time | 5 days 7 hours 41 min 48 sec |
Validate state | Invalid |
Credit | 2,993.82 |
Device peak FLOPS | 3.10 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5100, selfPID=5100, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:19:05 (8172): No heartbeat from core client for 30 sec - exiting 17:19:06 (8172): No heartbeat from core client for 30 sec - exiting 17:19:07 (8172): No heartbeat from core client for 30 sec - exiting 17:19:08 (8172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 00:36:23 (7612): No heartbeat from core client for 30 sec - exiting 00:36:25 (7612): No heartbeat from core client for 30 sec - exiting 00:36:26 (7612): No heartbeat from core client for 30 sec - exiting 00:36:27 (7612): No heartbeat from core client for 30 sec - exiting 00:36:28 (7612): No heartbeat from core client for 30 sec - exiting 00:36:29 (7612): No heartbeat from core client for 30 sec - exiting 00:36:30 (7612): No heartbeat from core client for 30 sec - exiting 00:36:31 (7612): No heartbeat from core client for 30 sec - exiting 00:36:32 (7612): No heartbeat from core client for 30 sec - exiting 00:36:33 (7612): No heartbeat from core client for 30 sec - exiting 00:36:34 (7612): No heartbeat from core client for 30 sec - exiting 00:36:35 (7612): No heartbeat from core client for 30 sec - exiting 00:36:36 (7612): No heartbeat from core client for 30 sec - exiting 00:36:37 (7612): No heartbeat from core client for 30 sec - exiting 00:36:38 (7612): No heartbeat from core client for 30 sec - exiting 00:36:39 (7612): No heartbeat from core client for 30 sec - exiting 00:36:40 (7612): No heartbeat from core client for 30 sec - exiting 00:36:41 (7612): No heartbeat from core client for 30 sec - exiting 00:36:42 (7612): No heartbeat from core client for 30 sec - exiting 00:36:43 (7612): No heartbeat from core client for 30 sec - exiting 00:36:44 (7612): No heartbeat from core client for 30 sec - exiting 00:36:45 (7612): No heartbeat from core client for 30 sec - exiting 00:36:46 (7612): No heartbeat from core client for 30 sec - exiting 00:36:47 (7612): No heartbeat from core client for 30 sec - exiting 00:36:48 (7612): No heartbeat from core client for 30 sec - exiting 00:36:49 (7612): No heartbeat from core client for 30 sec - exiting 00:36:50 (7612): No heartbeat from core client for 30 sec - exiting 00:36:51 (7612): No heartbeat from core client for 30 sec - exiting 00:36:52 (7612): No heartbeat from core client for 30 sec - exiting 00:36:53 (7612): No heartbeat from core client for 30 sec - exiting 00:36:54 (7612): No heartbeat from core client for 30 sec - exiting 00:36:55 (7612): No heartbeat from core client for 30 sec - exiting 00:36:56 (7612): No heartbeat from core client for 30 sec - exiting 00:36:57 (7612): No heartbeat from core client for 30 sec - exiting 00:36:58 (7612): No heartbeat from core client for 30 sec - exiting 00:36:59 (7612): No heartbeat from core client for 30 sec - exiting 00:37:00 (7612): No heartbeat from core client for 30 sec - exiting 00:37:01 (7612): No heartbeat from core client for 30 sec - exiting 00:37:02 (7612): No heartbeat from core client for 30 sec - exiting 00:37:03 (7612): No heartbeat from core client for 30 sec - exiting 00:37:04 (7612): No heartbeat from core client for 30 sec - exiting 00:37:05 (7612): No heartbeat from core client for 30 sec - exiting 00:37:06 (7612): No heartbeat from core client for 30 sec - exiting 00:37:07 (7612): No heartbeat from core client for 30 sec - exiting 00:37:08 (7612): No heartbeat from core client for 30 sec - exiting 00:37:09 (7612): No heartbeat from core client for 30 sec - exiting 00:37:10 (7612): No heartbeat from core client for 30 sec - exiting 00:37:11 (7612): No heartbeat from core client for 30 sec - exiting 00:37:12 (7612): No heartbeat from core client for 30 sec - exiting 00:37:13 (7612): No heartbeat from core client for 30 sec - exiting 00:37:14 (7612): No heartbeat from core client for 30 sec - exiting 00:37:15 (7612): No heartbeat from core client for 30 sec - exiting 00:37:16 (7612): No heartbeat from core client for 30 sec - exiting 00:37:17 (7612): No heartbeat from core client for 30 sec - exiting 00:37:18 (7612): No heartbeat from core client for 30 sec - exiting 00:37:19 (7612): No heartbeat from core client for 30 sec - exiting 00:37:20 (7612): No heartbeat from core client for 30 sec - exiting 00:37:21 (7612): No heartbeat from core client for 30 sec - exiting 00:37:22 (7612): No heartbeat from core client for 30 sec - exiting 00:37:23 (7612): No heartbeat from core client for 30 sec - exiting 00:37:24 (7612): No heartbeat from core client for 30 sec - exiting 00:37:25 (7612): No heartbeat from core client for 30 sec - exiting 00:37:26 (7612): No heartbeat from core client for 30 sec - exiting 00:37:27 (7612): No heartbeat from core client for 30 sec - exiting 00:37:28 (7612): No heartbeat from core client for 30 sec - exiting 00:37:29 (7612): No heartbeat from core client for 30 sec - exiting 00:37:30 (7612): No heartbeat from core client for 30 sec - exiting 00:37:31 (7612): No heartbeat from core client for 30 sec - exiting 00:37:32 (7612): No heartbeat from core client for 30 sec - exiting 00:37:33 (7612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6300, selfPID=6300, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:40:51 (7312): No heartbeat from core client for 30 sec - exiting 10:40:53 (7312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6148, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7084, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... GCPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 10:38:40 (9424): No heartbeat from core client for 30 sec - exiting 10:38:41 (9424): No heartbeat from core client for 30 sec - exiting 10:38:42 (9424): No heartbeat from core client for 30 sec - exiting 10:38:43 (9424): No heartbeat from core client for 30 sec - exiting 10:38:44 (9424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1732, selfPID=1732, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8160, selfPID=8160, iMonCtr=2 00:20:42 (5028): No heartbeat from core client for 30 sec - exiting 00:20:43 (5028): No heartbeat from core client for 30 sec - exiting 00:20:44 (5028): No heartbeat from core client for 30 sec - exiting 00:20:45 (5028): No heartbeat from core client for 30 sec - exiting 00:20:46 (5028): No heartbeat from core client for 30 sec - exiting 00:20:47 (5028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=8136, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1236, selfPID=9704, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:18:14 (7740): No heartbeat from core client for 30 sec - exiting 14:18:15 (7740): No heartbeat from core client for 30 sec - exiting 14:18:16 (7740): No heartbeat from core client for 30 sec - exiting 14:18:17 (7740): No heartbeat from core client for 30 sec - exiting 14:18:18 (7740): No heartbeat from core client for 30 sec - exiting 14:18:19 (7740): No heartbeat from core client for 30 sec - exiting 14:18:20 (7740): No heartbeat from core client for 30 sec - exiting 14:18:21 (7740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:11:29 (6792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=9528, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8792, selfPID=8224, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_rm41_2012_1_008955236_1_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rm41_2012_1_008955236_1_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rm41_2012_1_008955236_1_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rm41_2012_1_008955236_1_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rm41_2012_1_008955236_1_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rm41_2012_1_008955236_1_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Oct 2014 08:55:25 | 1327695 | 16969901 | hadam3p_anz_rm41_2012_1_008955236_1 | 69,419 | 453,244 | 6.5291 |
30 Sep 2014 23:33:57 | 1327695 | 16969901 | hadam3p_anz_rm41_2012_1_008955236_1 | 57,899 | 385,585 | 6.6596 |
22 Sep 2014 08:33:11 | 1327695 | 16969901 | hadam3p_anz_rm41_2012_1_008955236_1 | 46,379 | 320,642 | 6.9135 |
16 Sep 2014 07:13:42 | 1327695 | 16969901 | hadam3p_anz_rm41_2012_1_008955236_1 | 34,859 | 252,795 | 7.2519 |
16 Sep 2014 07:13:42 | 1327695 | 16969901 | hadam3p_anz_rm41_2012_1_008955236_1 | 34,859 | 252,795 | 7.2519 |
09 Sep 2014 17:33:19 | 1327695 | 16969901 | hadam3p_anz_rm41_2012_1_008955236_1 | 23,339 | 179,734 | 7.7010 |
07 Sep 2014 11:07:40 | 1327695 | 16969901 | hadam3p_anz_rm41_2012_1_008955236_1 | 11,819 | 88,005 | 7.4461 |
©2024 cpdn.org