Name | hadam3p_eu_aexw_2013_1_008682735_1 |
Workunit | 8817209 |
Created | 23 Apr 2014, 7:53:20 UTC |
Sent | 23 Apr 2014, 8:17:38 UTC |
Report deadline | 5 Apr 2015, 13:37:38 UTC |
Received | 7 May 2014, 16:10:42 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1299063 |
Run time | 3 days 5 hours 29 min 48 sec |
CPU time | 2 days 23 hours 20 min 13 sec |
Validate state | Invalid |
Credit | 1,591.48 |
Device peak FLOPS | 2.24 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:40:23 (11172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:42:35 (6652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:43:22 (4084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:46:57 (11960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5796, selfPID=5796, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... 10:54:38 (11108): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:22:26 (11320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:04:20 (10336): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:08:09 (13588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:09:08 (16148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:09:55 (13220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:10:30 (9056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:40:36 (14840): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 05:31:38 (13652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:34:26 (1804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:58:53 (15896): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:08:58 (10156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:32:49 (5848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 05:56:44 (16324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:59:17 (10732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:14:12 (8824): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:41:37 (12844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:44:04 (21572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:23:49 (13760): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10456, selfPID=10456, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10456, selfPID=13600, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_aexw_2013_1_008682735_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_aexw_2013_1_008682735_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_aexw_2013_1_008682735_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_aexw_2013_1_008682735_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
06 May 2014 23:29:13 | 1299063 | 16579975 | hadam3p_eu_aexw_2013_1_008682735_1 | 92,256 | 233,958 | 2.5360 |
06 May 2014 06:20:51 | 1299063 | 16579975 | hadam3p_eu_aexw_2013_1_008682735_1 | 80,736 | 204,944 | 2.5384 |
03 May 2014 21:48:29 | 1299063 | 16579975 | hadam3p_eu_aexw_2013_1_008682735_1 | 69,216 | 176,060 | 2.5436 |
03 May 2014 01:31:02 | 1299063 | 16579975 | hadam3p_eu_aexw_2013_1_008682735_1 | 57,696 | 147,225 | 2.5517 |
02 May 2014 11:14:36 | 1299063 | 16579975 | hadam3p_eu_aexw_2013_1_008682735_1 | 46,176 | 117,474 | 2.5440 |
01 May 2014 08:17:24 | 1299063 | 16579975 | hadam3p_eu_aexw_2013_1_008682735_1 | 34,656 | 87,821 | 2.5341 |
30 Apr 2014 10:27:42 | 1299063 | 16579975 | hadam3p_eu_aexw_2013_1_008682735_1 | 23,136 | 58,783 | 2.5408 |
28 Apr 2014 13:43:03 | 1299063 | 16579975 | hadam3p_eu_aexw_2013_1_008682735_1 | 11,616 | 29,655 | 2.5529 |
©2024 cpdn.org