Name | hadam3p_pnw_c2ps_1976_1_008027196_0 |
Workunit | 8182310 |
Created | 4 Jul 2012, 14:47:05 UTC |
Sent | 4 Jul 2012, 14:47:13 UTC |
Report deadline | 16 Jun 2013, 20:07:13 UTC |
Received | 17 Aug 2012, 10:48:18 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1109404 |
Run time | 3 days 0 hours 22 min 32 sec |
CPU time | 2 days 13 hours 9 min 34 sec |
Validate state | Invalid |
Credit | 1,003.35 |
Device peak FLOPS | 1.90 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2788, selfPID=2788, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3044, selfPID=3044, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2936, selfPID=2936, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 19:45:34 (1176): No heartbeat from core client for 30 sec - exiting 19:45:35 (1176): No heartbeat from core client for 30 sec - exiting 19:45:36 (1176): No heartbeat from core client for 30 sec - exiting 19:45:37 (1176): No heartbeat from core client for 30 sec - exiting 19:45:38 (1176): No heartbeat from core client for 30 sec - exiting 19:45:39 (1176): No heartbeat from core client for 30 sec - exiting 19:45:40 (1176): No heartbeat from core client for 30 sec - exiting 19:45:41 (1176): No heartbeat from core client for 30 sec - exiting 19:45:42 (1176): No heartbeat from core client for 30 sec - exiting 19:45:43 (1176): No heartbeat from core client for 30 sec - exiting 19:45:44 (1176): No heartbeat from core client for 30 sec - exiting 19:45:45 (1176): No heartbeat from core client for 30 sec - exiting 19:45:46 (1176): No heartbeat from core client for 30 sec - exiting 19:45:47 (1176): No heartbeat from core client for 30 sec - exiting 19:45:48 (1176): No heartbeat from core client for 30 sec - exiting 19:45:49 (1176): No heartbeat from core client for 30 sec - exiting 19:45:50 (1176): No heartbeat from core client for 30 sec - exiting 19:45:51 (1176): No heartbeat from core client for 30 sec - exiting 19:45:52 (1176): No heartbeat from core client for 30 sec - exiting 19:45:53 (1176): No heartbeat from core client for 30 sec - exiting 19:45:54 (1176): No heartbeat from core client for 30 sec - exiting 19:45:55 (1176): No heartbeat from core client for 30 sec - exiting 19:45:56 (1176): No heartbeat from core client for 30 sec - exiting 19:45:57 (1176): No heartbeat from core client for 30 sec - exiting 19:45:58 (1176): No heartbeat from core client for 30 sec - exiting 19:45:59 (1176): No heartbeat from core client for 30 sec - exiting 19:46:00 (1176): No heartbeat from core client for 30 sec - exiting 19:46:01 (1176): No heartbeat from core client for 30 sec - exiting 19:46:02 (1176): No heartbeat from core client for 30 sec - exiting 19:46:03 (1176): No heartbeat from core client for 30 sec - exiting 19:46:04 (1176): No heartbeat from core client for 30 sec - exiting 19:46:05 (1176): No heartbeat from core client for 30 sec - exiting 19:46:06 (1176): No heartbeat from core client for 30 sec - exiting 19:46:07 (1176): No heartbeat from core client for 30 sec - exiting 19:46:08 (1176): No heartbeat from core client for 30 sec - exiting 19:46:09 (1176): No heartbeat from core client for 30 sec - exiting 19:46:10 (1176): No heartbeat from core client for 30 sec - exiting 19:46:11 (1176): No heartbeat from core client for 30 sec - exiting 19:46:12 (1176): No heartbeat from core client for 30 sec - exiting 19:46:13 (1176): No heartbeat from core client for 30 sec - exiting 19:46:14 (1176): No heartbeat from core client for 30 sec - exiting 19:46:15 (1176): No heartbeat from core client for 30 sec - exiting 19:46:16 (1176): No heartbeat from core client for 30 sec - exiting 19:46:17 (1176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2788, selfPID=804, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 3 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1140, selfPID=3828, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_pnw_c2ps_1976_1_008027196_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_c2ps_1976_1_008027196_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_c2ps_1976_1_008027196_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_c2ps_1976_1_008027196_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_c2ps_1976_1_008027196_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_c2ps_1976_1_008027196_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_c2ps_1976_1_008027196_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_c2ps_1976_1_008027196_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Aug 2012 15:18:02 | 1109404 | 14861309 | hadam3p_pnw_c2ps_1976_1_008027196_0 | 46,176 | 178,948 | 3.8753 |
03 Aug 2012 12:01:56 | 1109404 | 14861309 | hadam3p_pnw_c2ps_1976_1_008027196_0 | 34,661 | 134,293 | 3.8745 |
03 Aug 2012 10:56:46 | 1109404 | 14861309 | hadam3p_pnw_c2ps_1976_1_008027196_0 | 34,656 | 133,789 | 3.8605 |
28 Jul 2012 14:25:10 | 1109404 | 14861309 | hadam3p_pnw_c2ps_1976_1_008027196_0 | 23,136 | 89,723 | 3.8781 |
08 Jul 2012 17:59:46 | 1109404 | 14861309 | hadam3p_pnw_c2ps_1976_1_008027196_0 | 11,616 | 46,135 | 3.9717 |
©2024 cpdn.org