Name | hadam3p_eu_y19k_1967_1_007053504_1 |
Workunit | 7256820 |
Created | 11 May 2012, 9:47:04 UTC |
Sent | 11 May 2012, 9:47:11 UTC |
Report deadline | 23 Apr 2013, 15:07:11 UTC |
Received | 14 Jun 2012, 9:25:42 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1016016 |
Run time | 5 days 11 hours 16 min 52 sec |
CPU time | 4 days 5 hours 57 min 16 sec |
Validate state | Invalid |
Credit | 1,392.75 |
Device peak FLOPS | 1.46 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.56</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5504, selfPID=5504, iMonCtr=2 21:42:20 (3732): No heartbeat from core client for 30 sec - exiting 21:42:21 (3732): No heartbeat from core client for 30 sec - exiting 21:42:23 (3732): No heartbeat from core client for 30 sec - exiting 21:42:24 (3732): No heartbeat from core client for 30 sec - exiting 21:42:25 (3732): No heartbeat from core client for 30 sec - exiting 21:42:26 (3732): No heartbeat from core client for 30 sec - exiting 21:42:27 (3732): No heartbeat from core client for 30 sec - exiting 21:42:28 (3732): No heartbeat from core client for 30 sec - exiting 21:42:29 (3732): No heartbeat from core client for 30 sec - exiting 21:42:30 (3732): No heartbeat from core client for 30 sec - exiting 21:42:31 (3732): No heartbeat from core client for 30 sec - exiting 21:42:32 (3732): No heartbeat from core client for 30 sec - exiting 21:42:33 (3732): No heartbeat from core client for 30 sec - exiting 21:42:35 (3732): No heartbeat from core client for 30 sec - exiting 21:42:36 (3732): No heartbeat from core client for 30 sec - exiting 21:42:37 (3732): No heartbeat from core client for 30 sec - exiting 21:42:38 (3732): No heartbeat from core client for 30 sec - exiting 21:42:39 (3732): No heartbeat from core client for 30 sec - exiting 21:42:40 (3732): No heartbeat from core client for 30 sec - exiting 21:42:41 (3732): No heartbeat from core client for 30 sec - exiting 21:42:42 (3732): No heartbeat from core client for 30 sec - exiting 21:42:43 (3732): No heartbeat from core client for 30 sec - exiting 21:42:44 (3732): No heartbeat from core client for 30 sec - exiting 21:42:45 (3732): No heartbeat from core client for 30 sec - exiting 21:42:47 (3732): No heartbeat from core client for 30 sec - exiting 21:42:48 (3732): No heartbeat from core client for 30 sec - exiting 21:42:49 (3732): No heartbeat from core client for 30 sec - exiting 21:42:50 (3732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:41:41 (7564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:41:42 (7564): No heartbeat from core client for 30 sec - exiting 01:41:43 (7564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10784, selfPID=10784, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10208, selfPID=10208, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8800, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7596, selfPID=7596, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:35:06 (2492): No heartbeat from core client for 30 sec - exiting 22:35:07 (2492): No heartbeat from core client for 30 sec - exiting 22:35:08 (2492): No heartbeat from core client for 30 sec - exiting 22:35:09 (2492): No heartbeat from core client for 30 sec - exiting 22:35:10 (2492): No heartbeat from core client for 30 sec - exiting 22:35:11 (2492): No heartbeat from core client for 30 sec - exiting 22:35:12 (2492): No heartbeat from core client for 30 sec - exiting 22:35:13 (2492): No heartbeat from core client for 30 sec - exiting 22:35:14 (2492): No heartbeat from core client for 30 sec - exiting 22:35:16 (2492): No heartbeat from core client for 30 sec - exiting 22:35:17 (2492): No heartbeat from core client for 30 sec - exiting 22:35:18 (2492): No heartbeat from core client for 30 sec - exiting 22:35:19 (2492): No heartbeat from core client for 30 sec - exiting 22:35:20 (2492): No heartbeat from core client for 30 sec - exiting 22:35:21 (2492): No heartbeat from core client for 30 sec - exiting 22:35:22 (2492): No heartbeat from core client for 30 sec - exiting 22:35:23 (2492): No heartbeat from core client for 30 sec - exiting 22:35:24 (2492): No heartbeat from core client for 30 sec - exiting 22:35:25 (2492): No heartbeat from core client for 30 sec - exiting 22:35:26 (2492): No heartbeat from core client for 30 sec - exiting 22:35:27 (2492): No heartbeat from core client for 30 sec - exiting 22:35:28 (2492): No heartbeat from core client for 30 sec - exiting 22:35:29 (2492): No heartbeat from core client for 30 sec - exiting 22:35:30 (2492): No heartbeat from core client for 30 sec - exiting 22:35:31 (2492): No heartbeat from core client for 30 sec - exiting 22:35:32 (2492): No heartbeat from core client for 30 sec - exiting 22:35:33 (2492): No heartbeat from core client for 30 sec - exiting 22:35:34 (2492): No heartbeat from core client for 30 sec - exiting 22:35:35 (2492): No heartbeat from core client for 30 sec - exiting 22:35:36 (2492): No heartbeat from core client for 30 sec - exiting 22:35:37 (2492): No heartbeat from core client for 30 sec - exiting 22:35:38 (2492): No heartbeat from core client for 30 sec - exiting 22:35:39 (2492): No heartbeat from core client for 30 sec - exiting 22:35:40 (2492): No heartbeat from core client for 30 sec - exiting 22:35:41 (2492): No heartbeat from core client for 30 sec - exiting 22:35:42 (2492): No heartbeat from core client for 30 sec - exiting 22:35:43 (2492): No heartbeat from core client for 30 sec - exiting 22:35:44 (2492): No heartbeat from core client for 30 sec - exiting 22:35:45 (2492): No heartbeat from core client for 30 sec - exiting 22:35:46 (2492): No heartbeat from core client for 30 sec - exiting 22:35:47 (2492): No heartbeat from core client for 30 sec - exiting 22:35:48 (2492): No heartbeat from core client for 30 sec - exiting 22:35:49 (2492): No heartbeat from core client for 30 sec - exiting 22:35:50 (2492): No heartbeat from core client for 30 sec - exiting 22:37:42 (2492): No heartbeat from core client for 30 sec - exiting 22:37:43 (2492): No heartbeat from core client for 30 sec - exiting 22:37:44 (2492): No heartbeat from core client for 30 sec - exiting 22:37:45 (2492): No heartbeat from core client for 30 sec - exiting 22:37:46 (2492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11232, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11252, selfPID=11252, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9792, selfPID=9792, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2872, selfPID=2872, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 11:38:08 (3468): No heartbeat from core client for 30 sec - exiting 11:38:09 (3468): No heartbeat from core client for 30 sec - exiting 11:38:10 (3468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:27:33 (2780): No heartbeat from core client for 30 sec - exiting 12:27:34 (2780): No heartbeat from core client for 30 sec - exiting 12:27:35 (2780): No heartbeat from core client for 30 sec - exiting 12:27:36 (2780): No heartbeat from core client for 30 sec - exiting 12:27:37 (2780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9408, selfPID=9408, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11624, selfPID=11624, iMonCtr=2 GCobal Workorn:: ClerN p: ocPDss pirocetss isin ,not runningRe eVait =ng,, bRheckalID= 0, elhePIkP=11140, iMoPICt=7=272 , iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7392, selfPID=7392, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9324, selfPID=9324, iMonCtr=2 21:32:38 (10976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17636, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8244, selfPID=8244, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11088, selfPID=11088, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=11032, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2128, selfPID=12260, iMonCtr=1 Model crash detected, will try to restart... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2128, selfPID=2128, iMonCtr=2 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_y19k_1967_1_007053504_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_y19k_1967_1_007053504_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_y19k_1967_1_007053504_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_y19k_1967_1_007053504_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_y19k_1967_1_007053504_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Jun 2012 15:06:35 | 1016016 | 14657734 | hadam3p_eu_y19k_1967_1_007053504_1 | 80,736 | 336,849 | 4.1722 |
12 Jun 2012 03:16:00 | 1016016 | 14657734 | hadam3p_eu_y19k_1967_1_007053504_1 | 69,218 | 288,736 | 4.1714 |
12 Jun 2012 02:14:47 | 1016016 | 14657734 | hadam3p_eu_y19k_1967_1_007053504_1 | 69,216 | 288,023 | 4.1612 |
10 Jun 2012 17:18:26 | 1016016 | 14657734 | hadam3p_eu_y19k_1967_1_007053504_1 | 57,696 | 240,413 | 4.1669 |
07 Jun 2012 23:23:19 | 1016016 | 14657734 | hadam3p_eu_y19k_1967_1_007053504_1 | 46,176 | 192,489 | 4.1686 |
06 Jun 2012 00:41:07 | 1016016 | 14657734 | hadam3p_eu_y19k_1967_1_007053504_1 | 34,656 | 145,177 | 4.1891 |
04 Jun 2012 19:42:31 | 1016016 | 14657734 | hadam3p_eu_y19k_1967_1_007053504_1 | 23,136 | 96,093 | 4.1534 |
31 May 2012 15:17:01 | 1016016 | 14657734 | hadam3p_eu_y19k_1967_1_007053504_1 | 11,616 | 48,359 | 4.1631 |
©2024 cpdn.org