climateprediction.net home page
Task 14993560

Task 14993560

Name hadam3p_pnw_zl6m_2005_1_006972374_1
Workunit 7175690
Created 25 Jul 2012, 13:50:19 UTC
Sent 25 Jul 2012, 13:50:25 UTC
Report deadline 7 Jul 2013, 19:10:25 UTC
Received 16 Aug 2012, 10:44:48 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1161000
Run time 7 days 0 hours 27 min 40 sec
CPU time 6 days 8 hours 16 min 29 sec
Validate state Invalid
Credit 2,254.95
Device peak FLOPS 1.87 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
13:51:52 (4076): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:41:07 (4136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:33:58 (3444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:26:06 (3208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:26:07 (3208): No heartbeat from core client for 30 sec - exiting
21:20:06 (4360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:20:07 (4360): No heartbeat from core client for 30 sec - exiting
21:20:08 (4360): No heartbeat from core client for 30 sec - exiting
21:20:10 (4360): No heartbeat from core client for 30 sec - exiting
21:20:11 (4360): No heartbeat from core client for 30 sec - exiting
21:20:12 (4360): No heartbeat from core client for 30 sec - exiting
21:20:13 (4360): No heartbeat from core client for 30 sec - exiting
21:20:14 (4360): No heartbeat from core client for 30 sec - exiting
21:20:15 (4360): No heartbeat from core client for 30 sec - exiting
21:20:16 (4360): No heartbeat from core client for 30 sec - exiting
22:12:03 (4628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:05:24 (4604): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2776, selfPID=2776, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bReCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2840, selfPID=2840, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2792, selfPID=2792, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2732, selfPID=2732, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4060, selfPID=4060, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4060, selfPID=1296, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_zl6m_2005_1_006972374_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zl6m_2005_1_006972374_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zl6m_2005_1_006972374_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Aug 2012 00:34:05 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 103,777 515,264 4.9651
15 Aug 2012 16:53:37 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 103,776 514,628 4.9590
14 Aug 2012 14:57:45 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 92,256 459,421 4.9798
12 Aug 2012 08:45:25 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 80,736 403,655 4.9997
11 Aug 2012 14:17:51 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 69,216 345,621 4.9934
10 Aug 2012 08:56:48 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 57,696 289,113 5.0110
09 Aug 2012 04:55:58 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 46,176 233,035 5.0467
07 Aug 2012 04:58:52 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 34,656 176,578 5.0952
06 Aug 2012 02:22:49 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 23,136 119,311 5.1569
05 Aug 2012 06:26:12 1161000 14993560 hadam3p_pnw_zl6m_2005_1_006972374_1 11,616 60,349 5.1953


©2024 climateprediction.net