climateprediction.net home page
Task 15737092

Task 15737092

Name hadam3p_pnw_q7nt_2045_1_008353330_0
Workunit 8504189
Created 19 Apr 2013, 16:25:02 UTC
Sent 19 Apr 2013, 16:28:51 UTC
Report deadline 1 Apr 2014, 21:48:51 UTC
Received 3 Jul 2013, 20:55:40 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1097161
Run time 3 days 6 hours 41 min 53 sec
CPU time 3 days 1 hours 14 min 15 sec
Validate state Invalid
Credit 1,253.71
Device peak FLOPS 1.65 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5676, selfPID=6652, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4544, selfPID=5168, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4252, selfPID=4388, iMonCtr=1
Model crash detected, will try to restart...
GSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5664, selfPID=4700, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1972, selfPID=5312, iMonCtr=1
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6136, selfPID=5112, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6300, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3816, selfPID=5288, iMonCtr=1
Model crash detected, will try to restart...
10:25:22 (5188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3804, selfPID=1408, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2444, selfPID=5084, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1748, selfPID=5024, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2644, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6088, selfPID=5208, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3716, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1572, selfPID=5268, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_q7nt_2045_1_008353330_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_q7nt_2045_1_008353330_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_q7nt_2045_1_008353330_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_q7nt_2045_1_008353330_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_q7nt_2045_1_008353330_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_q7nt_2045_1_008353330_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_q7nt_2045_1_008353330_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Jul 2013 13:55:02 1097161 15737092 hadam3p_pnw_q7nt_2045_1_008353330_0 57,698 244,838 4.2434
02 Jul 2013 12:05:57 1097161 15737092 hadam3p_pnw_q7nt_2045_1_008353330_0 57,696 244,242 4.2333
02 Jul 2013 10:50:38 1097161 15737092 hadam3p_pnw_q7nt_2045_1_008353330_0 46,176 195,491 4.2336
26 Jun 2013 19:15:21 1097161 15737092 hadam3p_pnw_q7nt_2045_1_008353330_0 34,656 146,261 4.2204
22 Jun 2013 18:52:46 1097161 15737092 hadam3p_pnw_q7nt_2045_1_008353330_0 23,136 97,473 4.2130
08 Jun 2013 10:49:29 1097161 15737092 hadam3p_pnw_q7nt_2045_1_008353330_0 11,616 49,538 4.2646


©2024 cpdn.org