climateprediction.net home page
Task 19404643

Task 19404643

Name hadam3p_afr50_ewnp_201412_12_371_010403859_0
Workunit 10403859
Created 19 Mar 2016, 21:12:59 UTC
Sent 21 Mar 2016, 6:34:54 UTC
Report deadline 3 Mar 2017, 11:54:54 UTC
Received 19 Apr 2016, 5:33:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1172489
Run time 4 days 0 hours 22 min 26 sec
CPU time 3 days 16 hours 19 min 59 sec
Validate state Invalid
Credit 6,287.02
Device peak FLOPS 3.04 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Africa v7.22
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2896, selfPID=5260, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=840, selfPID=4584, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4172, selfPID=3376, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2912, selfPID=4744, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1616, selfPID=1112, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3616, selfPID=4848, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5484, iMonCtr=2
Model crash detected, will try to restart...
CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5436, iMonCtr=2
ColobntroWlorrk: CPDN proc proess is not rrnuinn, exiting, bRetValtV al, =he1kPID=0, PelfPID=3824,IiMDnC2r=04
Model crash 
detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2832, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4860, selfPID=4808, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2096, selfPID=1128, iMonCtr=1
Model crash detected, will try to restart...
07:12:35 (3020): No heartbeat from client for 30 sec - exiting
07:12:35 (3020): timer handler: client dead, exiting
07:12:36 (3020): No heartbeat from client for 30 sec - exiting
07:12:36 (3020): timer handler: client dead, exiting
07:12:38 (3020): No heartbeat from client for 30 sec - exiting
07:12:38 (3020): timer handler: client dead, exiting
07:12:39 (3020): No heartbeat from client for 30 sec - exiting
07:12:39 (3020): timer handler: client dead, exiting
07:12:40 (3020): No heartbeat from client for 30 sec - exiting
07:12:40 (3020): timer handler: client dead, exiting
07:12:41 (3020): No heartbeat from client for 30 sec - exiting
07:12:41 (3020): timer handler: client dead, exiting
07:12:42 (3020): No heartbeat from client for 30 sec - exiting
07:12:42 (3020): timer handler: client dead, exiting
07:12:43 (3020): No heartbeat from client for 30 sec - exiting
07:12:43 (3020): timer handler: client dead, exiting
07:12:44 (3020): No heartbeat from client for 30 sec - exiting
07:12:44 (3020): timer handler: client dead, exiting
07:12:45 (3020): No heartbeat from client for 30 sec - exiting
07:12:45 (3020): timer handler: client dead, exiting
07:12:46 (3020): No heartbeat from client for 30 sec - exiting
07:12:46 (3020): timer handler: client dead, exiting
07:12:47 (3020): No heartbeat from client for 30 sec - exiting
07:12:47 (3020): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1568, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4948, selfPID=4176, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5748, selfPID=5312, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1692, selfPID=1700, iMonCtr=1
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1356, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2836, selfPID=2216, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
11:59:13 (2216): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_afr50_ewnp_201412_12_371_010403859_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_afr50_ewnp_201412_12_371_010403859_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_afr50_ewnp_201412_12_371_010403859_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_afr50_ewnp_201412_12_371_010403859_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_afr50_ewnp_201412_12_371_010403859_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_afr50_ewnp_201412_12_371_010403859_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Apr 2016 12:14:58 1172489 19404643 hadam3p_afr50_ewnp_201412_12_371_010403859_0 69,419 299,646 4.3165
11 Apr 2016 05:49:00 1172489 19404643 hadam3p_afr50_ewnp_201412_12_371_010403859_0 57,899 252,044 4.3532
05 Apr 2016 14:19:45 1172489 19404643 hadam3p_afr50_ewnp_201412_12_371_010403859_0 46,379 202,875 4.3743
01 Apr 2016 08:59:22 1172489 19404643 hadam3p_afr50_ewnp_201412_12_371_010403859_0 34,859 152,166 4.3652
30 Mar 2016 06:54:01 1172489 19404643 hadam3p_afr50_ewnp_201412_12_371_010403859_0 23,339 101,226 4.3372
23 Mar 2016 12:31:36 1172489 19404643 hadam3p_afr50_ewnp_201412_12_371_010403859_0 11,819 51,326 4.3427


©2024 climateprediction.net