climateprediction.net home page
Task 16891687

Task 16891687

Name hadam3p_pnw_f1s8_2010_1_008897421_1
Workunit 9043204
Created 19 Aug 2014, 15:05:26 UTC
Sent 19 Aug 2014, 15:41:23 UTC
Report deadline 1 Aug 2015, 21:01:23 UTC
Received 29 Aug 2014, 12:07:48 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1229790
Run time 3 days 19 hours 11 min 15 sec
CPU time 1 days 0 hours 6 min 16 sec
Validate state Invalid
Credit 1,007.76
Device peak FLOPS 2.67 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.22
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5464, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4212, selfPID=8092, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6564, selfPID=4116, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8720, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4380, selfPID=5540, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7444, selfPID=8468, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8564, selfPID=8128, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9036, selfPID=7284, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3416, selfPID=8828, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9012, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4588, selfPID=8120, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7936, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
15:29:19 (7936): called boinc_finish
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7320, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7320, selfPID=7888, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8556, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9012, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8868, selfPID=3384, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5932, selfPID=7160, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
SETPOS: Unit 61 to Word Address -198 Failed with Error Code -1

Model crashed: SETPOS: Unit 61 to Word Address -198 Failed with Error Code -1
Leaving CPDN_Main::Monitor...
00:58:26 (6792): called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_f1s8_2010_1_008897421_1_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_f1s8_2010_1_008897421_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_f1s8_2010_1_008897421_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_f1s8_2010_1_008897421_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_f1s8_2010_1_008897421_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_f1s8_2010_1_008897421_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_f1s8_2010_1_008897421_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_f1s8_2010_1_008897421_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Aug 2014 18:47:18 1229790 16891687 hadam3p_pnw_f1s8_2010_1_008897421_1 46,379 139,838 3.0151
23 Aug 2014 19:33:21 1229790 16891687 hadam3p_pnw_f1s8_2010_1_008897421_1 34,859 102,374 2.9368
22 Aug 2014 15:26:40 1229790 16891687 hadam3p_pnw_f1s8_2010_1_008897421_1 23,339 69,203 2.9651
20 Aug 2014 19:12:23 1229790 16891687 hadam3p_pnw_f1s8_2010_1_008897421_1 11,819 35,867 3.0347


©2024 cpdn.org