climateprediction.net home page
Task 18689291

Task 18689291

Name hadam3p_pnw_pmlz_2013_1_009976992_2
Workunit 9983350
Created 8 Jul 2015, 14:29:51 UTC
Sent 8 Jul 2015, 15:34:54 UTC
Report deadline 19 Jun 2016, 20:54:54 UTC
Received 15 Jul 2015, 19:10:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1364900
Run time 1 days 22 hours 13 min 46 sec
CPU time 1 days 21 hours 55 min 37 sec
Validate state Invalid
Credit 3,010.29
Device peak FLOPS 4.41 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.27
windows_intelx86
Stderr
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=832, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2648, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3388, selfPID=3680, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2776, selfPID=3728, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3404, selfPID=3688, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2916, selfPID=3764, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2964, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2632, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2824, selfPID=3760, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3056, selfPID=3648, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3540, selfPID=3728, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2900, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
14:42:48 (3660): called boinc_finish(193)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3692, selfPID=3752, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
14:42:55 (3752): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_pmlz_2013_1_009976992_2_13.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pmlz_2013_1_009976992_2_14.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pmlz_2013_1_009976992_2_15.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pmlz_2013_1_009976992_2_16.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pmlz_2013_1_009976992_2_17.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pmlz_2013_1_009976992_2_18.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Jul 2015 15:54:18 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 138,539 155,381 1.1216
14 Jul 2015 20:18:08 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 127,019 142,663 1.1232
14 Jul 2015 16:32:08 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 115,499 129,927 1.1249
13 Jul 2015 20:41:10 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 103,979 116,944 1.1247
13 Jul 2015 16:55:10 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 92,459 104,177 1.1267
11 Jul 2015 18:06:04 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 80,939 91,162 1.1263
11 Jul 2015 14:25:10 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 69,419 78,175 1.1261
10 Jul 2015 18:51:56 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 57,899 65,151 1.1253
09 Jul 2015 23:03:12 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 46,379 51,942 1.1199
09 Jul 2015 19:22:14 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 34,859 38,569 1.1064
09 Jul 2015 15:51:14 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 23,339 25,579 1.0960
08 Jul 2015 20:26:09 1364900 18689291 hadam3p_pnw_pmlz_2013_1_009976992_2 11,819 12,757 1.0794


©2024 cpdn.org