climateprediction.net home page
Task 18690829

Task 18690829

Name hadam3p_pnw_pm4q_2013_1_009976378_1
Workunit 9982736
Created 9 Jul 2015, 4:02:57 UTC
Sent 10 Jul 2015, 0:31:30 UTC
Report deadline 21 Jun 2016, 5:51:30 UTC
Received 27 Jul 2015, 20:33:24 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1367198
Run time 7 days 5 hours 11 min 13 sec
CPU time 5 days 19 hours 25 min 30 sec
Validate state Invalid
Credit 3,260.60
Device peak FLOPS 2.73 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.27
windows_intelx86
Stderr
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3192, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3628, selfPID=3628, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4828, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2024, selfPID=3408, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3580, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4564, selfPID=4564, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4060, selfPID=4060, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2560, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4968, selfPID=240, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1368, selfPID=1368, iMonCtr=2
Signal 11 received, exiting...
21:32:04 (2072): called boinc_finish(193)
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1880, selfPID=1880, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1880, selfPID=2368, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
21:32:25 (2368): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_pm4q_2013_1_009976378_1_14.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pm4q_2013_1_009976378_1_15.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pm4q_2013_1_009976378_1_16.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pm4q_2013_1_009976378_1_17.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pm4q_2013_1_009976378_1_18.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Jul 2015 16:59:39 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 150,059 498,973 3.3252
26 Jul 2015 06:24:19 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 138,539 459,472 3.3166
25 Jul 2015 02:16:22 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 127,019 419,006 3.2988
23 Jul 2015 21:35:20 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 115,499 378,811 3.2798
22 Jul 2015 00:43:31 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 103,979 340,430 3.2740
20 Jul 2015 04:09:13 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 92,459 301,338 3.2592
19 Jul 2015 12:17:18 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 80,939 262,150 3.2389
17 Jul 2015 21:45:28 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 69,419 224,668 3.2364
16 Jul 2015 21:33:04 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 57,899 185,990 3.2123
16 Jul 2015 00:01:24 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 46,379 149,694 3.2276
14 Jul 2015 01:45:29 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 34,859 111,079 3.1865
12 Jul 2015 06:36:38 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 23,339 73,537 3.1508
11 Jul 2015 06:07:18 1367198 18690829 hadam3p_pnw_pm4q_2013_1_009976378_1 11,819 36,346 3.0752


©2024 cpdn.org