climateprediction.net home page
Task 21456341

Task 21456341

Name wah2_safr50_a16x_201412_16_779_011705569_1
Workunit 11705569
Created 28 Dec 2018, 1:01:56 UTC
Sent 7 Jan 2019, 19:34:56 UTC
Report deadline 21 Dec 2019, 0:54:56 UTC
Received 17 Mar 2019, 13:03:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1471279
Run time 2 days 3 hours 13 min 14 sec
CPU time 1 days 22 hours 30 min 38 sec
Validate state Invalid
Credit 5,339.28
Device peak FLOPS 5.10 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 256.41 MB
Peak swap size 219.96 MB
Peak disk usage 102.92 MB
Stderr
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14384, selfPID=12956, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3296, selfPID=3296, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13172, selfPID=13172, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13108, selfPID=11388, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10328, selfPID=10328, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13940, selfPID=13940, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2076, selfPID=2076, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2076, selfPID=15116, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16508, selfPID=16508, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10408, selfPID=10408, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

CPDN Monitor - Quit request from BOINC...
Signal 11 received: Segment violation
Signal 11 received: Software termination signal from kill 
Signal 11 received: Abnormal termination triggered by abort call
Signal 11 received, exiting...
14:01:55 (11808): called boinc_finish(193)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9364, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11808, selfPID=14752, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
14:02:00 (14752): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a16x_201412_16_779_011705569_1_r1536692632_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Mar 2019 11:48:44 1471279 21456341 wah2_safr50_a16x_201412_16_779_011705569_1 80,939 153,935 1.9019
03 Mar 2019 12:24:09 1471279 21456341 wah2_safr50_a16x_201412_16_779_011705569_1 69,419 131,915 1.9003
22 Feb 2019 23:02:33 1471279 21456341 wah2_safr50_a16x_201412_16_779_011705569_1 57,899 109,828 1.8969
16 Feb 2019 15:26:54 1471279 21456341 wah2_safr50_a16x_201412_16_779_011705569_1 46,379 86,942 1.8746
15 Feb 2019 22:58:58 1471279 21456341 wah2_safr50_a16x_201412_16_779_011705569_1 34,859 64,743 1.8573
15 Feb 2019 16:09:41 1471279 21456341 wah2_safr50_a16x_201412_16_779_011705569_1 23,339 42,681 1.8287
05 Feb 2019 20:08:45 1471279 21456341 wah2_safr50_a16x_201412_16_779_011705569_1 11,819 21,083 1.7838


©2024 cpdn.org