climateprediction.net home page
Task 21270598

Task 21270598

Name wah2_eu25_egy1_201112_13_746_011594239_1
Workunit 11594240
Created 15 Aug 2018, 2:18:12 UTC
Sent 15 Aug 2018, 2:29:45 UTC
Report deadline 28 Jul 2019, 7:49:45 UTC
Received 18 Sep 2018, 22:10:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1322660
Run time 3 days 14 hours 58 min 27 sec
CPU time 3 days 0 hours 33 min 54 sec
Validate state Invalid
Credit 3,059.47
Device peak FLOPS 3.09 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 313.83 MB
Peak swap size 277.86 MB
Peak disk usage 128.08 MB
Stderr
<core_client_version>7.12.1</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
10:11:56 (4812): BOINC client no longer exists - exiting
10:11:56 (4812): timer handler: client dead, exiting
10:12:07 (4812): BOINC client no longer exists - exiting
10:12:07 (4812): timer handler: client dead, exiting
10:12:18 (4812): BOINC client no longer exists - exiting
10:12:18 (4812): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:28:02 (9732): Can't acquire lockfile (32) - waiting 35s
16:28:37 (9732): Can't acquire lockfile (32) - exiting
16:28:37 (9732): Error: The process cannot access the file because it is being used by another process.

 (0x20)
16:38:41 (14264): Can't acquire lockfile (32) - waiting 35s
16:39:16 (14264): Can't acquire lockfile (32) - exiting
16:39:16 (14264): Error: The process cannot access the file because it is being used by another process.

 (0x20)
16:49:20 (15760): Can't acquire lockfile (32) - waiting 35s
16:49:55 (15760): Can't acquire lockfile (32) - exiting
16:49:55 (15760): Error: The process cannot access the file because it is being used by another process.

 (0x20)
16:50:58 (5324): Can't acquire lockfile (32) - waiting 35s
16:51:33 (5324): Can't acquire lockfile (32) - exiting
16:51:33 (5324): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:02:19 (1600): Can't acquire lockfile (32) - waiting 35s
17:02:54 (1600): Can't acquire lockfile (32) - exiting
17:02:54 (1600): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:13:00 (5712): Can't acquire lockfile (32) - waiting 35s
17:13:35 (5712): Can't acquire lockfile (32) - exiting
17:13:35 (5712): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:23:39 (8456): Can't acquire lockfile (32) - waiting 35s
17:24:14 (8456): Can't acquire lockfile (32) - exiting
17:24:14 (8456): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:25:10 (5268): Can't acquire lockfile (32) - waiting 35s
17:25:45 (5268): Can't acquire lockfile (32) - exiting
17:25:45 (5268): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:26:35 (13904): Can't acquire lockfile (32) - waiting 35s
17:27:10 (13904): Can't acquire lockfile (32) - exiting
17:27:10 (13904): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:37:51 (15448): Can't acquire lockfile (32) - waiting 35s
17:38:26 (15448): Can't acquire lockfile (32) - exiting
17:38:26 (15448): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:48:34 (14868): Can't acquire lockfile (32) - waiting 35s
17:49:09 (14868): Can't acquire lockfile (32) - exiting
17:49:09 (14868): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:59:24 (14036): Can't acquire lockfile (32) - waiting 35s
17:59:59 (14036): Can't acquire lockfile (32) - exiting
17:59:59 (14036): Error: The process cannot access the file because it is being used by another process.

 (0x20)
18:10:09 (4132): Can't acquire lockfile (32) - waiting 35s
18:10:44 (4132): Can't acquire lockfile (32) - exiting
18:10:44 (4132): Error: The process cannot access the file because it is being used by another process.

 (0x20)
18:20:46 (432): Can't acquire lockfile (32) - waiting 35s
18:21:21 (432): Can't acquire lockfile (32) - exiting
18:21:21 (432): Error: The process cannot access the file because it is being used by another process.

 (0x20)
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=140, selfPID=140, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=140, selfPID=10016, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
11:31:07 (10016): called boinc_finish(0)
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 11 received: Segment violation
Signal 11 received: Software termination signal from kill 
Signal 11 received: Abnormal termination triggered by abort call
Signal 11 received, exiting...
14:37:50 (11500): called boinc_finish(193)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9092, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11500, selfPID=7972, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
14:37:54 (7972): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_5.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_6.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_7.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_egy1_201112_13_746_011594239_1_r1511157766_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Sep 2018 17:40:01 1322660 21270598 wah2_eu25_egy1_201112_13_746_011594239_1 46,379 237,271 5.1159
18 Sep 2018 17:38:32 1322660 21270598 wah2_eu25_egy1_201112_13_746_011594239_1 34,859 178,663 5.1253
18 Sep 2018 17:31:34 1322660 21270598 wah2_eu25_egy1_201112_13_746_011594239_1 23,339 120,547 5.1650
18 Sep 2018 17:29:24 1322660 21270598 wah2_eu25_egy1_201112_13_746_011594239_1 11,819 61,567 5.2092


©2024 cpdn.org