climateprediction.net home page
Task 12534776

Task 12534776

Name hadam3p_saf_2fo7_1994_1_007149994_0
Workunit 7334774
Created 26 Jan 2011, 21:26:15 UTC
Sent 31 Jan 2011, 13:49:28 UTC
Report deadline 13 Jan 2012, 19:09:28 UTC
Received 8 Mar 2011, 13:55:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 783352
Run time 5 days 10 hours 31 min 58 sec
CPU time 4 days 21 hours 47 min 5 sec
Validate state Invalid
Credit 2,057.34
Device peak FLOPS 2.23 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4944, selfPID=4944, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6832, selfPID=6832, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5216, selfPID=5216, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN proceCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6608, selfPID=6608, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5936, selfPID=5936, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7488, selfPID=7488, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3016, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5640, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4628, selfPID=4628, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7820, selfPID=7820, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5632, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6772, selfPID=7348, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2460, selfPID=2460, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:35:02 (2784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5460, selfPID=4676, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:59:32 (1856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6644, selfPID=6644, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2756, selfPID=2756, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7336, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4332, selfPID=4332, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5896, selfPID=5896, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5896, selfPID=5740, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
16:58:31 (5740): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_2fo7_1994_1_007149994_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Mar 2011 13:58:27 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 126,824 403,471 3.1813
08 Mar 2011 13:58:26 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 126,816 402,892 3.1770
08 Mar 2011 13:58:26 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 115,296 364,143 3.1583
23 Feb 2011 20:42:28 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 103,776 328,655 3.1670
20 Feb 2011 14:32:37 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 92,256 292,842 3.1742
18 Feb 2011 13:33:16 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 80,736 256,593 3.1782
13 Feb 2011 13:47:05 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 69,216 220,598 3.1871
12 Feb 2011 12:34:24 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 57,696 185,223 3.2103
11 Feb 2011 14:40:57 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 46,176 149,055 3.2280
08 Feb 2011 15:42:59 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 34,656 112,350 3.2419
05 Feb 2011 20:58:20 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 23,136 75,063 3.2444
04 Feb 2011 13:36:12 783352 12534776 hadam3p_saf_2fo7_1994_1_007149994_0 11,616 37,674 3.2433


©2024 climateprediction.net