climateprediction.net home page
Task 14365876

Task 14365876

Name hadam3p_saf_0r5g_1963_1_006858764_2
Workunit 7062080
Created 5 Apr 2012, 17:54:19 UTC
Sent 5 Apr 2012, 17:54:47 UTC
Report deadline 18 Mar 2013, 23:14:47 UTC
Received 11 May 2012, 17:13:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1098998
Run time 1 days 19 hours 25 min 23 sec
CPU time 1 days 18 hours 49 min 52 sec
Validate state Invalid
Credit 935.95
Device peak FLOPS 2.68 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4828, selfPID=4828, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4092, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4120, selfPID=3176, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4440, selfPID=2040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4940, selfPID=3696, iMonCtr=1
Model crash detected, will try to restart...
02:47:08 (3856): No heartbeat from core client for 30 sec - exiting
02:47:10 (3856): No heartbeat from core client for 30 sec - exiting
02:47:11 (3856): No heartbeat from core client for 30 sec - exiting
02:47:12 (3856): No heartbeat from core client for 30 sec - exiting
02:47:13 (3856): No heartbeat from core client for 30 sec - exiting
02:47:14 (3856): No heartbeat from core client for 30 sec - exiting
02:47:15 (3856): No heartbeat from core client for 30 sec - exiting
02:47:16 (3856): No heartbeat from core client for 30 sec - exiting
02:47:17 (3856): No heartbeat from core client for 30 sec - exiting
02:47:18 (3856): No heartbeat from core client for 30 sec - exiting
02:47:19 (3856): No heartbeat from core client for 30 sec - exiting
02:47:21 (3856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3548, selfPID=4516, iMonCtr=1
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4524, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4984, selfPID=4424, iMonCtr=1
Model crash detected, will try to restart...
15:19:13 (4816): No heartbeat from core client for 30 sec - exiting
15:19:15 (4816): No heartbeat from core client for 30 sec - exiting
15:19:16 (4816): No heartbeat from core client for 30 sec - exiting
15:19:17 (4816): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:19:18 (4816): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2848, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=2
Model crash detected, will try to restart...
23:25:17 (3844): No heartbeat from core client for 30 sec - exiting
23:25:18 (3844): No heartbeat from core client for 30 sec - exiting
23:25:19 (3844): No heartbeat from core client for 30 sec - exiting
23:25:20 (3844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5076, selfPID=5048, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1896, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4636, iMonCtr=2
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=2
Model crash detected, will try to restart...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4832, selfPID=4832, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4832, selfPID=4196, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_0r5g_1963_1_006858764_2_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0r5g_1963_1_006858764_2_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0r5g_1963_1_006858764_2_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0r5g_1963_1_006858764_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0r5g_1963_1_006858764_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0r5g_1963_1_006858764_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0r5g_1963_1_006858764_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 May 2012 16:39:37 1098998 14365876 hadam3p_saf_0r5g_1963_1_006858764_2 57,696 142,738 2.4740
08 May 2012 15:11:06 1098998 14365876 hadam3p_saf_0r5g_1963_1_006858764_2 46,176 114,744 2.4849
06 May 2012 06:40:28 1098998 14365876 hadam3p_saf_0r5g_1963_1_006858764_2 34,656 86,235 2.4883
03 May 2012 18:00:10 1098998 14365876 hadam3p_saf_0r5g_1963_1_006858764_2 23,136 56,608 2.4467
03 May 2012 06:22:35 1098998 14365876 hadam3p_saf_0r5g_1963_1_006858764_2 11,616 28,260 2.4329


©2024 climateprediction.net