climateprediction.net home page
Task 14317261

Task 14317261

Name hadam3p_eu_agy6_1987_1_007841805_0
Workunit 7996917
Created 25 Mar 2012, 10:31:58 UTC
Sent 25 Mar 2012, 10:32:24 UTC
Report deadline 7 Mar 2013, 15:52:24 UTC
Received 8 May 2012, 5:36:28 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1196548
Run time 3 days 16 hours 25 min 6 sec
CPU time 3 days 1 hours 27 min 35 sec
Validate state Invalid
Credit 2,187.67
Device peak FLOPS 3.06 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
13:40:38 (2028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5460, selfPID=5460, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5888, selfPID=2868, iMonCtr=1
Model crash detected, will try to restart...
22:10:21 (5076): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2304, selfPID=6104, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
16:48:12 (2704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:38:15 (4720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5528, selfPID=5528, iMonCtCPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5296, selfPID=3508, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1380, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
10:07:41 (3444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
G10:27:20 (4692): No heartbeat from core client for 30 sec - exiting
10:27:21 (4692): No heartbeat from core client for 30 sec - exiting
10:27:22 (4692): No heartbeat from core client for 30 sec - exiting
10:27:23 (4692): No heartbeat from core client for 30 sec - exiting
10:27:24 (4692): No heartbeat from core client for 30 sec - exiting
10:27:25 (4692): No heartbeat from core client for 30 sec - exiting
10:27:26 (4692): No heartbeat from core client for 30 sec - exiting
10:27:27 (4692): No heartbeat from core client for 30 sec - exiting
10:27:28 (4692): No heartbeat from core client for 30 sec - exiting
10:27:29 (4692): No heartbeat from core client for 30 sec - exiting
10:27:30 (4692): No heartbeat from core client for 30 sec - exiting
10:27:31 (4692): No heartbeat from core client for 30 sec - exiting
10:27:32 (4692): No heartbeat from core client for 30 sec - exiting
10:27:33 (4692): No heartbeat from core client for 30 sec - exiting
10:27:34 (4692): No heartbeat from core client for 30 sec - exiting
10:27:35 (4692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:27:36 (4692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
15:39:52 (4736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:39:53 (4736): No heartbeat from core client for 30 sec - exiting
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1980, selfPID=604, iMonCtr=1
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1980, selfPID=1980, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5200, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5624, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5364, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3696, selfPID=4336, iMonCtr=1
Model crash detected, will try to restart...
18:20:03 (3528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:53:29 (1696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5464, selfPID=2860, iMonCtr=1
Model crash detected, will try to restart...
14:42:06 (4972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:40:57 (6100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:40:58 (6100): No heartbeat from core client for 30 sec - exiting
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5972, selfPID=5972, iMonCtr=2
20:39:32 (5388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3752, selfPID=3720, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5908, selfPID=5908, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5908, selfPID=5916, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2664, selfPID=4524, iMonCtr=1
Model crash detected, will try to restart...
17:37:30 (3676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:37:31 (3676): No heartbeat from core client for 30 sec - exiting
17:37:32 (3676): No heartbeat from core client for 30 sec - exiting
17:37:33 (3676): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2064, selfPID=1696, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_agy6_1987_1_007841805_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 May 2012 09:49:40 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 126,816 249,681 1.9688
03 May 2012 06:27:42 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 115,296 226,943 1.9684
02 May 2012 13:57:27 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 103,776 205,351 1.9788
28 Apr 2012 12:41:25 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 92,256 182,536 1.9786
27 Apr 2012 16:16:54 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 80,736 159,462 1.9751
22 Apr 2012 13:37:26 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 69,216 136,287 1.9690
12 Apr 2012 18:35:18 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 57,696 113,734 1.9713
12 Apr 2012 18:35:18 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 46,176 91,151 1.9740
12 Apr 2012 18:35:18 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 34,656 68,645 1.9808
12 Apr 2012 18:35:18 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 23,136 46,118 1.9933
12 Apr 2012 18:35:18 1196548 14317261 hadam3p_eu_agy6_1987_1_007841805_0 11,616 23,395 2.0140


©2024 cpdn.org