climateprediction.net home page
Task 18518348

Task 18518348

Name hadam3p_anz_n6vi_2007_1_009865498_1
Workunit 9903995
Created 30 May 2015, 23:56:12 UTC
Sent 31 May 2015, 10:52:33 UTC
Report deadline 12 May 2016, 16:12:33 UTC
Received 7 Jul 2015, 1:31:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -187 (0xFFFFFF45) ERR_RESULT_UPLOAD
Computer ID 1310670
Run time 8 days 8 hours 38 min 38 sec
CPU time 7 days 19 hours 4 min 34 sec
Validate state Invalid
Credit 4,981.10
Device peak FLOPS 1.89 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
upload failure
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6372, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=876, selfPID=1936, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5520, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
08:08:34 (10048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
GSuspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6256, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8848, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6672, iMonCtr=2
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8072, selfPID=3764, iMonCtr=1
Model crash detected, will try to restart...
CSuspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:48:14 (3388): No heartbeat from core client for 30 sec - exiting
04:48:15 (3388): No heartbeat from core client for 30 sec - exiting
04:48:16 (3388): No heartbeat from core client for 30 sec - exiting
04:48:17 (3388): No heartbeat from core client for 30 sec - exiting
04:48:18 (3388): No heartbeat from core client for 30 sec - exiting
04:48:19 (3388): No heartbeat from core client for 30 sec - exiting
04:48:20 (3388): No heartbeat from core client for 30 sec - exiting
04:48:22 (3388): No heartbeat from core client for 30 sec - exiting
04:48:23 (3388): No heartbeat from core client for 30 sec - exiting
04:48:24 (3388): No heartbeat from core client for 30 sec - exiting
04:48:25 (3388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:48:26 (3388): No heartbeat from core client for 30 sec - exiting
04:48:27 (3388): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1524, selfPID=1752, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1344, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10700, selfPID=11092, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11080, selfPID=6432, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6396, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4388, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5044, selfPID=2088, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9948, iMonCtr=2
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8388, selfPID=9264, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6364, iMonCtr=2
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4324, selfPID=10600, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

zip error: Nothing to do! (../_1.zip)
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Jul 2015 18:34:03 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 115,499 628,686 5.4432
04 Jul 2015 12:14:13 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 103,979 567,220 5.4551
19 Jun 2015 11:29:05 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 92,459 503,588 5.4466
17 Jun 2015 03:33:50 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 80,939 441,513 5.4549
13 Jun 2015 12:46:35 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 69,419 378,858 5.4576
10 Jun 2015 11:01:44 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 57,899 315,468 5.4486
07 Jun 2015 00:02:32 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 46,379 252,214 5.4381
04 Jun 2015 22:21:29 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 34,859 189,887 5.4473
03 Jun 2015 12:16:37 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 23,339 127,167 5.4487
01 Jun 2015 07:24:15 1310670 18518348 hadam3p_anz_n6vi_2007_1_009865498_1 11,819 64,413 5.4500


©2024 climateprediction.net