climateprediction.net home page
Task 16421862

Task 16421862

Name hadam3p_anz_na8m_2012_1_008600886_1
Workunit 8747398
Created 27 Mar 2014, 8:54:47 UTC
Sent 27 Mar 2014, 8:57:11 UTC
Report deadline 9 Mar 2015, 14:17:11 UTC
Received 29 May 2014, 5:54:23 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 1241124
Run time 23 days 16 hours 22 min 25 sec
CPU time 22 days 18 hours 5 min 6 sec
Validate state Workunit error - check skipped
Credit 5,974.74
Device peak FLOPS 1.31 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3964, selfPID=6076, iMonCtr=1
Model crash detected, will try to restart...
19:07:18 (6108): No heartbeat from core client for 30 sec - exiting
19:07:19 (6108): No heartbeat from core client for 30 sec - exiting
19:07:21 (6108): No heartbeat from core client for 30 sec - exiting
19:07:22 (6108): No heartbeat from core client for 30 sec - exiting
19:07:23 (6108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3520, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4048, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=808, selfPID=2956, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2684, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5636, selfPID=5668, iMonCtr=1
Model crash detected, will try to restart...
06:14:06 (4000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5980, selfPID=5128, iMonCtr=1
Model crash detected, will try to restart...
07:32:57 (6060): No heartbeat from core client for 30 sec - exiting
07:32:59 (6060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6040, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1380, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14188, selfPID=12460, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1344, selfPID=5732, iMonCtr=1
Model crash detected, will try to restart...
06:33:00 (5660): No heartbeat from core client for 30 sec - exiting
06:33:03 (5660): No heartbeat from core client for 30 sec - exiting
06:33:04 (5660): No heartbeat from core client for 30 sec - exiting
06:33:05 (5660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3112, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6512, selfPID=3084, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14356, selfPID=13592, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3180, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6044, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1048, selfPID=5956, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=31556, selfPID=31556, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31876, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7816, selfPID=5532, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4048, selfPID=1540, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3800, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=660, selfPID=5608, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4232, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5464, selfPID=5720, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4640, selfPID=5204, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4336, selfPID=2448, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3536, selfPID=5548, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14668, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4636, selfPID=10948, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5580, selfPID=5580, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=22188, selfPID=22064, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30396, selfPID=13364, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=652, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
08:50:12 (6024): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3720, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5660, selfPID=7660, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 May 2014 04:30:34 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 138,539 1,964,588 14.1808
25 May 2014 16:23:15 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 127,019 1,801,960 14.1865
21 May 2014 02:59:40 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 115,499 1,643,495 14.2295
18 May 2014 01:32:56 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 103,979 1,484,683 14.2787
14 May 2014 07:53:21 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 92,459 1,328,067 14.3638
11 May 2014 04:05:24 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 80,939 1,166,119 14.4074
07 May 2014 03:10:11 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 69,419 1,004,605 14.4716
04 May 2014 00:34:07 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 57,899 844,010 14.5773
27 Apr 2014 06:13:13 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 46,379 679,870 14.6590
20 Apr 2014 13:29:55 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 34,859 504,107 14.4613
07 Apr 2014 08:30:47 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 23,339 331,181 14.1900
31 Mar 2014 11:59:15 1241124 16421862 hadam3p_anz_na8m_2012_1_008600886_1 11,819 169,391 14.3321


©2024 cpdn.org