climateprediction.net home page
Task 12240389

Task 12240389

Name hadam3p_pnw_zdyw_1967_1_006958224_0
Workunit 7161540
Created 23 Nov 2010, 9:41:41 UTC
Sent 18 Feb 2011, 21:47:49 UTC
Report deadline 1 Feb 2012, 3:07:49 UTC
Received 8 Apr 2011, 18:15:45 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1117239
Run time 3 days 20 hours 44 min 35 sec
CPU time 2 days 16 hours 43 min 11 sec
Validate state Invalid
Credit 1,003.37
Device peak FLOPS 1.85 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5380, selfPID=1744, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5020, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4512, selfPID=4136, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4280, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3612, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4700, selfPID=4292, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2352, selfPID=4128, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
21:14:23 (4316): No heartbeat from core client for 30 sec - exiting
21:14:24 (4316): No heartbeat from core client for 30 sec - exiting
21:14:25 (4316): No heartbeat from core client for 30 sec - exiting
21:14:26 (4316): No heartbeat from core client for 30 sec - exiting
21:14:27 (4316): No heartbeat from core client for 30 sec - exiting
21:14:28 (4316): No heartbeat from core client for 30 sec - exiting
21:14:30 (4316): No heartbeat from core client for 30 sec - exiting
21:14:31 (4316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2956, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4740, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2300, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1804, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4688, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3804, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6020, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Apr 2011 05:12:11 1117239 12240389 hadam3p_pnw_zdyw_1967_1_006958224_0 46,177 193,780 4.1965
05 Apr 2011 05:01:54 1117239 12240389 hadam3p_pnw_zdyw_1967_1_006958224_0 46,176 193,318 4.1865
03 Apr 2011 06:25:28 1117239 12240389 hadam3p_pnw_zdyw_1967_1_006958224_0 34,656 144,467 4.1686
29 Mar 2011 21:03:23 1117239 12240389 hadam3p_pnw_zdyw_1967_1_006958224_0 23,136 98,850 4.2726
27 Feb 2011 19:07:59 1117239 12240389 hadam3p_pnw_zdyw_1967_1_006958224_0 11,616 46,859 4.0340


©2024 cpdn.org