climateprediction.net (CPDN) home page
Task 13850498

Task 13850498

Name hadam3p_eu_81z1_2001_1_007629350_0
Workunit 7807669
Created 2 Jan 2012, 14:05:39 UTC
Sent 2 Jan 2012, 18:13:25 UTC
Report deadline 14 Dec 2012, 23:33:25 UTC
Received 22 Jan 2012, 8:40:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1149100
Run time 2 days 9 hours 42 min 28 sec
CPU time 2 days 3 hours 59 min 45 sec
Validate state Invalid
Credit 995.31
Device peak FLOPS 2.40 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3924, selfPID=7396, iMonCtr=1
Model crash detected, will try to restart...

zip error: Could not create output file (was replacing the original zip file)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6000, selfPID=5552, iMonCtr=1
Model crash detected, will try to restart...
20:35:00 (5004): No heartbeat from core client for 30 sec - exiting
20:35:01 (5004): No heartbeat from core client for 30 sec - exiting
20:35:02 (5004): No heartbeat from core client for 30 sec - exiting
20:35:03 (5004): No heartbeat from core client for 30 sec - exiting
20:35:04 (5004): No heartbeat from core client for 30 sec - exiting
20:35:05 (5004): No heartbeat from core client for 30 sec - exiting
20:35:06 (5004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5820, selfPID=5600, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6112, selfPID=5540, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5832, selfPID=5196, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4348, selfPID=6616, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=244, selfPID=5352, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3772, selfPID=3772, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4904, selfPID=164, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1300, selfPID=1300, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5024, selfPID=3252, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2520, selfPID=6080, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6252, selfPID=5728, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5224, selfPID=5224, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5368, selfPID=5608, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=5252, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5420, selfPID=2676, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6016, selfPID=6108, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2220, selfPID=5780, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Jan 2012 19:31:38 1149100 13850498 hadam3p_eu_81z1_2001_1_007629350_0 57,697 178,238 3.0892
20 Jan 2012 13:46:00 1149100 13850498 hadam3p_eu_81z1_2001_1_007629350_0 57,696 177,833 3.0822
16 Jan 2012 20:20:06 1149100 13850498 hadam3p_eu_81z1_2001_1_007629350_0 46,176 142,642 3.0891
14 Jan 2012 18:39:57 1149100 13850498 hadam3p_eu_81z1_2001_1_007629350_0 34,656 106,766 3.0807
07 Jan 2012 19:16:00 1149100 13850498 hadam3p_eu_81z1_2001_1_007629350_0 23,140 71,685 3.0979
07 Jan 2012 18:14:35 1149100 13850498 hadam3p_eu_81z1_2001_1_007629350_0 23,137 71,268 3.0803
07 Jan 2012 16:43:55 1149100 13850498 hadam3p_eu_81z1_2001_1_007629350_0 23,136 70,870 3.0632
04 Jan 2012 19:17:40 1149100 13850498 hadam3p_eu_81z1_2001_1_007629350_0 11,616 35,845 3.0858


©2025 cpdn.org