climateprediction.net home page
Task 12769336

Task 12769336

Name hadcm3n_o0p2_1900_40_007196233_2
Workunit 7394513
Created 2 Apr 2011, 20:38:33 UTC
Sent 2 Apr 2011, 20:47:01 UTC
Report deadline 3 Jul 2011, 4:14:12 UTC
Received 26 Apr 2011, 3:54:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1115875
Run time 4 days 11 hours 3 min 52 sec
CPU time 3 days 22 hours 0 min 25 sec
Validate state Invalid
Credit 2,177.28
Device peak FLOPS 2.42 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4676, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5292, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5064, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1
Model crash detected, will try to restart...
05:52:07 (4820): No heartbeat from core client for 30 sec - exiting
05:52:08 (4820): No heartbeat from core client for 30 sec - exiting
05:52:09 (4820): No heartbeat from core client for 30 sec - exiting
05:52:10 (4820): No heartbeat from core client for 30 sec - exiting
05:52:11 (4820): No heartbeat from core client for 30 sec - exiting
05:52:12 (4820): No heartbeat from core client for 30 sec - exiting
05:52:13 (4820): No heartbeat from core client for 30 sec - exiting
05:52:14 (4820): No heartbeat from core client for 30 sec - exiting
05:52:15 (4820): No heartbeat from core client for 30 sec - exiting
05:52:16 (4820): No heartbeat from core client for 30 sec - exiting
05:52:17 (4820): No heartbeat from core client for 30 sec - exiting
05:52:18 (4820): No heartbeat from core client for 30 sec - exiting
05:52:19 (4820): No heartbeat from core client for 30 sec - exiting
05:52:20 (4820): No heartbeat from core client for 30 sec - exiting
05:52:21 (4820): No heartbeat from core client for 30 sec - exiting
05:52:22 (4820): No heartbeat from core client for 30 sec - exiting
05:52:23 (4820): No heartbeat from core client for 30 sec - exiting
05:52:24 (4820): No heartbeat from core client for 30 sec - exiting
05:52:25 (4820): No heartbeat from core client for 30 sec - exiting
05:52:26 (4820): No heartbeat from core client for 30 sec - exiting
05:52:27 (4820): No heartbeat from core client for 30 sec - exiting
05:52:28 (4820): No heartbeat from core client for 30 sec - exiting
05:52:29 (4820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2932, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4792, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3268, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5456, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Apr 2011 08:44:57 1115875 12769336 hadcm3n_o0p2_1900_40_007196233_2 181,440 303,016 1.6701
24 Apr 2011 17:43:53 1115875 12769336 hadcm3n_o0p2_1900_40_007196233_2 155,520 259,707 1.6699
23 Apr 2011 12:10:35 1115875 12769336 hadcm3n_o0p2_1900_40_007196233_2 129,600 216,992 1.6743
22 Apr 2011 15:09:58 1115875 12769336 hadcm3n_o0p2_1900_40_007196233_2 103,680 174,405 1.6821
21 Apr 2011 04:32:39 1115875 12769336 hadcm3n_o0p2_1900_40_007196233_2 77,760 131,418 1.6900
10 Apr 2011 05:12:06 1115875 12769336 hadcm3n_o0p2_1900_40_007196233_2 51,840 87,325 1.6845
09 Apr 2011 06:47:08 1115875 12769336 hadcm3n_o0p2_1900_40_007196233_2 25,920 42,599 1.6435


©2024 cpdn.org