climateprediction.net home page
Task 14859853

Task 14859853

Name hadam3p_saf_1utq_1973_1_007006182_1
Workunit 7209498
Created 3 Jul 2012, 21:39:06 UTC
Sent 3 Jul 2012, 21:39:22 UTC
Report deadline 16 Jun 2013, 2:59:22 UTC
Received 10 Jul 2012, 4:40:14 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1013631
Run time 2 days 1 hours 25 min 56 sec
CPU time 1 days 16 hours 22 min 12 sec
Validate state Invalid
Credit 749.07
Device peak FLOPS 2.15 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7036, selfPID=6476, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4748, selfPID=5868, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6184, selfPID=2760, iMonCtr=1
Model crash detected, will try to restart...
12:41:55 (2796): No heartbeat from core client for 30 sec - exiting
12:41:56 (2796): No heartbeat from core client for 30 sec - exiting
12:41:57 (2796): No heartbeat from core client for 30 sec - exiting
12:41:58 (2796): No heartbeat from core client for 30 sec - exiting
12:41:59 (2796): No heartbeat from core client for 30 sec - exiting
12:42:00 (2796): No heartbeat from core client for 30 sec - exiting
12:42:01 (2796): No heartbeat from core client for 30 sec - exiting
12:42:02 (2796): No heartbeat from core client for 30 sec - exiting
12:42:03 (2796): No heartbeat from core client for 30 sec - exiting
12:42:04 (2796): No heartbeat from core client for 30 sec - exiting
12:42:05 (2796): No heartbeat from core client for 30 sec - exiting
12:42:06 (2796): No heartbeat from core client for 30 sec - exiting
12:42:07 (2796): No heartbeat from core client for 30 sec - exiting
12:42:08 (2796): No heartbeat from core client for 30 sec - exiting
12:42:09 (2796): No heartbeat from core client for 30 sec - exiting
12:42:10 (2796): No heartbeat from core client for 30 sec - exiting
12:42:11 (2796): No heartbeat from core client for 30 sec - exiting
12:42:12 (2796): No heartbeat from core client for 30 sec - exiting
12:42:13 (2796): No heartbeat from core client for 30 sec - exiting
12:42:14 (2796): No heartbeat from core client for 30 sec - exiting
12:42:15 (2796): No heartbeat from core client for 30 sec - exiting
12:42:16 (2796): No heartbeat from core client for 30 sec - exiting
12:42:17 (2796): No heartbeat from core client for 30 sec - exiting
12:42:18 (2796): No heartbeat from core client for 30 sec - exiting
12:42:19 (2796): No heartbeat from core client for 30 sec - exiting
12:42:20 (2796): No heartbeat from core client for 30 sec - exiting
12:42:21 (2796): No heartbeat from core client for 30 sec - exiting
12:42:22 (2796): No heartbeat from core client for 30 sec - exiting
12:42:23 (2796): No heartbeat from core client for 30 sec - exiting
12:42:24 (2796): No heartbeat from core client for 30 sec - exiting
12:42:25 (2796): No heartbeat from core client for 30 sec - exiting
12:42:26 (2796): No heartbeat from core client for 30 sec - exiting
12:42:27 (2796): No heartbeat from core client for 30 sec - exiting
12:42:28 (2796): No heartbeat from core client for 30 sec - exiting
12:42:29 (2796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4320, selfPID=5848, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4696, selfPID=6616, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2884, selfPID=2884, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8412, selfPID=6344, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Jul 2012 15:16:11 1013631 14859853 hadam3p_saf_1utq_1973_1_007006182_1 46,176 123,578 2.6762
07 Jul 2012 19:31:04 1013631 14859853 hadam3p_saf_1utq_1973_1_007006182_1 34,656 93,015 2.6840
07 Jul 2012 09:15:22 1013631 14859853 hadam3p_saf_1utq_1973_1_007006182_1 23,136 62,573 2.7046
05 Jul 2012 19:51:21 1013631 14859853 hadam3p_saf_1utq_1973_1_007006182_1 11,616 31,409 2.7039


©2024 climateprediction.net