climateprediction.net (CPDN) home page
Task 13136926

Task 13136926

Name hadcm3n_t39g_1940_40_007315054_2
Workunit 7512484
Created 10 Jul 2011, 15:29:33 UTC
Sent 10 Jul 2011, 15:33:31 UTC
Report deadline 9 Oct 2011, 23:00:42 UTC
Received 30 Jul 2011, 13:02:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 942022
Run time 5 days 17 hours 22 min 55 sec
CPU time 3 days 20 hours 13 min 38 sec
Validate state Invalid
Credit 2,177.28
Device peak FLOPS 2.80 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:26:05 (842): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:26:06 (842): No heartbeat from core client for 30 sec - exiting
22:31:02 (3475): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:31:04 (3475): No heartbeat from core client for 30 sec - exiting
22:31:05 (3475): No heartbeat from core client for 30 sec - exiting
22:31:06 (3475): No heartbeat from core client for 30 sec - exiting
22:31:07 (3475): No heartbeat from core client for 30 sec - exiting
22:31:08 (3475): No heartbeat from core client for 30 sec - exiting
22:31:09 (3475): No heartbeat from core client for 30 sec - exiting
22:31:10 (3475): No heartbeat from core client for 30 sec - exiting
22:31:11 (3475): No heartbeat from core client for 30 sec - exiting
22:31:12 (3475): No heartbeat from core client for 30 sec - exiting
22:31:13 (3475): No heartbeat from core client for 30 sec - exiting
22:31:14 (3475): No heartbeat from core client for 30 sec - exiting
22:31:15 (3475): No heartbeat from core client for 30 sec - exiting
22:31:16 (3475): No heartbeat from core client for 30 sec - exiting
22:31:17 (3475): No heartbeat from core client for 30 sec - exiting
22:31:18 (3475): No heartbeat from core client for 30 sec - exiting
22:31:19 (3475): No heartbeat from core client for 30 sec - exiting
22:31:20 (3475): No heartbeat from core client for 30 sec - exiting
22:31:21 (3475): No heartbeat from core client for 30 sec - exiting
22:31:22 (3475): No heartbeat from core client for 30 sec - exiting
22:31:23 (3475): No heartbeat from core client for 30 sec - exiting
22:31:24 (3475): No heartbeat from core client for 30 sec - exiting
22:31:25 (3475): No heartbeat from core client for 30 sec - exiting
22:31:26 (3475): No heartbeat from core client for 30 sec - exiting
22:31:27 (3475): No heartbeat from core client for 30 sec - exiting
22:31:28 (3475): No heartbeat from core client for 30 sec - exiting
22:31:29 (3475): No heartbeat from core client for 30 sec - exiting
22:31:30 (3475): No heartbeat from core client for 30 sec - exiting
22:31:31 (3475): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
20:07:46 (3586): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
SIGSEGV: segmentation violation
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:33:18 (8420): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:00:24 (855): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 133150) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 133150) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 133150) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 133150) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 133150) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 133150) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2515, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Jul 2011 20:44:20 942022 13136926 hadcm3n_t39g_1940_40_007315054_2 181,440 302,192 1.6655
26 Jul 2011 05:50:07 942022 13136926 hadcm3n_t39g_1940_40_007315054_2 155,520 259,417 1.6681
25 Jul 2011 23:10:57 942022 13136926 hadcm3n_t39g_1940_40_007315054_2 129,600 216,757 1.6725
25 Jul 2011 22:48:50 942022 13136926 hadcm3n_t39g_1940_40_007315054_2 103,680 171,180 1.6510
25 Jul 2011 18:21:35 942022 13136926 hadcm3n_t39g_1940_40_007315054_2 77,760 129,307 1.6629
25 Jul 2011 15:38:10 942022 13136926 hadcm3n_t39g_1940_40_007315054_2 51,840 85,071 1.6410
25 Jul 2011 15:38:10 942022 13136926 hadcm3n_t39g_1940_40_007315054_2 25,920 42,271 1.6308


©2024 cpdn.org