climateprediction.net home page
Task 13559400

Task 13559400

Name hadcm3n_y8m6_1900_40_007526814_1
Workunit 7724289
Created 28 Oct 2011, 13:47:53 UTC
Sent 29 Oct 2011, 7:12:45 UTC
Report deadline 28 Jan 2012, 14:39:56 UTC
Received 3 Dec 2011, 8:51:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1294024
Run time 24 days 20 hours 52 min 48 sec
CPU time 20 days 2 hours 11 min 6 sec
Validate state Invalid
Credit 10,264.32
Device peak FLOPS 2.37 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:02:06 (2329): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:02:07 (2329): No heartbeat from core client for 30 sec - exiting
15:02:08 (2329): No heartbeat from core client for 30 sec - exiting
15:02:10 (2329): No heartbeat from core client for 30 sec - exiting
15:02:12 (2329): No heartbeat from core client for 30 sec - exiting
15:02:13 (2329): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:05:44 (2327): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:05:46 (2327): No heartbeat from core client for 30 sec - exiting
01:05:50 (2327): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=424, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=424, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=424, iMonCtr=1
Model crash detected, will try to restart...
06:26:16 (424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:26:23 (424): No heartbeat from core client for 30 sec - exiting
06:26:24 (424): No heartbeat from core client for 30 sec - exiting
06:26:25 (424): No heartbeat from core client for 30 sec - exiting
06:26:26 (424): No heartbeat from core client for 30 sec - exiting
08:29:24 (15270): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:41:32 (2597): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32079, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32079, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32079, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32079, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32079, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32079, iMonCtr=1
Model crash detected, will try to restart...
04:25:34 (32079): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:25:36 (32079): No heartbeat from core client for 30 sec - exiting
04:25:37 (32079): No heartbeat from core client for 30 sec - exiting
04:25:38 (32079): No heartbeat from core client for 30 sec - exiting
04:25:39 (32079): No heartbeat from core client for 30 sec - exiting
04:25:40 (32079): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25289, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
SIGSEGV: segmentation violation
Stack trace (10 frames):
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df]
[0xf77bf400]
/lib32/libc.so.6(getenv+0x64)[0xf7526be4]
/lib32/libc.so.6(+0x8b3d0)[0xf75833d0]
/lib32/libc.so.6(+0x8b581)[0xf7583581]
/lib32/libc.so.6(localtime_r+0x2c)[0xf7581cbc]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x80b0d9c]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x80b2dc4]
/lib32/libpthread.so.0(+0x596e)[0xf779796e]
/lib32/libc.so.6(clone+0x5e)[0xf75c7b5e]

Exiting...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Dec 2011 21:33:09 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 855,360 1,956,652 2.2875
02 Dec 2011 03:10:26 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 829,440 1,896,361 2.2863
01 Dec 2011 09:00:55 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 803,520 1,836,804 2.2859
30 Nov 2011 15:13:56 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 777,600 1,776,793 2.2850
29 Nov 2011 20:47:33 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 751,680 1,716,736 2.2839
29 Nov 2011 02:19:00 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 725,760 1,656,126 2.2819
28 Nov 2011 08:25:27 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 699,840 1,595,827 2.2803
27 Nov 2011 13:31:58 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 673,920 1,535,410 2.2783
26 Nov 2011 19:20:51 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 648,000 1,475,032 2.2763
26 Nov 2011 01:27:45 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 622,080 1,414,521 2.2739
25 Nov 2011 07:30:41 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 596,160 1,354,301 2.2717
24 Nov 2011 12:32:38 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 570,240 1,309,439 2.2963
23 Nov 2011 18:50:54 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 544,320 1,248,803 2.2942
23 Nov 2011 00:45:37 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 518,400 1,188,048 2.2918
22 Nov 2011 05:38:00 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 492,480 1,127,659 2.2898
21 Nov 2011 11:34:15 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 466,560 1,067,145 2.2873
20 Nov 2011 18:15:13 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 440,640 1,006,718 2.2847
20 Nov 2011 00:20:54 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 414,720 945,766 2.2805
19 Nov 2011 05:17:45 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 388,800 885,241 2.2769
18 Nov 2011 09:54:28 982003 13559400 hadcm3n_y8m6_1900_40_007526814_1 362,880 826,322 2.2771


©2024 cpdn.org