climateprediction.net home page
Task 13533026

Task 13533026

Name hadcm3n_ym9a_1900_40_007513682_1
Workunit 7711157
Created 28 Oct 2011, 12:32:43 UTC
Sent 26 Nov 2011, 0:38:35 UTC
Report deadline 25 Feb 2012, 8:05:46 UTC
Received 20 Dec 2011, 2:12:30 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 763269
Run time 21 days 20 hours 6 min 35 sec
CPU time 16 days 7 hours 16 min 52 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.61 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:48:40 (16621): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:36:51 (27833): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:36:53 (27833): No heartbeat from core client for 30 sec - exiting
03:36:54 (27833): No heartbeat from core client for 30 sec - exiting
03:42:23 (3716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:37:24 (29727): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:37:26 (29727): No heartbeat from core client for 30 sec - exiting
01:38:58 (26673): No heartbeat from core client for 30 sec - exiting
01:38:59 (26673): No heartbeat from core client for 30 sec - exiting
01:39:00 (26673): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:45:44 (29668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:21:37 (30282): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:49:32 (4806): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:49:33 (4806): No heartbeat from core client for 30 sec - exiting
02:49:34 (4806): No heartbeat from core client for 30 sec - exiting
02:49:35 (4806): No heartbeat from core client for 30 sec - exiting
02:49:36 (4806): No heartbeat from core client for 30 sec - exiting
02:52:00 (16590): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:52:01 (16590): No heartbeat from core client for 30 sec - exiting
02:52:02 (16590): No heartbeat from core client for 30 sec - exiting
02:52:03 (16590): No heartbeat from core client for 30 sec - exiting
02:52:04 (16590): No heartbeat from core client for 30 sec - exiting
02:52:05 (16590): No heartbeat from core client for 30 sec - exiting
02:52:06 (16590): No heartbeat from core client for 30 sec - exiting
03:06:54 (18539): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:06:55 (18539): No heartbeat from core client for 30 sec - exiting
03:06:56 (18539): No heartbeat from core client for 30 sec - exiting
03:06:57 (18539): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf779d400]
[0xf779d430]
/lib32/libc.so.6(gsignal+0x51)[0xf7610421]
/lib32/libc.so.6(abort+0x17e)[0xf7611bfe]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf75fc296]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25251, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7736400]
[0xf7736430]
/lib32/libc.so.6(gsignal+0x51)[0xf75a9421]
/lib32/libc.so.6(abort+0x17e)[0xf75aabfe]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf7595296]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25251, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf776e400]
[0xf776e430]
/lib32/libc.so.6(gsignal+0x51)[0xf75e1421]
/lib32/libc.so.6(abort+0x17e)[0xf75e2bfe]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf75cd296]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25251, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7784400]
[0xf7784430]
/lib32/libc.so.6(gsignal+0x51)[0xf75f7421]
/lib32/libc.so.6(abort+0x17e)[0xf75f8bfe]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf75e3296]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25251, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77bb400]
[0xf77bb430]
/lib32/libc.so.6(gsignal+0x51)[0xf762e421]
/lib32/libc.so.6(abort+0x17e)[0xf762fbfe]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf761a296]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25251, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7754400]
[0xf7754430]
/lib32/libc.so.6(gsignal+0x51)[0xf75c7421]
/lib32/libc.so.6(abort+0x17e)[0xf75c8bfe]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf75b3296]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25251, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Dec 2011 14:24:01 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 777,600 1,396,574 1.7960
18 Dec 2011 21:09:43 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 751,680 1,349,465 1.7953
18 Dec 2011 03:56:37 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 725,760 1,303,013 1.7954
17 Dec 2011 11:09:25 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 699,840 1,256,465 1.7954
16 Dec 2011 18:10:00 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 673,920 1,209,814 1.7952
16 Dec 2011 00:59:30 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 648,000 1,163,317 1.7952
15 Dec 2011 08:34:28 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 622,080 1,116,789 1.7952
14 Dec 2011 16:00:07 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 596,160 1,070,492 1.7956
13 Dec 2011 23:20:12 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 570,240 1,024,165 1.7960
13 Dec 2011 06:40:12 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 544,320 977,795 1.7964
12 Dec 2011 15:25:29 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 518,400 931,451 1.7968
11 Dec 2011 20:29:32 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 492,480 886,100 1.7993
11 Dec 2011 00:49:39 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 466,560 840,304 1.8011
10 Dec 2011 07:00:42 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 440,640 794,622 1.8033
09 Dec 2011 14:29:01 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 414,720 749,330 1.8068
08 Dec 2011 20:29:19 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 388,800 703,128 1.8085
08 Dec 2011 03:41:35 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 362,880 657,133 1.8109
07 Dec 2011 04:09:23 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 336,960 610,491 1.8118
06 Dec 2011 10:45:30 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 311,040 563,879 1.8129
05 Dec 2011 17:39:05 763269 13533026 hadcm3n_ym9a_1900_40_007513682_1 285,120 517,096 1.8136


©2024 cpdn.org