climateprediction.net home page
Task 16062773

Task 16062773

Name hadcm3n_o8c1_1900_40_008465924_2
Workunit 8616763
Created 9 Oct 2013, 10:04:58 UTC
Sent 9 Oct 2013, 10:15:22 UTC
Report deadline 8 Jan 2014, 17:42:33 UTC
Received 26 Nov 2013, 23:18:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1069378
Run time 15 days 3 hours 10 min 52 sec
CPU time 12 days 8 hours 12 min 4 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 1.41 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:41:25 (5195): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:41:32 (5195): No heartbeat from core client for 30 sec - exiting
22:41:33 (5195): No heartbeat from core client for 30 sec - exiting
22:41:34 (5195): No heartbeat from core client for 30 sec - exiting
22:41:35 (5195): No heartbeat from core client for 30 sec - exiting
Signal 1 received, exiting...
Called boinc_finish
Signal 1 received, exiting...
Called boinc_finish
18:34:39 (9623): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:34:43 (9623): No heartbeat from core client for 30 sec - exiting
18:34:44 (9623): No heartbeat from core client for 30 sec - exiting
19:05:24 (6478): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:24:24 (6801): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:27:37 (6946): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:27:38 (6946): No heartbeat from core client for 30 sec - exiting
19:27:39 (6946): No heartbeat from core client for 30 sec - exiting
19:27:40 (6946): No heartbeat from core client for 30 sec - exiting
19:27:41 (6946): No heartbeat from core client for 30 sec - exiting
19:27:42 (6946): No heartbeat from core client for 30 sec - exiting
19:27:43 (6946): No heartbeat from core client for 30 sec - exiting
19:38:15 (6981): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:38:48 (6981): No heartbeat from core client for 30 sec - exiting
20:37:42 (7276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:39:38 (7276): No heartbeat from core client for 30 sec - exiting
20:39:39 (7276): No heartbeat from core client for 30 sec - exiting
20:39:40 (7276): No heartbeat from core client for 30 sec - exiting
20:39:41 (7276): No heartbeat from core client for 30 sec - exiting
20:39:42 (7276): No heartbeat from core client for 30 sec - exiting
20:39:43 (7276): No heartbeat from core client for 30 sec - exiting
20:39:44 (7276): No heartbeat from core client for 30 sec - exiting
20:39:45 (7276): No heartbeat from core client for 30 sec - exiting
20:39:46 (7276): No heartbeat from core client for 30 sec - exiting
20:39:47 (7276): No heartbeat from core client for 30 sec - exiting
20:39:48 (7276): No heartbeat from core client for 30 sec - exiting
20:39:49 (7276): No heartbeat from core client for 30 sec - exiting
20:39:50 (7276): No heartbeat from core client for 30 sec - exiting
20:39:51 (7276): No heartbeat from core client for 30 sec - exiting
20:39:52 (7276): No heartbeat from core client for 30 sec - exiting
20:39:53 (7276): No heartbeat from core client for 30 sec - exiting
20:39:54 (7276): No heartbeat from core client for 30 sec - exiting
20:39:55 (7276): No heartbeat from core client for 30 sec - exiting
20:39:56 (7276): No heartbeat from core client for 30 sec - exiting
20:39:57 (7276): No heartbeat from core client for 30 sec - exiting
20:39:58 (7276): No heartbeat from core client for 30 sec - exiting
20:39:59 (7276): No heartbeat from core client for 30 sec - exiting
20:40:00 (7276): No heartbeat from core client for 30 sec - exiting
20:40:01 (7276): No heartbeat from core client for 30 sec - exiting
20:40:02 (7276): No heartbeat from core client for 30 sec - exiting
21:08:25 (7990): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:15:11 (8188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:15:52 (8188): No heartbeat from core client for 30 sec - exiting
21:15:53 (8188): No heartbeat from core client for 30 sec - exiting
21:15:54 (8188): No heartbeat from core client for 30 sec - exiting
21:15:55 (8188): No heartbeat from core client for 30 sec - exiting
21:15:56 (8188): No heartbeat from core client for 30 sec - exiting
17:52:26 (8275): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:03:56 (16849): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:04:17 (16849): No heartbeat from core client for 30 sec - exiting
18:05:41 (16960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:10:57 (16993): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:15:13 (17037): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:15:48 (17037): No heartbeat from core client for 30 sec - exiting
18:15:49 (17037): No heartbeat from core client for 30 sec - exiting
18:15:50 (17037): No heartbeat from core client for 30 sec - exiting
18:15:51 (17037): No heartbeat from core client for 30 sec - exiting
18:21:31 (17105): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:27:13 (17161): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:27:14 (17161): No heartbeat from core client for 30 sec - exiting
18:27:15 (17161): No heartbeat from core client for 30 sec - exiting
18:27:16 (17161): No heartbeat from core client for 30 sec - exiting
18:27:17 (17161): No heartbeat from core client for 30 sec - exiting
18:27:18 (17161): No heartbeat from core client for 30 sec - exiting
18:27:19 (17161): No heartbeat from core client for 30 sec - exiting
18:27:20 (17161): No heartbeat from core client for 30 sec - exiting
18:27:21 (17161): No heartbeat from core client for 30 sec - exiting
18:27:22 (17161): No heartbeat from core client for 30 sec - exiting
18:27:23 (17161): No heartbeat from core client for 30 sec - exiting
18:27:24 (17161): No heartbeat from core client for 30 sec - exiting
18:27:25 (17161): No heartbeat from core client for 30 sec - exiting
18:27:26 (17161): No heartbeat from core client for 30 sec - exiting
18:27:27 (17161): No heartbeat from core client for 30 sec - exiting
18:27:28 (17161): No heartbeat from core client for 30 sec - exiting
18:27:29 (17161): No heartbeat from core client for 30 sec - exiting
18:27:30 (17161): No heartbeat from core client for 30 sec - exiting
18:27:31 (17161): No heartbeat from core client for 30 sec - exiting
18:27:32 (17161): No heartbeat from core client for 30 sec - exiting
18:27:33 (17161): No heartbeat from core client for 30 sec - exiting
18:27:34 (17161): No heartbeat from core client for 30 sec - exiting
18:27:35 (17161): No heartbeat from core client for 30 sec - exiting
18:27:36 (17161): No heartbeat from core client for 30 sec - exiting
18:27:37 (17161): No heartbeat from core client for 30 sec - exiting
18:27:38 (17161): No heartbeat from core client for 30 sec - exiting
18:27:39 (17161): No heartbeat from core client for 30 sec - exiting
18:27:40 (17161): No heartbeat from core client for 30 sec - exiting
18:27:41 (17161): No heartbeat from core client for 30 sec - exiting
18:27:42 (17161): No heartbeat from core client for 30 sec - exiting
18:27:43 (17161): No heartbeat from core client for 30 sec - exiting
18:27:44 (17161): No heartbeat from core client for 30 sec - exiting
18:27:45 (17161): No heartbeat from core client for 30 sec - exiting
18:27:46 (17161): No heartbeat from core client for 30 sec - exiting
18:27:47 (17161): No heartbeat from core client for 30 sec - exiting
18:27:48 (17161): No heartbeat from core client for 30 sec - exiting
18:27:49 (17161): No heartbeat from core client for 30 sec - exiting
18:27:50 (17161): No heartbeat from core client for 30 sec - exiting
18:27:51 (17161): No heartbeat from core client for 30 sec - exiting
18:27:52 (17161): No heartbeat from core client for 30 sec - exiting
18:27:53 (17161): No heartbeat from core client for 30 sec - exiting
18:27:54 (17161): No heartbeat from core client for 30 sec - exiting
18:27:55 (17161): No heartbeat from core client for 30 sec - exiting
18:27:56 (17161): No heartbeat from core client for 30 sec - exiting
18:27:57 (17161): No heartbeat from core client for 30 sec - exiting
18:27:58 (17161): No heartbeat from core client for 30 sec - exiting
18:27:59 (17161): No heartbeat from core client for 30 sec - exiting
18:28:00 (17161): No heartbeat from core client for 30 sec - exiting
18:28:01 (17161): No heartbeat from core client for 30 sec - exiting
18:28:02 (17161): No heartbeat from core client for 30 sec - exiting
18:28:03 (17161): No heartbeat from core client for 30 sec - exiting
18:28:04 (17161): No heartbeat from core client for 30 sec - exiting
18:28:05 (17161): No heartbeat from core client for 30 sec - exiting
18:28:06 (17161): No heartbeat from core client for 30 sec - exiting
18:28:07 (17161): No heartbeat from core client for 30 sec - exiting
18:28:08 (17161): No heartbeat from core client for 30 sec - exiting
18:28:09 (17161): No heartbeat from core client for 30 sec - exiting
18:28:10 (17161): No heartbeat from core client for 30 sec - exiting
18:28:11 (17161): No heartbeat from core client for 30 sec - exiting
18:28:12 (17161): No heartbeat from core client for 30 sec - exiting
18:28:13 (17161): No heartbeat from core client for 30 sec - exiting
18:28:14 (17161): No heartbeat from core client for 30 sec - exiting
18:28:15 (17161): No heartbeat from core client for 30 sec - exiting
18:28:16 (17161): No heartbeat from core client for 30 sec - exiting
18:28:17 (17161): No heartbeat from core client for 30 sec - exiting
18:28:18 (17161): No heartbeat from core client for 30 sec - exiting
18:28:19 (17161): No heartbeat from core client for 30 sec - exiting
18:28:20 (17161): No heartbeat from core client for 30 sec - exiting
19:27:09 (17218): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:32:32 (17629): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:32:33 (17629): No heartbeat from core client for 30 sec - exiting
19:32:34 (17629): No heartbeat from core client for 30 sec - exiting
19:32:35 (17629): No heartbeat from core client for 30 sec - exiting
19:32:36 (17629): No heartbeat from core client for 30 sec - exiting
19:32:37 (17629): No heartbeat from core client for 30 sec - exiting
19:32:38 (17629): No heartbeat from core client for 30 sec - exiting
19:32:39 (17629): No heartbeat from core client for 30 sec - exiting
19:32:40 (17629): No heartbeat from core client for 30 sec - exiting
19:32:41 (17629): No heartbeat from core client for 30 sec - exiting
19:32:42 (17629): No heartbeat from core client for 30 sec - exiting
19:37:08 (17697): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:37:10 (17697): No heartbeat from core client for 30 sec - exiting
19:37:11 (17697): No heartbeat from core client for 30 sec - exiting
19:37:12 (17697): No heartbeat from core client for 30 sec - exiting
19:37:40 (17697): No heartbeat from core client for 30 sec - exiting
19:46:01 (17744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
SIGABRT: abort called
Stack trace (9 frames):
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7707400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7707430]
/lib/libc.so.6(gsignal+0x4f)[0xf751b8cf]
/lib/libc.so.6(abort+0x143)[0xf751d1b3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf5)[0xf7506825]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf775a400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf775a430]
/lib/libc.so.6(gsignal+0x4f)[0xf756e8cf]
/lib/libc.so.6(abort+0x143)[0xf75701b3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf5)[0xf7559825]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77ae400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf77ae430]
/lib/libc.so.6(gsignal+0x4f)[0xf75c28cf]
/lib/libc.so.6(abort+0x143)[0xf75c41b3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf5)[0xf75ad825]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf76f0400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf76f0430]
/lib/libc.so.6(gsignal+0x4f)[0xf75048cf]
/lib/libc.so.6(abort+0x143)[0xf75061b3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf5)[0xf74ef825]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77c0400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf77c0430]
/lib/libc.so.6(gsignal+0x4f)[0xf75d48cf]
/lib/libc.so.6(abort+0x143)[0xf75d61b3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf5)[0xf75bf825]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7750400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7750430]
/lib/libc.so.6(gsignal+0x4f)[0xf75648cf]
/lib/libc.so.6(abort+0x143)[0xf75661b3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/libc.so.6(__libc_start_main+0xf5)[0xf754f825]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Nov 2013 03:44:01 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 388,800 1,025,039 2.6364
25 Nov 2013 03:34:47 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 362,880 955,581 2.6333
24 Nov 2013 04:00:37 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 336,960 886,603 2.6312
23 Nov 2013 06:31:47 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 311,040 817,046 2.6268
10 Nov 2013 11:03:28 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 285,120 748,798 2.6263
19 Oct 2013 13:22:34 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 259,200 681,907 2.6308
18 Oct 2013 13:02:22 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 233,280 613,569 2.6302
17 Oct 2013 12:51:03 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 207,360 545,398 2.6302
16 Oct 2013 12:11:05 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 181,440 476,071 2.6238
15 Oct 2013 11:52:35 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 155,520 406,957 2.6168
14 Oct 2013 11:53:38 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 129,600 338,516 2.6120
13 Oct 2013 11:56:10 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 103,680 270,915 2.6130
12 Oct 2013 11:13:16 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 77,760 203,344 2.6150
11 Oct 2013 11:18:43 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 51,840 135,802 2.6196
10 Oct 2013 11:22:41 1069378 16062773 hadcm3n_o8c1_1900_40_008465924_2 25,920 67,849 2.6176


©2024 cpdn.org