Name | hadcm3n_o8c1_1900_40_008465924_2 |
Workunit | 8616763 |
Created | 9 Oct 2013, 10:04:58 UTC |
Sent | 9 Oct 2013, 10:15:22 UTC |
Report deadline | 8 Jan 2014, 17:42:33 UTC |
Received | 26 Nov 2013, 23:18:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1069378 |
Run time | 15 days 3 hours 10 min 52 sec |
CPU time | 12 days 8 hours 12 min 4 sec |
Validate state | Invalid |
Credit | 4,665.60 |
Device peak FLOPS | 1.41 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 22:41:25 (5195): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:41:32 (5195): No heartbeat from core client for 30 sec - exiting 22:41:33 (5195): No heartbeat from core client for 30 sec - exiting 22:41:34 (5195): No heartbeat from core client for 30 sec - exiting 22:41:35 (5195): No heartbeat from core client for 30 sec - exiting Signal 1 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish 18:34:39 (9623): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:34:43 (9623): No heartbeat from core client for 30 sec - exiting 18:34:44 (9623): No heartbeat from core client for 30 sec - exiting 19:05:24 (6478): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:24:24 (6801): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:27:37 (6946): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:27:38 (6946): No heartbeat from core client for 30 sec - exiting 19:27:39 (6946): No heartbeat from core client for 30 sec - exiting 19:27:40 (6946): No heartbeat from core client for 30 sec - exiting 19:27:41 (6946): No heartbeat from core client for 30 sec - exiting 19:27:42 (6946): No heartbeat from core client for 30 sec - exiting 19:27:43 (6946): No heartbeat from core client for 30 sec - exiting 19:38:15 (6981): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:38:48 (6981): No heartbeat from core client for 30 sec - exiting 20:37:42 (7276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:39:38 (7276): No heartbeat from core client for 30 sec - exiting 20:39:39 (7276): No heartbeat from core client for 30 sec - exiting 20:39:40 (7276): No heartbeat from core client for 30 sec - exiting 20:39:41 (7276): No heartbeat from core client for 30 sec - exiting 20:39:42 (7276): No heartbeat from core client for 30 sec - exiting 20:39:43 (7276): No heartbeat from core client for 30 sec - exiting 20:39:44 (7276): No heartbeat from core client for 30 sec - exiting 20:39:45 (7276): No heartbeat from core client for 30 sec - exiting 20:39:46 (7276): No heartbeat from core client for 30 sec - exiting 20:39:47 (7276): No heartbeat from core client for 30 sec - exiting 20:39:48 (7276): No heartbeat from core client for 30 sec - exiting 20:39:49 (7276): No heartbeat from core client for 30 sec - exiting 20:39:50 (7276): No heartbeat from core client for 30 sec - exiting 20:39:51 (7276): No heartbeat from core client for 30 sec - exiting 20:39:52 (7276): No heartbeat from core client for 30 sec - exiting 20:39:53 (7276): No heartbeat from core client for 30 sec - exiting 20:39:54 (7276): No heartbeat from core client for 30 sec - exiting 20:39:55 (7276): No heartbeat from core client for 30 sec - exiting 20:39:56 (7276): No heartbeat from core client for 30 sec - exiting 20:39:57 (7276): No heartbeat from core client for 30 sec - exiting 20:39:58 (7276): No heartbeat from core client for 30 sec - exiting 20:39:59 (7276): No heartbeat from core client for 30 sec - exiting 20:40:00 (7276): No heartbeat from core client for 30 sec - exiting 20:40:01 (7276): No heartbeat from core client for 30 sec - exiting 20:40:02 (7276): No heartbeat from core client for 30 sec - exiting 21:08:25 (7990): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:15:11 (8188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:15:52 (8188): No heartbeat from core client for 30 sec - exiting 21:15:53 (8188): No heartbeat from core client for 30 sec - exiting 21:15:54 (8188): No heartbeat from core client for 30 sec - exiting 21:15:55 (8188): No heartbeat from core client for 30 sec - exiting 21:15:56 (8188): No heartbeat from core client for 30 sec - exiting 17:52:26 (8275): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:03:56 (16849): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:04:17 (16849): No heartbeat from core client for 30 sec - exiting 18:05:41 (16960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:10:57 (16993): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:15:13 (17037): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:15:48 (17037): No heartbeat from core client for 30 sec - exiting 18:15:49 (17037): No heartbeat from core client for 30 sec - exiting 18:15:50 (17037): No heartbeat from core client for 30 sec - exiting 18:15:51 (17037): No heartbeat from core client for 30 sec - exiting 18:21:31 (17105): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:27:13 (17161): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:27:14 (17161): No heartbeat from core client for 30 sec - exiting 18:27:15 (17161): No heartbeat from core client for 30 sec - exiting 18:27:16 (17161): No heartbeat from core client for 30 sec - exiting 18:27:17 (17161): No heartbeat from core client for 30 sec - exiting 18:27:18 (17161): No heartbeat from core client for 30 sec - exiting 18:27:19 (17161): No heartbeat from core client for 30 sec - exiting 18:27:20 (17161): No heartbeat from core client for 30 sec - exiting 18:27:21 (17161): No heartbeat from core client for 30 sec - exiting 18:27:22 (17161): No heartbeat from core client for 30 sec - exiting 18:27:23 (17161): No heartbeat from core client for 30 sec - exiting 18:27:24 (17161): No heartbeat from core client for 30 sec - exiting 18:27:25 (17161): No heartbeat from core client for 30 sec - exiting 18:27:26 (17161): No heartbeat from core client for 30 sec - exiting 18:27:27 (17161): No heartbeat from core client for 30 sec - exiting 18:27:28 (17161): No heartbeat from core client for 30 sec - exiting 18:27:29 (17161): No heartbeat from core client for 30 sec - exiting 18:27:30 (17161): No heartbeat from core client for 30 sec - exiting 18:27:31 (17161): No heartbeat from core client for 30 sec - exiting 18:27:32 (17161): No heartbeat from core client for 30 sec - exiting 18:27:33 (17161): No heartbeat from core client for 30 sec - exiting 18:27:34 (17161): No heartbeat from core client for 30 sec - exiting 18:27:35 (17161): No heartbeat from core client for 30 sec - exiting 18:27:36 (17161): No heartbeat from core client for 30 sec - exiting 18:27:37 (17161): No heartbeat from core client for 30 sec - exiting 18:27:38 (17161): No heartbeat from core client for 30 sec - exiting 18:27:39 (17161): No heartbeat from core client for 30 sec - exiting 18:27:40 (17161): No heartbeat from core client for 30 sec - exiting 18:27:41 (17161): No heartbeat from core client for 30 sec - exiting 18:27:42 (17161): No heartbeat from core client for 30 sec - exiting 18:27:43 (17161): No heartbeat from core client for 30 sec - exiting 18:27:44 (17161): No heartbeat from core client for 30 sec - exiting 18:27:45 (17161): No heartbeat from core client for 30 sec - exiting 18:27:46 (17161): No heartbeat from core client for 30 sec - exiting 18:27:47 (17161): No heartbeat from core client for 30 sec - exiting 18:27:48 (17161): No heartbeat from core client for 30 sec - exiting 18:27:49 (17161): No heartbeat from core client for 30 sec - exiting 18:27:50 (17161): No heartbeat from core client for 30 sec - exiting 18:27:51 (17161): No heartbeat from core client for 30 sec - exiting 18:27:52 (17161): No heartbeat from core client for 30 sec - exiting 18:27:53 (17161): No heartbeat from core client for 30 sec - exiting 18:27:54 (17161): No heartbeat from core client for 30 sec - exiting 18:27:55 (17161): No heartbeat from core client for 30 sec - exiting 18:27:56 (17161): No heartbeat from core client for 30 sec - exiting 18:27:57 (17161): No heartbeat from core client for 30 sec - exiting 18:27:58 (17161): No heartbeat from core client for 30 sec - exiting 18:27:59 (17161): No heartbeat from core client for 30 sec - exiting 18:28:00 (17161): No heartbeat from core client for 30 sec - exiting 18:28:01 (17161): No heartbeat from core client for 30 sec - exiting 18:28:02 (17161): No heartbeat from core client for 30 sec - exiting 18:28:03 (17161): No heartbeat from core client for 30 sec - exiting 18:28:04 (17161): No heartbeat from core client for 30 sec - exiting 18:28:05 (17161): No heartbeat from core client for 30 sec - exiting 18:28:06 (17161): No heartbeat from core client for 30 sec - exiting 18:28:07 (17161): No heartbeat from core client for 30 sec - exiting 18:28:08 (17161): No heartbeat from core client for 30 sec - exiting 18:28:09 (17161): No heartbeat from core client for 30 sec - exiting 18:28:10 (17161): No heartbeat from core client for 30 sec - exiting 18:28:11 (17161): No heartbeat from core client for 30 sec - exiting 18:28:12 (17161): No heartbeat from core client for 30 sec - exiting 18:28:13 (17161): No heartbeat from core client for 30 sec - exiting 18:28:14 (17161): No heartbeat from core client for 30 sec - exiting 18:28:15 (17161): No heartbeat from core client for 30 sec - exiting 18:28:16 (17161): No heartbeat from core client for 30 sec - exiting 18:28:17 (17161): No heartbeat from core client for 30 sec - exiting 18:28:18 (17161): No heartbeat from core client for 30 sec - exiting 18:28:19 (17161): No heartbeat from core client for 30 sec - exiting 18:28:20 (17161): No heartbeat from core client for 30 sec - exiting 19:27:09 (17218): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:32:32 (17629): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:32:33 (17629): No heartbeat from core client for 30 sec - exiting 19:32:34 (17629): No heartbeat from core client for 30 sec - exiting 19:32:35 (17629): No heartbeat from core client for 30 sec - exiting 19:32:36 (17629): No heartbeat from core client for 30 sec - exiting 19:32:37 (17629): No heartbeat from core client for 30 sec - exiting 19:32:38 (17629): No heartbeat from core client for 30 sec - exiting 19:32:39 (17629): No heartbeat from core client for 30 sec - exiting 19:32:40 (17629): No heartbeat from core client for 30 sec - exiting 19:32:41 (17629): No heartbeat from core client for 30 sec - exiting 19:32:42 (17629): No heartbeat from core client for 30 sec - exiting 19:37:08 (17697): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:37:10 (17697): No heartbeat from core client for 30 sec - exiting 19:37:11 (17697): No heartbeat from core client for 30 sec - exiting 19:37:12 (17697): No heartbeat from core client for 30 sec - exiting 19:37:40 (17697): No heartbeat from core client for 30 sec - exiting 19:46:01 (17744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... SIGABRT: abort called Stack trace (9 frames): /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7707400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7707430] /lib/libc.so.6(gsignal+0x4f)[0xf751b8cf] /lib/libc.so.6(abort+0x143)[0xf751d1b3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf7506825] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf775a400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf775a430] /lib/libc.so.6(gsignal+0x4f)[0xf756e8cf] /lib/libc.so.6(abort+0x143)[0xf75701b3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf7559825] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77ae400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf77ae430] /lib/libc.so.6(gsignal+0x4f)[0xf75c28cf] /lib/libc.so.6(abort+0x143)[0xf75c41b3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75ad825] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf76f0400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf76f0430] /lib/libc.so.6(gsignal+0x4f)[0xf75048cf] /lib/libc.so.6(abort+0x143)[0xf75061b3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf74ef825] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77c0400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf77c0430] /lib/libc.so.6(gsignal+0x4f)[0xf75d48cf] /lib/libc.so.6(abort+0x143)[0xf75d61b3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75bf825] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7750400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7750430] /lib/libc.so.6(gsignal+0x4f)[0xf75648cf] /lib/libc.so.6(abort+0x143)[0xf75661b3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/alexst/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf754f825] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17836, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Nov 2013 03:44:01 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 388,800 | 1,025,039 | 2.6364 |
25 Nov 2013 03:34:47 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 362,880 | 955,581 | 2.6333 |
24 Nov 2013 04:00:37 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 336,960 | 886,603 | 2.6312 |
23 Nov 2013 06:31:47 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 311,040 | 817,046 | 2.6268 |
10 Nov 2013 11:03:28 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 285,120 | 748,798 | 2.6263 |
19 Oct 2013 13:22:34 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 259,200 | 681,907 | 2.6308 |
18 Oct 2013 13:02:22 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 233,280 | 613,569 | 2.6302 |
17 Oct 2013 12:51:03 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 207,360 | 545,398 | 2.6302 |
16 Oct 2013 12:11:05 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 181,440 | 476,071 | 2.6238 |
15 Oct 2013 11:52:35 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 155,520 | 406,957 | 2.6168 |
14 Oct 2013 11:53:38 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 129,600 | 338,516 | 2.6120 |
13 Oct 2013 11:56:10 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 103,680 | 270,915 | 2.6130 |
12 Oct 2013 11:13:16 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 77,760 | 203,344 | 2.6150 |
11 Oct 2013 11:18:43 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 51,840 | 135,802 | 2.6196 |
10 Oct 2013 11:22:41 | 1069378 | 16062773 | hadcm3n_o8c1_1900_40_008465924_2 | 25,920 | 67,849 | 2.6176 |
©2024 cpdn.org