Name | hadcm3n_z8bc_1880_40_008201503_1 |
Workunit | 8356627 |
Created | 13 Sep 2012, 13:39:28 UTC |
Sent | 13 Sep 2012, 13:42:22 UTC |
Report deadline | 13 Dec 2012, 21:09:33 UTC |
Received | 16 Oct 2012, 1:52:09 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1291504 |
Run time | 26 days 22 hours 5 min 58 sec |
CPU time | 24 days 15 hours 6 min 46 sec |
Validate state | Invalid |
Credit | 11,508.48 |
Device peak FLOPS | 2.70 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 15 received, exiting... Called boinc_finish 01:02:10 (26869): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:03:26 (27898): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:03:29 (27898): No heartbeat from core client for 30 sec - exiting 01:03:30 (27898): No heartbeat from core client for 30 sec - exiting 01:03:31 (27898): No heartbeat from core client for 30 sec - exiting 01:06:49 (27934): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:06:50 (27934): No heartbeat from core client for 30 sec - exiting 01:06:51 (27934): No heartbeat from core client for 30 sec - exiting 01:06:52 (27934): No heartbeat from core client for 30 sec - exiting 01:06:53 (27934): No heartbeat from core client for 30 sec - exiting 01:06:54 (27934): No heartbeat from core client for 30 sec - exiting 01:06:55 (27934): No heartbeat from core client for 30 sec - exiting 01:40:31 (27969): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=28370, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 01:05:03 (28370): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:05:05 (28370): No heartbeat from core client for 30 sec - exiting 01:05:06 (28370): No heartbeat from core client for 30 sec - exiting 01:05:07 (28370): No heartbeat from core client for 30 sec - exiting 01:05:08 (28370): No heartbeat from core client for 30 sec - exiting 01:05:09 (28370): No heartbeat from core client for 30 sec - exiting 01:05:10 (28370): No heartbeat from core client for 30 sec - exiting 01:05:11 (28370): No heartbeat from core client for 30 sec - exiting 01:05:12 (28370): No heartbeat from core client for 30 sec - exiting Signal 1 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:22:13 (5573): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:22:14 (5573): No heartbeat from core client for 30 sec - exiting 14:22:15 (5573): No heartbeat from core client for 30 sec - exiting 14:22:16 (5573): No heartbeat from core client for 30 sec - exiting 14:22:17 (5573): No heartbeat from core client for 30 sec - exiting 14:22:18 (5573): No heartbeat from core client for 30 sec - exiting Signal 1 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:04:09 (13594): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:04:12 (13594): No heartbeat from core client for 30 sec - exiting 12:42:22 (1780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:42:23 (1780): No heartbeat from core client for 30 sec - exiting 12:42:24 (1780): No heartbeat from core client for 30 sec - exiting 12:42:25 (1780): No heartbeat from core client for 30 sec - exiting 12:42:26 (1780): No heartbeat from core client for 30 sec - exiting 12:42:27 (1780): No heartbeat from core client for 30 sec - exiting 12:42:28 (1780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish SIGABRT: abort called Stack trace (9 frames): /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7755400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7755430] /lib/libc.so.6(gsignal+0x4f)[0xf758f31f] /lib/libc.so.6(abort+0x143)[0xf7590c03] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf757a3d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4907, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7723400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7723430] /lib/libc.so.6(gsignal+0x4f)[0xf755d31f] /lib/libc.so.6(abort+0x143)[0xf755ec03] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75483d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4907, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7772400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7772430] /lib/libc.so.6(gsignal+0x4f)[0xf75ac31f] /lib/libc.so.6(abort+0x143)[0xf75adc03] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75973d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4907, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7725400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7725430] /lib/libc.so.6(gsignal+0x4f)[0xf755f31f] /lib/libc.so.6(abort+0x143)[0xf7560c03] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf754a3d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4907, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7701400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7701430] /lib/libc.so.6(gsignal+0x4f)[0xf753b31f] /lib/libc.so.6(abort+0x143)[0xf753cc03] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75263d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4907, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7756400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7756430] /lib/libc.so.6(gsignal+0x4f)[0xf759031f] /lib/libc.so.6(abort+0x143)[0xf7591c03] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/va3rcc/Downloads/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf757b3d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4907, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Oct 2012 18:01:13 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 959,040 | 2,100,145 | 2.1898 |
15 Oct 2012 02:34:42 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 933,120 | 2,046,619 | 2.1933 |
14 Oct 2012 12:11:49 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 907,200 | 1,993,596 | 2.1975 |
13 Oct 2012 21:14:20 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 881,280 | 1,941,413 | 2.2029 |
13 Oct 2012 06:42:07 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 855,360 | 1,889,272 | 2.2087 |
12 Oct 2012 16:06:37 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 829,440 | 1,837,248 | 2.2150 |
12 Oct 2012 00:23:40 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 803,520 | 1,780,897 | 2.2164 |
11 Oct 2012 07:45:30 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 777,600 | 1,721,232 | 2.2135 |
10 Oct 2012 16:22:53 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 751,680 | 1,663,131 | 2.2126 |
09 Oct 2012 23:54:25 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 725,760 | 1,605,466 | 2.2121 |
09 Oct 2012 07:40:18 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 699,840 | 1,548,609 | 2.2128 |
08 Oct 2012 14:12:09 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 673,920 | 1,491,917 | 2.2138 |
07 Oct 2012 22:13:18 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 648,000 | 1,433,968 | 2.2129 |
07 Oct 2012 04:53:51 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 622,080 | 1,374,408 | 2.2094 |
06 Oct 2012 12:18:25 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 596,160 | 1,314,786 | 2.2054 |
05 Oct 2012 19:22:23 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 570,240 | 1,254,878 | 2.2006 |
05 Oct 2012 03:06:18 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 544,320 | 1,194,639 | 2.1947 |
04 Oct 2012 09:43:17 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 518,400 | 1,134,526 | 2.1885 |
03 Oct 2012 16:58:40 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 492,480 | 1,074,735 | 2.1823 |
03 Oct 2012 00:44:52 | 1217251 | 15282303 | hadcm3n_z8bc_1880_40_008201503_1 | 466,560 | 1,016,784 | 2.1793 |
©2024 cpdn.org