Name | hadcm3n_4bjw_1940_40_008308841_0 |
Workunit | 8459976 |
Created | 7 Feb 2013, 18:54:12 UTC |
Sent | 7 Feb 2013, 19:45:43 UTC |
Report deadline | 10 May 2013, 3:12:54 UTC |
Received | 19 Feb 2013, 0:35:07 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1219011 |
Run time | 5 days 4 hours 12 min 57 sec |
CPU time | 4 days 22 hours 16 min 58 sec |
Validate state | Invalid |
Credit | 4,043.52 |
Device peak FLOPS | 4.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:53:40 (3483): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 00:57:33 (14684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:57:34 (14684): No heartbeat from core client for 30 sec - exiting 00:57:35 (14684): No heartbeat from core client for 30 sec - exiting 00:57:36 (14684): No heartbeat from core client for 30 sec - exiting 00:57:37 (14684): No heartbeat from core client for 30 sec - exiting 00:57:38 (14684): No heartbeat from core client for 30 sec - exiting 00:57:39 (14684): No heartbeat from core client for 30 sec - exiting 00:57:40 (14684): No heartbeat from core client for 30 sec - exiting 00:57:41 (14684): No heartbeat from core client for 30 sec - exiting 00:57:42 (14684): No heartbeat from core client for 30 sec - exiting 00:57:43 (14684): No heartbeat from core client for 30 sec - exiting 00:57:44 (14684): No heartbeat from core client for 30 sec - exiting 00:57:45 (14684): No heartbeat from core client for 30 sec - exiting 00:57:46 (14684): No heartbeat from core client for 30 sec - exiting 00:57:47 (14684): No heartbeat from core client for 30 sec - exiting 00:57:48 (14684): No heartbeat from core client for 30 sec - exiting 00:57:49 (14684): No heartbeat from core client for 30 sec - exiting 00:57:50 (14684): No heartbeat from core client for 30 sec - exiting 00:57:51 (14684): No heartbeat from core client for 30 sec - exiting 00:57:52 (14684): No heartbeat from core client for 30 sec - exiting 00:57:53 (14684): No heartbeat from core client for 30 sec - exiting 00:57:54 (14684): No heartbeat from core client for 30 sec - exiting 00:57:55 (14684): No heartbeat from core client for 30 sec - exiting 00:57:56 (14684): No heartbeat from core client for 30 sec - exiting 00:57:57 (14684): No heartbeat from core client for 30 sec - exiting 00:57:58 (14684): No heartbeat from core client for 30 sec - exiting 00:57:59 (14684): No heartbeat from core client for 30 sec - exiting 00:58:00 (14684): No heartbeat from core client for 30 sec - exiting 00:58:01 (14684): No heartbeat from core client for 30 sec - exiting 00:58:02 (14684): No heartbeat from core client for 30 sec - exiting 00:58:03 (14684): No heartbeat from core client for 30 sec - exiting 00:58:04 (14684): No heartbeat from core client for 30 sec - exiting 00:58:05 (14684): No heartbeat from core client for 30 sec - exiting 00:58:06 (14684): No heartbeat from core client for 30 sec - exiting 00:58:07 (14684): No heartbeat from core client for 30 sec - exiting 00:58:08 (14684): No heartbeat from core client for 30 sec - exiting 00:58:09 (14684): No heartbeat from core client for 30 sec - exiting 00:58:10 (14684): No heartbeat from core client for 30 sec - exiting 00:58:11 (14684): No heartbeat from core client for 30 sec - exiting 00:58:12 (14684): No heartbeat from core client for 30 sec - exiting 00:58:13 (14684): No heartbeat from core client for 30 sec - exiting 00:58:14 (14684): No heartbeat from core client for 30 sec - exiting 00:58:15 (14684): No heartbeat from core client for 30 sec - exiting 00:58:16 (14684): No heartbeat from core client for 30 sec - exiting 00:58:17 (14684): No heartbeat from core client for 30 sec - exiting 00:58:18 (14684): No heartbeat from core client for 30 sec - exiting 00:58:19 (14684): No heartbeat from core client for 30 sec - exiting 00:58:20 (14684): No heartbeat from core client for 30 sec - exiting 00:58:21 (14684): No heartbeat from core client for 30 sec - exiting 01:30:57 (14928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:30:58 (14928): No heartbeat from core client for 30 sec - exiting 01:30:59 (14928): No heartbeat from core client for 30 sec - exiting 01:31:00 (14928): No heartbeat from core client for 30 sec - exiting 01:31:01 (14928): No heartbeat from core client for 30 sec - exiting 01:31:02 (14928): No heartbeat from core client for 30 sec - exiting 01:31:03 (14928): No heartbeat from core client for 30 sec - exiting 01:31:04 (14928): No heartbeat from core client for 30 sec - exiting 01:31:05 (14928): No heartbeat from core client for 30 sec - exiting 01:31:06 (14928): No heartbeat from core client for 30 sec - exiting 01:31:07 (14928): No heartbeat from core client for 30 sec - exiting 01:31:08 (14928): No heartbeat from core client for 30 sec - exiting 01:31:09 (14928): No heartbeat from core client for 30 sec - exiting 01:31:10 (14928): No heartbeat from core client for 30 sec - exiting 01:31:11 (14928): No heartbeat from core client for 30 sec - exiting 01:31:12 (14928): No heartbeat from core client for 30 sec - exiting 01:31:13 (14928): No heartbeat from core client for 30 sec - exiting 01:31:14 (14928): No heartbeat from core client for 30 sec - exiting 01:31:15 (14928): No heartbeat from core client for 30 sec - exiting 01:31:16 (14928): No heartbeat from core client for 30 sec - exiting 01:31:17 (14928): No heartbeat from core client for 30 sec - exiting 01:31:18 (14928): No heartbeat from core client for 30 sec - exiting 01:31:19 (14928): No heartbeat from core client for 30 sec - exiting 01:31:20 (14928): No heartbeat from core client for 30 sec - exiting 01:31:21 (14928): No heartbeat from core client for 30 sec - exiting 01:31:22 (14928): No heartbeat from core client for 30 sec - exiting 01:31:23 (14928): No heartbeat from core client for 30 sec - exiting 01:31:24 (14928): No heartbeat from core client for 30 sec - exiting 01:31:25 (14928): No heartbeat from core client for 30 sec - exiting 01:31:26 (14928): No heartbeat from core client for 30 sec - exiting 01:31:27 (14928): No heartbeat from core client for 30 sec - exiting 01:31:28 (14928): No heartbeat from core client for 30 sec - exiting 01:31:29 (14928): No heartbeat from core client for 30 sec - exiting 01:31:30 (14928): No heartbeat from core client for 30 sec - exiting 01:31:31 (14928): No heartbeat from core client for 30 sec - exiting 01:31:32 (14928): No heartbeat from core client for 30 sec - exiting 01:31:33 (14928): No heartbeat from core client for 30 sec - exiting 01:31:34 (14928): No heartbeat from core client for 30 sec - exiting 01:31:35 (14928): No heartbeat from core client for 30 sec - exiting 01:31:36 (14928): No heartbeat from core client for 30 sec - exiting 01:31:37 (14928): No heartbeat from core client for 30 sec - exiting 01:31:38 (14928): No heartbeat from core client for 30 sec - exiting 01:31:39 (14928): No heartbeat from core client for 30 sec - exiting 01:31:40 (14928): No heartbeat from core client for 30 sec - exiting 01:31:41 (14928): No heartbeat from core client for 30 sec - exiting 01:31:42 (14928): No heartbeat from core client for 30 sec - exiting 01:31:43 (14928): No heartbeat from core client for 30 sec - exiting 01:31:44 (14928): No heartbeat from core client for 30 sec - exiting 01:31:45 (14928): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf773a400] [0xf773a430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf754c1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf754f825] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75374d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7715400] [0xf7715430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75271df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf752a825] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75124d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76dd400] [0xf76dd430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf74ef1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf74f2825] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74da4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77c7400] [0xf77c7430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75d91df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75dc825] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75c44d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf772e400] [0xf772e430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75401df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7543825] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf752b4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7733400] [0xf7733430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75451df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7548825] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /media/iscsi/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75304d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16529, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Feb 2013 03:02:00 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 336,960 | 402,949 | 1.1958 |
17 Feb 2013 18:11:37 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 311,040 | 372,511 | 1.1976 |
14 Feb 2013 01:54:12 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 285,120 | 343,643 | 1.2053 |
13 Feb 2013 13:20:13 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 259,200 | 311,736 | 1.2027 |
13 Feb 2013 03:35:15 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 233,280 | 278,186 | 1.1925 |
12 Feb 2013 05:43:58 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 207,360 | 244,546 | 1.1793 |
11 Feb 2013 04:00:10 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 181,440 | 214,374 | 1.1815 |
10 Feb 2013 18:17:08 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 155,520 | 182,387 | 1.1728 |
09 Feb 2013 16:45:57 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 129,600 | 149,017 | 1.1498 |
09 Feb 2013 07:06:25 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 103,680 | 118,798 | 1.1458 |
08 Feb 2013 22:29:02 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 77,760 | 89,075 | 1.1455 |
08 Feb 2013 13:55:25 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 51,840 | 59,184 | 1.1417 |
08 Feb 2013 05:20:50 | 1219011 | 15595753 | hadcm3n_4bjw_1940_40_008308841_0 | 25,920 | 29,293 | 1.1301 |
©2024 cpdn.org