Name | hadcm3n_yal5_1940_40_007683068_0 |
Workunit | 7838155 |
Created | 16 Jan 2012, 3:27:09 UTC |
Sent | 16 Jan 2012, 16:56:30 UTC |
Report deadline | 17 Apr 2012, 0:23:41 UTC |
Received | 13 Feb 2012, 11:52:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1185641 |
Run time | 8 days 0 hours 59 min 43 sec |
CPU time | 7 days 11 hours 29 min 25 sec |
Validate state | Invalid |
Credit | 4,976.64 |
Device peak FLOPS | 3.57 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:40:56 (6973): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:50:15 (5038): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:50:16 (5038): No heartbeat from core client for 30 sec - exiting 11:50:17 (5038): No heartbeat from core client for 30 sec - exiting 11:50:18 (5038): No heartbeat from core client for 30 sec - exiting 11:50:19 (5038): No heartbeat from core client for 30 sec - exiting 11:50:20 (5038): No heartbeat from core client for 30 sec - exiting 11:50:21 (5038): No heartbeat from core client for 30 sec - exiting 11:50:22 (5038): No heartbeat from core client for 30 sec - exiting 11:50:23 (5038): No heartbeat from core client for 30 sec - exiting 11:50:24 (5038): No heartbeat from core client for 30 sec - exiting 11:50:25 (5038): No heartbeat from core client for 30 sec - exiting 11:50:26 (5038): No heartbeat from core client for 30 sec - exiting 11:50:27 (5038): No heartbeat from core client for 30 sec - exiting 11:50:28 (5038): No heartbeat from core client for 30 sec - exiting 11:50:29 (5038): No heartbeat from core client for 30 sec - exiting 11:50:30 (5038): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7743400] [0xf7743430] /lib32/libc.so.6(gsignal+0x4f)[0xf7587c4f] /lib32/libc.so.6(abort+0x175)[0xf758b175] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf75730f3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5825, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77ca400] [0xf77ca430] /lib32/libc.so.6(gsignal+0x4f)[0xf760ec4f] /lib32/libc.so.6(abort+0x175)[0xf7612175] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf75fa0f3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5825, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7793400] [0xf7793430] /lib32/libc.so.6(gsignal+0x4f)[0xf75d7c4f] /lib32/libc.so.6(abort+0x175)[0xf75db175] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf75c30f3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5825, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76e1400] [0xf76e1430] /lib32/libc.so.6(gsignal+0x4f)[0xf7525c4f] /lib32/libc.so.6(abort+0x175)[0xf7529175] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf75110f3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5825, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76f3400] [0xf76f3430] /lib32/libc.so.6(gsignal+0x4f)[0xf7537c4f] /lib32/libc.so.6(abort+0x175)[0xf753b175] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf75230f3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5825, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77b4400] [0xf77b4430] /lib32/libc.so.6(gsignal+0x4f)[0xf75f8c4f] /lib32/libc.so.6(abort+0x175)[0xf75fc175] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf75e40f3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5825, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Feb 2012 05:32:22 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 414,720 | 626,039 | 1.5095 |
12 Feb 2012 16:11:35 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 388,800 | 586,633 | 1.5088 |
12 Feb 2012 02:31:35 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 362,880 | 547,180 | 1.5079 |
11 Feb 2012 13:50:22 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 336,960 | 507,643 | 1.5065 |
11 Feb 2012 01:15:59 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 311,040 | 468,064 | 1.5048 |
10 Feb 2012 10:45:17 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 285,120 | 428,576 | 1.5031 |
09 Feb 2012 22:22:12 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 259,200 | 388,449 | 1.4986 |
09 Feb 2012 10:48:00 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 233,280 | 348,330 | 1.4932 |
08 Feb 2012 19:18:59 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 207,360 | 308,451 | 1.4875 |
07 Feb 2012 14:27:23 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 181,440 | 268,360 | 1.4791 |
04 Feb 2012 02:29:49 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 155,520 | 232,344 | 1.4940 |
03 Feb 2012 11:06:23 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 129,600 | 197,879 | 1.5268 |
01 Feb 2012 19:36:37 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 103,680 | 160,873 | 1.5516 |
01 Feb 2012 06:22:16 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 77,760 | 120,820 | 1.5538 |
31 Jan 2012 17:37:30 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 51,840 | 81,065 | 1.5638 |
31 Jan 2012 00:07:03 | 1185641 | 13925755 | hadcm3n_yal5_1940_40_007683068_0 | 25,920 | 40,754 | 1.5723 |
©2024 cpdn.org