Name | hadcm3n_ygju_1900_40_007354308_0 |
Workunit | 7551738 |
Created | 6 Jul 2011, 14:32:49 UTC |
Sent | 15 Jul 2011, 14:15:30 UTC |
Report deadline | 14 Oct 2011, 21:42:41 UTC |
Received | 18 Jul 2011, 21:43:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1147202 |
Run time | 3 days 7 hours 6 min 23 sec |
CPU time | 2 days 20 hours 24 min 30 sec |
Validate state | Invalid |
Credit | 1,244.16 |
Device peak FLOPS | 2.47 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.59</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 23:29:24 (2533): No heartbeat from core client for 30 sec - exiting 23:29:27 (2533): No heartbeat from core client for 30 sec - exiting 23:29:28 (2533): No heartbeat from core client for 30 sec - exiting 23:29:30 (2533): No heartbeat from core client for 30 sec - exiting 23:29:32 (2533): No heartbeat from core client for 30 sec - exiting 23:29:34 (2533): No heartbeat from core client for 30 sec - exiting 23:29:35 (2533): No heartbeat from core client for 30 sec - exiting 23:29:36 (2533): No heartbeat from core client for 30 sec - exiting 23:29:37 (2533): No heartbeat from core client for 30 sec - exiting 23:29:38 (2533): No heartbeat from core client for 30 sec - exiting 23:29:39 (2533): No heartbeat from core client for 30 sec - exiting 23:29:40 (2533): No heartbeat from core client for 30 sec - exiting 23:29:41 (2533): No heartbeat from core client for 30 sec - exiting 23:29:42 (2533): No heartbeat from core client for 30 sec - exiting 23:29:43 (2533): No heartbeat from core client for 30 sec - exiting 23:29:44 (2533): No heartbeat from core client for 30 sec - exiting 23:29:45 (2533): No heartbeat from core client for 30 sec - exiting 23:29:47 (2533): No heartbeat from core client for 30 sec - exiting 23:29:48 (2533): No heartbeat from core client for 30 sec - exiting 23:29:49 (2533): No heartbeat from core client for 30 sec - exiting 23:29:50 (2533): No heartbeat from core client for 30 sec - exiting 23:29:51 (2533): No heartbeat from core client for 30 sec - exiting 23:29:52 (2533): No heartbeat from core client for 30 sec - exiting 23:29:53 (2533): No heartbeat from core client for 30 sec - exiting 23:29:54 (2533): No heartbeat from core client for 30 sec - exiting 23:29:55 (2533): No heartbeat from core client for 30 sec - exiting 23:29:56 (2533): No heartbeat from core client for 30 sec - exiting 23:29:57 (2533): No heartbeat from core client for 30 sec - exiting 23:29:58 (2533): No heartbeat from core client for 30 sec - exiting 23:29:59 (2533): No heartbeat from core client for 30 sec - exiting 23:30:00 (2533): No heartbeat from core client for 30 sec - exiting 23:30:01 (2533): No heartbeat from core client for 30 sec - exiting 23:30:02 (2533): No heartbeat from core client for 30 sec - exiting 23:30:03 (2533): No heartbeat from core client for 30 sec - exiting 23:30:04 (2533): No heartbeat from core client for 30 sec - exiting 23:30:05 (2533): No heartbeat from core client for 30 sec - exiting 23:30:06 (2533): No heartbeat from core client for 30 sec - exiting 23:30:07 (2533): No heartbeat from core client for 30 sec - exiting 23:30:08 (2533): No heartbeat from core client for 30 sec - exiting 23:30:09 (2533): No heartbeat from core client for 30 sec - exiting 23:30:10 (2533): No heartbeat from core client for 30 sec - exiting 23:30:11 (2533): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:31:10 (3947): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:31:11 (3947): No heartbeat from core client for 30 sec - exiting 16:31:12 (3947): No heartbeat from core client for 30 sec - exiting 16:31:13 (3947): No heartbeat from core client for 30 sec - exiting 16:31:14 (3947): No heartbeat from core client for 30 sec - exiting 16:31:15 (3947): No heartbeat from core client for 30 sec - exiting 16:31:16 (3947): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (10 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf771b400] [0xf771b430] /lib32/libc.so.6(gsignal+0x51)[0xf7569ea1] /lib32/libc.so.6(abort+0x17e)[0xf756d2ce] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf7555e37] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12980, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7738400] [0xf7738430] /lib32/libc.so.6(gsignal+0x51)[0xf7586ea1] /lib32/libc.so.6(abort+0x17e)[0xf758a2ce] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf7572e37] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12980, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7790400] [0xf7790430] /lib32/libc.so.6(gsignal+0x51)[0xf75deea1] /lib32/libc.so.6(abort+0x17e)[0xf75e22ce] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf75cae37] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12980, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77cc400] [0xf77cc430] /lib32/libc.so.6(gsignal+0x51)[0xf761aea1] /lib32/libc.so.6(abort+0x17e)[0xf761e2ce] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf7606e37] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12980, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76ff400] [0xf76ff430] /lib32/libc.so.6(gsignal+0x51)[0xf754dea1] /lib32/libc.so.6(abort+0x17e)[0xf75512ce] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf7539e37] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12980, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7750400] [0xf7750430] /lib32/libc.so.6(gsignal+0x51)[0xf759eea1] /lib32/libc.so.6(abort+0x17e)[0xf75a22ce] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf758ae37] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12980, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Jul 2011 17:31:57 | 1147202 | 13112486 | hadcm3n_ygju_1900_40_007354308_0 | 103,680 | 231,969 | 2.2374 |
25 Jul 2011 16:34:37 | 1147202 | 13112486 | hadcm3n_ygju_1900_40_007354308_0 | 77,760 | 174,185 | 2.2400 |
25 Jul 2011 15:56:21 | 1147202 | 13112486 | hadcm3n_ygju_1900_40_007354308_0 | 51,840 | 115,772 | 2.2333 |
25 Jul 2011 15:30:27 | 1147202 | 13112486 | hadcm3n_ygju_1900_40_007354308_0 | 25,920 | 57,591 | 2.2219 |
©2024 climateprediction.net