Name | hadcm3n_4mbw_1940_40_008312091_3 |
Workunit | 8463226 |
Created | 12 May 2013, 7:41:07 UTC |
Sent | 12 May 2013, 7:41:18 UTC |
Report deadline | 11 Aug 2013, 15:08:29 UTC |
Received | 4 Sep 2013, 12:09:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1291528 |
Run time | 1 days 8 hours 5 min 6 sec |
CPU time | 1 days 6 hours 36 min |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 18:09:58 (15751): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:13:39 (16072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:13:40 (16072): No heartbeat from core client for 30 sec - exiting 18:13:41 (16072): No heartbeat from core client for 30 sec - exiting 18:13:42 (16072): No heartbeat from core client for 30 sec - exiting 18:13:43 (16072): No heartbeat from core client for 30 sec - exiting 18:13:44 (16072): No heartbeat from core client for 30 sec - exiting 18:13:45 (16072): No heartbeat from core client for 30 sec - exiting 18:13:46 (16072): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 18:17:25 (16152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:17:26 (16152): No heartbeat from core client for 30 sec - exiting 18:17:27 (16152): No heartbeat from core client for 30 sec - exiting 18:17:28 (16152): No heartbeat from core client for 30 sec - exiting 18:17:29 (16152): No heartbeat from core client for 30 sec - exiting 18:17:30 (16152): No heartbeat from core client for 30 sec - exiting 18:17:31 (16152): No heartbeat from core client for 30 sec - exiting 18:17:32 (16152): No heartbeat from core client for 30 sec - exiting 18:17:33 (16152): No heartbeat from core client for 30 sec - exiting 18:17:34 (16152): No heartbeat from core client for 30 sec - exiting 18:17:35 (16152): No heartbeat from core client for 30 sec - exiting 18:17:36 (16152): No heartbeat from core client for 30 sec - exiting 18:17:37 (16152): No heartbeat from core client for 30 sec - exiting 18:17:38 (16152): No heartbeat from core client for 30 sec - exiting 18:17:39 (16152): No heartbeat from core client for 30 sec - exiting 18:17:40 (16152): No heartbeat from core client for 30 sec - exiting 18:17:41 (16152): No heartbeat from core client for 30 sec - exiting 18:17:42 (16152): No heartbeat from core client for 30 sec - exiting 18:21:14 (16235): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:25:12 (16315): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:25:13 (16315): No heartbeat from core client for 30 sec - exiting 18:25:14 (16315): No heartbeat from core client for 30 sec - exiting 18:25:15 (16315): No heartbeat from core client for 30 sec - exiting 18:25:16 (16315): No heartbeat from core client for 30 sec - exiting 18:25:17 (16315): No heartbeat from core client for 30 sec - exiting 18:25:18 (16315): No heartbeat from core client for 30 sec - exiting 18:25:19 (16315): No heartbeat from core client for 30 sec - exiting 18:25:20 (16315): No heartbeat from core client for 30 sec - exiting 18:25:21 (16315): No heartbeat from core client for 30 sec - exiting 18:25:22 (16315): No heartbeat from core client for 30 sec - exiting 18:25:23 (16315): No heartbeat from core client for 30 sec - exiting 18:25:24 (16315): No heartbeat from core client for 30 sec - exiting 18:25:25 (16315): No heartbeat from core client for 30 sec - exiting 18:25:26 (16315): No heartbeat from core client for 30 sec - exiting 18:25:27 (16315): No heartbeat from core client for 30 sec - exiting 18:25:28 (16315): No heartbeat from core client for 30 sec - exiting 18:25:29 (16315): No heartbeat from core client for 30 sec - exiting 18:25:30 (16315): No heartbeat from core client for 30 sec - exiting 18:25:31 (16315): No heartbeat from core client for 30 sec - exiting 18:25:32 (16315): No heartbeat from core client for 30 sec - exiting 18:25:33 (16315): No heartbeat from core client for 30 sec - exiting 18:25:34 (16315): No heartbeat from core client for 30 sec - exiting 18:29:06 (16395): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:29:07 (16395): No heartbeat from core client for 30 sec - exiting 18:29:08 (16395): No heartbeat from core client for 30 sec - exiting 18:29:09 (16395): No heartbeat from core client for 30 sec - exiting 18:29:10 (16395): No heartbeat from core client for 30 sec - exiting 18:29:11 (16395): No heartbeat from core client for 30 sec - exiting 18:29:12 (16395): No heartbeat from core client for 30 sec - exiting 18:32:55 (16475): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:32:56 (16475): No heartbeat from core client for 30 sec - exiting 18:32:57 (16475): No heartbeat from core client for 30 sec - exiting 18:32:58 (16475): No heartbeat from core client for 30 sec - exiting 18:32:59 (16475): No heartbeat from core client for 30 sec - exiting 18:33:00 (16475): No heartbeat from core client for 30 sec - exiting 18:33:01 (16475): No heartbeat from core client for 30 sec - exiting 18:33:02 (16475): No heartbeat from core client for 30 sec - exiting 18:33:03 (16475): No heartbeat from core client for 30 sec - exiting 18:33:04 (16475): No heartbeat from core client for 30 sec - exiting 18:33:05 (16475): No heartbeat from core client for 30 sec - exiting 18:33:06 (16475): No heartbeat from core client for 30 sec - exiting 18:33:07 (16475): No heartbeat from core client for 30 sec - exiting 18:33:08 (16475): No heartbeat from core client for 30 sec - exiting 18:33:09 (16475): No heartbeat from core client for 30 sec - exiting 18:33:10 (16475): No heartbeat from core client for 30 sec - exiting 18:33:11 (16475): No heartbeat from core client for 30 sec - exiting 18:33:12 (16475): No heartbeat from core client for 30 sec - exiting 18:33:13 (16475): No heartbeat from core client for 30 sec - exiting 18:36:45 (16563): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:36:46 (16563): No heartbeat from core client for 30 sec - exiting 18:36:47 (16563): No heartbeat from core client for 30 sec - exiting 18:36:48 (16563): No heartbeat from core client for 30 sec - exiting 18:36:49 (16563): No heartbeat from core client for 30 sec - exiting 18:36:50 (16563): No heartbeat from core client for 30 sec - exiting 18:36:51 (16563): No heartbeat from core client for 30 sec - exiting 22:19:56 (16645): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:19:57 (16645): No heartbeat from core client for 30 sec - exiting 22:19:58 (16645): No heartbeat from core client for 30 sec - exiting 22:19:59 (16645): No heartbeat from core client for 30 sec - exiting 22:20:00 (16645): No heartbeat from core client for 30 sec - exiting 22:23:21 (16832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:23:22 (16832): No heartbeat from core client for 30 sec - exiting 22:23:23 (16832): No heartbeat from core client for 30 sec - exiting 22:23:24 (16832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 15:48:56 (3919): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:08:34 (4322): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:08:35 (4322): No heartbeat from core client for 30 sec - exiting 16:08:36 (4322): No heartbeat from core client for 30 sec - exiting 16:08:37 (4322): No heartbeat from core client for 30 sec - exiting 16:08:38 (4322): No heartbeat from core client for 30 sec - exiting 16:23:11 (4529): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:23:12 (4529): No heartbeat from core client for 30 sec - exiting 16:23:13 (4529): No heartbeat from core client for 30 sec - exiting 16:23:14 (4529): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 16:39:50 (4853): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:39:51 (4853): No heartbeat from core client for 30 sec - exiting 16:42:10 (4957): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:42:11 (4957): No heartbeat from core client for 30 sec - exiting 16:42:12 (4957): No heartbeat from core client for 30 sec - exiting 16:42:13 (4957): No heartbeat from core client for 30 sec - exiting 16:42:14 (4957): No heartbeat from core client for 30 sec - exiting 16:42:15 (4957): No heartbeat from core client for 30 sec - exiting 16:42:16 (4957): No heartbeat from core client for 30 sec - exiting 16:42:17 (4957): No heartbeat from core client for 30 sec - exiting 16:42:18 (4957): No heartbeat from core client for 30 sec - exiting 16:42:19 (4957): No heartbeat from core client for 30 sec - exiting 16:42:20 (4957): No heartbeat from core client for 30 sec - exiting 16:42:21 (4957): No heartbeat from core client for 30 sec - exiting 16:42:22 (4957): No heartbeat from core client for 30 sec - exiting 16:42:23 (4957): No heartbeat from core client for 30 sec - exiting 16:42:24 (4957): No heartbeat from core client for 30 sec - exiting 16:42:25 (4957): No heartbeat from core client for 30 sec - exiting 16:42:26 (4957): No heartbeat from core client for 30 sec - exiting 16:44:56 (5069): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:44:57 (5069): No heartbeat from core client for 30 sec - exiting 16:50:59 (5138): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:58:08 (5252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:58:09 (5252): No heartbeat from core client for 30 sec - exiting 16:58:10 (5252): No heartbeat from core client for 30 sec - exiting 16:58:11 (5252): No heartbeat from core client for 30 sec - exiting 16:58:12 (5252): No heartbeat from core client for 30 sec - exiting 16:58:13 (5252): No heartbeat from core client for 30 sec - exiting 16:58:14 (5252): No heartbeat from core client for 30 sec - exiting 16:58:15 (5252): No heartbeat from core client for 30 sec - exiting 16:58:16 (5252): No heartbeat from core client for 30 sec - exiting 17:00:46 (5345): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:05:18 (5646): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:05:19 (5646): No heartbeat from core client for 30 sec - exiting 17:05:20 (5646): No heartbeat from core client for 30 sec - exiting 17:05:21 (5646): No heartbeat from core client for 30 sec - exiting 17:05:22 (5646): No heartbeat from core client for 30 sec - exiting 17:26:48 (5709): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:26:49 (5709): No heartbeat from core client for 30 sec - exiting 17:26:50 (5709): No heartbeat from core client for 30 sec - exiting 17:26:51 (5709): No heartbeat from core client for 30 sec - exiting 17:26:52 (5709): No heartbeat from core client for 30 sec - exiting 17:26:53 (5709): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76fd400] [0xf76fd430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf751a1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf751d825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75054d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23826, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76f7400] [0xf76f7430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75141df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7517825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74ff4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23826, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf778b400] [0xf778b430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75a81df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75ab825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75934d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23826, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf777b400] [0xf777b430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75981df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf759b825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75834d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23826, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf779d400] [0xf779d430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75ba1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75bd825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75a54d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23826, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf770c400] [0xf770c430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75291df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf752c825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75144d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23826, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 May 2013 02:15:02 | 1281428 | 15778875 | hadcm3n_4mbw_1940_40_008312091_3 | 25,920 | 59,063 | 2.2787 |
©2024 cpdn.org