Name | hadcm3n_3ea6_1940_40_008258363_4 |
Workunit | 8413487 |
Created | 5 Jun 2013, 2:49:35 UTC |
Sent | 10 Jun 2013, 5:58:56 UTC |
Report deadline | 9 Sep 2013, 13:26:07 UTC |
Received | 12 Jun 2013, 4:58:11 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 1 days 19 hours 47 min 1 sec |
CPU time | 1 days 18 hours 39 min 40 sec |
Validate state | Invalid |
Credit | 622.08 |
Device peak FLOPS | 2.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 12:49:13 (45923): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:53:28 (49162): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:08 (49313): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:38:25 (49450): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:42:19 (53914): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:58:25 (54081): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:02:06 (54330): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:06:19 (54489): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:10:59 (54660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:11:00 (54660): No heartbeat from core client for 30 sec - exiting 22:11:01 (54660): No heartbeat from core client for 30 sec - exiting 22:11:02 (54660): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 00:32:53 (54843): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:36:30 (56188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:55:17 (56344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:55:18 (56344): No heartbeat from core client for 30 sec - exiting 00:59:17 (56634): No heartbeat from core client for 30 sec - exiting 00:59:18 (56634): No heartbeat from core client for 30 sec - exiting 00:59:19 (56634): No heartbeat from core client for 30 sec - exiting 00:59:20 (56634): No heartbeat from core client for 30 sec - exiting 00:59:21 (56634): No heartbeat from core client for 30 sec - exiting 00:59:22 (56634): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:59:23 (56634): No heartbeat from core client for 30 sec - exiting 01:03:52 (56799): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:03:53 (56799): No heartbeat from core client for 30 sec - exiting 01:03:54 (56799): No heartbeat from core client for 30 sec - exiting 01:03:55 (56799): No heartbeat from core client for 30 sec - exiting 01:03:56 (56799): No heartbeat from core client for 30 sec - exiting 01:11:39 (56921): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:15:33 (57107): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:15:34 (57107): No heartbeat from core client for 30 sec - exiting 01:19:27 (57266): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:19:28 (57266): No heartbeat from core client for 30 sec - exiting 01:19:29 (57266): No heartbeat from core client for 30 sec - exiting 04:28:21 (57420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:40:35 (59139): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:44:22 (59338): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:44:23 (59338): No heartbeat from core client for 30 sec - exiting 04:44:24 (59338): No heartbeat from core client for 30 sec - exiting 04:44:25 (59338): No heartbeat from core client for 30 sec - exiting 04:44:26 (59338): No heartbeat from core client for 30 sec - exiting 04:44:27 (59338): No heartbeat from core client for 30 sec - exiting 04:44:28 (59338): No heartbeat from core client for 30 sec - exiting 04:44:29 (59338): No heartbeat from core client for 30 sec - exiting 04:44:30 (59338): No heartbeat from core client for 30 sec - exiting 04:44:31 (59338): No heartbeat from core client for 30 sec - exiting 04:44:32 (59338): No heartbeat from core client for 30 sec - exiting 04:44:33 (59338): No heartbeat from core client for 30 sec - exiting 04:44:34 (59338): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 05:01:47 (59493): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:27:07 (59749): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:27:08 (59749): No heartbeat from core client for 30 sec - exiting 10:27:09 (59749): No heartbeat from core client for 30 sec - exiting 10:27:10 (59749): No heartbeat from core client for 30 sec - exiting 10:27:11 (59749): No heartbeat from core client for 30 sec - exiting 10:27:12 (59749): No heartbeat from core client for 30 sec - exiting 10:27:13 (59749): No heartbeat from core client for 30 sec - exiting 10:27:14 (59749): No heartbeat from core client for 30 sec - exiting 10:27:15 (59749): No heartbeat from core client for 30 sec - exiting 10:27:16 (59749): No heartbeat from core client for 30 sec - exiting 10:27:17 (59749): No heartbeat from core client for 30 sec - exiting 10:27:18 (59749): No heartbeat from core client for 30 sec - exiting 10:27:19 (59749): No heartbeat from core client for 30 sec - exiting 10:27:20 (59749): No heartbeat from core client for 30 sec - exiting 10:27:21 (59749): No heartbeat from core client for 30 sec - exiting 10:27:22 (59749): No heartbeat from core client for 30 sec - exiting 10:27:23 (59749): No heartbeat from core client for 30 sec - exiting 14:13:24 (62994): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:17:17 (65121): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:06:02 (65240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:58:40 (2924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:02:24 (4060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:02:25 (4060): No heartbeat from core client for 30 sec - exiting 21:02:26 (4060): No heartbeat from core client for 30 sec - exiting 21:02:27 (4060): No heartbeat from core client for 30 sec - exiting 21:02:28 (4060): No heartbeat from core client for 30 sec - exiting 21:02:29 (4060): No heartbeat from core client for 30 sec - exiting 21:02:30 (4060): No heartbeat from core client for 30 sec - exiting 21:02:31 (4060): No heartbeat from core client for 30 sec - exiting 21:02:32 (4060): No heartbeat from core client for 30 sec - exiting 21:02:33 (4060): No heartbeat from core client for 30 sec - exiting 21:02:34 (4060): No heartbeat from core client for 30 sec - exiting 05:41:04 (4252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:45:01 (8179): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:45:02 (8179): No heartbeat from core client for 30 sec - exiting 05:45:03 (8179): No heartbeat from core client for 30 sec - exiting 05:45:04 (8179): No heartbeat from core client for 30 sec - exiting 05:45:05 (8179): No heartbeat from core client for 30 sec - exiting 05:45:06 (8179): No heartbeat from core client for 30 sec - exiting 05:45:07 (8179): No heartbeat from core client for 30 sec - exiting 05:45:08 (8179): No heartbeat from core client for 30 sec - exiting 05:45:09 (8179): No heartbeat from core client for 30 sec - exiting 05:45:10 (8179): No heartbeat from core client for 30 sec - exiting 05:45:11 (8179): No heartbeat from core client for 30 sec - exiting 05:48:54 (8292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:48:55 (8292): No heartbeat from core client for 30 sec - exiting 05:48:56 (8292): No heartbeat from core client for 30 sec - exiting 05:48:57 (8292): No heartbeat from core client for 30 sec - exiting 05:48:58 (8292): No heartbeat from core client for 30 sec - exiting 05:48:59 (8292): No heartbeat from core client for 30 sec - exiting 05:49:00 (8292): No heartbeat from core client for 30 sec - exiting 05:49:01 (8292): No heartbeat from core client for 30 sec - exiting 05:49:02 (8292): No heartbeat from core client for 30 sec - exiting 05:49:03 (8292): No heartbeat from core client for 30 sec - exiting 05:49:04 (8292): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7796400] [0xf7796425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b31df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b6825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759e4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8385, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7700400] [0xf7700425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf751d1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7520825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75084d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8385, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7766400] [0xf7766425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75831df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7586825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf756e4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8385, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7774400] [0xf7774425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75911df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7594825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf757c4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8385, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf774f400] [0xf774f425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf756c1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf756f825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75574d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8385, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7781400] [0xf7781425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf759e1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75a1825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75894d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8385, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Jun 2013 21:29:45 | 1282401 | 15830152 | hadcm3n_3ea6_1940_40_008258363_4 | 51,840 | 128,815 | 2.4849 |
11 Jun 2013 01:45:21 | 1282401 | 15830152 | hadcm3n_3ea6_1940_40_008258363_4 | 25,920 | 62,928 | 2.4278 |
©2024 cpdn.org