Name | hadcm3n_zg54_1920_40_008349674_1 |
Workunit | 8500535 |
Created | 14 May 2013, 8:30:19 UTC |
Sent | 14 May 2013, 8:30:26 UTC |
Report deadline | 13 Aug 2013, 15:57:37 UTC |
Received | 30 May 2013, 10:12:53 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 15 days 20 hours 2 min 32 sec |
CPU time | 15 days 11 hours 53 min 8 sec |
Validate state | Invalid |
Credit | 5,909.76 |
Device peak FLOPS | 2.01 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 15:42:45 (22091): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:25:14 (2327): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:25:15 (2327): No heartbeat from core client for 30 sec - exiting 17:25:16 (2327): No heartbeat from core client for 30 sec - exiting 09:35:20 (2955): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:35:21 (2955): No heartbeat from core client for 30 sec - exiting 09:35:22 (2955): No heartbeat from core client for 30 sec - exiting 09:35:23 (2955): No heartbeat from core client for 30 sec - exiting 09:35:24 (2955): No heartbeat from core client for 30 sec - exiting 09:35:25 (2955): No heartbeat from core client for 30 sec - exiting 09:35:26 (2955): No heartbeat from core client for 30 sec - exiting 09:35:27 (2955): No heartbeat from core client for 30 sec - exiting 09:35:28 (2955): No heartbeat from core client for 30 sec - exiting 09:35:29 (2955): No heartbeat from core client for 30 sec - exiting 09:35:30 (2955): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 06:40:35 (13115): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:45:34 (38869): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:36:02 (39080): No heartbeat from core client for 30 sec - exiting 10:36:03 (39080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:28 (41713): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:29 (41713): No heartbeat from core client for 30 sec - exiting 10:40:30 (41713): No heartbeat from core client for 30 sec - exiting 10:40:31 (41713): No heartbeat from core client for 30 sec - exiting 10:40:32 (41713): No heartbeat from core client for 30 sec - exiting 10:40:33 (41713): No heartbeat from core client for 30 sec - exiting 10:40:34 (41713): No heartbeat from core client for 30 sec - exiting 10:40:35 (41713): No heartbeat from core client for 30 sec - exiting 10:40:36 (41713): No heartbeat from core client for 30 sec - exiting 10:40:37 (41713): No heartbeat from core client for 30 sec - exiting 10:40:38 (41713): No heartbeat from core client for 30 sec - exiting 10:40:39 (41713): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77db400] [0xf77db425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75f81df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75fb825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75e34d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=41926, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76e5400] [0xf76e5425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75021df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7505825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74ed4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=41926, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf776d400] [0xf776d425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf758a1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf758d825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75754d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=41926, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77d6400] [0xf77d6425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75f31df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75f6825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75de4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=41926, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77b9400] [0xf77b9425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75d61df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75d9825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75c14d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=41926, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7796400] [0xf7796425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b31df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b6825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759e4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=41926, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 May 2013 02:20:54 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 492,480 | 1,314,564 | 2.6693 |
29 May 2013 05:51:53 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 466,560 | 1,244,200 | 2.6668 |
28 May 2013 09:54:04 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 440,640 | 1,174,770 | 2.6661 |
27 May 2013 14:25:56 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 414,720 | 1,107,302 | 2.6700 |
26 May 2013 18:45:49 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 388,800 | 1,040,654 | 2.6766 |
25 May 2013 23:42:18 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 362,880 | 972,568 | 2.6801 |
25 May 2013 04:20:42 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 336,960 | 906,024 | 2.6888 |
24 May 2013 08:52:56 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 311,040 | 839,097 | 2.6977 |
23 May 2013 13:49:47 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 285,120 | 771,356 | 2.7054 |
22 May 2013 18:34:12 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 259,200 | 703,417 | 2.7138 |
21 May 2013 21:58:45 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 233,280 | 634,997 | 2.7220 |
21 May 2013 01:40:24 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 207,360 | 564,300 | 2.7214 |
20 May 2013 05:54:55 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 181,440 | 493,968 | 2.7225 |
19 May 2013 10:17:41 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 155,520 | 422,857 | 2.7190 |
18 May 2013 14:05:38 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 129,600 | 352,907 | 2.7230 |
17 May 2013 18:22:35 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 103,680 | 284,472 | 2.7438 |
16 May 2013 22:30:57 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 77,760 | 216,100 | 2.7791 |
16 May 2013 02:19:54 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 51,840 | 144,445 | 2.7864 |
15 May 2013 05:09:16 | 1282401 | 15782513 | hadcm3n_zg54_1920_40_008349674_1 | 25,920 | 70,962 | 2.7377 |
©2024 cpdn.org