Name | hadcm3n_z9ma_1880_40_008245316_2 |
Workunit | 8400440 |
Created | 20 Nov 2012, 19:20:06 UTC |
Sent | 20 Nov 2012, 19:20:42 UTC |
Report deadline | 20 Feb 2013, 2:47:53 UTC |
Received | 3 Dec 2012, 9:37:23 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1212089 |
Run time | 12 days 8 hours 28 min 19 sec |
CPU time | 8 days 22 hours 2 min 49 sec |
Validate state | Invalid |
Credit | 6,531.84 |
Device peak FLOPS | 1.69 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.29</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 09:33:07 (27055): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:33:08 (27055): No heartbeat from core client for 30 sec - exiting 22:50:31 (30574): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:50:32 (30574): No heartbeat from core client for 30 sec - exiting 22:50:33 (30574): No heartbeat from core client for 30 sec - exiting 22:50:34 (30574): No heartbeat from core client for 30 sec - exiting 22:50:35 (30574): No heartbeat from core client for 30 sec - exiting 22:50:36 (30574): No heartbeat from core client for 30 sec - exiting 22:50:39 (30574): No heartbeat from core client for 30 sec - exiting 22:50:40 (30574): No heartbeat from core client for 30 sec - exiting 22:50:41 (30574): No heartbeat from core client for 30 sec - exiting 22:50:42 (30574): No heartbeat from core client for 30 sec - exiting 22:50:43 (30574): No heartbeat from core client for 30 sec - exiting 03:34:50 (27926): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:42:07 (24594): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:42:08 (24594): No heartbeat from core client for 30 sec - exiting 07:10:10 (4906): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:26:32 (5210): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:26:33 (5210): No heartbeat from core client for 30 sec - exiting 10:26:34 (5210): No heartbeat from core client for 30 sec - exiting 10:26:35 (5210): No heartbeat from core client for 30 sec - exiting 10:26:36 (5210): No heartbeat from core client for 30 sec - exiting 10:26:37 (5210): No heartbeat from core client for 30 sec - exiting 10:26:38 (5210): No heartbeat from core client for 30 sec - exiting 10:26:39 (5210): No heartbeat from core client for 30 sec - exiting 10:26:40 (5210): No heartbeat from core client for 30 sec - exiting 10:26:41 (5210): No heartbeat from core client for 30 sec - exiting 19:10:03 (6006): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:10:04 (6006): No heartbeat from core client for 30 sec - exiting 19:10:05 (6006): No heartbeat from core client for 30 sec - exiting 19:10:07 (6006): No heartbeat from core client for 30 sec - exiting 19:10:08 (6006): No heartbeat from core client for 30 sec - exiting 19:10:09 (6006): No heartbeat from core client for 30 sec - exiting 19:10:10 (6006): No heartbeat from core client for 30 sec - exiting 19:10:11 (6006): No heartbeat from core client for 30 sec - exiting 19:10:12 (6006): No heartbeat from core client for 30 sec - exiting 06:38:22 (8331): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:38:23 (8331): No heartbeat from core client for 30 sec - exiting 04:30:42 (9985): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:30:43 (9985): No heartbeat from core client for 30 sec - exiting 04:30:44 (9985): No heartbeat from core client for 30 sec - exiting 07:30:21 (5558): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:30:25 (5558): No heartbeat from core client for 30 sec - exiting 07:00:56 (6612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77ca400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf77ca430] /lib32/libc.so.6(gsignal+0x4f)[0xf75dbc3f] /lib32/libc.so.6(abort+0x175)[0xf75dd505] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf75c6ba3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7704400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7704430] /lib32/libc.so.6(gsignal+0x4f)[0xf7515c3f] /lib32/libc.so.6(abort+0x175)[0xf7517505] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf7500ba3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7738400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7738430] /lib32/libc.so.6(gsignal+0x4f)[0xf7549c3f] /lib32/libc.so.6(abort+0x175)[0xf754b505] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf7534ba3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7782400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7782430] /lib32/libc.so.6(gsignal+0x4f)[0xf7593c3f] /lib32/libc.so.6(abort+0x175)[0xf7595505] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf757eba3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf772d400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf772d430] /lib32/libc.so.6(gsignal+0x4f)[0xf753ec3f] /lib32/libc.so.6(abort+0x175)[0xf7540505] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf7529ba3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77c4400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf77c4430] /lib32/libc.so.6(gsignal+0x4f)[0xf75d5c3f] /lib32/libc.so.6(abort+0x175)[0xf75d7505] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf3)[0xf75c0ba3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Dec 2012 17:50:37 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 544,320 | 739,424 | 1.3584 |
02 Dec 2012 04:12:47 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 518,400 | 704,365 | 1.3587 |
01 Dec 2012 14:27:33 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 492,480 | 669,320 | 1.3591 |
01 Dec 2012 00:44:27 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 466,560 | 633,788 | 1.3584 |
30 Nov 2012 11:02:23 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 440,640 | 598,565 | 1.3584 |
29 Nov 2012 21:19:06 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 414,720 | 563,289 | 1.3582 |
29 Nov 2012 08:02:19 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 388,800 | 527,809 | 1.3575 |
28 Nov 2012 18:30:19 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 362,880 | 492,073 | 1.3560 |
28 Nov 2012 05:21:00 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 336,960 | 456,904 | 1.3560 |
27 Nov 2012 14:54:56 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 311,040 | 421,646 | 1.3556 |
27 Nov 2012 01:17:06 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 285,120 | 386,582 | 1.3559 |
26 Nov 2012 11:51:12 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 259,200 | 351,380 | 1.3556 |
25 Nov 2012 22:27:51 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 233,280 | 316,445 | 1.3565 |
25 Nov 2012 08:55:27 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 207,360 | 281,420 | 1.3572 |
24 Nov 2012 19:07:58 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 181,440 | 246,295 | 1.3574 |
24 Nov 2012 05:17:40 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 155,520 | 211,192 | 1.3580 |
23 Nov 2012 15:34:38 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 129,600 | 175,872 | 1.3570 |
23 Nov 2012 01:49:10 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 103,680 | 140,777 | 1.3578 |
22 Nov 2012 12:18:08 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 77,760 | 105,648 | 1.3586 |
21 Nov 2012 22:35:24 | 1212089 | 15440282 | hadcm3n_z9ma_1880_40_008245316_2 | 51,840 | 70,224 | 1.3546 |
©2025 cpdn.org