Name | hadcm3n_t4qb_1940_40_007958823_2 |
Workunit | 8113935 |
Created | 10 May 2012, 20:58:05 UTC |
Sent | 10 May 2012, 21:06:07 UTC |
Report deadline | 10 Aug 2012, 4:33:18 UTC |
Received | 24 Jun 2012, 13:41:09 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1179592 |
Run time | 37 days 20 hours 19 min 41 sec |
CPU time | 37 days 16 hours 50 min 23 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 1.37 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:34:52 (19768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 13:04:10 (23742): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:24:50 (24249): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:24:57 (24249): No heartbeat from core client for 30 sec - exiting 17:27:19 (26472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:20:19 (26692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:17:05 (26934): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:22:33 (27211): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:27:54 (27422): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:33:58 (27628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:36:32 (27840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:39:38 (28245): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:39:42 (28245): No heartbeat from core client for 30 sec - exiting 16:39:43 (28245): No heartbeat from core client for 30 sec - exiting 16:39:44 (28245): No heartbeat from core client for 30 sec - exiting 17:47:17 (28469): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:06:00 (28681): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:47:29 (29064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:53:18 (29435): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:56 (29640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:45:18 (29844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:32:12 (30143): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:48:22 (30361): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:49:03 (31410): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:03:07 (33370): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:35:17 (33622): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:47:09 (34092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:39:54 (34339): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:43:48 (34557): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:50:01 (34797): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:55:39 (35144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:57:48 (35419): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:44:05 (35637): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:56:07 (35850): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:57:23 (36095): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:17:07 (36317): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:05:56 (36951): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:06:02 (36951): No heartbeat from core client for 30 sec - exiting 13:06:03 (36951): No heartbeat from core client for 30 sec - exiting 14:07:59 (37743): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:16:26 (37987): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:20:59 (38205): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:23:15 (38420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:50:01 (41574): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:55:34 (41860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:26:46 (42091): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... *** glibc detected *** ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu: double free or corruption (out): 0x08dbace0 *** ======= Backtrace: ========= /lib32/libc.so.6(+0x6bff1)[0xf752cff1] /lib32/libc.so.6(+0x6d880)[0xf752e880] /lib32/libc.so.6(cfree+0x6d)[0xf753192d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x21)[0xf77087b1] /usr/lib32/libstdc++.so.6(_ZdaPv+0x1d)[0xf770880d] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf74d7ce7] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] ======= Memory map: ======== 08048000-080e3000 r-xp 00000000 08:02 3514599 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e3000-080e4000 rw-p 0009b000 08:02 3514599 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e4000-0813b000 rw-p 00000000 00:00 0 08d65000-08dca000 rw-p 00000000 00:00 0 [heap] f6f00000-f6f21000 rw-p 00000000 00:00 0 f6f21000-f7000000 ---p 00000000 00:00 0 f7043000-f74be000 rw-s 00000000 08:02 3784874 /var/lib/boinc-client/slots/12/137845 f74be000-f74c1000 rw-p 00000000 00:00 0 f74c1000-f7615000 r-xp 00000000 08:02 9879703 /lib32/libc-2.12.1.so f7615000-f7616000 ---p 00154000 08:02 9879703 /lib32/libc-2.12.1.so f7616000-f7618000 r--p 00154000 08:02 9879703 /lib32/libc-2.12.1.so f7618000-f7619000 rw-p 00156000 08:02 9879703 /lib32/libc-2.12.1.so f7619000-f761c000 rw-p 00000000 00:00 0 f761c000-f7636000 r-xp 00000000 08:02 13436533 /usr/lib32/libgcc_s.so.1 f7636000-f7637000 r--p 00019000 08:02 13436533 /usr/lib32/libgcc_s.so.1 f7637000-f7638000 rw-p 0001a000 08:02 13436533 /usr/lib32/libgcc_s.so.1 f7638000-f765c000 r-xp 00000000 08:02 9879707 /lib32/libm-2.12.1.so f765c000-f765d000 r--p 00023000 08:02 9879707 /lib32/libm-2.12.1.so f765d000-f765e000 rw-p 00024000 08:02 9879707 /lib32/libm-2.12.1.so f765e000-f773d000 r-xp 00000000 08:02 13436540 /usr/lib32/libstdc++.so.6.0.14 f773d000-f7741000 r--p 000de000 08:02 13436540 /usr/lib32/libstdc++.so.6.0.14 f7741000-f7742000 rw-p 000e2000 08:02 13436540 /usr/lib32/libstdc++.so.6.0.14 f7742000-f7749000 rw-p 00000000 00:00 0 f7749000-f774b000 r-xp 00000000 08:02 9879706 /lib32/libdl-2.12.1.so f774b000-f774c000 r--p 00001000 08:02 9879706 /lib32/libdl-2.12.1.so f774c000-f774d000 rw-p 00002000 08:02 9879706 /lib32/libdl-2.12.1.so f774d000-f774e000 rw-p 00000000 00:00 0 f774e000-f7763000 r-xp 00000000 08:02 9879717 /lib32/libpthread-2.12.1.so f7763000-f7764000 r--p 00014000 08:02 9879717 /lib32/libpthread-2.12.1.so f7764000-f7765000 rw-p 00015000 08:02 9879717 /lib32/libpthread-2.12.1.so f7765000-f7767000 rw-p 00000000 00:00 0 f7778000-f7779000 rw-p 00000000 00:00 0 f7779000-f777a000 ---p 00000000 00:00 0 f777a000-f777d000 rw-p 00000000 00:00 0 f777d000-f777f000 rw-s 00000000 08:02 3784871 /var/lib/boinc-client/slots/12/boinc_mmap_file f777f000-f7781000 rw-p 00000000 00:00 0 f7781000-f7782000 r-xp 00000000 00:00 0 [vdso] f7782000-f779e000 r-xp 00000000 08:02 9879700 /lib32/ld-2.12.1.so f779e000-f779f000 r--p 0001b000 08:02 9879700 /lib32/ld-2.12.1.so f779f000-f77a0000 rw-p 0001c000 08:02 9879700 /lib32/ld-2.12.1.so ffe72000-ffee3000 rw-p 00000000 00:00 0 [stack] SIGABRT: abort called Stack trace (19 frames): ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df] [0xf7781400] [0xf7781425] /lib32/libc.so.6(gsignal+0x51)[0xf74eba01] /lib32/libc.so.6(abort+0x182)[0xf74eee42] /lib32/libc.so.6(+0x61f15)[0xf7522f15] /lib32/libc.so.6(+0x6bff1)[0xf752cff1] /lib32/libc.so.6(+0x6d880)[0xf752e880] /lib32/libc.so.6(cfree+0x6d)[0xf753192d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x21)[0xf77087b1] /usr/lib32/libstdc++.so.6(_ZdaPv+0x1d)[0xf770880d] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf74d7ce7] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Jun 2012 13:45:00 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 1,036,800 | 3,257,398 | 3.1418 |
23 Jun 2012 14:24:36 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 1,010,880 | 3,177,252 | 3.1431 |
22 Jun 2012 17:08:43 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 984,960 | 3,100,598 | 3.1479 |
21 Jun 2012 19:28:39 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 959,040 | 3,024,413 | 3.1536 |
20 Jun 2012 22:04:13 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 933,120 | 2,947,824 | 3.1591 |
20 Jun 2012 00:50:22 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 907,200 | 2,871,190 | 3.1649 |
19 Jun 2012 03:51:18 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 881,280 | 2,794,870 | 3.1714 |
18 Jun 2012 05:01:31 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 855,360 | 2,714,854 | 3.1739 |
17 Jun 2012 05:11:30 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 829,440 | 2,629,229 | 3.1699 |
16 Jun 2012 05:51:57 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 803,520 | 2,546,137 | 3.1687 |
15 Jun 2012 05:25:24 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 777,600 | 2,461,758 | 3.1658 |
14 Jun 2012 05:44:00 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 751,680 | 2,377,849 | 3.1634 |
13 Jun 2012 07:24:29 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 725,760 | 2,298,666 | 3.1673 |
12 Jun 2012 09:02:10 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 699,840 | 2,219,312 | 3.1712 |
11 Jun 2012 11:02:28 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 673,920 | 2,139,163 | 3.1742 |
10 Jun 2012 12:42:21 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 648,000 | 2,059,042 | 3.1775 |
09 Jun 2012 14:53:34 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 622,080 | 1,979,768 | 3.1825 |
08 Jun 2012 13:45:49 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 596,160 | 1,891,469 | 3.1728 |
07 Jun 2012 12:59:46 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 570,240 | 1,802,576 | 3.1611 |
06 Jun 2012 12:46:21 | 1179592 | 14655919 | hadcm3n_t4qb_1940_40_007958823_2 | 544,320 | 1,719,559 | 3.1591 |
©2024 cpdn.org