Name | hadcm3n_o4x4_2100_40_007999577_4 |
Workunit | 8154691 |
Created | 3 Sep 2012, 22:05:27 UTC |
Sent | 3 Sep 2012, 22:05:39 UTC |
Report deadline | 4 Dec 2012, 5:32:50 UTC |
Received | 8 Oct 2012, 22:13:02 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1179592 |
Run time | 26 days 17 hours 28 min 12 sec |
CPU time | 26 days 17 hours 28 min 12 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 1.93 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 04:51:28 (16622): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:44:24 (19096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:59:37 (20648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:04:47 (22349): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:18:53 (22515): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:50:39 (23168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:00:28 (23773): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:52:06 (24064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:15:41 (24423): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:24:04 (24704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:42:57 (25020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:36:19 (25948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:40:20 (26809): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:45:35 (26998): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:58:38 (27280): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:02:34 (27529): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:48:48 (27781): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Error: Input file: dataout/o4x4ko.pjw5c10 is not a valid UM file. Error converting file to netcdf: dataout/o4x4ko.pjw5c10 Error: Input file: dataout/o4x4ko.piw5c10 is not a valid UM file. Error converting file to netcdf: dataout/o4x4ko.piw5c10 Error: Input file: dataout/o4x4ko.pfw5c10 is not a valid UM file. Error converting file to netcdf: dataout/o4x4ko.pfw5c10 Error: Input file: dataout/o4x4ka.phw5c10 is not a valid UM file. Error converting file to netcdf: dataout/o4x4ka.phw5c10 Error: Input file: dataout/o4x4ka.pgw5c10 is not a valid UM file. Error converting file to netcdf: dataout/o4x4ka.pgw5c10 Error: Input file: dataout/o4x4ka.pew5c10 is not a valid UM file. Error converting file to netcdf: dataout/o4x4ka.pew5c10 Error: Input file: dataout/o4x4ka.pdw5c10 is not a valid UM file. Error converting file to netcdf: dataout/o4x4ka.pdw5c10 04:19:32 (28061): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:51:40 (28318): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:39:05 (28640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:01:27 (29085): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:49:21 (29446): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:30:55 (29728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:13:26 (29989): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:18:16 (30668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:25:40 (30892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:33:59 (31187): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:43:34 (31508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:31:52 (31771): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:42:22 (32040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:35:13 (32289): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:39:11 (32830): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:15:36 (33064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... *** glibc detected *** ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu: double free or corruption (out): 0x08f2b928 *** ======= Backtrace: ========= /lib32/libc.so.6(+0x6bff1)[0xf7534ff1] /lib32/libc.so.6(+0x6d880)[0xf7536880] /lib32/libc.so.6(cfree+0x6d)[0xf753992d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x21)[0xf77107b1] /usr/lib32/libstdc++.so.6(_ZdaPv+0x1d)[0xf771080d] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf74dfce7] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] ======= Memory map: ======== 08048000-080e3000 r-xp 00000000 08:02 3514599 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e3000-080e4000 rw-p 0009b000 08:02 3514599 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e4000-0813b000 rw-p 00000000 00:00 0 08ed6000-08f3b000 rw-p 00000000 00:00 0 [heap] f6f00000-f6f21000 rw-p 00000000 00:00 0 f6f21000-f7000000 ---p 00000000 00:00 0 f704b000-f74c6000 rw-s 00000000 08:02 8847385 /var/lib/boinc-client/slots/15/136175 f74c6000-f74c9000 rw-p 00000000 00:00 0 f74c9000-f761d000 r-xp 00000000 08:02 9879703 /lib32/libc-2.12.1.so f761d000-f761e000 ---p 00154000 08:02 9879703 /lib32/libc-2.12.1.so f761e000-f7620000 r--p 00154000 08:02 9879703 /lib32/libc-2.12.1.so f7620000-f7621000 rw-p 00156000 08:02 9879703 /lib32/libc-2.12.1.so f7621000-f7624000 rw-p 00000000 00:00 0 f7624000-f763e000 r-xp 00000000 08:02 13436533 /usr/lib32/libgcc_s.so.1 f763e000-f763f000 r--p 00019000 08:02 13436533 /usr/lib32/libgcc_s.so.1 f763f000-f7640000 rw-p 0001a000 08:02 13436533 /usr/lib32/libgcc_s.so.1 f7640000-f7664000 r-xp 00000000 08:02 9879707 /lib32/libm-2.12.1.so f7664000-f7665000 r--p 00023000 08:02 9879707 /lib32/libm-2.12.1.so f7665000-f7666000 rw-p 00024000 08:02 9879707 /lib32/libm-2.12.1.so f7666000-f7745000 r-xp 00000000 08:02 13436540 /usr/lib32/libstdc++.so.6.0.14 f7745000-f7749000 r--p 000de000 08:02 13436540 /usr/lib32/libstdc++.so.6.0.14 f7749000-f774a000 rw-p 000e2000 08:02 13436540 /usr/lib32/libstdc++.so.6.0.14 f774a000-f7751000 rw-p 00000000 00:00 0 f7751000-f7753000 r-xp 00000000 08:02 9879706 /lib32/libdl-2.12.1.so f7753000-f7754000 r--p 00001000 08:02 9879706 /lib32/libdl-2.12.1.so f7754000-f7755000 rw-p 00002000 08:02 9879706 /lib32/libdl-2.12.1.so f7755000-f7756000 rw-p 00000000 00:00 0 f7756000-f776b000 r-xp 00000000 08:02 9879717 /lib32/libpthread-2.12.1.so f776b000-f776c000 r--p 00014000 08:02 9879717 /lib32/libpthread-2.12.1.so f776c000-f776d000 rw-p 00015000 08:02 9879717 /lib32/libpthread-2.12.1.so f776d000-f776f000 rw-p 00000000 00:00 0 f7780000-f7781000 rw-p 00000000 00:00 0 f7781000-f7782000 ---p 00000000 00:00 0 f7782000-f7785000 rw-p 00000000 00:00 0 f7785000-f7787000 rw-s 00000000 08:02 8847382 /var/lib/boinc-client/slots/15/boinc_mmap_file f7787000-f7789000 rw-p 00000000 00:00 0 f7789000-f778a000 r-xp 00000000 00:00 0 [vdso] f778a000-f77a6000 r-xp 00000000 08:02 9879700 /lib32/ld-2.12.1.so f77a6000-f77a7000 r--p 0001b000 08:02 9879700 /lib32/ld-2.12.1.so f77a7000-f77a8000 rw-p 0001c000 08:02 9879700 /lib32/ld-2.12.1.so ff95e000-ff9ce000 rw-p 00000000 00:00 0 [stack] SIGABRT: abort called Stack trace (19 frames): ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df] [0xf7789400] [0xf7789425] /lib32/libc.so.6(gsignal+0x51)[0xf74f3a01] /lib32/libc.so.6(abort+0x182)[0xf74f6e42] /lib32/libc.so.6(+0x61f15)[0xf752af15] /lib32/libc.so.6(+0x6bff1)[0xf7534ff1] /lib32/libc.so.6(+0x6d880)[0xf7536880] /lib32/libc.so.6(cfree+0x6d)[0xf753992d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x21)[0xf77107b1] /usr/lib32/libstdc++.so.6(_ZdaPv+0x1d)[0xf771080d] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf74dfce7] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Oct 2012 22:18:00 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 777,600 | 2,309,280 | 2.9698 |
08 Oct 2012 00:19:04 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 751,680 | 2,236,038 | 2.9747 |
07 Oct 2012 01:17:57 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 725,760 | 2,155,772 | 2.9704 |
06 Oct 2012 02:20:59 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 699,840 | 2,076,884 | 2.9677 |
05 Oct 2012 01:15:54 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 673,920 | 1,984,439 | 2.9446 |
04 Oct 2012 03:21:55 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 648,000 | 1,908,671 | 2.9455 |
03 Oct 2012 04:20:42 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 622,080 | 1,827,273 | 2.9374 |
02 Oct 2012 05:21:12 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 596,160 | 1,745,530 | 2.9280 |
01 Oct 2012 22:14:09 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 570,240 | 1,674,382 | 2.9363 |
30 Sep 2012 22:13:05 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 544,320 | 1,603,844 | 2.9465 |
29 Sep 2012 22:14:08 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 518,400 | 1,533,432 | 2.9580 |
28 Sep 2012 22:13:29 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 492,480 | 1,462,638 | 2.9699 |
28 Sep 2012 01:14:33 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 466,560 | 1,387,213 | 2.9733 |
27 Sep 2012 02:17:39 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 440,640 | 1,304,815 | 2.9612 |
26 Sep 2012 22:14:57 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 414,720 | 1,231,999 | 2.9707 |
25 Sep 2012 22:14:51 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 388,800 | 1,159,730 | 2.9828 |
24 Sep 2012 22:15:26 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 362,880 | 1,089,276 | 3.0018 |
23 Sep 2012 22:12:12 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 336,960 | 1,020,446 | 3.0284 |
23 Sep 2012 00:14:04 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 311,040 | 953,895 | 3.0668 |
22 Sep 2012 05:17:15 | 1179592 | 15230315 | hadcm3n_o4x4_2100_40_007999577_4 | 285,120 | 885,917 | 3.1072 |
©2024 cpdn.org