Name | hadcm3n_yblf_1900_40_007523360_0 |
Workunit | 7720835 |
Created | 28 Oct 2011, 13:22:14 UTC |
Sent | 31 Oct 2011, 14:19:37 UTC |
Report deadline | 30 Jan 2012, 21:46:48 UTC |
Received | 12 Dec 2011, 23:03:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1174556 |
Run time | 17 days 6 hours 28 min 42 sec |
CPU time | 8 days 12 hours 22 min 47 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 1.54 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.2</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> 14:43:12 (3772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:12:37 (6924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 06:01:21 (7829): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 12:27:34 (7870): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:36:14 (10709): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:53:44 (10726): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:30:52 (10768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:34:44 (10828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:55:11 (10941): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:33:01 (11142): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:21:29 (14437): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 08:40:28 (16313): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:12:10 (30432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Suspended CPDN Monitor - Suspend request from BOINC... 09:33:32 (32050): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:56:03 (2080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 05:38:44 (5478): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... *** glibc detected *** ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu: double free or corruption (out): 0x088f1aa8 *** ======= Backtrace: ========= /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x6aac1)[0xb7553ac1] /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x6c328)[0xb7555328] /lib/i386-linux-gnu/i686/cmov/libc.so.6(cfree+0x6d)[0xb75583dd] /usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdlPv+0x1f)[0xb7733abf] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib/i386-linux-gnu/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xb74ffe46] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] ======= Memory map: ======== 08048000-080e3000 r-xp 00000000 08:01 3711203 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e3000-080e4000 rw-p 0009b000 08:01 3711203 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e4000-0813b000 rw-p 00000000 00:00 0 0889c000-08902000 rw-p 00000000 00:00 0 [heap] b6f00000-b6f21000 rw-p 00000000 00:00 0 b6f21000-b7000000 ---p 00000000 00:00 0 b706b000-b74e6000 rw-s 00000000 08:01 3727584 /var/lib/boinc-client/slots/1/137730 b74e6000-b74e9000 rw-p 00000000 00:00 0 b74e9000-b763c000 r-xp 00000000 08:01 401589 /lib/i386-linux-gnu/i686/cmov/libc-2.13.so b763c000-b763d000 ---p 00153000 08:01 401589 /lib/i386-linux-gnu/i686/cmov/libc-2.13.so b763d000-b763f000 r--p 00153000 08:01 401589 /lib/i386-linux-gnu/i686/cmov/libc-2.13.so b763f000-b7640000 rw-p 00155000 08:01 401589 /lib/i386-linux-gnu/i686/cmov/libc-2.13.so b7640000-b7643000 rw-p 00000000 00:00 0 b7643000-b765f000 r-xp 00000000 08:01 393369 /lib/i386-linux-gnu/libgcc_s.so.1 b765f000-b7660000 rw-p 0001b000 08:01 393369 /lib/i386-linux-gnu/libgcc_s.so.1 b7660000-b7684000 r-xp 00000000 08:01 401584 /lib/i386-linux-gnu/i686/cmov/libm-2.13.so b7684000-b7685000 r--p 00023000 08:01 401584 /lib/i386-linux-gnu/i686/cmov/libm-2.13.so b7685000-b7686000 rw-p 00024000 08:01 401584 /lib/i386-linux-gnu/i686/cmov/libm-2.13.so b7686000-b7762000 r-xp 00000000 08:01 3981356 /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16 b7762000-b7763000 ---p 000dc000 08:01 3981356 /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16 b7763000-b7767000 r--p 000dc000 08:01 3981356 /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16 b7767000-b7768000 rw-p 000e0000 08:01 3981356 /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16 b7768000-b7770000 rw-p 00000000 00:00 0 b7770000-b7772000 r-xp 00000000 08:01 401576 /lib/i386-linux-gnu/i686/cmov/libdl-2.13.so b7772000-b7773000 r--p 00001000 08:01 401576 /lib/i386-linux-gnu/i686/cmov/libdl-2.13.so b7773000-b7774000 rw-p 00002000 08:01 401576 /lib/i386-linux-gnu/i686/cmov/libdl-2.13.so b7774000-b7789000 r-xp 00000000 08:01 401573 /lib/i386-linux-gnu/i686/cmov/libpthread-2.13.so b7789000-b778a000 r--p 00014000 08:01 401573 /lib/i386-linux-gnu/i686/cmov/libpthread-2.13.so b778a000-b778b000 rw-p 00015000 08:01 401573 /lib/i386-linux-gnu/i686/cmov/libpthread-2.13.so b778b000-b778d000 rw-p 00000000 00:00 0 b7794000-b7795000 rw-p 00000000 00:00 0 b7795000-b7796000 ---p 00000000 00:00 0 b7796000-b7799000 rw-p 00000000 00:00 0 b7799000-b779b000 rw-s 00000000 08:01 3727389 /var/lib/boinc-client/slots/1/boinc_mmap_file b779b000-b779d000 rw-p 00000000 00:00 0 b779d000-b779e000 r-xp 00000000 00:00 0 [vdso] b779e000-b77b9000 r-xp 00000000 08:01 393265 /lib/i386-linux-gnu/ld-2.13.so b77b9000-b77ba000 r--p 0001b000 08:01 393265 /lib/i386-linux-gnu/ld-2.13.so b77ba000-b77bb000 rw-p 0001c000 08:01 393265 /lib/i386-linux-gnu/ld-2.13.so bfb2c000-bfb9b000 rw-p 00000000 00:00 0 [stack] SIGABRT: abort called Stack trace (17 frames): ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df] [0xb779d400] [0xb779d424] /lib/i386-linux-gnu/i686/cmov/libc.so.6(gsignal+0x51)[0xb7513911] /lib/i386-linux-gnu/i686/cmov/libc.so.6(abort+0x182)[0xb7516d42] /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x609d5)[0xb75499d5] /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x6aac1)[0xb7553ac1] /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x6c328)[0xb7555328] /lib/i386-linux-gnu/i686/cmov/libc.so.6(cfree+0x6d)[0xb75583dd] /usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdlPv+0x1f)[0xb7733abf] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib/i386-linux-gnu/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xb74ffe46] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Dec 2011 06:52:54 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 259,200 | 735,760 | 2.8386 |
26 Nov 2011 01:37:46 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 233,280 | 658,971 | 2.8248 |
25 Nov 2011 01:54:12 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 207,360 | 581,749 | 2.8055 |
24 Nov 2011 02:24:55 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 181,440 | 504,644 | 2.7813 |
22 Nov 2011 14:50:26 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 155,520 | 430,367 | 2.7673 |
20 Nov 2011 15:44:24 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 129,600 | 358,659 | 2.7674 |
20 Nov 2011 15:44:24 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 103,680 | 286,910 | 2.7673 |
20 Nov 2011 15:44:24 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 77,760 | 215,433 | 2.7705 |
20 Nov 2011 15:44:24 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 51,840 | 143,426 | 2.7667 |
20 Nov 2011 15:44:24 | 1174556 | 13552414 | hadcm3n_yblf_1900_40_007523360_0 | 25,920 | 71,802 | 2.7701 |
©2024 cpdn.org