|
Name | famous_vlan_1899_200_006713365_3 |
Workunit | 6916618 |
Created | 26 Aug 2010, 17:36:39 UTC |
Sent | 2 Nov 2010, 17:41:13 UTC |
Report deadline | 2 Feb 2011, 1:08:24 UTC |
Received | 17 Nov 2010, 14:24:24 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1099887 |
Run time | 2 days 8 hours 18 min 27 sec |
CPU time | 2 days 6 hours 50 min 46 sec |
Validate state | Invalid |
Credit | 2,902.96 |
Device peak FLOPS | 3.00 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-apple-darwin |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> (42486): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (47186): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (49937): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (50117): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (50211): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (50304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (50329): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (50339): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (50361): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (50374): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (50401): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (51812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (51910): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (52467): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 MainError: 12:56:06 AM No files match the supplied pattern. MainError: 12:56:06 AM No files match the supplied pattern. (52565): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... (55165): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56235): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (56308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56370): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56387): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56430): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56481): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (56851): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (56925): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... (58101): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (58200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (58689): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (58762): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (59192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (59262): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (59358): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... (60218): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60271): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60288): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60309): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60330): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60385): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60406): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60437): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60473): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (60502): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... SIGSEGV: segmentation violation Crashed executable name: famous_um_6.11_i686-apple-darwin built using BOINC library version 6.11.1 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.5 build 10H574 Mon Nov 15 18:26:12 2010 0 0x0048fdcf PrintBacktrace (in famous_um_6.11_i686-apple-darwin) + 1221 1 0x0049054a boinc_catch_signal (in famous_um_6.11_i686-apple-darwin) + 474 2 0x92b3f46b 0x92b3f46b 3 0xffffffff 0xffffffff 4 0x0134e6b3 0x134e6b3 5 0x0134d44f 0x134d44f 6 0x002eeac6 readcntl (in famous_um_6.11_i686-apple-darwin) + 2598 7 0x003f55ea um_setup (in famous_um_6.11_i686-apple-darwin) + 458 8 0x00452abd um_shell (in famous_um_6.11_i686-apple-darwin) + 11429 9 0x004698c0 main (in famous_um_6.11_i686-apple-darwin) + 1112 10 0x000026b6 _start (in famous_um_6.11_i686-apple-darwin) + 216 11 0x000025dd start (in famous_um_6.11_i686-apple-darwin) + 41 Thread 0 crashed with X86 Thread State (32-bit): eax: 0xffffffe1 ebx: 0x00000003 ecx: 0xbfffc3ac edx: 0x92ad90fa edi: 0x00000000 esi: 0x00000000 ebp: 0xbfffc3e8 esp: 0xbfffc3ac ss: 0x0000001f efl: 0x00000206 eip: 0x92ad90fa cs: 0x00000007 ds: 0x0000001f es: 0x0000001f fs: 0x00000000 gs: 0x00000037 Binary Images Description: 0x1000 - 0x517fff /Library/Application Support/BOINC Data/projects/climateprediction.net/famous_um_6.11_i686-apple-darwin 0x12a9000 - 0x12c4fff /Library/Application Support/BOINC Data/projects/climateprediction.net/famous_vlan_1899_200_006713365/lib/libsvml.dylib 0x131c000 - 0x13b3fff /Library/Application Support/BOINC Data/projects/climateprediction.net/famous_vlan_1899_200_006713365/lib/libifcoremt.dylib 0x13fc000 - 0x1556fff /Library/Application Support/BOINC Data/projects/climateprediction.net/famous_vlan_1899_200_006713365/lib/libimf.dylib 0x1669000 - 0x1697fff /Library/Application Support/BOINC Data/projects/climateprediction.net/famous_vlan_1899_200_006713365/lib/libirc.dylib 0x90e49000 - 0x90e4cfff /usr/lib/system/libmathCommon.A.dylib 0x9286c000 - 0x9287afff /usr/lib/libz.1.dylib 0x92ad8000 - 0x92c7ffff /usr/lib/libSystem.B.dylib Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=62828, iMonCtr=1 Model crash detected, will try to restart... (62828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (62918): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (62936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (63192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (63362): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (66111): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (67378): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (67389): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (67411): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( (67424): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Nov 2010 13:45:24 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 879,866 | 196,226 | 0.2230 |
17 Nov 2010 13:08:24 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 870,506 | 194,123 | 0.2230 |
17 Nov 2010 12:34:22 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 861,146 | 192,015 | 0.2230 |
17 Nov 2010 11:53:46 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 851,786 | 189,901 | 0.2229 |
17 Nov 2010 11:17:23 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 842,426 | 187,778 | 0.2229 |
17 Nov 2010 10:40:59 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 833,066 | 185,653 | 0.2229 |
17 Nov 2010 10:04:45 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 823,706 | 183,519 | 0.2228 |
17 Nov 2010 09:28:04 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 814,346 | 181,386 | 0.2227 |
17 Nov 2010 08:49:11 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 804,986 | 179,253 | 0.2227 |
17 Nov 2010 08:24:55 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 795,626 | 177,119 | 0.2226 |
17 Nov 2010 07:38:17 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 786,266 | 174,993 | 0.2226 |
17 Nov 2010 07:01:22 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 776,906 | 172,876 | 0.2225 |
17 Nov 2010 06:24:53 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 767,546 | 170,769 | 0.2225 |
17 Nov 2010 05:48:36 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 758,186 | 168,675 | 0.2225 |
17 Nov 2010 05:37:21 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 748,826 | 166,586 | 0.2225 |
17 Nov 2010 05:37:21 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 739,466 | 164,494 | 0.2224 |
17 Nov 2010 05:37:21 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 730,106 | 162,402 | 0.2224 |
17 Nov 2010 05:37:21 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 720,746 | 160,314 | 0.2224 |
17 Nov 2010 05:37:21 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 711,386 | 158,240 | 0.2224 |
17 Nov 2010 05:37:21 | 1099887 | 11826790 | famous_vlan_1899_200_006713365_3 | 702,026 | 156,268 | 0.2226 |
©2024 climateprediction.net