Name | famous_wrmx_1499_200_007122576_1 |
Workunit | 7320936 |
Created | 18 Jan 2011, 17:30:02 UTC |
Sent | 18 Jan 2011, 18:01:48 UTC |
Report deadline | 20 Apr 2011, 1:28:59 UTC |
Received | 23 Jan 2011, 10:52:27 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1051329 |
Run time | |
CPU time | 1 days 17 hours 5 min 30 sec |
Validate state | Invalid |
Credit | 555.96 |
Device peak FLOPS | 1.70 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-pc-linux-gnu |
Stderr | <core_client_version>6.4.5</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... (10564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (10564): No heartbeat from core client for 30 sec - exiting (10564): No heartbeat from core client for 30 sec - exiting (11814): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11814): No heartbeat from core client for 30 sec - exiting (11814): No heartbeat from core client for 30 sec - exiting (11814): No heartbeat from core client for 30 sec - exiting (11814): No heartbeat from core client for 30 sec - exiting (11814): No heartbeat from core client for 30 sec - exiting (11814): No heartbeat from core client for 30 sec - exiting (11814): No heartbeat from core client for 30 sec - exiting (13689): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (13723): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... (15721): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... (16194): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16194): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... (17230): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17230): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... (18831): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... (20730): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (20730): No heartbeat from core client for 30 sec - exiting (20730): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting (20845): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... (23121): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... (24617): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (24647): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (24690): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (24954): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (24954): No heartbeat from core client for 30 sec - exiting (24954): No heartbeat from core client for 30 sec - exiting (24954): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... (25757): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... (26430): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (27097): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... (27691): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (27691): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... (1438): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (1438): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (1673): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting (1673): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... (2579): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (2701): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (3072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (3072): No heartbeat from core client for 30 sec - exiting (3072): No heartbeat from core client for 30 sec - exiting (3072): No heartbeat from core client for 30 sec - exiting (3227): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (3227): No heartbeat from core client for 30 sec - exiting (3331): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (3331): No heartbeat from core client for 30 sec - exiting (3331): No heartbeat from core client for 30 sec - exiting (3331): No heartbeat from core client for 30 sec - exiting (3348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (3368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (3419): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (3419): No heartbeat from core client for 30 sec - exiting (3419): No heartbeat from core client for 30 sec - exiting (3535): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (3535): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... (7567): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (7567): No heartbeat from core client for 30 sec - exiting (7567): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... (9625): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (9671): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (9893): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... (12028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (12028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... (13202): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (13211): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (13211): No heartbeat from core client for 30 sec - exiting (13241): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Sorry, too many model crashes! :-( (14019): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Jan 2011 10:57:00 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 168,506 | 143,647 | 0.8525 |
22 Jan 2011 22:12:01 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 159,146 | 135,640 | 0.8523 |
22 Jan 2011 18:34:21 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 149,786 | 127,795 | 0.8532 |
22 Jan 2011 06:22:00 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 140,426 | 119,915 | 0.8539 |
22 Jan 2011 03:28:16 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 131,066 | 111,951 | 0.8542 |
21 Jan 2011 19:11:26 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 121,706 | 103,931 | 0.8540 |
21 Jan 2011 13:37:36 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 112,346 | 95,930 | 0.8539 |
21 Jan 2011 13:37:36 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 102,986 | 87,880 | 0.8533 |
21 Jan 2011 11:56:36 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 93,626 | 79,777 | 0.8521 |
20 Jan 2011 20:10:26 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 84,266 | 71,734 | 0.8513 |
20 Jan 2011 14:31:29 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 74,906 | 63,824 | 0.8521 |
20 Jan 2011 09:15:57 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 65,546 | 55,964 | 0.8538 |
20 Jan 2011 03:39:28 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 56,186 | 47,914 | 0.8528 |
19 Jan 2011 23:53:07 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 46,826 | 39,887 | 0.8518 |
19 Jan 2011 20:08:08 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 37,466 | 31,773 | 0.8480 |
19 Jan 2011 16:44:54 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 28,106 | 24,144 | 0.8590 |
19 Jan 2011 10:52:38 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 18,746 | 16,202 | 0.8643 |
19 Jan 2011 10:52:38 | 1051329 | 12503627 | famous_wrmx_1499_200_007122576_1 | 9,386 | 8,065 | 0.8593 |
©2024 climateprediction.net