Name | famous_u740_1799_200_006719768_1 |
Workunit | 6923021 |
Created | 10 Dec 2010, 9:30:14 UTC |
Sent | 10 Dec 2010, 9:35:12 UTC |
Report deadline | 11 Mar 2011, 17:02:23 UTC |
Received | 9 Jan 2011, 22:04:57 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 954460 |
Run time | 15 days 12 hours 35 min 26 sec |
CPU time | 8 days 5 hours 41 min 39 sec |
Validate state | Invalid |
Credit | 4,724.98 |
Device peak FLOPS | 1.75 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... (15456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... (3220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (4667): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (5159): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (5214): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (5484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (5881): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6359): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6547): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6990): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (7650): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (8069): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (8306): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (8500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (8776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (9236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (9517): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (9571): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (9937): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (10421): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (10674): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11327): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11517): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11599): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (12119): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (13809): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (14229): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (14496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (14839): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (15505): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (15779): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16102): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16670): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16923): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17410): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... (21122): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (21263): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (21724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (21807): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (22200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (22564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (22916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (23393): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (23432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (23992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (24230): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/ocean_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/ocean_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/ocean_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/ocean_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/ocean_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/famous_u740_1799_200_006719768/dataout/ocean_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy Sorry, too many model crashes! :-( (1879): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Dec 2010 01:17:33 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,432,106 | 709,793 | 0.4956 |
30 Dec 2010 22:44:01 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,422,746 | 705,364 | 0.4958 |
30 Dec 2010 20:13:51 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,413,386 | 700,696 | 0.4958 |
30 Dec 2010 18:22:54 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,404,026 | 696,027 | 0.4957 |
30 Dec 2010 14:48:49 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,394,666 | 691,356 | 0.4957 |
30 Dec 2010 12:15:35 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,385,306 | 686,696 | 0.4957 |
30 Dec 2010 09:36:45 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,375,946 | 682,228 | 0.4958 |
30 Dec 2010 06:52:28 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,366,586 | 677,555 | 0.4958 |
29 Dec 2010 21:27:47 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,357,226 | 672,891 | 0.4958 |
29 Dec 2010 21:27:47 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,347,866 | 668,225 | 0.4958 |
29 Dec 2010 21:27:47 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,338,506 | 663,573 | 0.4958 |
29 Dec 2010 21:27:47 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,329,146 | 658,916 | 0.4957 |
29 Dec 2010 10:13:10 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,319,786 | 654,261 | 0.4957 |
29 Dec 2010 07:33:28 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,310,426 | 649,613 | 0.4957 |
29 Dec 2010 04:56:58 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,301,066 | 644,967 | 0.4957 |
29 Dec 2010 02:27:40 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,291,706 | 640,503 | 0.4959 |
28 Dec 2010 23:48:45 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,282,346 | 635,857 | 0.4959 |
28 Dec 2010 21:16:44 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,272,986 | 631,208 | 0.4958 |
28 Dec 2010 18:42:16 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,263,626 | 626,554 | 0.4958 |
28 Dec 2010 16:07:51 | 954460 | 12375826 | famous_u740_1799_200_006719768_1 | 1,254,266 | 622,110 | 0.4960 |
©2024 cpdn.org