Name | famous_uml0_1199_200_006662111_0 |
Workunit | 6865483 |
Created | 10 Jun 2010, 15:16:27 UTC |
Sent | 7 Jul 2010, 20:44:15 UTC |
Report deadline | 7 Oct 2010, 4:11:26 UTC |
Received | 12 Aug 2010, 11:41:26 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1061563 |
Run time | 11 days 1 hours 38 min 31 sec |
CPU time | 8 days 20 hours 3 min 52 sec |
Validate state | Invalid |
Credit | 5,095.56 |
Device peak FLOPS | 2.26 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4552, iMonCtr=1 Model crash detected, will try to restart... 08:04:45 (5496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:07:51 (5308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:16:13 (4636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:33:40 (5232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:31 (4360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:03:32 (4664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:05:53 (5580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:35:41 (2356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:50:09 (4376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:39:53 (4820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:34:33 (4516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:43:20 (5604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:02:04 (5076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:35:29 (4464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:59:19 (2040): No heartbeat from core client for 30 sec - exiting 12:59:21 (2040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:23:01 (524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:38:41 (4976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:47:52 (4352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... 11:46:33 (4348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:28:02 (5488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:43:09 (4588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 09:55:54 (4320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:55:55 (4320): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 10:53:41 (760): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 23:19:47 (1864): No heartbeat from core client for 30 sec - exiting 23:19:48 (1864): No heartbeat from core client for 30 sec - exiting 23:19:49 (1864): No heartbeat from core client for 30 sec - exiting 23:19:50 (1864): No heartbeat from core client for 30 sec - exiting 23:19:51 (1864): No heartbeat from core client for 30 sec - exiting 23:19:52 (1864): No heartbeat from core client for 30 sec - exiting 23:19:53 (1864): No heartbeat from core client for 30 sec - exiting 23:19:54 (1864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:25:00 (4324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_uml0_1199_200_006662111/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_uml0_1199_200_006662111/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_uml0_1199_200_006662111/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_uml0_1199_200_006662111/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_uml0_1199_200_006662111/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_uml0_1199_200_006662111/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy Sorry, too many model crashes! :-( 12:41:21 (4924): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Aug 2010 10:46:42 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,544,426 | 761,008 | 0.4927 |
11 Aug 2010 09:26:34 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,535,066 | 756,448 | 0.4928 |
10 Aug 2010 21:32:03 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,525,706 | 751,088 | 0.4923 |
10 Aug 2010 19:50:50 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,516,346 | 746,448 | 0.4923 |
10 Aug 2010 18:20:31 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,506,986 | 741,889 | 0.4923 |
10 Aug 2010 17:27:15 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,497,626 | 737,346 | 0.4923 |
10 Aug 2010 14:22:45 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,488,266 | 732,484 | 0.4922 |
10 Aug 2010 12:42:17 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,478,906 | 727,817 | 0.4921 |
10 Aug 2010 11:21:40 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,469,546 | 723,206 | 0.4921 |
10 Aug 2010 09:51:07 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,460,186 | 718,567 | 0.4921 |
10 Aug 2010 09:02:12 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,450,826 | 713,809 | 0.4920 |
09 Aug 2010 20:42:24 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,441,466 | 709,013 | 0.4919 |
09 Aug 2010 18:52:53 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,432,106 | 704,385 | 0.4919 |
09 Aug 2010 17:16:49 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,422,746 | 699,294 | 0.4915 |
09 Aug 2010 15:49:04 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,413,386 | 694,422 | 0.4913 |
08 Aug 2010 18:16:48 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,404,026 | 689,810 | 0.4913 |
07 Aug 2010 19:04:06 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,394,666 | 685,305 | 0.4914 |
07 Aug 2010 17:45:02 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,385,306 | 680,854 | 0.4915 |
07 Aug 2010 16:31:07 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,375,946 | 676,434 | 0.4916 |
07 Aug 2010 14:23:50 | 1061563 | 11561410 | famous_uml0_1199_200_006662111_0 | 1,366,586 | 671,933 | 0.4917 |
©2024 cpdn.org