Name | famous_w3cm_599_200_006751030_0 |
Workunit | 6954346 |
Created | 12 Nov 2010, 12:28:14 UTC |
Sent | 15 Nov 2010, 22:24:27 UTC |
Report deadline | 15 Feb 2011, 5:51:38 UTC |
Received | 9 Dec 2010, 21:45:00 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 835150 |
Run time | 11 days 16 hours 48 min 27 sec |
CPU time | 11 days 0 hours 50 min 21 sec |
Validate state | Invalid |
Credit | 5,558.78 |
Device peak FLOPS | 2.07 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:35:27 (11832): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:27:17 (273556): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 16:28:48 (705768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 16:30:19 (705212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 16:31:50 (706412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 16:33:21 (702424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 16:34:52 (706304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 16:39:25 (705716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:40:56 (705360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:42:27 (705252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:43:58 (705468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:45:29 (706096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:47:00 (702876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:48:31 (705088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:50:02 (706268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:54:35 (706436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:56:06 (704892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:00:39 (632356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:05:12 (706012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:09:45 (707192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:17:13 (706096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Atmos Restart file copy failed on w3cmla#da0000007807g+ Atmos Hold Restart file rename failed on atmos_restart.hold 02:43:16 (1038188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/famous_w3cm_599_200_006751030/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/famous_w3cm_599_200_006751030/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/famous_w3cm_599_200_006751030/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/famous_w3cm_599_200_006751030/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/famous_w3cm_599_200_006751030/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/famous_w3cm_599_200_006751030/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy Sorry, too many model crashes! :-( 10:27:42 (5912): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Dec 2010 18:29:04 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,684,826 | 950,404 | 0.5641 |
09 Dec 2010 18:29:04 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,675,466 | 945,169 | 0.5641 |
09 Dec 2010 18:29:04 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,666,106 | 939,913 | 0.5641 |
09 Dec 2010 18:29:04 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,656,746 | 934,677 | 0.5642 |
09 Dec 2010 18:29:03 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,647,386 | 929,414 | 0.5642 |
09 Dec 2010 18:29:03 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,638,026 | 924,182 | 0.5642 |
09 Dec 2010 18:29:03 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,628,666 | 918,960 | 0.5642 |
09 Dec 2010 18:29:03 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,619,306 | 913,762 | 0.5643 |
09 Dec 2010 18:29:03 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,609,946 | 908,492 | 0.5643 |
09 Dec 2010 18:29:03 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,600,586 | 903,224 | 0.5643 |
09 Dec 2010 18:29:03 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,591,226 | 897,927 | 0.5643 |
08 Dec 2010 15:42:15 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,581,866 | 892,509 | 0.5642 |
08 Dec 2010 14:11:53 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,572,506 | 887,285 | 0.5642 |
08 Dec 2010 10:40:00 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,563,146 | 881,991 | 0.5642 |
08 Dec 2010 09:12:23 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,553,786 | 876,700 | 0.5642 |
08 Dec 2010 09:12:23 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,544,426 | 871,405 | 0.5642 |
08 Dec 2010 09:12:23 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,535,066 | 866,095 | 0.5642 |
08 Dec 2010 09:12:23 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,525,706 | 860,816 | 0.5642 |
08 Dec 2010 00:39:36 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,516,346 | 855,514 | 0.5642 |
07 Dec 2010 23:10:05 | 835150 | 11998703 | famous_w3cm_599_200_006751030_0 | 1,506,986 | 850,196 | 0.5642 |
©2024 cpdn.org