Name | famous_xa62_1599_200_007075009_0 |
Workunit | 7278309 |
Created | 18 Dec 2010, 14:10:13 UTC |
Sent | 3 Jan 2011, 2:25:04 UTC |
Report deadline | 4 Apr 2011, 9:52:15 UTC |
Received | 1 Feb 2011, 19:34:19 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1100225 |
Run time | 10 days 10 hours 34 min 25 sec |
CPU time | 10 days 3 hours 36 min 47 sec |
Validate state | Invalid |
Credit | 6,083.77 |
Device peak FLOPS | 3.04 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6448, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3096, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3096, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4680, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4680, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:10:57 (3896): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 12:11:00 (3896): No heartbeat from core client for 30 sec - exiting 12:11:01 (3896): No heartbeat from core client for 30 sec - exiting 12:11:02 (3896): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Model crashed: STASH : Unsupported MPP submodel tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Model crashed: STASH : Unsupported MPP submodel tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Model crashed: STASH : Unsupported MPP submodel tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Model crashed: STASH : Unsupported MPP submodel tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Model crashed: STASH : Unsupported MPP submodel tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Model crashed: STASH : Unsupported MPP submodel tmp/pipe_dummy Sorry, too many model crashes! :-( 20:51:04 (4036): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
01 Feb 2011 19:38:10 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,843,946 | 873,492 | 0.4737 |
01 Feb 2011 19:38:10 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,834,586 | 869,275 | 0.4738 |
01 Feb 2011 00:04:58 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,825,226 | 865,030 | 0.4739 |
31 Jan 2011 22:56:10 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,815,866 | 860,754 | 0.4740 |
31 Jan 2011 21:47:45 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,806,506 | 856,599 | 0.4742 |
31 Jan 2011 19:42:50 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,797,146 | 852,325 | 0.4743 |
31 Jan 2011 19:42:50 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,787,786 | 847,863 | 0.4743 |
31 Jan 2011 19:42:50 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,778,426 | 843,338 | 0.4742 |
31 Jan 2011 00:16:38 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,769,066 | 838,793 | 0.4741 |
30 Jan 2011 22:59:40 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,759,706 | 834,153 | 0.4740 |
30 Jan 2011 21:41:11 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,750,346 | 829,603 | 0.4740 |
30 Jan 2011 20:22:09 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,740,986 | 825,078 | 0.4739 |
30 Jan 2011 18:47:09 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,731,626 | 820,320 | 0.4737 |
30 Jan 2011 18:25:27 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,722,266 | 815,421 | 0.4735 |
30 Jan 2011 03:56:47 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,712,906 | 810,484 | 0.4732 |
30 Jan 2011 02:29:51 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,703,546 | 805,565 | 0.4729 |
30 Jan 2011 01:02:31 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,694,186 | 800,631 | 0.4726 |
29 Jan 2011 23:40:21 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,684,826 | 795,714 | 0.4723 |
29 Jan 2011 22:18:12 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,675,466 | 790,810 | 0.4720 |
29 Jan 2011 20:55:10 | 1100225 | 12406499 | famous_xa62_1599_200_007075009_0 | 1,666,106 | 785,848 | 0.4717 |
©2024 cpdn.org