Name | famous_vlsb_1999_200_006714001_3 |
Workunit | 6917254 |
Created | 26 Aug 2010, 17:42:25 UTC |
Sent | 1 Nov 2010, 15:24:17 UTC |
Report deadline | 31 Jan 2011, 22:51:28 UTC |
Received | 9 Nov 2010, 7:43:50 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 922180 |
Run time | 3 days 17 hours 48 min 38 sec |
CPU time | 3 days 18 hours 49 min 27 sec |
Validate state | Invalid |
Credit | 2,316.21 |
Device peak FLOPS | 2.28 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5056, selfPID=5056, iMonCtr=1 21:55:14 (3128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:55:15 (3128): No heartbeat from core client for 30 sec - exiting 21:55:16 (3128): No heartbeat from core client for 30 sec - exiting 21:55:17 (3128): No heartbeat from core client for 30 sec - exiting 21:55:18 (3128): No heartbeat from core client for 30 sec - exiting 21:55:19 (3128): No heartbeat from core client for 30 sec - exiting 21:55:20 (3128): No heartbeat from core client for 30 sec - exiting 21:55:21 (3128): No heartbeat from core client for 30 sec - exiting 21:55:22 (3128): No heartbeat from core client for 30 sec - exiting 21:55:23 (3128): No heartbeat from core client for 30 sec - exiting 21:55:24 (3128): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1992, selfPID=1992, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:57:55 (2992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:57:56 (2992): No heartbeat from core client for 30 sec - exiting 08:57:57 (2992): No heartbeat from core client for 30 sec - exiting 08:57:58 (2992): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5592, selfPID=5592, iMonCtr=1 20:13:38 (496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:13:39 (496): No heartbeat from core client for 30 sec - exiting 20:13:40 (496): No heartbeat from core client for 30 sec - exiting 20:13:41 (496): No heartbeat from core client for 30 sec - exiting 20:13:42 (496): No heartbeat from core client for 30 sec - exiting 20:13:43 (496): No heartbeat from core client for 30 sec - exiting 20:13:44 (496): No heartbeat from core client for 30 sec - exiting 20:13:45 (496): No heartbeat from core client for 30 sec - exiting 20:13:46 (496): No heartbeat from core client for 30 sec - exiting 20:13:47 (496): No heartbeat from core client for 30 sec - exiting 20:13:48 (496): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7272, selfPID=7272, iMonCtr=1 01:18:44 (2812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:18:45 (2812): No heartbeat from core client for 30 sec - exiting 01:18:46 (2812): No heartbeat from core client for 30 sec - exiting 01:18:47 (2812): No heartbeat from core client for 30 sec - exiting 01:18:48 (2812): No heartbeat from core client for 30 sec - exiting 01:18:49 (2812): No heartbeat from core client for 30 sec - exiting 01:18:50 (2812): No heartbeat from core client for 30 sec - exiting 01:18:51 (2812): No heartbeat from core client for 30 sec - exiting 01:18:52 (2812): No heartbeat from core client for 30 sec - exiting 01:18:53 (2812): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7704, selfPID=7704, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:43:19 (8244): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 16:43:20 (8244): No heartbeat from core client for 30 sec - exiting 16:43:21 (8244): No heartbeat from core client for 30 sec - exiting 16:43:22 (8244): No heartbeat from core client for 30 sec - exiting 16:43:23 (8244): No heartbeat from core client for 30 sec - exiting 16:43:24 (8244): No heartbeat from core client for 30 sec - exiting 16:43:25 (8244): No heartbeat from core client for 30 sec - exiting 16:43:26 (8244): No heartbeat from core client for 30 sec - exiting 16:43:27 (8244): No heartbeat from core client for 30 sec - exiting 16:43:28 (8244): No heartbeat from core client for 30 sec - exiting 16:43:29 (8244): No heartbeat from core client for 30 sec - exiting forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_vlsb_1999_200_006714001\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9104, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_vlsb_1999_200_006714001\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9104, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_vlsb_1999_200_006714001\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9104, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_vlsb_1999_200_006714001\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9104, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_vlsb_1999_200_006714001\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9104, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_vlsb_1999_200_006714001\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9104, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( 23:01:54 (9104): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Nov 2010 22:04:10 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 702,026 | 324,170 | 0.4618 |
08 Nov 2010 20:51:48 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 692,666 | 319,699 | 0.4615 |
08 Nov 2010 19:41:47 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 683,306 | 315,223 | 0.4613 |
08 Nov 2010 18:29:24 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 673,946 | 310,709 | 0.4610 |
08 Nov 2010 17:16:56 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 664,586 | 306,210 | 0.4608 |
08 Nov 2010 16:03:43 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 655,226 | 301,658 | 0.4604 |
08 Nov 2010 14:53:46 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 645,866 | 297,118 | 0.4600 |
08 Nov 2010 13:41:29 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 636,506 | 292,610 | 0.4597 |
08 Nov 2010 12:32:57 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 627,146 | 288,124 | 0.4594 |
08 Nov 2010 11:20:37 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 617,786 | 283,647 | 0.4591 |
08 Nov 2010 10:07:44 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 608,426 | 279,173 | 0.4588 |
08 Nov 2010 09:01:14 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 599,066 | 274,705 | 0.4586 |
08 Nov 2010 07:44:10 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 589,706 | 270,230 | 0.4582 |
08 Nov 2010 06:31:35 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 580,346 | 265,782 | 0.4580 |
08 Nov 2010 05:39:51 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 570,986 | 261,389 | 0.4578 |
08 Nov 2010 05:39:51 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 561,626 | 257,024 | 0.4576 |
08 Nov 2010 05:39:51 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 552,266 | 252,588 | 0.4574 |
08 Nov 2010 05:39:51 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 542,906 | 248,225 | 0.4572 |
08 Nov 2010 00:26:54 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 533,546 | 243,948 | 0.4572 |
07 Nov 2010 23:17:51 | 922180 | 11829980 | famous_vlsb_1999_200_006714001_3 | 524,186 | 239,665 | 0.4572 |
©2024 cpdn.org