Name | famous_v0ei_599_200_006686296_4 |
Workunit | 6889549 |
Created | 26 Aug 2010, 15:39:54 UTC |
Sent | 30 Aug 2010, 4:39:19 UTC |
Report deadline | 29 Nov 2010, 12:06:30 UTC |
Received | 19 Sep 2010, 18:48:36 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1086791 |
Run time | 12 days 6 hours 44 min 26 sec |
CPU time | 11 days 15 hours 58 min 2 sec |
Validate state | Invalid |
Credit | 4,416.16 |
Device peak FLOPS | 2.60 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> Het station kan een bepaald gebied of spoor op de schijf niet vinden. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5108, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4404, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5008, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3812, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2560, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4968, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( 00:43:36 (4240): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1 Model crash detected, will try to restart... 00:54:00 (2716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:54:01 (2716): No heartbeat from core client for 30 sec - exiting 00:54:02 (2716): No heartbeat from core client for 30 sec - exiting 00:54:03 (2716): No heartbeat from core client for 30 sec - exiting 00:54:04 (2716): No heartbeat from core client for 30 sec - exiting 00:54:05 (2716): No heartbeat from core client for 30 sec - exiting 00:54:06 (2716): No heartbeat from core client for 30 sec - exiting 00:54:07 (2716): No heartbeat from core client for 30 sec - exiting 00:54:08 (2716): No heartbeat from core client for 30 sec - exiting 00:54:10 (2716): No heartbeat from core client for 30 sec - exiting 00:54:11 (2716): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=612, selfPID=612, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4864, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4500, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4664, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=416, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=1 Model crash detected, will try to restart... 20:47:30 (4272): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Sep 2010 21:52:57 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,338,506 | 619,266 | 0.4627 |
11 Sep 2010 20:38:01 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,329,146 | 614,798 | 0.4626 |
11 Sep 2010 19:16:49 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,319,786 | 610,342 | 0.4625 |
11 Sep 2010 17:55:32 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,310,426 | 605,908 | 0.4624 |
11 Sep 2010 16:41:06 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,301,066 | 601,478 | 0.4623 |
11 Sep 2010 15:22:09 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,291,706 | 596,855 | 0.4621 |
11 Sep 2010 14:02:09 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,282,346 | 592,319 | 0.4619 |
11 Sep 2010 12:41:35 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,272,986 | 587,875 | 0.4618 |
11 Sep 2010 11:15:45 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,263,626 | 583,457 | 0.4617 |
11 Sep 2010 01:57:53 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,254,266 | 579,007 | 0.4616 |
11 Sep 2010 00:36:21 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,244,906 | 574,585 | 0.4615 |
11 Sep 2010 00:07:04 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,235,546 | 570,162 | 0.4615 |
10 Sep 2010 21:37:32 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,226,186 | 565,697 | 0.4613 |
10 Sep 2010 20:23:21 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,216,826 | 561,246 | 0.4612 |
10 Sep 2010 19:01:48 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,207,466 | 556,767 | 0.4611 |
10 Sep 2010 17:39:56 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,198,106 | 552,245 | 0.4609 |
10 Sep 2010 16:03:22 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,188,746 | 547,824 | 0.4608 |
10 Sep 2010 13:56:53 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,179,386 | 543,405 | 0.4608 |
10 Sep 2010 12:39:50 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,170,026 | 538,949 | 0.4606 |
10 Sep 2010 11:18:29 | 1086791 | 11691253 | famous_v0ei_599_200_006686296_4 | 1,160,666 | 534,386 | 0.4604 |
©2025 cpdn.org