Name | famous_s3am_999_200_006727857_0 |
Workunit | 6931198 |
Created | 9 Oct 2010, 21:49:27 UTC |
Sent | 30 Oct 2010, 8:46:10 UTC |
Report deadline | 29 Jan 2011, 16:13:21 UTC |
Received | 13 Nov 2010, 17:42:28 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1075728 |
Run time | 2 days 16 hours 54 min 21 sec |
CPU time | 1 days 18 hours 47 min 1 sec |
Validate state | Invalid |
Credit | 1,111.83 |
Device peak FLOPS | 2.58 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4572, selfPID=4572, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6556, selfPID=6556, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6728, selfPID=6728, iMonCtr=1 No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4980, selfPID=4980, iMonCtr=1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6732, selfPID=6732, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4376, selfPID=4376, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6088, selfPID=6088, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3288, selfPID=3288, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6300, selfPID=6300, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6724, selfPID=6724, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1208, selfPID=1208, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5216, selfPID=5216, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6612, selfPID=6612, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6140, selfPID=6140, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4308, selfPID=4308, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5864, selfPID=5864, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3564, selfPID=3564, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5680, selfPID=5680, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2196, selfPID=2196, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5188, selfPID=5188, iMonCtr=1 19:01:23 (4716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:01:24 (4716): No heartbeat from core client for 30 sec - exiting 19:01:25 (4716): No heartbeat from core client for 30 sec - exiting 19:01:26 (4716): No heartbeat from core client for 30 sec - exiting 19:01:27 (4716): No heartbeat from core client for 30 sec - exiting 19:01:28 (4716): No heartbeat from core client for 30 sec - exiting 19:01:29 (4716): No heartbeat from core client for 30 sec - exiting 19:01:30 (4716): No heartbeat from core client for 30 sec - exiting 19:01:31 (4716): No heartbeat from core client for 30 sec - exiting 19:01:32 (4716): No heartbeat from core client for 30 sec - exiting 19:01:33 (4716): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2260, selfPID=2260, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2236, selfPID=2236, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4732, selfPID=4732, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2108, selfPID=2108, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2740, selfPID=2740, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5096, selfPID=5096, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5556, selfPID=5556, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1372, selfPID=1372, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1532, selfPID=1532, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4324, selfPID=4324, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2092, selfPID=2092, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5264, selfPID=5264, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1308, selfPID=1308, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1484, selfPID=1484, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2552, selfPID=2552, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5352, selfPID=5352, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5708, selfPID=5708, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2516, selfPID=2516, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4316, selfPID=4316, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5244, selfPID=5244, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4788, selfPID=4788, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2336, selfPID=2336, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4800, selfPID=4800, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=4892, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3280, selfPID=3280, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5984, selfPID=5984, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7032, selfPID=7032, iMonCtr=1 No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6720, selfPID=6720, iMonCtr=1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5908, iMonCtr=1 Model crash detected, will try to restart... 16:26:06 (4580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:26:08 (4580): No heartbeat from core client for 30 sec - exiting 16:26:09 (4580): No heartbeat from core client for 30 sec - exiting 16:26:10 (4580): No heartbeat from core client for 30 sec - exiting 16:26:11 (4580): No heartbeat from core client for 30 sec - exiting 16:26:12 (4580): No heartbeat from core client for 30 sec - exiting 16:26:13 (4580): No heartbeat from core client for 30 sec - exiting 16:26:14 (4580): No heartbeat from core client for 30 sec - exiting 16:26:15 (4580): No heartbeat from core client for 30 sec - exiting 16:26:16 (4580): No heartbeat from core client for 30 sec - exiting 16:26:17 (4580): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1128, selfPID=1128, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1800, selfPID=1800, iMonCtr=1 16:44:04 (3636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:44:05 (3636): No heartbeat from core client for 30 sec - exiting 16:44:07 (3636): No heartbeat from core client for 30 sec - exiting 16:44:08 (3636): No heartbeat from core client for 30 sec - exiting 16:44:09 (3636): No heartbeat from core client for 30 sec - exiting 16:44:10 (3636): No heartbeat from core client for 30 sec - exiting 16:44:11 (3636): No heartbeat from core client for 30 sec - exiting 16:44:12 (3636): No heartbeat from core client for 30 sec - exiting 16:44:13 (3636): No heartbeat from core client for 30 sec - exiting 16:44:14 (3636): No heartbeat from core client for 30 sec - exiting 16:44:15 (3636): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4144, selfPID=4144, iMonCtr=1 17:22:50 (3720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:22:52 (3720): No heartbeat from core client for 30 sec - exiting 17:22:53 (3720): No heartbeat from core client for 30 sec - exiting 17:22:54 (3720): No heartbeat from core client for 30 sec - exiting 17:22:55 (3720): No heartbeat from core client for 30 sec - exiting 17:22:56 (3720): No heartbeat from core client for 30 sec - exiting 17:22:57 (3720): No heartbeat from core client for 30 sec - exiting 17:22:58 (3720): No heartbeat from core client for 30 sec - exiting 17:22:59 (3720): No heartbeat from core client for 30 sec - exiting 17:23:01 (3720): No heartbeat from core client for 30 sec - exiting 17:23:02 (3720): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4268, selfPID=4268, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2920, selfPID=2920, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5356, selfPID=5356, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4304, selfPID=4304, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=496, selfPID=496, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5516, selfPID=5516, iMonCtr=1 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Nov 2010 01:03:05 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 336,986 | 150,476 | 0.4465 |
12 Nov 2010 22:32:42 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 327,626 | 146,618 | 0.4475 |
12 Nov 2010 19:16:27 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 318,266 | 142,513 | 0.4478 |
12 Nov 2010 00:13:20 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 308,906 | 138,278 | 0.4476 |
11 Nov 2010 20:16:48 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 299,546 | 134,162 | 0.4479 |
11 Nov 2010 00:09:00 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 290,186 | 129,725 | 0.4470 |
10 Nov 2010 19:49:10 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 280,826 | 125,528 | 0.4470 |
09 Nov 2010 23:36:47 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 271,466 | 121,367 | 0.4471 |
09 Nov 2010 19:19:38 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 262,106 | 116,740 | 0.4454 |
09 Nov 2010 00:01:38 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 252,746 | 112,736 | 0.4460 |
08 Nov 2010 19:35:09 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 243,386 | 108,580 | 0.4461 |
08 Nov 2010 17:11:51 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 234,026 | 104,188 | 0.4452 |
07 Nov 2010 22:29:36 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 224,666 | 100,279 | 0.4463 |
07 Nov 2010 17:48:09 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 215,306 | 95,932 | 0.4456 |
07 Nov 2010 16:38:39 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 205,946 | 91,690 | 0.4452 |
07 Nov 2010 16:38:39 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 196,586 | 87,295 | 0.4441 |
06 Nov 2010 17:41:47 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 187,226 | 83,281 | 0.4448 |
06 Nov 2010 14:40:24 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 177,866 | 79,091 | 0.4447 |
06 Nov 2010 11:46:11 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 168,506 | 74,905 | 0.4445 |
06 Nov 2010 11:41:03 | 1075728 | 11924573 | famous_s3am_999_200_006727857_0 | 159,146 | 70,466 | 0.4428 |
©2024 cpdn.org