Name | famous_wsem_1999_200_007123573_0 |
Workunit | 7321933 |
Created | 16 Jan 2011, 16:58:19 UTC |
Sent | 16 Jan 2011, 20:27:38 UTC |
Report deadline | 18 Apr 2011, 3:54:49 UTC |
Received | 3 Feb 2011, 16:47:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1066945 |
Run time | 5 days 22 hours 17 min 18 sec |
CPU time | 5 days 6 hours 0 min 43 sec |
Validate state | Invalid |
Credit | 2,655.91 |
Device peak FLOPS | 2.28 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3560, iMonCtr=1 Model crash detected, will try to restart... 11:23:17 (1528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:23:18 (1528): No heartbeat from core client for 30 sec - exiting 11:23:19 (1528): No heartbeat from core client for 30 sec - exiting 11:23:20 (1528): No heartbeat from core client for 30 sec - exiting 11:23:21 (1528): No heartbeat from core client for 30 sec - exiting 11:23:22 (1528): No heartbeat from core client for 30 sec - exiting 11:23:23 (1528): No heartbeat from core client for 30 sec - exiting 11:23:24 (1528): No heartbeat from core client for 30 sec - exiting 11:23:25 (1528): No heartbeat from core client for 30 sec - exiting 11:23:26 (1528): No heartbeat from core client for 30 sec - exiting 11:23:27 (1528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 19:19:36 (4048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:20:21 (3436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:21:40 (4676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3492, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4064, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3680, iMonCtr=1 Model crash detected, will try to restart... 09:25:06 (3656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:25:48 (4060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:26:26 (2492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=1 Model crash detected, will try to restart... 09:05:01 (3472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:06:23 (2016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:08:43 (3568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:15:21 (2444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:16:12 (3320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:28:01 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:28:52 (1872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:01:38 (1100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:02:29 (3632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Feb 2011 17:09:15 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 804,986 | 453,257 | 0.5631 |
01 Feb 2011 14:30:12 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 795,626 | 448,560 | 0.5638 |
01 Feb 2011 12:54:26 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 786,266 | 443,387 | 0.5639 |
01 Feb 2011 00:10:12 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 776,906 | 438,122 | 0.5639 |
31 Jan 2011 22:25:53 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 767,546 | 432,838 | 0.5639 |
31 Jan 2011 17:15:01 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 758,186 | 427,512 | 0.5639 |
31 Jan 2011 12:51:05 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 748,826 | 422,212 | 0.5638 |
30 Jan 2011 18:31:38 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 739,466 | 416,923 | 0.5638 |
30 Jan 2011 16:10:56 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 730,106 | 411,460 | 0.5636 |
30 Jan 2011 14:34:53 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 720,746 | 406,192 | 0.5636 |
30 Jan 2011 12:51:12 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 711,386 | 400,803 | 0.5634 |
30 Jan 2011 11:03:55 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 702,026 | 395,422 | 0.5633 |
29 Jan 2011 16:17:53 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 692,666 | 390,010 | 0.5631 |
29 Jan 2011 14:41:49 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 683,306 | 384,843 | 0.5632 |
29 Jan 2011 13:04:13 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 673,946 | 379,642 | 0.5633 |
28 Jan 2011 22:25:41 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 664,586 | 374,365 | 0.5633 |
28 Jan 2011 20:38:24 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 655,226 | 369,012 | 0.5632 |
28 Jan 2011 18:59:56 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 645,866 | 363,779 | 0.5632 |
28 Jan 2011 18:47:44 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 636,506 | 358,493 | 0.5632 |
28 Jan 2011 15:22:05 | 1066945 | 12498245 | famous_wsem_1999_200_007123573_0 | 627,146 | 353,055 | 0.5630 |
©2024 cpdn.org