Name | famous_ugec_799_200_006654095_6 |
Workunit | 6857467 |
Created | 23 Aug 2010, 8:17:55 UTC |
Sent | 23 Aug 2010, 8:33:07 UTC |
Report deadline | 22 Nov 2010, 16:00:18 UTC |
Received | 28 Aug 2010, 16:51:00 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 11 (0x0000000B) Unknown error code |
Computer ID | 1095731 |
Run time | 4 days 6 hours 25 min 34 sec |
CPU time | 3 days 22 hours 28 min 9 sec |
Validate state | Invalid |
Credit | 1,328.00 |
Device peak FLOPS | 1.11 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> process got signal 11 </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1152, iMonCtr=1 Model crash detected, will try to restart... (1152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11861): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11865): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11869): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11874): No heartbeat from core client for 30 sec - exiting (11874): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11897, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... (11897): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11897): No heartbeat from core client for 30 sec - exiting (11897): No heartbeat from core client for 30 sec - exiting (11953): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11957, iMonCtr=1 Model crash detected, will try to restart... (11957): No heartbeat from core client for 30 sec - exiting (11957): No heartbeat from core client for 30 sec - exiting (11957): No heartbeat from core client for 30 sec - exiting (11957): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11990, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11990, iMonCtr=1 Model crash detected, will try to restart... (11990): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (12031): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12039, iMonCtr=1 Model crash detected, will try to restart... (12039): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (12039): No heartbeat from core client for 30 sec - exiting </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Aug 2010 03:33:11 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 402,506 | 333,567 | 0.8287 |
28 Aug 2010 01:55:01 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 393,146 | 325,744 | 0.8286 |
27 Aug 2010 22:38:13 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 383,786 | 317,921 | 0.8284 |
27 Aug 2010 20:07:50 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 374,426 | 310,091 | 0.8282 |
27 Aug 2010 17:43:11 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 365,066 | 302,268 | 0.8280 |
27 Aug 2010 15:11:42 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 355,706 | 294,445 | 0.8278 |
27 Aug 2010 12:44:49 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 346,346 | 286,610 | 0.8275 |
27 Aug 2010 10:14:10 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 336,986 | 278,760 | 0.8272 |
27 Aug 2010 07:28:38 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 327,626 | 270,715 | 0.8263 |
27 Aug 2010 05:16:15 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 318,266 | 263,004 | 0.8264 |
27 Aug 2010 03:05:45 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 308,906 | 255,304 | 0.8265 |
27 Aug 2010 02:14:14 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 299,546 | 247,609 | 0.8266 |
26 Aug 2010 22:43:42 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 290,186 | 239,908 | 0.8267 |
26 Aug 2010 20:35:38 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 280,826 | 232,205 | 0.8269 |
26 Aug 2010 18:24:54 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 271,466 | 224,507 | 0.8270 |
26 Aug 2010 16:10:35 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 262,106 | 216,807 | 0.8272 |
26 Aug 2010 16:05:13 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 252,746 | 209,109 | 0.8273 |
26 Aug 2010 11:54:32 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 243,386 | 201,422 | 0.8276 |
26 Aug 2010 09:43:01 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 234,026 | 193,730 | 0.8278 |
26 Aug 2010 07:29:50 | 1095731 | 11671173 | famous_ugec_799_200_006654095_6 | 224,666 | 186,027 | 0.8280 |
©2024 cpdn.org