Name | famous_wtpq_1999_200_007113013_0 |
Workunit | 7311705 |
Created | 16 Jan 2011, 14:45:53 UTC |
Sent | 20 Jan 2011, 10:03:20 UTC |
Report deadline | 21 Apr 2011, 17:30:31 UTC |
Received | 8 Mar 2011, 10:09:15 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1072684 |
Run time | 13 days 5 hours 38 min 48 sec |
CPU time | 12 days 22 hours 47 min 56 sec |
Validate state | Workunit error - check skipped |
Credit | 6,176.41 |
Device peak FLOPS | 2.34 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5716, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:25:10 (2536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3668, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 20:15:06 (4548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:56:47 (5300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:59:40 (2260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 20:11:46 (3092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:41:51 (5888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2128, iMonCtr=1 Model crash detected, will try to restart... 19:58:49 (3676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:01:35 (3472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:00:23 (2836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1 Model crash detected, will try to restart... 18:24:38 (3424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:23:29 (5424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4356, iMonCtr=1 Model crash detected, will try to restart... 22:20:10 (4080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5512, iMonCtr=1 Model crash detected, will try to restart... 00:59:53 (3232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3436, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2112, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3708, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3716, iMonCtr=1 Model crash detected, will try to restart... 19:36:21 (3968): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,872,026 | 1,118,862 | 0.5977 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,862,666 | 1,113,323 | 0.5977 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,853,306 | 1,107,752 | 0.5977 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,843,946 | 1,102,226 | 0.5978 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,834,586 | 1,096,675 | 0.5978 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,825,226 | 1,091,127 | 0.5978 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,815,866 | 1,085,580 | 0.5978 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,806,506 | 1,080,066 | 0.5979 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,797,146 | 1,074,535 | 0.5979 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,787,786 | 1,068,956 | 0.5979 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,778,426 | 1,063,465 | 0.5980 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,769,066 | 1,057,875 | 0.5980 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,759,706 | 1,052,288 | 0.5980 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,750,346 | 1,046,735 | 0.5980 |
08 Mar 2011 10:15:05 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,740,986 | 1,041,088 | 0.5980 |
08 Mar 2011 10:15:04 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,731,626 | 1,035,532 | 0.5980 |
08 Mar 2011 10:15:04 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,722,266 | 1,029,947 | 0.5980 |
08 Mar 2011 10:15:04 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,712,906 | 1,024,351 | 0.5980 |
08 Mar 2011 10:15:04 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,703,546 | 1,018,775 | 0.5980 |
08 Mar 2011 10:15:04 | 1072684 | 12487770 | famous_wtpq_1999_200_007113013_0 | 1,694,186 | 1,013,247 | 0.5981 |
©2024 cpdn.org