Name | famous_v82d_1999_200_006696219_2 |
Workunit | 6899472 |
Created | 26 Aug 2010, 16:26:05 UTC |
Sent | 10 Dec 2010, 11:45:56 UTC |
Report deadline | 11 Mar 2011, 19:13:07 UTC |
Received | 24 Jul 2011, 17:57:03 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1055955 |
Run time | 14 days 20 hours 21 min 45 sec |
CPU time | 13 days 17 hours 24 min 5 sec |
Validate state | Invalid |
Credit | 4,385.28 |
Device peak FLOPS | 1.07 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... 14:37:47 (3220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2176, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2804, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2960, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2356, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3140, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3244, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3172, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2488, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1012, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3096, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 09:32:34 (2308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3620, iMonCtr=1 Model crash detected, will try to restart... 14:12:34 (2208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2716, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2164, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3000, iMonCtr=1 Model crash detected, will try to restart... 12:08:48 (4016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 11:30:55 (3200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1132, iMonCtr=1 Model crash detected, will try to restart... 08:35:33 (2848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... C </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Jul 2011 16:51:44 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,329,146 | 1,180,644 | 0.8883 |
26 Jul 2011 16:51:43 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,319,786 | 1,172,220 | 0.8882 |
26 Jul 2011 16:51:43 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,310,426 | 1,163,988 | 0.8883 |
26 Jul 2011 16:51:43 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,301,066 | 1,155,807 | 0.8884 |
25 Jul 2011 19:41:25 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,291,706 | 1,147,512 | 0.8884 |
25 Jul 2011 18:01:45 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,282,346 | 1,139,167 | 0.8883 |
25 Jul 2011 17:15:48 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,272,986 | 1,130,796 | 0.8883 |
25 Jul 2011 17:15:48 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,263,626 | 1,122,497 | 0.8883 |
25 Jul 2011 17:15:47 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,254,266 | 1,114,243 | 0.8884 |
25 Jul 2011 17:15:47 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,244,906 | 1,106,080 | 0.8885 |
25 Jul 2011 17:15:47 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,235,546 | 1,097,894 | 0.8886 |
25 Jul 2011 17:15:47 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,226,186 | 1,089,626 | 0.8886 |
25 Jul 2011 17:15:47 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,216,826 | 1,081,353 | 0.8887 |
25 Jul 2011 17:15:47 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,207,466 | 1,073,108 | 0.8887 |
25 Jul 2011 17:15:47 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,198,106 | 1,064,836 | 0.8888 |
08 Jul 2011 14:29:19 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,188,746 | 1,056,517 | 0.8888 |
08 Jul 2011 05:17:25 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,179,386 | 1,048,247 | 0.8888 |
08 Jul 2011 05:17:25 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,170,026 | 1,040,090 | 0.8889 |
08 Jul 2011 05:17:25 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,160,666 | 1,031,807 | 0.8890 |
30 Jun 2011 17:37:54 | 1055955 | 11740866 | famous_v82d_1999_200_006696219_2 | 1,151,306 | 1,023,553 | 0.8890 |
©2024 cpdn.org