Name | famous_ubmv_599_200_006647922_1 |
Workunit | 6851294 |
Created | 10 Jun 2010, 13:12:04 UTC |
Sent | 12 Aug 2010, 4:30:06 UTC |
Report deadline | 11 Nov 2010, 11:57:17 UTC |
Received | 5 Sep 2010, 1:59:56 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 132 (0x00000084) Unknown error code |
Computer ID | 1085551 |
Run time | |
CPU time | 6 days 23 hours 54 min 1 sec |
Validate state | Invalid |
Credit | 3,335.30 |
Device peak FLOPS | 1.85 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-pc-linux-gnu |
Stderr | <core_client_version>6.4.5</core_client_version> <![CDATA[ <message> process got signal 4 </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... (2998): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=75417, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=75417, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=75417, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Signal 15 received, exiting... (75419): called boinc_finish Signal 15 received, exiting... (41496): called boinc_finish (41496): called boinc_finish Signal 15 received, exiting... (48887): called boinc_finish Signal 15 received, exiting... (82135): called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... (73568): called boinc_finish Signal 15 received, exiting... (2226): called boinc_finish SIGSEGV: segmentation violation Signal 15 received, exiting... Signal 15 received, exiting... Signal 15 received, exiting... (2224): called boinc_finish (34206): called boinc_finish (31491):SIGSEGV: segmentation violation Signal 15 received, exiting... (34594): called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... (62384): called boinc_finish (44758): called boinc_finish famous_um_6.11_i686-pc-linux-gnu: vfprintf.c:1611: _IO_vfprintf_internal: Assertion `(size_t) done <= (size_t) 2147483647' failed. (34276): called boinc_finish SIGSEGV: segmentation violation Signal 15 received, exiting... Signal 15 received, exiting... Signal 15 received, exiting... (12350): called boinc_finish (8012): called boinc_finish (8012): called boinc_finish Signal 15 received, exiting... (47711): called boinc_finish (30830): called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... (37876): called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... (38102): called boinc_finish SIGSEGV: segmentation violation Stack trace (2 frames): /var/db/boinc/projects/climateprediction.net/famous_um_6.11_i686-pc-linux-gnu(boinc_catch_signal+0x58)[0x83aa400] [0xffffefb7] Exiting... (25172): called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... Signal 15 received, exiting... Signal 15 received, exiting... (4773): called boinc_finish (3000): called boinc_finish (6049): called boinc_finish (4771): called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... (20785): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1734701856, selfPID=21335, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1734701856, selfPID=21335, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1734701856, selfPID=21335, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1734701856, selfPID=21335, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (21335): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Sep 2010 12:13:47 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 1,010,906 | 634,118 | 0.6273 |
04 Sep 2010 01:36:02 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 1,001,546 | 619,053 | 0.6181 |
04 Sep 2010 01:36:02 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 992,186 | 610,706 | 0.6155 |
04 Sep 2010 01:36:02 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 982,826 | 604,289 | 0.6148 |
04 Sep 2010 01:36:02 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 973,466 | 597,958 | 0.6143 |
04 Sep 2010 01:36:02 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 964,106 | 582,597 | 0.6043 |
04 Sep 2010 01:36:01 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 954,746 | 575,921 | 0.6032 |
04 Sep 2010 01:36:01 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 945,386 | 604,180 | 0.6391 |
01 Sep 2010 11:54:20 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 936,026 | 597,676 | 0.6385 |
01 Sep 2010 11:54:20 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 926,666 | 591,118 | 0.6379 |
01 Sep 2010 11:54:20 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 917,306 | 584,450 | 0.6371 |
01 Sep 2010 11:54:20 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 907,946 | 577,724 | 0.6363 |
01 Sep 2010 11:54:20 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 898,586 | 571,081 | 0.6355 |
01 Sep 2010 11:54:20 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 889,226 | 556,374 | 0.6257 |
01 Sep 2010 11:54:20 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 879,866 | 549,833 | 0.6249 |
01 Sep 2010 11:54:20 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 870,506 | 543,385 | 0.6242 |
01 Sep 2010 10:23:00 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 861,146 | 536,239 | 0.6227 |
31 Aug 2010 05:58:44 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 851,786 | 514,420 | 0.6039 |
31 Aug 2010 04:07:54 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 842,426 | 507,653 | 0.6026 |
31 Aug 2010 02:11:25 | 1085551 | 11490442 | famous_ubmv_599_200_006647922_1 | 833,066 | 500,908 | 0.6013 |
©2024 cpdn.org