Name | famous_ubtv_1199_200_006719220_5 |
Workunit | 6922473 |
Created | 12 Sep 2010, 8:43:45 UTC |
Sent | 12 Sep 2010, 10:59:00 UTC |
Report deadline | 12 Dec 2010, 18:26:11 UTC |
Received | 13 Sep 2010, 5:37:57 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 904604 |
Run time | |
CPU time | 3 hours 12 min 7 sec |
Validate state | Invalid |
Credit | 216.26 |
Device peak FLOPS | 2.16 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-apple-darwin |
Stderr | <core_client_version>6.2.18</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> (16434): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16434): No heartbeat from core client for 30 sec - exiting (16434): No heartbeat from core client for 30 sec - exiting (16434): No heartbeat from core client for 30 sec - exiting (16434): No heartbeat from core client for 30 sec - exiting (16434): No heartbeat from core client for 30 sec - exiting (16585): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... (16816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16816): No heartbeat from core client for 30 sec - exiting (16896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16896): No heartbeat from core client for 30 sec - exiting (16896): No heartbeat from core client for 30 sec - exiting (16896): No heartbeat from core client for 30 sec - exiting (16896): No heartbeat from core client for 30 sec - exiting (16896): No heartbeat from core client for 30 sec - exiting (16896): No heartbeat from core client for 30 sec - exiting (16896): No heartbeat from core client for 30 sec - exiting (16896): No heartbeat from core client for 30 sec - exiting (16918): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16934): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16934): No heartbeat from core client for 30 sec - exiting (16934): No heartbeat from core client for 30 sec - exiting (16934): No heartbeat from core client for 30 sec - exiting (16934): No heartbeat from core client for 30 sec - exiting (16934): No heartbeat from core client for 30 sec - exiting (16934): No heartbeat from core client for 30 sec - exiting (16934): No heartbeat from core client for 30 sec - exiting (16934): No heartbeat from core client for 30 sec - exiting (16934): No heartbeat from core client for 30 sec - exiting (16957): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16985): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16985): No heartbeat from core client for 30 sec - exiting (16985): No heartbeat from core client for 30 sec - exiting (16985): No heartbeat from core client for 30 sec - exiting (17120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17389): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17389): No heartbeat from core client for 30 sec - exiting (17389): No heartbeat from core client for 30 sec - exiting (17389): No heartbeat from core client for 30 sec - exiting (17389): No heartbeat from core client for 30 sec - exiting (17389): No heartbeat from core client for 30 sec - exiting (17389): No heartbeat from core client for 30 sec - exiting (17389): No heartbeat from core client for 30 sec - exiting (17399): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17399): No heartbeat from core client for 30 sec - exiting (17427): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (17464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17464): No heartbeat from core client for 30 sec - exiting (17464): No heartbeat from core client for 30 sec - exiting (17464): No heartbeat from core client for 30 sec - exiting (17464): No heartbeat from core client for 30 sec - exiting (17464): No heartbeat from core client for 30 sec - exiting (17464): No heartbeat from core client for 30 sec - exiting (17464): No heartbeat from core client for 30 sec - exiting (17464): No heartbeat from core client for 30 sec - exiting (17490): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17511): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17538): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17538): No heartbeat from core client for 30 sec - exiting (17538): No heartbeat from core client for 30 sec - exiting (17538): No heartbeat from core client for 30 sec - exiting (17538): No heartbeat from core client for 30 sec - exiting (17761): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17812): No heartbeat from core client for 30 sec - exiting (17812): No heartbeat from core client for 30 sec - exiting (17812): No heartbeat from core client for 30 sec - exiting (17812): No heartbeat from core client for 30 sec - exiting (17992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17992): No heartbeat from core client for 30 sec - exiting (18074): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (18074): No heartbeat from core client for 30 sec - exiting (18074): No heartbeat from core client for 30 sec - exiting (18074): No heartbeat from core client for 30 sec - exiting (18082): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... (23030): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy Sorry, too many model crashes! :-( (23155): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Sep 2010 05:32:57 | 904604 | 11882426 | famous_ubtv_1199_200_006719220_5 | 65,546 | 11,189 | 0.1707 |
12 Sep 2010 23:54:25 | 904604 | 11882426 | famous_ubtv_1199_200_006719220_5 | 56,186 | 9,501 | 0.1691 |
12 Sep 2010 23:54:25 | 904604 | 11882426 | famous_ubtv_1199_200_006719220_5 | 46,826 | 7,879 | 0.1683 |
12 Sep 2010 15:05:26 | 904604 | 11882426 | famous_ubtv_1199_200_006719220_5 | 37,466 | 6,389 | 0.1705 |
12 Sep 2010 14:06:48 | 904604 | 11882426 | famous_ubtv_1199_200_006719220_5 | 28,106 | 4,671 | 0.1662 |
12 Sep 2010 13:12:58 | 904604 | 11882426 | famous_ubtv_1199_200_006719220_5 | 18,746 | 3,115 | 0.1662 |
12 Sep 2010 12:14:36 | 904604 | 11882426 | famous_ubtv_1199_200_006719220_5 | 9,386 | 1,875 | 0.1998 |
©2024 cpdn.org