Name | famous_unv0_1399_200_006663767_2 |
Workunit | 6867139 |
Created | 10 Jun 2010, 15:31:03 UTC |
Sent | 3 Jul 2010, 18:51:01 UTC |
Report deadline | 3 Oct 2010, 2:18:12 UTC |
Received | 17 Jul 2010, 20:03:32 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1085414 |
Run time | 2 days 16 hours 47 min 44 sec |
CPU time | 2 days 12 hours 31 min 2 sec |
Validate state | Invalid |
Credit | 525.07 |
Device peak FLOPS | 0.74 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8260, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Signal 3 received, exiting... (15435): called boinc_finish BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8044, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8044, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17274, selfPID=17274, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19724, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19724, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19840, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19874, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19874, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( (19874): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Jul 2010 17:38:32 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 159,146 | 209,958 | 1.3193 |
16 Jul 2010 11:53:13 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 149,786 | 196,930 | 1.3147 |
15 Jul 2010 20:42:49 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 140,426 | 184,736 | 1.3155 |
15 Jul 2010 17:14:05 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 131,066 | 172,977 | 1.3198 |
15 Jul 2010 13:42:56 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 121,706 | 161,046 | 1.3232 |
15 Jul 2010 10:17:42 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 112,346 | 149,320 | 1.3291 |
15 Jul 2010 06:48:34 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 102,986 | 137,561 | 1.3357 |
14 Jul 2010 22:16:48 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 93,626 | 125,021 | 1.3353 |
14 Jul 2010 18:04:44 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 84,266 | 111,499 | 1.3232 |
14 Jul 2010 10:16:59 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 74,906 | 100,022 | 1.3353 |
14 Jul 2010 00:47:10 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 65,546 | 88,877 | 1.3559 |
12 Jul 2010 18:06:51 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 56,186 | 77,651 | 1.3820 |
12 Jul 2010 03:25:16 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 46,826 | 64,294 | 1.3730 |
11 Jul 2010 09:52:02 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 37,466 | 50,537 | 1.3489 |
05 Jul 2010 22:20:28 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 28,106 | 37,622 | 1.3386 |
05 Jul 2010 18:24:36 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 18,746 | 24,450 | 1.3043 |
05 Jul 2010 14:49:20 | 1085414 | 11569695 | famous_unv0_1399_200_006663767_2 | 9,386 | 12,234 | 1.3034 |
©2024 cpdn.org