Name | famous_unv2_1799_200_006663769_4 |
Workunit | 6867141 |
Created | 10 Jun 2010, 15:31:03 UTC |
Sent | 3 Jul 2010, 18:43:18 UTC |
Report deadline | 3 Oct 2010, 2:10:29 UTC |
Received | 10 Jul 2010, 12:18:06 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1085414 |
Run time | 6 days 0 hours 29 min 4 sec |
CPU time | 5 days 15 hours 1 min 29 sec |
Validate state | Invalid |
Credit | 1,142.71 |
Device peak FLOPS | 1.05 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 3 received, exiting... (1400): called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2279, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( (2279): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Jul 2010 05:34:43 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 346,346 | 472,526 | 1.3643 |
10 Jul 2010 02:22:13 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 336,986 | 459,732 | 1.3642 |
09 Jul 2010 21:56:31 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 327,626 | 446,756 | 1.3636 |
09 Jul 2010 17:48:39 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 318,266 | 433,132 | 1.3609 |
09 Jul 2010 13:32:26 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 308,906 | 419,318 | 1.3574 |
09 Jul 2010 09:36:19 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 299,546 | 405,946 | 1.3552 |
09 Jul 2010 05:37:24 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 290,186 | 392,874 | 1.3539 |
09 Jul 2010 02:29:49 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 280,826 | 380,083 | 1.3534 |
08 Jul 2010 21:54:28 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 271,466 | 366,936 | 1.3517 |
08 Jul 2010 17:39:01 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 262,106 | 352,923 | 1.3465 |
08 Jul 2010 13:05:37 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 252,746 | 339,009 | 1.3413 |
08 Jul 2010 09:01:43 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 243,386 | 325,718 | 1.3383 |
08 Jul 2010 02:21:58 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 234,026 | 312,718 | 1.3363 |
07 Jul 2010 18:22:47 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 224,666 | 299,319 | 1.3323 |
07 Jul 2010 09:29:54 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 215,306 | 286,202 | 1.3293 |
07 Jul 2010 02:51:57 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 205,946 | 273,342 | 1.3273 |
06 Jul 2010 22:52:11 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 196,586 | 260,020 | 1.3227 |
06 Jul 2010 19:01:44 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 187,226 | 247,387 | 1.3213 |
06 Jul 2010 15:19:56 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 177,866 | 234,798 | 1.3201 |
06 Jul 2010 11:39:38 | 1085414 | 11569707 | famous_unv2_1799_200_006663769_4 | 168,506 | 222,271 | 1.3191 |
©2024 cpdn.org