Name | famous_wtfd_1499_200_007115056_1 |
Workunit | 7312145 |
Created | 21 Apr 2011, 14:17:46 UTC |
Sent | 21 Apr 2011, 14:17:52 UTC |
Report deadline | 21 Jul 2011, 21:45:03 UTC |
Received | 25 May 2011, 6:22:44 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 912695 |
Run time | 4 days 13 hours 31 min 25 sec |
CPU time | 4 days 5 hours 4 min 19 sec |
Validate state | Invalid |
Credit | 2,100.04 |
Device peak FLOPS | 2.08 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:47:53 (3700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:47:54 (3700): No heartbeat from core client for 30 sec - exiting 11:47:55 (3700): No heartbeat from core client for 30 sec - exiting 11:47:56 (3700): No heartbeat from core client for 30 sec - exiting 11:47:57 (3700): No heartbeat from core client for 30 sec - exiting 11:47:58 (3700): No heartbeat from core client for 30 sec - exiting 11:47:59 (3700): No heartbeat from core client for 30 sec - exiting 11:48:00 (3700): No heartbeat from core client for 30 sec - exiting 11:48:01 (3700): No heartbeat from core client for 30 sec - exiting 11:48:02 (3700): No heartbeat from core client for 30 sec - exiting 11:48:03 (3700): No heartbeat from core client for 30 sec - exiting 11:48:04 (3700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2940, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2876, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:24:26 (5572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4628, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:07:49 (5848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:08:22 (3088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6044, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5644, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2696, iMonCtr=1 Model crash detected, will try to restart... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( 09:11:29 (4540): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 May 2011 14:13:32 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 636,506 | 360,602 | 0.5665 |
20 May 2011 12:37:13 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 627,146 | 355,235 | 0.5664 |
20 May 2011 10:25:27 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 617,786 | 349,900 | 0.5664 |
20 May 2011 08:49:08 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 608,426 | 344,575 | 0.5663 |
20 May 2011 07:12:29 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 599,066 | 339,214 | 0.5662 |
19 May 2011 13:57:48 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 589,706 | 333,854 | 0.5661 |
19 May 2011 12:21:40 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 580,346 | 328,498 | 0.5660 |
19 May 2011 10:30:05 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 570,986 | 323,020 | 0.5657 |
19 May 2011 08:58:56 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 561,626 | 317,678 | 0.5656 |
19 May 2011 07:12:22 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 552,266 | 312,464 | 0.5658 |
18 May 2011 14:22:52 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 542,906 | 307,253 | 0.5659 |
18 May 2011 12:46:42 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 533,546 | 301,955 | 0.5659 |
18 May 2011 11:00:42 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 524,186 | 296,640 | 0.5659 |
17 May 2011 13:58:08 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 514,826 | 291,319 | 0.5659 |
17 May 2011 12:27:02 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 505,466 | 286,119 | 0.5660 |
17 May 2011 10:45:56 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 496,106 | 280,792 | 0.5660 |
17 May 2011 09:14:54 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 486,746 | 275,534 | 0.5661 |
17 May 2011 07:44:00 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 477,386 | 270,257 | 0.5661 |
16 May 2011 14:26:03 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 468,026 | 264,837 | 0.5659 |
16 May 2011 12:54:55 | 912695 | 12805579 | famous_wtfd_1499_200_007115056_1 | 458,666 | 259,460 | 0.5657 |
©2024 climateprediction.net