Name | hadcm3n_t4rg_1980_40_007449767_3 |
Workunit | 7647270 |
Created | 29 Sep 2011, 15:10:31 UTC |
Sent | 29 Sep 2011, 15:10:41 UTC |
Report deadline | 29 Dec 2011, 22:37:52 UTC |
Received | 25 Oct 2011, 14:56:06 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 550320 |
Run time | 11 days 10 hours 43 min 15 sec |
CPU time | 11 days 5 hours 54 min 16 sec |
Validate state | Invalid |
Credit | 10,886.40 |
Device peak FLOPS | 3.58 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 22:50:23 (8124): Can't acquire lockfile (32) - waiting 35s 22:50:42 (6472): Can't acquire lockfile (32) - waiting 35s 22:50:46 (9396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6048, selfPID=6048, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3092, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3092, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3092, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3092, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3092, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3092, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Oct 2011 13:12:32 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 907,200 | 962,553 | 1.0610 |
31 Oct 2011 13:12:32 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 881,280 | 935,199 | 1.0612 |
31 Oct 2011 13:12:32 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 855,360 | 907,846 | 1.0614 |
31 Oct 2011 13:12:32 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 829,440 | 880,751 | 1.0619 |
31 Oct 2011 13:12:32 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 803,520 | 853,360 | 1.0620 |
31 Oct 2011 13:12:32 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 777,600 | 825,304 | 1.0613 |
31 Oct 2011 13:12:31 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 751,680 | 797,128 | 1.0605 |
31 Oct 2011 13:12:31 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 725,760 | 768,919 | 1.0595 |
31 Oct 2011 13:12:31 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 699,840 | 740,806 | 1.0585 |
31 Oct 2011 13:12:30 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 673,920 | 713,299 | 1.0584 |
31 Oct 2011 13:12:29 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 648,000 | 686,051 | 1.0587 |
31 Oct 2011 13:12:29 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 622,080 | 658,930 | 1.0592 |
31 Oct 2011 13:12:29 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 596,160 | 631,567 | 1.0594 |
31 Oct 2011 13:12:29 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 570,240 | 604,159 | 1.0595 |
31 Oct 2011 13:12:29 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 544,320 | 576,519 | 1.0592 |
31 Oct 2011 13:12:28 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 518,400 | 548,891 | 1.0588 |
19 Oct 2011 03:50:19 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 492,480 | 521,303 | 1.0585 |
18 Oct 2011 18:56:19 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 466,560 | 493,804 | 1.0584 |
18 Oct 2011 11:15:02 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 440,640 | 466,417 | 1.0585 |
18 Oct 2011 02:35:31 | 550320 | 13449562 | hadcm3n_t4rg_1980_40_007449767_3 | 414,720 | 438,934 | 1.0584 |
©2024 cpdn.org