Name | hadcm3n_001l_1900_40_007817204_1 |
Workunit | 7972313 |
Created | 28 Feb 2012, 15:41:58 UTC |
Sent | 28 Feb 2012, 18:35:24 UTC |
Report deadline | 30 May 2012, 2:02:35 UTC |
Received | 2 Apr 2012, 20:16:27 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 459222 |
Run time | 6 days 21 hours 14 min 24 sec |
CPU time | 6 days 19 hours 29 min 59 sec |
Validate state | Invalid |
Credit | 9,020.16 |
Device peak FLOPS | 3.26 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:48:32 (2912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... forrtl: Access is denied. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5496, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7396, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7396, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7396, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 16:35:18 (5900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8108, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8108, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Apr 2012 17:02:53 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 751,680 | 577,450 | 0.7682 |
01 Apr 2012 15:57:01 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 725,760 | 556,667 | 0.7670 |
01 Apr 2012 10:16:09 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 699,840 | 536,470 | 0.7666 |
31 Mar 2012 16:50:18 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 673,920 | 516,224 | 0.7660 |
31 Mar 2012 11:17:27 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 648,000 | 496,352 | 0.7660 |
30 Mar 2012 20:04:31 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 622,080 | 476,632 | 0.7662 |
26 Mar 2012 18:17:02 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 596,160 | 456,939 | 0.7665 |
25 Mar 2012 19:45:51 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 570,240 | 436,978 | 0.7663 |
25 Mar 2012 13:15:10 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 544,320 | 416,545 | 0.7653 |
24 Mar 2012 21:37:01 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 518,400 | 396,740 | 0.7653 |
23 Mar 2012 20:47:31 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 492,480 | 376,520 | 0.7645 |
22 Mar 2012 21:22:41 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 466,560 | 357,161 | 0.7655 |
22 Mar 2012 15:59:18 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 440,640 | 337,816 | 0.7666 |
20 Mar 2012 21:05:15 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 414,720 | 318,390 | 0.7677 |
18 Mar 2012 17:45:34 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 388,800 | 298,856 | 0.7687 |
15 Mar 2012 17:51:25 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 362,880 | 278,400 | 0.7672 |
13 Mar 2012 19:58:11 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 336,960 | 257,800 | 0.7651 |
12 Mar 2012 19:18:05 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 311,040 | 237,636 | 0.7640 |
11 Mar 2012 19:32:41 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 285,120 | 217,912 | 0.7643 |
11 Mar 2012 11:07:34 | 459222 | 14201921 | hadcm3n_001l_1900_40_007817204_1 | 259,200 | 198,374 | 0.7653 |
©2024 climateprediction.net