Name | hadcm3n_o1nw_2060_40_008242109_0 |
Workunit | 8397233 |
Created | 29 Oct 2012, 17:50:47 UTC |
Sent | 29 Oct 2012, 17:50:51 UTC |
Report deadline | 29 Jan 2013, 1:18:02 UTC |
Received | 10 Nov 2012, 13:01:28 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1164880 |
Run time | 11 days 18 hours 14 min 5 sec |
CPU time | 9 days 13 hours 54 min 56 sec |
Validate state | Invalid |
Credit | 4,354.56 |
Device peak FLOPS | 2.29 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3068, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Nov 2012 11:31:16 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 362,880 | 824,170 | 2.2712 |
09 Nov 2012 15:09:56 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 336,960 | 765,608 | 2.2721 |
08 Nov 2012 18:45:23 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 311,040 | 707,050 | 2.2732 |
07 Nov 2012 22:58:05 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 285,120 | 647,849 | 2.2722 |
07 Nov 2012 03:01:56 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 259,200 | 587,622 | 2.2671 |
06 Nov 2012 07:40:31 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 233,280 | 529,261 | 2.2688 |
05 Nov 2012 12:18:40 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 207,360 | 470,602 | 2.2695 |
04 Nov 2012 16:46:42 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 181,440 | 412,375 | 2.2728 |
03 Nov 2012 20:02:25 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 155,520 | 353,251 | 2.2714 |
02 Nov 2012 23:23:27 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 129,600 | 294,451 | 2.2720 |
02 Nov 2012 02:49:08 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 103,680 | 236,477 | 2.2808 |
01 Nov 2012 06:46:12 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 77,760 | 178,542 | 2.2961 |
31 Oct 2012 11:24:23 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 51,840 | 120,345 | 2.3215 |
30 Oct 2012 16:07:23 | 1164880 | 15417203 | hadcm3n_o1nw_2060_40_008242109_0 | 25,920 | 60,541 | 2.3357 |
©2024 cpdn.org