Name | hadcm3n_zff5_1960_40_008335834_0 |
Workunit | 8486695 |
Created | 23 Mar 2013, 2:59:15 UTC |
Sent | 23 Mar 2013, 3:05:14 UTC |
Report deadline | 22 Jun 2013, 10:32:25 UTC |
Received | 10 Apr 2013, 0:46:56 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1045292 |
Run time | 13 days 16 hours 25 min 56 sec |
CPU time | 13 days 11 hours 59 min 22 sec |
Validate state | Invalid |
Credit | 8,398.08 |
Device peak FLOPS | 2.46 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 03:21:23 (44840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Apr 2013 05:33:52 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 699,840 | 1,151,758 | 1.6457 |
08 Apr 2013 18:15:56 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 673,920 | 1,109,251 | 1.6460 |
08 Apr 2013 05:40:57 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 648,000 | 1,067,182 | 1.6469 |
07 Apr 2013 18:17:55 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 622,080 | 1,025,062 | 1.6478 |
07 Apr 2013 04:56:39 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 596,160 | 981,371 | 1.6462 |
06 Apr 2013 16:46:23 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 570,240 | 938,139 | 1.6452 |
06 Apr 2013 04:31:56 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 544,320 | 894,814 | 1.6439 |
05 Apr 2013 16:01:34 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 518,400 | 850,987 | 1.6416 |
05 Apr 2013 03:37:41 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 492,480 | 807,316 | 1.6393 |
04 Apr 2013 15:25:40 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 466,560 | 764,185 | 1.6379 |
04 Apr 2013 03:28:39 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 440,640 | 721,402 | 1.6372 |
03 Apr 2013 15:15:40 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 414,720 | 678,308 | 1.6356 |
03 Apr 2013 03:18:22 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 388,800 | 635,667 | 1.6349 |
02 Apr 2013 15:05:42 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 362,880 | 592,089 | 1.6316 |
02 Apr 2013 02:51:25 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 336,960 | 548,356 | 1.6274 |
01 Apr 2013 15:15:47 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 311,040 | 505,255 | 1.6244 |
01 Apr 2013 02:52:23 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 285,120 | 462,476 | 1.6220 |
31 Mar 2013 14:55:33 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 259,200 | 419,734 | 1.6193 |
31 Mar 2013 02:42:33 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 233,280 | 377,234 | 1.6171 |
30 Mar 2013 15:00:31 | 1045292 | 15679336 | hadcm3n_zff5_1960_40_008335834_0 | 207,360 | 335,356 | 1.6173 |
©2024 climateprediction.net