Name | hadcm3n_xb9d_1980_40_009466045_0 |
Workunit | 9548279 |
Created | 15 Jan 2015, 16:41:04 UTC |
Sent | 18 Jan 2015, 0:31:00 UTC |
Report deadline | 19 Apr 2015, 7:58:11 UTC |
Received | 29 Jan 2015, 8:52:37 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1292645 |
Run time | 10 days 11 hours 22 min 15 sec |
CPU time | 9 days 1 hours 13 min 56 sec |
Validate state | Invalid |
Credit | 6,842.88 |
Device peak FLOPS | 2.90 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.4.36</core_client_version> <![CDATA[ <message> Za - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:51:22 (5372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:52:30 (5720): No heartbeat from core client for 30 sec - exiting 10:52:31 (5720): No heartbeat from core client for 30 sec - exiting 10:52:32 (5720): No heartbeat from core client for 30 sec - exiting 10:52:33 (5720): No heartbeat from core client for 30 sec - exiting 10:52:34 (5720): No heartbeat from core client for 30 sec - exiting 10:52:35 (5720): No heartbeat from core client for 30 sec - exiting 10:52:36 (5720): No heartbeat from core client for 30 sec - exiting 10:52:37 (5720): No heartbeat from core client for 30 sec - exiting 10:52:38 (5720): No heartbeat from core client for 30 sec - exiting 10:52:39 (5720): No heartbeat from core client for 30 sec - exiting 10:52:40 (5720): No heartbeat from core client for 30 sec - exiting 10:52:41 (5720): No heartbeat from core client for 30 sec - exiting 10:52:42 (5720): No heartbeat from core client for 30 sec - exiting 10:52:43 (5720): No heartbeat from core client for 30 sec - exiting 10:52:44 (5720): No heartbeat from core client for 30 sec - exiting 10:52:45 (5720): No heartbeat from core client for 30 sec - exiting 10:52:46 (5720): No heartbeat from core client for 30 sec - exiting 10:52:47 (5720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:58:42 (1128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:58:43 (1128): No heartbeat from core client for 30 sec - exiting 11:00:06 (4580): No heartbeat from core client for 30 sec - exiting 11:00:07 (4580): No heartbeat from core client for 30 sec - exiting 11:00:08 (4580): No heartbeat from core client for 30 sec - exiting 11:00:09 (4580): No heartbeat from core client for 30 sec - exiting 11:00:10 (4580): No heartbeat from core client for 30 sec - exiting 11:00:11 (4580): No heartbeat from core client for 30 sec - exiting 11:00:12 (4580): No heartbeat from core client for 30 sec - exiting 11:00:13 (4580): No heartbeat from core client for 30 sec - exiting 11:00:14 (4580): No heartbeat from core client for 30 sec - exiting 11:00:15 (4580): No heartbeat from core client for 30 sec - exiting 11:00:16 (4580): No heartbeat from core client for 30 sec - exiting 11:00:17 (4580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:06:40 (244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:06:42 (244): No heartbeat from core client for 30 sec - exiting 14:13:30 (5596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:13:32 (5596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 10:56:21 (5960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:56:22 (5960): No heartbeat from core client for 30 sec - exiting 10:56:23 (5960): No heartbeat from core client for 30 sec - exiting 10:56:24 (5960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 08:45:46 (2000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:45:47 (2000): No heartbeat from core client for 30 sec - exiting 12:15:56 (4848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:15:57 (4848): No heartbeat from core client for 30 sec - exiting 12:17:18 (1696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:17:19 (1696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:04:37 (1956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:04:38 (1956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:19:33 (2400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:19:34 (2400): No heartbeat from core client for 30 sec - exiting 10:19:35 (2400): No heartbeat from core client for 30 sec - exiting 11:12:59 (1776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:13:43 (4136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:38:08 (3388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:38:10 (3388): No heartbeat from core client for 30 sec - exiting 12:39:14 (5464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 08:53:33 (5624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:53:35 (5624): No heartbeat from core client for 30 sec - exiting 08:55:52 (5616): No heartbeat from core client for 30 sec - exiting 08:55:53 (5616): No heartbeat from core client for 30 sec - exiting 08:55:54 (5616): No heartbeat from core client for 30 sec - exiting 08:55:55 (5616): No heartbeat from core client for 30 sec - exiting 08:55:56 (5616): No heartbeat from core client for 30 sec - exiting 08:55:57 (5616): No heartbeat from core client for 30 sec - exiting 08:55:58 (5616): No heartbeat from core client for 30 sec - exiting 08:55:59 (5616): No heartbeat from core client for 30 sec - exiting 08:56:00 (5616): No heartbeat from core client for 30 sec - exiting 08:56:01 (5616): No heartbeat from core client for 30 sec - exiting 08:56:02 (5616): No heartbeat from core client for 30 sec - exiting 08:56:03 (5616): No heartbeat from core client for 30 sec - exiting 08:56:04 (5616): No heartbeat from core client for 30 sec - exiting 08:56:05 (5616): No heartbeat from core client for 30 sec - exiting 08:56:06 (5616): No heartbeat from core client for 30 sec - exiting 08:56:07 (5616): No heartbeat from core client for 30 sec - exiting 08:56:08 (5616): No heartbeat from core client for 30 sec - exiting 08:56:09 (5616): No heartbeat from core client for 30 sec - exiting 08:56:10 (5616): No heartbeat from core client for 30 sec - exiting 08:56:11 (5616): No heartbeat from core client for 30 sec - exiting 08:56:12 (5616): No heartbeat from core client for 30 sec - exiting 08:56:13 (5616): No heartbeat from core client for 30 sec - exiting 08:56:14 (5616): No heartbeat from core client for 30 sec - exiting 08:56:15 (5616): No heartbeat from core client for 30 sec - exiting 08:56:16 (5616): No heartbeat from core client for 30 sec - exiting 08:56:17 (5616): No heartbeat from core client for 30 sec - exiting 08:56:18 (5616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Jan 2015 05:46:20 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 570,240 | 752,537 | 1.3197 |
27 Jan 2015 09:01:22 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 544,320 | 718,224 | 1.3195 |
26 Jan 2015 23:16:01 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 518,400 | 684,896 | 1.3212 |
26 Jan 2015 13:08:40 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 492,480 | 650,682 | 1.3212 |
26 Jan 2015 01:19:35 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 466,560 | 616,587 | 1.3216 |
25 Jan 2015 15:37:00 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 440,640 | 583,057 | 1.3232 |
25 Jan 2015 06:08:42 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 414,720 | 549,668 | 1.3254 |
24 Jan 2015 20:32:27 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 388,800 | 515,797 | 1.3266 |
24 Jan 2015 10:59:25 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 362,880 | 482,469 | 1.3296 |
24 Jan 2015 01:37:06 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 336,960 | 449,373 | 1.3336 |
23 Jan 2015 15:49:10 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 311,040 | 415,254 | 1.3351 |
23 Jan 2015 02:59:05 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 285,120 | 380,524 | 1.3346 |
22 Jan 2015 17:22:04 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 259,200 | 347,055 | 1.3389 |
22 Jan 2015 04:52:33 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 233,280 | 309,985 | 1.3288 |
21 Jan 2015 13:26:26 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 207,360 | 274,151 | 1.3221 |
21 Jan 2015 03:08:29 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 181,440 | 240,614 | 1.3261 |
20 Jan 2015 17:33:37 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 155,520 | 207,244 | 1.3326 |
20 Jan 2015 06:35:40 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 129,600 | 171,982 | 1.3270 |
19 Jan 2015 21:01:52 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 103,680 | 138,411 | 1.3350 |
19 Jan 2015 09:48:54 | 1292645 | 17796227 | hadcm3n_xb9d_1980_40_009466045_0 | 77,760 | 103,092 | 1.3258 |
©2024 cpdn.org