Name | hadcm3n_oeg7_1900_40_008473850_0 |
Workunit | 8624689 |
Created | 27 Sep 2013, 10:24:34 UTC |
Sent | 28 Sep 2013, 15:28:57 UTC |
Report deadline | 28 Dec 2013, 22:56:08 UTC |
Received | 2 Nov 2013, 15:03:03 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1273479 |
Run time | 12 days 12 hours 23 min 18 sec |
CPU time | 10 days 7 hours 2 min 29 sec |
Validate state | Invalid |
Credit | 7,153.92 |
Device peak FLOPS | 3.14 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:06:20 (13164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:05:11 (14320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:03:05 (13940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:01:50 (12884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:53:21 (19604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:52:09 (19972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:49:44 (19852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:46:20 (12632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:45:10 (12232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:43:54 (19520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:42:42 (13960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 12:41:29 (14104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:40:34 (12848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:39:15 (19744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:35:47 (19292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 06:33:29 (19620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:32:23 (12160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:31:11 (5960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:30:02 (20180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:28:59 (20220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:27:48 (19116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 07:25:27 (19120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:24:18 (20332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:23:07 (17340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:21:57 (13204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 01:20:56 (19928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:19:45 (20364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:18:33 (20360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:17:17 (12744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:16:03 (20428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:12:33 (20104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:11:22 (20116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:10:29 (28328): No heartbeat from core client for 30 sec - exiting 09:10:30 (28328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:10:32 (28328): No heartbeat from core client for 30 sec - exiting 09:10:33 (28328): No heartbeat from core client for 30 sec - exiting 09:10:34 (28328): No heartbeat from core client for 30 sec - exiting 12:09:13 (27912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:08:02 (27636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:05:39 (26960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:04:28 (28352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:03:19 (26752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:02:09 (26896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:00:58 (26916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:59:49 (28340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:15:26 (10284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:02:43 (9464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 04:53:15 (5008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 03:47:12 (5772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:45:59 (6032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:44:50 (3812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:43:45 (5940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:42:31 (3800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:35:44 (5192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:30:57 (6076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:26:34 (5632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7364, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7364, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7364, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7364, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7364, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7364, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Oct 2013 18:55:32 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 596,160 | 868,860 | 1.4574 |
22 Oct 2013 09:22:58 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 570,240 | 831,038 | 1.4573 |
21 Oct 2013 15:44:22 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 544,320 | 793,152 | 1.4571 |
20 Oct 2013 20:43:55 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 518,400 | 755,415 | 1.4572 |
17 Oct 2013 23:46:19 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 492,480 | 717,305 | 1.4565 |
16 Oct 2013 21:52:52 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 466,560 | 679,757 | 1.4570 |
09 Oct 2013 15:34:02 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 440,640 | 642,083 | 1.4572 |
09 Oct 2013 02:40:06 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 414,720 | 603,761 | 1.4558 |
08 Oct 2013 13:35:17 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 388,800 | 565,783 | 1.4552 |
07 Oct 2013 23:08:00 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 362,880 | 527,577 | 1.4539 |
07 Oct 2013 06:33:37 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 336,960 | 489,661 | 1.4532 |
06 Oct 2013 15:41:21 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 311,040 | 451,586 | 1.4519 |
06 Oct 2013 02:26:11 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 285,120 | 413,649 | 1.4508 |
05 Oct 2013 14:21:09 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 259,200 | 376,000 | 1.4506 |
05 Oct 2013 01:05:35 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 233,280 | 337,899 | 1.4485 |
04 Oct 2013 11:53:12 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 207,360 | 300,022 | 1.4469 |
03 Oct 2013 23:31:29 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 181,440 | 262,502 | 1.4468 |
03 Oct 2013 11:37:33 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 155,520 | 224,896 | 1.4461 |
02 Oct 2013 23:21:38 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 129,600 | 187,310 | 1.4453 |
02 Oct 2013 08:29:17 | 1273479 | 16044555 | hadcm3n_oeg7_1900_40_008473850_0 | 103,680 | 149,872 | 1.4455 |
©2024 cpdn.org