Name | hadcm3n_7zwc_1980_40_008457807_2 |
Workunit | 8608663 |
Created | 21 Oct 2013, 0:59:51 UTC |
Sent | 21 Oct 2013, 0:59:57 UTC |
Report deadline | 20 Jan 2014, 8:27:08 UTC |
Received | 3 Nov 2013, 10:33:28 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1277629 |
Run time | 10 days 22 hours 9 min 3 sec |
CPU time | 10 days 5 hours 21 min 59 sec |
Validate state | Invalid |
Credit | 2,799.36 |
Device peak FLOPS | 1.17 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.26</core_client_version> <![CDATA[ <message> Enheten gjenkjenner ikke kommandoen. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 13:32:20 (916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:35:05 (804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:36:21 (2288): No heartbeat from core client for 30 sec - exiting 13:36:23 (2288): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:38:01 (5236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:33:44 (5692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:33:45 (5692): No heartbeat from core client for 30 sec - exiting 10:43:53 (5144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:43:54 (5144): No heartbeat from core client for 30 sec - exiting 10:43:56 (5144): No heartbeat from core client for 30 sec - exiting 11:02:56 (5276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:04:28 (6208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:28:34 (768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:28:35 (768): No heartbeat from core client for 30 sec - exiting 11:28:36 (768): No heartbeat from core client for 30 sec - exiting 11:28:37 (768): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:59:51 (4452): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 21:59:53 (4452): No heartbeat from core client for 30 sec - exiting 21:59:54 (4452): No heartbeat from core client for 30 sec - exiting 21:59:55 (4452): No heartbeat from core client for 30 sec - exiting 21:59:56 (4452): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 06:45:09 (4324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:37:38 (7460): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 22:37:39 (7460): No heartbeat from core client for 30 sec - exiting 22:37:40 (7460): No heartbeat from core client for 30 sec - exiting 22:37:41 (7460): No heartbeat from core client for 30 sec - exiting 22:37:42 (7460): No heartbeat from core client for 30 sec - exiting 22:37:43 (7460): No heartbeat from core client for 30 sec - exiting 22:37:44 (7460): No heartbeat from core client for 30 sec - exiting 22:37:45 (7460): No heartbeat from core client for 30 sec - exiting 22:37:46 (7460): No heartbeat from core client for 30 sec - exiting 22:37:47 (7460): No heartbeat from core client for 30 sec - exiting 22:37:48 (7460): No heartbeat from core client for 30 sec - exiting 22:37:49 (7460): No heartbeat from core client for 30 sec - exiting 22:37:50 (7460): No heartbeat from core client for 30 sec - exiting 22:37:51 (7460): No heartbeat from core client for 30 sec - exiting 22:37:52 (7460): No heartbeat from core client for 30 sec - exiting 22:37:53 (7460): No heartbeat from core client for 30 sec - exiting 22:37:54 (7460): No heartbeat from core client for 30 sec - exiting 22:37:55 (7460): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 21:15:43 (5340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:32:19 (3676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:32:20 (3676): No heartbeat from core client for 30 sec - exiting 06:32:21 (3676): No heartbeat from core client for 30 sec - exiting 06:32:22 (3676): No heartbeat from core client for 30 sec - exiting 06:32:23 (3676): No heartbeat from core client for 30 sec - exiting 06:32:24 (3676): No heartbeat from core client for 30 sec - exiting 06:32:25 (3676): No heartbeat from core client for 30 sec - exiting 06:32:26 (3676): No heartbeat from core client for 30 sec - exiting 06:32:27 (3676): No heartbeat from core client for 30 sec - exiting 06:32:28 (3676): No heartbeat from core client for 30 sec - exiting Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8516, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8516, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1608, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1608, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1608, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1608, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
01 Nov 2013 15:57:49 | 1277629 | 16071123 | hadcm3n_7zwc_1980_40_008457807_2 | 233,280 | 804,600 | 3.4491 |
31 Oct 2013 12:50:10 | 1277629 | 16071123 | hadcm3n_7zwc_1980_40_008457807_2 | 207,360 | 716,967 | 3.4576 |
30 Oct 2013 12:25:39 | 1277629 | 16071123 | hadcm3n_7zwc_1980_40_008457807_2 | 181,440 | 631,711 | 3.4817 |
29 Oct 2013 10:43:57 | 1277629 | 16071123 | hadcm3n_7zwc_1980_40_008457807_2 | 155,520 | 543,916 | 3.4974 |
28 Oct 2013 11:12:13 | 1277629 | 16071123 | hadcm3n_7zwc_1980_40_008457807_2 | 129,600 | 461,531 | 3.5612 |
27 Oct 2013 08:43:18 | 1277629 | 16071123 | hadcm3n_7zwc_1980_40_008457807_2 | 103,680 | 371,685 | 3.5849 |
25 Oct 2013 00:49:19 | 1277629 | 16071123 | hadcm3n_7zwc_1980_40_008457807_2 | 77,760 | 280,002 | 3.6008 |
23 Oct 2013 04:25:18 | 1277629 | 16071123 | hadcm3n_7zwc_1980_40_008457807_2 | 51,840 | 160,921 | 3.1042 |
21 Oct 2013 20:58:07 | 1277629 | 16071123 | hadcm3n_7zwc_1980_40_008457807_2 | 25,920 | 71,585 | 2.7618 |
©2024 cpdn.org