Name | hadcm3n_7ylv_1980_40_008456134_1 |
Workunit | 8606990 |
Created | 12 Dec 2013, 19:54:30 UTC |
Sent | 12 Dec 2013, 19:55:06 UTC |
Report deadline | 14 Mar 2014, 3:22:17 UTC |
Received | 31 Dec 2013, 15:15:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1083083 |
Run time | 5 days 2 hours 54 min 43 sec |
CPU time | 4 days 19 hours 20 min 27 sec |
Validate state | Invalid |
Credit | 3,421.44 |
Device peak FLOPS | 2.64 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.28</core_client_version> <![CDATA[ <message> El dispositivo no reconoce el comando. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 16:27:24 (4256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:35:45 (5668): No heartbeat from core client for 30 sec - exiting 18:35:46 (5668): No heartbeat from core client for 30 sec - exiting 18:35:47 (5668): No heartbeat from core client for 30 sec - exiting 18:35:48 (5668): No heartbeat from core client for 30 sec - exiting 18:35:49 (5668): No heartbeat from core client for 30 sec - exiting 18:35:50 (5668): No heartbeat from core client for 30 sec - exiting 18:35:51 (5668): No heartbeat from core client for 30 sec - exiting 18:35:52 (5668): No heartbeat from core client for 30 sec - exiting 18:35:53 (5668): No heartbeat from core client for 30 sec - exiting 18:35:54 (5668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:42:00 (5180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:40:05 (5876): No heartbeat from core client for 30 sec - exiting 13:40:06 (5876): No heartbeat from core client for 30 sec - exiting 13:40:07 (5876): No heartbeat from core client for 30 sec - exiting 13:40:08 (5876): No heartbeat from core client for 30 sec - exiting 13:40:09 (5876): No heartbeat from core client for 30 sec - exiting 13:40:10 (5876): No heartbeat from core client for 30 sec - exiting 13:40:11 (5876): No heartbeat from core client for 30 sec - exiting 13:40:12 (5876): No heartbeat from core client for 30 sec - exiting 13:40:13 (5876): No heartbeat from core client for 30 sec - exiting 13:40:14 (5876): No heartbeat from core client for 30 sec - exiting 13:40:15 (5876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:40:31 (5136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:40:32 (5136): No heartbeat from core client for 30 sec - exiting 16:40:33 (5136): No heartbeat from core client for 30 sec - exiting 16:40:34 (5136): No heartbeat from core client for 30 sec - exiting 16:40:35 (5136): No heartbeat from core client for 30 sec - exiting 16:40:36 (5136): No heartbeat from core client for 30 sec - exiting 16:40:37 (5136): No heartbeat from core client for 30 sec - exiting 16:40:38 (5136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:51:46 (5348): No heartbeat from core client for 30 sec - exiting 18:51:47 (5348): No heartbeat from core client for 30 sec - exiting 18:51:48 (5348): No heartbeat from core client for 30 sec - exiting 18:51:49 (5348): No heartbeat from core client for 30 sec - exiting 18:51:50 (5348): No heartbeat from core client for 30 sec - exiting 18:51:51 (5348): No heartbeat from core client for 30 sec - exiting 18:51:52 (5348): No heartbeat from core client for 30 sec - exiting 18:51:53 (5348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:51:51 (5348): No heartbeat from core client for 30 sec - exiting 19:06:46 (4860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:06:48 (4860): No heartbeat from core client for 30 sec - exiting 19:07:38 (6688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:57:39 (4584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4116, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 21:53:30 (1288): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 08:18:50 (5932): No heartbeat from core client for 30 sec - exiting 08:18:51 (5932): No heartbeat from core client for 30 sec - exiting 08:18:52 (5932): No heartbeat from core client for 30 sec - exiting 08:18:53 (5932): No heartbeat from core client for 30 sec - exiting 08:18:54 (5932): No heartbeat from core client for 30 sec - exiting 08:18:55 (5932): No heartbeat from core client for 30 sec - exiting 08:18:56 (5932): No heartbeat from core client for 30 sec - exiting 08:18:57 (5932): No heartbeat from core client for 30 sec - exiting 08:18:58 (5932): No heartbeat from core client for 30 sec - exiting 08:18:59 (5932): No heartbeat from core client for 30 sec - exiting 08:19:00 (5932): No heartbeat from core client for 30 sec - exiting 08:19:01 (5932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:41:46 (4588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:10:54 (5724): No heartbeat from core client for 30 sec - exiting 09:10:55 (5724): No heartbeat from core client for 30 sec - exiting 09:10:56 (5724): No heartbeat from core client for 30 sec - exiting 09:10:57 (5724): No heartbeat from core client for 30 sec - exiting 09:10:58 (5724): No heartbeat from core client for 30 sec - exiting 09:10:59 (5724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... 09:38:35 (6024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 13:34:39 (5532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:24:21 (2812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5496, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6068, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6068, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6068, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6068, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6068, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6068, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Dec 2013 09:17:05 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 285,120 | 411,856 | 1.4445 |
29 Dec 2013 17:32:51 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 259,200 | 375,124 | 1.4472 |
28 Dec 2013 08:45:35 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 233,280 | 337,642 | 1.4474 |
27 Dec 2013 18:37:02 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 207,360 | 300,610 | 1.4497 |
23 Dec 2013 23:16:16 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 181,440 | 262,161 | 1.4449 |
19 Dec 2013 22:56:10 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 155,520 | 224,280 | 1.4421 |
16 Dec 2013 17:45:42 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 129,600 | 186,713 | 1.4407 |
16 Dec 2013 06:08:08 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 103,680 | 150,258 | 1.4492 |
15 Dec 2013 16:07:29 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 77,760 | 112,582 | 1.4478 |
13 Dec 2013 20:16:32 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 51,840 | 74,673 | 1.4405 |
13 Dec 2013 06:32:43 | 1083083 | 16142721 | hadcm3n_7ylv_1980_40_008456134_1 | 25,920 | 37,393 | 1.4426 |
©2024 climateprediction.net