Name | hadcm3n_ylkb_1900_40_007360805_0 |
Workunit | 7558235 |
Created | 6 Jul 2011, 15:15:08 UTC |
Sent | 7 Jul 2011, 16:52:32 UTC |
Report deadline | 7 Oct 2011, 0:19:43 UTC |
Received | 7 Aug 2011, 8:23:45 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 523127 |
Run time | 8 days 23 hours 49 min 38 sec |
CPU time | 8 days 23 hours 49 min 38 sec |
Validate state | Invalid |
Credit | 3,421.44 |
Device peak FLOPS | 2.20 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.2.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 17:19:20 (2156): No heartbeat from core client for 30 sec - exiting 17:19:21 (2156): No heartbeat from core client for 30 sec - exiting 17:19:27 (2156): No heartbeat from core client for 30 sec - exiting 17:19:29 (2156): No heartbeat from core client for 30 sec - exiting 17:19:34 (2156): No heartbeat from core client for 30 sec - exiting 17:19:36 (2156): No heartbeat from core client for 30 sec - exiting 17:19:38 (2156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:44:38 (2188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:16:33 (2768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:18:19 (540): No heartbeat from core client for 30 sec - exiting 22:18:20 (540): No heartbeat from core client for 30 sec - exiting 22:18:21 (540): No heartbeat from core client for 30 sec - exiting 22:18:22 (540): No heartbeat from core client for 30 sec - exiting 22:18:23 (540): No heartbeat from core client for 30 sec - exiting 22:18:24 (540): No heartbeat from core client for 30 sec - exiting 22:18:25 (540): No heartbeat from core client for 30 sec - exiting 22:18:26 (540): No heartbeat from core client for 30 sec - exiting 22:18:27 (540): No heartbeat from core client for 30 sec - exiting 22:18:28 (540): No heartbeat from core client for 30 sec - exiting 22:18:30 (540): No heartbeat from core client for 30 sec - exiting 22:18:31 (540): No heartbeat from core client for 30 sec - exiting 22:18:32 (540): No heartbeat from core client for 30 sec - exiting 22:18:33 (540): No heartbeat from core client for 30 sec - exiting 22:18:34 (540): No heartbeat from core client for 30 sec - exiting 22:18:35 (540): No heartbeat from core client for 30 sec - exiting 22:18:36 (540): No heartbeat from core client for 30 sec - exiting 22:18:37 (540): No heartbeat from core client for 30 sec - exiting 22:18:38 (540): No heartbeat from core client for 30 sec - exiting 22:18:39 (540): No heartbeat from core client for 30 sec - exiting 22:18:40 (540): No heartbeat from core client for 30 sec - exiting 22:18:42 (540): No heartbeat from core client for 30 sec - exiting 22:18:43 (540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Restart file copy failed on ylkbka.dab25a0 Model crashed: TEMPHIST: Failed in OPEN of history file tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_se_6.07_windows_intelx86.dll after 11 attempts cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_um_6.07_windows_intelx86.exe after 11 attempts cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_ylkb_1900_40_007360805/jobs/xabnk.ihist after 11 attempts 22:42:39 (448): No heartbeat from core client for 30 sec - exiting cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_ylkb_1900_40_007360805/jobs/xabnk.namelists after 11 attempts 22:42:40 (448): No heartbeat from core client for 30 sec - exiting cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_ylkb_1900_40_007360805/dataout/atmos_restart.day after 11 attempts 22:42:41 (448): No heartbeat from core client for 30 sec - exiting cpdnmonitor: cannot open input file C:\Documents and Settings\All Users\Application Data\BOINC/projects/climateprediction.net/hadcm3n_ylkb_1900_40_007360805/dataout/ocean_restart.day after 11 attempts 22:42:41 (448): Can't open init data file - running in standalone mode Could not launch model process. Last Error=6 Called boinc_finish 22:42:42 (448): No heartbeat from core client for 30 sec - exiting Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=344, iMonCtr=1 Model crash detected, will try to restart... 01:03:36 (344): No heartbeat from core client for 30 sec - exiting 01:03:37 (344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Jul 2011 08:06:51 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 285,120 | 745,505 | 2.6147 |
25 Jul 2011 22:58:03 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 259,200 | 677,354 | 2.6132 |
25 Jul 2011 22:11:27 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 233,280 | 608,752 | 2.6095 |
25 Jul 2011 20:53:44 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 207,360 | 543,944 | 2.6232 |
25 Jul 2011 16:23:14 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 181,440 | 478,991 | 2.6399 |
25 Jul 2011 15:48:17 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 155,520 | 411,791 | 2.6478 |
25 Jul 2011 15:01:53 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 129,600 | 342,876 | 2.6456 |
25 Jul 2011 15:01:53 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 103,680 | 275,047 | 2.6528 |
10 Jul 2011 17:46:02 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 77,760 | 207,022 | 2.6623 |
09 Jul 2011 09:20:32 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 51,840 | 139,084 | 2.6829 |
08 Jul 2011 14:23:40 | 523127 | 13125481 | hadcm3n_ylkb_1900_40_007360805_0 | 25,920 | 69,768 | 2.6917 |
©2025 cpdn.org