Name | hadcm3n_yjso_1900_40_007358514_1 |
Workunit | 7555944 |
Created | 6 Jul 2011, 15:00:16 UTC |
Sent | 8 Jul 2011, 8:00:12 UTC |
Report deadline | 7 Oct 2011, 15:27:23 UTC |
Received | 28 Jul 2011, 8:53:14 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1110972 |
Run time | 13 days 3 hours 14 min 22 sec |
CPU time | 11 days 16 hours 18 min 7 sec |
Validate state | Invalid |
Credit | 5,287.68 |
Device peak FLOPS | 2.53 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> 06:06:54 (4336): No heartbeat from core client for 30 sec - exiting 06:06:55 (4336): No heartbeat from core client for 30 sec - exiting 06:06:56 (4336): No heartbeat from core client for 30 sec - exiting 06:06:57 (4336): No heartbeat from core client for 30 sec - exiting 06:06:58 (4336): No heartbeat from core client for 30 sec - exiting 06:06:59 (4336): No heartbeat from core client for 30 sec - exiting 06:07:00 (4336): No heartbeat from core client for 30 sec - exiting 06:07:01 (4336): No heartbeat from core client for 30 sec - exiting 06:07:03 (4336): No heartbeat from core client for 30 sec - exiting 06:07:04 (4336): No heartbeat from core client for 30 sec - exiting 06:07:05 (4336): No heartbeat from core client for 30 sec - exiting 06:07:06 (4336): No heartbeat from core client for 30 sec - exiting 06:07:07 (4336): No heartbeat from core client for 30 sec - exiting 06:07:08 (4336): No heartbeat from core client for 30 sec - exiting 06:07:09 (4336): No heartbeat from core client for 30 sec - exiting 06:07:10 (4336): No heartbeat from core client for 30 sec - exiting 06:07:11 (4336): No heartbeat from core client for 30 sec - exiting 06:07:12 (4336): No heartbeat from core client for 30 sec - exiting 06:07:13 (4336): No heartbeat from core client for 30 sec - exiting 06:07:15 (4336): No heartbeat from core client for 30 sec - exiting 06:07:16 (4336): No heartbeat from core client for 30 sec - exiting 06:07:17 (4336): No heartbeat from core client for 30 sec - exiting 06:07:18 (4336): No heartbeat from core client for 30 sec - exiting 06:07:19 (4336): No heartbeat from core client for 30 sec - exiting 06:07:20 (4336): No heartbeat from core client for 30 sec - exiting 06:07:21 (4336): No heartbeat from core client for 30 sec - exiting 06:07:22 (4336): No heartbeat from core client for 30 sec - exiting 06:07:23 (4336): No heartbeat from core client for 30 sec - exiting 06:07:24 (4336): No heartbeat from core client for 30 sec - exiting 06:07:26 (4336): No heartbeat from core client for 30 sec - exiting 06:07:27 (4336): No heartbeat from core client for 30 sec - exiting 06:07:28 (4336): No heartbeat from core client for 30 sec - exiting 06:07:29 (4336): No heartbeat from core client for 30 sec - exiting 06:07:30 (4336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4492, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5708, iMonCtr=1 Model crash detected, will try to restart... 13:57:44 (4496): No heartbeat from core client for 30 sec - exiting 13:57:45 (4496): No heartbeat from core client for 30 sec - exiting 13:57:46 (4496): No heartbeat from core client for 30 sec - exiting 13:57:47 (4496): No heartbeat from core client for 30 sec - exiting 13:57:48 (4496): No heartbeat from core client for 30 sec - exiting 13:57:49 (4496): No heartbeat from core client for 30 sec - exiting 13:57:50 (4496): No heartbeat from core client for 30 sec - exiting 13:57:51 (4496): No heartbeat from core client for 30 sec - exiting 13:57:52 (4496): No heartbeat from core client for 30 sec - exiting 13:57:53 (4496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2560, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 06:31:38 (4920): No heartbeat from core client for 30 sec - exiting 06:31:39 (4920): No heartbeat from core client for 30 sec - exiting 06:31:40 (4920): No heartbeat from core client for 30 sec - exiting 06:31:41 (4920): No heartbeat from core client for 30 sec - exiting 06:31:42 (4920): No heartbeat from core client for 30 sec - exiting 06:31:44 (4920): No heartbeat from core client for 30 sec - exiting 06:31:45 (4920): No heartbeat from core client for 30 sec - exiting 06:31:46 (4920): No heartbeat from core client for 30 sec - exiting 06:31:47 (4920): No heartbeat from core client for 30 sec - exiting 06:31:48 (4920): No heartbeat from core client for 30 sec - exiting 06:31:49 (4920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4304, iMonCtr=1 Model crash detected, will try to restart... 17:32:45 (5392): No heartbeat from core client for 30 sec - exiting 17:32:46 (5392): No heartbeat from core client for 30 sec - exiting 17:32:47 (5392): No heartbeat from core client for 30 sec - exiting 17:32:48 (5392): No heartbeat from core client for 30 sec - exiting 17:32:50 (5392): No heartbeat from core client for 30 sec - exiting 17:32:51 (5392): No heartbeat from core client for 30 sec - exiting 17:32:52 (5392): No heartbeat from core client for 30 sec - exiting 17:32:53 (5392): No heartbeat from core client for 30 sec - exiting 17:32:54 (5392): No heartbeat from core client for 30 sec - exiting 17:32:55 (5392): No heartbeat from core client for 30 sec - exiting 17:32:56 (5392): No heartbeat from core client for 30 sec - exiting 17:32:57 (5392): No heartbeat from core client for 30 sec - exiting 17:32:58 (5392): No heartbeat from core client for 30 sec - exiting 17:32:59 (5392): No heartbeat from core client for 30 sec - exiting 17:33:00 (5392): No heartbeat from core client for 30 sec - exiting 17:33:02 (5392): No heartbeat from core client for 30 sec - exiting 17:33:03 (5392): No heartbeat from core client for 30 sec - exiting 17:33:04 (5392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:33:05 (5392): No heartbeat from core client for 30 sec - exiting 05:40:49 (4496): No heartbeat from core client for 30 sec - exiting 05:40:50 (4496): No heartbeat from core client for 30 sec - exiting 05:40:52 (4496): No heartbeat from core client for 30 sec - exiting 05:40:53 (4496): No heartbeat from core client for 30 sec - exiting 05:40:54 (4496): No heartbeat from core client for 30 sec - exiting 05:40:55 (4496): No heartbeat from core client for 30 sec - exiting 05:40:56 (4496): No heartbeat from core client for 30 sec - exiting 05:40:57 (4496): No heartbeat from core client for 30 sec - exiting 05:40:58 (4496): No heartbeat from core client for 30 sec - exiting 05:40:59 (4496): No heartbeat from core client for 30 sec - exiting 05:41:00 (4496): No heartbeat from core client for 30 sec - exiting 05:41:01 (4496): No heartbeat from core client for 30 sec - exiting 05:41:02 (4496): No heartbeat from core client for 30 sec - exiting 05:41:04 (4496): No heartbeat from core client for 30 sec - exiting 05:41:05 (4496): No heartbeat from core client for 30 sec - exiting 05:41:06 (4496): No heartbeat from core client for 30 sec - exiting 05:41:07 (4496): No heartbeat from core client for 30 sec - exiting 05:41:08 (4496): No heartbeat from core client for 30 sec - exiting 05:41:09 (4496): No heartbeat from core client for 30 sec - exiting 05:41:10 (4496): No heartbeat from core client for 30 sec - exiting 05:41:11 (4496): No heartbeat from core client for 30 sec - exiting 05:41:12 (4496): No heartbeat from core client for 30 sec - exiting 05:41:13 (4496): No heartbeat from core client for 30 sec - exiting 05:41:15 (4496): No heartbeat from core client for 30 sec - exiting 05:41:16 (4496): No heartbeat from core client for 30 sec - exiting 05:41:17 (4496): No heartbeat from core client for 30 sec - exiting 05:41:18 (4496): No heartbeat from core client for 30 sec - exiting 05:41:19 (4496): No heartbeat from core client for 30 sec - exiting 05:41:20 (4496): No heartbeat from core client for 30 sec - exiting 05:41:21 (4496): No heartbeat from core client for 30 sec - exiting 05:41:22 (4496): No heartbeat from core client for 30 sec - exiting 05:41:23 (4496): No heartbeat from core client for 30 sec - exiting 05:41:24 (4496): No heartbeat from core client for 30 sec - exiting 05:41:25 (4496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:41:27 (4496): No heartbeat from core client for 30 sec - exiting 06:07:51 (4304): No heartbeat from core client for 30 sec - exiting 06:07:52 (4304): No heartbeat from core client for 30 sec - exiting 06:07:54 (4304): No heartbeat from core client for 30 sec - exiting 06:07:55 (4304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Jul 2011 15:13:46 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 440,640 | 980,379 | 2.2249 |
26 Jul 2011 15:30:23 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 414,720 | 917,335 | 2.2119 |
25 Jul 2011 23:01:16 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 388,800 | 853,700 | 2.1957 |
25 Jul 2011 21:58:06 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 362,880 | 792,153 | 2.1830 |
25 Jul 2011 20:37:45 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 336,960 | 728,701 | 2.1626 |
25 Jul 2011 19:24:47 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 311,040 | 661,040 | 2.1253 |
25 Jul 2011 19:24:47 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 285,120 | 594,415 | 2.0848 |
25 Jul 2011 18:10:43 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 259,200 | 531,207 | 2.0494 |
25 Jul 2011 17:34:11 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 233,280 | 476,480 | 2.0425 |
25 Jul 2011 15:32:09 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 207,360 | 424,428 | 2.0468 |
25 Jul 2011 13:06:06 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 181,440 | 366,677 | 2.0209 |
25 Jul 2011 13:06:06 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 155,520 | 317,015 | 2.0384 |
25 Jul 2011 13:06:06 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 129,600 | 260,913 | 2.0132 |
25 Jul 2011 13:06:06 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 103,680 | 211,869 | 2.0435 |
25 Jul 2011 13:06:06 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 77,760 | 156,408 | 2.0114 |
10 Jul 2011 10:47:23 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 51,840 | 101,740 | 1.9626 |
09 Jul 2011 11:56:24 | 1110972 | 13120900 | hadcm3n_yjso_1900_40_007358514_1 | 25,920 | 50,822 | 1.9607 |
©2024 climateprediction.net