Name | hadcm3n_39rr_1940_40_008263800_3 |
Workunit | 8418924 |
Created | 14 May 2013, 19:14:07 UTC |
Sent | 14 May 2013, 19:14:09 UTC |
Report deadline | 14 Aug 2013, 2:41:20 UTC |
Received | 12 Jun 2013, 1:02:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1113142 |
Run time | 20 days 15 hours 30 min 57 sec |
CPU time | 15 days 0 hours 18 min 53 sec |
Validate state | Invalid |
Credit | 9,020.16 |
Device peak FLOPS | 2.58 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=84540, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=55852, iMonCtr=1 Model crash detected, will try to restart... 14:55:42 (10636): No heartbeat from core client for 30 sec - exiting 14:55:43 (10636): No heartbeat from core client for 30 sec - exiting 14:55:44 (10636): No heartbeat from core client for 30 sec - exiting 14:55:45 (10636): No heartbeat from core client for 30 sec - exiting 14:55:46 (10636): No heartbeat from core client for 30 sec - exiting 14:55:47 (10636): No heartbeat from core client for 30 sec - exiting 14:55:48 (10636): No heartbeat from core client for 30 sec - exiting 14:55:49 (10636): No heartbeat from core client for 30 sec - exiting 14:55:50 (10636): No heartbeat from core client for 30 sec - exiting 14:55:52 (10636): No heartbeat from core client for 30 sec - exiting 14:55:53 (10636): No heartbeat from core client for 30 sec - exiting 14:55:54 (10636): No heartbeat from core client for 30 sec - exiting 14:55:55 (10636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:06:17 (9004): No heartbeat from core client for 30 sec - exiting 09:06:18 (9004): No heartbeat from core client for 30 sec - exiting 09:06:19 (9004): No heartbeat from core client for 30 sec - exiting 09:06:20 (9004): No heartbeat from core client for 30 sec - exiting 09:06:21 (9004): No heartbeat from core client for 30 sec - exiting 09:06:23 (9004): No heartbeat from core client for 30 sec - exiting 09:06:24 (9004): No heartbeat from core client for 30 sec - exiting 09:06:25 (9004): No heartbeat from core client for 30 sec - exiting 09:06:26 (9004): No heartbeat from core client for 30 sec - exiting 09:06:27 (9004): No heartbeat from core client for 30 sec - exiting 09:06:28 (9004): No heartbeat from core client for 30 sec - exiting 09:06:29 (9004): No heartbeat from core client for 30 sec - exiting 09:06:30 (9004): No heartbeat from core client for 30 sec - exiting 09:06:31 (9004): No heartbeat from core client for 30 sec - exiting 09:06:32 (9004): No heartbeat from core client for 30 sec - exiting 09:06:33 (9004): No heartbeat from core client for 30 sec - exiting 09:06:35 (9004): No heartbeat from core client for 30 sec - exiting 09:06:36 (9004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4312, iMonCtr=1 Model crash detected, will try to restart... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Jun 2013 14:35:43 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 751,680 | 1,288,134 | 1.7137 |
10 Jun 2013 23:08:47 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 725,760 | 1,244,469 | 1.7147 |
09 Jun 2013 23:48:44 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 699,840 | 1,199,976 | 1.7146 |
09 Jun 2013 10:10:11 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 673,920 | 1,154,486 | 1.7131 |
08 Jun 2013 20:59:20 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 648,000 | 1,108,792 | 1.7111 |
08 Jun 2013 01:05:13 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 622,080 | 1,064,081 | 1.7105 |
07 Jun 2013 07:19:48 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 596,160 | 1,019,641 | 1.7103 |
06 Jun 2013 12:13:59 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 570,240 | 974,964 | 1.7097 |
05 Jun 2013 16:54:19 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 544,320 | 930,324 | 1.7091 |
04 Jun 2013 18:46:02 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 518,400 | 885,910 | 1.7089 |
03 Jun 2013 22:58:54 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 492,480 | 841,299 | 1.7083 |
03 Jun 2013 04:45:19 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 466,560 | 796,750 | 1.7077 |
02 Jun 2013 10:34:21 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 440,640 | 752,406 | 1.7075 |
01 Jun 2013 18:36:06 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 414,720 | 708,255 | 1.7078 |
01 Jun 2013 01:43:06 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 388,800 | 663,783 | 1.7073 |
31 May 2013 08:26:08 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 362,880 | 618,489 | 1.7044 |
30 May 2013 16:44:04 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 336,960 | 573,939 | 1.7033 |
30 May 2013 02:46:19 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 311,040 | 529,119 | 1.7011 |
29 May 2013 09:48:16 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 285,120 | 484,686 | 1.6999 |
28 May 2013 12:04:56 | 1113142 | 15784196 | hadcm3n_39rr_1940_40_008263800_3 | 259,200 | 439,905 | 1.6972 |
©2024 cpdn.org