Name | hadcm3n_zkpq_1960_40_008280624_2 |
Workunit | 8431759 |
Created | 18 Apr 2013, 11:00:14 UTC |
Sent | 18 Apr 2013, 11:00:39 UTC |
Report deadline | 18 Jul 2013, 18:27:50 UTC |
Received | 15 May 2013, 13:11:09 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1158176 |
Run time | 26 days 7 hours 30 min 51 sec |
CPU time | 22 days 5 hours 6 min 27 sec |
Validate state | Invalid |
Credit | 10,886.40 |
Device peak FLOPS | 2.96 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=46416, iMonCtr=1 Model crash detected, will try to restart... 06:44:52 (11872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:02:37 (6688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:59:34 (33380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:01:01 (37484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:26:01 (31464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:50:40 (34756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=44356, iMonCtr=1 Model crash detected, will try to restart... 19:26:23 (5732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:27:03 (1236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:28:16 (5572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:32:05 (3464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:06:34 (352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:08:18 (6828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:46:38 (3128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:51:38 (2236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:01:38 (5840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:06:39 (552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:16:40 (7012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:26:40 (6764): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:36:40 (6684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:41:52 (4344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:45:16 (5804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:45:17 (5804): No heartbeat from core client for 30 sec - exiting 00:45:18 (5804): No heartbeat from core client for 30 sec - exiting 00:45:19 (5804): No heartbeat from core client for 30 sec - exiting 00:45:20 (5804): No heartbeat from core client for 30 sec - exiting 00:45:21 (5804): No heartbeat from core client for 30 sec - exiting 00:45:22 (5804): No heartbeat from core client for 30 sec - exiting 00:45:23 (5804): No heartbeat from core client for 30 sec - exiting 00:45:24 (5804): No heartbeat from core client for 30 sec - exiting 00:45:25 (5804): No heartbeat from core client for 30 sec - exiting 00:45:26 (5804): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:10:48 (5916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:35:51 (10124): No heartbeat from core client for 30 sec - exiting 03:35:52 (10124): No heartbeat from core client for 30 sec - exiting 03:35:53 (10124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/zkpqko.pji6c10 Error converting file to netcdf: dataout/zkpqko.pii6c10 Error converting file to netcdf: dataout/zkpqko.pfi6c10 Error converting file to netcdf: dataout/zkpqka.phi6c10 Error converting file to netcdf: dataout/zkpqka.pgi6c10 Error converting file to netcdf: dataout/zkpqka.pei6c10 Error converting file to netcdf: dataout/zkpqka.pdi6c10 03:50:51 (9632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:50:52 (9632): No heartbeat from core client for 30 sec - exiting 11:51:41 (4536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:35:43 (11272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:37:04 (15572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:39:34 (13712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:39:35 (13712): No heartbeat from core client for 30 sec - exiting 17:39:36 (13712): No heartbeat from core client for 30 sec - exiting 17:39:37 (13712): No heartbeat from core client for 30 sec - exiting 17:39:38 (13712): No heartbeat from core client for 30 sec - exiting 17:39:39 (13712): No heartbeat from core client for 30 sec - exiting 17:39:40 (13712): No heartbeat from core client for 30 sec - exiting 17:39:41 (13712): No heartbeat from core client for 30 sec - exiting 17:39:43 (13712): No heartbeat from core client for 30 sec - exiting 17:39:44 (13712): No heartbeat from core client for 30 sec - exiting 17:39:45 (13712): No heartbeat from core client for 30 sec - exiting 21:29:31 (15652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:32:32 (12940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:36:30 (17920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:36:31 (17920): No heartbeat from core client for 30 sec - exiting 21:36:32 (17920): No heartbeat from core client for 30 sec - exiting 21:36:33 (17920): No heartbeat from core client for 30 sec - exiting 21:36:34 (17920): No heartbeat from core client for 30 sec - exiting 21:36:35 (17920): No heartbeat from core client for 30 sec - exiting 21:36:36 (17920): No heartbeat from core client for 30 sec - exiting 21:36:37 (17920): No heartbeat from core client for 30 sec - exiting 21:36:38 (17920): No heartbeat from core client for 30 sec - exiting 21:36:39 (17920): No heartbeat from core client for 30 sec - exiting 21:36:40 (17920): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 06:06:09 (13852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:06:11 (13852): No heartbeat from core client for 30 sec - exiting 06:06:12 (13852): No heartbeat from core client for 30 sec - exiting 06:06:13 (13852): No heartbeat from core client for 30 sec - exiting 06:06:14 (13852): No heartbeat from core client for 30 sec - exiting 06:06:15 (13852): No heartbeat from core client for 30 sec - exiting 06:06:16 (13852): No heartbeat from core client for 30 sec - exiting 06:06:17 (13852): No heartbeat from core client for 30 sec - exiting 06:06:18 (13852): No heartbeat from core client for 30 sec - exiting 06:06:19 (13852): No heartbeat from core client for 30 sec - exiting 06:06:20 (13852): No heartbeat from core client for 30 sec - exiting 06:07:49 (15012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 May 2013 17:12:23 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 907,200 | 1,867,840 | 2.0589 |
13 May 2013 21:33:36 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 881,280 | 1,806,962 | 2.0504 |
13 May 2013 02:40:11 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 855,360 | 1,744,047 | 2.0390 |
12 May 2013 07:10:30 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 829,440 | 1,678,423 | 2.0236 |
11 May 2013 12:44:38 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 803,520 | 1,616,659 | 2.0120 |
10 May 2013 21:51:49 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 777,600 | 1,565,857 | 2.0137 |
10 May 2013 06:35:39 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 751,680 | 1,515,034 | 2.0155 |
09 May 2013 16:06:29 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 725,760 | 1,467,959 | 2.0227 |
09 May 2013 00:48:48 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 699,840 | 1,418,740 | 2.0272 |
08 May 2013 09:39:22 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 673,920 | 1,368,264 | 2.0303 |
07 May 2013 19:02:20 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 648,000 | 1,317,841 | 2.0337 |
07 May 2013 02:19:55 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 622,080 | 1,263,747 | 2.0315 |
06 May 2013 10:49:03 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 596,160 | 1,210,970 | 2.0313 |
05 May 2013 10:55:29 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 570,240 | 1,159,523 | 2.0334 |
04 May 2013 10:26:30 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 544,320 | 1,109,922 | 2.0391 |
03 May 2013 11:54:34 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 518,400 | 1,060,214 | 2.0452 |
02 May 2013 16:11:08 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 492,480 | 1,011,180 | 2.0532 |
01 May 2013 21:52:09 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 466,560 | 961,161 | 2.0601 |
01 May 2013 05:27:56 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 440,640 | 911,164 | 2.0678 |
30 Apr 2013 12:36:48 | 1158176 | 15733345 | hadcm3n_zkpq_1960_40_008280624_2 | 414,720 | 860,763 | 2.0755 |
©2024 cpdn.org