climateprediction.net home page
Task 15733345

Task 15733345

Name hadcm3n_zkpq_1960_40_008280624_2
Workunit 8431759
Created 18 Apr 2013, 11:00:14 UTC
Sent 18 Apr 2013, 11:00:39 UTC
Report deadline 18 Jul 2013, 18:27:50 UTC
Received 15 May 2013, 13:11:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1158176
Run time 26 days 7 hours 30 min 51 sec
CPU time 22 days 5 hours 6 min 27 sec
Validate state Invalid
Credit 10,886.40
Device peak FLOPS 2.96 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=46416, iMonCtr=1
Model crash detected, will try to restart...
06:44:52 (11872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:02:37 (6688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:59:34 (33380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:01:01 (37484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:26:01 (31464): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:50:40 (34756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=44356, iMonCtr=1
Model crash detected, will try to restart...
19:26:23 (5732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:27:03 (1236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:28:16 (5572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:32:05 (3464): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:06:34 (352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:08:18 (6828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:46:38 (3128): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:51:38 (2236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:01:38 (5840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:06:39 (552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:16:40 (7012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:26:40 (6764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:36:40 (6684): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:41:52 (4344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:45:16 (5804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:45:17 (5804): No heartbeat from core client for 30 sec - exiting
00:45:18 (5804): No heartbeat from core client for 30 sec - exiting
00:45:19 (5804): No heartbeat from core client for 30 sec - exiting
00:45:20 (5804): No heartbeat from core client for 30 sec - exiting
00:45:21 (5804): No heartbeat from core client for 30 sec - exiting
00:45:22 (5804): No heartbeat from core client for 30 sec - exiting
00:45:23 (5804): No heartbeat from core client for 30 sec - exiting
00:45:24 (5804): No heartbeat from core client for 30 sec - exiting
00:45:25 (5804): No heartbeat from core client for 30 sec - exiting
00:45:26 (5804): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:10:48 (5916): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:35:51 (10124): No heartbeat from core client for 30 sec - exiting
03:35:52 (10124): No heartbeat from core client for 30 sec - exiting
03:35:53 (10124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/zkpqko.pji6c10
Error converting file to netcdf: dataout/zkpqko.pii6c10
Error converting file to netcdf: dataout/zkpqko.pfi6c10
Error converting file to netcdf: dataout/zkpqka.phi6c10
Error converting file to netcdf: dataout/zkpqka.pgi6c10
Error converting file to netcdf: dataout/zkpqka.pei6c10
Error converting file to netcdf: dataout/zkpqka.pdi6c10
03:50:51 (9632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:50:52 (9632): No heartbeat from core client for 30 sec - exiting
11:51:41 (4536): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:35:43 (11272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:37:04 (15572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:39:34 (13712): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:39:35 (13712): No heartbeat from core client for 30 sec - exiting
17:39:36 (13712): No heartbeat from core client for 30 sec - exiting
17:39:37 (13712): No heartbeat from core client for 30 sec - exiting
17:39:38 (13712): No heartbeat from core client for 30 sec - exiting
17:39:39 (13712): No heartbeat from core client for 30 sec - exiting
17:39:40 (13712): No heartbeat from core client for 30 sec - exiting
17:39:41 (13712): No heartbeat from core client for 30 sec - exiting
17:39:43 (13712): No heartbeat from core client for 30 sec - exiting
17:39:44 (13712): No heartbeat from core client for 30 sec - exiting
17:39:45 (13712): No heartbeat from core client for 30 sec - exiting
21:29:31 (15652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:32:32 (12940): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:36:30 (17920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:36:31 (17920): No heartbeat from core client for 30 sec - exiting
21:36:32 (17920): No heartbeat from core client for 30 sec - exiting
21:36:33 (17920): No heartbeat from core client for 30 sec - exiting
21:36:34 (17920): No heartbeat from core client for 30 sec - exiting
21:36:35 (17920): No heartbeat from core client for 30 sec - exiting
21:36:36 (17920): No heartbeat from core client for 30 sec - exiting
21:36:37 (17920): No heartbeat from core client for 30 sec - exiting
21:36:38 (17920): No heartbeat from core client for 30 sec - exiting
21:36:39 (17920): No heartbeat from core client for 30 sec - exiting
21:36:40 (17920): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
06:06:09 (13852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:06:11 (13852): No heartbeat from core client for 30 sec - exiting
06:06:12 (13852): No heartbeat from core client for 30 sec - exiting
06:06:13 (13852): No heartbeat from core client for 30 sec - exiting
06:06:14 (13852): No heartbeat from core client for 30 sec - exiting
06:06:15 (13852): No heartbeat from core client for 30 sec - exiting
06:06:16 (13852): No heartbeat from core client for 30 sec - exiting
06:06:17 (13852): No heartbeat from core client for 30 sec - exiting
06:06:18 (13852): No heartbeat from core client for 30 sec - exiting
06:06:19 (13852): No heartbeat from core client for 30 sec - exiting
06:06:20 (13852): No heartbeat from core client for 30 sec - exiting
06:07:49 (15012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 May 2013 17:12:23 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 907,200 1,867,840 2.0589
13 May 2013 21:33:36 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 881,280 1,806,962 2.0504
13 May 2013 02:40:11 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 855,360 1,744,047 2.0390
12 May 2013 07:10:30 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 829,440 1,678,423 2.0236
11 May 2013 12:44:38 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 803,520 1,616,659 2.0120
10 May 2013 21:51:49 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 777,600 1,565,857 2.0137
10 May 2013 06:35:39 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 751,680 1,515,034 2.0155
09 May 2013 16:06:29 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 725,760 1,467,959 2.0227
09 May 2013 00:48:48 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 699,840 1,418,740 2.0272
08 May 2013 09:39:22 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 673,920 1,368,264 2.0303
07 May 2013 19:02:20 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 648,000 1,317,841 2.0337
07 May 2013 02:19:55 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 622,080 1,263,747 2.0315
06 May 2013 10:49:03 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 596,160 1,210,970 2.0313
05 May 2013 10:55:29 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 570,240 1,159,523 2.0334
04 May 2013 10:26:30 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 544,320 1,109,922 2.0391
03 May 2013 11:54:34 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 518,400 1,060,214 2.0452
02 May 2013 16:11:08 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 492,480 1,011,180 2.0532
01 May 2013 21:52:09 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 466,560 961,161 2.0601
01 May 2013 05:27:56 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 440,640 911,164 2.0678
30 Apr 2013 12:36:48 1158176 15733345 hadcm3n_zkpq_1960_40_008280624_2 414,720 860,763 2.0755


©2024 cpdn.org