climateprediction.net home page
Task 15457376

Task 15457376

Name hadcm3n_zhuq_1880_40_008252546_0
Workunit 8407670
Created 23 Nov 2012, 14:19:49 UTC
Sent 23 Nov 2012, 14:20:11 UTC
Report deadline 22 Feb 2013, 21:47:22 UTC
Received 22 Dec 2012, 5:27:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1242229
Run time 19 days 19 hours 12 min 17 sec
CPU time 18 days 10 hours 3 min 32 sec
Validate state Invalid
Credit 10,575.36
Device peak FLOPS 2.86 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
17:53:48 (8736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:05:18 (13564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:06:18 (13564): No heartbeat from core client for 30 sec - exiting
08:06:19 (13564): No heartbeat from core client for 30 sec - exiting
08:06:20 (13564): No heartbeat from core client for 30 sec - exiting
08:06:21 (13564): No heartbeat from core client for 30 sec - exiting
08:06:22 (13564): No heartbeat from core client for 30 sec - exiting
08:06:23 (13564): No heartbeat from core client for 30 sec - exiting
09:49:00 (15072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:49:02 (15072): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6424, iMonCtr=1
Model crash detected, will try to restart...
10:45:54 (6432): No heartbeat from core client for 30 sec - exiting
10:45:55 (6432): No heartbeat from core client for 30 sec - exiting
10:45:56 (6432): No heartbeat from core client for 30 sec - exiting
10:45:57 (6432): No heartbeat from core client for 30 sec - exiting
10:45:59 (6432): No heartbeat from core client for 30 sec - exiting
10:46:00 (6432): No heartbeat from core client for 30 sec - exiting
10:46:01 (6432): No heartbeat from core client for 30 sec - exiting
10:46:02 (6432): No heartbeat from core client for 30 sec - exiting
10:46:03 (6432): No heartbeat from core client for 30 sec - exiting
10:46:04 (6432): No heartbeat from core client for 30 sec - exiting
10:46:05 (6432): No heartbeat from core client for 30 sec - exiting
10:46:06 (6432): No heartbeat from core client for 30 sec - exiting
10:46:07 (6432): No heartbeat from core client for 30 sec - exiting
10:46:08 (6432): No heartbeat from core client for 30 sec - exiting
10:46:09 (6432): No heartbeat from core client for 30 sec - exiting
10:46:11 (6432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:14:52 (8164): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6456, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=204, iMonCtr=1
Model crash detected, will try to restart...
14:57:34 (2672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:32:31 (3800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:26:28 (5884): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
17:21:32 (6172): No heartbeat from core client for 30 sec - exiting
17:21:34 (6172): No heartbeat from core client for 30 sec - exiting
17:21:35 (6172): No heartbeat from core client for 30 sec - exiting
17:21:36 (6172): No heartbeat from core client for 30 sec - exiting
17:21:37 (6172): No heartbeat from core client for 30 sec - exiting
17:21:38 (6172): No heartbeat from core client for 30 sec - exiting
17:21:39 (6172): No heartbeat from core client for 30 sec - exiting
17:21:40 (6172): No heartbeat from core client for 30 sec - exiting
17:21:41 (6172): No heartbeat from core client for 30 sec - exiting
17:21:42 (6172): No heartbeat from core client for 30 sec - exiting
17:21:43 (6172): No heartbeat from core client for 30 sec - exiting
17:21:44 (6172): No heartbeat from core client for 30 sec - exiting
17:21:45 (6172): No heartbeat from core client for 30 sec - exiting
17:21:46 (6172): No heartbeat from core client for 30 sec - exiting
17:21:47 (6172): No heartbeat from core client for 30 sec - exiting
17:21:48 (6172): No heartbeat from core client for 30 sec - exiting
17:21:49 (6172): No heartbeat from core client for 30 sec - exiting
17:21:50 (6172): No heartbeat from core client for 30 sec - exiting
17:21:51 (6172): No heartbeat from core client for 30 sec - exiting
17:21:52 (6172): No heartbeat from core client for 30 sec - exiting
17:21:53 (6172): No heartbeat from core client for 30 sec - exiting
17:21:54 (6172): No heartbeat from core client for 30 sec - exiting
17:21:55 (6172): No heartbeat from core client for 30 sec - exiting
17:21:56 (6172): No heartbeat from core client for 30 sec - exiting
17:21:57 (6172): No heartbeat from core client for 30 sec - exiting
17:21:58 (6172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:21:59 (6172): No heartbeat from core client for 30 sec - exiting
17:57:45 (9340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:57:46 (9340): No heartbeat from core client for 30 sec - exiting
17:57:47 (9340): No heartbeat from core client for 30 sec - exiting
17:57:48 (9340): No heartbeat from core client for 30 sec - exiting
17:57:49 (9340): No heartbeat from core client for 30 sec - exiting
17:57:50 (9340): No heartbeat from core client for 30 sec - exiting
17:57:51 (9340): No heartbeat from core client for 30 sec - exiting
17:57:52 (9340): No heartbeat from core client for 30 sec - exiting
17:57:53 (9340): No heartbeat from core client for 30 sec - exiting
17:57:54 (9340): No heartbeat from core client for 30 sec - exiting
17:57:55 (9340): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
09:48:32 (6760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:48:36 (6760): No heartbeat from core client for 30 sec - exiting
09:48:37 (6760): No heartbeat from core client for 30 sec - exiting
09:48:38 (6760): No heartbeat from core client for 30 sec - exiting
09:48:39 (6760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Dec 2012 09:16:58 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 881,280 1,541,339 1.7490
19 Dec 2012 15:28:07 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 855,360 1,485,141 1.7363
18 Dec 2012 22:19:39 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 829,440 1,429,198 1.7231
18 Dec 2012 05:31:06 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 803,520 1,373,450 1.7093
17 Dec 2012 12:55:14 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 777,600 1,318,409 1.6955
16 Dec 2012 21:01:09 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 751,680 1,265,098 1.6830
16 Dec 2012 05:33:56 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 725,760 1,212,941 1.6713
15 Dec 2012 12:20:10 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 699,840 1,159,322 1.6566
14 Dec 2012 17:27:50 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 673,920 1,104,567 1.6390
14 Dec 2012 04:21:07 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 648,000 1,049,722 1.6199
14 Dec 2012 04:21:07 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 622,080 995,077 1.5996
14 Dec 2012 04:21:07 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 596,160 940,612 1.5778
14 Dec 2012 04:21:07 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 570,240 886,578 1.5547
14 Dec 2012 04:21:07 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 544,320 831,135 1.5269
14 Dec 2012 04:21:07 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 518,400 776,211 1.4973
05 Dec 2012 21:10:29 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 492,480 737,958 1.4985
05 Dec 2012 08:39:24 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 466,560 698,782 1.4977
04 Dec 2012 21:51:50 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 440,640 659,629 1.4970
04 Dec 2012 09:42:26 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 414,720 623,518 1.5035
03 Dec 2012 23:44:52 1242229 15457376 hadcm3n_zhuq_1880_40_008252546_0 388,800 588,391 1.5134


©2024 cpdn.org