climateprediction.net home page
Task 12734315

Task 12734315

Name hadcm3n_o0yd_1900_40_007196568_0
Workunit 7394848
Created 28 Mar 2011, 13:58:41 UTC
Sent 2 Apr 2011, 10:21:58 UTC
Report deadline 2 Jul 2011, 17:49:09 UTC
Received 28 Apr 2011, 7:26:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1135600
Run time 9 days 12 hours 9 min 54 sec
CPU time 8 days 21 hours 15 min 17 sec
Validate state Invalid
Credit 5,598.72
Device peak FLOPS 3.29 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
10:57:32 (1728): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
10:57:39 (1728): No heartbeat from core client for 30 sec - exiting
10:57:40 (1728): No heartbeat from core client for 30 sec - exiting
10:57:41 (1728): No heartbeat from core client for 30 sec - exiting
10:57:42 (1728): No heartbeat from core client for 30 sec - exiting
10:57:43 (1728): No heartbeat from core client for 30 sec - exiting
10:57:44 (1728): No heartbeat from core client for 30 sec - exiting
10:57:45 (1728): No heartbeat from core client for 30 sec - exiting
10:57:46 (1728): No heartbeat from core client for 30 sec - exiting
10:57:47 (1728): No heartbeat from core client for 30 sec - exiting
10:57:49 (1728): No heartbeat from core client for 30 sec - exiting
10:57:50 (1728): No heartbeat from core client for 30 sec - exiting
10:57:51 (1728): No heartbeat from core client for 30 sec - exiting
10:57:52 (1728): No heartbeat from core client for 30 sec - exiting
10:57:53 (1728): No heartbeat from core client for 30 sec - exiting
10:57:54 (1728): No heartbeat from core client for 30 sec - exiting
10:57:55 (1728): No heartbeat from core client for 30 sec - exiting
10:57:56 (1728): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:38:23 (1800): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:00:31 (1696): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
01:00:34 (1696): No heartbeat from core client for 30 sec - exiting
01:00:35 (1696): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:44:07 (1860): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
19:44:09 (1860): No heartbeat from core client for 30 sec - exiting
19:44:10 (1860): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:18:33 (1072): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
22:18:35 (1072): No heartbeat from core client for 30 sec - exiting
22:18:36 (1072): No heartbeat from core client for 30 sec - exiting
22:18:37 (1072): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:16:33 (3332): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
05:16:34 (3332): No heartbeat from core client for 30 sec - exiting
05:16:35 (3332): No heartbeat from core client for 30 sec - exiting
05:16:36 (3332): No heartbeat from core client for 30 sec - exiting
05:16:37 (3332): No heartbeat from core client for 30 sec - exiting
05:16:38 (3332): No heartbeat from core client for 30 sec - exiting
05:16:39 (3332): No heartbeat from core client for 30 sec - exiting
05:16:40 (3332): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:40:23 (2680): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:25:18 (1160): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Apr 2011 01:12:58 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 466,560 747,820 1.6028
27 Apr 2011 14:52:58 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 440,640 712,751 1.6175
27 Apr 2011 03:55:01 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 414,720 677,649 1.6340
26 Apr 2011 17:31:41 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 388,800 642,952 1.6537
26 Apr 2011 06:21:24 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 362,880 608,604 1.6771
25 Apr 2011 20:19:23 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 336,960 574,648 1.7054
25 Apr 2011 06:23:40 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 311,040 539,928 1.7359
24 Apr 2011 20:30:35 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 285,120 506,018 1.7748
24 Apr 2011 10:34:30 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 259,200 473,759 1.8278
23 Apr 2011 23:35:46 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 233,280 439,794 1.8853
23 Apr 2011 13:21:37 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 207,360 405,603 1.9560
23 Apr 2011 03:09:04 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 181,440 371,697 2.0486
22 Apr 2011 17:29:19 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 155,520 337,717 2.1715
22 Apr 2011 06:37:41 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 129,600 302,968 2.3377
21 Apr 2011 20:33:41 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 103,680 269,063 2.5951
20 Apr 2011 15:42:07 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 77,760 103,250 1.3278
20 Apr 2011 15:42:07 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 51,840 69,081 1.3326
20 Apr 2011 15:42:07 1135600 12734315 hadcm3n_o0yd_1900_40_007196568_0 25,920 34,895 1.3463


©2024 climateprediction.net