climateprediction.net home page
Task 13119456

Task 13119456

Name hadcm3n_yj8m_1900_40_007357792_1
Workunit 7555222
Created 6 Jul 2011, 14:55:22 UTC
Sent 8 Jul 2011, 21:23:50 UTC
Report deadline 8 Oct 2011, 4:51:01 UTC
Received 16 Sep 2011, 7:40:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1157438
Run time 26 days 18 hours 22 min 29 sec
CPU time 25 days 19 hours 31 min 35 sec
Validate state Invalid
Credit 9,020.16
Device peak FLOPS 2.26 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4732, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
01:07:16 (2896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3080, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:51:51 (2888): No heartbeat from core client for 30 sec - exiting
12:51:52 (2888): No heartbeat from core client for 30 sec - exiting
12:51:53 (2888): No heartbeat from core client for 30 sec - exiting
12:51:54 (2888): No heartbeat from core client for 30 sec - exiting
12:51:55 (2888): No heartbeat from core client for 30 sec - exiting
12:51:56 (2888): No heartbeat from core client for 30 sec - exiting
12:51:57 (2888): No heartbeat from core client for 30 sec - exiting
12:51:58 (2888): No heartbeat from core client for 30 sec - exiting
12:51:59 (2888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
00:28:42 (3708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:44:09 (2024): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
05:05:17 (3208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2796, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:37:56 (5596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2692, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4020, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2756, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3604, iMonCtr=1
Model crash detected, will try to restart...
23:48:30 (3368): No heartbeat from core client for 30 sec - exiting
23:48:31 (3368): No heartbeat from core client for 30 sec - exiting
23:48:33 (3368): No heartbeat from core client for 30 sec - exiting
23:48:34 (3368): No heartbeat from core client for 30 sec - exiting
23:48:35 (3368): No heartbeat from core client for 30 sec - exiting
23:48:36 (3368): No heartbeat from core client for 30 sec - exiting
23:48:37 (3368): No heartbeat from core client for 30 sec - exiting
23:48:38 (3368): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1916, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3708, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2572, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Sep 2011 23:27:35 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 751,680 2,160,820 2.8747
01 Sep 2011 07:59:20 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 725,760 2,048,529 2.8226
31 Aug 2011 03:13:22 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 699,840 1,947,450 2.7827
29 Aug 2011 20:56:28 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 673,920 1,843,920 2.7361
28 Aug 2011 16:53:47 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 648,000 1,742,981 2.6898
27 Aug 2011 00:11:48 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 622,080 1,665,379 2.6771
23 Aug 2011 13:46:59 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 596,160 1,574,345 2.6408
20 Aug 2011 22:36:09 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 570,240 1,500,411 2.6312
17 Aug 2011 20:48:41 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 544,320 1,417,646 2.6044
09 Aug 2011 18:28:26 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 518,400 1,334,198 2.5737
09 Aug 2011 04:57:29 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 492,480 1,254,030 2.5464
09 Aug 2011 04:57:29 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 466,560 1,174,614 2.5176
07 Aug 2011 00:59:49 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 440,640 1,099,941 2.4962
04 Aug 2011 11:26:06 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 414,720 1,027,538 2.4777
03 Aug 2011 09:03:32 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 388,800 956,635 2.4605
27 Jul 2011 16:20:12 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 362,880 886,671 2.4434
26 Jul 2011 23:26:41 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 336,960 828,939 2.4601
26 Jul 2011 08:22:17 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 311,040 774,264 2.4893
25 Jul 2011 23:00:19 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 285,120 716,344 2.5124
25 Jul 2011 21:58:02 1157438 13119456 hadcm3n_yj8m_1900_40_007357792_1 259,200 658,413 2.5402


©2024 cpdn.org