climateprediction.net home page
Task 16044658

Task 16044658

Name hadcm3n_oej1_1900_40_008473952_0
Workunit 8624791
Created 27 Sep 2013, 10:25:25 UTC
Sent 28 Sep 2013, 12:44:02 UTC
Report deadline 28 Dec 2013, 20:11:13 UTC
Received 29 Oct 2013, 16:52:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1295275
Run time 9 days 3 hours 53 min 21 sec
CPU time 8 days 1 hours 54 min 59 sec
Validate state Invalid
Credit 4,976.64
Device peak FLOPS 3.38 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
Het station kan een bepaald gebied of spoor op de schijf niet vinden.
 (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:00:42 (3488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:08:21 (3804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:15:03 (2116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:19:04 (6092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:26:27 (4424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:38:39 (6020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:53:53 (1280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:08:44 (2872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3768, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3872, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4364, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4864, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4500, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3416, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3416, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5820, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3340, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3324, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3324, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3324, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4880, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3360, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1328, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4716, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4716, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2516, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2880, iMonCtr=1
Model crash detected, will try to restart...
08:04:05 (3420): No heartbeat from core client for 30 sec - exiting
08:04:06 (3420): No heartbeat from core client for 30 sec - exiting
08:04:07 (3420): No heartbeat from core client for 30 sec - exiting
08:04:08 (3420): No heartbeat from core client for 30 sec - exiting
08:04:09 (3420): No heartbeat from core client for 30 sec - exiting
08:04:10 (3420): No heartbeat from core client for 30 sec - exiting
08:04:11 (3420): No heartbeat from core client for 30 sec - exiting
08:04:12 (3420): No heartbeat from core client for 30 sec - exiting
08:04:13 (3420): No heartbeat from core client for 30 sec - exiting
08:04:15 (3420): No heartbeat from core client for 30 sec - exiting
08:04:16 (3420): No heartbeat from core client for 30 sec - exiting
08:04:17 (3420): No heartbeat from core client for 30 sec - exiting
08:04:18 (3420): No heartbeat from core client for 30 sec - exiting
08:04:19 (3420): No heartbeat from core client for 30 sec - exiting
08:04:20 (3420): No heartbeat from core client for 30 sec - exiting
08:04:21 (3420): No heartbeat from core client for 30 sec - exiting
08:04:22 (3420): No heartbeat from core client for 30 sec - exiting
08:04:23 (3420): No heartbeat from core client for 30 sec - exiting
08:04:24 (3420): No heartbeat from core client for 30 sec - exiting
08:04:25 (3420): No heartbeat from core client for 30 sec - exiting
08:04:27 (3420): No heartbeat from core client for 30 sec - exiting
08:04:28 (3420): No heartbeat from core client for 30 sec - exiting
08:04:29 (3420): No heartbeat from core client for 30 sec - exiting
08:04:30 (3420): No heartbeat from core client for 30 sec - exiting
08:04:31 (3420): No heartbeat from core client for 30 sec - exiting
08:04:32 (3420): No heartbeat from core client for 30 sec - exiting
08:04:33 (3420): No heartbeat from core client for 30 sec - exiting
08:04:34 (3420): No heartbeat from core client for 30 sec - exiting
08:04:35 (3420): No heartbeat from core client for 30 sec - exiting
08:04:36 (3420): No heartbeat from core client for 30 sec - exiting
08:04:37 (3420): No heartbeat from core client for 30 sec - exiting
08:04:39 (3420): No heartbeat from core client for 30 sec - exiting
08:04:40 (3420): No heartbeat from core client for 30 sec - exiting
08:04:41 (3420): No heartbeat from core client for 30 sec - exiting
08:04:42 (3420): No heartbeat from core client for 30 sec - exiting
08:04:43 (3420): No heartbeat from core client for 30 sec - exiting
08:04:44 (3420): No heartbeat from core client for 30 sec - exiting
08:04:45 (3420): No heartbeat from core client for 30 sec - exiting
08:04:46 (3420): No heartbeat from core client for 30 sec - exiting
08:04:47 (3420): No heartbeat from core client for 30 sec - exiting
08:04:48 (3420): No heartbeat from core client for 30 sec - exiting
08:04:50 (3420): No heartbeat from core client for 30 sec - exiting
08:04:51 (3420): No heartbeat from core client for 30 sec - exiting
08:04:52 (3420): No heartbeat from core client for 30 sec - exiting
08:04:53 (3420): No heartbeat from core client for 30 sec - exiting
08:04:54 (3420): No heartbeat from core client for 30 sec - exiting
08:04:55 (3420): No heartbeat from core client for 30 sec - exiting
08:04:56 (3420): No heartbeat from core client for 30 sec - exiting
08:04:57 (3420): No heartbeat from core client for 30 sec - exiting
08:04:58 (3420): No heartbeat from core client for 30 sec - exiting
08:04:59 (3420): No heartbeat from core client for 30 sec - exiting
08:05:00 (3420): No heartbeat from core client for 30 sec - exiting
08:05:02 (3420): No heartbeat from core client for 30 sec - exiting
08:05:03 (3420): No heartbeat from core client for 30 sec - exiting
08:05:04 (3420): No heartbeat from core client for 30 sec - exiting
08:05:05 (3420): No heartbeat from core client for 30 sec - exiting
08:05:06 (3420): No heartbeat from core client for 30 sec - exiting
08:05:07 (3420): No heartbeat from core client for 30 sec - exiting
08:05:08 (3420): No heartbeat from core client for 30 sec - exiting
08:05:09 (3420): No heartbeat from core client for 30 sec - exiting
08:05:10 (3420): No heartbeat from core client for 30 sec - exiting
08:05:11 (3420): No heartbeat from core client for 30 sec - exiting
08:05:12 (3420): No heartbeat from core client for 30 sec - exiting
08:05:13 (3420): No heartbeat from core client for 30 sec - exiting
08:05:14 (3420): No heartbeat from core client for 30 sec - exiting
08:05:16 (3420): No heartbeat from core client for 30 sec - exiting
08:05:17 (3420): No heartbeat from core client for 30 sec - exiting
08:05:18 (3420): No heartbeat from core client for 30 sec - exiting
08:05:19 (3420): No heartbeat from core client for 30 sec - exiting
08:05:20 (3420): No heartbeat from core client for 30 sec - exiting
08:05:21 (3420): No heartbeat from core client for 30 sec - exiting
08:05:22 (3420): No heartbeat from core client for 30 sec - exiting
08:05:23 (3420): No heartbeat from core client for 30 sec - exiting
08:05:24 (3420): No heartbeat from core client for 30 sec - exiting
08:05:25 (3420): No heartbeat from core client for 30 sec - exiting
08:05:26 (3420): No heartbeat from core client for 30 sec - exiting
08:05:27 (3420): No heartbeat from core client for 30 sec - exiting
08:05:28 (3420): No heartbeat from core client for 30 sec - exiting
08:05:29 (3420): No heartbeat from core client for 30 sec - exiting
08:05:30 (3420): No heartbeat from core client for 30 sec - exiting
08:05:31 (3420): No heartbeat from core client for 30 sec - exiting
08:05:32 (3420): No heartbeat from core client for 30 sec - exiting
08:05:33 (3420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Oct 2013 16:14:04 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 414,720 685,722 1.6535
27 Oct 2013 03:06:11 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 388,800 641,904 1.6510
26 Oct 2013 14:49:00 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 362,880 599,336 1.6516
25 Oct 2013 09:13:36 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 336,960 555,938 1.6499
21 Oct 2013 14:22:20 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 311,040 513,220 1.6500
19 Oct 2013 11:46:16 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 285,120 471,597 1.6540
18 Oct 2013 10:25:46 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 259,200 429,365 1.6565
17 Oct 2013 12:00:21 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 233,280 387,535 1.6612
14 Oct 2013 09:37:20 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 207,360 344,739 1.6625
13 Oct 2013 05:18:04 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 181,440 302,358 1.6664
12 Oct 2013 16:21:21 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 155,520 259,619 1.6694
12 Oct 2013 08:04:29 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 129,600 215,985 1.6666
07 Oct 2013 14:26:11 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 103,680 173,422 1.6727
05 Oct 2013 14:15:09 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 77,760 129,679 1.6677
04 Oct 2013 06:09:58 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 51,840 85,817 1.6554
02 Oct 2013 15:55:54 1295275 16044658 hadcm3n_oej1_1900_40_008473952_0 25,920 42,186 1.6275


©2024 cpdn.org