climateprediction.net home page
Task 15608680

Task 15608680

Name hadcm3n_n3gl_1880_40_008286309_4
Workunit 8437444
Created 14 Feb 2013, 23:33:59 UTC
Sent 14 Feb 2013, 23:34:05 UTC
Report deadline 17 May 2013, 7:01:16 UTC
Received 22 Mar 2013, 13:26:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1192595
Run time 27 days 19 hours 19 min 40 sec
CPU time 19 days 0 hours 55 min 8 sec
Validate state Invalid
Credit 9,020.16
Device peak FLOPS 2.09 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:03:02 (2724): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
03:03:05 (2724): No heartbeat from core client for 30 sec - exiting
03:03:07 (2724): No heartbeat from core client for 30 sec - exiting
03:03:08 (2724): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
15:30:01 (1640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:30:02 (1640): No heartbeat from core client for 30 sec - exiting
15:30:03 (1640): No heartbeat from core client for 30 sec - exiting
17:50:03 (3636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:20:09 (1872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:47:52 (4240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:47:53 (4240): No heartbeat from core client for 30 sec - exiting
06:47:54 (4240): No heartbeat from core client for 30 sec - exiting
06:47:55 (4240): No heartbeat from core client for 30 sec - exiting
06:47:56 (4240): No heartbeat from core client for 30 sec - exiting
07:02:53 (5356): No heartbeat from core client for 30 sec - exiting
07:02:54 (5356): No heartbeat from core client for 30 sec - exiting
07:02:55 (5356): No heartbeat from core client for 30 sec - exiting
07:02:56 (5356): No heartbeat from core client for 30 sec - exiting
07:02:57 (5356): No heartbeat from core client for 30 sec - exiting
07:02:58 (5356): No heartbeat from core client for 30 sec - exiting
07:02:59 (5356): No heartbeat from core client for 30 sec - exiting
07:03:00 (5356): No heartbeat from core client for 30 sec - exiting
07:03:01 (5356): No heartbeat from core client for 30 sec - exiting
07:03:02 (5356): No heartbeat from core client for 30 sec - exiting
07:03:03 (5356): No heartbeat from core client for 30 sec - exiting
07:03:04 (5356): No heartbeat from core client for 30 sec - exiting
07:03:05 (5356): No heartbeat from core client for 30 sec - exiting
07:03:06 (5356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:07:56 (10076): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:07:58 (10076): No heartbeat from core client for 30 sec - exiting
07:07:59 (10076): No heartbeat from core client for 30 sec - exiting
07:09:28 (8352): No heartbeat from core client for 30 sec - exiting
07:09:29 (8352): No heartbeat from core client for 30 sec - exiting
07:09:30 (8352): No heartbeat from core client for 30 sec - exiting
07:09:31 (8352): No heartbeat from core client for 30 sec - exiting
07:09:32 (8352): No heartbeat from core client for 30 sec - exiting
07:09:33 (8352): No heartbeat from core client for 30 sec - exiting
07:09:34 (8352): No heartbeat from core client for 30 sec - exiting
07:09:35 (8352): No heartbeat from core client for 30 sec - exiting
07:09:37 (8352): No heartbeat from core client for 30 sec - exiting
07:09:38 (8352): No heartbeat from core client for 30 sec - exiting
07:09:39 (8352): No heartbeat from core client for 30 sec - exiting
07:09:40 (8352): No heartbeat from core client for 30 sec - exiting
07:09:41 (8352): No heartbeat from core client for 30 sec - exiting
07:09:42 (8352): No heartbeat from core client for 30 sec - exiting
07:09:43 (8352): No heartbeat from core client for 30 sec - exiting
07:09:44 (8352): No heartbeat from core client for 30 sec - exiting
07:09:45 (8352): No heartbeat from core client for 30 sec - exiting
07:09:46 (8352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:18:47 (3292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:18:50 (3292): No heartbeat from core client for 30 sec - exiting
07:18:51 (3292): No heartbeat from core client for 30 sec - exiting
07:18:52 (3292): No heartbeat from core client for 30 sec - exiting
07:18:54 (3292): No heartbeat from core client for 30 sec - exiting
07:22:05 (3692): No heartbeat from core client for 30 sec - exiting
07:22:06 (3692): No heartbeat from core client for 30 sec - exiting
07:22:07 (3692): No heartbeat from core client for 30 sec - exiting
07:22:09 (3692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:22:10 (3692): No heartbeat from core client for 30 sec - exiting
07:23:48 (376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:28:54 (3640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:08:57 (4952): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:08:58 (4952): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:35:39 (3608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:40:25 (2772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:09:04 (6332): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4220, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4220, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4220, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2128, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadcm3n_n3gl_1880_40_008286309_4_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadcm3n_n3gl_1880_40_008286309_4_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Mar 2013 14:20:16 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 751,680 1,633,721 2.1734
19 Mar 2013 12:25:08 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 725,760 1,577,707 2.1739
18 Mar 2013 08:38:40 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 699,840 1,521,552 2.1741
17 Mar 2013 04:23:00 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 673,920 1,464,136 2.1726
16 Mar 2013 00:35:23 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 648,000 1,405,052 2.1683
12 Mar 2013 15:50:39 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 596,160 1,286,648 2.1582
11 Mar 2013 12:09:19 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 570,240 1,228,652 2.1546
08 Mar 2013 21:33:55 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 544,320 1,170,925 2.1512
06 Mar 2013 15:55:39 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 518,400 1,112,965 2.1469
05 Mar 2013 18:50:46 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 492,480 1,056,318 2.1449
04 Mar 2013 21:06:25 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 466,560 1,000,426 2.1443
03 Mar 2013 23:25:36 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 440,640 942,878 2.1398
03 Mar 2013 02:09:17 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 414,720 887,062 2.1389
02 Mar 2013 02:50:40 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 388,800 829,828 2.1343
01 Mar 2013 02:15:55 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 362,880 772,707 2.1294
28 Feb 2013 08:02:34 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 336,960 717,107 2.1282
26 Feb 2013 00:57:38 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 311,040 662,301 2.1293
25 Feb 2013 00:36:33 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 285,120 605,732 2.1245
23 Feb 2013 22:15:52 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 259,200 548,806 2.1173
22 Feb 2013 16:55:52 1192595 15608680 hadcm3n_n3gl_1880_40_008286309_4 233,280 492,492 2.1112


©2024 climateprediction.net