climateprediction.net home page
Task 15775856

Task 15775856

Name hadcm3n_4guo_2020_40_008365793_0
Workunit 8516652
Created 11 May 2013, 2:38:25 UTC
Sent 11 May 2013, 2:48:20 UTC
Report deadline 10 Aug 2013, 10:15:31 UTC
Received 26 May 2013, 17:32:22 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1267713
Run time 14 days 19 hours 58 min
CPU time 12 days 20 hours 15 min 48 sec
Validate state Invalid
Credit 8,087.04
Device peak FLOPS 2.97 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
12:37:37 (12336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:37:38 (12336): No heartbeat from core client for 30 sec - exiting
23:10:41 (42836): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:10:42 (42836): No heartbeat from core client for 30 sec - exiting
23:10:43 (42836): No heartbeat from core client for 30 sec - exiting
23:10:44 (42836): No heartbeat from core client for 30 sec - exiting
23:10:45 (42836): No heartbeat from core client for 30 sec - exiting
23:10:46 (42836): No heartbeat from core client for 30 sec - exiting
23:10:47 (42836): No heartbeat from core client for 30 sec - exiting
23:11:40 (63716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:15:41 (43872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:15:43 (43872): No heartbeat from core client for 30 sec - exiting
23:23:21 (64436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:13:05 (42432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:13:07 (42432): No heartbeat from core client for 30 sec - exiting
02:13:08 (42432): No heartbeat from core client for 30 sec - exiting
02:13:09 (42432): No heartbeat from core client for 30 sec - exiting
02:14:06 (66208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:18:04 (42872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:20:49 (69264): No heartbeat from core client for 30 sec - exiting
02:20:50 (69264): No heartbeat from core client for 30 sec - exiting
02:20:51 (69264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:20:52 (69264): No heartbeat from core client for 30 sec - exiting
02:20:53 (69264): No heartbeat from core client for 30 sec - exiting
02:24:17 (69524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:24:21 (69524): No heartbeat from core client for 30 sec - exiting
02:24:22 (69524): No heartbeat from core client for 30 sec - exiting
02:24:23 (69524): No heartbeat from core client for 30 sec - exiting
02:24:24 (69524): No heartbeat from core client for 30 sec - exiting
03:05:02 (67788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:14:13 (5736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:14:15 (5736): No heartbeat from core client for 30 sec - exiting
00:14:16 (5736): No heartbeat from core client for 30 sec - exiting
00:15:06 (41992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:19:14 (43724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:19:16 (43724): No heartbeat from core client for 30 sec - exiting
00:22:23 (43020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:33:47 (43048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:45:52 (44380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:45:53 (44380): No heartbeat from core client for 30 sec - exiting
01:45:54 (44380): No heartbeat from core client for 30 sec - exiting
01:45:55 (44380): No heartbeat from core client for 30 sec - exiting
01:45:56 (44380): No heartbeat from core client for 30 sec - exiting
01:45:57 (44380): No heartbeat from core client for 30 sec - exiting
01:45:58 (44380): No heartbeat from core client for 30 sec - exiting
01:46:58 (60696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:00:54 (65272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:00:55 (65272): No heartbeat from core client for 30 sec - exiting
02:00:56 (65272): No heartbeat from core client for 30 sec - exiting
02:01:48 (66812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:47:41 (5208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:01:33 (5860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:18:28 (37892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:18:29 (37892): No heartbeat from core client for 30 sec - exiting
15:00:58 (66544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:00:59 (66544): No heartbeat from core client for 30 sec - exiting
15:01:00 (66544): No heartbeat from core client for 30 sec - exiting
15:01:01 (66544): No heartbeat from core client for 30 sec - exiting
15:01:02 (66544): No heartbeat from core client for 30 sec - exiting
15:01:41 (20472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:22:08 (70600): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:49:50 (60388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:43:25 (13376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:43:26 (13376): No heartbeat from core client for 30 sec - exiting
17:56:53 (1044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:13:25 (5376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:13:26 (5376): No heartbeat from core client for 30 sec - exiting
22:14:54 (12868): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:14:56 (12868): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 May 2013 10:20:37 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 673,920 1,087,932 1.6143
25 May 2013 22:21:56 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 648,000 1,046,587 1.6151
25 May 2013 09:22:20 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 622,080 1,004,830 1.6153
24 May 2013 20:41:59 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 596,160 962,997 1.6153
24 May 2013 08:02:40 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 570,240 919,555 1.6126
23 May 2013 11:54:08 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 544,320 876,956 1.6111
22 May 2013 22:13:49 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 518,400 835,141 1.6110
22 May 2013 09:13:31 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 492,480 793,331 1.6109
21 May 2013 18:44:05 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 466,560 751,190 1.6101
21 May 2013 06:17:06 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 440,640 710,171 1.6117
20 May 2013 17:41:41 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 414,720 668,638 1.6123
20 May 2013 04:29:28 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 388,800 627,913 1.6150
19 May 2013 15:34:28 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 362,880 585,671 1.6140
18 May 2013 22:53:25 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 336,960 543,072 1.6117
18 May 2013 02:41:19 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 311,040 502,646 1.6160
17 May 2013 14:27:33 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 285,120 460,994 1.6168
16 May 2013 23:41:44 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 259,200 419,352 1.6179
16 May 2013 09:57:43 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 233,280 377,316 1.6174
15 May 2013 21:01:58 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 207,360 336,212 1.6214
15 May 2013 09:20:44 1267713 15775856 hadcm3n_4guo_2020_40_008365793_0 181,440 295,560 1.6290


©2024 cpdn.org