climateprediction.net home page
Task 15841638

Task 15841638

Name hadcm3n_3jbj_1940_40_008259434_2
Workunit 8414558
Created 13 Jun 2013, 15:37:45 UTC
Sent 13 Jun 2013, 17:25:49 UTC
Report deadline 13 Sep 2013, 0:53:00 UTC
Received 29 Jun 2013, 8:51:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1230144
Run time 14 days 8 hours 37 min 49 sec
CPU time 13 days 1 hours 48 min 35 sec
Validate state Invalid
Credit 8,087.04
Device peak FLOPS 2.20 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5052, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
00:57:43 (2268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:48:22 (4596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:02:50 (5084): No heartbeat from core client for 30 sec - exiting
10:02:51 (5084): No heartbeat from core client for 30 sec - exiting
10:02:52 (5084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:35:46 (4176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:44:25 (4828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:05:04 (4504): No heartbeat from core client for 30 sec - exiting
05:05:05 (4504): No heartbeat from core client for 30 sec - exiting
05:05:06 (4504): No heartbeat from core client for 30 sec - exiting
05:05:07 (4504): No heartbeat from core client for 30 sec - exiting
05:05:08 (4504): No heartbeat from core client for 30 sec - exiting
05:05:09 (4504): No heartbeat from core client for 30 sec - exiting
05:05:10 (4504): No heartbeat from core client for 30 sec - exiting
05:05:11 (4504): No heartbeat from core client for 30 sec - exiting
05:05:12 (4504): No heartbeat from core client for 30 sec - exiting
05:05:13 (4504): No heartbeat from core client for 30 sec - exiting
05:05:15 (4504): No heartbeat from core client for 30 sec - exiting
05:05:16 (4504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3752, iMonCtr=1
Model crash detected, will try to restart...
05:07:38 (3428): No heartbeat from core client for 30 sec - exiting
05:07:40 (3428): No heartbeat from core client for 30 sec - exiting
05:07:41 (3428): No heartbeat from core client for 30 sec - exiting
05:07:42 (3428): No heartbeat from core client for 30 sec - exiting
05:07:43 (3428): No heartbeat from core client for 30 sec - exiting
05:07:44 (3428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:12:42 (4472): No heartbeat from core client for 30 sec - exiting
05:12:44 (4472): No heartbeat from core client for 30 sec - exiting
05:12:45 (4472): No heartbeat from core client for 30 sec - exiting
05:12:46 (4472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:04:58 (4372): No heartbeat from core client for 30 sec - exiting
05:04:59 (4372): No heartbeat from core client for 30 sec - exiting
05:05:00 (4372): No heartbeat from core client for 30 sec - exiting
05:05:01 (4372): No heartbeat from core client for 30 sec - exiting
05:05:02 (4372): No heartbeat from core client for 30 sec - exiting
05:05:03 (4372): No heartbeat from core client for 30 sec - exiting
05:05:05 (4372): No heartbeat from core client for 30 sec - exiting
05:05:06 (4372): No heartbeat from core client for 30 sec - exiting
05:05:07 (4372): No heartbeat from core client for 30 sec - exiting
05:05:08 (4372): No heartbeat from core client for 30 sec - exiting
05:05:09 (4372): No heartbeat from core client for 30 sec - exiting
05:05:10 (4372): No heartbeat from core client for 30 sec - exiting
05:05:11 (4372): No heartbeat from core client for 30 sec - exiting
05:05:12 (4372): No heartbeat from core client for 30 sec - exiting
05:05:13 (4372): No heartbeat from core client for 30 sec - exiting
05:05:14 (4372): No heartbeat from core client for 30 sec - exiting
05:05:15 (4372): No heartbeat from core client for 30 sec - exiting
05:05:16 (4372): No heartbeat from core client for 30 sec - exiting
05:05:17 (4372): No heartbeat from core client for 30 sec - exiting
05:05:18 (4372): No heartbeat from core client for 30 sec - exiting
05:05:19 (4372): No heartbeat from core client for 30 sec - exiting
05:05:20 (4372): No heartbeat from core client for 30 sec - exiting
05:05:21 (4372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:29:44 (5240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:21:32 (3272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:27:01 (3200): No heartbeat from core client for 30 sec - exiting
07:27:02 (3200): No heartbeat from core client for 30 sec - exiting
07:27:03 (3200): No heartbeat from core client for 30 sec - exiting
07:27:05 (3200): No heartbeat from core client for 30 sec - exiting
07:27:06 (3200): No heartbeat from core client for 30 sec - exiting
07:27:07 (3200): No heartbeat from core client for 30 sec - exiting
07:27:08 (3200): No heartbeat from core client for 30 sec - exiting
07:27:09 (3200): No heartbeat from core client for 30 sec - exiting
07:27:10 (3200): No heartbeat from core client for 30 sec - exiting
07:27:11 (3200): No heartbeat from core client for 30 sec - exiting
07:27:12 (3200): No heartbeat from core client for 30 sec - exiting
07:27:13 (3200): No heartbeat from core client for 30 sec - exiting
07:27:14 (3200): No heartbeat from core client for 30 sec - exiting
07:27:16 (3200): No heartbeat from core client for 30 sec - exiting
07:27:17 (3200): No heartbeat from core client for 30 sec - exiting
07:27:18 (3200): No heartbeat from core client for 30 sec - exiting
07:27:19 (3200): No heartbeat from core client for 30 sec - exiting
07:27:20 (3200): No heartbeat from core client for 30 sec - exiting
07:27:21 (3200): No heartbeat from core client for 30 sec - exiting
07:27:22 (3200): No heartbeat from core client for 30 sec - exiting
07:27:23 (3200): No heartbeat from core client for 30 sec - exiting
07:27:24 (3200): No heartbeat from core client for 30 sec - exiting
07:27:25 (3200): No heartbeat from core client for 30 sec - exiting
07:27:26 (3200): No heartbeat from core client for 30 sec - exiting
07:27:28 (3200): No heartbeat from core client for 30 sec - exiting
07:27:29 (3200): No heartbeat from core client for 30 sec - exiting
07:27:30 (3200): No heartbeat from core client for 30 sec - exiting
07:27:31 (3200): No heartbeat from core client for 30 sec - exiting
07:27:32 (3200): No heartbeat from core client for 30 sec - exiting
07:27:33 (3200): No heartbeat from core client for 30 sec - exiting
07:27:34 (3200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:19:31 (4348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:09:52 (4120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:32:39 (2696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:35:48 (7784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:47:02 (7276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:16:58 (4512): No heartbeat from core client for 30 sec - exiting
05:16:59 (4512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:17:34 (3852): No heartbeat from core client for 30 sec - exiting
05:17:35 (3852): No heartbeat from core client for 30 sec - exiting
05:17:36 (3852): No heartbeat from core client for 30 sec - exiting
05:17:38 (3852): No heartbeat from core client for 30 sec - exiting
05:17:39 (3852): No heartbeat from core client for 30 sec - exiting
05:17:40 (3852): No heartbeat from core client for 30 sec - exiting
05:17:41 (3852): No heartbeat from core client for 30 sec - exiting
05:17:42 (3852): No heartbeat from core client for 30 sec - exiting
05:17:43 (3852): No heartbeat from core client for 30 sec - exiting
05:17:44 (3852): No heartbeat from core client for 30 sec - exiting
05:17:45 (3852): No heartbeat from core client for 30 sec - exiting
05:17:46 (3852): No heartbeat from core client for 30 sec - exiting
05:17:47 (3852): No heartbeat from core client for 30 sec - exiting
05:17:48 (3852): No heartbeat from core client for 30 sec - exiting
05:17:50 (3852): No heartbeat from core client for 30 sec - exiting
05:17:51 (3852): No heartbeat from core client for 30 sec - exiting
05:17:52 (3852): No heartbeat from core client for 30 sec - exiting
05:17:53 (3852): No heartbeat from core client for 30 sec - exiting
05:17:54 (3852): No heartbeat from core client for 30 sec - exiting
05:17:55 (3852): No heartbeat from core client for 30 sec - exiting
05:17:56 (3852): No heartbeat from core client for 30 sec - exiting
05:17:57 (3852): No heartbeat from core client for 30 sec - exiting
05:17:58 (3852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:37:23 (388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:37:24 (388): No heartbeat from core client for 30 sec - exiting
00:37:25 (388): No heartbeat from core client for 30 sec - exiting
00:37:26 (388): No heartbeat from core client for 30 sec - exiting
00:37:27 (388): No heartbeat from core client for 30 sec - exiting
00:38:07 (1984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:22:41 (4428): No heartbeat from core client for 30 sec - exiting
05:22:42 (4428): No heartbeat from core client for 30 sec - exiting
05:22:43 (4428): No heartbeat from core client for 30 sec - exiting
05:22:44 (4428): No heartbeat from core client for 30 sec - exiting
05:22:45 (4428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1
Model crash detected, will try to restart...
04:58:54 (4820): No heartbeat from core client for 30 sec - exiting
04:58:55 (4820): No heartbeat from core client for 30 sec - exiting
04:58:56 (4820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Jul 2013 09:51:17 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 673,920 1,141,575 1.6939
28 Jun 2013 08:53:37 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 648,000 1,097,160 1.6931
27 Jun 2013 20:04:56 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 622,080 1,052,458 1.6918
27 Jun 2013 05:32:36 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 596,160 1,008,771 1.6921
26 Jun 2013 16:23:40 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 570,240 965,374 1.6929
26 Jun 2013 01:30:40 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 544,320 921,136 1.6923
25 Jun 2013 11:37:28 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 518,400 877,532 1.6928
24 Jun 2013 21:34:59 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 492,480 832,841 1.6911
24 Jun 2013 06:53:59 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 466,560 788,512 1.6901
23 Jun 2013 19:05:20 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 440,640 744,663 1.6900
23 Jun 2013 03:54:38 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 414,720 700,223 1.6884
22 Jun 2013 14:58:56 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 388,800 656,314 1.6881
21 Jun 2013 23:39:38 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 362,880 612,060 1.6867
21 Jun 2013 08:23:12 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 336,960 567,692 1.6847
20 Jun 2013 19:34:13 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 311,040 524,328 1.6857
20 Jun 2013 05:16:27 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 285,120 480,202 1.6842
19 Jun 2013 14:59:10 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 259,200 436,214 1.6829
18 Jun 2013 23:47:02 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 233,280 391,810 1.6796
18 Jun 2013 09:34:25 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 207,360 347,905 1.6778
17 Jun 2013 20:25:50 1230144 15841638 hadcm3n_3jbj_1940_40_008259434_2 181,440 303,337 1.6718


©2024 climateprediction.net