climateprediction.net home page
Task 15605551

Task 15605551

Name hadcm3n_zdj0_1880_40_008250336_4
Workunit 8405460
Created 12 Feb 2013, 2:01:56 UTC
Sent 12 Feb 2013, 2:02:19 UTC
Report deadline 14 May 2013, 9:29:30 UTC
Received 29 Mar 2013, 22:02:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1228594
Run time 10 days 12 hours 5 min 17 sec
CPU time 10 days 1 hours 5 min 50 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 3.67 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
14:35:02 (2808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2984, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:36:15 (3932): No heartbeat from core client for 30 sec - exiting
03:36:16 (3932): No heartbeat from core client for 30 sec - exiting
03:36:18 (3932): No heartbeat from core client for 30 sec - exiting
03:36:19 (3932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:36:20 (3932): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:20:27 (2088): No heartbeat from core client for 30 sec - exiting
03:20:28 (2088): No heartbeat from core client for 30 sec - exiting
03:20:29 (2088): No heartbeat from core client for 30 sec - exiting
03:20:30 (2088): No heartbeat from core client for 30 sec - exiting
03:20:31 (2088): No heartbeat from core client for 30 sec - exiting
03:20:32 (2088): No heartbeat from core client for 30 sec - exiting
03:20:33 (2088): No heartbeat from core client for 30 sec - exiting
03:20:35 (2088): No heartbeat from core client for 30 sec - exiting
03:20:36 (2088): No heartbeat from core client for 30 sec - exiting
03:20:37 (2088): No heartbeat from core client for 30 sec - exiting
03:20:38 (2088): No heartbeat from core client for 30 sec - exiting
03:20:39 (2088): No heartbeat from core client for 30 sec - exiting
03:20:40 (2088): No heartbeat from core client for 30 sec - exiting
03:20:41 (2088): No heartbeat from core client for 30 sec - exiting
03:20:42 (2088): No heartbeat from core client for 30 sec - exiting
03:20:43 (2088): No heartbeat from core client for 30 sec - exiting
03:20:44 (2088): No heartbeat from core client for 30 sec - exiting
03:20:45 (2088): No heartbeat from core client for 30 sec - exiting
03:20:47 (2088): No heartbeat from core client for 30 sec - exiting
03:20:48 (2088): No heartbeat from core client for 30 sec - exiting
03:20:49 (2088): No heartbeat from core client for 30 sec - exiting
03:20:50 (2088): No heartbeat from core client for 30 sec - exiting
03:20:51 (2088): No heartbeat from core client for 30 sec - exiting
03:20:52 (2088): No heartbeat from core client for 30 sec - exiting
03:20:53 (2088): No heartbeat from core client for 30 sec - exiting
03:20:54 (2088): No heartbeat from core client for 30 sec - exiting
03:20:55 (2088): No heartbeat from core client for 30 sec - exiting
03:20:56 (2088): No heartbeat from core client for 30 sec - exiting
03:20:58 (2088): No heartbeat from core client for 30 sec - exiting
03:20:59 (2088): No heartbeat from core client for 30 sec - exiting
03:21:00 (2088): No heartbeat from core client for 30 sec - exiting
03:21:01 (2088): No heartbeat from core client for 30 sec - exiting
03:21:02 (2088): No heartbeat from core client for 30 sec - exiting
03:21:03 (2088): No heartbeat from core client for 30 sec - exiting
03:21:04 (2088): No heartbeat from core client for 30 sec - exiting
03:21:05 (2088): No heartbeat from core client for 30 sec - exiting
03:21:06 (2088): No heartbeat from core client for 30 sec - exiting
03:21:07 (2088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:03:45 (2564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:03:46 (2564): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:30:11 (2360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:31:00 (1116): No heartbeat from core client for 30 sec - exiting
06:31:01 (1116): No heartbeat from core client for 30 sec - exiting
06:31:02 (1116): No heartbeat from core client for 30 sec - exiting
06:31:03 (1116): No heartbeat from core client for 30 sec - exiting
06:31:04 (1116): No heartbeat from core client for 30 sec - exiting
06:31:05 (1116): No heartbeat from core client for 30 sec - exiting
06:31:07 (1116): No heartbeat from core client for 30 sec - exiting
06:31:08 (1116): No heartbeat from core client for 30 sec - exiting
06:31:09 (1116): No heartbeat from core client for 30 sec - exiting
06:31:10 (1116): No heartbeat from core client for 30 sec - exiting
06:31:11 (1116): No heartbeat from core client for 30 sec - exiting
06:31:12 (1116): No heartbeat from core client for 30 sec - exiting
06:31:13 (1116): No heartbeat from core client for 30 sec - exiting
06:31:14 (1116): No heartbeat from core client for 30 sec - exiting
06:31:15 (1116): No heartbeat from core client for 30 sec - exiting
06:31:16 (1116): No heartbeat from core client for 30 sec - exiting
06:31:17 (1116): No heartbeat from core client for 30 sec - exiting
06:31:19 (1116): No heartbeat from core client for 30 sec - exiting
06:31:20 (1116): No heartbeat from core client for 30 sec - exiting
06:31:21 (1116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8152, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8152, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8152, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8152, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8152, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8152, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Mar 2013 23:46:08 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 518,400 831,078 1.6032
28 Mar 2013 11:14:43 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 492,480 789,695 1.6035
27 Mar 2013 22:24:46 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 466,560 747,992 1.6032
27 Mar 2013 10:02:47 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 440,640 706,634 1.6037
26 Mar 2013 21:30:39 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 414,720 663,641 1.6002
26 Mar 2013 09:36:27 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 388,800 621,857 1.5994
25 Mar 2013 21:36:05 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 362,880 579,257 1.5963
25 Mar 2013 09:42:28 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 336,960 537,300 1.5946
24 Mar 2013 21:38:43 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 311,040 494,880 1.5910
24 Mar 2013 09:22:16 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 285,120 452,527 1.5871
23 Mar 2013 21:12:22 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 259,200 410,204 1.5826
23 Mar 2013 08:38:12 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 233,280 369,162 1.5825
22 Mar 2013 20:20:05 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 207,360 327,439 1.5791
22 Mar 2013 08:33:53 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 181,440 285,928 1.5759
21 Mar 2013 20:43:50 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 155,520 243,886 1.5682
21 Mar 2013 08:54:12 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 129,600 202,037 1.5589
20 Mar 2013 21:00:11 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 103,680 160,337 1.5465
20 Mar 2013 09:07:07 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 77,760 118,075 1.5185
19 Mar 2013 20:57:26 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 51,840 76,114 1.4682
12 Feb 2013 12:45:11 1228594 15605551 hadcm3n_zdj0_1880_40_008250336_4 25,920 36,665 1.4145


©2024 cpdn.org