climateprediction.net home page
Task 15445116

Task 15445116

Name hadcm3n_z8t2_1880_40_008247630_0
Workunit 8402754
Created 21 Nov 2012, 8:51:50 UTC
Sent 21 Nov 2012, 8:52:03 UTC
Report deadline 20 Feb 2013, 16:19:14 UTC
Received 4 Jan 2013, 17:20:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1206582
Run time 13 days 12 hours 28 min 42 sec
CPU time 12 days 13 hours 57 min 50 sec
Validate state Invalid
Credit 6,842.88
Device peak FLOPS 1.48 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:15:00 (4432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:15:01 (4432): No heartbeat from core client for 30 sec - exiting
06:15:02 (4432): No heartbeat from core client for 30 sec - exiting
06:15:03 (4432): No heartbeat from core client for 30 sec - exiting
06:15:04 (4432): No heartbeat from core client for 30 sec - exiting
06:15:05 (4432): No heartbeat from core client for 30 sec - exiting
06:15:06 (4432): No heartbeat from core client for 30 sec - exiting
06:15:07 (4432): No heartbeat from core client for 30 sec - exiting
06:15:08 (4432): No heartbeat from core client for 30 sec - exiting
06:15:09 (4432): No heartbeat from core client for 30 sec - exiting
06:15:10 (4432): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4348, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Jan 2013 01:13:27 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 570,240 1,050,189 1.8417
31 Dec 2012 13:30:23 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 544,320 1,002,781 1.8423
30 Dec 2012 20:39:38 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 518,400 954,678 1.8416
29 Dec 2012 07:40:08 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 492,480 905,783 1.8392
25 Dec 2012 18:06:49 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 466,560 858,474 1.8400
24 Dec 2012 06:52:24 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 440,640 810,642 1.8397
19 Dec 2012 18:14:34 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 414,720 761,755 1.8368
17 Dec 2012 05:49:08 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 388,800 714,020 1.8365
16 Dec 2012 09:09:39 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 362,880 666,788 1.8375
15 Dec 2012 19:45:13 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 336,960 620,002 1.8400
13 Dec 2012 23:18:02 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 311,040 571,529 1.8375
13 Dec 2012 17:56:38 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 285,120 524,421 1.8393
13 Dec 2012 17:56:38 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 259,200 476,572 1.8386
13 Dec 2012 17:56:38 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 233,280 428,039 1.8349
06 Dec 2012 19:43:06 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 207,360 380,366 1.8343
04 Dec 2012 18:17:37 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 181,440 332,624 1.8332
02 Dec 2012 03:52:43 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 155,520 284,341 1.8283
01 Dec 2012 08:46:14 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 129,600 237,229 1.8305
29 Nov 2012 02:26:09 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 103,680 189,395 1.8267
28 Nov 2012 07:16:23 1206582 15445116 hadcm3n_z8t2_1880_40_008247630_0 77,760 141,542 1.8202


©2024 cpdn.org