climateprediction.net home page
Task 16294624

Task 16294624

Name hadcm3n_862m_1980_40_008514373_0
Workunit 8661885
Created 26 Feb 2014, 16:01:08 UTC
Sent 26 Feb 2014, 16:03:38 UTC
Report deadline 28 May 2014, 23:30:49 UTC
Received 23 Jun 2014, 13:43:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1294522
Run time 12 days 9 hours 1 min 7 sec
CPU time 11 days 8 hours 12 min 41 sec
Validate state Invalid
Credit 7,776.00
Device peak FLOPS 2.83 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5576, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6120, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4100, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5876, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5756, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5912, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5896, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5876, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5984, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5984, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5984, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4380, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5996, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5068, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5336, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3548, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6852, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:59:31 (5916): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3768, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5272, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5644, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5644, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Jun 2014 17:38:09 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 648,000 961,878 1.4844
19 Jun 2014 14:12:35 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 622,080 923,095 1.4839
17 Jun 2014 21:02:34 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 596,160 884,615 1.4839
16 Jun 2014 17:44:15 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 570,240 846,574 1.4846
10 Jun 2014 20:20:46 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 544,320 808,581 1.4855
04 Apr 2014 20:32:33 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 518,400 770,070 1.4855
03 Apr 2014 18:36:51 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 492,480 731,419 1.4852
02 Apr 2014 15:39:25 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 466,560 692,771 1.4848
01 Apr 2014 01:54:24 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 440,640 653,886 1.4839
28 Mar 2014 18:48:04 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 414,720 615,325 1.4837
27 Mar 2014 16:10:25 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 388,800 575,592 1.4804
25 Mar 2014 22:18:20 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 362,880 536,591 1.4787
24 Mar 2014 19:19:19 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 336,960 497,771 1.4772
21 Mar 2014 16:31:56 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 311,040 459,285 1.4766
19 Mar 2014 21:18:40 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 285,120 421,282 1.4776
18 Mar 2014 18:24:41 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 259,200 382,951 1.4774
17 Mar 2014 13:51:09 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 233,280 344,803 1.4781
13 Mar 2014 19:46:16 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 207,360 305,123 1.4715
12 Mar 2014 16:41:50 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 181,440 266,500 1.4688
11 Mar 2014 14:51:24 1294522 16294624 hadcm3n_862m_1980_40_008514373_0 155,520 231,415 1.4880


©2024 climateprediction.net