climateprediction.net home page
Task 13370221

Task 13370221

Name hadcm3n_t1qq_1940_40_007451846_1
Workunit 7649349
Created 10 Sep 2011, 12:05:08 UTC
Sent 11 Sep 2011, 22:26:16 UTC
Report deadline 12 Dec 2011, 5:53:27 UTC
Received 10 Oct 2011, 16:10:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1128330
Run time 17 days 16 hours 17 min 49 sec
CPU time 17 days 6 hours 3 min 41 sec
Validate state Invalid
Credit 10,575.36
Device peak FLOPS 2.20 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:22:29 (3364): No heartbeat from core client for 30 sec - exiting
11:22:30 (3364): No heartbeat from core client for 30 sec - exiting
11:22:31 (3364): No heartbeat from core client for 30 sec - exiting
11:22:32 (3364): No heartbeat from core client for 30 sec - exiting
11:22:33 (3364): No heartbeat from core client for 30 sec - exiting
11:22:34 (3364): No heartbeat from core client for 30 sec - exiting
11:22:35 (3364): No heartbeat from core client for 30 sec - exiting
11:22:36 (3364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:22:38 (3364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:08:46 (4712): No heartbeat from core client for 30 sec - exiting
01:08:48 (4712): No heartbeat from core client for 30 sec - exiting
01:08:49 (4712): No heartbeat from core client for 30 sec - exiting
01:08:50 (4712): No heartbeat from core client for 30 sec - exiting
01:08:51 (4712): No heartbeat from core client for 30 sec - exiting
01:08:52 (4712): No heartbeat from core client for 30 sec - exiting
01:08:53 (4712): No heartbeat from core client for 30 sec - exiting
01:08:54 (4712): No heartbeat from core client for 30 sec - exiting
01:08:55 (4712): No heartbeat from core client for 30 sec - exiting
01:08:56 (4712): No heartbeat from core client for 30 sec - exiting
01:08:57 (4712): No heartbeat from core client for 30 sec - exiting
01:08:58 (4712): No heartbeat from core client for 30 sec - exiting
01:08:59 (4712): No heartbeat from core client for 30 sec - exiting
01:09:00 (4712): No heartbeat from core client for 30 sec - exiting
01:09:01 (4712): No heartbeat from core client for 30 sec - exiting
01:09:02 (4712): No heartbeat from core client for 30 sec - exiting
01:09:03 (4712): No heartbeat from core client for 30 sec - exiting
01:09:04 (4712): No heartbeat from core client for 30 sec - exiting
01:09:05 (4712): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:27:23 (5280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:27:24 (5280): No heartbeat from core client for 30 sec - exiting
05:27:25 (5280): No heartbeat from core client for 30 sec - exiting
05:27:26 (5280): No heartbeat from core client for 30 sec - exiting
05:27:27 (5280): No heartbeat from core client for 30 sec - exiting
05:27:28 (5280): No heartbeat from core client for 30 sec - exiting
05:27:29 (5280): No heartbeat from core client for 30 sec - exiting
05:27:30 (5280): No heartbeat from core client for 30 sec - exiting
05:27:31 (5280): No heartbeat from core client for 30 sec - exiting
05:27:32 (5280): No heartbeat from core client for 30 sec - exiting
05:27:33 (5280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
05:11:42 (5672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:11:43 (5672): No heartbeat from core client for 30 sec - exiting
05:11:44 (5672): No heartbeat from core client for 30 sec - exiting
05:11:45 (5672): No heartbeat from core client for 30 sec - exiting
05:11:46 (5672): No heartbeat from core client for 30 sec - exiting
05:11:47 (5672): No heartbeat from core client for 30 sec - exiting
05:11:48 (5672): No heartbeat from core client for 30 sec - exiting
05:11:49 (5672): No heartbeat from core client for 30 sec - exiting
05:11:50 (5672): No heartbeat from core client for 30 sec - exiting
05:11:51 (5672): No heartbeat from core client for 30 sec - exiting
05:11:52 (5672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
10:39:06 (5536): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Oct 2011 17:52:11 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 881,280 1,459,178 1.6557
06 Oct 2011 02:47:16 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 855,360 1,415,416 1.6548
05 Oct 2011 15:01:32 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 829,440 1,373,087 1.6554
05 Oct 2011 02:38:49 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 803,520 1,330,396 1.6557
04 Oct 2011 13:26:06 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 777,600 1,287,994 1.6564
03 Oct 2011 22:11:47 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 751,680 1,245,508 1.6570
30 Sep 2011 22:43:46 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 725,760 1,203,427 1.6582
30 Sep 2011 06:23:55 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 699,840 1,161,723 1.6600
29 Sep 2011 14:46:08 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 673,920 1,120,140 1.6621
29 Sep 2011 02:42:58 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 648,000 1,077,130 1.6622
28 Sep 2011 14:35:05 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 622,080 1,033,648 1.6616
27 Sep 2011 22:21:34 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 596,160 990,465 1.6614
26 Sep 2011 09:30:15 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 570,240 947,561 1.6617
25 Sep 2011 20:46:18 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 544,320 903,911 1.6606
25 Sep 2011 08:10:15 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 518,400 861,587 1.6620
24 Sep 2011 20:25:10 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 492,480 819,517 1.6641
24 Sep 2011 08:51:29 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 466,560 777,140 1.6657
23 Sep 2011 20:32:26 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 440,640 735,329 1.6688
23 Sep 2011 08:28:00 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 414,720 692,907 1.6708
22 Sep 2011 17:47:54 1128330 13370221 hadcm3n_t1qq_1940_40_007451846_1 388,800 648,053 1.6668


©2024 climateprediction.net