climateprediction.net home page
Task 15744495

Task 15744495

Name hadcm3n_n5b2_1920_40_008321509_1
Workunit 8472644
Created 21 Apr 2013, 5:38:57 UTC
Sent 21 Apr 2013, 5:39:52 UTC
Report deadline 21 Jul 2013, 13:07:03 UTC
Received 1 May 2013, 20:28:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1172912
Run time 5 days 4 hours 27 min 12 sec
CPU time 4 days 22 hours 4 min 38 sec
Validate state Invalid
Credit 4,354.56
Device peak FLOPS 3.32 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:05:56 (5184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=912, iMonCtr=1
Model crash detected, will try to restart...
11:56:40 (3960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:56:41 (3960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:00:21 (4888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:53:57 (1624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:55:21 (5988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4804, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 May 2013 01:46:57 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 362,880 412,377 1.1364
30 Apr 2013 09:30:39 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 336,960 382,855 1.1362
29 Apr 2013 22:01:57 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 311,040 355,750 1.1437
29 Apr 2013 03:51:14 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 285,120 326,454 1.1450
28 Apr 2013 06:35:31 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 259,200 296,604 1.1443
27 Apr 2013 10:18:25 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 233,280 267,353 1.1461
26 Apr 2013 07:22:26 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 207,360 237,327 1.1445
25 Apr 2013 06:26:15 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 181,440 206,816 1.1399
24 Apr 2013 21:32:49 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 155,520 175,697 1.1297
24 Apr 2013 20:31:54 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 129,600 149,599 1.1543
23 Apr 2013 22:21:20 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 103,680 119,617 1.1537
23 Apr 2013 01:31:43 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 77,760 89,579 1.1520
22 Apr 2013 09:04:02 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 51,840 59,428 1.1464
22 Apr 2013 00:00:48 1172912 15744495 hadcm3n_n5b2_1920_40_008321509_1 25,920 29,799 1.1497


©2024 climateprediction.net