climateprediction.net home page
Task 15475609

Task 15475609

Name hadcm3n_ydxv_1980_40_008244283_3
Workunit 8399407
Created 13 Dec 2012, 22:19:33 UTC
Sent 13 Dec 2012, 22:20:20 UTC
Report deadline 15 Mar 2013, 5:47:31 UTC
Received 23 Dec 2012, 0:07:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1254994
Run time 2 days 10 hours 19 min 7 sec
CPU time 2 days 8 hours 22 min 7 sec
Validate state Invalid
Credit 1,866.24
Device peak FLOPS 3.75 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=164, iMonCtr=1
Model crash detected, will try to restart...
12:27:42 (4700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:55:55 (3668): No heartbeat from core client for 30 sec - exiting
09:55:57 (3668): No heartbeat from core client for 30 sec - exiting
09:55:58 (3668): No heartbeat from core client for 30 sec - exiting
09:55:59 (3668): No heartbeat from core client for 30 sec - exiting
09:56:00 (3668): No heartbeat from core client for 30 sec - exiting
09:56:01 (3668): No heartbeat from core client for 30 sec - exiting
09:56:02 (3668): No heartbeat from core client for 30 sec - exiting
09:56:03 (3668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:56:04 (3668): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4140, iMonCtr=1
Model crash detected, will try to restart...
10:16:11 (4488): No heartbeat from core client for 30 sec - exiting
10:16:12 (4488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4912, iMonCtr=1
Model crash detected, will try to restart...
16:54:26 (216): No heartbeat from core client for 30 sec - exiting
16:54:27 (216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:25:15 (1264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:35:04 (4388): No heartbeat from core client for 30 sec - exiting
09:35:05 (4388): No heartbeat from core client for 30 sec - exiting
09:35:06 (4388): No heartbeat from core client for 30 sec - exiting
09:35:07 (4388): No heartbeat from core client for 30 sec - exiting
09:35:08 (4388): No heartbeat from core client for 30 sec - exiting
09:35:09 (4388): No heartbeat from core client for 30 sec - exiting
09:35:10 (4388): No heartbeat from core client for 30 sec - exiting
09:35:11 (4388): No heartbeat from core client for 30 sec - exiting
09:35:12 (4388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:40:05 (4040): No heartbeat from core client for 30 sec - exiting
18:40:07 (4040): No heartbeat from core client for 30 sec - exiting
18:40:08 (4040): No heartbeat from core client for 30 sec - exiting
18:40:09 (4040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:49:29 (3732): No heartbeat from core client for 30 sec - exiting
09:49:30 (3732): No heartbeat from core client for 30 sec - exiting
09:49:31 (3732): No heartbeat from core client for 30 sec - exiting
09:49:32 (3732): No heartbeat from core client for 30 sec - exiting
09:49:33 (3732): No heartbeat from core client for 30 sec - exiting
09:49:34 (3732): No heartbeat from core client for 30 sec - exiting
09:49:35 (3732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:50:57 (924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:55:01 (5036): No heartbeat from core client for 30 sec - exiting
15:55:02 (5036): No heartbeat from core client for 30 sec - exiting
15:55:03 (5036): No heartbeat from core client for 30 sec - exiting
15:55:04 (5036): No heartbeat from core client for 30 sec - exiting
15:55:05 (5036): No heartbeat from core client for 30 sec - exiting
15:55:06 (5036): No heartbeat from core client for 30 sec - exiting
15:55:07 (5036): No heartbeat from core client for 30 sec - exiting
15:55:08 (5036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:55:09 (5036): No heartbeat from core client for 30 sec - exiting
09:50:03 (4896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:01:08 (4708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:15:25 (4860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:02:25 (3220): No heartbeat from core client for 30 sec - exiting
16:02:26 (3220): No heartbeat from core client for 30 sec - exiting
16:02:27 (3220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1
Model crash detected, will try to restart...
16:49:07 (4812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:49:08 (4812): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2548, iMonCtr=1
Model crash detected, will try to restart...
19:05:44 (4512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1136, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1136, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Dec 2012 14:50:39 1254994 15475609 hadcm3n_ydxv_1980_40_008244283_3 155,520 186,980 1.2023
19 Dec 2012 15:03:03 1254994 15475609 hadcm3n_ydxv_1980_40_008244283_3 129,600 155,892 1.2029
19 Dec 2012 05:51:26 1254994 15475609 hadcm3n_ydxv_1980_40_008244283_3 103,680 124,701 1.2027
18 Dec 2012 20:42:50 1254994 15475609 hadcm3n_ydxv_1980_40_008244283_3 77,760 93,694 1.2049
17 Dec 2012 15:00:38 1254994 15475609 hadcm3n_ydxv_1980_40_008244283_3 51,840 62,989 1.2151
16 Dec 2012 00:37:55 1254994 15475609 hadcm3n_ydxv_1980_40_008244283_3 25,920 31,249 1.2056


©2024 climateprediction.net