climateprediction.net home page
Task 15766437

Task 15766437

Name hadcm3n_4dcb_1940_40_008302274_1
Workunit 8453409
Created 9 May 2013, 1:54:16 UTC
Sent 9 May 2013, 1:54:28 UTC
Report deadline 8 Aug 2013, 9:21:39 UTC
Received 19 May 2013, 7:25:14 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1185835
Run time 3 days 15 hours 46 min 2 sec
CPU time 3 days 14 hours 51 min 7 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 1.72 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:29:00 (4860): No heartbeat from core client for 30 sec - exiting
07:29:01 (4860): No heartbeat from core client for 30 sec - exiting
07:29:02 (4860): No heartbeat from core client for 30 sec - exiting
07:29:03 (4860): No heartbeat from core client for 30 sec - exiting
07:29:04 (4860): No heartbeat from core client for 30 sec - exiting
07:29:05 (4860): No heartbeat from core client for 30 sec - exiting
07:29:06 (4860): No heartbeat from core client for 30 sec - exiting
07:29:07 (4860): No heartbeat from core client for 30 sec - exiting
07:29:08 (4860): No heartbeat from core client for 30 sec - exiting
07:29:09 (4860): No heartbeat from core client for 30 sec - exiting
07:29:10 (4860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5296, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
09:20:40 (3008): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2876, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:10:47 (4404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:44:19 (5096): No heartbeat from core client for 30 sec - exiting
18:44:20 (5096): No heartbeat from core client for 30 sec - exiting
18:44:21 (5096): No heartbeat from core client for 30 sec - exiting
18:44:22 (5096): No heartbeat from core client for 30 sec - exiting
18:44:23 (5096): No heartbeat from core client for 30 sec - exiting
18:44:24 (5096): No heartbeat from core client for 30 sec - exiting
18:44:25 (5096): No heartbeat from core client for 30 sec - exiting
18:44:26 (5096): No heartbeat from core client for 30 sec - exiting
18:44:27 (5096): No heartbeat from core client for 30 sec - exiting
18:44:28 (5096): No heartbeat from core client for 30 sec - exiting
18:44:29 (5096): No heartbeat from core client for 30 sec - exiting
18:44:30 (5096): No heartbeat from core client for 30 sec - exiting
18:44:31 (5096): No heartbeat from core client for 30 sec - exiting
18:44:32 (5096): No heartbeat from core client for 30 sec - exiting
18:44:33 (5096): No heartbeat from core client for 30 sec - exiting
18:44:34 (5096): No heartbeat from core client for 30 sec - exiting
18:44:35 (5096): No heartbeat from core client for 30 sec - exiting
18:44:36 (5096): No heartbeat from core client for 30 sec - exiting
18:44:37 (5096): No heartbeat from core client for 30 sec - exiting
18:44:38 (5096): No heartbeat from core client for 30 sec - exiting
18:44:39 (5096): No heartbeat from core client for 30 sec - exiting
18:44:40 (5096): No heartbeat from core client for 30 sec - exiting
18:44:41 (5096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

zip error: Could not create output file (was replacing the original zip file)
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 May 2013 06:47:53 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 259,200 311,574 1.2021
17 May 2013 20:06:38 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 233,280 280,550 1.2026
16 May 2013 19:16:55 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 207,360 248,868 1.2002
15 May 2013 19:35:32 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 181,440 217,294 1.1976
15 May 2013 00:27:37 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 155,520 186,233 1.1975
14 May 2013 03:36:43 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 129,600 155,178 1.1974
13 May 2013 05:21:40 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 103,680 124,327 1.1991
12 May 2013 20:21:55 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 77,760 92,215 1.1859
10 May 2013 07:56:00 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 51,840 59,739 1.1524
09 May 2013 21:22:31 1185835 15766437 hadcm3n_4dcb_1940_40_008302274_1 25,920 29,528 1.1392


©2024 cpdn.org