climateprediction.net home page
Task 15588186

Task 15588186

Name hadcm3n_4fr0_1940_40_008302743_0
Workunit 8453878
Created 6 Feb 2013, 20:13:43 UTC
Sent 6 Feb 2013, 20:14:10 UTC
Report deadline 9 May 2013, 3:41:21 UTC
Received 12 Feb 2013, 15:30:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1230144
Run time 5 days 14 hours 17 min 51 sec
CPU time 5 days 7 hours 32 min 57 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.22 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
00:16:30 (5424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:18:17 (5728): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:18:58 (5620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:22:38 (1488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:07:57 (1452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:27:22 (5016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:28:08 (4232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:17:55 (5264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:17:57 (5264): No heartbeat from core client for 30 sec - exiting
05:55:05 (4136): No heartbeat from core client for 30 sec - exiting
05:55:06 (4136): No heartbeat from core client for 30 sec - exiting
05:55:07 (4136): No heartbeat from core client for 30 sec - exiting
05:55:08 (4136): No heartbeat from core client for 30 sec - exiting
05:55:09 (4136): No heartbeat from core client for 30 sec - exiting
05:55:11 (4136): No heartbeat from core client for 30 sec - exiting
05:55:12 (4136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:11:38 (4372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:12:18 (4844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:16:42 (4372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:06:27 (6808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:06:29 (6808): No heartbeat from core client for 30 sec - exiting
00:06:30 (6808): No heartbeat from core client for 30 sec - exiting
00:06:31 (6808): No heartbeat from core client for 30 sec - exiting
00:07:33 (2904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:34:27 (3856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:35:22 (4468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:40:46 (232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
05:49:39 (3732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5176, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Feb 2013 02:08:05 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 259,200 425,038 1.6398
11 Feb 2013 12:16:57 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 233,280 382,471 1.6395
10 Feb 2013 23:59:10 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 207,360 340,504 1.6421
10 Feb 2013 11:27:47 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 181,440 297,938 1.6421
09 Feb 2013 23:49:46 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 155,520 255,308 1.6416
09 Feb 2013 10:42:16 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 129,600 212,994 1.6435
08 Feb 2013 22:13:55 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 103,680 170,774 1.6471
08 Feb 2013 09:42:23 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 77,760 128,023 1.6464
07 Feb 2013 21:26:41 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 51,840 85,495 1.6492
07 Feb 2013 09:08:01 1230144 15588186 hadcm3n_4fr0_1940_40_008302743_0 25,920 42,834 1.6525


©2024 climateprediction.net