climateprediction.net home page
Task 15777462

Task 15777462

Name hadcm3n_4h3h_1940_40_008310536_1
Workunit 8461671
Created 11 May 2013, 10:28:28 UTC
Sent 11 May 2013, 11:35:28 UTC
Report deadline 10 Aug 2013, 19:02:39 UTC
Received 22 May 2013, 9:39:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1158176
Run time 10 days 4 hours 22 min 32 sec
CPU time 9 days 12 hours 18 min 4 sec
Validate state Invalid
Credit 4,354.56
Device peak FLOPS 2.83 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:06:09 (14008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:07:49 (15028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:00:56 (4268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:01:57 (2344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:08:22 (5572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:24:11 (5392): No heartbeat from core client for 30 sec - exiting
06:24:12 (5392): No heartbeat from core client for 30 sec - exiting
06:24:13 (5392): No heartbeat from core client for 30 sec - exiting
06:24:14 (5392): No heartbeat from core client for 30 sec - exiting
06:24:15 (5392): No heartbeat from core client for 30 sec - exiting
06:24:16 (5392): No heartbeat from core client for 30 sec - exiting
06:24:17 (5392): No heartbeat from core client for 30 sec - exiting
06:24:18 (5392): No heartbeat from core client for 30 sec - exiting
06:24:19 (5392): No heartbeat from core client for 30 sec - exiting
06:24:20 (5392): No heartbeat from core client for 30 sec - exiting
06:24:21 (5392): No heartbeat from core client for 30 sec - exiting
06:24:22 (5392): No heartbeat from core client for 30 sec - exiting
06:24:23 (5392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:25:24 (7544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:26:36 (6356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:28:19 (5572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:35:31 (3552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:35:32 (3552): No heartbeat from core client for 30 sec - exiting
11:47:42 (19608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:09:00 (20920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:16:42 (5268): No heartbeat from core client for 30 sec - exiting
22:16:44 (5268): No heartbeat from core client for 30 sec - exiting
22:16:45 (5268): No heartbeat from core client for 30 sec - exiting
22:16:46 (5268): No heartbeat from core client for 30 sec - exiting
22:16:47 (5268): No heartbeat from core client for 30 sec - exiting
22:16:48 (5268): No heartbeat from core client for 30 sec - exiting
22:16:49 (5268): No heartbeat from core client for 30 sec - exiting
22:16:50 (5268): No heartbeat from core client for 30 sec - exiting
22:16:51 (5268): No heartbeat from core client for 30 sec - exiting
22:16:52 (5268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6080, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6080, iMonCtr=1
Model crash detected, will try to restart...
22:19:36 (6080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=1
Model crash detected, will try to restart...
22:23:22 (5260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4364, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4364, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 May 2013 13:09:36 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 362,880 782,576 2.1566
20 May 2013 21:12:58 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 336,960 729,951 2.1663
20 May 2013 05:54:54 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 311,040 678,049 2.1799
19 May 2013 15:29:26 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 285,120 627,414 2.2005
19 May 2013 02:09:48 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 259,200 580,325 2.2389
18 May 2013 12:25:00 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 233,280 532,697 2.2835
17 May 2013 20:16:46 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 207,360 477,995 2.3051
17 May 2013 04:28:46 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 181,440 422,906 2.3308
16 May 2013 12:28:41 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 155,520 367,302 2.3618
15 May 2013 17:51:38 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 129,600 312,976 2.4149
14 May 2013 18:47:28 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 103,680 251,802 2.4286
13 May 2013 23:09:44 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 77,760 189,781 2.4406
13 May 2013 03:40:41 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 51,840 126,178 2.4340
12 May 2013 06:10:09 1158176 15777462 hadcm3n_4h3h_1940_40_008310536_1 25,920 61,588 2.3761


©2024 cpdn.org