climateprediction.net home page
Task 15476398

Task 15476398

Name hadcm3n_zlj3_1960_40_008255668_1
Workunit 8410792
Created 14 Dec 2012, 3:08:37 UTC
Sent 14 Dec 2012, 3:15:22 UTC
Report deadline 15 Mar 2013, 10:42:33 UTC
Received 28 Dec 2012, 1:41:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1129647
Run time 7 days 9 hours 35 min 53 sec
CPU time 7 days 0 hours 37 min 3 sec
Validate state Invalid
Credit 4,043.52
Device peak FLOPS 2.89 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4272, iMonCtr=1
Model crash detected, will try to restart...
05:26:57 (4296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5364, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3360, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:13:07 (5072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
05:30:37 (3144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:39:13 (376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:26:57 (8900): No heartbeat from core client for 30 sec - exiting
05:26:58 (8900): No heartbeat from core client for 30 sec - exiting
05:26:59 (8900): No heartbeat from core client for 30 sec - exiting
05:27:00 (8900): No heartbeat from core client for 30 sec - exiting
05:27:01 (8900): No heartbeat from core client for 30 sec - exiting
05:27:02 (8900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:08:08 (1456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:19:51 (1184): No heartbeat from core client for 30 sec - exiting
18:19:52 (1184): No heartbeat from core client for 30 sec - exiting
18:19:53 (1184): No heartbeat from core client for 30 sec - exiting
18:19:54 (1184): No heartbeat from core client for 30 sec - exiting
18:19:55 (1184): No heartbeat from core client for 30 sec - exiting
18:19:56 (1184): No heartbeat from core client for 30 sec - exiting
18:19:57 (1184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Dec 2012 20:50:47 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 336,960 598,046 1.7748
26 Dec 2012 02:42:39 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 311,040 552,389 1.7759
25 Dec 2012 14:20:38 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 285,120 504,729 1.7702
24 Dec 2012 10:43:20 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 259,200 456,982 1.7630
23 Dec 2012 15:04:30 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 233,280 409,945 1.7573
22 Dec 2012 16:30:56 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 207,360 364,490 1.7578
21 Dec 2012 18:32:54 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 181,440 318,614 1.7560
21 Dec 2012 01:13:25 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 155,520 272,868 1.7546
19 Dec 2012 23:39:17 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 129,600 227,238 1.7534
18 Dec 2012 00:20:12 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 103,680 181,242 1.7481
16 Dec 2012 18:40:30 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 77,760 136,209 1.7517
15 Dec 2012 20:25:59 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 51,840 89,994 1.7360
14 Dec 2012 23:46:37 1129647 15476398 hadcm3n_zlj3_1960_40_008255668_1 25,920 44,855 1.7305


©2024 cpdn.org