climateprediction.net home page
Task 16052473

Task 16052473

Name hadcm3n_o1q9_2020_40_008408309_2
Workunit 8559165
Created 1 Oct 2013, 18:10:33 UTC
Sent 1 Oct 2013, 19:05:30 UTC
Report deadline 1 Jan 2014, 2:32:41 UTC
Received 29 Oct 2013, 17:52:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1295275
Run time 5 days 0 hours 53 min 5 sec
CPU time 4 days 4 hours 33 min 7 sec
Validate state Invalid
Credit 2,488.32
Device peak FLOPS 3.38 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
Het apparaat herkent de opdracht niet.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5904, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4268, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4456, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2164, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4740, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4740, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3080, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4420, iMonCtr=1
Model crash detected, will try to restart...
08:04:05 (920): No heartbeat from core client for 30 sec - exiting
08:04:06 (920): No heartbeat from core client for 30 sec - exiting
08:04:07 (920): No heartbeat from core client for 30 sec - exiting
08:04:08 (920): No heartbeat from core client for 30 sec - exiting
08:04:09 (920): No heartbeat from core client for 30 sec - exiting
08:04:10 (920): No heartbeat from core client for 30 sec - exiting
08:04:11 (920): No heartbeat from core client for 30 sec - exiting
08:04:13 (920): No heartbeat from core client for 30 sec - exiting
08:04:14 (920): No heartbeat from core client for 30 sec - exiting
08:04:15 (920): No heartbeat from core client for 30 sec - exiting
08:04:16 (920): No heartbeat from core client for 30 sec - exiting
08:04:17 (920): No heartbeat from core client for 30 sec - exiting
08:04:18 (920): No heartbeat from core client for 30 sec - exiting
08:04:19 (920): No heartbeat from core client for 30 sec - exiting
08:04:20 (920): No heartbeat from core client for 30 sec - exiting
08:04:21 (920): No heartbeat from core client for 30 sec - exiting
08:04:22 (920): No heartbeat from core client for 30 sec - exiting
08:04:23 (920): No heartbeat from core client for 30 sec - exiting
08:04:25 (920): No heartbeat from core client for 30 sec - exiting
08:04:26 (920): No heartbeat from core client for 30 sec - exiting
08:04:27 (920): No heartbeat from core client for 30 sec - exiting
08:04:28 (920): No heartbeat from core client for 30 sec - exiting
08:04:29 (920): No heartbeat from core client for 30 sec - exiting
08:04:30 (920): No heartbeat from core client for 30 sec - exiting
08:04:31 (920): No heartbeat from core client for 30 sec - exiting
08:04:32 (920): No heartbeat from core client for 30 sec - exiting
08:04:33 (920): No heartbeat from core client for 30 sec - exiting
08:04:34 (920): No heartbeat from core client for 30 sec - exiting
08:04:35 (920): No heartbeat from core client for 30 sec - exiting
08:04:37 (920): No heartbeat from core client for 30 sec - exiting
08:04:38 (920): No heartbeat from core client for 30 sec - exiting
08:04:39 (920): No heartbeat from core client for 30 sec - exiting
08:04:40 (920): No heartbeat from core client for 30 sec - exiting
08:04:41 (920): No heartbeat from core client for 30 sec - exiting
08:04:42 (920): No heartbeat from core client for 30 sec - exiting
08:04:43 (920): No heartbeat from core client for 30 sec - exiting
08:04:44 (920): No heartbeat from core client for 30 sec - exiting
08:04:45 (920): No heartbeat from core client for 30 sec - exiting
08:04:46 (920): No heartbeat from core client for 30 sec - exiting
08:04:47 (920): No heartbeat from core client for 30 sec - exiting
08:04:49 (920): No heartbeat from core client for 30 sec - exiting
08:04:50 (920): No heartbeat from core client for 30 sec - exiting
08:04:51 (920): No heartbeat from core client for 30 sec - exiting
08:04:52 (920): No heartbeat from core client for 30 sec - exiting
08:04:53 (920): No heartbeat from core client for 30 sec - exiting
08:04:54 (920): No heartbeat from core client for 30 sec - exiting
08:04:55 (920): No heartbeat from core client for 30 sec - exiting
08:04:56 (920): No heartbeat from core client for 30 sec - exiting
08:04:57 (920): No heartbeat from core client for 30 sec - exiting
08:04:58 (920): No heartbeat from core client for 30 sec - exiting
08:05:00 (920): No heartbeat from core client for 30 sec - exiting
08:05:01 (920): No heartbeat from core client for 30 sec - exiting
08:05:02 (920): No heartbeat from core client for 30 sec - exiting
08:05:03 (920): No heartbeat from core client for 30 sec - exiting
08:05:04 (920): No heartbeat from core client for 30 sec - exiting
08:05:05 (920): No heartbeat from core client for 30 sec - exiting
08:05:06 (920): No heartbeat from core client for 30 sec - exiting
08:05:07 (920): No heartbeat from core client for 30 sec - exiting
08:05:08 (920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Oct 2013 12:04:59 1295275 16052473 hadcm3n_o1q9_2020_40_008408309_2 207,360 338,921 1.6345
26 Oct 2013 22:19:12 1295275 16052473 hadcm3n_o1q9_2020_40_008408309_2 181,440 296,123 1.6321
26 Oct 2013 09:22:22 1295275 16052473 hadcm3n_o1q9_2020_40_008408309_2 155,520 252,712 1.6249
24 Oct 2013 17:29:35 1295275 16052473 hadcm3n_o1q9_2020_40_008408309_2 129,600 210,115 1.6213
20 Oct 2013 13:52:31 1295275 16052473 hadcm3n_o1q9_2020_40_008408309_2 103,680 168,576 1.6259
18 Oct 2013 15:41:49 1295275 16052473 hadcm3n_o1q9_2020_40_008408309_2 77,760 126,276 1.6239
17 Oct 2013 16:01:39 1295275 16052473 hadcm3n_o1q9_2020_40_008408309_2 51,840 83,884 1.6181
14 Oct 2013 14:10:56 1295275 16052473 hadcm3n_o1q9_2020_40_008408309_2 25,920 41,626 1.6059


©2024 cpdn.org