climateprediction.net home page
Task 15698468

Task 15698468

Name hadcm3n_4n4v_1940_40_008306555_4
Workunit 8457690
Created 1 Apr 2013, 9:17:50 UTC
Sent 1 Apr 2013, 9:18:02 UTC
Report deadline 1 Jul 2013, 16:45:13 UTC
Received 21 Apr 2013, 3:34:38 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1242391
Run time 9 days 2 hours 2 min 21 sec
CPU time 7 days 21 hours 32 min 44 sec
Validate state Invalid
Credit 2,799.36
Device peak FLOPS 1.59 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
Il dispositivo non riconosce il comando. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7748, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5196, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=1
Model crash detected, will try to restart...
17:21:15 (5768): No heartbeat from core client for 30 sec - exiting
17:21:16 (5768): No heartbeat from core client for 30 sec - exiting
17:21:17 (5768): No heartbeat from core client for 30 sec - exiting
17:21:18 (5768): No heartbeat from core client for 30 sec - exiting
17:21:19 (5768): No heartbeat from core client for 30 sec - exiting
17:21:20 (5768): No heartbeat from core client for 30 sec - exiting
17:21:21 (5768): No heartbeat from core client for 30 sec - exiting
17:21:23 (5768): No heartbeat from core client for 30 sec - exiting
17:21:24 (5768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:21:25 (5768): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6060, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9440, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
16:20:08 (5680): No heartbeat from core client for 30 sec - exiting
16:20:09 (5680): No heartbeat from core client for 30 sec - exiting
16:20:10 (5680): No heartbeat from core client for 30 sec - exiting
16:20:12 (5680): No heartbeat from core client for 30 sec - exiting
16:20:13 (5680): No heartbeat from core client for 30 sec - exiting
16:20:14 (5680): No heartbeat from core client for 30 sec - exiting
16:20:15 (5680): No heartbeat from core client for 30 sec - exiting
16:20:16 (5680): No heartbeat from core client for 30 sec - exiting
16:20:17 (5680): No heartbeat from core client for 30 sec - exiting
16:20:18 (5680): No heartbeat from core client for 30 sec - exiting
16:20:19 (5680): No heartbeat from core client for 30 sec - exiting
16:20:20 (5680): No heartbeat from core client for 30 sec - exiting
16:20:21 (5680): No heartbeat from core client for 30 sec - exiting
16:20:22 (5680): No heartbeat from core client for 30 sec - exiting
16:20:24 (5680): No heartbeat from core client for 30 sec - exiting
16:20:25 (5680): No heartbeat from core client for 30 sec - exiting
16:20:26 (5680): No heartbeat from core client for 30 sec - exiting
16:20:27 (5680): No heartbeat from core client for 30 sec - exiting
16:20:28 (5680): No heartbeat from core client for 30 sec - exiting
16:20:29 (5680): No heartbeat from core client for 30 sec - exiting
16:20:30 (5680): No heartbeat from core client for 30 sec - exiting
16:20:31 (5680): No heartbeat from core client for 30 sec - exiting
16:20:32 (5680): No heartbeat from core client for 30 sec - exiting
16:20:33 (5680): No heartbeat from core client for 30 sec - exiting
16:20:34 (5680): No heartbeat from core client for 30 sec - exiting
16:20:36 (5680): No heartbeat from core client for 30 sec - exiting
16:20:37 (5680): No heartbeat from core client for 30 sec - exiting
16:20:38 (5680): No heartbeat from core client for 30 sec - exiting
16:20:39 (5680): No heartbeat from core client for 30 sec - exiting
16:20:40 (5680): No heartbeat from core client for 30 sec - exiting
16:20:41 (5680): No heartbeat from core client for 30 sec - exiting
16:20:42 (5680): No heartbeat from core client for 30 sec - exiting
16:20:43 (5680): No heartbeat from core client for 30 sec - exiting
16:20:44 (5680): No heartbeat from core client for 30 sec - exiting
16:20:45 (5680): No heartbeat from core client for 30 sec - exiting
16:20:47 (5680): No heartbeat from core client for 30 sec - exiting
16:20:48 (5680): No heartbeat from core client for 30 sec - exiting
16:20:49 (5680): No heartbeat from core client for 30 sec - exiting
16:20:50 (5680): No heartbeat from core client for 30 sec - exiting
16:20:51 (5680): No heartbeat from core client for 30 sec - exiting
16:20:52 (5680): No heartbeat from core client for 30 sec - exiting
16:20:53 (5680): No heartbeat from core client for 30 sec - exiting
16:20:54 (5680): No heartbeat from core client for 30 sec - exiting
16:20:55 (5680): No heartbeat from core client for 30 sec - exiting
16:20:56 (5680): No heartbeat from core client for 30 sec - exiting
16:20:57 (5680): No heartbeat from core client for 30 sec - exiting
16:20:59 (5680): No heartbeat from core client for 30 sec - exiting
16:21:00 (5680): No heartbeat from core client for 30 sec - exiting
16:21:01 (5680): No heartbeat from core client for 30 sec - exiting
16:21:02 (5680): No heartbeat from core client for 30 sec - exiting
16:21:03 (5680): No heartbeat from core client for 30 sec - exiting
16:21:04 (5680): No heartbeat from core client for 30 sec - exiting
16:21:05 (5680): No heartbeat from core client for 30 sec - exiting
16:21:06 (5680): No heartbeat from core client for 30 sec - exiting
16:21:07 (5680): No heartbeat from core client for 30 sec - exiting
16:21:08 (5680): No heartbeat from core client for 30 sec - exiting
16:21:09 (5680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:21:11 (5680): No heartbeat from core client for 30 sec - exiting
18:04:14 (3764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
06:40:08 (3172): No heartbeat from core client for 30 sec - exiting
06:40:09 (3172): No heartbeat from core client for 30 sec - exiting
06:40:10 (3172): No heartbeat from core client for 30 sec - exiting
06:40:11 (3172): No heartbeat from core client for 30 sec - exiting
06:40:12 (3172): No heartbeat from core client for 30 sec - exiting
06:40:13 (3172): No heartbeat from core client for 30 sec - exiting
06:40:14 (3172): No heartbeat from core client for 30 sec - exiting
06:40:16 (3172): No heartbeat from core client for 30 sec - exiting
06:40:17 (3172): No heartbeat from core client for 30 sec - exiting
06:40:18 (3172): No heartbeat from core client for 30 sec - exiting
06:40:19 (3172): No heartbeat from core client for 30 sec - exiting
06:40:20 (3172): No heartbeat from core client for 30 sec - exiting
06:40:21 (3172): No heartbeat from core client for 30 sec - exiting
06:40:22 (3172): No heartbeat from core client for 30 sec - exiting
06:40:23 (3172): No heartbeat from core client for 30 sec - exiting
06:40:24 (3172): No heartbeat from core client for 30 sec - exiting
06:40:25 (3172): No heartbeat from core client for 30 sec - exiting
06:40:26 (3172): No heartbeat from core client for 30 sec - exiting
06:40:28 (3172): No heartbeat from core client for 30 sec - exiting
06:40:29 (3172): No heartbeat from core client for 30 sec - exiting
06:40:30 (3172): No heartbeat from core client for 30 sec - exiting
06:40:31 (3172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6656, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5264, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4248, iMonCtr=1
Model crash detected, will try to restart...
16:18:27 (4700): No heartbeat from core client for 30 sec - exiting
16:18:28 (4700): No heartbeat from core client for 30 sec - exiting
16:18:30 (4700): No heartbeat from core client for 30 sec - exiting
16:18:31 (4700): No heartbeat from core client for 30 sec - exiting
16:18:32 (4700): No heartbeat from core client for 30 sec - exiting
16:18:33 (4700): No heartbeat from core client for 30 sec - exiting
16:18:34 (4700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:19:18 (4960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:19:20 (4960): No heartbeat from core client for 30 sec - exiting
06:19:21 (4960): No heartbeat from core client for 30 sec - exiting
06:19:22 (4960): No heartbeat from core client for 30 sec - exiting
06:19:23 (4960): No heartbeat from core client for 30 sec - exiting
06:19:24 (4960): No heartbeat from core client for 30 sec - exiting
06:19:25 (4960): No heartbeat from core client for 30 sec - exiting
06:19:26 (4960): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    
CPDN Monitor - Quit request from BOINC...

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Apr 2013 20:13:31 1242391 15698468 hadcm3n_4n4v_1940_40_008306555_4 233,280 676,124 2.8983
18 Apr 2013 19:47:04 1242391 15698468 hadcm3n_4n4v_1940_40_008306555_4 207,360 599,530 2.8913
14 Apr 2013 13:35:05 1242391 15698468 hadcm3n_4n4v_1940_40_008306555_4 181,440 524,525 2.8909
13 Apr 2013 17:23:43 1242391 15698468 hadcm3n_4n4v_1940_40_008306555_4 155,520 450,356 2.8958
12 Apr 2013 12:35:41 1242391 15698468 hadcm3n_4n4v_1940_40_008306555_4 129,600 374,468 2.8894
10 Apr 2013 17:01:16 1242391 15698468 hadcm3n_4n4v_1940_40_008306555_4 103,680 299,271 2.8865
06 Apr 2013 22:14:13 1242391 15698468 hadcm3n_4n4v_1940_40_008306555_4 77,760 225,687 2.9024
05 Apr 2013 17:48:03 1242391 15698468 hadcm3n_4n4v_1940_40_008306555_4 51,840 148,557 2.8657
04 Apr 2013 19:17:19 1242391 15698468 hadcm3n_4n4v_1940_40_008306555_4 25,920 74,806 2.8860


©2024 climateprediction.net