climateprediction.net home page
Task 15072219

Task 15072219

Name hadcm3n_o4ep_1980_40_008025893_3
Workunit 8181007
Created 7 Aug 2012, 15:00:53 UTC
Sent 7 Aug 2012, 15:01:03 UTC
Report deadline 6 Nov 2012, 22:28:14 UTC
Received 14 Aug 2012, 22:03:22 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1165586
Run time 5 days 6 hours 4 min 55 sec
CPU time 3 days 6 hours 32 min 13 sec
Validate state Invalid
Credit 622.08
Device peak FLOPS 1.42 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
17:36:21 (4564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:18:38 (5668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:34:04 (4284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:17:37 (212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:49:16 (1700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:49:18 (1700): No heartbeat from core client for 30 sec - exiting
00:12:37 (4304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:12:38 (4304): No heartbeat from core client for 30 sec - exiting
05:44:53 (7648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:08:05 (9208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
16:01:48 (3440): No heartbeat from core client for 30 sec - exiting
16:01:49 (3440): No heartbeat from core client for 30 sec - exiting
16:01:50 (3440): No heartbeat from core client for 30 sec - exiting
16:01:52 (3440): No heartbeat from core client for 30 sec - exiting
16:01:53 (3440): No heartbeat from core client for 30 sec - exiting
16:01:54 (3440): No heartbeat from core client for 30 sec - exiting
16:01:55 (3440): No heartbeat from core client for 30 sec - exiting
16:01:56 (3440): No heartbeat from core client for 30 sec - exiting
16:01:57 (3440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
08:08:14 (5136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Aug 2012 18:59:02 1165586 15072219 hadcm3n_o4ep_1980_40_008025893_3 51,840 219,586 4.2358
09 Aug 2012 12:57:08 1165586 15072219 hadcm3n_o4ep_1980_40_008025893_3 25,920 110,191 4.2512


©2024 cpdn.org