climateprediction.net home page
Task 13094357

Task 13094357

Name hadcm3n_y9k4_1900_40_007345246_0
Workunit 7542676
Created 6 Jul 2011, 13:30:00 UTC
Sent 20 Jul 2011, 13:47:22 UTC
Report deadline 19 Oct 2011, 21:14:33 UTC
Received 21 Aug 2011, 15:17:08 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1067051
Run time 6 days 9 hours 6 min 31 sec
CPU time 5 days 19 hours 6 min 39 sec
Validate state Invalid
Credit 3,421.44
Device peak FLOPS 2.47 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2780, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=1
Model crash detected, will try to restart...
08:43:00 (5644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:09:04 (4040): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
10:09:06 (4040): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5020, iMonCtr=1
Model crash detected, will try to restart...
08:50:14 (5344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=404, iMonCtr=1
Model crash detected, will try to restart...
09:06:15 (5812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:42:00 (5584): No heartbeat from core client for 30 sec - exiting
08:42:01 (5584): No heartbeat from core client for 30 sec - exiting
08:42:02 (5584): No heartbeat from core client for 30 sec - exiting
08:42:03 (5584): No heartbeat from core client for 30 sec - exiting
08:42:04 (5584): No heartbeat from core client for 30 sec - exiting
08:42:05 (5584): No heartbeat from core client for 30 sec - exiting
08:42:06 (5584): No heartbeat from core client for 30 sec - exiting
08:42:08 (5584): No heartbeat from core client for 30 sec - exiting
08:42:09 (5584): No heartbeat from core client for 30 sec - exiting
08:42:10 (5584): No heartbeat from core client for 30 sec - exiting
08:42:11 (5584): No heartbeat from core client for 30 sec - exiting
08:42:12 (5584): No heartbeat from core client for 30 sec - exiting
08:42:13 (5584): No heartbeat from core client for 30 sec - exiting
08:42:14 (5584): No heartbeat from core client for 30 sec - exiting
08:42:15 (5584): No heartbeat from core client for 30 sec - exiting
08:42:16 (5584): No heartbeat from core client for 30 sec - exiting
08:42:17 (5584): No heartbeat from core client for 30 sec - exiting
08:42:18 (5584): No heartbeat from core client for 30 sec - exiting
08:42:20 (5584): No heartbeat from core client for 30 sec - exiting
08:42:21 (5584): No heartbeat from core client for 30 sec - exiting
08:42:22 (5584): No heartbeat from core client for 30 sec - exiting
08:42:23 (5584): No heartbeat from core client for 30 sec - exiting
08:42:24 (5584): No heartbeat from core client for 30 sec - exiting
08:42:25 (5584): No heartbeat from core client for 30 sec - exiting
08:42:26 (5584): No heartbeat from core client for 30 sec - exiting
08:42:27 (5584): No heartbeat from core client for 30 sec - exiting
08:42:28 (5584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:42:29 (5584): No heartbeat from core client for 30 sec - exiting
08:38:10 (6132): No heartbeat from core client for 30 sec - exiting
08:38:11 (6132): No heartbeat from core client for 30 sec - exiting
08:38:13 (6132): No heartbeat from core client for 30 sec - exiting
08:38:14 (6132): No heartbeat from core client for 30 sec - exiting
08:38:15 (6132): No heartbeat from core client for 30 sec - exiting
08:38:16 (6132): No heartbeat from core client for 30 sec - exiting
08:38:17 (6132): No heartbeat from core client for 30 sec - exiting
08:38:18 (6132): No heartbeat from core client for 30 sec - exiting
08:38:19 (6132): No heartbeat from core client for 30 sec - exiting
08:38:20 (6132): No heartbeat from core client for 30 sec - exiting
08:38:21 (6132): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5168, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
17:29:44 (5936): No heartbeat from core client for 30 sec - exiting

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Aug 2011 15:16:49 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 285,120 476,726 1.6720
14 Aug 2011 19:58:27 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 259,200 432,821 1.6698
14 Aug 2011 19:58:27 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 233,280 389,298 1.6688
14 Aug 2011 19:58:27 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 207,360 346,018 1.6687
14 Aug 2011 19:58:27 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 181,440 303,277 1.6715
14 Aug 2011 19:58:27 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 155,520 260,275 1.6736
14 Aug 2011 19:58:27 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 129,600 216,667 1.6718
14 Aug 2011 19:58:27 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 103,680 173,389 1.6723
31 Jul 2011 18:19:21 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 77,760 130,581 1.6793
27 Jul 2011 16:41:59 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 51,840 87,047 1.6791
27 Jul 2011 12:43:29 1067051 13094357 hadcm3n_y9k4_1900_40_007345246_0 25,920 43,327 1.6716


©2024 climateprediction.net