climateprediction.net home page
Task 15516812

Task 15516812

Name hadcm3n_z99u_1920_40_008280655_0
Workunit 8431790
Created 29 Dec 2012, 15:09:54 UTC
Sent 29 Dec 2012, 16:26:15 UTC
Report deadline 30 Mar 2013, 23:53:26 UTC
Received 13 Jan 2013, 15:56:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1210909
Run time 12 days 17 hours 18 min 40 sec
CPU time 12 days 13 hours 33 min 52 sec
Validate state Invalid
Credit 8,709.12
Device peak FLOPS 2.74 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.60</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:43:03 (3336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:13:56 (3880): No heartbeat from core client for 30 sec - exiting
11:13:57 (3880): No heartbeat from core client for 30 sec - exiting
11:13:58 (3880): No heartbeat from core client for 30 sec - exiting
11:14:00 (3880): No heartbeat from core client for 30 sec - exiting
11:14:01 (3880): No heartbeat from core client for 30 sec - exiting
11:14:02 (3880): No heartbeat from core client for 30 sec - exiting
11:14:03 (3880): No heartbeat from core client for 30 sec - exiting
11:14:04 (3880): No heartbeat from core client for 30 sec - exiting
11:14:05 (3880): No heartbeat from core client for 30 sec - exiting
11:14:06 (3880): No heartbeat from core client for 30 sec - exiting
11:14:07 (3880): No heartbeat from core client for 30 sec - exiting
11:14:08 (3880): No heartbeat from core client for 30 sec - exiting
11:14:09 (3880): No heartbeat from core client for 30 sec - exiting
11:14:10 (3880): No heartbeat from core client for 30 sec - exiting
11:14:12 (3880): No heartbeat from core client for 30 sec - exiting
11:14:13 (3880): No heartbeat from core client for 30 sec - exiting
11:14:14 (3880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:49:09 (3264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:49:10 (3264): No heartbeat from core client for 30 sec - exiting
12:49:11 (3264): No heartbeat from core client for 30 sec - exiting
12:49:12 (3264): No heartbeat from core client for 30 sec - exiting
12:49:13 (3264): No heartbeat from core client for 30 sec - exiting
12:49:14 (3264): No heartbeat from core client for 30 sec - exiting
12:49:15 (3264): No heartbeat from core client for 30 sec - exiting
12:49:16 (3264): No heartbeat from core client for 30 sec - exiting
12:49:17 (3264): No heartbeat from core client for 30 sec - exiting
12:49:18 (3264): No heartbeat from core client for 30 sec - exiting
12:49:19 (3264): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Jan 2013 03:33:39 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 725,760 1,050,285 1.4472
12 Jan 2013 17:11:04 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 699,840 1,011,314 1.4451
12 Jan 2013 06:18:52 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 673,920 971,845 1.4421
11 Jan 2013 18:38:23 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 648,000 932,410 1.4389
11 Jan 2013 07:14:01 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 622,080 892,948 1.4354
10 Jan 2013 21:06:27 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 596,160 853,564 1.4318
10 Jan 2013 09:13:44 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 570,240 814,357 1.4281
09 Jan 2013 22:16:40 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 544,320 775,315 1.4244
09 Jan 2013 10:40:31 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 518,400 735,859 1.4195
08 Jan 2013 23:33:03 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 492,480 696,257 1.4138
08 Jan 2013 12:30:08 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 466,560 656,833 1.4078
08 Jan 2013 01:25:25 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 440,640 617,423 1.4012
07 Jan 2013 14:21:19 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 414,720 578,011 1.3937
07 Jan 2013 02:49:43 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 388,800 538,737 1.3856
06 Jan 2013 16:47:51 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 362,880 500,242 1.3785
06 Jan 2013 05:55:34 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 336,960 462,095 1.3714
05 Jan 2013 19:09:24 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 311,040 425,363 1.3676
05 Jan 2013 09:05:53 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 285,120 389,160 1.3649
04 Jan 2013 23:06:56 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 259,200 353,162 1.3625
03 Jan 2013 22:47:48 1210909 15516812 hadcm3n_z99u_1920_40_008280655_0 233,280 317,246 1.3599


©2024 climateprediction.net