climateprediction.net home page
Task 14767491

Task 14767491

Name hadcm3n_o4st_1980_40_007834271_4
Workunit 7989383
Created 2 Jun 2012, 17:03:23 UTC
Sent 2 Jun 2012, 17:03:41 UTC
Report deadline 2 Sep 2012, 0:30:52 UTC
Received 29 Jul 2012, 18:55:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1184169
Run time 9 days 1 hours 15 min 29 sec
CPU time 8 days 16 hours 50 min 15 sec
Validate state Invalid
Credit 8,709.12
Device peak FLOPS 3.71 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:44:44 (7140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
18:56:00 (3356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:32:16 (7588): No heartbeat from core client for 30 sec - exiting
20:32:17 (7588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
23:33:42 (8176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o4stko.pji5c10
Error converting file to netcdf: dataout/o4stko.pii5c10
Error converting file to netcdf: dataout/o4stko.pfi5c10
Error converting file to netcdf: dataout/o4stka.phi5c10
Error converting file to netcdf: dataout/o4stka.pgi5c10
Error converting file to netcdf: dataout/o4stka.pei5c10
Error converting file to netcdf: dataout/o4stka.pdi5c10
CPDN Monitor - Quit request from BOINC...
19:46:34 (788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:55:00 (1888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:37:19 (2944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:33:23 (8392): No heartbeat from core client for 30 sec - exiting
21:33:24 (8392): No heartbeat from core client for 30 sec - exiting
21:33:26 (8392): No heartbeat from core client for 30 sec - exiting
21:33:27 (8392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
11:36:24 (7840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=664, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=664, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=664, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3756, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3756, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3756, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Jul 2012 20:21:07 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 725,760 734,970 1.0127
26 Jul 2012 18:34:29 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 699,840 709,027 1.0131
25 Jul 2012 15:13:34 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 673,920 682,867 1.0133
25 Jul 2012 06:56:51 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 648,000 656,813 1.0136
22 Jul 2012 18:57:27 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 622,080 630,593 1.0137
22 Jul 2012 10:07:08 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 596,160 604,556 1.0141
21 Jul 2012 17:49:37 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 570,240 578,189 1.0139
20 Jul 2012 20:18:13 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 544,320 552,019 1.0141
16 Jul 2012 18:34:51 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 518,400 525,678 1.0140
14 Jul 2012 18:56:56 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 492,480 499,479 1.0142
14 Jul 2012 11:35:31 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 466,560 473,044 1.0139
11 Jul 2012 21:13:43 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 440,640 447,045 1.0145
10 Jul 2012 16:22:24 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 414,720 420,979 1.0151
10 Jul 2012 08:05:15 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 388,800 394,945 1.0158
08 Jul 2012 17:25:49 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 362,880 368,472 1.0154
30 Jun 2012 09:17:19 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 336,960 342,206 1.0156
27 Jun 2012 21:37:51 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 311,040 315,857 1.0155
24 Jun 2012 15:57:35 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 285,120 289,513 1.0154
23 Jun 2012 21:22:31 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 259,200 263,402 1.0162
23 Jun 2012 13:54:06 1184169 14767491 hadcm3n_o4st_1980_40_007834271_4 233,280 237,303 1.0172


©2024 cpdn.org