climateprediction.net home page
Task 14360033

Task 14360033

Name hadcm3n_o7pp_2020_40_007857140_0
Workunit 8012252
Created 4 Apr 2012, 21:13:53 UTC
Sent 4 Apr 2012, 23:25:08 UTC
Report deadline 5 Jul 2012, 6:52:19 UTC
Received 11 Apr 2012, 9:18:12 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1168700
Run time 4 days 17 hours 15 min 14 sec
CPU time 4 days 12 hours 53 min 31 sec
Validate state Invalid
Credit 2,488.32
Device peak FLOPS 2.36 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
07:55:43 (24928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:55:45 (24928): No heartbeat from core client for 30 sec - exiting
07:55:46 (24928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:34:45 (37448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:25:50 (42336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
00:01:15 (48972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=49496, selfPID=49496, iMonCtr=1
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o7ppko.pjm5c10
Error converting file to netcdf: dataout/o7ppko.pim5c10
Error converting file to netcdf: dataout/o7ppko.pfm5c10
Error converting file to netcdf: dataout/o7ppka.phm5c10
Error converting file to netcdf: dataout/o7ppka.pgm5c10
Error converting file to netcdf: dataout/o7ppka.pem5c10
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o7ppko.pjm5c10
Error converting file to netcdf: dataout/o7ppko.pim5c10
Error converting file to netcdf: dataout/o7ppko.pfm5c10
Error converting file to netcdf: dataout/o7ppka.phm5c10
Error converting file to netcdf: dataout/o7ppka.pgm5c10
Error converting file to netcdf: dataout/o7ppka.pem5c10
Error converting file to netcdf: dataout/o7ppka.pdm5c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:10:12 (48996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:13:32 (61008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:07:20 (65520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7932, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
09:21:28 (7932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7692, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7692, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7692, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7692, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7692, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Apr 2012 04:26:26 1168700 14360033 hadcm3n_o7pp_2020_40_007857140_0 207,360 363,408 1.7525
09 Apr 2012 14:25:52 1168700 14360033 hadcm3n_o7pp_2020_40_007857140_0 181,440 318,419 1.7550
08 Apr 2012 23:54:50 1168700 14360033 hadcm3n_o7pp_2020_40_007857140_0 155,520 270,995 1.7425
08 Apr 2012 10:20:34 1168700 14360033 hadcm3n_o7pp_2020_40_007857140_0 129,600 223,140 1.7218
07 Apr 2012 07:40:47 1168700 14360033 hadcm3n_o7pp_2020_40_007857140_0 103,680 179,000 1.7265
06 Apr 2012 17:21:30 1168700 14360033 hadcm3n_o7pp_2020_40_007857140_0 77,760 136,339 1.7533
06 Apr 2012 03:55:54 1168700 14360033 hadcm3n_o7pp_2020_40_007857140_0 51,840 92,466 1.7837
05 Apr 2012 13:51:43 1168700 14360033 hadcm3n_o7pp_2020_40_007857140_0 25,920 47,062 1.8157


©2024 cpdn.org