climateprediction.net home page
Task 12849427

Task 12849427

Name hadcm3n_p2a5_1900_40_007220541_2
Workunit 7418781
Created 2 May 2011, 20:01:09 UTC
Sent 2 May 2011, 20:05:39 UTC
Report deadline 2 Aug 2011, 3:32:50 UTC
Received 24 May 2011, 17:42:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 255313
Run time
CPU time 13 days 6 hours 8 min 53 sec
Validate state Invalid
Credit 2,799.36
Device peak FLOPS 1.27 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>5.2.7</core_client_version>
<message>The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Ocean Restart file copy failed on p2a5ko.daa1450
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:51:30 (3044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
01:05:36 (5972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/p2a5ko.pja3c10
Error converting file to netcdf: dataout/p2a5ko.pia3c10
Error converting file to netcdf: dataout/p2a5ko.pfa3c10
Error converting file to netcdf: dataout/p2a5ka.pha3c10
Error converting file to netcdf: dataout/p2a5ka.pga3c10
Error converting file to netcdf: dataout/p2a5ka.pea3c10
Error converting file to netcdf: dataout/p2a5ka.pda3c10
15:03:13 (8672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
03:19:51 (2944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
18:24:56 (2588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:15:34 (496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
04:12:56 (4380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:17:03 (4168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1
Model crash detected, will try to restart...
10:31:04 (5332): No heartbeat from core client for 30 sec - exiting
10:31:05 (5332): No heartbeat from core client for 30 sec - exiting
10:31:06 (5332): No heartbeat from core client for 30 sec - exiting
10:31:07 (5332): No heartbeat from core client for 30 sec - exiting
10:31:08 (5332): No heartbeat from core client for 30 sec - exiting
10:31:09 (5332): No heartbeat from core client for 30 sec - exiting
10:31:10 (5332): No heartbeat from core client for 30 sec - exiting
10:31:11 (5332): No heartbeat from core client for 30 sec - exiting
10:31:13 (5332): No heartbeat from core client for 30 sec - exiting
10:31:14 (5332): No heartbeat from core client for 30 sec - exiting
10:31:15 (5332): No heartbeat from core client for 30 sec - exiting
10:31:16 (5332): No heartbeat from core client for 30 sec - exiting
10:31:17 (5332): No heartbeat from core client for 30 sec - exiting
10:31:18 (5332): No heartbeat from core client for 30 sec - exiting
10:31:19 (5332): No heartbeat from core client for 30 sec - exiting
10:31:21 (5332): No heartbeat from core client for 30 sec - exiting
10:31:22 (5332): No heartbeat from core client for 30 sec - exiting
10:31:23 (5332): No heartbeat from core client for 30 sec - exiting
10:31:24 (5332): No heartbeat from core client for 30 sec - exiting
10:31:26 (5332): No heartbeat from core client for 30 sec - exiting
10:31:27 (5332): No heartbeat from core client for 30 sec - exiting
10:31:28 (5332): No heartbeat from core client for 30 sec - exiting
10:31:29 (5332): No heartbeat from core client for 30 sec - exiting
10:31:30 (5332): No heartbeat from core client for 30 sec - exiting
10:31:31 (5332): No heartbeat from core client for 30 sec - exiting
10:31:33 (5332): No heartbeat from core client for 30 sec - exiting
10:31:34 (5332): No heartbeat from core client for 30 sec - exiting
10:31:35 (5332): No heartbeat from core client for 30 sec - exiting
10:31:36 (5332): No heartbeat from core client for 30 sec - exiting
10:31:37 (5332): No heartbeat from core client for 30 sec - exiting
10:31:38 (5332): No heartbeat from core client for 30 sec - exiting
10:31:39 (5332): No heartbeat from core client for 30 sec - exiting
10:31:40 (5332): No heartbeat from core client for 30 sec - exiting
10:31:41 (5332): No heartbeat from core client for 30 sec - exiting
10:31:42 (5332): No heartbeat from core client for 30 sec - exiting
10:31:43 (5332): No heartbeat from core client for 30 sec - exiting
10:31:44 (5332): No heartbeat from core client for 30 sec - exiting
10:31:45 (5332): No heartbeat from core client for 30 sec - exiting
10:31:46 (5332): No heartbeat from core client for 30 sec - exiting
10:31:47 (5332): No heartbeat from core client for 30 sec - exiting
10:31:48 (5332): No heartbeat from core client for 30 sec - exiting
10:31:50 (5332): No heartbeat from core client for 30 sec - exiting
10:31:51 (5332): No heartbeat from core client for 30 sec - exiting
10:31:52 (5332): No heartbeat from core client for 30 sec - exiting
10:31:53 (5332): No heartbeat from core client for 30 sec - exiting
10:31:54 (5332): No heartbeat from core client for 30 sec - exiting
10:31:55 (5332): No heartbeat from core client for 30 sec - exiting
10:31:56 (5332): No heartbeat from core client for 30 sec - exiting
10:31:57 (5332): No heartbeat from core client for 30 sec - exiting
10:31:58 (5332): No heartbeat from core client for 30 sec - exiting
10:31:59 (5332): No heartbeat from core client for 30 sec - exiting
10:32:00 (5332): No heartbeat from core client for 30 sec - exiting
10:32:01 (5332): No heartbeat from core client for 30 sec - exiting
10:32:03 (5332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6940, iMonCtr=1
Model crash detected, will try to restart...
21:57:24 (6940): No heartbeat from core client for 30 sec - exiting
21:57:25 (6940): No heartbeat from core client for 30 sec - exiting
21:57:26 (6940): No heartbeat from core client for 30 sec - exiting
21:57:27 (6940): No heartbeat from core client for 30 sec - exiting
21:57:28 (6940): No heartbeat from core client for 30 sec - exiting
21:57:29 (6940): No heartbeat from core client for 30 sec - exiting
21:57:30 (6940): No heartbeat from core client for 30 sec - exiting
21:57:31 (6940): No heartbeat from core client for 30 sec - exiting
21:57:32 (6940): No heartbeat from core client for 30 sec - exiting
21:57:33 (6940): No heartbeat from core client for 30 sec - exiting
21:57:34 (6940): No heartbeat from core client for 30 sec - exiting
21:57:35 (6940): No heartbeat from core client for 30 sec - exiting
21:57:36 (6940): No heartbeat from core client for 30 sec - exiting
21:57:37 (6940): No heartbeat from core client for 30 sec - exiting
21:57:38 (6940): No heartbeat from core client for 30 sec - exiting
21:57:39 (6940): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9772, iMonCtr=1
Model crash detected, will try to restart...
09:36:18 (9772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7296, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7296, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7296, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 May 2011 06:21:15 255313 12849427 hadcm3n_p2a5_1900_40_007220541_2 233,280 1,115,131 4.7802
19 May 2011 09:14:08 255313 12849427 hadcm3n_p2a5_1900_40_007220541_2 207,360 989,765 4.7732
17 May 2011 19:19:37 255313 12849427 hadcm3n_p2a5_1900_40_007220541_2 181,440 864,539 4.7649
14 May 2011 23:26:05 255313 12849427 hadcm3n_p2a5_1900_40_007220541_2 155,520 741,275 4.7664
13 May 2011 14:12:45 255313 12849427 hadcm3n_p2a5_1900_40_007220541_2 129,600 618,064 4.7690
11 May 2011 20:05:54 255313 12849427 hadcm3n_p2a5_1900_40_007220541_2 103,680 495,135 4.7756
10 May 2011 06:09:13 255313 12849427 hadcm3n_p2a5_1900_40_007220541_2 77,760 371,833 4.7818
08 May 2011 17:07:23 255313 12849427 hadcm3n_p2a5_1900_40_007220541_2 51,840 247,844 4.7809
04 May 2011 09:01:43 255313 12849427 hadcm3n_p2a5_1900_40_007220541_2 25,920 122,883 4.7409


©2024 climateprediction.net