climateprediction.net (CPDN) home page
Task 15896897

Task 15896897

Name hadcm3n_o4hy_2140_40_008269154_3
Workunit 8424278
Created 19 Jul 2013, 0:46:04 UTC
Sent 19 Jul 2013, 0:47:00 UTC
Report deadline 18 Oct 2013, 8:14:11 UTC
Received 14 Aug 2013, 21:43:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1286770
Run time 10 days 14 hours 53 min 7 sec
CPU time 9 days 0 hours 17 min 14 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 4.03 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3620, iMonCtr=1
Model crash detected, will try to restart...
00:36:41 (3560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:36:42 (3560): No heartbeat from core client for 30 sec - exiting
00:36:43 (3560): No heartbeat from core client for 30 sec - exiting
00:36:44 (3560): No heartbeat from core client for 30 sec - exiting
00:36:45 (3560): No heartbeat from core client for 30 sec - exiting
00:36:46 (3560): No heartbeat from core client for 30 sec - exiting
00:36:47 (3560): No heartbeat from core client for 30 sec - exiting
00:36:48 (3560): No heartbeat from core client for 30 sec - exiting
00:36:49 (3560): No heartbeat from core client for 30 sec - exiting
00:36:50 (3560): No heartbeat from core client for 30 sec - exiting
00:36:51 (3560): No heartbeat from core client for 30 sec - exiting
MainError:	04:19:10 PM	No files match the supplied pattern.
MainError:	04:19:10 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	12:18:24 AM	No files match the supplied pattern.
MainError:	12:18:24 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
MainError:	09:47:02 AM	No files match the supplied pattern.
MainError:	09:47:02 AM	No files match the supplied pattern.
MainError:	04:01:28 AM	No files match the supplied pattern.
MainError:	04:01:28 AM	No files match the supplied pattern.
MainError:	10:15:13 PM	No files match the supplied pattern.
MainError:	10:15:13 PM	No files match the supplied pattern.
MainError:	06:43:57 AM	No files match the supplied pattern.
MainError:	06:43:57 AM	No files match the supplied pattern.
MainError:	05:00:32 AM	No files match the supplied pattern.
MainError:	05:00:32 AM	No files match the supplied pattern.
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3692, iMonCtr=1
Model crash detected, will try to restart...
MainError:	10:42:57 PM	No files match the supplied pattern.
MainError:	10:42:57 PM	No files match the supplied pattern.
MainError:	05:43:15 AM	No files match the supplied pattern.
MainError:	05:43:15 AM	No files match the supplied pattern.
MainError:	03:33:16 AM	No files match the supplied pattern.
MainError:	03:33:16 AM	No files match the supplied pattern.
Error converting file to netcdf: dataout/o4hyka.ph11c10
Error converting file to netcdf: dataout/o4hyka.pg11c10
Error converting file to netcdf: dataout/o4hyka.pe11c10
MainError:	11:52:06 AM	No files match the supplied pattern.
MainError:	11:52:06 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Aug 2013 21:46:03 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 777,600 815,936 1.0493
14 Aug 2013 21:46:03 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 751,680 789,718 1.0506
14 Aug 2013 21:46:03 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 725,760 764,372 1.0532
14 Aug 2013 21:46:03 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 699,840 739,932 1.0573
14 Aug 2013 21:46:03 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 673,920 712,969 1.0579
14 Aug 2013 21:46:03 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 648,000 685,952 1.0586
14 Aug 2013 21:46:03 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 622,080 656,745 1.0557
30 Jul 2013 09:47:17 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 596,160 631,214 1.0588
30 Jul 2013 09:47:17 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 570,240 605,366 1.0616
30 Jul 2013 09:47:17 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 544,320 579,023 1.0638
30 Jul 2013 09:47:17 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 518,400 552,709 1.0662
30 Jul 2013 09:47:17 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 492,480 526,137 1.0683
30 Jul 2013 09:47:17 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 466,560 499,401 1.0704
30 Jul 2013 09:47:17 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 440,640 472,682 1.0727
30 Jul 2013 09:47:17 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 414,720 446,341 1.0762
30 Jul 2013 09:47:17 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 388,800 420,098 1.0805
26 Jul 2013 02:34:59 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 362,880 393,994 1.0857
25 Jul 2013 07:29:27 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 336,960 367,470 1.0905
24 Jul 2013 23:25:26 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 311,040 337,729 1.0858
24 Jul 2013 03:01:15 1286770 15896897 hadcm3n_o4hy_2140_40_008269154_3 285,120 310,369 1.0886


©2024 cpdn.org