climateprediction.net home page
Task 15544795

Task 15544795

Name hadcm3n_o7fm_2140_40_008269875_1
Workunit 8424999
Created 15 Jan 2013, 15:20:51 UTC
Sent 15 Jan 2013, 15:20:58 UTC
Report deadline 16 Apr 2013, 22:48:09 UTC
Received 2 Feb 2013, 18:36:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1003311
Run time 17 days 21 hours 38 min 35 sec
CPU time 16 days 3 hours 17 min 25 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.34 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
20:53:26 (4116): No heartbeat from core client for 30 sec - exiting
20:53:27 (4116): No heartbeat from core client for 30 sec - exiting
20:53:28 (4116): No heartbeat from core client for 30 sec - exiting
20:53:29 (4116): No heartbeat from core client for 30 sec - exiting
20:53:30 (4116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1120, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1120, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3436, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3348, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3416, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3380, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3360, iMonCtr=1
Model crash detected, will try to restart...
MainError:	12:11:20 AM	No files match the supplied pattern.
MainError:	12:11:20 AM	No files match the supplied pattern.
MainError:	02:31:48 AM	No files match the supplied pattern.
MainError:	02:31:48 AM	No files match the supplied pattern.
MainError:	04:13:07 PM	No files match the supplied pattern.
MainError:	04:13:07 PM	No files match the supplied pattern.
MainError:	06:25:56 AM	No files match the supplied pattern.
MainError:	06:25:56 AM	No files match the supplied pattern.
MainError:	07:58:20 PM	No files match the supplied pattern.
MainError:	07:58:20 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	09:22:21 AM	No files match the supplied pattern.
MainError:	09:22:21 AM	No files match the supplied pattern.
MainError:	12:31:13 AM	No files match the supplied pattern.
MainError:	12:31:13 AM	No files match the supplied pattern.
MainError:	02:43:53 PM	No files match the supplied pattern.
MainError:	02:43:53 PM	No files match the supplied pattern.
MainError:	05:41:58 AM	No files match the supplied pattern.
MainError:	05:41:58 AM	No files match the supplied pattern.
MainError:	09:35:18 PM	No files match the supplied pattern.
MainError:	09:35:18 PM	No files match the supplied pattern.
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3572, iMonCtr=1
Model crash detected, will try to restart...
18:49:31 (3376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Error converting file to netcdf: dataout/o7fmka.ph11c10
Error converting file to netcdf: dataout/o7fmka.pg11c10
Error converting file to netcdf: dataout/o7fmka.pe11c10
MainError:	12:53:58 AM	No files match the supplied pattern.
MainError:	12:53:58 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Feb 2013 12:55:15 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 777,600 1,403,744 1.8052
01 Feb 2013 21:52:42 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 751,680 1,355,777 1.8037
01 Feb 2013 06:16:20 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 725,760 1,304,973 1.7981
31 Jan 2013 14:45:32 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 699,840 1,255,095 1.7934
31 Jan 2013 00:34:14 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 673,920 1,206,582 1.7904
30 Jan 2013 09:23:14 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 648,000 1,157,733 1.7866
29 Jan 2013 20:31:26 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 622,080 1,112,501 1.7884
29 Jan 2013 06:29:10 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 596,160 1,068,356 1.7921
28 Jan 2013 16:15:42 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 570,240 1,022,042 1.7923
28 Jan 2013 02:33:31 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 544,320 977,399 1.7956
27 Jan 2013 12:11:41 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 518,400 931,399 1.7967
26 Jan 2013 20:57:49 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 492,480 884,127 1.7953
26 Jan 2013 06:19:31 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 466,560 836,986 1.7940
25 Jan 2013 16:25:47 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 440,640 788,667 1.7898
25 Jan 2013 01:34:03 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 414,720 742,483 1.7903
24 Jan 2013 12:01:46 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 388,800 697,893 1.7950
23 Jan 2013 22:29:07 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 362,880 653,168 1.8000
23 Jan 2013 08:33:11 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 336,960 607,784 1.8037
22 Jan 2013 18:44:46 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 311,040 563,135 1.8105
22 Jan 2013 04:49:45 1003311 15544795 hadcm3n_o7fm_2140_40_008269875_1 285,120 518,631 1.8190


©2024 cpdn.org