climateprediction.net home page
Task 13164029

Task 13164029

Name hadcm3n_yi1l_1900_40_007356243_2
Workunit 7553673
Created 28 Jul 2011, 11:22:40 UTC
Sent 28 Jul 2011, 11:23:23 UTC
Report deadline 27 Oct 2011, 18:50:34 UTC
Received 26 Aug 2011, 1:36:27 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1160098
Run time 5 days 8 hours 57 min 31 sec
CPU time 5 days 1 hours 56 min 28 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 3.25 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 3 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17591, selfPID=17591, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=77389, selfPID=77389, iMonCtr=1
18:22:24 (82782): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Error: Input file: dataout/yi1lko.pja9c10 is not a valid UM file.
Error converting file to netcdf: dataout/yi1lko.pja9c10
Error: Input file: dataout/yi1lko.pia9c10 is not a valid UM file.
Error converting file to netcdf: dataout/yi1lko.pia9c10
Error: Input file: dataout/yi1lko.pfa9c10 is not a valid UM file.
Error converting file to netcdf: dataout/yi1lko.pfa9c10
Error: Input file: dataout/yi1lka.pha9c10 is not a valid UM file.
Error converting file to netcdf: dataout/yi1lka.pha9c10
Error: Input file: dataout/yi1lka.pga9c10 is not a valid UM file.
Error converting file to netcdf: dataout/yi1lka.pga9c10
Error: Input file: dataout/yi1lka.pea9c10 is not a valid UM file.
Error converting file to netcdf: dataout/yi1lka.pea9c10
Error: Input file: dataout/yi1lka.pda9c10 is not a valid UM file.
Error converting file to netcdf: dataout/yi1lka.pda9c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:26:08 (50276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 135970) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=62515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 135970) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=62515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 135970) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=62515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 135970) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=62515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 135970) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=62515, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 135970) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=62515, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Aug 2011 15:16:56 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 492,480 429,113 0.8713
10 Aug 2011 04:01:59 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 466,560 404,893 0.8678
09 Aug 2011 07:07:49 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 440,640 380,689 0.8639
08 Aug 2011 10:36:41 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 414,720 356,449 0.8595
08 Aug 2011 00:01:20 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 388,800 332,168 0.8543
07 Aug 2011 06:55:11 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 362,880 307,958 0.8486
06 Aug 2011 04:04:06 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 336,960 283,716 0.8420
05 Aug 2011 02:49:07 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 311,040 259,667 0.8348
04 Aug 2011 05:43:34 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 285,120 235,350 0.8254
03 Aug 2011 11:02:51 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 259,200 211,037 0.8142
03 Aug 2011 01:31:44 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 233,280 186,973 0.8015
02 Aug 2011 02:02:43 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 207,360 162,962 0.7859
01 Aug 2011 09:23:55 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 181,440 151,922 0.8373
01 Aug 2011 00:30:56 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 155,520 127,655 0.8208
31 Jul 2011 11:56:52 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 129,600 103,590 0.7993
31 Jul 2011 02:25:53 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 103,680 79,605 0.7678
30 Jul 2011 19:22:48 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 77,760 55,642 0.7156
30 Jul 2011 17:48:56 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 51,840 47,912 0.9242
30 Jul 2011 06:37:46 1160098 13164029 hadcm3n_yi1l_1900_40_007356243_2 25,920 24,009 0.9263


©2024 cpdn.org