climateprediction.net home page
Task 13098717

Task 13098717

Name hadcm3n_yb8n_1900_40_007347425_1
Workunit 7544855
Created 6 Jul 2011, 13:44:08 UTC
Sent 19 Jul 2011, 0:43:28 UTC
Report deadline 18 Oct 2011, 8:10:39 UTC
Received 7 Aug 2011, 9:15:14 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1152409
Run time 6 days 8 hours 20 min 16 sec
CPU time 4 days 22 hours 19 min 19 sec
Validate state Invalid
Credit 3,421.44
Device peak FLOPS 2.76 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 62 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Error: Input file: dataout/yb8nko.pjb0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nko.pjb0c10
Error: Input file: dataout/yb8nko.pib0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nko.pib0c10
Error: Input file: dataout/yb8nko.pfb0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nko.pfb0c10
Error: Input file: dataout/yb8nko.pcb0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nko.pcb0c10
Error: Input file: dataout/yb8nko.pbb0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nko.pbb0c10
Error: Input file: dataout/yb8nko.pab0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nko.pab0c10
Error: Input file: dataout/yb8nka.phb0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nka.phb0c10
Error: Input file: dataout/yb8nka.pgb0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nka.pgb0c10
Error: Input file: dataout/yb8nka.peb0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nka.peb0c10
Error: Input file: dataout/yb8nka.pdb0c10 is not a valid UM file.
Error converting file to netcdf: dataout/yb8nka.pdb0c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:46:46 (9094): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
08:14:05 (10948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
20:30:52 (22126): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:35:30 (22130): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:47:10 (22169): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:04:52 (22246): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:29:33 (22388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:05:26 (22562): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:08:28 (23227): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:08:29 (23227): No heartbeat from core client for 30 sec - exiting
01:08:30 (23227): No heartbeat from core client for 30 sec - exiting
01:52:05 (24059): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
03:12:37 (24359): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:39:57 (24900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:05:35 (25571): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:07:47 (25780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:08:54 (25785): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:09:34 (25797): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 136400) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25995, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 136400) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25995, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 136400) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25995, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 136400) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25995, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 136400) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25995, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 136400) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25995, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Aug 2011 05:53:38 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 285,120 417,441 1.4641
26 Jul 2011 15:26:12 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 259,200 380,261 1.4671
25 Jul 2011 23:05:22 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 233,280 342,176 1.4668
25 Jul 2011 23:05:21 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 207,360 304,124 1.4666
25 Jul 2011 23:05:21 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 181,440 266,063 1.4664
25 Jul 2011 23:05:21 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 155,520 228,329 1.4682
25 Jul 2011 23:05:21 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 129,600 190,527 1.4701
25 Jul 2011 23:05:21 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 103,680 152,512 1.4710
25 Jul 2011 23:05:21 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 77,760 114,298 1.4699
25 Jul 2011 23:05:21 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 51,840 76,255 1.4710
25 Jul 2011 23:04:24 1152409 13098717 hadcm3n_yb8n_1900_40_007347425_1 25,920 38,152 1.4719


©2024 cpdn.org