climateprediction.net home page
Task 14366529

Task 14366529

Name hadcm3n_ygy2_1980_40_007858454_0
Workunit 8013566
Created 5 Apr 2012, 18:59:28 UTC
Sent 5 Apr 2012, 19:06:18 UTC
Report deadline 6 Jul 2012, 2:33:29 UTC
Received 7 Jul 2012, 17:16:08 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 970965
Run time 34 days 16 hours 0 min 46 sec
CPU time 25 days 13 hours 17 min 5 sec
Validate state Invalid
Credit 7,776.00
Device peak FLOPS 1.44 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
00:09:43 (2516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
01:38:42 (3031): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
23:41:22 (5039): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:58:31 (9306): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:02:49 (9330): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:49:24 (9341): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Error: Input file: dataout/ygy2ko.pji3c10 is not a valid UM file.
Error converting file to netcdf: dataout/ygy2ko.pji3c10
Error: Input file: dataout/ygy2ko.pii3c10 is not a valid UM file.
Error converting file to netcdf: dataout/ygy2ko.pii3c10
Error: Input file: dataout/ygy2ko.pfi3c10 is not a valid UM file.
Error converting file to netcdf: dataout/ygy2ko.pfi3c10
Error: Input file: dataout/ygy2ka.phi3c10 is not a valid UM file.
Error converting file to netcdf: dataout/ygy2ka.phi3c10
Error: Input file: dataout/ygy2ka.pgi3c10 is not a valid UM file.
Error converting file to netcdf: dataout/ygy2ka.pgi3c10
Error: Input file: dataout/ygy2ka.pei3c10 is not a valid UM file.
Error converting file to netcdf: dataout/ygy2ka.pei3c10
Error: Input file: dataout/ygy2ka.pdi3c10 is not a valid UM file.
Error converting file to netcdf: dataout/ygy2ka.pdi3c10
18:13:33 (9606): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:09:09 (2471): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:50:29 (8693): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:04:11 (8960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:25:31 (9086): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:26:46 (9551): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
14:07:44 (7435): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:09:37 (7550): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:55:33 (30886): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31222, iMonCtr=1
23:31:02 (31222): No heartbeat from core client for 30 sec - exiting
Model crash detected, will try to restart...
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
01:30:49 (3651): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15074, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
05:12:36 (15620): No heartbeat from core client for 30 sec - exiting
05:12:42 (15620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:12:44 (15620): No heartbeat from core client for 30 sec - exiting
05:12:45 (15620): No heartbeat from core client for 30 sec - exiting
05:12:46 (15620): No heartbeat from core client for 30 sec - exiting
05:12:47 (15620):10:17:22 (1660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:21:10 (31649): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:27:54 (31663): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
18:28:43 (2442): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:29:02 (2971): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:31:52 (3949): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:31:54 (3949): No heartbeat from core client for 30 sec - exiting
21:31:55 (3949): No heartbeat from core client for 30 sec - exiting
23:31:04 (1488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9573, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
09:15:06 (4211): No heartbeat from core client for 30 sec - exiting
09:15:09 (4211): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:15:10 (4211): No heartbeat from core client for 30 sec - exiting
09:15:11 (4211): No heartbeat from core client for 30 sec - exiting
15:50:39 (7090): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:08:10 (8550): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:05:09 (17240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:05:10 (17240): No heartbeat from core client for 30 sec - exiting
08:05:11 (17240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 15 received, exiting...
Called boinc_finish
21:00:20 (1607): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:00:38 (1607): No heartbeat from core client for 30 sec - exiting
Signal 15 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
SIGSEGV: segmentation violation
Stack trace (7 frames):
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df]
[0xf0f400]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804cedf]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050a03]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a]
/lib/tls/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0x126bd6]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51]

Exiting...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Jun 2012 17:57:25 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 648,000 2,204,850 3.4025
27 Jun 2012 18:15:17 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 622,080 2,115,257 3.4003
26 Jun 2012 11:58:55 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 596,160 2,025,628 3.3978
19 Jun 2012 06:34:18 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 570,240 1,938,140 3.3988
18 Jun 2012 05:52:09 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 544,320 1,853,678 3.4055
17 Jun 2012 18:36:28 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 518,400 1,770,206 3.4147
14 Jun 2012 21:10:55 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 492,480 1,685,864 3.4232
10 Jun 2012 13:28:03 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 466,560 1,599,443 3.4282
08 Jun 2012 10:44:05 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 440,640 1,511,010 3.4291
04 Jun 2012 22:58:37 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 414,720 1,421,960 3.4287
03 Jun 2012 15:41:34 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 388,800 1,332,484 3.4272
02 Jun 2012 10:09:49 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 362,880 1,243,233 3.4260
01 Jun 2012 03:09:43 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 336,960 1,153,722 3.4239
25 May 2012 02:40:06 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 311,040 1,064,756 3.4232
23 May 2012 18:33:44 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 285,120 976,138 3.4236
15 May 2012 06:09:48 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 259,200 887,008 3.4221
09 May 2012 19:11:02 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 233,280 798,191 3.4216
08 May 2012 04:22:32 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 207,360 709,298 3.4206
20 Apr 2012 08:02:30 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 181,440 620,451 3.4196
18 Apr 2012 18:12:43 970965 14366529 hadcm3n_ygy2_1980_40_007858454_0 155,520 532,227 3.4222


©2024 cpdn.org