climateprediction.net home page
Task 13540291

Task 13540291

Name hadcm3n_yfif_1900_40_007517307_0
Workunit 7714782
Created 28 Oct 2011, 12:51:44 UTC
Sent 22 Nov 2011, 0:52:50 UTC
Report deadline 21 Feb 2012, 8:20:01 UTC
Received 16 Dec 2011, 1:39:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1176343
Run time 22 days 5 hours 1 min 52 sec
CPU time 21 days 11 hours 41 min 16 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.26 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:53:08 (1904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:03:19 (12714): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:12:03 (13204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:41:51 (13242): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:43:19 (13971): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:46:14 (13988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:48:12 (14005): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:50:10 (14021): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:52:08 (14037): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:54:05 (14053): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:56:03 (14069): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:58:01 (14085): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:59:59 (14101): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:01:57 (14119): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:03:54 (14144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:05:52 (14161): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:07:50 (14177): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:23:18 (1977): No heartbeat from core client for 30 sec - exiting
17:23:20 (1977): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_yfif_1900_40_007517307/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 3 received, exiting...
SIGSEGV: segmentation violation
Called boinc_finish
Stack trace (7 frames):
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df]
[0xf7712400]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804ceb0]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050a03]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf7474c76]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51]

Exiting...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Dec 2011 01:44:43 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 777,600 1,770,343 2.2767
14 Dec 2011 05:16:39 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 751,680 1,711,407 2.2768
13 Dec 2011 12:22:29 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 725,760 1,652,615 2.2771
12 Dec 2011 19:19:07 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 699,840 1,593,745 2.2773
12 Dec 2011 02:21:44 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 673,920 1,534,856 2.2775
11 Dec 2011 09:16:54 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 648,000 1,475,886 2.2776
10 Dec 2011 16:12:45 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 622,080 1,416,773 2.2775
09 Dec 2011 23:23:43 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 596,160 1,358,082 2.2780
09 Dec 2011 06:17:00 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 570,240 1,298,488 2.2771
08 Dec 2011 13:29:23 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 544,320 1,239,126 2.2765
07 Dec 2011 20:58:46 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 518,400 1,180,276 2.2768
07 Dec 2011 04:14:25 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 492,480 1,121,442 2.2771
06 Dec 2011 11:05:36 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 466,560 1,062,128 2.2765
05 Dec 2011 18:26:31 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 440,640 1,003,337 2.2770
05 Dec 2011 06:10:19 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 414,720 943,956 2.2761
04 Dec 2011 08:14:48 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 388,800 884,788 2.2757
03 Dec 2011 15:32:57 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 362,880 825,974 2.2762
02 Dec 2011 22:43:36 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 336,960 767,083 2.2765
02 Dec 2011 06:06:08 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 311,040 708,172 2.2768
30 Nov 2011 21:11:08 1176343 13540291 hadcm3n_yfif_1900_40_007517307_0 285,120 648,238 2.2736


©2024 climateprediction.net