climateprediction.net home page
Task 14292472

Task 14292472

Name hadcm3n_y9n6_1940_40_007834472_1
Workunit 7989584
Created 19 Mar 2012, 16:29:25 UTC
Sent 19 Mar 2012, 19:48:25 UTC
Report deadline 19 Jun 2012, 3:15:36 UTC
Received 17 Apr 2012, 8:42:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1366213
Run time 18 days 21 hours 53 min 15 sec
CPU time 18 days 21 hours 53 min 15 sec
Validate state Invalid
Credit 8,398.08
Device peak FLOPS 2.04 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.2.15</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
00:01:07 (11794): No heartbeat from core client for 30 sec - exiting
00:01:08 (11794): No heartbeat from core client for 30 sec - exiting
00:01:09 (11794): No heartbeat from core client for 30 sec - exiting
00:01:10 (11794): No heartbeat from core client for 30 sec - exiting
00:01:11 (11794): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:01:12 (11794): No heartbeat from core client for 30 sec - exiting
00:01:13 (11794): No heartbeat from core client for 30 sec - exiting
00:01:14 (11794): No heartbeat from core client for 30 sec - exiting
02:42:51 (12121): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:42:53 (12121): No heartbeat from core client for 30 sec - exiting
02:45:15 (9354): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:33:47 (9408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:33:49 (9408): No heartbeat from core client for 30 sec - exiting
03:33:50 (9408): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
04:05:15 (4538): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:05:17 (4538): No heartbeat from core client for 30 sec - exiting
04:07:00 (26008): No heartbeat from core client for 30 sec - exiting
04:07:01 (26008): No heartbeat from core client for 30 sec - exiting
04:07:02 (26008): No heartbeat from core client for 30 sec - exiting
04:07:03 (26008): No heartbeat from core client for 30 sec - exiting
04:07:04 (26008): No heartbeat from core client for 30 sec - exiting
04:07:05 (26008): No heartbeat from core client for 30 sec - exiting
04:07:06 (26008): No heartbeat from core client for 30 sec - exiting
04:07:07 (26008): No heartbeat from core client for 30 sec - exiting
04:07:38 (26008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:23:48 (26051): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:23:49 (26051): No heartbeat from core client for 30 sec - exiting
04:19:19 (20631): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:23:55 (14841): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:01:38 (25565): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:02:51 (19502): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:02:02 (19528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:02:04 (19528): No heartbeat from core client for 30 sec - exiting
00:02:05 (19528): No heartbeat from core client for 30 sec - exiting
00:02:06 (19528): No heartbeat from core client for 30 sec - exiting
00:02:07 (19528): No heartbeat from core client for 30 sec - exiting
00:02:08 (19528): No heartbeat from core client for 30 sec - exiting
00:02:09 (19528): No heartbeat from core client for 30 sec - exiting
00:02:10 (19528): No heartbeat from core client for 30 sec - exiting
00:02:11 (19528): No heartbeat from core client for 30 sec - exiting
00:02:12 (19528): No heartbeat from core client for 30 sec - exiting
00:02:13 (19528): No heartbeat from core client for 30 sec - exiting
00:02:14 (19528): No heartbeat from core client for 30 sec - exiting
00:02:15 (19528): No heartbeat from core client for 30 sec - exiting
00:02:16 (19528): No heartbeat from core client for 30 sec - exiting
00:03:48 (8087): No heartbeat from core client for 30 sec - exiting
00:04:22 (8087): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:03:21 (8137): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:03:24 (8137): No heartbeat from core client for 30 sec - exiting
00:03:25 (8137): No heartbeat from core client for 30 sec - exiting
00:03:26 (8137): No heartbeat from core client for 30 sec - exiting
00:03:27 (8137): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
00:01:38 (2364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:17:33 (28889): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:38:17 (22088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:38:19 (22088): No heartbeat from core client for 30 sec - exiting
04:38:20 (22088): No heartbeat from core client for 30 sec - exiting
04:38:21 (22088): No heartbeat from core client for 30 sec - exiting
04:38:22 (22088): No heartbeat from core client for 30 sec - exiting
04:38:23 (22088):04:40:03 (11054): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:40:04 (11054): No heartbeat from core client for 30 sec - exiting
04:40:05 (11054): No heartbeat from core client for 30 sec - exiting
04:40:06 (11054): No heartbeat from core client for 30 sec - exiting
04:40:07 (11054): No heartbeat from core client for 30 sec - exiting
04:40:08 (11054): No heartbeat from core client for 30 sec - exiting
04:40:09 (11054): No heartbeat from core client for 30 sec - exiting
04:40:10 (11054): No heartbeat from core client for 30 sec - exiting
04:40:11 (11054): No heartbeat from core client for 30 sec - exiting
04:40:12 (11054): No heartbeat from core client for 30 sec - exiting
04:40:13 (11054): No heartbeat from core client for 30 sec - exiting
04:40:14 (11054): No heartbeat from core client for 30 sec - exiting
04:40:15 (11054): No heartbeat from core client for 30 sec - exiting
04:40:16 (11054): No heartbeat from core client for 30 sec - exiting
04:40:17 (11054): No heartbeat from core client for 30 sec - exiting
04:40:18 (11054): No heartbeat from core client for 30 sec - exiting
04:40:19 (11054): No heartbeat from core client for 30 sec - exiting
04:40:20 (11054): No heartbeat from core client for 30 sec - exiting
04:40:21 (11054): No heartbeat from core client for 30 sec - exiting
04:40:22 (11054): No heartbeat from core client for 30 sec - exiting
04:40:23 (11054): No heartbeat from core client for 30 sec - exiting
04:40:24 (11054): No heartbeat from core client for 30 sec - exiting
04:40:25 (11054): No heartbeat from core client for 30 sec - exiting
04:40:26 (11054): No heartbeat from core client for 30 sec - exiting
04:40:27 (11054): No heartbeat from core client for 30 sec - exiting
04:40:28 (11054): No heartbeat from core client for 30 sec - exiting
04:40:29 (11054): No heartbeat from core client for 30 sec - exiting
04:40:30 (11054): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (10 frames):
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xb7780400]
[0xb7780422]
/lib/tls/i686/cmov/libc.so.6(gsignal+0x51)[0xb760b651]
/lib/tls/i686/cmov/libc.so.6(abort+0x182)[0xb760ea82]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/tls/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xb75f7bd6]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11085, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xb773e400]
[0xb773e422]
/lib/tls/i686/cmov/libc.so.6(gsignal+0x51)[0xb75c9651]
/lib/tls/i686/cmov/libc.so.6(abort+0x182)[0xb75cca82]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/tls/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xb75b5bd6]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11085, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xb770d400]
[0xb770d422]
/lib/tls/i686/cmov/libc.so.6(gsignal+0x51)[0xb7598651]
/lib/tls/i686/cmov/libc.so.6(abort+0x182)[0xb759ba82]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/tls/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xb7584bd6]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11085, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xb7784400]
[0xb7784422]
/lib/tls/i686/cmov/libc.so.6(gsignal+0x51)[0xb760f651]
/lib/tls/i686/cmov/libc.so.6(abort+0x182)[0xb7612a82]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/tls/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xb75fbbd6]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11085, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xb7767400]
[0xb7767422]
/lib/tls/i686/cmov/libc.so.6(gsignal+0x51)[0xb75f2651]
/lib/tls/i686/cmov/libc.so.6(abort+0x182)[0xb75f5a82]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/tls/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xb75debd6]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11085, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xb773d400]
[0xb773d422]
/lib/tls/i686/cmov/libc.so.6(gsignal+0x51)[0xb75c8651]
/lib/tls/i686/cmov/libc.so.6(abort+0x182)[0xb75cba82]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/tls/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xb75b4bd6]
/home/boinc/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11085, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Apr 2012 05:35:29 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 699,840 1,624,079 2.3206
16 Apr 2012 12:33:34 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 673,920 1,563,287 2.3197
15 Apr 2012 19:11:33 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 648,000 1,502,427 2.3186
15 Apr 2012 01:47:01 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 622,080 1,441,879 2.3178
14 Apr 2012 09:01:34 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 596,160 1,381,754 2.3178
13 Apr 2012 15:34:12 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 570,240 1,321,710 2.3178
12 Apr 2012 22:00:24 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 544,320 1,261,495 2.3176
12 Apr 2012 04:44:52 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 518,400 1,201,309 2.3173
11 Apr 2012 11:57:24 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 492,480 1,141,301 2.3175
10 Apr 2012 18:37:11 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 466,560 1,081,285 2.3176
10 Apr 2012 01:25:58 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 440,640 1,021,277 2.3177
09 Apr 2012 08:39:14 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 414,720 961,263 2.3179
08 Apr 2012 15:36:15 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 388,800 901,241 2.3180
07 Apr 2012 22:17:10 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 362,880 841,058 2.3177
07 Apr 2012 05:03:00 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 336,960 780,771 2.3171
06 Apr 2012 11:39:09 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 311,040 720,546 2.3166
05 Apr 2012 18:28:55 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 285,120 660,263 2.3157
05 Apr 2012 01:17:53 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 259,200 599,857 2.3143
04 Apr 2012 08:21:40 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 233,280 539,728 2.3136
03 Apr 2012 15:21:20 805269 14292472 hadcm3n_y9n6_1940_40_007834472_1 207,360 479,600 2.3129


©2024 cpdn.org