climateprediction.net home page
Task 12820307

Task 12820307

Name hadcm3n_p20g_1900_40_007220192_1
Workunit 7418432
Created 26 Apr 2011, 15:19:35 UTC
Sent 3 May 2011, 2:45:46 UTC
Report deadline 2 Aug 2011, 10:12:57 UTC
Received 15 May 2011, 14:44:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1147202
Run time 10 days 23 hours 31 min 20 sec
CPU time 8 days 9 hours 6 min 13 sec
Validate state Invalid
Credit 3,732.48
Device peak FLOPS 2.51 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.59</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:00:23 (2437): No heartbeat from core client for 30 sec - exiting
13:00:25 (2437): No heartbeat from core client for 30 sec - exiting
13:00:27 (2437): No heartbeat from core client for 30 sec - exiting
13:00:28 (2437): No heartbeat from core client for 30 sec - exiting
13:00:29 (2437): No heartbeat from core client for 30 sec - exiting
13:00:30 (2437): No heartbeat from core client for 30 sec - exiting
13:00:31 (2437): No heartbeat from core client for 30 sec - exiting
13:00:32 (2437): No heartbeat from core client for 30 sec - exiting
13:00:33 (2437): No heartbeat from core client for 30 sec - exiting
13:00:34 (2437): No heartbeat from core client for 30 sec - exiting
13:00:35 (2437): No heartbeat from core client for 30 sec - exiting
13:00:36 (2437): No heartbeat from core client for 30 sec - exiting
13:00:37 (2437): No heartbeat from core client for 30 sec - exiting
13:00:38 (2437): No heartbeat from core client for 30 sec - exiting
13:00:39 (2437): No heartbeat from core client for 30 sec - exiting
13:00:40 (2437): No heartbeat from core client for 30 sec - exiting
13:00:41 (2437): No heartbeat from core client for 30 sec - exiting
13:00:42 (2437): No heartbeat from core client for 30 sec - exiting
13:00:43 (2437): No heartbeat from core client for 30 sec - exiting
13:00:44 (2437): No heartbeat from core client for 30 sec - exiting
13:00:45 (2437): No heartbeat from core client for 30 sec - exiting
13:00:46 (2437): No heartbeat from core client for 30 sec - exiting
13:00:47 (2437): No heartbeat from core client for 30 sec - exiting
13:00:48 (2437): No heartbeat from core client for 30 sec - exiting
13:00:49 (2437): No heartbeat from core client for 30 sec - exiting
13:00:50 (2437): No heartbeat from core client for 30 sec - exiting
13:00:51 (2437): No heartbeat from core client for 30 sec - exiting
13:00:52 (2437): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:06:34 (3624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:06:35 (3624): No heartbeat from core client for 30 sec - exiting
17:06:36 (3624): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
14:04:03 (2948): No heartbeat from core client for 30 sec - exiting
14:04:04 (2948): No heartbeat from core client for 30 sec - exiting
14:04:05 (2948): No heartbeat from core client for 30 sec - exiting
14:04:06 (2948): No heartbeat from core client for 30 sec - exiting
14:04:07 (2948): No heartbeat from core client for 30 sec - exiting
14:04:08 (2948): No heartbeat from core client for 30 sec - exiting
14:04:09 (2948): No heartbeat from core client for 30 sec - exiting
14:04:10 (2948): No heartbeat from core client for 30 sec - exiting
14:04:11 (2948): No heartbeat from core client for 30 sec - exiting
14:04:12 (2948): No heartbeat from core client for 30 sec - exiting
14:04:13 (2948): No heartbeat from core client for 30 sec - exiting
14:04:14 (2948): No heartbeat from core client for 30 sec - exiting
14:04:15 (2948): No heartbeat from core client for 30 sec - exiting
14:04:16 (2948): No heartbeat from core client for 30 sec - exiting
14:04:17 (2948): No heartbeat from core client for 30 sec - exiting
14:04:18 (2948): No heartbeat from core client for 30 sec - exiting
14:04:19 (2948): No heartbeat from core client for 30 sec - exiting
14:04:20 (2948): No heartbeat from core client for 30 sec - exiting
14:04:21 (2948): No heartbeat from core client for 30 sec - exiting
14:04:22 (2948): No heartbeat from core client for 30 sec - exiting
14:04:23 (2948): No heartbeat from core client for 30 sec - exiting
14:04:24 (2948): No heartbeat from core client for 30 sec - exiting
14:04:25 (2948): No heartbeat from core client for 30 sec - exiting
14:04:26 (2948): No heartbeat from core client for 30 sec - exiting
14:04:27 (2948): No heartbeat from core client for 30 sec - exiting
14:04:28 (2948): No heartbeat from core client for 30 sec - exiting
14:04:29 (2948): No heartbeat from core client for 30 sec - exiting
14:04:30 (2948): No heartbeat from core client for 30 sec - exiting
14:04:31 (2948): No heartbeat from core client for 30 sec - exiting
14:04:32 (2948): No heartbeat from core client for 30 sec - exiting
14:04:33 (2948): No heartbeat from core client for 30 sec - exiting
14:04:34 (2948): No heartbeat from core client for 30 sec - exiting
14:04:35 (2948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:55:52 (2960): No heartbeat from core client for 30 sec - exiting
12:55:53 (2960): No heartbeat from core client for 30 sec - exiting
12:55:54 (2960): No heartbeat from core client for 30 sec - exiting
12:55:55 (2960): No heartbeat from core client for 30 sec - exiting
12:55:56 (2960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:47:16 (3272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
22:40:41 (3235): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:52:49 (22412): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:52:50 (22412): No heartbeat from core client for 30 sec - exiting
20:52:51 (22412): No heartbeat from core client for 30 sec - exiting
20:52:52 (22412): No heartbeat from core client for 30 sec - exiting
20:52:53 (22412): No heartbeat from core client for 30 sec - exiting
20:52:54 (22412): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77bd400]
[0xf77bd430]
/lib32/libc.so.6(gsignal+0x51)[0xf760eea1]
/lib32/libc.so.6(abort+0x17e)[0xf76122ce]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe7)[0xf75fae37]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2458, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76fa400]
[0xf76fa430]
/lib32/libc.so.6(gsignal+0x51)[0xf754bea1]
/lib32/libc.so.6(abort+0x17e)[0xf754f2ce]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe7)[0xf7537e37]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2458, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf778f400]
[0xf778f430]
/lib32/libc.so.6(gsignal+0x51)[0xf75e0ea1]
/lib32/libc.so.6(abort+0x17e)[0xf75e42ce]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe7)[0xf75cce37]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2458, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77d1400]
[0xf77d1430]
/lib32/libc.so.6(gsignal+0x51)[0xf7622ea1]
/lib32/libc.so.6(abort+0x17e)[0xf76262ce]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe7)[0xf760ee37]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2458, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7764400]
[0xf7764430]
/lib32/libc.so.6(gsignal+0x51)[0xf75b5ea1]
/lib32/libc.so.6(abort+0x17e)[0xf75b92ce]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe7)[0xf75a1e37]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2458, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7704400]
[0xf7704430]
/lib32/libc.so.6(gsignal+0x51)[0xf7555ea1]
/lib32/libc.so.6(abort+0x17e)[0xf75592ce]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe7)[0xf7541e37]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2458, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 May 2011 14:50:12 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 311,040 686,894 2.2084
13 May 2011 20:19:52 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 285,120 629,209 2.2068
13 May 2011 02:34:01 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 259,200 570,919 2.2026
12 May 2011 09:18:03 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 233,280 513,280 2.2003
11 May 2011 00:28:26 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 207,360 455,671 2.1975
09 May 2011 21:53:36 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 181,440 399,067 2.1994
08 May 2011 22:43:45 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 155,520 341,935 2.1987
08 May 2011 01:19:36 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 129,600 284,997 2.1991
07 May 2011 02:38:18 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 103,680 227,977 2.1989
06 May 2011 04:52:46 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 77,760 171,170 2.2013
04 May 2011 22:39:37 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 51,840 114,122 2.2014
03 May 2011 23:20:50 1147202 12820307 hadcm3n_p20g_1900_40_007220192_1 25,920 57,191 2.2064


©2024 cpdn.org