climateprediction.net home page
Task 13554670

Task 13554670

Name hadcm3n_ygls_1900_40_007524482_1
Workunit 7721957
Created 28 Oct 2011, 13:29:58 UTC
Sent 30 Oct 2011, 22:15:27 UTC
Report deadline 30 Jan 2012, 5:42:38 UTC
Received 8 Nov 2011, 3:30:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1066269
Run time 3 days 17 hours 26 min 17 sec
CPU time 2 days 6 hours 27 min 37 sec
Validate state Invalid
Credit 1,244.16
Device peak FLOPS 2.94 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.12.42</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
17:46:10 (16552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:46:11 (16552): No heartbeat from core client for 30 sec - exiting
20:49:35 (18522): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:46:01 (19230): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:46:02 (19230): No heartbeat from core client for 30 sec - exiting
21:46:03 (19230): No heartbeat from core client for 30 sec - exiting
21:46:04 (19230): No heartbeat from core client for 30 sec - exiting
21:46:05 (19230): No heartbeat from core client for 30 sec - exiting
21:46:06 (19230): No heartbeat from core client for 30 sec - exiting
21:46:07 (19230): No heartbeat from core client for 30 sec - exiting
21:46:08 (19230): No heartbeat from core client for 30 sec - exiting
21:46:09 (19230): No heartbeat from core client for 30 sec - exiting
21:46:10 (19230): No heartbeat from core client for 30 sec - exiting
21:46:17 (19230): No heartbeat from core client for 30 sec - exiting
21:46:18 (19230): No heartbeat from core client for 30 sec - exiting
21:46:19 (19230): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
18:51:39 (20591): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:51:40 (20591): No heartbeat from core client for 30 sec - exiting
18:51:41 (20591): No heartbeat from core client for 30 sec - exiting
18:51:42 (20591): No heartbeat from core client for 30 sec - exiting
18:51:43 (20591): No heartbeat from core client for 30 sec - exiting
18:51:50 (20591): No heartbeat from core client for 30 sec - exiting
18:51:51 (20591): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:51:25 (29010): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:51:26 (29010): No heartbeat from core client for 30 sec - exiting
08:51:27 (29010): No heartbeat from core client for 30 sec - exiting
08:51:28 (29010): No heartbeat from core client for 30 sec - exiting
08:51:29 (29010): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:57:12 (20223): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:03:11 (24764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:03:12 (24764): No heartbeat from core client for 30 sec - exiting
02:03:13 (24764): No heartbeat from core client for 30 sec - exiting
02:03:14 (24764): No heartbeat from core client for 30 sec - exiting
02:03:15 (24764): No heartbeat from core client for 30 sec - exiting
02:03:16 (24764): No heartbeat from core client for 30 sec - exiting
02:03:17 (24764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:03:41 (6667): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:05:38 (9706): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:05:39 (9706): No heartbeat from core client for 30 sec - exiting
22:05:40 (9706): No heartbeat from core client for 30 sec - exiting
22:05:41 (9706): No heartbeat from core client for 30 sec - exiting
23:32:33 (9724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:32:34 (9724): No heartbeat from core client for 30 sec - exiting
23:32:42 (9724): No heartbeat from core client for 30 sec - exiting
23:32:43 (9724): No heartbeat from core client for 30 sec - exiting
23:32:44 (9724): No heartbeat from core client for 30 sec - exiting
23:32:45 (9724): No heartbeat from core client for 30 sec - exiting
23:32:46 (9724): No heartbeat from core client for 30 sec - exiting
23:32:54 (9724): No heartbeat from core client for 30 sec - exiting
23:32:55 (9724): No heartbeat from core client for 30 sec - exiting
23:32:58 (9724): No heartbeat from core client for 30 sec - exiting
23:32:59 (9724): No heartbeat from core client for 30 sec - exiting
23:33:00 (9724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
07:37:31 (21731): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:37:33 (21731): No heartbeat from core client for 30 sec - exiting
07:37:34 (21731): No heartbeat from core client for 30 sec - exiting
07:37:35 (21731): No heartbeat from core client for 30 sec - exiting
07:37:36 (21731): No heartbeat from core client for 30 sec - exiting
07:37:37 (21731): No heartbeat from core client for 30 sec - exiting
07:37:38 (21731): No heartbeat from core client for 30 sec - exiting
07:41:02 (21979): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:41:03 (21979): No heartbeat from core client for 30 sec - exiting
07:41:04 (21979): No heartbeat from core client for 30 sec - exiting
07:41:05 (21979): No heartbeat from core client for 30 sec - exiting
07:41:06 (21979): No heartbeat from core client for 30 sec - exiting
07:41:07 (21979): No heartbeat from core client for 30 sec - exiting
07:41:08 (21979): No heartbeat from core client for 30 sec - exiting
07:41:09 (21979): No heartbeat from core client for 30 sec - exiting
07:41:10 (21979): No heartbeat from core client for 30 sec - exiting
07:41:11 (21979): No heartbeat from core client for 30 sec - exiting
07:41:18 (21979): No heartbeat from core client for 30 sec - exiting
07:41:19 (21979): No heartbeat from core client for 30 sec - exiting
07:41:20 (21979): No heartbeat from core client for 30 sec - exiting
07:41:21 (21979): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
08:09:43 (22021): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:09:44 (22021): No heartbeat from core client for 30 sec - exiting
08:09:45 (22021): No heartbeat from core client for 30 sec - exiting
08:09:46 (22021): No heartbeat from core client for 30 sec - exiting
08:09:47 (22021): No heartbeat from core client for 30 sec - exiting
08:09:48 (22021): No heartbeat from core client for 30 sec - exiting
09:50:07 (22130): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:50:08 (22130): No heartbeat from core client for 30 sec - exiting
09:50:09 (22130): No heartbeat from core client for 30 sec - exiting
09:50:10 (22130): No heartbeat from core client for 30 sec - exiting
09:50:11 (22130): No heartbeat from core client for 30 sec - exiting
09:50:12 (22130): No heartbeat from core client for 30 sec - exiting
09:50:13 (22130): No heartbeat from core client for 30 sec - exiting
09:50:14 (22130): No heartbeat from core client for 30 sec - exiting
09:50:15 (22130): No heartbeat from core client for 30 sec - exiting
09:50:16 (22130): No heartbeat from core client for 30 sec - exiting
09:50:25 (22130): No heartbeat from core client for 30 sec - exiting
09:50:26 (22130): No heartbeat from core client for 30 sec - exiting
09:50:27 (22130): No heartbeat from core client for 30 sec - exiting
09:50:28 (22130): No heartbeat from core client for 30 sec - exiting
09:50:29 (22130): No heartbeat from core client for 30 sec - exiting
09:50:32 (22130): No heartbeat from core client for 30 sec - exiting
09:50:33 (22130): No heartbeat from core client for 30 sec - exiting
09:50:34 (22130): No heartbeat from core client for 30 sec - exiting
09:50:35 (22130): No heartbeat from core client for 30 sec - exiting
09:50:36 (22130): No heartbeat from core client for 30 sec - exiting
09:50:37 (22130): No heartbeat from core client for 30 sec - exiting
09:50:38 (22130): No heartbeat from core client for 30 sec - exiting
09:50:39 (22130): No heartbeat from core client for 30 sec - exiting
21:50:43 (22523): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:50:44 (22523): No heartbeat from core client for 30 sec - exiting
21:50:45 (22523): No heartbeat from core client for 30 sec - exiting
02:47:23 (21060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:47:24 (21060): No heartbeat from core client for 30 sec - exiting
02:47:25 (21060): No heartbeat from core client for 30 sec - exiting
02:47:26 (21060): No heartbeat from core client for 30 sec - exiting
02:47:27 (21060): No heartbeat from core client for 30 sec - exiting
02:47:34 (21060): No heartbeat from core client for 30 sec - exiting
02:47:35 (21060): No heartbeat from core client for 30 sec - exiting
02:47:36 (21060): No heartbeat from core client for 30 sec - exiting
02:47:37 (21060): No heartbeat from core client for 30 sec - exiting
02:47:38 (21060): No heartbeat from core client for 30 sec - exiting
02:47:39 (21060): No heartbeat from core client for 30 sec - exiting
02:47:44 (21060): No heartbeat from core client for 30 sec - exiting
02:47:45 (21060): No heartbeat from core client for 30 sec - exiting
02:47:46 (21060): No heartbeat from core client for 30 sec - exiting
02:47:47 (21060): No heartbeat from core client for 30 sec - exiting
02:47:55 (21060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:24:46 (8037): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:24:47 (8037): No heartbeat from core client for 30 sec - exiting
20:24:48 (8037): No heartbeat from core client for 30 sec - exiting
20:24:49 (8037): No heartbeat from core client for 30 sec - exiting
20:24:50 (8037): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76f1400]
[0xf76f1430]
/lib32/libc.so.6(gsignal+0x50)[0xf7517a60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75002cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24799, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7759400]
[0xf7759430]
/lib32/libc.so.6(gsignal+0x50)[0xf757fa60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75682cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24799, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77ae400]
[0xf77ae430]
/lib32/libc.so.6(gsignal+0x50)[0xf75d4a60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75bd2cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24799, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76e4400]
[0xf76e4430]
/lib32/libc.so.6(gsignal+0x50)[0xf750aa60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf74f32cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24799, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7701400]
[0xf7701430]
/lib32/libc.so.6(gsignal+0x50)[0xf7527a60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75102cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24799, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7749400]
[0xf7749430]
/lib32/libc.so.6(gsignal+0x50)[0xf756fa60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75582cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24799, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Nov 2011 20:54:10 1066269 13554670 hadcm3n_ygls_1900_40_007524482_1 103,680 160,010 1.5433
05 Nov 2011 16:58:48 1066269 13554670 hadcm3n_ygls_1900_40_007524482_1 77,760 120,108 1.5446
01 Nov 2011 23:44:46 1066269 13554670 hadcm3n_ygls_1900_40_007524482_1 51,840 79,896 1.5412
01 Nov 2011 04:53:12 1066269 13554670 hadcm3n_ygls_1900_40_007524482_1 25,920 39,867 1.5381


©2024 cpdn.org