climateprediction.net home page
Task 21912442

Task 21912442

Name hadam4h_a1uj_201311_4_843_011910001_1
Workunit 11910001
Created 6 Feb 2020, 0:10:14 UTC
Sent 5 Mar 2020, 1:20:11 UTC
Report deadline 15 Feb 2021, 6:40:11 UTC
Received 21 Apr 2020, 3:30:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1337904
Run time 13 days 9 hours 2 min 31 sec
CPU time 11 days 4 hours 57 min 6 sec
Validate state Invalid
Credit 20,375.94
Device peak FLOPS 3.97 GFLOPS
Application version UK Met Office HadAM4 at N216 resolution v8.52
i686-pc-linux-gnu
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17621, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17621, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17621, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17621, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17621, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17621, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17621, iMonCtr=1
Model crash detected, will try to restart...
Signal 15 received: Software termination signal from kill 
Signal 15 received: Abnormal termination triggered by abort call
Signal 15 received, exiting...
21:11:36 (6174): called boinc_finish(193)
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 15 received: Software termination signal from kill 
Signal 15 received: Abnormal termination triggered by abort call
Signal 15 received, exiting...
00:43:30 (3809): called boinc_finish(193)
Signal 15 received: Software termination signal from kill 
Signal 15 received: Abnormal termination triggered by abort call
Signal 15 received, exiting...
SIGSEGV: segmentation violation
Stack trace (19 frames):
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu(boinc_catch_signal+0x67)[0x80d4cf7]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf778c0f0]
/lib/libc.so.6(getenv+0x72)[0xf73c3e72]
/lib/libc.so.6(+0xaa85e)[0xf743d85e]
/lib/libc.so.6(+0xaabd6)[0xf743dbd6]
/lib/libc.so.6(localtime_r+0x2b)[0xf743c18b]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d01b2]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d0900]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d09f1]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf778c0f0]
linux-gate.so.1(__kernel_vsyscall+0x9)[0xf778c119]
/lib/libc.so.6(nanosleep+0x46)[0xf744ba56]
/lib/libc.so.6(usleep+0x3d)[0xf747bfcd]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80e78a5]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80503e8]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051b13]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051d8b]
/lib/libc.so.6(__libc_start_main+0xf3)[0xf73ab703]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804cd21]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3861, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3861, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 15 received: Software termination signal from kill 
Signal 15 received: Abnormal termination triggered by abort call
Signal 15 received, exiting...
SIGSEGV: segmentation violation
Stack trace (19 frames):
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu(boinc_catch_signal+0x67)[0x80d4cf7]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77540f0]
/lib/libc.so.6(getenv+0x72)[0xf738be72]
/lib/libc.so.6(+0xaa85e)[0xf740585e]
/lib/libc.so.6(+0xaabd6)[0xf7405bd6]
/lib/libc.so.6(localtime_r+0x2b)[0xf740418b]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d01b2]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d0900]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d09f1]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77540f0]
linux-gate.so.1(__kernel_vsyscall+0x9)[0xf7754119]
/lib/libc.so.6(nanosleep+0x46)[0xf7413a56]
/lib/libc.so.6(usleep+0x3d)[0xf7443fcd]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80e78a5]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80503e8]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051b13]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051d8b]
/lib/libc.so.6(__libc_start_main+0xf3)[0xf7373703]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804cd21]

Exiting...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 15 received: Software termination signal from kill 
Signal 15 received: Abnormal termination triggered by abort call
Signal 15 received, exiting...
SIGSEGV: segmentation violation
Stack trace (19 frames):
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu(boinc_catch_signal+0x67)[0x80d4cf7]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf778e0f0]
/lib/libc.so.6(getenv+0x72)[0xf73c5e72]
/lib/libc.so.6(+0xaa85e)[0xf743f85e]
/lib/libc.so.6(+0xaabd6)[0xf743fbd6]
/lib/libc.so.6(localtime_r+0x2b)[0xf743e18b]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d01b2]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d0900]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d09f1]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf778e0f0]
linux-gate.so.1(__kernel_vsyscall+0x9)[0xf778e119]
/lib/libc.so.6(nanosleep+0x46)[0xf744da56]
/lib/libc.so.6(usleep+0x3d)[0xf747dfcd]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80e78a5]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80503e8]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051b13]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051d8b]
/lib/libc.so.6(__libc_start_main+0xf3)[0xf73ad703]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804cd21]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2729, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2729, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2729, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17436, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17436, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17436, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17436, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20583, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20583, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20583, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20583, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20583, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20583, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
04:52:42 (20583): called boinc_finish(22)

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Mar 2020 06:27:22 1337904 21912442 hadam4h_a1uj_201311_4_843_011910001_1 26,123 789,035 30.2046
19 Mar 2020 07:41:34 1337904 21912442 hadam4h_a1uj_201311_4_843_011910001_1 17,483 547,743 31.3300
15 Mar 2020 07:14:03 1337904 21912442 hadam4h_a1uj_201311_4_843_011910001_1 8,843 279,040 31.5549


©2024 cpdn.org