climateprediction.net home page
Task 21861436

Task 21861436

Name hadam4h_a21m_201411_4_842_011907106_2
Workunit 11907106
Created 22 Dec 2019, 22:37:53 UTC
Sent 23 Dec 2019, 2:00:13 UTC
Report deadline 4 Dec 2020, 7:20:13 UTC
Received 15 Jan 2020, 2:52:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 12 (0x0000000C) Unknown error code
Computer ID 1492959
Run time 12 days 3 hours 1 min 8 sec
CPU time 12 days 3 hours 1 min 8 sec
Validate state Invalid
Credit 13,636.74
Device peak FLOPS 3.22 GFLOPS
Application version UK Met Office HadAM4 at N216 resolution v8.52
i686-pc-linux-gnu
Peak working set size 1,364.45 MB
Peak swap size 1,385.77 MB
Peak disk usage 12.91 MB
Stderr
<core_client_version>7.9.3</core_client_version>
<![CDATA[
<message>
process exited with code 12 (0xc, -244)</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 15 received: Software termination signal from kill 
Signal 15 received: Abnormal termination triggered by abort call
Signal 15 received, exiting...
15:19:42 (16433): called boinc_finish(193)
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/xnnuj.pipe_dummy                                                            

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: STWORK  : Error in PP_FILE                                                                                                                                                                                                                                      tmp/xnnuj.pipe_dummy                                                            

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/xnnuj.pipe_dummy                                                            
Signal 15 received: Software termination signal from kill 
Signal 15 received: Abnormal termination triggered by abort call
Signal 15 received, exiting...
SIGSEGV: segmentation violation
Stack trace (26 frames):
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu(boinc_catch_signal+0x67)[0x80d4cf7]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7f5b090]
/lib32/libc.so.6(getenv+0x99)[0xf7b015f9]
/lib32/libc.so.6(+0xae498)[0xf7b80498]
/lib32/libc.so.6(+0xae865)[0xf7b80865]
/lib32/libc.so.6(localtime_r+0x12)[0xf7b7edc2]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d01b2]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d0900]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d09f1]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7f5b090]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d0580]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7f5b090]
linux-gate.so.1(__kernel_vsyscall+0x9)[0xf7f5b079]
/lib32/libc.so.6(__read+0x5b)[0xf7bb767b]
/lib32/libc.so.6(+0x7186e)[0xf7b4386e]
/lib32/libc.so.6(fread+0x38)[0xf7b37f78]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804d57f]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804d712]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804e447]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804e83d]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8053dff]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80504d2]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051b0e]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051d8b]
/lib32/libc.so.6(__libc_start_main+0xf1)[0xf7aeae81]
../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804cd21]

Exiting...
OPEN:  File Creation Failed: No such file or directory
OPEN:  Unable to Open File dataout/a21mga.pbl5jan for Read/Write

Model crashed: STWORK  : Error opening output PP file on unit 61                                                                                                                                                                                                               tmp/xnnuj.pipe_dummy                                                            
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.ihist after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.namelists after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/dataout/atmos_restart.day after 11 attempts
forrtl: Bad file descriptor
forrtl: severe (30): open failure, unit 6, file /proc/7227/fd/
Image              PC        Routine            Line        Source             
hadam4_um_8.52_i6  083F6605  Unknown               Unknown  Unknown
hadam4_um_8.52_i6  0843E0A0  Unknown               Unknown  Unknown
hadam4_um_8.52_i6  081DF6C9  Unknown               Unknown  Unknown
hadam4_um_8.52_i6  0836E63F  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2237, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.ihist after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.namelists after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/dataout/atmos_restart.day after 11 attempts
forrtl: Bad file descriptor
forrtl: severe (30): open failure, unit 6, file /proc/7237/fd/
Image              PC        Routine            Line        Source             
hadam4_um_8.52_i6  083F6605  Unknown               Unknown  Unknown
hadam4_um_8.52_i6  0843E0A0  Unknown               Unknown  Unknown
hadam4_um_8.52_i6  081DF6C9  Unknown               Unknown  Unknown
hadam4_um_8.52_i6  0836E63F  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2237, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.ihist after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.namelists after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/dataout/atmos_restart.day after 11 attempts
forrtl: Bad file descriptor
forrtl: severe (30): open failure, unit 6, file /proc/7245/fd/
Image              PC        Routine            Line        Source             
hadam4_um_8.52_i6  083F6605  Unknown               Unknown  Unknown
hadam4_um_8.52_i6  0843E0A0  Unknown               Unknown  Unknown
hadam4_um_8.52_i6  081DF6C9  Unknown               Unknown  Unknown
hadam4_um_8.52_i6  0836E63F  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2237, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Jan 2020 17:30:34 1492959 21861436 hadam4h_a21m_201411_4_842_011907106_2 17,483 776,632 44.4221
06 Jan 2020 22:46:13 1492959 21861436 hadam4h_a21m_201411_4_842_011907106_2 8,843 411,629 46.5486


©2024 cpdn.org