Name | hadam4h_a21m_201411_4_842_011907106_2 |
Workunit | 11907106 |
Created | 22 Dec 2019, 22:37:53 UTC |
Sent | 23 Dec 2019, 2:00:13 UTC |
Report deadline | 4 Dec 2020, 7:20:13 UTC |
Received | 15 Jan 2020, 2:52:41 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 12 (0x0000000C) Unknown error code |
Computer ID | 1492959 |
Run time | 12 days 3 hours 1 min 8 sec |
CPU time | 12 days 3 hours 1 min 8 sec |
Validate state | Invalid |
Credit | 13,636.74 |
Device peak FLOPS | 3.22 GFLOPS |
Application version | UK Met Office HadAM4 at N216 resolution v8.52 i686-pc-linux-gnu |
Peak working set size | 1,364.45 MB |
Peak swap size | 1,385.77 MB |
Peak disk usage | 12.91 MB |
Stderr | <core_client_version>7.9.3</core_client_version> <![CDATA[ <message> process exited with code 12 (0xc, -244)</message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 15 received: Software termination signal from kill Signal 15 received: Abnormal termination triggered by abort call Signal 15 received, exiting... 15:19:42 (16433): called boinc_finish(193) Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/xnnuj.pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: STWORK : Error in PP_FILE tmp/xnnuj.pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/xnnuj.pipe_dummy Signal 15 received: Software termination signal from kill Signal 15 received: Abnormal termination triggered by abort call Signal 15 received, exiting... SIGSEGV: segmentation violation Stack trace (26 frames): ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu(boinc_catch_signal+0x67)[0x80d4cf7] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7f5b090] /lib32/libc.so.6(getenv+0x99)[0xf7b015f9] /lib32/libc.so.6(+0xae498)[0xf7b80498] /lib32/libc.so.6(+0xae865)[0xf7b80865] /lib32/libc.so.6(localtime_r+0x12)[0xf7b7edc2] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d01b2] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d0900] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d09f1] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7f5b090] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80d0580] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7f5b090] linux-gate.so.1(__kernel_vsyscall+0x9)[0xf7f5b079] /lib32/libc.so.6(__read+0x5b)[0xf7bb767b] /lib32/libc.so.6(+0x7186e)[0xf7b4386e] /lib32/libc.so.6(fread+0x38)[0xf7b37f78] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804d57f] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804d712] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804e447] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804e83d] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8053dff] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x80504d2] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051b0e] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x8051d8b] /lib32/libc.so.6(__libc_start_main+0xf1)[0xf7aeae81] ../../projects/climateprediction.net/hadam4_8.52_i686-pc-linux-gnu[0x804cd21] Exiting... OPEN: File Creation Failed: No such file or directory OPEN: Unable to Open File dataout/a21mga.pbl5jan for Read/Write Model crashed: STWORK : Error opening output PP file on unit 61 tmp/xnnuj.pipe_dummy cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.ihist after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.namelists after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/dataout/atmos_restart.day after 11 attempts forrtl: Bad file descriptor forrtl: severe (30): open failure, unit 6, file /proc/7227/fd/ Image PC Routine Line Source hadam4_um_8.52_i6 083F6605 Unknown Unknown Unknown hadam4_um_8.52_i6 0843E0A0 Unknown Unknown Unknown hadam4_um_8.52_i6 081DF6C9 Unknown Unknown Unknown hadam4_um_8.52_i6 0836E63F Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2237, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.ihist after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.namelists after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/dataout/atmos_restart.day after 11 attempts forrtl: Bad file descriptor forrtl: severe (30): open failure, unit 6, file /proc/7237/fd/ Image PC Routine Line Source hadam4_um_8.52_i6 083F6605 Unknown Unknown Unknown hadam4_um_8.52_i6 0843E0A0 Unknown Unknown Unknown hadam4_um_8.52_i6 081DF6C9 Unknown Unknown Unknown hadam4_um_8.52_i6 0836E63F Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2237, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.ihist after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/jobs/xnnuj.namelists after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadam4h_a21m_201411_4_842_011907106/dataout/atmos_restart.day after 11 attempts forrtl: Bad file descriptor forrtl: severe (30): open failure, unit 6, file /proc/7245/fd/ Image PC Routine Line Source hadam4_um_8.52_i6 083F6605 Unknown Unknown Unknown hadam4_um_8.52_i6 0843E0A0 Unknown Unknown Unknown hadam4_um_8.52_i6 081DF6C9 Unknown Unknown Unknown hadam4_um_8.52_i6 0836E63F Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2237, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Jan 2020 17:30:34 | 1492959 | 21861436 | hadam4h_a21m_201411_4_842_011907106_2 | 17,483 | 776,632 | 44.4221 |
06 Jan 2020 22:46:13 | 1492959 | 21861436 | hadam4h_a21m_201411_4_842_011907106_2 | 8,843 | 411,629 | 46.5486 |
©2024 cpdn.org