Name | oifs_43r3_bl_a239_2016092300_20_1018_12291036_0 |
Workunit | 12291036 |
Created | 12 Jun 2024, 23:57:51 UTC |
Sent | 14 Jun 2024, 9:44:24 UTC |
Report deadline | 13 Aug 2024, 9:44:24 UTC |
Received | 14 Jun 2024, 10:45:06 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 1 (0x00000001) Unknown error code |
Computer ID | 1549254 |
Run time | 10 min 30 sec |
CPU time | 9 min 20 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 1.00 GFLOPS |
Application version | OpenIFS 43r3 Baroclinic Lifecycle v1.13 x86_64-pc-linux-gnu |
Peak working set size | 4.88 GB |
Peak swap size | 5.84 GB |
Peak disk usage | 248.91 MB |
Stderr | <core_client_version>7.20.2</core_client_version> <![CDATA[ <message> process exited with code 1 (0x1, -255)</message> <stderr_txt> (argv0) ../../projects/climateprediction.net/oifs_43r3_bl_1.13_x86_64-pc-linux-gnu (argv1) start_date: 2016092300 (argv2) exptid: h7zg (argv3) unique_member_id: a239 (argv4) batchid: 1018 (argv5) wuid: 12291036 (argv6) fclen: 20 (argv7) app_name: oifs_43r3_bl (argv8) nthreads: 1 Working directory is: /public/home/liupei/slots/5 Project directory is: /public/home/liupei/projects/climateprediction.net/ app name: oifs_43r3_bl version: 1.13 Location of temp folder: /public/home/liupei/projects/climateprediction.net/oifs_43r3_bl_12291036 Copying: /public/home/liupei/projects/climateprediction.net/oifs_43r3_bl_app_1.13_x86_64-pc-linux-gnu.zip to: /public/home/liupei/slots/5/oifs_43r3_bl_app_1.13_x86_64-pc-linux-gnu.zip Unzipping the app zip file: /public/home/liupei/slots/5/oifs_43r3_bl_app_1.13_x86_64-pc-linux-gnu.zip Copying the namelist files from: ../../projects/climateprediction.net/jf_937ae3edf025b4c5d97948ff2de40583 to: /public/home/liupei/slots/5/oifs_43r3_bl_a239_2016092300_20_1018_12291036.zip Unzipping the namelist zip file: /public/home/liupei/slots/5/oifs_43r3_bl_a239_2016092300_20_1018_12291036.zip ic_ancil_file: ic_ancil_12291036 ifsdata_file: ifsdata_12291036 climate_data_file: clim_data_12291036 horiz_resolution: 159 vert_resolution: 91 grid_type: l_2 upload_interval: 96 utstep: 900 nfrres: restart dump frequency (steps) 48 Copying IC ancils from: ../../projects/climateprediction.net/jf_f7ecfd27aa1e68bbe7e8d19800e25c73 to: /public/home/liupei/slots/5/ic_ancil_12291036.zip Unzipping the IC ancils zip file: /public/home/liupei/slots/5/ic_ancil_12291036.zip Copying the ifsdata_file from: ../../projects/climateprediction.net/jf_470211db9cba2dad38fda10731bcca4f to: /public/home/liupei/slots/5/ifsdata/ifsdata_12291036.zip Unzipping the ifsdata_zip file: /public/home/liupei/slots/5/ifsdata/ifsdata_12291036.zip Copying the climate data file from: ../../projects/climateprediction.net/jf_f32c668cf7829426ffd94699b48b9a2e to: /public/home/liupei/slots/5/159l_2/clim_data_12291036.zip Unzipping the climate data zip file: /public/home/liupei/slots/5/159l_2/clim_data_12291036.zip Checking for progress XML file: /public/home/liupei/slots/5/progress_file_12291036.xml Creating progress file: /public/home/liupei/slots/5/progress_file_12291036.xml last_cpu_time: 0 upload_file_number: 0 last_iter: 0 last_upload: 0 model_completed: 0 total_length_of_simulation: 1728000 result_base_name: oifs_43r3_bl_a239_2016092300_20_1018_12291036_0_r888513243 The child process has been launched with process id: 64507 Executing the command: /public/home/liupei/slots/5/oifs_43r3_model.exe [EC_DRHOOK:hostname:myproc:omptid:pid:unixtid] [YYYYMMDD:HHMMSS:epoch:walltime] [function@file:lineno] -- Max OpenMP threads = 1 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2091] fp = 0x314f440 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2098] DR_HOOK_ALLOW_COREDUMP=-1 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2104] Hardlimit for core file is now 0 (0x0) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2122] DR_HOOK_PROFILE_PROC=-1 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2128] DR_HOOK_PROFILE_LIMIT=-10.000 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2194] DR_HOOK_RANDOM_MEMSTAT=0 (RAND_MAX=2147483647) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2205] DR_HOOK_HASHBITS=16 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2213] DR_HOOK_NCALLSTACK=0 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2221] DR_HOOK_HARAKIRI_TIMEOUT=500 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2228] DR_HOOK_TRAPFPE=1 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2235] DR_HOOK_TRAPFPE_INVALID=1 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2242] DR_HOOK_TRAPFPE_DIVBYZERO=1 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2249] DR_HOOK_TRAPFPE_OVERFLOW=1 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.000] [ignore_signals@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1227] DR_HOOK_IGNORE_SIGNALS=<undef> [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [restore_default_signals@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1178] DR_HOOK_RESTORE_DEFAULT_SIGNALS=<undef> [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1937] New signal handler 'signal_drhook' for signal#6 (SIGABRT) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1938] New signal handler 'signal_drhook' for signal#7 (SIGBUS) at 0x820cf0 (old at (nil)) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1939] New signal handler 'signal_drhook' for signal#11 (SIGSEGV) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1944] New signal handler 'signal_drhook' for signal#16 (SIGSTKFLT) at 0x820cf0 (old at (nil)) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [trapfpe_treatment@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1149] DR_HOOK enables SIGFPE-related floating point trapping since DRHOOK_TRAPFPE=1 [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1948] New signal handler 'signal_drhook' for signal#8 (SIGFPE) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1949] New signal handler 'signal_drhook' for signal#4 (SIGILL) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1951] New signal handler 'signal_drhook' for signal#5 (SIGTRAP) at 0x820cf0 (old at (nil)) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1952] New signal handler 'signal_drhook' for signal#2 (SIGINT) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1960] New signal handler 'signal_drhook' for signal#3 (SIGQUIT) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1961] New signal handler 'signal_drhook' for signal#15 (SIGTERM) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1966] New signal handler 'signal_drhook' for signal#24 (SIGXCPU) at 0x820cf0 (old at (nil)) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1968] New signal handler 'signal_drhook' for signal#25 (SIGXFSZ) at 0x820cf0 (old at (nil)) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1973] New signal handler 'signal_drhook' for signal#31 (SIGSYS) at 0x820cf0 (old at (nil)) [EC_DRHOOK:admin:1:1:64507:64507] [20240614:174746:1718358466:0.001] [catch_signals@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1111] DR_HOOK_CATCH_SIGNALS=<undef> 17:48:50 STEP 0 H= 0:00 +CPU= 52.926 Moving to projects directory: /public/home/liupei/slots/5/ICMGGh7zg+000000 Moving to projects directory: /public/home/liupei/slots/5/ICMSHh7zg+000000 Moving to projects directory: /public/home/liupei/slots/5/ICMUAh7zg+000000 17:49:34 STEP 1 H= 0:15 +CPU= 43.674 17:50:18 STEP 2 H= 0:30 +CPU= 43.432 17:51:00 STEP 3 H= 0:45 +CPU= 42.503 17:51:45 STEP 4 H= 1:00 +CPU= 43.793 17:52:28 STEP 5 H= 1:15 +CPU= 42.956 17:53:12 STEP 6 H= 1:30 +CPU= 44.196 17:53:58 STEP 7 H= 1:45 +CPU= 44.896 17:54:43 STEP 8 H= 2:00 +CPU= 45.033 17:55:27 STEP 9 H= 2:15 +CPU= 43.559 17:56:13 STEP 10 H= 2:30 +CPU= 45.405 17:56:57 STEP 11 H= 2:45 +CPU= 44.100 forrtl: severe (41): insufficient virtual memory Image PC Routine Line Source oifs_43r3_model.e 0000000001D395D4 Unknown Unknown Unknown oifs_43r3_model.e 00000000012C4E48 gp_model_heap_ 72 gp_model_heap.F90 oifs_43r3_model.e 0000000000BACBAE scan2m_ 535 scan2m.F90 oifs_43r3_model.e 0000000000688153 stepo_ 327 stepo.F90 oifs_43r3_model.e 00000000004CD1DC cnt4_ 1133 cnt4.F90 oifs_43r3_model.e 00000000004B5D85 cnt3_ 267 cnt3.F90 oifs_43r3_model.e 00000000004B4F52 cnt2_ 88 cnt2.F90 oifs_43r3_model.e 000000000049D638 cnt1_ 92 cnt1.F90 oifs_43r3_model.e 0000000000406FBD cnt0_ 146 cnt0.F90 oifs_43r3_model.e 0000000000401E7F MAIN__ 96 master.F90 oifs_43r3_model.e 0000000000401E12 Unknown Unknown Unknown oifs_43r3_model.e 0000000001DCC34A Unknown Unknown Unknown oifs_43r3_model.e 0000000001DCDBE7 Unknown Unknown Unknown oifs_43r3_model.e 0000000000401CD5 Unknown Unknown Unknown ..The child process terminated with status: 41 >>> Printing last 70 lines from file: NODE.001_01 17:56:13 STEP 10 H= 2:30 +CPU= 45.405 NSTEP = 11 STEPO 0AAA00AAA 17:56:57 STEP 11 H= 2:45 +CPU= 44.100 DATE= 2016 9 23 ISTEP= 12: Reset RSOLINC= 1361.4482 RIP0= 1361.4482 SU_GHGCLIM, RCARDI,RCH4,RN2O,RCFC11,RCFC12,RNO2: 6.175516699499999E-004 1.087291259320000E-006 4.989444016862500E-007 3.778881179906250E-009 2.099153872040000E-009 7.940283000000001E-011 suecaec NAERMACC = 1 UPDT RAESHDU = 2000.0 SUECAEC: GLOBAL AVERAGE VOLCANIC SULPHATE FOR (KINDAT,KMINUT) 20160923 270 9.999999999999998E-005 GPNORM HUMIDITY AVERAGE MINIMUM MAXIMUM AVE 0.174506683621949E-02 0.999999999999995E-08 0.152062680928081E-01 GPNORM LIQUID WATER AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM ICE WATER AVERAGE MINIMUM MAXIMUM AVE 0.171470708449327E-08 0.000000000000000E+00 0.236820451115556E-06 GPNORM SNOW AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM RAIN AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM CLOUD FRACTION AVERAGE MINIMUM MAXIMUM AVE 0.123582501006338E-01 0.000000000000000E+00 0.100000000000000E+01 GPNORM CLW AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM CIW AVERAGE MINIMUM MAXIMUM AVE 0.152915915117804E-08 0.000000000000000E+00 0.236800889561465E-06 GPNORM CC AVERAGE MINIMUM MAXIMUM AVE 0.104788871359060E-01 0.000000000000000E+00 0.100000000000000E+01 GPNORM RFL AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM SFL AVERAGE MINIMUM MAXIMUM AVE 0.345159118827507E-11 0.000000000000000E+00 0.198737264152240E-09 GPNORM CRFL AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM CSFL AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM PFRC AVERAGE MINIMUM MAXIMUM AVE 0.132212659471247E-01 0.000000000000000E+00 0.100000000000000E+01 NSTEP = 12 STEPO A00000000 MAXGPFV : MAX. VALUE = 0.000000000000000E+000 MAXGPFV : MAX. VALUE = 0.000000000000000E+000 NSTEP = 12 SCAN2M_HPOS P IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ICMGGh7zg+000012 MODE=w IO-STREAM CLOSED - ICMGGh7zg+000012 MPI-TASK: 1 - 310443604 BYTES IN 6 RECORDS TRANSFERRED IN 0.0004 SECONDS776109.0 Mbytes/s, TOTAL TIME= 0.0146( 2.7%) NSTEP = 12 STEPO 0AA000000 IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ./ICMSHh7zg+000012 MODE=w IO-STREAM CLOSED - ./ICMSHh7zg+000012 MPI-TASK: 1 - 707399250 BYTES IN 202 RECORDS TRANSFERRED IN 0.0132 SECONDS 53590.9 Mbytes/s, TOTAL TIME= 0.3345( 3.9%) IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ./ICMUAh7zg+000012 MODE=w IO-STREAM CLOSED - ./ICMUAh7zg+000012 MPI-TASK: 1 - 4967097664 BYTES IN 125 RECORDS TRANSFERRED IN 0.0041 SECONDS******** Mbytes/s, TOTAL TIME= 0.1458( 2.8%) STEPO_FPOS WAS : B00000F00E IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ./ICMUAh7zg+000012 MODE=a IO-STREAM CLOSED - ./ICMUAh7zg+000012 MPI-TASK: 1 - 10 BYTES IN 2 RECORDS TRANSFERRED IN 0.0003 SECONDS 0.0 Mbytes/s, TOTAL TIME= 0.0085( 3.5%) STEPO_FPOS WAS : V00000000Y IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ./ICMUAh7zg+000012 MODE=a IO-STREAM CLOSED - ./ICMUAh7zg+000012 MPI-TASK: 1 - 301863875 BYTES IN 4 RECORDS TRANSFERRED IN 0.0003 SECONDS******** Mbytes/s, TOTAL TIME= 0.0087( 3.4%) STEPO_FPOS WAS : T00000000M NSTEP = 12 STEPO 0AAA00AAA ------------------------------------------------ CNT0 not found; string returned was: 'STEPO' >>> Printing last 8 lines from file: ifs.stat 17:56:13 0AAA00AAA STEPO 11 45.405 45.405 45.786 8:22 8:26 0.55515332309566E-07 2GB 0MB 17:56:58 A00000000 STEPO 12 44.611 44.611 44.890 9:07 9:11 0.55515332309566E-07 2GB 0MB 17:56:58 0AA000000 STEPO 12 0.064 0.064 0.064 9:07 9:11 0.55515332309566E-07 2GB 0MB 17:57:00 FULLPOS-B DYNFPOS 12 2.522 2.522 2.543 9:10 9:14 0.55515332309566E-07 2GB 0MB 17:57:06 FULLPOS-V DYNFPOS 12 5.342 5.342 5.367 9:15 9:19 0.55515332309566E-07 2GB 0MB 17:57:06 FULLPOS-T DYNFPOS 12 0.763 0.763 0.764 9:16 9:20 0.55515332309566E-07 2GB 0MB 17:57:07 0AAA00AAA STEPO 12 0.682 0.682 0.683 9:16 9:20 0.55515332309566E-07 2GB 0MB ------------------------------------------------ >>> Printing last 8 lines from file: /public/home/liupei/slots/5/progress_file_12291036.xml <running_values> <last_cpu_time>555.990000</last_cpu_time> <upload_file_number>0</upload_file_number> <last_iter>12</last_iter> <last_upload>0</last_upload> <model_completed>0</model_completed> </running_values> ------------------------------------------------ ..Failed, model did not complete successfully </stderr_txt> ]]> |
No trickles! |
---|
©2024 cpdn.org