Name | oifs_43r3_bl_a0y8_2016092300_20_1018_12289559_0 |
Workunit | 12289559 |
Created | 12 Jun 2024, 21:45:05 UTC |
Sent | 14 Jun 2024, 0:00:49 UTC |
Report deadline | 13 Aug 2024, 0:00:49 UTC |
Received | 14 Jun 2024, 1:52:10 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 1 (0x00000001) Unknown error code |
Computer ID | 1550568 |
Run time | 18 min 10 sec |
CPU time | 18 min 10 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 9.29 GFLOPS |
Application version | OpenIFS 43r3 Baroclinic Lifecycle v1.13 x86_64-pc-linux-gnu |
Peak working set size | 4,958.27 MB |
Peak swap size | 5,981.07 MB |
Peak disk usage | 262.51 MB |
Stderr | <core_client_version>7.22.1</core_client_version> <![CDATA[ <message> process exited with code 1 (0x1, -255)</message> <stderr_txt> (argv0) ../../projects/climateprediction.net/oifs_43r3_bl_1.13_x86_64-pc-linux-gnu (argv1) start_date: 2016092300 (argv2) exptid: h7zg (argv3) unique_member_id: a0y8 (argv4) batchid: 1018 (argv5) wuid: 12289559 (argv6) fclen: 20 (argv7) app_name: oifs_43r3_bl (argv8) nthreads: 1 Working directory is: /home/worker/slots/7 Project directory is: /home/worker/projects/climateprediction.net/ app name: oifs_43r3_bl version: 1.13 Location of temp folder: /home/worker/projects/climateprediction.net/oifs_43r3_bl_12289559 Copying: /home/worker/projects/climateprediction.net/oifs_43r3_bl_app_1.13_x86_64-pc-linux-gnu.zip to: /home/worker/slots/7/oifs_43r3_bl_app_1.13_x86_64-pc-linux-gnu.zip Unzipping the app zip file: /home/worker/slots/7/oifs_43r3_bl_app_1.13_x86_64-pc-linux-gnu.zip Copying the namelist files from: ../../projects/climateprediction.net/jf_ec1538c5b7ce7c05be2e43bd16f9d0b7 to: /home/worker/slots/7/oifs_43r3_bl_a0y8_2016092300_20_1018_12289559.zip Unzipping the namelist zip file: /home/worker/slots/7/oifs_43r3_bl_a0y8_2016092300_20_1018_12289559.zip ic_ancil_file: ic_ancil_12289559 ifsdata_file: ifsdata_12289559 climate_data_file: clim_data_12289559 horiz_resolution: 159 vert_resolution: 91 grid_type: l_2 upload_interval: 96 utstep: 900 nfrres: restart dump frequency (steps) 48 Copying IC ancils from: ../../projects/climateprediction.net/jf_f7ecfd27aa1e68bbe7e8d19800e25c73 to: /home/worker/slots/7/ic_ancil_12289559.zip Unzipping the IC ancils zip file: /home/worker/slots/7/ic_ancil_12289559.zip Copying the ifsdata_file from: ../../projects/climateprediction.net/jf_7d03b27d15900d2ad307ba4174bbcd20 to: /home/worker/slots/7/ifsdata/ifsdata_12289559.zip Unzipping the ifsdata_zip file: /home/worker/slots/7/ifsdata/ifsdata_12289559.zip Copying the climate data file from: ../../projects/climateprediction.net/jf_f32c668cf7829426ffd94699b48b9a2e to: /home/worker/slots/7/159l_2/clim_data_12289559.zip Unzipping the climate data zip file: /home/worker/slots/7/159l_2/clim_data_12289559.zip Checking for progress XML file: /home/worker/slots/7/progress_file_12289559.xml Creating progress file: /home/worker/slots/7/progress_file_12289559.xml last_cpu_time: 0 upload_file_number: 0 last_iter: 0 last_upload: 0 model_completed: 0 total_length_of_simulation: 1728000 result_base_name: oifs_43r3_bl_a0y8_2016092300_20_1018_12289559_0_r1371548700 The child process has been launched with process id: 2368 Executing the command: /home/worker/slots/7/oifs_43r3_model.exe [EC_DRHOOK:hostname:myproc:omptid:pid:unixtid] [YYYYMMDD:HHMMSS:epoch:walltime] [function@file:lineno] -- Max OpenMP threads = 1 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2091] fp = 0x314f440 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2098] DR_HOOK_ALLOW_COREDUMP=-1 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2104] Hardlimit for core file is now 0 (0x0) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2122] DR_HOOK_PROFILE_PROC=-1 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2128] DR_HOOK_PROFILE_LIMIT=-10.000 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2194] DR_HOOK_RANDOM_MEMSTAT=0 (RAND_MAX=2147483647) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2205] DR_HOOK_HASHBITS=16 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2213] DR_HOOK_NCALLSTACK=0 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2221] DR_HOOK_HARAKIRI_TIMEOUT=500 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2228] DR_HOOK_TRAPFPE=1 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2235] DR_HOOK_TRAPFPE_INVALID=1 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2242] DR_HOOK_TRAPFPE_DIVBYZERO=1 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [process_options@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:2249] DR_HOOK_TRAPFPE_OVERFLOW=1 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [ignore_signals@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1227] DR_HOOK_IGNORE_SIGNALS=<undef> [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [restore_default_signals@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1178] DR_HOOK_RESTORE_DEFAULT_SIGNALS=<undef> [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1937] New signal handler 'signal_drhook' for signal#6 (SIGABRT) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1938] New signal handler 'signal_drhook' for signal#7 (SIGBUS) at 0x820cf0 (old at (nil)) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1939] New signal handler 'signal_drhook' for signal#11 (SIGSEGV) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1944] New signal handler 'signal_drhook' for signal#16 (SIGSTKFLT) at 0x820cf0 (old at (nil)) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [trapfpe_treatment@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1149] DR_HOOK enables SIGFPE-related floating point trapping since DRHOOK_TRAPFPE=1 [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1948] New signal handler 'signal_drhook' for signal#8 (SIGFPE) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1949] New signal handler 'signal_drhook' for signal#4 (SIGILL) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1951] New signal handler 'signal_drhook' for signal#5 (SIGTRAP) at 0x820cf0 (old at (nil)) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1952] New signal handler 'signal_drhook' for signal#2 (SIGINT) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1960] New signal handler 'signal_drhook' for signal#3 (SIGQUIT) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1961] New signal handler 'signal_drhook' for signal#15 (SIGTERM) at 0x820cf0 (old at 0x1cf8da0) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1966] New signal handler 'signal_drhook' for signal#24 (SIGXCPU) at 0x820cf0 (old at (nil)) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1968] New signal handler 'signal_drhook' for signal#25 (SIGXFSZ) at 0x820cf0 (old at 0x1) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [signal_drhook_init@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1973] New signal handler 'signal_drhook' for signal#31 (SIGSYS) at 0x820cf0 (old at (nil)) [EC_DRHOOK:onto:1:1:2368:2368] [20240614:030103:1718323263:0.000] [catch_signals@/home/abowery/Working_folder/OpenIFS/oifs_43r3_bl/gc_oifs43r3_2/src/ifsaux/support/drhook.c:1111] DR_HOOK_CATCH_SIGNALS=<undef> 04:00:48 STEP 0 H= 0:00 +CPU=522.848 Moving to projects directory: /home/worker/slots/7/ICMGGh7zg+000000 Moving to projects directory: /home/worker/slots/7/ICMSHh7zg+000000 Moving to projects directory: /home/worker/slots/7/ICMUAh7zg+000000 04:01:07 STEP 1 H= 0:15 +CPU= 18.495 04:01:26 STEP 2 H= 0:30 +CPU= 18.709 04:01:45 STEP 3 H= 0:45 +CPU= 18.995 04:02:05 STEP 4 H= 1:00 +CPU= 19.790 04:02:24 STEP 5 H= 1:15 +CPU= 18.116 04:30:32 STEP 6 H= 1:30 +CPU=305.299 ..The child process has been killed with signal: 9 >>> Printing last 70 lines from file: NODE.001_01 AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM CLW AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM CIW AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM CC AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM RFL AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM SFL AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM CRFL AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM CSFL AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 GPNORM PFRC AVERAGE MINIMUM MAXIMUM AVE 0.000000000000000E+00 0.000000000000000E+00 0.000000000000000E+00 NSTEP = 0 STEPO A00000000 MAXGPFV : MAX. VALUE = 0.000000000000000E+000 MAXGPFV : MAX. VALUE = 0.000000000000000E+000 NSTEP = 0 SCAN2M_HPOS P IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ICMGGh7zg+000000 MODE=w IO-STREAM CLOSED - ICMGGh7zg+000000 MPI-TASK: 1 - 377620004 BYTES IN 6 RECORDS TRANSFERRED IN 0.0001 SECONDS******** Mbytes/s, TOTAL TIME= 0.0314( 0.3%) NSTEP = 0 STEPO 0AA000000 IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ./ICMSHh7zg+000000 MODE=w IO-STREAM CLOSED - ./ICMSHh7zg+000000 MPI-TASK: 1 - 707399250 BYTES IN 202 RECORDS TRANSFERRED IN 0.0070 SECONDS101057.0 Mbytes/s, TOTAL TIME= 0.1394( 5.0%) IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ./ICMUAh7zg+000000 MODE=w IO-STREAM CLOSED - ./ICMUAh7zg+000000 MPI-TASK: 1 - 5214613568 BYTES IN 125 RECORDS TRANSFERRED IN 0.0030 SECONDS******** Mbytes/s, TOTAL TIME= 0.0834( 3.6%) STEPO_FPOS WAS : B00000F00E IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ./ICMUAh7zg+000000 MODE=a IO-STREAM CLOSED - ./ICMUAh7zg+000000 MPI-TASK: 1 - 10 BYTES IN 2 RECORDS TRANSFERRED IN 0.0002 SECONDS 0.1 Mbytes/s, TOTAL TIME= 0.0050( 4.0%) STEPO_FPOS WAS : V00000000Y IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ./ICMUAh7zg+000000 MODE=a IO-STREAM CLOSED - ./ICMUAh7zg+000000 MPI-TASK: 1 - 317333619 BYTES IN 4 RECORDS TRANSFERRED IN 0.0002 SECONDS******** Mbytes/s, TOTAL TIME= 0.0048( 4.2%) STEPO_FPOS WAS : T00000000M NSTEP = 0 STEPO 0AAA00AAA MAXGPFV : MAX. VALUE = 0.000000000000000E+000 MAXGPFV : MAX. VALUE = 0.000000000000000E+000 NSTEP = 0 SCAN2M_HPOS P IO-STREAM SETUP - IOTYPE = 2 NUMIOPROCS = 1 CPATH = ICMGGh7zg+000000 MODE=a IO-STREAM CLOSED - ICMGGh7zg+000000 MPI-TASK: 1 - 325913348 BYTES IN 7 RECORDS TRANSFERRED IN 0.0009 SECONDS362125.9 Mbytes/s, TOTAL TIME= 0.0125( 7.2%) 04:00:48 STEP 0 H= 0:00 +CPU=522.848 NSTEP = 1 STEPO 0AAA00AAA 04:01:07 STEP 1 H= 0:15 +CPU= 18.495 NSTEP = 2 STEPO 0AAA00AAA 04:01:26 STEP 2 H= 0:30 +CPU= 18.709 NSTEP = 3 STEPO 0AAA00AAA 04:01:45 STEP 3 H= 0:45 +CPU= 18.995 NSTEP = 4 STEPO 0AAA00AAA 04:02:05 STEP 4 H= 1:00 +CPU= 19.790 ISTEP= 5 RSOLINC= 1361.4482 RI0= 1365.0000 ZI0= 1361.4482 RIP0= 1361.4482 ISTEP= 5 ZI0= 1361.4482 ZSEASON= 1.000000000 REA=149597870000.00 RDEASO=149597870000.00 NSTEP = 5 STEPO 0AAA00AAA 04:02:24 STEP 5 H= 1:15 +CPU= 18.116 NSTEP = 6 STEPO 0AAA00AAA 04:30:32 STEP 6 H= 1:30 +CPU=305.299 NSTEP = 7 STEPO 0AAA00AAA ------------------------------------------------ CNT0 not found; string returned was: 'STEPO' >>> Printing last 8 lines from file: ifs.stat 04:00:48 0AAA00AAA STEPO 1 518.813 518.813 3576.688 8:46 59:44 0.55515332309566E-07 2GB 0MB 04:01:07 0AAA00AAA STEPO 2 18.494 18.494 18.755 9:05 60:03 0.55515332309566E-07 2GB 0MB 04:01:26 0AAA00AAA STEPO 3 18.709 18.709 18.917 9:24 60:22 0.55515332309566E-07 2GB 0MB 04:01:45 0AAA00AAA STEPO 4 18.996 18.996 19.255 9:43 60:41 0.55515332309566E-07 2GB 0MB 04:02:05 0AAA00AAA STEPO 5 19.790 19.790 20.270 10:02 61:02 0.55515332309566E-07 2GB 0MB 04:02:24 0AAA00AAA STEPO 6 18.116 18.116 18.439 10:21 61:20 0.55515332309566E-07 2GB 0MB 04:30:32 0AAA00AAA STEPO 7 305.301 305.301 1688.049 15:26 89:28 0.55515332309566E-07 2GB 0MB ------------------------------------------------ >>> Printing last 8 lines from file: /home/worker/slots/7/progress_file_12289559.xml <running_values> <last_cpu_time>1089.850000</last_cpu_time> <upload_file_number>0</upload_file_number> <last_iter>7</last_iter> <last_upload>0</last_upload> <model_completed>0</model_completed> </running_values> ------------------------------------------------ ..Failed, model did not complete successfully </stderr_txt> ]]> |
No trickles! |
---|
©2024 cpdn.org