climateprediction.net home page
Task 22461019

Task 22461019

Name wah2_eas25_a1va_199712_24_1020_012303549_0
Workunit 12303549
Created 22 Jul 2024, 12:49:52 UTC
Sent 22 Jul 2024, 13:42:50 UTC
Report deadline 30 Oct 2024, 13:42:50 UTC
Received 28 Jul 2024, 13:47:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1550966
Run time 2 days 19 hours 32 min 39 sec
CPU time 2 days 19 hours 32 min 39 sec
Validate state Invalid
Credit 2,506.49
Device peak FLOPS 4.86 GFLOPS
Application version Weather At Home 2 (wah2) (region independent) v8.32
windows_intelx86
Peak working set size 338.00 MB
Peak swap size 307.29 MB
Peak disk usage 94.98 MB
Stderr
<core_client_version>7.24.1</core_client_version>
<![CDATA[
<stderr_txt>
modelGetExecutables: check control files, strTemp0 & 1 : 
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xadae.namelists
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xacxf.namelists
modelGetExecutables: unzipping control files : strInput & strTmp 
wah2_eas25_a1va_199712_24_1020_012303549.zip
wah2_eas25_a1va_199712_24_1020_012303549/jobs
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549 generic_phase1_spinup_eas25_global_aabaka_f ic19610912_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_hist_N96_1989_2000v2 oxi.addfa ozone_hist_N96_1989_2000v2
regional model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549
executeModelProcess: MonID=10196, GCM_PID=4304, RCM_PID=10372
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Detaching shared memory... Done.
modelGetExecutables: check control files, strTemp0 & 1 : 
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xadae.namelists
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549 generic_phase1_spinup_eas25_global_aabaka_f ic19610912_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_hist_N96_1989_2000v2 oxi.addfa ozone_hist_N96_1989_2000v2
regional model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=13360, GCM_PID=14172, RCM_PID=10436
Queuing intermediate upload for CPDN/BOINC: cpdnout1.zip
modelGetExecutables: check control files, strTemp0 & 1 : 
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xadae.namelists
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549 generic_phase1_spinup_eas25_global_aabaka_f ic19610912_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_hist_N96_1989_2000v2 oxi.addfa ozone_hist_N96_1989_2000v2
regional model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549
executeModelProcess: MonID=2676, GCM_PID=1872, RCM_PID=6908
Queuing intermediate upload for CPDN/BOINC: cpdnout2.zip
Global Worker:: CPDN process is not running, exiting, bRetVal = T, checkPID = 1872, selfPID = 1872, iMonCtr = 1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = T, checkPID = 1872, selfPID = 6908, iMonCtr = 1
modelGetExecutables: check control files, strTemp0 & 1 : 
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xadae.namelists
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549 generic_phase1_spinup_eas25_global_aabaka_f ic19610912_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_hist_N96_1989_2000v2 oxi.addfa ozone_hist_N96_1989_2000v2
regional model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=13640, GCM_PID=10744, RCM_PID=13460
Queuing intermediate upload for CPDN/BOINC: cpdnout3.zip
06:10:06 (13640): BOINC client no longer exists - exiting
06:10:06 (13640): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Detaching shared memory... Done.
modelGetExecutables: check control files, strTemp0 & 1 : 
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xadae.namelists
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549 generic_phase1_spinup_eas25_global_aabaka_f ic19610912_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_hist_N96_1989_2000v2 oxi.addfa ozone_hist_N96_1989_2000v2
regional model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=11352, GCM_PID=7100, RCM_PID=8144
BUFFOUT: C I/O Error - Return code = 1

Model crashed: 
Leaving CPDN_Main::Monitor...
monitor:finished called ... tidying up.
monitor:finished: Uploading out files...
modelGetExecutables: check control files, strTemp0 & 1 : 
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xadae.namelists
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_a1va_199712_24_1020_012303549/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549 generic_phase1_spinup_eas25_global_aabaka_f ic19610912_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_hist_N96_1989_2000v2 oxi.addfa ozone_hist_N96_1989_2000v2
regional model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a1va_199712_24_1020_012303549
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=11096, GCM_PID=7596, RCM_PID=16844
Global Worker:: CPDN process is not running, exiting, bRetVal = T, checkPID = 16844, selfPID = 7596, iMonCtr = 2
Controller:: CPDN process is not running, exiting, bRetVal = T, checkPID = 7596, selfPID = 11096, iMonCtr = 1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
monitor:finished called ... tidying up.
monitor:finished: Uploading out files...
Queuing intermediate upload for CPDN/BOINC: cpdnout_out.zip
Detaching shared memory... Done.
monitor:finished: Closed output file : stdout_<>.txt
modelResultFiles : Removing : wah2_eas25_a1va_199712_24_1020_012303549 in C:\ProgramData\BOINC/projects/climateprediction.net
monitor:finished: handing over to boinc_finish(RetVal=0)
22:04:28 (11096): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_4.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_5.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_6.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_7.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a1va_199712_24_1020_012303549_0_r1768486010_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Jul 2024 12:03:26 1550966 22461019 wah2_eas25_a1va_199712_24_1020_012303549_0 34,859 208,725 5.9877
24 Jul 2024 09:57:50 1550966 22461019 wah2_eas25_a1va_199712_24_1020_012303549_0 23,339 124,205 5.3218
23 Jul 2024 15:16:55 1550966 22461019 wah2_eas25_a1va_199712_24_1020_012303549_0 11,819 61,699 5.2203


©2024 cpdn.org