climateprediction.net home page
Task 22462163

Task 22462163

Name wah2_eas25_a2r2_200212_24_1020_012304693_0
Workunit 12304693
Created 22 Jul 2024, 13:00:11 UTC
Sent 22 Jul 2024, 16:47:05 UTC
Report deadline 30 Oct 2024, 16:47:05 UTC
Received 3 Aug 2024, 5:09:40 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1430197
Run time 4 days 13 hours 56 min 5 sec
CPU time 3 days 15 hours 14 min 24 sec
Validate state Invalid
Credit 3,334.82
Device peak FLOPS 3.90 GFLOPS
Application version Weather At Home 2 (wah2) (region independent) v8.32
windows_intelx86
Peak working set size 345.88 MB
Peak swap size 307.32 MB
Peak disk usage 95.90 MB
Stderr
<core_client_version>7.22.2</core_client_version>
<![CDATA[
<stderr_txt>
modelGetExecutables: check control files, strTemp0 & 1 : 
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists
modelGetExecutables: unzipping control files : strInput & strTmp 
wah2_eas25_a2r2_200212_24_1020_012304693.zip
wah2_eas25_a2r2_200212_24_1020_012304693/jobs
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2
regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=17512, GCM_PID=16024, RCM_PID=20460
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:28:44 (17512): BOINC client no longer exists - exiting
13:28:44 (17512): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Detaching shared memory... Done.
modelGetExecutables: check control files, strTemp0 & 1 : 
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2
regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=24472, GCM_PID=23772, RCM_PID=11020
Queuing intermediate upload for CPDN/BOINC: cpdnout1.zip
Queuing intermediate upload for CPDN/BOINC: cpdnout2.zip
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Detaching shared memory... Done.
modelGetExecutables: check control files, strTemp0 & 1 : 
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2
regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=23936, GCM_PID=27080, RCM_PID=13692
20:02:31 (23936): BOINC client no longer exists - exiting
20:02:31 (23936): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Detaching shared memory... Done.
modelGetExecutables: check control files, strTemp0 & 1 : 
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2
regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=12716, GCM_PID=10160, RCM_PID=22720
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Detaching shared memory... Done.
modelGetExecutables: check control files, strTemp0 & 1 : 
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2
regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=9944, GCM_PID=13896, RCM_PID=21708
Queuing intermediate upload for CPDN/BOINC: cpdnout3.zip
Queuing intermediate upload for CPDN/BOINC: cpdnout4.zip
Suspended CPDN Monitor - Suspend request from BOINC...
modelGetExecutables: check control files, strTemp0 & 1 : 
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists
S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f
global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2
regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=3348, GCM_PID=15144, RCM_PID=8748
Controller:: CPDN process is not running, exiting, bRetVal = T, checkPID = 8748, selfPID = 3348, iMonCtr = 2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
monitor:finished called ... tidying up.
monitor:finished: Uploading out files...
Queuing intermediate upload for CPDN/BOINC: cpdnout_out.zip
Detaching shared memory... Done.
monitor:finished: Closed output file : stdout_<>.txt
modelResultFiles : Removing : wah2_eas25_a2r2_200212_24_1020_012304693 in S:\BOINCdata/projects/climateprediction.net
monitor:finished: handing over to boinc_finish(RetVal=0)
06:56:07 (3348): called boinc_finish(0)

</stderr_txt><message>
upload failure: <file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_5.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_6.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_7.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Aug 2024 15:04:08 1430197 22462163 wah2_eas25_a2r2_200212_24_1020_012304693_0 46,379 284,476 6.1337
01 Aug 2024 12:42:50 1430197 22462163 wah2_eas25_a2r2_200212_24_1020_012304693_0 34,859 214,758 6.1608
30 Jul 2024 19:09:08 1430197 22462163 wah2_eas25_a2r2_200212_24_1020_012304693_0 23,339 143,471 6.1473
29 Jul 2024 16:30:12 1430197 22462163 wah2_eas25_a2r2_200212_24_1020_012304693_0 11,819 68,649 5.8084


©2024 cpdn.org