Name | wah2_eas25_a2r2_200212_24_1020_012304693_0 |
Workunit | 12304693 |
Created | 22 Jul 2024, 13:00:11 UTC |
Sent | 22 Jul 2024, 16:47:05 UTC |
Report deadline | 30 Oct 2024, 16:47:05 UTC |
Received | 3 Aug 2024, 5:09:40 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1430197 |
Run time | 4 days 13 hours 56 min 5 sec |
CPU time | 3 days 15 hours 14 min 24 sec |
Validate state | Invalid |
Credit | 3,334.82 |
Device peak FLOPS | 3.90 GFLOPS |
Application version | Weather At Home 2 (wah2) (region independent) v8.32 windows_intelx86 |
Peak working set size | 345.88 MB |
Peak swap size | 307.32 MB |
Peak disk usage | 95.90 MB |
Stderr | <core_client_version>7.22.2</core_client_version> <![CDATA[ <stderr_txt> modelGetExecutables: check control files, strTemp0 & 1 : S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists modelGetExecutables: unzipping control files : strInput & strTmp wah2_eas25_a2r2_200212_24_1020_012304693.zip wah2_eas25_a2r2_200212_24_1020_012304693/jobs gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2 regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. executeModelProcess: MonID=17512, GCM_PID=16024, RCM_PID=20460 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:28:44 (17512): BOINC client no longer exists - exiting 13:28:44 (17512): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Detaching shared memory... Done. modelGetExecutables: check control files, strTemp0 & 1 : S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2 regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. executeModelProcess: MonID=24472, GCM_PID=23772, RCM_PID=11020 Queuing intermediate upload for CPDN/BOINC: cpdnout1.zip Queuing intermediate upload for CPDN/BOINC: cpdnout2.zip Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Detaching shared memory... Done. modelGetExecutables: check control files, strTemp0 & 1 : S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2 regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. executeModelProcess: MonID=23936, GCM_PID=27080, RCM_PID=13692 20:02:31 (23936): BOINC client no longer exists - exiting 20:02:31 (23936): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Detaching shared memory... Done. modelGetExecutables: check control files, strTemp0 & 1 : S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2 regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. executeModelProcess: MonID=12716, GCM_PID=10160, RCM_PID=22720 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Detaching shared memory... Done. modelGetExecutables: check control files, strTemp0 & 1 : S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2 regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. executeModelProcess: MonID=9944, GCM_PID=13896, RCM_PID=21708 Queuing intermediate upload for CPDN/BOINC: cpdnout3.zip Queuing intermediate upload for CPDN/BOINC: cpdnout4.zip Suspended CPDN Monitor - Suspend request from BOINC... modelGetExecutables: check control files, strTemp0 & 1 : S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xadae.namelists S:\BOINCdata/projects/climateprediction.net/wah2_eas25_a2r2_200212_24_1020_012304693/jobs/xacxf.namelists gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f global model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 generic_phase1_spinup_eas25_global_aabaka_f ic19610706_16_N96 ALLclim_ancil_134months_OSTIA_sst_1994-12-01_2006-01-30 ALLclim_ancil_134months_OSTIA_ice_1994-12-01_2006-01-30 so2dms_rcp45_N96_1999_2010 oxi.addfa ozone_rcp45_N96_1999_2010v2 regional model: command string: "S:\BOINCdata/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_a2r2_200212_24_1020_012304693 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. executeModelProcess: MonID=3348, GCM_PID=15144, RCM_PID=8748 Controller:: CPDN process is not running, exiting, bRetVal = T, checkPID = 8748, selfPID = 3348, iMonCtr = 2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... monitor:finished called ... tidying up. monitor:finished: Uploading out files... Queuing intermediate upload for CPDN/BOINC: cpdnout_out.zip Detaching shared memory... Done. monitor:finished: Closed output file : stdout_<>.txt modelResultFiles : Removing : wah2_eas25_a2r2_200212_24_1020_012304693 in S:\BOINCdata/projects/climateprediction.net monitor:finished: handing over to boinc_finish(RetVal=0) 06:56:07 (3348): called boinc_finish(0) </stderr_txt><message> upload failure: <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_5.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_6.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_7.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_8.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_9.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_10.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_11.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_12.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_13.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_14.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_15.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_16.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_17.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_18.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_19.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_20.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_21.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_22.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_23.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_24.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a2r2_200212_24_1020_012304693_0_r325651615_restart.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Aug 2024 15:04:08 | 1430197 | 22462163 | wah2_eas25_a2r2_200212_24_1020_012304693_0 | 46,379 | 284,476 | 6.1337 |
01 Aug 2024 12:42:50 | 1430197 | 22462163 | wah2_eas25_a2r2_200212_24_1020_012304693_0 | 34,859 | 214,758 | 6.1608 |
30 Jul 2024 19:09:08 | 1430197 | 22462163 | wah2_eas25_a2r2_200212_24_1020_012304693_0 | 23,339 | 143,471 | 6.1473 |
29 Jul 2024 16:30:12 | 1430197 | 22462163 | wah2_eas25_a2r2_200212_24_1020_012304693_0 | 11,819 | 68,649 | 5.8084 |
©2024 cpdn.org