Name | wah2_eu25_n3y5_201712_13_801_011787043_0 |
Workunit | 11787043 |
Created | 14 Mar 2019, 10:18:16 UTC |
Sent | 17 Mar 2019, 8:31:33 UTC |
Report deadline | 27 Feb 2020, 13:51:33 UTC |
Received | 22 Mar 2019, 9:53:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1273646 |
Run time | 2 days 14 hours 57 min 32 sec |
CPU time | 2 days 12 hours 46 min 25 sec |
Validate state | Invalid |
Credit | 3,059.47 |
Device peak FLOPS | 4.09 GFLOPS |
Application version | Weather At Home 2 (wah2) v8.24 windows_intelx86 |
Peak working set size | 351.80 MB |
Peak swap size | 279.38 MB |
Peak disk usage | 0.02 MB |
Stderr | <core_client_version>7.14.2</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6420, selfPID=6420, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8208, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 21:40:27 (816): BOINC client no longer exists - exiting 21:40:27 (816): timer handler: client dead, exiting 21:40:37 (816): BOINC client no longer exists - exiting 21:40:37 (816): timer handler: client dead, exiting 21:40:47 (816): BOINC client no longer exists - exiting 21:40:47 (816): timer handler: client dead, exiting 21:40:57 (816): BOINC client no longer exists - exiting 21:40:57 (816): timer handler: client dead, exiting 21:41:07 (816): BOINC client no longer exists - exiting 21:41:07 (816): timer handler: client dead, exiting 21:41:17 (816): BOINC client no longer exists - exiting 21:41:17 (816): timer handler: client dead, exiting 21:41:27 (816): BOINC client no longer exists - exiting 21:41:27 (816): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:41:37 (816)CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7692, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7636, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8744, selfPID=7532, iMonCtr=1 Model crash detected, will try to restart... 17:10:52 (7188): BOINC client no longer exists - exiting 17:10:52 (7188): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:10:52 (11088): Can't acquire lockfile (32) - waiting 35s 17:11:02 (7188): BOINC client no longer exists - exiting 17:11:17 (7188): timer handler: client dead, exiting 17:11:27 (7188)::11:27 (11088): Can't acquire lockfile (32) - exiting 17:11:28 (11088): Error: The process cannot access the file because it is being used by another process. (0x20) Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7952, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7604, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2252, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9492, iMonCtr=2 Model crash detected, will try to restart... GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9868, selfPID=4636, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8516, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9068, iMonCtr=2 Leaving CPDN_ain::Monitor... cpdnmonitor: cannot open input file D:\User\BOINC/projects/climateprediction.net/wah2_eu25_n3y5_201712_13_801_011787043/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file D:\User\BOINC/projects/climateprediction.net/wah2_eu25_n3y5_201712_13_801_011787043/dataout/region_restart.day after 11 attempts Model crash : Model crashed: f file in READ from history file for namelist NLIHISTO tmp/xadae.pipe_dummy 2048 Leaving CPDN_ain::Monitor... 08:52:45 (10220): called boinc_finish(0) </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_5.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_6.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_7.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_8.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_9.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_10.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_11.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_12.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_13.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_restart.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Mar 2019 20:14:33 | 1273646 | 21563288 | wah2_eu25_n3y5_201712_13_801_011787043_0 | 46,379 | 206,067 | 4.4431 |
20 Mar 2019 23:00:19 | 1273646 | 21563288 | wah2_eu25_n3y5_201712_13_801_011787043_0 | 34,859 | 157,180 | 4.5090 |
20 Mar 2019 08:54:32 | 1273646 | 21563288 | wah2_eu25_n3y5_201712_13_801_011787043_0 | 23,339 | 108,882 | 4.6652 |
19 Mar 2019 09:09:18 | 1273646 | 21563288 | wah2_eu25_n3y5_201712_13_801_011787043_0 | 11,819 | 57,901 | 4.8990 |
©2024 cpdn.org