Name | wah2_sam50_a05t_201312_25_881_012034217_0 |
Workunit | 12034217 |
Created | 2 Nov 2020, 12:08:34 UTC |
Sent | 2 Nov 2020, 12:22:35 UTC |
Report deadline | 15 Oct 2021, 17:42:35 UTC |
Received | 24 Nov 2020, 19:13:36 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1510585 |
Run time | 4 days 9 hours 14 min 36 sec |
CPU time | 4 days 4 hours 28 min 51 sec |
Validate state | Invalid |
Credit | 9,138.96 |
Device peak FLOPS | 4.36 GFLOPS |
Application version | Weather At Home 2 (wah2) v8.24 windows_intelx86 |
Peak working set size | 226.84 MB |
Peak swap size | 188.07 MB |
Peak disk usage | 153.80 MB |
Stderr | <core_client_version>7.16.5</core_client_version> <![CDATA[ <stderr_txt> 06:28:26 (21760): start_timer_thread(): CreateThread() failed, errno 0 06:28:28 (18824): start_timer_thread(): CreateThread() failed, errno 0 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:06:10 (21212): Can't acquire lockfile (32) - waiting 35s 20:06:45 (21212): Can't acquire lockfile (32) - exiting 20:06:45 (21212): Error: The process cannot access the file because it is being used by another process. (0x20) 20:16:51 (20792): Can't acquire lockfile (32) - waiting 35s 20:17:26 (20792): Can't acquire lockfile (32) - exiting 20:17:26 (20792): Error: The process cannot access the file because it is being used by another process. (0x20) 20:27:39 (16448): Can't acquire lockfile (32) - waiting 35s 20:28:14 (16448): Can't acquire lockfile (32) - exiting 20:28:14 (16448): Error: The process cannot access the file because it is being used by another process. (0x20) 20:38:31 (11552): Can't acquire lockfile (32) - waiting 35s 20:39:06 (11552): Can't acquire lockfile (32) - exiting 20:39:06 (11552): Error: The process cannot access the file because it is being used by another process. (0x20) 20:49:14 (20428): Can't acquire lockfile (32) - waiting 35s 20:49:49 (20428): Can't acquire lockfile (32) - exiting 20:49:49 (20428): Error: The process cannot access the file because it is being used by another process. (0x20) 21:00:00 (12316): Can't acquire lockfile (32) - waiting 35s 21:00:35 (12316): Can't acquire lockfile (32) - exiting 21:00:35 (12316): Error: The process cannot access the file because it is being used by another process. (0x20) 21:10:57 (18796): Can't acquire lockfile (32) - waiting 35s 21:11:32 (18796): Can't acquire lockfile (32) - exiting 21:11:32 (18796): Error: The process cannot access the file because it is being used by another process. (0x20) 21:21:37 (22464): Can't acquire lockfile (32) - waiting 35s 21:22:12 (22464): Can't acquire lockfile (32) - exiting 21:22:12 (22464): Error: The process cannot access the file because it is being used by another process. (0x20) 22:42:20 (16176): Can't acquire lockfile (32) - waiting 35s 22:42:55 (16176): Can't acquire lockfile (32) - exiting 22:42:55 (16176): Error: The process cannot access the file because it is being used by another process. (0x20) 00:01:54 (24780): Can't acquire lockfile (32) - waiting 35s 00:02:29 (24780): Can't acquire lockfile (32) - exiting 00:02:29 (24780): Error: The process cannot access the file because it is being used by another process. (0x20) 00:17:16 (23120): Can't acquire lockfile (32) - waiting 35s 00:17:51 (23120): Can't acquire lockfile (32) - exiting 00:17:51 (23120): Error: The process cannot access the file because it is being used by another process. (0x20) 00:27:53 (11928): Can't acquire lockfile (32) - waiting 35s 00:28:28 (11928): Can't acquire lockfile (32) - exiting 00:28:28 (11928): Error: The process cannot access the file because it is being used by another process. (0x20) 00:54:25 (20384): Can't acquire lockfile (32) - waiting 35s 00:55:00 (20384): Can't acquire lockfile (32) - exiting 00:55:00 (20384): Error: The process cannot access the file because it is being used by another process. (0x20) 06:49:42 (19032): Can't acquire lockfile (32) - waiting 35s 06:50:17 (19032): Can't acquire lockfile (32) - exiting 06:50:17 (19032): Error: The process cannot access the file because it is being used by another process. (0x20) 07:04:26 (25684): Can't acquire lockfile (32) - waiting 35s 07:05:01 (25684): Can't acquire lockfile (32) - exiting 07:05:01 (25684): Error: The process cannot access the file because it is being used by another process. (0x20) 07:46:23 (10704): Can't acquire lockfile (32) - waiting 35s 07:46:58 (10704): Can't acquire lockfile (32) - exiting 07:46:58 (10704): Error: The process cannot access the file because it is being used by another process. (0x20) 08:34:15 (24724): Can't acquire lockfile (32) - waiting 35s 08:34:50 (24724): Can't acquire lockfile (32) - exiting 08:34:50 (24724): Error: The process cannot access the file because it is being used by another process. (0x20) 08:54:45 (16936): Can't acquire lockfile (32) - waiting 35s 08:55:20 (16936): Can't acquire lockfile (32) - exiting 08:55:20 (16936): Error: The process cannot access the file because it is being used by another process. (0x20) 11:28:09 (26172): Can't acquire lockfile (32) - waiting 35s 11:28:44 (26172): Can't acquire lockfile (32) - exiting 11:28:44 (26172): Error: The process cannot access the file because it is being used by another process. (0x20) 12:02:33 (19332): Can't acquire lockfile (32) - waiting 35s 12:03:08 (19332): Can't acquire lockfile (32) - exiting 12:03:08 (19332): Error: The process cannot access the file because it is being used by another process. (0x20) 13:06:03 (12536): Can't acquire lockfile (32) - waiting 35s 13:06:38 (12536): Can't acquire lockfile (32) - exiting 13:06:38 (12536): Error: The process cannot access the file because it is being used by another process. (0x20) 13:32:18 (19040): Can't acquire lockfile (32) - waiting 35s 13:32:53 (19040): Can't acquire lockfile (32) - exiting 13:32:53 (19040): Error: The process cannot access the file because it is being used by another process. (0x20) CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3808, selfPID=20340, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... cpdnmonitor: cannot open input file Y:\BOINC/projects/climateprediction.net/wah2_sam50_a05t_201312_25_881_012034217/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file Y:\BOINC/projects/climateprediction.net/wah2_sam50_a05t_201312_25_881_012034217/dataout/region_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5336, selfPID=18240, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout10.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout11.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout12.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout2.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout3.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout4.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout5.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout6.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout7.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout8.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout9.zip 12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout_out.zip 12:14:01 (18240): called boinc_finish(0) </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_13.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_14.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_15.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_16.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_17.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_18.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_19.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_20.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_21.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_22.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_23.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_24.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_25.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_restart.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Nov 2020 18:59:33 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 138,539 | 333,061 | 2.4041 |
20 Nov 2020 10:28:57 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 127,019 | 306,056 | 2.4095 |
16 Nov 2020 10:24:34 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 115,499 | 273,703 | 2.3697 |
15 Nov 2020 12:55:19 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 103,979 | 245,686 | 2.3628 |
14 Nov 2020 12:15:04 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 92,459 | 215,367 | 2.3293 |
12 Nov 2020 13:58:25 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 57,899 | 134,996 | 2.3316 |
12 Nov 2020 04:55:36 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 46,379 | 106,175 | 2.2893 |
12 Nov 2020 02:53:09 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 34,859 | 80,284 | 2.3031 |
12 Nov 2020 02:53:09 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 23,339 | 58,618 | 2.5116 |
11 Nov 2020 08:24:37 | 1510585 | 21959746 | wah2_sam50_a05t_201312_25_881_012034217_0 | 11,819 | 33,132 | 2.8033 |
©2024 cpdn.org