Name | wah2_safr50_n1t8_201612_16_789_011745024_2 |
Workunit | 11745024 |
Created | 5 Mar 2019, 17:51:12 UTC |
Sent | 5 Mar 2019, 18:25:14 UTC |
Report deadline | 15 Feb 2020, 23:45:14 UTC |
Received | 1 Apr 2019, 14:24:12 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1307851 |
Run time | 6 days 11 hours 35 min 35 sec |
CPU time | 5 days 17 hours 52 min 58 sec |
Validate state | Invalid |
Credit | 6,099.22 |
Device peak FLOPS | 2.34 GFLOPS |
Application version | Weather At Home 2 (wah2) v8.24 windows_intelx86 |
Peak working set size | 255.07 MB |
Peak swap size | 221.05 MB |
Peak disk usage | 275.41 MB |
Stderr | <core_client_version>7.8.3</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4552, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=600, selfPID=5908, iMonCtr=1 CGntroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=2 lobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3788, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3016, selfPID=2808, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2000, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1844, selfPID=2700, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5252, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5280, selfPID=3216, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3536, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2720, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1752, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2480, selfPID=884, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3748, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4036, selfPID=3348, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5520, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5556, selfPID=2908, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2848, selfPID=2336, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3888, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2452, selfPID=2760, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=892, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2860, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3272, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3296, selfPID=2756, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr= Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2728, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2752, selfPID=2516, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/wah2_safr50_n1t8_201612_16_789_011745024/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/wah2_safr50_n1t8_201612_16_789_011745024/dataout/region_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xadae.pipe_dummy Leaving CPDN_ain::Monitor... 08:23:03 (2924): called boinc_finish(0) </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>wah2_safr50_n1t8_201612_16_789_011745024_2_r972876472_9.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_safr50_n1t8_201612_16_789_011745024_2_r972876472_10.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_safr50_n1t8_201612_16_789_011745024_2_r972876472_11.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_safr50_n1t8_201612_16_789_011745024_2_r972876472_12.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_safr50_n1t8_201612_16_789_011745024_2_r972876472_13.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_safr50_n1t8_201612_16_789_011745024_2_r972876472_14.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_safr50_n1t8_201612_16_789_011745024_2_r972876472_15.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_safr50_n1t8_201612_16_789_011745024_2_r972876472_16.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_safr50_n1t8_201612_16_789_011745024_2_r972876472_restart.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Mar 2019 14:38:09 | 1307851 | 21519875 | wah2_safr50_n1t8_201612_16_789_011745024_2 | 92,459 | 476,426 | 5.1528 |
26 Mar 2019 19:08:30 | 1307851 | 21519875 | wah2_safr50_n1t8_201612_16_789_011745024_2 | 80,939 | 411,288 | 5.0815 |
21 Mar 2019 15:16:13 | 1307851 | 21519875 | wah2_safr50_n1t8_201612_16_789_011745024_2 | 69,419 | 350,580 | 5.0502 |
19 Mar 2019 13:41:35 | 1307851 | 21519875 | wah2_safr50_n1t8_201612_16_789_011745024_2 | 57,899 | 293,238 | 5.0646 |
14 Mar 2019 20:24:28 | 1307851 | 21519875 | wah2_safr50_n1t8_201612_16_789_011745024_2 | 46,379 | 234,940 | 5.0657 |
14 Mar 2019 02:33:49 | 1307851 | 21519875 | wah2_safr50_n1t8_201612_16_789_011745024_2 | 34,859 | 177,930 | 5.1043 |
12 Mar 2019 16:44:35 | 1307851 | 21519875 | wah2_safr50_n1t8_201612_16_789_011745024_2 | 23,339 | 119,524 | 5.1212 |
08 Mar 2019 15:45:25 | 1307851 | 21519875 | wah2_safr50_n1t8_201612_16_789_011745024_2 | 11,819 | 60,682 | 5.1343 |
©2024 cpdn.org