Name | hadam3p_pnw_xah7_2007_1_010141235_0 |
Workunit | 10102635 |
Created | 21 Aug 2015, 17:02:25 UTC |
Sent | 23 Aug 2015, 10:03:32 UTC |
Report deadline | 4 Aug 2016, 15:23:32 UTC |
Received | 11 Oct 2015, 7:19:32 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1207471 |
Run time | 1 days 12 hours 33 min 26 sec |
CPU time | 1 days 7 hours 13 min 18 sec |
Validate state | Invalid |
Credit | 757.44 |
Device peak FLOPS | 3.48 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v7.27 windows_intelx86 |
Stderr | <core_client_version>7.6.9</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3356, selfPID=3056, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5924, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=168, selfPID=4248, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2008, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4804, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3764, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4260, selfPID=4328, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4932, selfPID=4244, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2632, selfPID=2636, iMonCtr=1 Model crash detected, will try to restart... GGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3472, selfPID=4216, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1884, selfPID=4208, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4216, selfPID=4136, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5796, selfPID=4768, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2092, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=600, selfPID=4444, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5068, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5096, selfPID=3244, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2764, selfPID=4432, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 11:34:19 (4432): called boinc_finish(0) </stderr_txt><message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_xah7_2007_1_010141235_0_4.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_xah7_2007_1_010141235_0_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_xah7_2007_1_010141235_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_xah7_2007_1_010141235_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_xah7_2007_1_010141235_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_xah7_2007_1_010141235_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_xah7_2007_1_010141235_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_xah7_2007_1_010141235_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_xah7_2007_1_010141235_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Sep 2015 12:11:37 | 1207471 | 18850704 | hadam3p_pnw_xah7_2007_1_010141235_0 | 34,859 | 91,264 | 2.6181 |
26 Sep 2015 08:01:00 | 1207471 | 18850704 | hadam3p_pnw_xah7_2007_1_010141235_0 | 23,339 | 59,411 | 2.5456 |
12 Sep 2015 06:23:34 | 1207471 | 18850704 | hadam3p_pnw_xah7_2007_1_010141235_0 | 11,819 | 27,107 | 2.2935 |
©2024 cpdn.org