Name | hadam3p_eu_2kcv_1980_1_007303116_1 |
Workunit | 7500540 |
Created | 28 Jun 2011, 17:47:55 UTC |
Sent | 28 Jun 2011, 17:48:48 UTC |
Report deadline | 9 Jun 2012, 23:08:48 UTC |
Received | 24 Jul 2011, 16:59:27 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1004102 |
Run time | 3 days 22 hours 44 min 39 sec |
CPU time | 3 days 13 hours 36 min 11 sec |
Validate state | Invalid |
Credit | 1,790.21 |
Device peak FLOPS | 2.34 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <stderr_txt> 15:27:16 (5512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4280, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2128, selfPID=664, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5140, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1644, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6028, selfPID=5228, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2016, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4544, selfPID=4948, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4976, selfPID=4976, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2864, selfPID=2864, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=444, selfPID=5240, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4316, selfPID=4960, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3160, selfPID=5328, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3412, selfPID=4108, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3224, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6120, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3864, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4420, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3360, selfPID=2896, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3224, selfPID=4888, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr= 2 del crash detected, will try to restart... Leaving CPDN_Main::Monitor... forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2kcv_1980_1_007303116\tmp\xaakm.namelists Image PC Routine Line Source hadam3p_eu_um_6.0 013AA39A Unknown Unknown Unknown hadam3p_eu_um_6.0 01352CD0 Unknown Unknown Unknown hadam3p_eu_um_6.0 01351E9A Unknown Unknown Unknown hadam3p_eu_um_6.0 01332819 Unknown Unknown Unknown hadam3p_eu_um_6.0 01232287 Unknown Unknown Unknown hadam3p_eu_um_6.0 012CE7B2 Unknown Unknown Unknown hadam3p_eu_um_6.0 012CF2DA Unknown Unknown Unknown hadam3p_eu_um_6.0 01049BD2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0138E638 Unknown Unknown Unknown kernel32.dll 76E4ED6C Unknown Unknown Unknown ntdll.dll 774337F5 Unknown Unknown Unknown ntdll.dll 774337C8 Unknown Unknown Unknown forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2kcv_1980_1_007303116\tmp\xaakg.namelists Image PC Routine Line Source hadrm3p_eu_um_6.0 00C6C52A Unknown Unknown Unknown hadrm3p_eu_um_6.0 00C14460 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00C1362A Unknown Unknown Unknown hadrm3p_eu_um_6.0 00BF2469 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00AF66EB Unknown Unknown Unknown hadrm3p_eu_um_6.0 00B92AE2 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00B935AF Unknown Unknown Unknown hadrm3p_eu_um_6.0 00939860 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00C50893 Unknown Unknown Unknown kernel32.dll 76E4ED6C Unknown Unknown Unknown ntdll.dll 774337F5 Unknown Unknown Unknown ntdll.dll 774337C8 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5012, selfPID=4648, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_2kcv_1980_1_007303116_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2kcv_1980_1_007303116_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2kcv_1980_1_007303116_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Jul 2011 22:04:49 | 1004102 | 13022234 | hadam3p_eu_2kcv_1980_1_007303116_1 | 103,776 | 308,590 | 2.9736 |
25 Jul 2011 20:55:41 | 1004102 | 13022234 | hadam3p_eu_2kcv_1980_1_007303116_1 | 92,256 | 275,870 | 2.9903 |
25 Jul 2011 19:40:35 | 1004102 | 13022234 | hadam3p_eu_2kcv_1980_1_007303116_1 | 80,736 | 243,783 | 3.0195 |
25 Jul 2011 19:22:44 | 1004102 | 13022234 | hadam3p_eu_2kcv_1980_1_007303116_1 | 69,216 | 212,861 | 3.0753 |
25 Jul 2011 19:22:44 | 1004102 | 13022234 | hadam3p_eu_2kcv_1980_1_007303116_1 | 57,696 | 181,423 | 3.1445 |
25 Jul 2011 18:11:38 | 1004102 | 13022234 | hadam3p_eu_2kcv_1980_1_007303116_1 | 46,176 | 149,811 | 3.2443 |
25 Jul 2011 17:20:18 | 1004102 | 13022234 | hadam3p_eu_2kcv_1980_1_007303116_1 | 34,656 | 114,750 | 3.3111 |
25 Jul 2011 14:41:27 | 1004102 | 13022234 | hadam3p_eu_2kcv_1980_1_007303116_1 | 23,136 | 79,437 | 3.4335 |
25 Jul 2011 14:06:15 | 1004102 | 13022234 | hadam3p_eu_2kcv_1980_1_007303116_1 | 11,616 | 40,438 | 3.4812 |
©2024 cpdn.org