Name | hadam3p_eu_2rvn_1985_1_007293985_0 |
Workunit | 7491259 |
Created | 15 Jun 2011, 3:11:14 UTC |
Sent | 15 Jun 2011, 3:11:26 UTC |
Report deadline | 27 May 2012, 8:31:26 UTC |
Received | 3 Aug 2011, 19:51:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1105901 |
Run time | 5 days 5 hours 33 min 30 sec |
CPU time | 4 days 12 hours 51 min 14 sec |
Validate state | Invalid |
Credit | 2,187.67 |
Device peak FLOPS | 2.33 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6388, selfPID=9704, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7072, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4064, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3476, selfPID=6968, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7652, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4600, selfPID=748, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:03:09 (1872): No heartbeat from core client for 30 sec - exiting 10:03:10 (1872): No heartbeat from core client for 30 sec - exiting 10:03:11 (1872): No heartbeat from core client for 30 sec - exiting 10:03:12 (1872): No heartbeat from core client for 30 sec - exiting 10:03:13 (1872): No heartbeat from core client for 30 sec - exiting 10:03:14 (1872): No heartbeat from core client for 30 sec - exiting 10:03:15 (1872): No heartbeat from core client for 30 sec - exiting 10:03:16 (1872): No heartbeat from core client for 30 sec - exiting 10:03:17 (1872): No heartbeat from core client for 30 sec - exiting 10:03:18 (1872): No heartbeat from core client for 30 sec - exiting 10:03:19 (1872): No heartbeat from core client for 30 sec - exiting 10:03:20 (1872): No heartbeat from core client for 30 sec - exiting 10:03:21 (1872): No heartbeat from core client for 30 sec - exiting 10:03:22 (1872): No heartbeat from core client for 30 sec - exiting 10:03:23 (1872): No heartbeat from core client for 30 sec - exiting 10:03:24 (1872): No heartbeat from core client for 30 sec - exiting 10:03:25 (1872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:31:29 (4928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4636, selfPID=4636, iMonCtr=2 23:31:31 (4928): No heartbeat from core client for 30 sec - exiting 23:32:06 (6740): No heartbeat from core client for 30 sec - exiting 23:32:07 (6740): No heartbeat from core client for 30 sec - exiting 23:32:08 (6740): No heartbeat from core client for 30 sec - exiting 23:32:09 (6740): No heartbeat from core client for 30 sec - exiting 23:32:10 (6740): No heartbeat from core client for 30 sec - exiting 23:32:11 (6740): No heartbeat from core client for 30 sec - exiting 23:32:12 (6740): No heartbeat from core client for 30 sec - exiting 23:32:13 (6740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5376, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7508, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8660, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5320, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6444, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2rvn_1985_1_007293985\tmp\xaakg.namelists Image PC Routine Line Source hadrm3p_eu_um_6.0 011DC52A Unknown Unknown Unknown hadrm3p_eu_um_6.0 01184460 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0118362A Unknown Unknown Unknown hadrm3p_eu_um_6.0 01162469 Unknown Unknown Unknown hadrm3p_eu_um_6.0 010666EB Unknown Unknown Unknown hadrm3p_eu_um_6.0 01102AE2 Unknown Unknown Unknown hadrm3p_eu_um_6.0 011035AF Unknown Unknown Unknown hadrm3p_eu_um_6.0 00EA9860 Unknown Unknown Unknown hadrm3p_eu_um_6.0 011C0893 Unknown Unknown Unknown kernel32.dll 75943677 Unknown Unknown Unknown ntdll.dll 774C9F02 Unknown Unknown Unknown ntdll.dll 774C9ED5 Unknown Unknown Unknown forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2rvn_1985_1_007293985\tmp\xaakm.namelists Image PC Routine Line Source hadam3p_eu_um_6.0 0043A39A Unknown Unknown Unknown hadam3p_eu_um_6.0 003E2CD0 Unknown Unknown Unknown hadam3p_eu_um_6.0 003E1E9A Unknown Unknown Unknown hadam3p_eu_um_6.0 003C2819 Unknown Unknown Unknown hadam3p_eu_um_6.0 002C2287 Unknown Unknown Unknown hadam3p_eu_um_6.0 0035E7B2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0035F2DA Unknown Unknown Unknown hadam3p_eu_um_6.0 000D9BD2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0041E638 Unknown Unknown Unknown kernel32.dll 75943677 Unknown Unknown Unknown ntdll.dll 774C9F02 Unknown Unknown Unknown ntdll.dll 774C9ED5 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3428, selfPID=6240, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_2rvn_1985_1_007293985_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Jul 2011 05:32:40 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 126,816 | 363,732 | 2.8682 |
25 Jul 2011 22:29:17 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 115,296 | 329,345 | 2.8565 |
25 Jul 2011 17:58:14 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 103,776 | 296,879 | 2.8608 |
25 Jul 2011 15:02:06 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 92,256 | 263,290 | 2.8539 |
25 Jul 2011 14:04:26 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 80,736 | 230,550 | 2.8556 |
25 Jul 2011 14:04:25 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 69,216 | 197,417 | 2.8522 |
25 Jul 2011 14:04:25 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 57,696 | 164,020 | 2.8428 |
05 Jul 2011 20:21:12 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 46,176 | 132,079 | 2.8603 |
22 Jun 2011 04:08:40 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 34,656 | 98,493 | 2.8420 |
20 Jun 2011 21:32:51 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 23,136 | 64,546 | 2.7899 |
17 Jun 2011 06:54:49 | 1105901 | 12978500 | hadam3p_eu_2rvn_1985_1_007293985_0 | 11,616 | 33,112 | 2.8506 |
©2024 cpdn.org