Name | hadam3p_saf_7d24_2005_1_007573861_0 |
Workunit | 7751991 |
Created | 2 Dec 2011, 15:48:38 UTC |
Sent | 16 Dec 2011, 19:31:25 UTC |
Report deadline | 28 Nov 2012, 0:51:25 UTC |
Received | 13 Feb 2012, 22:00:04 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1180029 |
Run time | 7 days 10 hours 46 min 46 sec |
CPU time | 7 days 10 hours 46 min 46 sec |
Validate state | Invalid |
Credit | 2,057.55 |
Device peak FLOPS | 1.52 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>5.3.19</core_client_version> <stderr_txt> 22:35:24 (5364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:35:25 (5364): No heartbeat from core client for 30 sec - exiting No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6000, selfPID=3220, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2764, selfPID=260, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3440, selfPID=3440, iMonCtr=1 00:12:12 (1152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:12:14 (1152): No heartbeat from core client for 30 sec - exiting 00:12:15 (1152): No heartbeat from core client for 30 sec - exiting 00:12:16 (1152): No heartbeat from core client for 30 sec - exiting 00:12:17 (1152): No heartbeat from core client for 30 sec - exiting 00:37:04 (5532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:37:06 (5532): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1456, selfPID=1908, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1456, selfPID=1456, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3188, selfPID=404, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4024, selfPID=4024, iMonCtr=1 No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4024, selfPID=2268, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3816, selfPID=3816, iMonCtr=1 No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5156, selfPID=3832, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2836, selfPID=2836, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:29:03 (1900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=964, selfPID=256, iMonCtr=1 No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2124, selfPID=6112, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2476, selfPID=2476, iMonCtr=1 20:29:07 (5756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2924, selfPID=1884, iMonCtr=1 CPDN Monitor - Quit request from BOINC... 10:50:11 (2192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:50:13 (2192): No heartbeat from core client for 30 sec - exiting 10:50:14 (2192): No heartbeat from core client for 30 sec - exiting 10:50:15 (2192): No heartbeat from core client for 30 sec - exiting 10:50:16 (2192): No heartbeat from core client for 30 sec - exiting 17:50:42 (1120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:50:43 (1120): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5388, selfPID=5388, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2764, selfPID=2764, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5420, selfPID=5420, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3096, selfPID=3096, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3096, selfPID=3348, iMonCtr=1 14:21:34 (3520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:21:36 (3520): No heartbeat from core client for 30 sec - exiting 14:21:37 (3520): No heartbeat from core client for 30 sec - exiting 14:21:38 (3520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:42:45 (1552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:15:02 (5644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3504, selfPID=3504, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3408, selfPID=3408, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = No Process Handle 1, checkPID=4148, selfPID=4148, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4148, selfPID=3464, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3672, selfPID=2784, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3672, selfPID=3672, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1800, selfPID=1800, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:52:25 (1216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:52:26 (1216): No heartbeat from core client for 30 sec - exiting 22:44:03 (1616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:44:05 (1616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 21:56:44 (540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:54:24 (4020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... forrtl: The requested operation cannot be performed on a file with a user-mapped section open. forrtl: severe (38): error during write, unit 6, file C:\Program Files\Climate Change Experiment\projects\bbc.cpdn.org\hadam3p_saf_7d24_2005_1_007573861\dataout\xaakg.out Image PC Routine Line Source hadrm3p_saf_um_6. 0073C52A Unknown Unknown Unknown hadrm3p_saf_um_6. 00739858 Unknown Unknown Unknown hadrm3p_saf_um_6. 006E4460 Unknown Unknown Unknown hadrm3p_saf_um_6. 006E362A Unknown Unknown Unknown hadrm3p_saf_um_6. 006B59F1 Unknown Unknown Unknown hadrm3p_saf_um_6. 006B28F5 Unknown Unknown Unknown hadrm3p_saf_um_6. 00662ACE Unknown Unknown Unknown hadrm3p_saf_um_6. 006635AF Unknown Unknown Unknown hadrm3p_saf_um_6. 00409860 Unknown Unknown Unknown hadrm3p_saf_um_6. 00720893 Unknown Unknown Unknown kernel32.dll 7C816FE7 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=732, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message><file_xfer_error> <file_name>hadam3p_saf_7d24_2005_1_007573861_0_12.zip</file_name> <error_code>-161</error_code> <error_message></error_message> </file_xfer_error> </message> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Feb 2012 21:00:16 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 126,837 | 609,291 | 4.8037 |
08 Feb 2012 23:05:19 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 126,816 | 608,255 | 4.7964 |
04 Feb 2012 20:31:25 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 115,296 | 554,091 | 4.8058 |
25 Jan 2012 18:51:05 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 103,776 | 499,634 | 4.8145 |
24 Jan 2012 13:38:38 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 92,256 | 446,038 | 4.8348 |
20 Jan 2012 23:08:10 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 80,736 | 388,447 | 4.8113 |
19 Jan 2012 09:41:43 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 69,216 | 331,035 | 4.7826 |
15 Jan 2012 15:21:48 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 57,696 | 276,842 | 4.7983 |
09 Jan 2012 14:49:52 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 46,176 | 222,261 | 4.8133 |
03 Jan 2012 23:58:52 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 34,656 | 167,957 | 4.8464 |
29 Dec 2011 21:31:13 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 23,136 | 112,745 | 4.8731 |
19 Dec 2011 23:20:29 | 1180029 | 13688822 | hadam3p_saf_7d24_2005_1_007573861_0 | 11,616 | 57,548 | 4.9542 |
©2024 cpdn.org