Name | hadam3p_pnw_zs6b_1994_1_007015035_0 |
Workunit | 7218351 |
Created | 24 Nov 2010, 14:54:24 UTC |
Sent | 15 Jan 2011, 20:34:57 UTC |
Report deadline | 29 Dec 2011, 1:54:57 UTC |
Received | 3 Feb 2011, 17:26:09 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1117997 |
Run time | 4 days 1 hours 31 min 57 sec |
CPU time | 3 days 13 hours 4 min 9 sec |
Validate state | Invalid |
Credit | 2,755.56 |
Device peak FLOPS | 3.08 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6004, selfPID=5544, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 09:39:02 (5736): No heartbeat from core client for 30 sec - exiting 09:39:03 (5736): No heartbeat from core client for 30 sec - exiting 09:39:04 (5736): No heartbeat from core client for 30 sec - exiting 09:39:05 (5736): No heartbeat from core client for 30 sec - exiting 09:39:06 (5736): No heartbeat from core client for 30 sec - exiting 09:39:07 (5736): No heartbeat from core client for 30 sec - exiting 09:39:08 (5736): No heartbeat from core client for 30 sec - exiting 09:39:09 (5736): No heartbeat from core client for 30 sec - exiting 09:39:10 (5736): No heartbeat from core client for 30 sec - exiting 09:39:11 (5736): No heartbeat from core client for 30 sec - exiting 09:39:12 (5736): No heartbeat from core client for 30 sec - exiting 09:39:13 (5736): No heartbeat from core client for 30 sec - exiting 09:39:14 (5736): No heartbeat from core client for 30 sec - exiting 09:39:15 (5736): No heartbeat from core client for 30 sec - exiting 09:39:16 (5736): No heartbeat from core client for 30 sec - exiting 09:39:17 (5736): No heartbeat from core client for 30 sec - exiting 09:39:18 (5736): No heartbeat from core client for 30 sec - exiting 09:39:19 (5736): No heartbeat from core client for 30 sec - exiting 09:39:20 (5736): No heartbeat from core client for 30 sec - exiting 09:39:21 (5736): No heartbeat from core client for 30 sec - exiting 09:39:22 (5736): No heartbeat from core client for 30 sec - exiting 09:39:23 (5736): No heartbeat from core client for 30 sec - exiting 09:39:24 (5736): No heartbeat from core client for 30 sec - exiting 09:39:25 (5736): No heartbeat from core client for 30 sec - exiting 09:39:26 (5736): No heartbeat from core client for 30 sec - exiting 09:39:27 (5736): No heartbeat from core client for 30 sec - exiting 09:39:28 (5736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:39:29 (5736): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5292, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3472, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2328, iMonCtr=2 09:32:59 (5948): No heartbeat from core client for 30 sec - exiting 09:33:00 (5948): No heartbeat from core client for 30 sec - exiting 09:33:01 (5948): No heartbeat from core client for 30 sec - exiting 09:33:02 (5948): No heartbeat from core client for 30 sec - exiting 09:33:03 (5948): No heartbeat from core client for 30 sec - exiting 09:33:04 (5948): No heartbeat from core client for 30 sec - exiting 09:33:05 (5948): No heartbeat from core client for 30 sec - exiting 09:33:06 (5948): No heartbeat from core client for 30 sec - exiting 09:33:07 (5948): No heartbeat from core client for 30 sec - exiting 09:33:08 (5948): No heartbeat from core client for 30 sec - exiting 09:33:09 (5948): No heartbeat from core client for 30 sec - exiting 09:33:10 (5948): No heartbeat from core client for 30 sec - exiting 09:33:11 (5948): No heartbeat from core client for 30 sec - exiting 09:33:12 (5948): No heartbeat from core client for 30 sec - exiting 09:33:13 (5948): No heartbeat from core client for 30 sec - exiting 09:33:14 (5948): No heartbeat from core client for 30 sec - exiting 09:33:15 (5948): No heartbeat from core client for 30 sec - exiting 09:33:16 (5948): No heartbeat from core client for 30 sec - exiting 09:33:17 (5948): No heartbeat from core client for 30 sec - exiting 09:33:18 (5948): No heartbeat from core client for 30 sec - exiting 09:33:19 (5948): No heartbeat from core client for 30 sec - exiting 09:33:20 (5948): No heartbeat from core client for 30 sec - exiting 09:33:21 (5948): No heartbeat from core client for 30 sec - exiting 09:33:22 (5948): No heartbeat from core client for 30 sec - exiting 09:33:23 (5948): No heartbeat from core client for 30 sec - exiting 09:33:24 (5948): No heartbeat from core client for 30 sec - exiting 09:33:25 (5948): No heartbeat from core client for 30 sec - exiting 09:33:26 (5948): No heartbeat from core client for 30 sec - exiting 09:33:27 (5948): No heartbeat from core client for 30 sec - exiting 09:33:28 (5948): No heartbeat from core client for 30 sec - exiting 09:33:29 (5948): No heartbeat from core client for 30 sec - exiting 09:33:30 (5948): No heartbeat from core client for 30 sec - exiting 09:33:31 (5948): No heartbeat from core client for 30 sec - exiting 09:33:32 (5948): No heartbeat from core client for 30 sec - exiting 09:33:33 (5948): No heartbeat from core client for 30 sec - exiting 09:33:34 (5948): No heartbeat from core client for 30 sec - exiting 09:33:35 (5948): No heartbeat from core client for 30 sec - exiting 09:33:36 (5948): No heartbeat from core client for 30 sec - exiting 09:33:37 (5948): No heartbeat from core client for 30 sec - exiting 09:33:38 (5948): No heartbeat from core client for 30 sec - exiting 09:33:39 (5948): No heartbeat from core client for 30 sec - exiting 09:33:40 (5948): No heartbeat from core client for 30 sec - exiting 09:33:41 (5948): No heartbeat from core client for 30 sec - exiting 09:33:42 (5948): No heartbeat from core client for 30 sec - exiting 09:33:43 (5948): No heartbeat from core client for 30 sec - exiting 09:33:44 (5948): No heartbeat from core client for 30 sec - exiting 09:33:45 (5948): No heartbeat from core client for 30 sec - exiting 09:33:46 (5948): No heartbeat from core client for 30 sec - exiting 09:33:47 (5948): No heartbeat from core client for 30 sec - exiting 09:33:48 (5948): No heartbeat from core client for 30 sec - exiting 09:33:49 (5948): No heartbeat from core client for 30 sec - exiting 09:33:50 (5948): No heartbeat from core client for 30 sec - exiting 09:33:51 (5948): No heartbeat from core client for 30 sec - exiting 09:33:52 (5948): No heartbeat from core client for 30 sec - exiting 09:33:53 (5948): No heartbeat from core client for 30 sec - exiting 09:33:54 (5948): No heartbeat from core client for 30 sec - exiting 09:33:55 (5948): No heartbeat from core client for 30 sec - exiting 09:33:56 (5948): No heartbeat from core client for 30 sec - exiting 09:33:57 (5948): No heartbeat from core client for 30 sec - exiting 09:33:58 (5948): No heartbeat from core client for 30 sec - exiting 09:33:59 (5948): No heartbeat from core client for 30 sec - exiting 09:34:00 (5948): No heartbeat from core client for 30 sec - exiting 09:34:01 (5948): No heartbeat from core client for 30 sec - exiting 09:34:03 (5948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2364, selfPID=5592, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 8 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5652, selfPID=5448, iMonCtr=1 Model crash detected, will try to restart... 17:35:11 (4840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=360, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=2 17:11:26 (5564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3624, selfPID=3624, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4260, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1456, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4724, selfPID=5136, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 11 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4280, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 11 cpdnmonitor: cannot open input file C:\ProgramData/projects/climateprediction.net/hadam3p_pnw_zs6b_1994_1_007015035/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData/projects/climateprediction.net/hadam3p_pnw_zs6b_1994_1_007015035/dataout/region_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakm.pipe_dummy 2048 Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 17:24:50 (5648): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_pnw_zs6b_1994_1_007015035_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Feb 2011 18:47:41 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 126,816 | 287,333 | 2.2657 |
31 Jan 2011 23:35:41 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 115,296 | 260,303 | 2.2577 |
24 Jan 2011 17:08:48 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 103,776 | 233,977 | 2.2546 |
24 Jan 2011 09:29:50 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 92,264 | 208,409 | 2.2588 |
24 Jan 2011 00:04:30 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 92,256 | 208,098 | 2.2557 |
23 Jan 2011 16:16:18 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 80,736 | 182,269 | 2.2576 |
22 Jan 2011 23:13:34 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 69,216 | 156,515 | 2.2613 |
22 Jan 2011 15:07:57 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 57,696 | 130,255 | 2.2576 |
18 Jan 2011 17:03:23 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 46,176 | 104,122 | 2.2549 |
17 Jan 2011 22:36:24 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 34,656 | 78,375 | 2.2615 |
17 Jan 2011 14:50:41 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 23,136 | 52,821 | 2.2831 |
16 Jan 2011 21:28:53 | 1117997 | 12300233 | hadam3p_pnw_zs6b_1994_1_007015035_0 | 11,616 | 27,202 | 2.3418 |
©2024 cpdn.org