Name | hadam3p_eu_2syg_1997_1_007401767_0 |
Workunit | 7599197 |
Created | 14 Aug 2011, 12:48:51 UTC |
Sent | 14 Aug 2011, 12:51:49 UTC |
Report deadline | 26 Jul 2012, 18:11:49 UTC |
Received | 5 Oct 2011, 19:23:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1163438 |
Run time | 2 days 15 hours 22 min 59 sec |
CPU time | 2 days 5 hours 33 min 57 sec |
Validate state | Invalid |
Credit | 1,790.21 |
Device peak FLOPS | 3.45 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 14:28:56 (6052): No heartbeat from core client for 30 sec - exiting 14:28:57 (6052): No heartbeat from core client for 30 sec - exiting 14:28:58 (6052): No heartbeat from core client for 30 sec - exiting 14:28:59 (6052): No heartbeat from core client for 30 sec - exiting 14:29:00 (6052): No heartbeat from core client for 30 sec - exiting 14:29:01 (6052): No heartbeat from core client for 30 sec - exiting 14:29:02 (6052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7736, selfPID=7736, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 21:33:26 (4152): No heartbeat from core client for 30 sec - exiting 21:33:27 (4152): No heartbeat from core client for 30 sec - exiting 21:33:28 (4152): No heartbeat from core client for 30 sec - exiting 21:33:29 (4152): No heartbeat from core client for 30 sec - exiting 21:33:30 (4152): No heartbeat from core client for 30 sec - exiting 21:33:31 (4152): No heartbeat from core client for 30 sec - exiting 21:33:32 (4152): No heartbeat from core client for 30 sec - exiting 21:33:33 (4152): No heartbeat from core client for 30 sec - exiting 21:33:34 (4152): No heartbeat from core client for 30 sec - exiting 21:33:35 (4152): No heartbeat from core client for 30 sec - exiting 21:33:36 (4152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1324, selfPID=1324, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 20:02:22 (1036): No heartbeat from core client for 30 sec - exiting 20:02:23 (1036): No heartbeat from core client for 30 sec - exiting 20:02:24 (1036): No heartbeat from core client for 30 sec - exiting 20:02:25 (1036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:06:40 (8952): No heartbeat from core client for 30 sec - exiting 20:06:42 (8952): No heartbeat from core client for 30 sec - exiting 20:06:43 (8952): No heartbeat from core client for 30 sec - exiting 20:06:44 (8952): No heartbeat from core client for 30 sec - exiting 20:06:45 (8952): No heartbeat from core client for 30 sec - exiting 20:06:46 (8952): No heartbeat from core client for 30 sec - exiting 20:06:47 (8952): No heartbeat from core client for 30 sec - exiting 20:06:48 (8952): No heartbeat from core client for 30 sec - exiting 20:06:49 (8952): No heartbeat from core client for 30 sec - exiting 20:06:50 (8952): No heartbeat from core client for 30 sec - exiting 20:06:51 (8952): No heartbeat from core client for 30 sec - exiting 20:06:52 (8952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8096, selfPID=8096, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7468, selfPID=7468, iMonCtr=2 20:13:54 (6372): No heartbeat from core client for 30 sec - exiting 20:13:55 (6372): No heartbeat from core client for 30 sec - exiting 20:13:56 (6372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4736, selfPID=4736, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9488, selfPID=9488, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:23:00 (6968): No heartbeat from core client for 30 sec - exiting 19:23:01 (6968): No heartbeat from core client for 30 sec - exiting 19:23:02 (6968): No heartbeat from core client for 30 sec - exiting 19:23:03 (6968): No heartbeat from core client for 30 sec - exiting 19:23:04 (6968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:04:35 (9036): No heartbeat from core client for 30 sec - exiting 20:04:36 (9036): No heartbeat from core client for 30 sec - exiting 20:04:37 (9036): No heartbeat from core client for 30 sec - exiting 20:04:38 (9036): No heartbeat from core client for 30 sec - exiting 20:04:39 (9036): No heartbeat from core client for 30 sec - exiting 20:04:40 (9036): No heartbeat from core client for 30 sec - exiting 20:04:41 (9036): No heartbeat from core client for 30 sec - exiting 20:04:42 (9036): No heartbeat from core client for 30 sec - exiting 20:04:43 (9036): No heartbeat from core client for 30 sec - exiting 20:04:44 (9036): No heartbeat from core client for 30 sec - exiting 20:04:45 (9036): No heartbeat from core client for 30 sec - exiting 20:04:46 (9036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1380, selfPID=1380, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2756, selfPID=2756, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 18:04:28 (3672): No heartbeat from core client for 30 sec - exiting 18:04:29 (3672): No heartbeat from core client for 30 sec - exiting 18:04:30 (3672): No heartbeat from core client for 30 sec - exiting 18:04:31 (3672): No heartbeat from core client for 30 sec - exiting 18:04:32 (3672): No heartbeat from core client for 30 sec - exiting 18:04:33 (3672): No heartbeat from core client for 30 sec - exiting 18:04:34 (3672): No heartbeat from core client for 30 sec - exiting 18:04:35 (3672): No heartbeat from core client for 30 sec - exiting 18:04:36 (3672): No heartbeat from core client for 30 sec - exiting 18:04:37 (3672): No heartbeat from core client for 30 sec - exiting 18:04:38 (3672): No heartbeat from core client for 30 sec - exiting 18:04:39 (3672): No heartbeat from core client for 30 sec - exiting 18:04:40 (3672): No heartbeat from core client for 30 sec - exiting 18:04:41 (3672): No heartbeat from core client for 30 sec - exiting 18:04:42 (3672): No heartbeat from core client for 30 sec - exiting 18:04:43 (3672): No heartbeat from core client for 30 sec - exiting 18:04:44 (3672): No heartbeat from core client for 30 sec - exiting 18:04:45 (3672): No heartbeat from core client for 30 sec - exiting 18:04:46 (3672): No heartbeat from core client for 30 sec - exiting 18:04:47 (3672): No heartbeat from core client for 30 sec - exiting 18:04:48 (3672): No heartbeat from core client for 30 sec - exiting 18:04:49 (3672): No heartbeat from core client for 30 sec - exiting 18:04:50 (3672): No heartbeat from core client for 30 sec - exiting 18:04:51 (3672): No heartbeat from core client for 30 sec - exiting 18:04:52 (3672): No heartbeat from core client for 30 sec - exiting 18:04:53 (3672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=792, selfPID=792, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8376, selfPID=8376, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7512, selfPID=7512, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:27:18 (9136): No heartbeat from core client for 30 sec - exiting 19:27:19 (9136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4880, selfPID=4880, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:03:09 (9468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6300, selfPID=6300, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 11:59:14 (7072): No heartbeat from core client for 30 sec - exiting 11:59:15 (7072): No heartbeat from core client for 30 sec - exiting 11:59:16 (7072): No heartbeat from core client for 30 sec - exiting 11:59:17 (7072): No heartbeat from core client for 30 sec - exiting 11:59:18 (7072): No heartbeat from core client for 30 sec - exiting 11:59:19 (7072): No heartbeat from core client for 30 sec - exiting 11:59:20 (7072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11344, selfPID=11344, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5680, selfPID=5680, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8780, selfPID=8780, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:05:20 (9200): No heartbeat from core client for 30 sec - exiting 22:05:21 (9200): No heartbeat from core client for 30 sec - exiting 22:05:22 (9200): No heartbeat from core client for 30 sec - exiting 22:05:23 (9200): No heartbeat from core client for 30 sec - exiting 22:05:24 (9200): No heartbeat from core client for 30 sec - exiting 22:05:25 (9200): No heartbeat from core client for 30 sec - exiting 22:05:26 (9200): No heartbeat from core client for 30 sec - exiting 22:05:27 (9200): No heartbeat from core client for 30 sec - exiting 22:05:28 (9200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 Leaving CPDN_Main::Monitor... zip error: Could not create output file (was replacing the original zip file) Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_2syg_1997_1_007401767_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2syg_1997_1_007401767_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Sep 2011 14:03:19 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 103,776 | 173,570 | 1.6725 |
25 Sep 2011 09:51:20 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 92,257 | 153,637 | 1.6653 |
23 Sep 2011 18:30:17 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 92,256 | 153,392 | 1.6627 |
17 Sep 2011 15:03:27 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 80,736 | 133,802 | 1.6573 |
14 Sep 2011 18:41:56 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 69,216 | 114,017 | 1.6473 |
10 Sep 2011 18:29:24 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 57,696 | 94,952 | 1.6457 |
06 Sep 2011 19:08:08 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 46,176 | 76,158 | 1.6493 |
03 Sep 2011 19:55:13 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 34,656 | 57,075 | 1.6469 |
31 Aug 2011 20:16:03 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 23,136 | 38,164 | 1.6496 |
28 Aug 2011 18:08:50 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 11,617 | 19,400 | 1.6700 |
28 Aug 2011 16:21:00 | 1163438 | 13246970 | hadam3p_eu_2syg_1997_1_007401767_0 | 11,616 | 19,168 | 1.6501 |
©2024 cpdn.org