Name | hadam3p_saf_1ern_1994_1_006946971_0 |
Workunit | 7150287 |
Created | 22 Nov 2010, 16:23:26 UTC |
Sent | 9 Mar 2011, 7:16:37 UTC |
Report deadline | 19 Feb 2012, 12:36:37 UTC |
Received | 26 Apr 2011, 6:22:08 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | -1 (0xFFFFFFFF) Unknown error code |
Computer ID | 1114214 |
Run time | 7 days 5 hours 32 min 55 sec |
CPU time | 2 days 16 hours 47 min 38 sec |
Validate state | Invalid |
Credit | 1,683.45 |
Device peak FLOPS | 2.45 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code -1 (0xffffffff) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=500, selfPID=500, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3228, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4396, selfPID=4088, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=2 Model crash detected, will try to restart... 08:17:04 (4472): No heartbeat from core client for 30 sec - exiting 08:17:05 (4472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5132, selfPID=5132, iMonCtr=2 15:27:37 (4216): No heartbeat from core client for 30 sec - exiting 15:27:38 (4216): No heartbeat from core client for 30 sec - exiting 15:27:39 (4216): No heartbeat from core client for 30 sec - exiting 15:27:40 (4216): No heartbeat from core client for 30 sec - exiting 15:27:41 (4216): No heartbeat from core client for 30 sec - exiting 15:27:42 (4216): No heartbeat from core client for 30 sec - exiting 15:27:43 (4216): No heartbeat from core client for 30 sec - exiting 15:27:44 (4216): No heartbeat from core client for 30 sec - exiting 15:27:46 (4216): No heartbeat from core client for 30 sec - exiting 15:27:47 (4216): No heartbeat from core client for 30 sec - exiting 15:27:48 (4216): No heartbeat from core client for 30 sec - exiting 15:27:49 (4216): No heartbeat from core client for 30 sec - exiting 15:27:50 (4216): No heartbeat from core client for 30 sec - exiting 15:27:51 (4216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3240, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 11:57:13 (3240): called boinc_finish 13:43:10 (4512): No heartbeat from core client for 30 sec - exiting 13:43:11 (4512): No heartbeat from core client for 30 sec - exiting 13:43:12 (4512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:43:14 (4512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6668, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2020, selfPID=4768, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6904, selfPID=6904, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=848, selfPID=848, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4744, selfPID=4744, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6632, selfPID=6632, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3068, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5072, selfPID=5072, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5028, selfPID=800, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5512, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5556, selfPID=4152, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... 06:35:09 (3000): No heartbeat from core client for 30 sec - exiting 06:35:10 (3000): No heartbeat from core client for 30 sec - exiting 06:35:11 (3000): No heartbeat from core client for 30 sec - exiting 06:35:12 (3000): No heartbeat from core client for 30 sec - exiting 06:35:13 (3000): No heartbeat from core client for 30 sec - exiting 06:35:15 (3000): No heartbeat from core client for 30 sec - exiting 06:35:16 (3000): No heartbeat from core client for 30 sec - exiting 06:35:17 (3000): No heartbeat from core client for 30 sec - exiting 06:35:18 (3000): No heartbeat from core client for 30 sec - exiting 06:35:19 (3000): No heartbeat from core client for 30 sec - exiting 06:35:20 (3000): No heartbeat from core client for 30 sec - exiting 06:35:21 (3000): No heartbeat from core client for 30 sec - exiting 06:35:22 (3000): No heartbeat from core client for 30 sec - exiting 06:35:23 (3000): No heartbeat from core client for 30 sec - exiting 06:35:24 (3000): No heartbeat from core client for 30 sec - exiting 06:35:25 (3000): No heartbeat from core client for 30 sec - exiting 06:35:27 (3000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3872, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 18:34:51 (3872): called boinc_finish Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3984, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6052, selfPID=4528, iMonCtr=1 Model crash detected, will try to restart... 06:15:09 (4560): No heartbeat from core client for 30 sec - exiting 06:15:10 (4560): No heartbeat from core client for 30 sec - exiting 06:15:11 (4560): No heartbeat from core client for 30 sec - exiting 06:15:12 (4560): No heartbeat from core client for 30 sec - exiting 06:15:13 (4560): No heartbeat from core client for 30 sec - exiting 06:15:14 (4560): No heartbeat from core client for 30 sec - exiting 06:15:16 (4560): No heartbeat from core client for 30 sec - exiting 06:15:17 (4560): No heartbeat from core client for 30 sec - exiting 06:15:18 (4560): No heartbeat from core client for 30 sec - exiting 06:15:19 (4560): No heartbeat from core client for 30 sec - exiting 06:15:20 (4560): No heartbeat from core client for 30 sec - exiting 06:15:21 (4560): No heartbeat from core client for 30 sec - exiting 06:15:22 (4560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 13:04:16 (3744): No heartbeat from core client for 30 sec - exiting 13:04:18 (3744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:04:19 (3744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5748, selfPID=5748, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3872, selfPID=3872, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5836, selfPID=4856, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:48:08 (4552): No heartbeat from core client for 30 sec - exiting 07:48:09 (4552): No heartbeat from core client for 30 sec - exiting 07:48:10 (4552): No heartbeat from core client for 30 sec - exiting 07:48:12 (4552): No heartbeat from core client for 30 sec - exiting 07:48:13 (4552): No heartbeat from core client for 30 sec - exiting 07:48:14 (4552): No heartbeat from core client for 30 sec - exiting 07:48:15 (4552): No heartbeat from core client for 30 sec - exiting 07:48:16 (4552): No heartbeat from core client for 30 sec - exiting 07:48:17 (4552): No heartbeat from core client for 30 sec - exiting 07:48:18 (4552): No heartbeat from core client for 30 sec - exiting 07:48:19 (4552): No heartbeat from core client for 30 sec - exiting 07:48:20 (4552): No heartbeat from core client for 30 sec - exiting 07:48:21 (4552): No heartbeat from core client for 30 sec - exiting 07:48:22 (4552): No heartbeat from core client for 30 sec - exiting 07:48:24 (4552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6256, selfPID=6256, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:48:28 (4248): No heartbeat from core client for 30 sec - exiting 07:48:29 (4248): No heartbeat from core client for 30 sec - exiting 07:48:30 (4248): No heartbeat from core client for 30 sec - exiting 07:48:31 (4248): No heartbeat from core client for 30 sec - exiting 07:48:32 (4248): No heartbeat from core client for 30 sec - exiting 07:48:34 (4248): No heartbeat from core client for 30 sec - exiting 07:48:35 (4248): No heartbeat from core client for 30 sec - exiting 07:48:36 (4248): No heartbeat from core client for 30 sec - exiting 07:48:37 (4248): No heartbeat from core client for 30 sec - exiting 07:48:38 (4248): No heartbeat from core client for 30 sec - exiting 07:48:39 (4248): No heartbeat from core client for 30 sec - exiting 07:48:40 (4248): No heartbeat from core client for 30 sec - exiting 07:48:41 (4248): No heartbeat from core client for 30 sec - exiting 07:48:42 (4248): No heartbeat from core client for 30 sec - exiting 07:48:43 (4248): No heartbeat from core client for 30 sec - exiting 07:48:44 (4248): No heartbeat from core client for 30 sec - exiting 07:48:46 (4248): No heartbeat from core client for 30 sec - exiting 07:48:47 (4248): No heartbeat from core client for 30 sec - exiting 07:48:48 (4248): No heartbeat from core client for 30 sec - exiting 07:48:49 (4248): No heartbeat from core client for 30 sec - exiting 07:48:50 (4248): No heartbeat from core client for 30 sec - exiting 07:48:51 (4248): No heartbeat from core client for 30 sec - exiting 07:48:52 (4248): No heartbeat from core client for 30 sec - exiting 07:48:53 (4248): No heartbeat from core client for 30 sec - exiting 07:48:54 (4248): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 09:57:02 (3136): No heartbeat from core client for 30 sec - exiting 09:57:03 (3136): No heartbeat from core client for 30 sec - exiting 09:57:04 (3136): No heartbeat from core client for 30 sec - exiting 09:57:05 (3136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3764, selfPID=2872, iMonCtr=1 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Apr 2011 18:52:46 | 1114214 | 12227510 | hadam3p_saf_1ern_1994_1_006946971_0 | 103,776 | 220,562 | 2.1254 |
21 Apr 2011 18:52:46 | 1114214 | 12227510 | hadam3p_saf_1ern_1994_1_006946971_0 | 92,256 | 196,189 | 2.1266 |
01 Apr 2011 11:56:56 | 1114214 | 12227510 | hadam3p_saf_1ern_1994_1_006946971_0 | 80,736 | 173,890 | 2.1538 |
20 Mar 2011 17:05:25 | 1114214 | 12227510 | hadam3p_saf_1ern_1994_1_006946971_0 | 69,216 | 150,128 | 2.1690 |
17 Mar 2011 16:03:25 | 1114214 | 12227510 | hadam3p_saf_1ern_1994_1_006946971_0 | 57,696 | 125,813 | 2.1806 |
16 Mar 2011 11:16:07 | 1114214 | 12227510 | hadam3p_saf_1ern_1994_1_006946971_0 | 46,176 | 100,732 | 2.1815 |
15 Mar 2011 14:58:44 | 1114214 | 12227510 | hadam3p_saf_1ern_1994_1_006946971_0 | 34,656 | 75,319 | 2.1733 |
14 Mar 2011 08:53:41 | 1114214 | 12227510 | hadam3p_saf_1ern_1994_1_006946971_0 | 23,136 | 50,618 | 2.1878 |
12 Mar 2011 15:38:42 | 1114214 | 12227510 | hadam3p_saf_1ern_1994_1_006946971_0 | 11,616 | 25,916 | 2.2311 |
©2024 cpdn.org