Name | hadam3p_pnw_z9sx_1992_1_006943225_1 |
Workunit | 7146541 |
Created | 21 Mar 2011, 14:09:13 UTC |
Sent | 21 Mar 2011, 17:52:48 UTC |
Report deadline | 2 Mar 2012, 23:12:48 UTC |
Received | 1 Apr 2011, 22:05:30 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1086163 |
Run time | |
CPU time | 2 days 4 hours 21 min 25 sec |
Validate state | Invalid |
Credit | 1,003.35 |
Device peak FLOPS | 2.81 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.08 windows_intelx86 |
Stderr | <core_client_version>6.2.28</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 05:58:18 (6604): No heartbeat from core client for 30 sec - exiting 05:58:19 (6604): No heartbeat from core client for 30 sec - exiting 05:58:20 (6604): No heartbeat from core client for 30 sec - exiting 05:58:21 (6604): No heartbeat from core client for 30 sec - exiting 05:58:22 (6604): No heartbeat from core client for 30 sec - exiting 05:58:23 (6604): No heartbeat from core client for 30 sec - exiting 05:58:24 (6604): No heartbeat from core client for 30 sec - exiting 05:58:25 (6604): No heartbeat from core client for 30 sec - exiting 05:58:26 (6604): No heartbeat from core client for 30 sec - exiting 05:58:27 (6604): No heartbeat from core client for 30 sec - exiting 05:58:28 (6604): No heartbeat from core client for 30 sec - exiting 05:58:29 (6604): No heartbeat from core client for 30 sec - exiting 05:58:30 (6604): No heartbeat from core client for 30 sec - exiting 05:59:02 (6604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:21:33 (6692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:21:35 (6692): No heartbeat from core client for 30 sec - exiting 09:44:07 (3364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:44:08 (3364): No heartbeat from core client for 30 sec - exiting 09:44:09 (3364): No heartbeat from core client for 30 sec - exiting 09:57:35 (16492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:22:32 (13848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:22:33 (13848): No heartbeat from core client for 30 sec - exiting 10:45:36 (16820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:45:38 (16820): No heartbeat from core client for 30 sec - exiting 11:36:08 (17788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:46:46 (22120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:01:58 (10356): No heartbeat from core client for 30 sec - exiting 02:01:59 (10356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:44:03 (4352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:44:05 (4352): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 14:35:19 (16880): No heartbeat from core client for 30 sec - exiting 14:35:20 (16880): No heartbeat from core client for 30 sec - exiting 14:35:21 (16880): No heartbeat from core client for 30 sec - exiting 14:35:22 (16880): No heartbeat from core client for 30 sec - exiting 14:35:23 (16880): No heartbeat from core client for 30 sec - exiting 14:35:24 (16880): No heartbeat from core client for 30 sec - exiting 14:35:25 (16880): No heartbeat from core client for 30 sec - exiting 14:35:26 (16880): No heartbeat from core client for 30 sec - exiting 14:35:27 (16880): No heartbeat from core client for 30 sec - exiting 14:35:28 (16880): No heartbeat from core client for 30 sec - exiting 14:35:29 (16880): No heartbeat from core client for 30 sec - exiting 14:35:30 (16880): No heartbeat from core client for 30 sec - exiting 14:35:31 (16880): No heartbeat from core client for 30 sec - exiting 14:35:32 (16880): No heartbeat from core client for 30 sec - exiting 14:35:33 (16880): No heartbeat from core client for 30 sec - exiting 14:35:34 (16880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17712, selfPID=17712, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... 01:42:50 (13216): No heartbeat from core client for 30 sec - exiting 01:42:51 (13216): No heartbeat from core client for 30 sec - exiting 01:42:52 (13216): No heartbeat from core client for 30 sec - exiting 01:42:53 (13216): No heartbeat from core client for 30 sec - exiting 01:42:55 (13216): No heartbeat from core client for 30 sec - exiting 01:42:56 (13216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:06:42 (12964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13276, selfPID=13276, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32604, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=31932, selfPID=32956, iMonCtr=1 Model crash detected, will try to restart... 06:56:39 (14592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16212, selfPID=16212, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:03:59 (17216): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 19:04:02 (17216): No heartbeat from core client for 30 sec - exiting 19:04:03 (17216): No heartbeat from core client for 30 sec - exiting 19:04:04 (17216): No heartbeat from core client for 30 sec - exiting 19:04:05 (17216): No heartbeat from core client for 30 sec - exiting 19:04:06 (17216): No heartbeat from core client for 30 sec - exiting 19:04:07 (17216): No heartbeat from core client for 30 sec - exiting 19:04:08 (17216): No heartbeat from core client for 30 sec - exiting 19:04:09 (17216): No heartbeat from core client for 30 sec - exiting 02:09:23 (95984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:09:24 (95984): No heartbeat from core client for 30 sec - exiting 07:34:57 (12216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:34:58 (12216): No heartbeat from core client for 30 sec - exiting 07:34:59 (12216): No heartbeat from core client for 30 sec - exiting 07:35:00 (12216): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 11:51:39 (63624): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 11:51:42 (63624): No heartbeat from core client for 30 sec - exiting 11:51:43 (63624): No heartbeat from core client for 30 sec - exiting 11:51:44 (63624): No heartbeat from core client for 30 sec - exiting 11:51:45 (63624): No heartbeat from core client for 30 sec - exiting 11:51:46 (63624): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:36:54 (13952): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 17:36:58 (13952): No heartbeat from core client for 30 sec - exiting 17:36:59 (13952): No heartbeat from core client for 30 sec - exiting 17:37:00 (13952): No heartbeat from core client for 30 sec - exiting 17:37:01 (13952): No heartbeat from core client for 30 sec - exiting 17:37:02 (13952): No heartbeat from core client for 30 sec - exiting </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Mar 2011 23:57:41 | 1086163 | 12681173 | hadam3p_pnw_z9sx_1992_1_006943225_1 | 46,176 | 158,367 | 3.4296 |
28 Mar 2011 11:47:42 | 1086163 | 12681173 | hadam3p_pnw_z9sx_1992_1_006943225_1 | 34,656 | 119,583 | 3.4506 |
26 Mar 2011 21:53:28 | 1086163 | 12681173 | hadam3p_pnw_z9sx_1992_1_006943225_1 | 23,136 | 78,906 | 3.4105 |
25 Mar 2011 05:25:09 | 1086163 | 12681173 | hadam3p_pnw_z9sx_1992_1_006943225_1 | 11,616 | 39,419 | 3.3935 |
©2024 climateprediction.net