Name | hadam3p_eu_8gna_2000_1_007750335_2 |
Workunit | 7905444 |
Created | 25 Feb 2012, 10:52:32 UTC |
Sent | 25 Feb 2012, 10:54:58 UTC |
Report deadline | 6 Feb 2013, 16:14:58 UTC |
Received | 23 Mar 2012, 13:03:29 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1184441 |
Run time | 4 days 3 hours 49 min |
CPU time | 3 days 10 hours 43 min 36 sec |
Validate state | Invalid |
Credit | 1,790.21 |
Device peak FLOPS | 2.01 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2820, iMonCtr=2 Model crash detected, will try to restart... 11:16:42 (5360): No heartbeat from core client for 30 sec - exiting 11:16:43 (5360): No heartbeat from core client for 30 sec - exiting 11:16:45 (5360): No heartbeat from core client for 30 sec - exiting 11:16:46 (5360): No heartbeat from core client for 30 sec - exiting 11:16:47 (5360): No heartbeat from core client for 30 sec - exiting 11:16:48 (5360): No heartbeat from core client for 30 sec - exiting 11:16:49 (5360): No heartbeat from core client for 30 sec - exiting 11:16:50 (5360): No heartbeat from core client for 30 sec - exiting 11:16:51 (5360): No heartbeat from core client for 30 sec - exiting 11:16:52 (5360): No heartbeat from core client for 30 sec - exiting 11:16:53 (5360): No heartbeat from core client for 30 sec - exiting 11:16:54 (5360): No heartbeat from core client for 30 sec - exiting 11:16:56 (5360): No heartbeat from core client for 30 sec - exiting 11:16:57 (5360): No heartbeat from core client for 30 sec - exiting 11:16:58 (5360): No heartbeat from core client for 30 sec - exiting 11:16:59 (5360): No heartbeat from core client for 30 sec - exiting 11:17:00 (5360): No heartbeat from core client for 30 sec - exiting 11:17:01 (5360): No heartbeat from core client for 30 sec - exiting 11:17:02 (5360): No heartbeat from core client for 30 sec - exiting 11:17:03 (5360): No heartbeat from core client for 30 sec - exiting 11:17:04 (5360): No heartbeat from core client for 30 sec - exiting 11:17:06 (5360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:40:31 (4148): No heartbeat from core client for 30 sec - exiting 09:40:32 (4148): No heartbeat from core client for 30 sec - exiting 09:40:33 (4148): No heartbeat from core client for 30 sec - exiting 09:40:34 (4148): No heartbeat from core client for 30 sec - exiting 09:40:35 (4148): No heartbeat from core client for 30 sec - exiting 09:40:36 (4148): No heartbeat from core client for 30 sec - exiting 09:40:37 (4148): No heartbeat from core client for 30 sec - exiting 09:40:38 (4148): No heartbeat from core client for 30 sec - exiting 09:40:39 (4148): No heartbeat from core client for 30 sec - exiting 09:40:40 (4148): No heartbeat from core client for 30 sec - exiting 09:40:41 (4148): No heartbeat from core client for 30 sec - exiting 09:40:43 (4148): No heartbeat from core client for 30 sec - exiting 09:40:44 (4148): No heartbeat from core client for 30 sec - exiting 09:40:45 (4148): No heartbeat from core client for 30 sec - exiting 09:40:46 (4148): No heartbeat from core client for 30 sec - exiting 09:40:47 (4148): No heartbeat from core client for 30 sec - exiting 09:40:48 (4148): No heartbeat from core client for 30 sec - exiting 09:40:49 (4148): No heartbeat from core client for 30 sec - exiting 09:40:50 (4148): No heartbeat from core client for 30 sec - exiting 09:40:51 (4148): No heartbeat from core client for 30 sec - exiting 09:40:52 (4148): No heartbeat from core client for 30 sec - exiting 09:40:54 (4148): No heartbeat from core client for 30 sec - exiting 09:40:55 (4148): No heartbeat from core client for 30 sec - exiting 09:40:56 (4148): No heartbeat from core client for 30 sec - exiting 09:40:57 (4148): No heartbeat from core client for 30 sec - exiting 09:40:58 (4148): No heartbeat from core client for 30 sec - exiting 09:40:59 (4148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4712, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7080, selfPID=7080, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:45:58 (5040): No heartbeat from core client for 30 sec - exiting 22:45:59 (5040): No heartbeat from core client for 30 sec - exiting 22:46:00 (5040): No heartbeat from core client for 30 sec - exiting 22:46:01 (5040): No heartbeat from core client for 30 sec - exiting 22:46:02 (5040): No heartbeat from core client for 30 sec - exiting 22:46:03 (5040): No heartbeat from core client for 30 sec - exiting 22:46:04 (5040): No heartbeat from core client for 30 sec - exiting 22:46:06 (5040): No heartbeat from core client for 30 sec - exiting 22:46:07 (5040): No heartbeat from core client for 30 sec - exiting 22:46:08 (5040): No heartbeat from core client for 30 sec - exiting 22:46:09 (5040): No heartbeat from core client for 30 sec - exiting 22:46:10 (5040): No heartbeat from core client for 30 sec - exiting 22:46:11 (5040): No heartbeat from core client for 30 sec - exiting 22:46:12 (5040): No heartbeat from core client for 30 sec - exiting 22:46:13 (5040): No heartbeat from core client for 30 sec - exiting 22:46:14 (5040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6548, selfPID=6548, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8072, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3736, iMonCtr=2 Model crash detected, will try to restart... 10:12:51 (3576): No heartbeat from core client for 30 sec - exiting 10:12:52 (3576): No heartbeat from core client for 30 sec - exiting 10:12:53 (3576): No heartbeat from core client for 30 sec - exiting 10:12:54 (3576): No heartbeat from core client for 30 sec - exiting 10:12:55 (3576): No heartbeat from core client for 30 sec - exiting 10:12:56 (3576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:12:57 (3576): No heartbeat from core client for 30 sec - exiting 10:12:59 (3576): No heartbeat from core client for 30 sec - exiting 10:13:00 (3576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5748, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2000, selfPID=5904, iMonCtr=1 Model crash detected, will try to restart... 16:54:55 (4976): No heartbeat from core client for 30 sec - exiting 16:54:56 (4976): No heartbeat from core client for 30 sec - exiting 16:54:57 (4976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:04:17 (5036): No heartbeat from core client for 30 sec - exiting 10:04:18 (5036): No heartbeat from core client for 30 sec - exiting 10:04:19 (5036): No heartbeat from core client for 30 sec - exiting 10:04:20 (5036): No heartbeat from core client for 30 sec - exiting 10:04:21 (5036): No heartbeat from core client for 30 sec - exiting 10:04:22 (5036): No heartbeat from core client for 30 sec - exiting 10:04:23 (5036): No heartbeat from core client for 30 sec - exiting 10:04:25 (5036): No heartbeat from core client for 30 sec - exiting 10:04:26 (5036): No heartbeat from core client for 30 sec - exiting 10:04:27 (5036): No heartbeat from core client for 30 sec - exiting 10:04:28 (5036): No heartbeat from core client for 30 sec - exiting 10:04:29 (5036): No heartbeat from core client for 30 sec - exiting 10:04:30 (5036): No heartbeat from core client for 30 sec - exiting 10:04:31 (5036): No heartbeat from core client for 30 sec - exiting 10:04:32 (5036): No heartbeat from core client for 30 sec - exiting 10:04:33 (5036): No heartbeat from core client for 30 sec - exiting 10:04:35 (5036): No heartbeat from core client for 30 sec - exiting 10:04:36 (5036): No heartbeat from core client for 30 sec - exiting 10:04:37 (5036): No heartbeat from core client for 30 sec - exiting 10:04:38 (5036): No heartbeat from core client for 30 sec - exiting 10:04:39 (5036): No heartbeat from core client for 30 sec - exiting 10:04:41 (5036): No heartbeat from core client for 30 sec - exiting 10:04:42 (5036): No heartbeat from core client for 30 sec - exiting 10:04:43 (5036): No heartbeat from core client for 30 sec - exiting 10:04:44 (5036): No heartbeat from core client for 30 sec - exiting 10:04:45 (5036): No heartbeat from core client for 30 sec - exiting 10:04:47 (5036): No heartbeat from core client for 30 sec - exiting 10:04:48 (5036): No heartbeat from core client for 30 sec - exiting 10:04:49 (5036): No heartbeat from core client for 30 sec - exiting 10:04:50 (5036): No heartbeat from core client for 30 sec - exiting 10:04:51 (5036): No heartbeat from core client for 30 sec - exiting 10:04:52 (5036): No heartbeat from core client for 30 sec - exiting 10:04:53 (5036): No heartbeat from core client for 30 sec - exiting 10:04:54 (5036): No heartbeat from core client for 30 sec - exiting 10:04:55 (5036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6760, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:35:44 (5616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 Mar 2012 12:15:09 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 103,776 | 274,167 | 2.6419 |
19 Mar 2012 17:11:57 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 92,256 | 244,190 | 2.6469 |
16 Mar 2012 15:22:32 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 80,736 | 214,283 | 2.6541 |
09 Mar 2012 00:14:12 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 69,216 | 183,691 | 2.6539 |
07 Mar 2012 17:36:07 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 57,703 | 152,964 | 2.6509 |
07 Mar 2012 16:33:50 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 57,696 | 152,560 | 2.6442 |
05 Mar 2012 18:01:22 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 46,176 | 122,184 | 2.6460 |
04 Mar 2012 12:37:41 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 34,656 | 91,757 | 2.6477 |
01 Mar 2012 13:52:50 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 23,138 | 60,879 | 2.6311 |
01 Mar 2012 12:50:49 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 23,136 | 60,507 | 2.6153 |
28 Feb 2012 14:34:56 | 1184441 | 14190947 | hadam3p_eu_8gna_2000_1_007750335_2 | 11,616 | 30,597 | 2.6340 |
©2024 cpdn.org