Name | hadam3p_eu_2uh2_1971_1_007377423_2 |
Workunit | 7574853 |
Created | 31 Jul 2011, 2:30:44 UTC |
Sent | 31 Jul 2011, 2:31:55 UTC |
Report deadline | 12 Jul 2012, 7:51:55 UTC |
Received | 7 Feb 2012, 1:26:29 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1051942 |
Run time | 8 days 6 hours 9 min 2 sec |
CPU time | 6 days 17 hours 28 min 59 sec |
Validate state | Invalid |
Credit | 2,186.01 |
Device peak FLOPS | 1.93 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4068, selfPID=4068, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Restart file copy failed on atmos_restart.day CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3452, selfPID=3452, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6808, selfPID=6808, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5228, selfPID=5228, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=5432, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5436, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2784, selfPID=6684, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5340, selfPID=5340, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... GCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2456, selfPID=2456, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3916, selfPID=2904, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2764, selfPID=2764, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2944, selfPID=2944, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=756, selfPID=756, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4768, selfPID=4768, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6500, selfPID=6500, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5160, selfPID=5160, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5356, selfPID=5356, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=532, selfPID=532, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4576, selfPID=4576, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3688, selfPID=3580, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6212, selfPID=5320, iMonCtr=1 Model crash detected, will try to restart... 05:27:05 (5720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=980, selfPID=980, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2200, selfPID=2200, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4976, selfPID=4976, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:19:15 (4720): No heartbeat from core client for 30 sec - exiting 15:19:16 (4720): No heartbeat from core client for 30 sec - exiting 15:19:17 (4720): No heartbeat from core client for 30 sec - exiting 15:19:18 (4720): No heartbeat from core client for 30 sec - exiting 15:19:20 (4720): No heartbeat from core client for 30 sec - exiting 15:19:21 (4720): No heartbeat from core client for 30 sec - exiting 15:19:22 (4720): No heartbeat from core client for 30 sec - exiting 15:19:23 (4720): No heartbeat from core client for 30 sec - exiting 15:19:24 (4720): No heartbeat from core client for 30 sec - exiting 15:19:25 (4720): No heartbeat from core client for 30 sec - exiting 15:19:27 (4720): No heartbeat from core client for 30 sec - exiting 15:19:28 (4720): No heartbeat from core client for 30 sec - exiting 15:19:30 (4720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6052, selfPID=6052, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4780, selfPID=3092, iMonCtr=1 Model crash detected, will try to restart... 19:03:52 (3988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:56:06 (3300): No heartbeat from core client for 30 sec - exiting 14:56:08 (3300): No heartbeat from core client for 30 sec - exiting 14:56:09 (3300): No heartbeat from core client for 30 sec - exiting 14:56:10 (3300): No heartbeat from core client for 30 sec - exiting 14:56:11 (3300): No heartbeat from core client for 30 sec - exiting 14:56:12 (3300): No heartbeat from core client for 30 sec - exiting 14:56:13 (3300): No heartbeat from core client for 30 sec - exiting 14:56:14 (3300): No heartbeat from core client for 30 sec - exiting 14:56:15 (3300): No heartbeat from core client for 30 sec - exiting 14:56:16 (3300): No heartbeat from core client for 30 sec - exiting 14:56:18 (3300): No heartbeat from core client for 30 sec - exiting 14:56:19 (3300): No heartbeat from core client for 30 sec - exiting 14:56:20 (3300): No heartbeat from core client for 30 sec - exiting 14:56:21 (3300): No heartbeat from core client for 30 sec - exiting 14:56:22 (3300): No heartbeat from core client for 30 sec - exiting 14:56:23 (3300): No heartbeat from core client for 30 sec - exiting 14:56:24 (3300): No heartbeat from core client for 30 sec - exiting 14:56:25 (3300): No heartbeat from core client for 30 sec - exiting 14:56:26 (3300): No heartbeat from core client for 30 sec - exiting 14:56:27 (3300): No heartbeat from core client for 30 sec - exiting 14:56:28 (3300): No heartbeat from core client for 30 sec - exiting 14:56:30 (3300): No heartbeat from core client for 30 sec - exiting 14:56:31 (3300): No heartbeat from core client for 30 sec - exiting 14:56:32 (3300): No heartbeat from core client for 30 sec - exiting 14:56:33 (3300): No heartbeat from core client for 30 sec - exiting 14:56:34 (3300): No heartbeat from core client for 30 sec - exiting 14:56:35 (3300): No heartbeat from core client for 30 sec - exiting 14:56:36 (3300): No heartbeat from core client for 30 sec - exiting 14:56:37 (3300): No heartbeat from core client for 30 sec - exiting 14:56:38 (3300): No heartbeat from core client for 30 sec - exiting 14:56:39 (3300): No heartbeat from core client for 30 sec - exiting 14:56:41 (3300): No heartbeat from core client for 30 sec - exiting 14:56:42 (3300): No heartbeat from core client for 30 sec - exiting 14:56:43 (3300): No heartbeat from core client for 30 sec - exiting 14:56:44 (3300): No heartbeat from core client for 30 sec - exiting 14:56:45 (3300): No heartbeat from core client for 30 sec - exiting 14:56:46 (3300): No heartbeat from core client for 30 sec - exiting 14:56:47 (3300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:56:48 (3300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5496, selfPID=5496, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6252, selfPID=6868, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5500, selfPID=5500, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4252, selfPID=4252, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6792, selfPID=6792, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3848, selfPID=3848, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2328, selfPID=2328, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6032, selfPID=6032, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3504, selfPID=3504, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:56:33 (5512): No heartbeat from core client for 30 sec - exiting 15:56:35 (5512): No heartbeat from core client for 30 sec - exiting 15:56:36 (5512): No heartbeat from core client for 30 sec - exiting 15:57:11 (5512): No heartbeat from core client for 30 sec - exiting 15:57:12 (5512): No heartbeat from core client for 30 sec - exiting 15:57:13 (5512): No heartbeat from core client for 30 sec - exiting 15:57:14 (5512): No heartbeat from core client for 30 sec - exiting 15:57:16 (5512): No heartbeat from core client for 30 sec - exiting 15:57:17 (5512): No heartbeat from core client for 30 sec - exiting 15:57:18 (5512): No heartbeat from core client for 30 sec - exiting 15:57:19 (5512): No heartbeat from core client for 30 sec - exiting 15:57:20 (5512): No heartbeat from core client for 30 sec - exiting 15:57:21 (5512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:22 (5512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:35:54 (4704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:35:55 (4704): No heartbeat from core client for 30 sec - exiting 17:35:56 (4704): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5988, selfPID=5988, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2168, selfPID=2168, iMonCtr=2 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Feb 2012 02:51:36 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 126,720 | 560,250 | 4.4212 |
24 Jan 2012 22:49:40 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 115,200 | 505,577 | 4.3887 |
09 Jan 2012 22:45:57 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 103,680 | 455,378 | 4.3921 |
18 Dec 2011 04:06:40 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 92,160 | 404,462 | 4.3887 |
14 Dec 2011 19:32:26 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 80,640 | 354,160 | 4.3919 |
03 Dec 2011 23:17:38 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 69,120 | 303,339 | 4.3886 |
24 Nov 2011 22:02:54 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 57,600 | 252,821 | 4.3893 |
31 Oct 2011 16:45:25 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 46,080 | 202,025 | 4.3842 |
10 Oct 2011 01:11:56 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 34,560 | 150,760 | 4.3623 |
25 Sep 2011 01:30:48 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 23,040 | 100,237 | 4.3506 |
09 Aug 2011 20:11:52 | 1051942 | 13175762 | hadam3p_eu_2uh2_1971_1_007377423_2 | 11,616 | 51,555 | 4.4383 |
©2024 cpdn.org