Name | hadam3p_eu_9va5_1979_1_008069992_0 |
Workunit | 8225106 |
Created | 20 Jul 2012, 0:26:50 UTC |
Sent | 21 Jul 2012, 11:03:04 UTC |
Report deadline | 3 Jul 2013, 16:23:04 UTC |
Received | 30 Jul 2012, 16:27:35 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1051974 |
Run time | 4 days 4 hours 21 min 11 sec |
CPU time | 3 days 22 hours 2 min 17 sec |
Validate state | Invalid |
Credit | 1,988.94 |
Device peak FLOPS | 2.58 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6240, selfPID=7452, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6240, selfPID=6240, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8108, selfPID=3664, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8108, selfPID=8108, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6456, selfPID=1680, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6456, selfPID=6456, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9128, selfPID=8364, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9128, selfPID=9128, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8472, selfPID=8196, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8472, selfPID=8472, iMonCtr=1 02:43:31 (8812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:43:32 (8812): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7608, selfPID=7608, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5640, selfPID=5728, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5640, selfPID=5640, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6728, selfPID=4396, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6728, selfPID=6728, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5132, selfPID=5344, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5132, selfPID=5132, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4440, selfPID=4052, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4440, selfPID=4440, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8016, selfPID=8112, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8016, selfPID=8016, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7488, selfPID=3968, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7488, selfPID=7488, iMonCtr=1 06:11:09 (7828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:11:10 (7828): No heartbeat from core client for 30 sec - exiting 06:11:12 (7828): No heartbeat from core client for 30 sec - exiting 06:11:13 (7828): No heartbeat from core client for 30 sec - exiting 06:11:14 (7828): No heartbeat from core client for 30 sec - exiting 06:11:15 (7828): No heartbeat from core client for 30 sec - exiting 06:11:16 (7828): No heartbeat from core client for 30 sec - exiting 06:11:17 (7828): No heartbeat from core client for 30 sec - exiting 06:11:18 (7828): No heartbeat from core client for 30 sec - exiting 06:11:19 (7828): No heartbeat from core client for 30 sec - exiting 06:11:20 (7828): No heartbeat from core client for 30 sec - exiting 06:11:21 (7828): No heartbeat from core client for 30 sec - exiting 06:11:22 (7828): No heartbeat from core client for 30 sec - exiting 06:11:24 (7828): No heartbeat from core client for 30 sec - exiting 06:11:25 (7828): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5212, selfPID=5212, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2340, selfPID=5356, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2340, selfPID=2340, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4068, selfPID=5660, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4068, selfPID=4068, iMonCtr=1 08:05:18 (4260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:05:19 (4260): No heartbeat from core client for 30 sec - exiting 08:05:20 (4260): No heartbeat from core client for 30 sec - exiting 08:05:21 (4260): No heartbeat from core client for 30 sec - exiting 08:05:22 (4260): No heartbeat from core client for 30 sec - exiting 08:05:23 (4260): No heartbeat from core client for 30 sec - exiting 08:05:24 (4260): No heartbeat from core client for 30 sec - exiting 08:05:25 (4260): No heartbeat from core client for 30 sec - exiting 08:05:26 (4260): No heartbeat from core client for 30 sec - exiting 08:05:27 (4260): No heartbeat from core client for 30 sec - exiting 08:05:29 (4260): No heartbeat from core client for 30 sec - exiting 08:05:30 (4260): No heartbeat from core client for 30 sec - exiting 08:05:31 (4260): No heartbeat from core client for 30 sec - exiting 08:05:32 (4260): No heartbeat from core client for 30 sec - exiting 08:05:33 (4260): No heartbeat from core client for 30 sec - exiting 08:05:34 (4260): No heartbeat from core client for 30 sec - exiting 08:05:35 (4260): No heartbeat from core client for 30 sec - exiting 08:05:36 (4260): No heartbeat from core client for 30 sec - exiting 08:05:37 (4260): No heartbeat from core client for 30 sec - exiting 08:05:38 (4260): No heartbeat from core client for 30 sec - exiting 08:05:39 (4260): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2972, selfPID=2972, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2972, selfPID=5428, iMonCtr=1 06:25:05 (7468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4464, selfPID=4464, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4464, selfPID=4268, iMonCtr=1 06:30:52 (10352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:30:54 (10352): No heartbeat from core client for 30 sec - exiting 06:30:55 (10352): No heartbeat from core client for 30 sec - exiting 06:30:56 (10352): No heartbeat from core client for 30 sec - exiting 06:30:57 (10352): No heartbeat from core client for 30 sec - exiting 06:30:58 (10352): No heartbeat from core client for 30 sec - exiting 06:30:59 (10352): No heartbeat from core client for 30 sec - exiting 06:31:00 (10352): No heartbeat from core client for 30 sec - exiting 06:31:01 (10352): No heartbeat from core client for 30 sec - exiting 06:31:02 (10352): No heartbeat from core client for 30 sec - exiting 06:31:03 (10352): No heartbeat from core client for 30 sec - exiting 06:31:04 (10352): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7372, selfPID=7372, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7372, selfPID=9252, iMonCtr=1 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Jul 2012 01:05:19 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 115,296 | 327,607 | 2.8414 |
29 Jul 2012 15:55:46 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 103,776 | 295,484 | 2.8473 |
29 Jul 2012 06:44:01 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 92,256 | 261,948 | 2.8394 |
28 Jul 2012 21:16:47 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 80,736 | 229,256 | 2.8396 |
28 Jul 2012 11:54:40 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 69,216 | 196,741 | 2.8424 |
28 Jul 2012 01:52:55 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 57,696 | 164,316 | 2.8480 |
27 Jul 2012 02:13:54 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 46,176 | 131,540 | 2.8487 |
23 Jul 2012 00:46:02 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 34,656 | 98,479 | 2.8416 |
22 Jul 2012 11:02:20 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 23,136 | 65,931 | 2.8497 |
22 Jul 2012 00:45:22 | 1051974 | 14950053 | hadam3p_eu_9va5_1979_1_008069992_0 | 11,616 | 33,060 | 2.8461 |
©2024 cpdn.org