Name | hadam3p_eu_a3pc_1990_1_007872189_0 |
Workunit | 8027301 |
Created | 14 Apr 2012, 20:32:57 UTC |
Sent | 14 Apr 2012, 20:33:30 UTC |
Report deadline | 28 Mar 2013, 1:53:30 UTC |
Received | 27 Apr 2012, 18:45:09 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -2 (0xFFFFFFFE) Unknown error code |
Computer ID | 1051974 |
Run time | 4 days 14 hours 18 min 57 sec |
CPU time | 4 days 6 hours 2 min 21 sec |
Validate state | Invalid |
Credit | 2,187.67 |
Device peak FLOPS | 2.67 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> - exit code -2 (0xfffffffe) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2864, selfPID=2864, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2864, selfPID=9136, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6856, selfPID=2164, iMonCtr=1 Model crash detected, will try to restart... 13:17:39 (6596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:17:40 (6596): No heartbeat from core client for 30 sec - exiting 13:17:41 (6596): No heartbeat from core client for 30 sec - exiting 13:17:42 (6596): No heartbeat from core client for 30 sec - exiting 13:17:43 (6596): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5456, selfPID=5456, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9816, selfPID=7144, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9816, selfPID=9816, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10472, selfPID=11104, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10472, selfPID=10472, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1596, selfPID=1596, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1596, selfPID=4712, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4512, selfPID=5916, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4512, selfPID=4512, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9352, selfPID=9360, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9352, selfPID=9352, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8504, selfPID=6060, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8504, selfPID=8504, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6328, selfPID=7052, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6328, selfPID=6328, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8048, selfPID=8048, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8048, selfPID=8056, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7112, selfPID=2540, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7112, selfPID=7112, iMonCtr=1 14:49:54 (5616): No heartbeat from core client for 30 sec - exiting 14:49:55 (5616): No heartbeat from core client for 30 sec - exiting 14:49:56 (5616): No heartbeat from core client for 30 sec - exiting Could not launch model process. Last Error=5 Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Apr 2012 07:56:01 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 126,816 | 354,663 | 2.7967 |
26 Apr 2012 22:22:52 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 115,296 | 322,330 | 2.7957 |
26 Apr 2012 01:21:32 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 103,776 | 293,748 | 2.8306 |
23 Apr 2012 04:16:56 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 92,256 | 260,967 | 2.8287 |
22 Apr 2012 18:29:36 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 80,736 | 228,388 | 2.8288 |
21 Apr 2012 21:56:09 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 69,216 | 198,523 | 2.8682 |
21 Apr 2012 10:07:49 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 57,696 | 167,434 | 2.9020 |
21 Apr 2012 00:05:58 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 46,176 | 133,224 | 2.8851 |
16 Apr 2012 03:48:21 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 34,656 | 98,870 | 2.8529 |
15 Apr 2012 17:07:56 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 23,136 | 66,068 | 2.8556 |
15 Apr 2012 06:41:59 | 1051974 | 14403657 | hadam3p_eu_a3pc_1990_1_007872189_0 | 11,616 | 33,523 | 2.8859 |
©2024 cpdn.org