Name | hadam3p_eu_l3fy_2013_1_008815687_0 |
Workunit | 8961616 |
Created | 8 Jul 2014, 9:08:32 UTC |
Sent | 31 Jul 2014, 7:34:16 UTC |
Report deadline | 13 Jul 2015, 12:54:16 UTC |
Received | 14 Sep 2014, 16:25:33 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1326499 |
Run time | 10 days 19 hours 54 min 57 sec |
CPU time | 4 days 23 hours 1 min 37 sec |
Validate state | Workunit error - check skipped |
Credit | 2,386.39 |
Device peak FLOPS | 2.57 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4600, iMonCtr=2 Model crash detected, will try to restart... 18:19:58 (5072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:34:06 (7028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:34:07 (7028): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6132, selfPID=6240, iMonCtr=1 Model crash detected, will try to restart... 18:09:37 (5584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5388, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4568, selfPID=908, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3560, selfPID=5264, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5784, selfPID=5064, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4340, selfPID=3804, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4620, selfPID=928, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5564, selfPID=5008, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2228, selfPID=4540, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5984, selfPID=2924, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9192, selfPID=8608, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3500, selfPID=3588, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5564, selfPID=4820, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5496, selfPID=4308, iMonCtr=1 Model crash detected, will try to restart... 08:22:19 (4636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:22:20 (4636): No heartbeat from core client for 30 sec - exiting 08:22:21 (4636): No heartbeat from core client for 30 sec - exiting 08:22:22 (4636): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5952, selfPID=4396, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5780, selfPID=400, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1464, selfPID=4804, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 09:20:52 (4296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5480, selfPID=5480, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5004, selfPID=388, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6104, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4736, selfPID=4936, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5992, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5876, selfPID=5040, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5356, selfPID=3312, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5640, selfPID=1936, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3476, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5476, selfPID=4904, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5476, selfPID=5068, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5500, selfPID=4108, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5160, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6136, selfPID=5040, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Sep 2014 16:33:00 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 138,336 | 427,848 | 3.0928 |
28 Aug 2014 05:51:21 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 126,831 | 382,746 | 3.0178 |
26 Aug 2014 06:58:54 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 126,816 | 382,125 | 3.0132 |
25 Aug 2014 12:18:53 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 115,296 | 347,205 | 3.0114 |
19 Aug 2014 20:47:38 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 103,776 | 311,917 | 3.0057 |
19 Aug 2014 09:36:39 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 92,256 | 276,097 | 2.9927 |
18 Aug 2014 08:24:52 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 80,736 | 240,563 | 2.9796 |
16 Aug 2014 18:42:10 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 69,216 | 205,874 | 2.9744 |
15 Aug 2014 17:53:24 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 57,696 | 172,008 | 2.9813 |
15 Aug 2014 07:54:59 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 46,176 | 138,092 | 2.9906 |
14 Aug 2014 15:27:36 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 34,656 | 103,723 | 2.9929 |
01 Sep 2014 10:51:53 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 23,147 | 72,647 | 3.1385 |
02 Aug 2014 05:05:22 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 23,136 | 69,462 | 3.0023 |
31 Aug 2014 08:36:49 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 11,625 | 36,847 | 3.1696 |
01 Aug 2014 07:12:18 | 1326499 | 16734295 | hadam3p_eu_l3fy_2013_1_008815687_0 | 11,616 | 34,711 | 2.9882 |
©2024 cpdn.org