Name | hadam3p_anz_m625_2012_1_009270145_0 |
Workunit | 9363061 |
Created | 1 Dec 2014, 16:32:05 UTC |
Sent | 1 Dec 2014, 18:57:48 UTC |
Report deadline | 14 Nov 2015, 0:17:48 UTC |
Received | 19 Dec 2014, 15:57:34 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1295575 |
Run time | 5 days 12 hours 48 min 8 sec |
CPU time | 5 days 11 hours 4 min 29 sec |
Validate state | Workunit error - check skipped |
Credit | 5,974.74 |
Device peak FLOPS | 3.54 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6252, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8112, selfPID=4724, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6340, selfPID=4644, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6556, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7996, selfPID=5160, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6296, selfPID=5352, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 06:28:02 (5184): No heartbeat from core client for 30 sec - exiting 06:28:03 (5184): No heartbeat from core client for 30 sec - exiting 06:28:05 (5184): No heartbeat from core client for 30 sec - exiting 06:28:06 (5184): No heartbeat from core client for 30 sec - exiting 06:28:07 (5184): No heartbeat from core client for 30 sec - exiting 06:28:08 (5184): No heartbeat from core client for 30 sec - exiting 06:28:09 (5184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6516, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3608, selfPID=4652, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5988, selfPID=4608, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6316, selfPID=4444, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6460, selfPID=4432, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5124, selfPID=5080, iMonCtr=1 Model crash detected, will try to restart... 19:18:39 (2688): No heartbeat from core client for 30 sec - exiting 19:18:40 (2688): No heartbeat from core client for 30 sec - exiting 19:18:41 (2688): No heartbeat from core client for 30 sec - exiting 19:18:42 (2688): No heartbeat from core client for 30 sec - exiting 19:18:43 (2688): No heartbeat from core client for 30 sec - exiting 19:18:44 (2688): No heartbeat from core client for 30 sec - exiting 19:18:46 (2688): No heartbeat from core client for 30 sec - exiting 19:18:47 (2688): No heartbeat from core client for 30 sec - exiting 19:18:48 (2688): No heartbeat from core client for 30 sec - exiting 19:18:49 (2688): No heartbeat from core client for 30 sec - exiting 19:18:50 (2688): No heartbeat from core client for 30 sec - exiting 19:18:51 (2688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6692, selfPID=4548, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7316, selfPID=4536, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6932, selfPID=4676, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7104, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7136, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4640, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6924, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6956, selfPID=5588, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6664, selfPID=4648, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7340, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7364, selfPID=5152, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6668, selfPID=5380, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5668, selfPID=5576, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5680, selfPID=4604, iMonCtr=1 Model crash detected, will try to restart... C14:38:18 (4836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6628, selfPID=4648, iMonCtr=1 Model crash detected, will try to restart... 14:35:08 (4956): No heartbeat from core client for 30 sec - exiting 14:35:09 (4956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4712, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7232, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7288, selfPID=4680, iMonCtr=1 Model crash detected, will try to restart... 12:49:47 (5000): No heartbeat from core client for 30 sec - exiting 12:49:48 (5000): No heartbeat from core client for 30 sec - exiting 12:49:49 (5000): No heartbeat from core client for 30 sec - exiting 12:49:50 (5000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Dec 2014 14:41:48 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 138,539 | 471,540 | 3.4037 |
17 Dec 2014 16:07:56 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 127,019 | 432,161 | 3.4023 |
15 Dec 2014 19:46:15 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 115,499 | 392,472 | 3.3981 |
14 Dec 2014 06:09:17 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 103,979 | 352,897 | 3.3939 |
13 Dec 2014 11:18:29 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 92,459 | 313,252 | 3.3880 |
12 Dec 2014 17:36:11 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 80,939 | 274,061 | 3.3860 |
11 Dec 2014 15:53:35 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 69,419 | 234,175 | 3.3734 |
10 Dec 2014 13:53:19 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 57,899 | 194,621 | 3.3614 |
07 Dec 2014 13:58:58 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 46,379 | 155,338 | 3.3493 |
06 Dec 2014 14:02:36 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 34,859 | 116,723 | 3.3484 |
05 Dec 2014 17:52:20 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 23,339 | 78,365 | 3.3577 |
03 Dec 2014 17:40:34 | 1295575 | 17534634 | hadam3p_anz_m625_2012_1_009270145_0 | 11,819 | 39,648 | 3.3546 |
©2024 cpdn.org