Name | hadam3p_anz_na8m_2012_1_008600886_1 |
Workunit | 8747398 |
Created | 27 Mar 2014, 8:54:47 UTC |
Sent | 27 Mar 2014, 8:57:11 UTC |
Report deadline | 9 Mar 2015, 14:17:11 UTC |
Received | 29 May 2014, 5:54:23 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1241124 |
Run time | 23 days 16 hours 22 min 25 sec |
CPU time | 22 days 18 hours 5 min 6 sec |
Validate state | Workunit error - check skipped |
Credit | 5,974.74 |
Device peak FLOPS | 1.31 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3964, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... 19:07:18 (6108): No heartbeat from core client for 30 sec - exiting 19:07:19 (6108): No heartbeat from core client for 30 sec - exiting 19:07:21 (6108): No heartbeat from core client for 30 sec - exiting 19:07:22 (6108): No heartbeat from core client for 30 sec - exiting 19:07:23 (6108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3520, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4048, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=808, selfPID=2956, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2684, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5636, selfPID=5668, iMonCtr=1 Model crash detected, will try to restart... 06:14:06 (4000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5980, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... 07:32:57 (6060): No heartbeat from core client for 30 sec - exiting 07:32:59 (6060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6040, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1380, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14188, selfPID=12460, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1344, selfPID=5732, iMonCtr=1 Model crash detected, will try to restart... 06:33:00 (5660): No heartbeat from core client for 30 sec - exiting 06:33:03 (5660): No heartbeat from core client for 30 sec - exiting 06:33:04 (5660): No heartbeat from core client for 30 sec - exiting 06:33:05 (5660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3112, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6512, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14356, selfPID=13592, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3180, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6044, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1048, selfPID=5956, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=31556, selfPID=31556, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31876, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7816, selfPID=5532, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4048, selfPID=1540, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3800, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=660, selfPID=5608, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4232, selfPID=4428, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5464, selfPID=5720, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4640, selfPID=5204, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4336, selfPID=2448, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3536, selfPID=5548, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14668, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4636, selfPID=10948, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5580, selfPID=5580, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=22188, selfPID=22064, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30396, selfPID=13364, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=652, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 08:50:12 (6024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3720, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5660, selfPID=7660, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 May 2014 04:30:34 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 138,539 | 1,964,588 | 14.1808 |
25 May 2014 16:23:15 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 127,019 | 1,801,960 | 14.1865 |
21 May 2014 02:59:40 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 115,499 | 1,643,495 | 14.2295 |
18 May 2014 01:32:56 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 103,979 | 1,484,683 | 14.2787 |
14 May 2014 07:53:21 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 92,459 | 1,328,067 | 14.3638 |
11 May 2014 04:05:24 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 80,939 | 1,166,119 | 14.4074 |
07 May 2014 03:10:11 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 69,419 | 1,004,605 | 14.4716 |
04 May 2014 00:34:07 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 57,899 | 844,010 | 14.5773 |
27 Apr 2014 06:13:13 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 46,379 | 679,870 | 14.6590 |
20 Apr 2014 13:29:55 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 34,859 | 504,107 | 14.4613 |
07 Apr 2014 08:30:47 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 23,339 | 331,181 | 14.1900 |
31 Mar 2014 11:59:15 | 1241124 | 16421862 | hadam3p_anz_na8m_2012_1_008600886_1 | 11,819 | 169,391 | 14.3321 |
©2024 cpdn.org