Name | hadam3p_anz_k1g9_201212_12_306_010265026_0 |
Workunit | 10265026 |
Created | 25 Jan 2016, 12:45:29 UTC |
Sent | 25 Jan 2016, 17:35:47 UTC |
Report deadline | 6 Jan 2017, 22:55:47 UTC |
Received | 5 Mar 2016, 22:21:36 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1221356 |
Run time | 11 days 11 hours 37 min 56 sec |
CPU time | 9 days 18 hours 1 min 9 sec |
Validate state | Workunit error - check skipped |
Credit | 5,974.74 |
Device peak FLOPS | 2.69 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6584, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3932, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2744, selfPID=2044, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7052, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6816, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7056, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2260, selfPID=4572, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6068, selfPID=6320, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6968, selfPID=6492, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5144, selfPID=6924, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6884, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7124, selfPID=6816, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3660, selfPID=3948, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4884, selfPID=3200, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5192, iMonCtr=2 Model crash detected, will try to restart... ColobaltrollerrkerCPDN pro proc iss is nounninnn, exiting,ing, bRetVal = 1, checkPID=0, 1, chID=6PI36,0iM snCtr=2 =2864, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4408, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6352, selfPID=5744, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4140, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=108, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7716, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7748, selfPID=1104, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5948, iMonCtr=2 18:51:38 (6524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6932, selfPID=6932, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6204, iMonCtr=2 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7480, selfPID=3356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3608, selfPID=7804, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4692, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4884, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6612, selfPID=6960, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3240, selfPID=7724, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4100, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6276, selfPID=1076, iMonCtr=1 Model crash detected, will try to restart... 20:58:02 (7060): No heartbeat from core client for 30 sec - exiting 20:58:03 (7060): No heartbeat from core client for 30 sec - exiting 20:58:04 (7060): No heartbeat from core client for 30 sec - exiting 20:58:05 (7060): No heartbeat from core client for 30 sec - exiting 20:58:06 (7060): No heartbeat from core client for 30 sec - exiting 20:58:07 (7060): No heartbeat from core client for 30 sec - exiting 20:58:08 (7060): No heartbeat from core client for 30 sec - exiting 20:58:09 (7060): No heartbeat from core client for 30 sec - exiting 20:58:10 (7060): No heartbeat from core client for 30 sec - exiting 20:58:11 (7060): No heartbeat from core client for 30 sec - exiting 20:58:12 (7060): No heartbeat from core client for 30 sec - exiting 20:58:13 (7060): No heartbeat from core client for 30 sec - exiting 20:58:14 (7060): No heartbeat from core client for 30 sec - exiting 20:58:15 (7060): No heartbeat from core client for 30 sec - exiting 20:58:16 (7060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4604, selfPID=4604, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7564, selfPID=7148, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5256, selfPID=5928, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1952, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3268, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3244, selfPID=4524, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2268, selfPID=3044, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5256, selfPID=6584, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=2 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Mar 2016 22:25:36 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 138,539 | 841,933 | 6.0772 |
02 Mar 2016 18:09:02 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 127,019 | 772,469 | 6.0815 |
28 Feb 2016 20:48:10 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 115,499 | 702,583 | 6.0830 |
26 Feb 2016 12:33:57 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 103,979 | 633,846 | 6.0959 |
23 Feb 2016 21:16:03 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 92,459 | 562,168 | 6.0802 |
21 Feb 2016 15:28:59 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 80,939 | 491,231 | 6.0692 |
16 Feb 2016 20:59:47 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 69,419 | 421,926 | 6.0780 |
14 Feb 2016 10:22:23 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 57,899 | 353,229 | 6.1008 |
10 Feb 2016 20:41:30 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 46,379 | 282,697 | 6.0954 |
05 Feb 2016 23:39:30 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 34,859 | 212,650 | 6.1003 |
31 Jan 2016 23:06:06 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 23,339 | 140,889 | 6.0366 |
29 Jan 2016 19:42:36 | 1221356 | 19210627 | hadam3p_anz_k1g9_201212_12_306_010265026_0 | 11,819 | 70,598 | 5.9733 |
©2024 cpdn.org