Name | hadam3p_anz_f20l_2012_1_009265148_1 |
Workunit | 9358064 |
Created | 6 Dec 2014, 8:48:11 UTC |
Sent | 6 Dec 2014, 8:52:19 UTC |
Report deadline | 18 Nov 2015, 14:12:19 UTC |
Received | 11 Feb 2015, 14:33:07 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1347325 |
Run time | 16 days 13 hours 55 min 53 sec |
CPU time | 14 days 23 hours 9 min 41 sec |
Validate state | Workunit error - check skipped |
Credit | 5,974.74 |
Device peak FLOPS | 2.02 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.27</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3832, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 10:54:59 (1596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:55:00 (1596): No heartbeat from core client for 30 sec - exiting 10:55:02 (1596): No heartbeat from core client for 30 sec - exiting 11:05:44 (2796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:05:47 (2796): No heartbeat from core client for 30 sec - exiting 11:05:48 (2796): No heartbeat from core client for 30 sec - exiting 11:05:49 (2796): No heartbeat from core client for 30 sec - exiting 11:05:50 (2796): No heartbeat from core client for 30 sec - exiting 11:05:51 (2796): No heartbeat from core client for 30 sec - exiting 11:27:19 (1476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:27:20 (1476): No heartbeat from core client for 30 sec - exiting 11:48:41 (1544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:48:42 (1544): No heartbeat from core client for 30 sec - exiting 11:48:43 (1544): No heartbeat from core client for 30 sec - exiting Glonobal Wller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1712, iMonCtr=2 Model crash detected, will try to restart... :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3848, iMonCtr=2 CPDN Monitor - Quit request from BOINC... GController:: CPDN procesl is not ronnal Workering, exiting, bRetVal = 1, checkPID=0, selfPID=3184, iMonCtr=2 Model crash detected, will try to restart... :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=420, iMonCtr=2 GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3764, iMonCtr=2 Model crash detected, will try to restart... lobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2332, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1768, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=2 Model crash detected, will try to restart... CGntrollerbal CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=2 Model crash detected, will try to restart... :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3924, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2588, selfPID=2588, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1336, iMonCtr=2 Model crash detected, will try to restart... Global WorkerCPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1844, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1992, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4080, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3504, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3116, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1172, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3280, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3556, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:51:12 (3756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:51:14 (3756): No heartbeat from core client for 30 sec - exiting 08:37:56 (2676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:37:58 (2676): No heartbeat from core client for 30 sec - exiting 08:38:00 (2676): No heartbeat from core client for 30 sec - exiting 08:38:01 (2676): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3804, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:58:25 (3484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:58:26 (3484): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1720, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2516, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=848, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3360, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=396, selfPID=396, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2748, selfPID=2748, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3088, selfPID=3088, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2156, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2108, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global WorkerController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3776, iMonCtr=2 Model crash detected, will try to restart... :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1060, iMonCtr=2 Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2276, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2332, selfPID=2668, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2544, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:27:41 (3692): No heartbeat from core client for 30 sec - exiting 07:27:42 (3692): No heartbeat from core client for 30 sec - exiting 07:27:43 (3692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2468, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1008, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1804, selfPID=3880, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4052, selfPID=2516, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:31:41 (3776): No heartbeat from core client for 30 sec - exiting 07:31:42 (3776): No heartbeat from core client for 30 sec - exiting 07:31:43 (3776): No heartbeat from core client for 30 sec - exiting 07:31:44 (3776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3236, selfPID=2928, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3900, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:42:43 (3536): No heartbeat from core client for 30 sec - exiting 07:42:44 (3536): No heartbeat from core client for 30 sec - exiting 07:42:45 (3536): No heartbeat from core client for 30 sec - exiting 07:42:46 (3536): No heartbeat from core client for 30 sec - exiting 07:42:48 (3536): No heartbeat from core client for 30 sec - exiting 07:42:49 (3536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:21:49 (2740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:21:50 (2740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3768, selfPID=3180, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=264, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:48:39 (4712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2748, selfPID=2748, iMonCtr=2 17:48:47 (4712): No heartbeat from core client for 30 sec - exiting 17:48:48 (4712): No heartbeat from core client for 30 sec - exiting 17:48:49 (4712): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5276, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4100, selfPID=3876, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2212, selfPID=3504, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=936, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=472, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3816, iMonCtr=2 Model crash detected, will try to restart... 07:36:11 (2608): No heartbeat from core client for 30 sec - exiting 07:36:12 (2608): No heartbeat from core client for 30 sec - exiting 07:36:13 (2608): No heartbeat from core client for 30 sec - exiting 07:36:15 (2608): No heartbeat from core client for 30 sec - exiting 07:36:16 (2608): No heartbeat from core client for 30 sec - exiting 07:36:17 (2608): No heartbeat from core client for 30 sec - exiting 07:36:19 (2608): No heartbeat from core client for 30 sec - exiting 07:36:20 (2608): No heartbeat from core client for 30 sec - exiting 07:36:21 (2608): No heartbeat from core client for 30 sec - exiting 07:36:22 (2608): No heartbeat from core client for 30 sec - exiting 07:36:23 (2608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:36:24 (2608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Feb 2015 12:52:34 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 138,539 | 1,292,010 | 9.3260 |
04 Feb 2015 11:35:49 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 127,019 | 1,185,373 | 9.3322 |
30 Jan 2015 11:39:39 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 115,499 | 1,083,319 | 9.3795 |
26 Jan 2015 09:27:38 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 103,979 | 975,081 | 9.3777 |
22 Jan 2015 07:33:06 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 92,459 | 869,505 | 9.4042 |
09 Jan 2015 06:27:55 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 80,939 | 766,132 | 9.4655 |
05 Jan 2015 08:02:38 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 69,419 | 662,555 | 9.5443 |
29 Dec 2014 14:09:57 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 57,899 | 547,887 | 9.4628 |
24 Dec 2014 12:52:26 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 46,379 | 436,159 | 9.4042 |
19 Dec 2014 14:51:49 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 34,859 | 333,815 | 9.5761 |
15 Dec 2014 16:05:43 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 23,339 | 214,989 | 9.2116 |
10 Dec 2014 06:19:23 | 1347325 | 17554113 | hadam3p_anz_f20l_2012_1_009265148_1 | 11,819 | 107,912 | 9.1304 |
©2024 cpdn.org