Name | hadam3p_anz_d989_2013_1_009726457_0 |
Workunit | 9799754 |
Created | 8 Apr 2015, 18:41:34 UTC |
Sent | 9 Apr 2015, 21:21:08 UTC |
Report deadline | 22 Mar 2016, 2:41:08 UTC |
Received | 21 May 2015, 7:45:46 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1277229 |
Run time | 7 days 14 hours 37 min 37 sec |
CPU time | 7 days 13 hours 39 min 43 sec |
Validate state | Workunit error - check skipped |
Credit | 5,974.74 |
Device peak FLOPS | 2.91 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3296, selfPID=4672, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5344, selfPID=3432, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5980, selfPID=4544, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4092, selfPID=4904, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3240, selfPID=4800, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5656, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5972, selfPID=4676, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4532, selfPID=4008, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5844, selfPID=4136, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5988, selfPID=1328, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5204, selfPID=5116, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5676, selfPID=4252, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5532, selfPID=3440, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2768, selfPID=2768, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3256, selfPID=4184, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5792, selfPID=4476, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5736, selfPID=2548, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5060, selfPID=3384, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5728, selfPID=2788, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5452, selfPID=1644, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6052, selfPID=4148, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5584, selfPID=1528, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2236, selfPID=1732, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1468, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2936, selfPID=3728, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5336, selfPID=1072, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1088, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5428, selfPID=3376, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5524, selfPID=5524, iMonCtr=2 20:27:12 (5092): No heartbeat from core client for 30 sec - exiting 20:27:14 (5092): No heartbeat from core client for 30 sec - exiting 20:27:15 (5092): No heartbeat from core client for 30 sec - exiting 20:27:16 (5092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:59:44 (4608): No heartbeat from core client for 30 sec - exiting 20:59:46 (4608): No heartbeat from core client for 30 sec - exiting 20:59:47 (4608): No heartbeat from core client for 30 sec - exiting 20:59:48 (4608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:21:20 (5424): No heartbeat from core client for 30 sec - exiting 21:21:22 (5424): No heartbeat from core client for 30 sec - exiting 21:21:23 (5424): No heartbeat from core client for 30 sec - exiting 21:21:24 (5424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:39:18 (5240): No heartbeat from core client for 30 sec - exiting 22:39:19 (5240): No heartbeat from core client for 30 sec - exiting 22:39:20 (5240): No heartbeat from core client for 30 sec - exiting 22:39:22 (5240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 May 2015 08:20:25 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 138,539 | 653,569 | 4.7176 |
08 May 2015 19:26:26 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 127,019 | 600,867 | 4.7305 |
08 May 2015 19:12:52 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 115,499 | 546,841 | 4.7346 |
08 May 2015 19:08:59 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 103,979 | 493,487 | 4.7460 |
08 May 2015 19:04:42 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 92,459 | 439,404 | 4.7524 |
28 Apr 2015 17:00:36 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 80,939 | 384,649 | 4.7523 |
23 Apr 2015 14:57:31 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 69,419 | 330,102 | 4.7552 |
21 Apr 2015 14:37:07 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 57,899 | 275,967 | 4.7664 |
17 Apr 2015 18:42:24 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 46,379 | 221,062 | 4.7664 |
14 Apr 2015 20:17:29 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 34,859 | 167,256 | 4.7981 |
13 Apr 2015 18:43:59 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 23,339 | 111,859 | 4.7928 |
12 Apr 2015 14:10:17 | 1277229 | 18281336 | hadam3p_anz_d989_2013_1_009726457_0 | 11,819 | 56,838 | 4.8090 |
©2024 cpdn.org