Name | hadam3p_anz_nanf_2012_1_008601419_2 |
Workunit | 8747931 |
Created | 27 Mar 2014, 8:53:07 UTC |
Sent | 27 Mar 2014, 8:57:11 UTC |
Report deadline | 9 Mar 2015, 14:17:11 UTC |
Received | 29 May 2014, 6:55:05 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1241124 |
Run time | 23 days 18 hours 18 min 12 sec |
CPU time | 22 days 18 hours 51 min 10 sec |
Validate state | Workunit error - check skipped |
Credit | 5,974.74 |
Device peak FLOPS | 1.31 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4228, selfPID=6008, iMonCtr=1 Model crash detected, will try to restart... 19:07:18 (6088): No heartbeat from core client for 30 sec - exiting 19:07:19 (6088): No heartbeat from core client for 30 sec - exiting 19:07:21 (6088): No heartbeat from core client for 30 sec - exiting 19:07:22 (6088): No heartbeat from core client for 30 sec - exiting 19:07:23 (6088): No heartbeat from core client for 30 sec - exiting 19:07:24 (6088): No heartbeat from core client for 30 sec - exiting 19:07:25 (6088): No heartbeat from core client for 30 sec - exiting 19:07:26 (6088): No heartbeat from core client for 30 sec - exiting 19:07:27 (6088): No heartbeat from core client for 30 sec - exiting 19:07:28 (6088): No heartbeat from core client for 30 sec - exiting 19:07:29 (6088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4952, selfPID=4952, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5952, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5540, selfPID=5380, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5824, selfPID=5888, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4288, selfPID=3024, iMonCtr=1 Model crash detected, will try to restart... 06:14:05 (6104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=856, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5880, selfPID=6128, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:32:57 (5988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1808, selfPID=1808, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13764, selfPID=12508, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3708, selfPID=5708, iMonCtr=1 Model crash detected, will try to restart... 06:33:00 (4612): No heartbeat from core client for 30 sec - exiting 06:33:03 (4612): No heartbeat from core client for 30 sec - exiting 06:33:04 (4612): No heartbeat from core client for 30 sec - exiting 06:33:05 (4612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5708, selfPID=5264, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5824, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2608, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7112, selfPID=6748, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6112, selfPID=14456, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=756, selfPID=1144, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9008, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3812, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3520, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4948, selfPID=4948, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32612, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9496, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3284, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3952, selfPID=6968, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5348, selfPID=4248, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4636, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4468, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4548, selfPID=4772, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6908, selfPID=10132, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4848, selfPID=21920, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16512, selfPID=16776, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 08:50:11 (6056): No heartbeat from core client for 30 sec - exiting 08:50:13 (6056): No heartbeat from core client for 30 sec - exiting 08:50:14 (6056): No heartbeat from core client for 30 sec - exiting 08:50:15 (6056): No heartbeat from core client for 30 sec - exiting 08:50:16 (6056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5764, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4236, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5972, selfPID=7644, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 May 2014 06:55:57 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 138,539 | 1,967,341 | 14.2006 |
25 May 2014 18:54:58 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 127,019 | 1,805,273 | 14.2126 |
21 May 2014 03:59:53 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 115,499 | 1,646,307 | 14.2539 |
17 May 2014 23:52:31 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 103,979 | 1,487,737 | 14.3081 |
14 May 2014 06:16:25 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 92,459 | 1,330,990 | 14.3955 |
11 May 2014 01:44:49 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 80,939 | 1,169,471 | 14.4488 |
07 May 2014 00:59:40 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 69,419 | 1,008,063 | 14.5214 |
03 May 2014 22:18:36 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 57,899 | 846,525 | 14.6207 |
27 Apr 2014 04:12:48 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 46,379 | 681,701 | 14.6985 |
20 Apr 2014 11:43:37 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 34,859 | 504,403 | 14.4698 |
07 Apr 2014 06:30:22 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 23,339 | 331,567 | 14.2066 |
31 Mar 2014 11:59:15 | 1241124 | 16421860 | hadam3p_anz_nanf_2012_1_008601419_2 | 11,819 | 169,105 | 14.3079 |
©2024 climateprediction.net