Name | hadam3p_anz_nc26_2012_1_008589318_0 |
Workunit | 8735830 |
Created | 25 Mar 2014, 20:17:48 UTC |
Sent | 25 Mar 2014, 20:20:56 UTC |
Report deadline | 8 Mar 2015, 1:40:56 UTC |
Received | 20 Aug 2014, 8:05:52 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Aborted by user |
Exit status | 203 (0x000000CB) EXIT_ABORTED_VIA_GUI |
Computer ID | 1169010 |
Run time | 8 days 15 hours 1 min 55 sec |
CPU time | 5 days 4 hours 23 min 21 sec |
Validate state | Invalid |
Credit | 2,000.18 |
Device peak FLOPS | 2.49 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> aborted by user </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3480, selfPID=4336, iMonCtr=1 Model crash detected, will try to restart... 07:30:26 (5084): No heartbeat from core client for 30 sec - exiting 07:30:28 (5084): No heartbeat from core client for 30 sec - exiting 07:30:29 (5084): No heartbeat from core client for 30 sec - exiting 07:30:30 (5084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6080, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 00:49:49 (2128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5976, selfPID=5976, iMonCtr=2 08:53:12 (4372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5356, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 22:10:52 (4652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:20:16 (4752): No heartbeat from core client for 30 sec - exiting 22:20:17 (4752): No heartbeat from core client for 30 sec - exiting 22:20:18 (4752): No heartbeat from core client for 30 sec - exiting 22:20:19 (4752): No heartbeat from core client for 30 sec - exiting 22:20:20 (4752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:04:53 (5868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5476, selfPID=5476, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5848, selfPID=6124, iMonCtr=1 Model crash detected, will try to restart... 09:53:11 (872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GCPDN Monitor - Quit request from BOINC... 19:22:59 (1388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:04:59 (4328): No heartbeat from core client for 30 sec - exiting 00:05:00 (4328): No heartbeat from core client for 30 sec - exiting 00:05:01 (4328): No heartbeat from core client for 30 sec - exiting 00:05:02 (4328): No heartbeat from core client for 30 sec - exiting 00:05:03 (4328): No heartbeat from core client for 30 sec - exiting 00:05:04 (4328): No heartbeat from core client for 30 sec - exiting 00:05:05 (4328): No heartbeat from core client for 30 sec - exiting 00:05:06 (4328): No heartbeat from core client for 30 sec - exiting 00:05:07 (4328): No heartbeat from core client for 30 sec - exiting 00:05:08 (4328): No heartbeat from core client for 30 sec - exiting 00:05:09 (4328): No heartbeat from core client for 30 sec - exiting 00:05:10 (4328): No heartbeat from core client for 30 sec - exiting 00:05:11 (4328): No heartbeat from core client for 30 sec - exiting 00:05:12 (4328): No heartbeat from core client for 30 sec - exiting 00:05:13 (4328): No heartbeat from core client for 30 sec - exiting 00:05:14 (4328): No heartbeat from core client for 30 sec - exiting 00:05:15 (4328): No heartbeat from core client for 30 sec - exiting 00:05:16 (4328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10136, selfPID=9696, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4568, selfPID=6088, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=468, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2696, iMonCtr=2 Model crash detected, will try to restart... 19:56:27 (5956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:08:46 (3176): No heartbeat from core client for 30 sec - exiting 20:08:47 (3176): No heartbeat from core client for 30 sec - exiting 20:08:48 (3176): No heartbeat from core client for 30 sec - exiting 20:08:49 (3176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:43:34 (5484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4500, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5272, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... 23:27:25 (3960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:27:27 (3960): No heartbeat from core client for 30 sec - exiting 23:27:28 (3960): No heartbeat from core client for 30 sec - exiting 23:27:29 (3960): No heartbeat from core client for 30 sec - exiting 23:27:30 (3960): No heartbeat from core client for 30 sec - exiting 23:27:31 (3960): No heartbeat from core client for 30 sec - exiting 23:27:32 (3960): No heartbeat from core client for 30 sec - exiting 23:27:33 (3960): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5828, selfPID=1532, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5900, selfPID=5256, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7052, selfPID=7052, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:29:39 (5976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:29:40 (5976): No heartbeat from core client for 30 sec - exiting 07:29:41 (5976): No heartbeat from core client for 30 sec - exiting 07:29:42 (5976): No heartbeat from core client for 30 sec - exiting 07:29:43 (5976): No heartbeat from core client for 30 sec - exiting 07:29:44 (5976): No heartbeat from core client for 30 sec - exiting 07:29:45 (5976): No heartbeat from core client for 30 sec - exiting 07:29:46 (5976): No heartbeat from core client for 30 sec - exiting 07:29:47 (5976): No heartbeat from core client for 30 sec - exiting 07:29:48 (5976): No heartbeat from core client for 30 sec - exiting 07:29:49 (5976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 18:33:45 (1152): No heartbeat from core client for 30 sec - exiting 18:33:47 (1152): No heartbeat from core client for 30 sec - exiting 18:33:50 (1152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... G23:10:52 (5072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1432, selfPID=1432, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5340, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5028, selfPID=5524, iMonCtr=1 Model crash detected, will try to restart... 23:00:37 (3288): No heartbeat from core client for 30 sec - exiting 23:00:38 (3288): No heartbeat from core client for 30 sec - exiting 23:00:39 (3288): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:59:39 (4964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4336, selfPID=4336, iMonCtr=2 16:05:45 (5496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:17:56 (4796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=308, selfPID=308, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5744, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Abort request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Aug 2014 17:59:19 | 1169010 | 16404446 | hadam3p_anz_nc26_2012_1_008589318_0 | 46,379 | 438,373 | 9.4520 |
19 Jul 2014 08:06:02 | 1169010 | 16404446 | hadam3p_anz_nc26_2012_1_008589318_0 | 34,859 | 316,104 | 9.0681 |
24 May 2014 18:33:35 | 1169010 | 16404446 | hadam3p_anz_nc26_2012_1_008589318_0 | 23,339 | 206,025 | 8.8275 |
13 Apr 2014 10:21:32 | 1169010 | 16404446 | hadam3p_anz_nc26_2012_1_008589318_0 | 11,819 | 106,767 | 9.0335 |
©2024 cpdn.org