Name | hadam3p_anz_a21r_2013_1_009460958_0 |
Workunit | 9543192 |
Created | 14 Jan 2015, 10:48:58 UTC |
Sent | 18 Jan 2015, 22:37:34 UTC |
Report deadline | 1 Jan 2016, 3:57:34 UTC |
Received | 2 May 2015, 17:12:15 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1308980 |
Run time | 9 days 14 hours 59 min 15 sec |
CPU time | 7 days 13 hours 26 min 46 sec |
Validate state | Invalid |
Credit | 5,477.92 |
Device peak FLOPS | 3.00 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.42</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5352, selfPID=5352, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4580, selfPID=4616, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5052, selfPID=5052, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9924, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6696, selfPID=6696, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3124, selfPID=4380, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6416, selfPID=6416, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9656, selfPID=9656, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6744, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2776, selfPID=5580, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:10:25 (2184): No heartbeat from core client for 30 sec - exiting 12:10:26 (2184): No heartbeat from core client for 30 sec - exiting 12:10:27 (2184): No heartbeat from core client for 30 sec - exiting 12:10:28 (2184): No heartbeat from core client for 30 sec - exiting 12:10:29 (2184): No heartbeat from core client for 30 sec - exiting 12:10:30 (2184): No heartbeat from core client for 30 sec - exiting 12:10:31 (2184): No heartbeat from core client for 30 sec - exiting 12:10:32 (2184): No heartbeat from core client for 30 sec - exiting 12:10:33 (2184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2880, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3872, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:09:47 (5420): No heartbeat from core client for 30 sec - exiting 22:09:48 (5420): No heartbeat from core client for 30 sec - exiting 22:09:50 (5420): No heartbeat from core client for 30 sec - exiting 22:09:51 (5420): No heartbeat from core client for 30 sec - exiting 22:09:52 (5420): No heartbeat from core client for 30 sec - exiting 22:09:53 (5420): No heartbeat from core client for 30 sec - exiting 22:09:54 (5420): No heartbeat from core client for 30 sec - exiting 22:09:55 (5420): No heartbeat from core client for 30 sec - exiting 22:09:56 (5420): No heartbeat from core client for 30 sec - exiting 22:09:57 (5420): No heartbeat from core client for 30 sec - exiting 22:09:58 (5420): No heartbeat from core client for 30 sec - exiting 22:09:59 (5420): No heartbeat from core client for 30 sec - exiting 22:10:00 (5420): No heartbeat from core client for 30 sec - exiting 22:10:02 (5420): No heartbeat from core client for 30 sec - exiting 22:10:03 (5420): No heartbeat from core client for 30 sec - exiting 22:10:04 (5420): No heartbeat from core client for 30 sec - exiting 22:10:05 (5420): No heartbeat from core client for 30 sec - exiting 22:10:06 (5420): No heartbeat from core client for 30 sec - exiting 22:10:07 (5420): No heartbeat from core client for 30 sec - exiting 22:10:08 (5420): No heartbeat from core client for 30 sec - exiting 22:10:09 (5420): No heartbeat from core client for 30 sec - exiting 22:10:10 (5420): No heartbeat from core client for 30 sec - exiting 22:10:11 (5420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3468, selfPID=3468, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=6764, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7988, selfPID=7988, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7988, selfPID=4876, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_a21r_2013_1_009460958_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 May 2015 19:06:30 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 127,019 | 646,952 | 5.0933 |
12 Apr 2015 07:12:29 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 115,499 | 589,108 | 5.1005 |
10 Apr 2015 22:16:48 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 103,979 | 531,609 | 5.1127 |
05 Apr 2015 20:11:25 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 92,459 | 474,374 | 5.1306 |
22 Mar 2015 17:18:44 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 80,939 | 417,082 | 5.1530 |
17 Mar 2015 22:44:53 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 69,419 | 357,347 | 5.1477 |
15 Mar 2015 22:48:17 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 57,899 | 296,749 | 5.1253 |
15 Mar 2015 04:19:58 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 46,379 | 236,747 | 5.1046 |
14 Mar 2015 08:01:49 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 34,859 | 178,299 | 5.1149 |
11 Mar 2015 18:50:06 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 23,339 | 120,132 | 5.1473 |
24 Jan 2015 13:25:21 | 1308980 | 17789086 | hadam3p_anz_a21r_2013_1_009460958_0 | 11,819 | 60,291 | 5.1012 |
©2024 cpdn.org