Name | hadam3p_eu_j6y8_1997_1_008500853_0 |
Workunit | 8651666 |
Created | 3 Dec 2013, 22:29:50 UTC |
Sent | 4 Dec 2013, 0:08:07 UTC |
Report deadline | 16 Nov 2014, 5:28:07 UTC |
Received | 16 Dec 2013, 3:20:35 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1284014 |
Run time | 4 days 6 hours 24 min 30 sec |
CPU time | 3 days 23 hours 32 min 40 sec |
Validate state | Invalid |
Credit | 1,591.48 |
Device peak FLOPS | 1.85 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3412, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3800, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3848, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3864, selfPID=3324, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3436, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4368, selfPID=3712, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3864, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3884, selfPID=3320, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3504, selfPID=3252, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3888, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3904, selfPID=3368, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2612, selfPID=1552, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 08:25:04 (3664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3768, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3784, selfPID=3372, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3812, selfPID=3324, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_j6y8_1997_1_008500853_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_j6y8_1997_1_008500853_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_j6y8_1997_1_008500853_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_j6y8_1997_1_008500853_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Dec 2013 07:30:20 | 1284014 | 16128010 | hadam3p_eu_j6y8_1997_1_008500853_0 | 92,256 | 329,272 | 3.5691 |
13 Dec 2013 03:16:50 | 1284014 | 16128010 | hadam3p_eu_j6y8_1997_1_008500853_0 | 80,736 | 287,423 | 3.5600 |
12 Dec 2013 01:58:06 | 1284014 | 16128010 | hadam3p_eu_j6y8_1997_1_008500853_0 | 69,216 | 248,462 | 3.5897 |
11 Dec 2013 00:40:12 | 1284014 | 16128010 | hadam3p_eu_j6y8_1997_1_008500853_0 | 57,707 | 208,833 | 3.6189 |
10 Dec 2013 10:17:02 | 1284014 | 16128010 | hadam3p_eu_j6y8_1997_1_008500853_0 | 57,696 | 208,070 | 3.6063 |
09 Dec 2013 07:29:04 | 1284014 | 16128010 | hadam3p_eu_j6y8_1997_1_008500853_0 | 46,176 | 166,194 | 3.5991 |
08 Dec 2013 03:32:37 | 1284014 | 16128010 | hadam3p_eu_j6y8_1997_1_008500853_0 | 34,656 | 125,453 | 3.6200 |
06 Dec 2013 05:26:21 | 1284014 | 16128010 | hadam3p_eu_j6y8_1997_1_008500853_0 | 23,136 | 83,604 | 3.6136 |
05 Dec 2013 03:41:04 | 1284014 | 16128010 | hadam3p_eu_j6y8_1997_1_008500853_0 | 11,616 | 42,594 | 3.6668 |
©2024 cpdn.org