Name | hadam3p_pnw_32se_1984_1_007183766_0 |
Workunit | 7382048 |
Created | 22 Feb 2011, 12:46:53 UTC |
Sent | 27 Feb 2011, 18:22:39 UTC |
Report deadline | 9 Feb 2012, 23:42:39 UTC |
Received | 27 Mar 2011, 9:05:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1045219 |
Run time | 4 days 11 hours 19 min 33 sec |
CPU time | 1 days 7 hours 47 min 23 sec |
Validate state | Invalid |
Credit | 1,503.98 |
Device peak FLOPS | 2.91 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2184, selfPID=2836, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4696, selfPID=4380, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3960, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3604, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4028, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4184, selfPID=3492, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 6 07:02:57 (3492): called boinc_finish Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4608, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3200, selfPID=4716, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4256, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3436, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 19:49:19 (3436): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4796, selfPID=4064, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5056, selfPID=4312, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2964, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2868, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 3 21:06:38 (4212): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_pnw_32se_1984_1_007183766_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_32se_1984_1_007183766_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_32se_1984_1_007183766_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_32se_1984_1_007183766_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_32se_1984_1_007183766_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_32se_1984_1_007183766_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Mar 2011 04:19:19 | 1045219 | 12623194 | hadam3p_pnw_32se_1984_1_007183766_0 | 69,216 | 206,408 | 2.9821 |
13 Mar 2011 18:30:54 | 1045219 | 12623194 | hadam3p_pnw_32se_1984_1_007183766_0 | 57,696 | 170,891 | 2.9619 |
12 Mar 2011 21:20:19 | 1045219 | 12623194 | hadam3p_pnw_32se_1984_1_007183766_0 | 46,176 | 134,298 | 2.9084 |
12 Mar 2011 11:06:42 | 1045219 | 12623194 | hadam3p_pnw_32se_1984_1_007183766_0 | 34,656 | 97,872 | 2.8241 |
08 Mar 2011 13:26:13 | 1045219 | 12623194 | hadam3p_pnw_32se_1984_1_007183766_0 | 23,136 | 62,174 | 2.6873 |
08 Mar 2011 13:26:13 | 1045219 | 12623194 | hadam3p_pnw_32se_1984_1_007183766_0 | 11,616 | 32,588 | 2.8054 |
©2024 cpdn.org