Name | hadam3p_pnw_bejd_1960_1_007905144_0 |
Workunit | 8060256 |
Created | 17 Apr 2012, 17:52:30 UTC |
Sent | 12 May 2012, 11:26:14 UTC |
Report deadline | 24 Apr 2013, 16:46:14 UTC |
Received | 21 May 2012, 10:01:14 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1197298 |
Run time | 3 days 8 hours 59 min 33 sec |
CPU time | 3 days 0 hours 38 min 55 sec |
Validate state | Invalid |
Credit | 1,503.98 |
Device peak FLOPS | 2.79 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.24</core_client_version> <![CDATA[ <stderr_txt> 10:17:58 (17682): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 forrtl: No space left on device forrtl: severe (38): error during write, unit 0, file /var/lib/boinc-client/projects/climateprediction.net/hadam3p_pnw_bejd_1960_1_007905144/dataout/xaakg.err Image PC Routine Line Source hadrm3p_pnw_um_6. 083C744D Unknown Unknown Unknown hadrm3p_pnw_um_6. 083C6245 Unknown Unknown Unknown hadrm3p_pnw_um_6. 08396C9F Unknown Unknown Unknown hadrm3p_pnw_um_6. 08352E0D Unknown Unknown Unknown hadrm3p_pnw_um_6. 08352757 Unknown Unknown Unknown hadrm3p_pnw_um_6. 0838CD8F Unknown Unknown Unknown hadrm3p_pnw_um_6. 083897C9 Unknown Unknown Unknown hadrm3p_pnw_um_6. 08069968 Unknown Unknown Unknown hadrm3p_pnw_um_6. 082CCDA2 Unknown Unknown Unknown libc.so.6 F0DC64D3 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1424, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 6 zip I/O error: No space left on device zip error: Output file write failure (write error on zip file) Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_bejd_1960_1_007905144_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bejd_1960_1_007905144_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bejd_1960_1_007905144_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bejd_1960_1_007905144_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bejd_1960_1_007905144_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bejd_1960_1_007905144_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bejd_1960_1_007905144_0_13.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 May 2012 14:19:36 | 1197298 | 14443801 | hadam3p_pnw_bejd_1960_1_007905144_0 | 69,216 | 236,126 | 3.4114 |
15 May 2012 03:06:35 | 1197298 | 14443801 | hadam3p_pnw_bejd_1960_1_007905144_0 | 57,696 | 198,724 | 3.4443 |
14 May 2012 15:16:34 | 1197298 | 14443801 | hadam3p_pnw_bejd_1960_1_007905144_0 | 46,176 | 159,814 | 3.4610 |
14 May 2012 10:35:32 | 1197298 | 14443801 | hadam3p_pnw_bejd_1960_1_007905144_0 | 34,656 | 117,856 | 3.4007 |
13 May 2012 14:18:51 | 1197298 | 14443801 | hadam3p_pnw_bejd_1960_1_007905144_0 | 23,136 | 74,941 | 3.2392 |
13 May 2012 01:33:45 | 1197298 | 14443801 | hadam3p_pnw_bejd_1960_1_007905144_0 | 11,616 | 37,679 | 3.2437 |
©2024 cpdn.org