Name | hadam3p_saf_0ptl_1968_1_006847441_1 |
Workunit | 7050757 |
Created | 6 Apr 2012, 15:51:56 UTC |
Sent | 6 Apr 2012, 15:52:32 UTC |
Report deadline | 19 Mar 2013, 21:12:32 UTC |
Received | 12 Apr 2012, 16:33:14 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 786542 |
Run time | 5 days 5 hours 33 min 42 sec |
CPU time | 5 days 5 hours 33 min 42 sec |
Validate state | Invalid |
Credit | 2,068.11 |
Device peak FLOPS | 1.81 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>5.10.20</core_client_version> <![CDATA[ <stderr_txt> Precis Restart file copy #1 failed on 0ptlga.dag8c40 Precis Restart file copy #1 failed on 0ptlga.dag8c80 Precis Restart file copy #1 failed on 0ptlga.dag8ca0 Precis Restart file copy #1 failed on 0ptlga.dag8cb0 Precis Restart file copy #1 failed on 0ptlga.dag8cc0 Precis Restart file copy #1 failed on 0ptlga.dag8cd0 Precis Restart file copy #1 failed on 0ptlga.dag8ce0 Precis Restart file copy #1 failed on 0ptlga.dag8ch0 Precis Restart file copy #1 failed on 0ptlga.dag8ci0 Precis Restart file copy #1 failed on 0ptlga.dag8cj0 Precis Restart file copy #1 failed on 0ptlga.dag8cn0 Precis Restart file copy #1 failed on 0ptlga.dag8co0 Precis Restart file copy #1 failed on 0ptlga.dag8cr0 Precis Restart file copy #1 failed on 0ptlga.dag8cs0 Precis Restart file copy #1 failed on 0ptlga.dag8ct0 Precis Restart file copy #1 failed on 0ptlga.dag8cu0 Precis Restart file copy #1 failed on 0ptlga.dag9130 Precis Restart file copy #1 failed on 0ptlga.dag9140 Precis Restart file copy #1 failed on 0ptlga.dag9150 Precis Restart file copy #1 failed on 0ptlga.dag9160 Precis Restart file copy #1 failed on 0ptlga.dag9170 Precis Restart file copy #1 failed on 0ptlga.dag9180 04:53:03 (1796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Precis Restart file copy #1 failed on 0ptlga.dag91b0 Precis Restart file copy #1 failed on 0ptlga.dag91o0 Precis Restart file copy #1 failed on 0ptlga.dag91s0 Precis Restart file copy #1 failed on 0ptlga.dag9270 Precis Restart file copy #1 failed on 0ptlga.dag92d0 Precis Restart file copy #1 failed on 0ptlga.dag92e0 Precis Restart file copy #1 failed on 0ptlga.dag92m0 Precis Restart file copy #1 failed on 0ptlga.dag92p0 Precis Restart file copy #1 failed on 0ptlga.dag92s0 Precis Restart file copy #1 failed on 0ptlga.dag9360 04:27:59 (256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Precis Restart file copy #1 failed on 0ptlga.dag9380 Precis Restart file copy #1 failed on 0ptlga.dag93c0 Precis Restart file copy #1 failed on 0ptlga.dag93e0 Precis Restart file copy #1 failed on 0ptlga.dag93h0 Precis Restart file copy #1 failed on 0ptlga.dag93j0 Precis Restart file copy #1 failed on 0ptlga.dag93k0 Precis Restart file copy #1 failed on 0ptlga.dag93n0 Precis Restart file copy #1 failed on 0ptlga.dag93o0 Precis Restart file copy #1 failed on 0ptlga.dag93p0 Precis Restart file copy #1 failed on 0ptlga.dag93r0 Precis Restart file copy #1 failed on 0ptlga.dag93s0 Precis Restart file copy #1 failed on 0ptlga.dag9410 Precis Restart file copy #1 failed on 0ptlga.dag9420 Precis Restart file copy #1 failed on 0ptlga.dag9470 Precis Restart file copy #1 failed on 0ptlga.dag9490 Precis Restart file copy #1 failed on 0ptlga.dag94b0 Precis Restart file copy #1 failed on 0ptlga.dag94d0 Precis Restart file copy #1 failed on 0ptlga.dag94e0 Precis Restart file copy #1 failed on 0ptlga.dag94g0 Precis Restart file copy #1 failed on 0ptlga.dag94m0 Precis Restart file copy #1 failed on 0ptlga.dag94p0 Precis Restart file copy #1 failed on 0ptlga.dag94s0 Precis Restart file copy #1 failed on 0ptlga.dag94t0 Precis Restart file copy #1 failed on 0ptlga.dag9560 Precis Restart file copy #1 failed on 0ptlga.dag9580 Precis Restart file copy #1 failed on 0ptlga.dag95b0 04:37:00 (3712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3920, selfPID=3920, iMonCtr=2 Precis Restart file copy #1 failed on 0ptlga.dag95l0 Precis Restart file copy #1 failed on 0ptlga.dag95q0 Precis Restart file copy #1 failed on 0ptlga.dag95u0 Precis Restart file copy #1 failed on 0ptlga.dag9680 Precis Restart file copy #1 failed on 0ptlga.dag96b0 Precis Restart file copy #1 failed on 0ptlga.dag96l0 Precis Restart file copy #1 failed on 0ptlga.dag96n0 CPDN Monitor - Quit request from BOINC... Precis Restart file copy #1 failed on 0ptlga.dag96n0 Precis Restart file copy #1 failed on 0ptlga.dag96p0 Precis Restart file copy #1 failed on 0ptlga.dag96q0 Precis Restart file copy #1 failed on 0ptlga.dag96t0 Precis Restart file copy #1 failed on 0ptlga.dag9710 Precis Restart file copy #1 failed on 0ptlga.dag97b0 Precis Restart file copy #1 failed on 0ptlga.dag97c0 Precis Restart file copy #1 failed on 0ptlga.dag97f0 Precis Restart file copy #1 failed on 0ptlga.dag97q0 Precis Restart file copy #1 failed on 0ptlga.dag9860 Precis Restart file copy #1 failed on 0ptlga.dag98d0 Precis Restart file copy #1 failed on 0ptlga.dag98m0 Precis Restart file copy #1 failed on 0ptlga.dag98s0 Precis Restart file copy #1 failed on 0ptlga.dag9940 Precis Restart file copy #1 failed on 0ptlga.dag9950 Precis Restart file copy #1 failed on 0ptlga.dag9990 Precis Restart file copy #1 failed on 0ptlga.dag99b0 Precis Restart file copy #1 failed on 0ptlga.dag99f0 Precis Restart file copy #1 failed on 0ptlga.dag99k0 Precis Restart file copy #1 failed on 0ptlga.dag99m0 Precis Restart file copy #1 failed on 0ptlga.dag99n0 Precis Restart file copy #1 failed on 0ptlga.dag99o0 04:18:04 (3644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Precis Restart file copy #1 failed on 0ptlga.dag99t0 Precis Restart file copy #1 failed on 0ptlga.dag99u0 Precis Restart file copy #1 failed on 0ptlga.dag9a40 Precis Restart file copy #1 failed on 0ptlga.dag9ac0 Precis Restart file copy #1 failed on 0ptlga.dag9ae0 Precis Restart file copy #1 failed on 0ptlga.dag9af0 Precis Restart file copy #1 failed on 0ptlga.dag9ak0 Precis Restart file copy #1 failed on 0ptlga.dag9ao0 Precis Restart file copy #1 failed on 0ptlga.dag9b10 Precis Restart file copy #1 failed on 0ptlga.dag9b30 Precis Restart file copy #1 failed on 0ptlga.dag9b50 Precis Restart file copy #1 failed on 0ptlga.dag9b60 Precis Restart file copy #1 failed on 0ptlga.dag9b90 Precis Restart file copy #1 failed on 0ptlga.dag9bb0 Precis Restart file copy #1 failed on 0ptlga.dag9bf0 Precis Restart file copy #1 failed on 0ptlga.dag9bg0 Precis Restart file copy #1 failed on 0ptlga.dag9bh0 Precis Restart file copy #1 failed on 0ptlga.dag9bi0 Precis Restart file copy #1 failed on 0ptlga.dag9bl0 Precis Restart file copy #1 failed on 0ptlga.dag9bm0 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_0ptl_1968_1_006847441_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Apr 2012 23:18:59 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 127,488 | 423,301 | 3.3203 |
11 Apr 2012 13:13:52 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 115,968 | 387,166 | 3.3386 |
11 Apr 2012 02:44:52 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 104,448 | 349,953 | 3.3505 |
10 Apr 2012 16:18:29 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 92,928 | 312,825 | 3.3663 |
10 Apr 2012 04:41:42 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 81,408 | 275,762 | 3.3874 |
09 Apr 2012 18:23:47 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 69,792 | 237,224 | 3.3990 |
09 Apr 2012 06:57:18 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 58,272 | 199,474 | 3.4232 |
08 Apr 2012 20:19:18 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 46,752 | 161,402 | 3.4523 |
08 Apr 2012 09:14:16 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 35,232 | 122,967 | 3.4902 |
07 Apr 2012 22:32:25 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 23,712 | 84,523 | 3.5646 |
07 Apr 2012 09:27:42 | 786542 | 14371515 | hadam3p_saf_0ptl_1968_1_006847441_1 | 11,616 | 38,715 | 3.3329 |
©2024 cpdn.org