climateprediction.net home page
ANOTHER UPLOAD PROBLEM

ANOTHER UPLOAD PROBLEM

Message boards : Number crunching : ANOTHER UPLOAD PROBLEM
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 13 · 14 · 15 · 16 · 17 · 18 · 19 . . . 33 · Next

AuthorMessage
Lockleys

Send message
Joined: 13 Jan 07
Posts: 195
Credit: 10,581,566
RAC: 0
Message 50568 - Posted: 20 Oct 2014, 12:49:45 UTC

Just have a single _13 upload for an eu model still waiting patiently for a server to accept it.
ID: 50568 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 50572 - Posted: 21 Oct 2014, 6:46:02 UTC - in response to Message 50566.  

Bellator

That's a very old type of message. I thought that it had been removed from BOINC years ago.
As it's still there, the best advice is: Ignore it.
If the model IS going to crash, then just let it do that by itself.

I was thinking about this from another user a week or so back, and wondered at the time if I should post a list of thoughts on the matter.
I now have, at the top of the Preferences section.

**************

If you're saying that you have several trickle_up files still on your computer, then that IS a problem, as the trickle server is working OK.

What are the messages for the reason that they're not uploading?

ID: 50572 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 50573 - Posted: 21 Oct 2014, 6:47:39 UTC - in response to Message 50568.  

Lockleys

The restart server to which zip 13s go should be working.
Any messages about why it's not uploading?

ID: 50573 · Report as offensive     Reply Quote
Lockleys

Send message
Joined: 13 Jan 07
Posts: 195
Credit: 10,581,566
RAC: 0
Message 50576 - Posted: 21 Oct 2014, 9:25:00 UTC - in response to Message 50573.  

Les

I just get the standard message in the Event Log whenever the _13 zip attempts an upload. For example:

21/10/2014 10:17:34 | | Resuming network activity
21/10/2014 10:17:34 | climateprediction.net | Started upload of hadam3p_eu_o3b0_2013_1_008830869_1_13.zip
21/10/2014 10:17:56 | climateprediction.net | Temporarily failed upload of hadam3p_eu_o3b0_2013_1_008830869_1_13.zip: connect() failed
21/10/2014 10:17:56 | climateprediction.net | Backing off 05:33:19 on upload of hadam3p_eu_o3b0_2013_1_008830869_1_13.zip
21/10/2014 10:17:59 | | Project communication failed: attempting access to reference site
21/10/2014 10:18:01 | | Internet access OK - project servers may be temporarily down.

This one stays stuck regardless of whether I close and reload CPDN or even reboot the PC.
ID: 50576 · Report as offensive     Reply Quote
Bellator
Avatar

Send message
Joined: 31 Mar 05
Posts: 44
Credit: 234,235
RAC: 0
Message 50577 - Posted: 21 Oct 2014, 14:25:16 UTC - in response to Message 50572.  

19 Oct 2014 17:57:21 1288126 17091695 hadam3p_anz_ron8_2012_1_008958519_1 1 80,939 486,888 6.0155
15 Oct 2014 20:12:09 1288126 17091695 hadam3p_anz_ron8_2012_1_008958519_1 1 69,419 417,102 6.0085
12 Oct 2014 23:09:49 1288126 17091695 hadam3p_anz_ron8_2012_1_008958519_1 1 57,899 347,227 5.9971
10 Oct 2014 16:09:27 1288126 17091695 hadam3p_anz_ron8_2012_1_008958519_1 1 46,379 278,166 5.9977
04 Oct 2014 15:12:23 1288126 17091695 hadam3p_anz_ron8_2012_1_008958519_1 1 34,859 208,816 5.9903
29 Sep 2014 13:31:20 1288126 17091695 hadam3p_anz_ron8_2012_1_008958519_1 1 23,339 139,989 5.9981
26 Sep 2014 10:26:47 1288126 17091695 hadam3p_anz_ron8_2012_1_008958519_1 1 11,819 70,387 5.9554

Thank you Les.
As you can see, I have been getting trickles for a month now. This is one of two WU, the other has identical results.
In my account, the total credit is stuck at 189,838 or 193,838, depending on where you look. I do not get any messages, just a note in the log that a trickle is being uploaded. So I now have twice 7 trickles, but no credit as far as I can see.
ID: 50577 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 50578 - Posted: 21 Oct 2014, 15:39:30 UTC - in response to Message 50577.  

The credit scripts get run manually now and then when Jonathan gets the time to keep an eye on what happens.

At the moment, it's about once per month, so credits can't be used as a measure of anything.

ID: 50578 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 50580 - Posted: 21 Oct 2014, 19:56:54 UTC - in response to Message 50576.  

Lockleys

OK, I'll email them.


ID: 50580 · Report as offensive     Reply Quote
Niall

Send message
Joined: 18 Dec 13
Posts: 62
Credit: 1,078,935
RAC: 0
Message 50689 - Posted: 30 Oct 2014, 11:18:40 UTC

I assume it's known that the server handling eu WUs is down? I have a pair here (eu_c0 series) that have completed but half the zip files are still sitting on my machine, waiting to upload.
ID: 50689 · Report as offensive     Reply Quote
Profile Byron Leigh Hatch @ team Carl ...
Avatar

Send message
Joined: 17 Aug 04
Posts: 289
Credit: 44,103,664
RAC: 0
Message 50700 - Posted: 31 Oct 2014, 15:48:32 UTC

Hello everyone. I am just reporting some of my latest BOINC Event Log Messages in case it might help.

thank you to staff, Moderators and volunteers for doing your best under the current circumstances.

31/10/2014 8:24:44 AM | climateprediction.net | Started upload of hadam3p_eu_g3h1_2013_1_008852446_2_2.zip
31/10/2014 8:25:01 AM | climateprediction.net | Started upload of hadam3p_eu_g3h1_2013_1_008852446_2_1.zip
31/10/2014 8:25:21 AM | climateprediction.net | Temporarily failed upload of hadam3p_eu_g3h1_2013_1_008852446_2_2.zip: transient HTTP error
31/10/2014 8:25:21 AM | climateprediction.net | Backing off 00:09:41 on upload of hadam3p_eu_g3h1_2013_1_008852446_2_2.zip
31/10/2014 8:25:22 AM | | Project communication failed: attempting access to reference site
31/10/2014 8:25:26 AM | | Internet access OK - project servers may be temporarily down.
31/10/2014 8:26:40 AM | | Project communication failed: attempting access to reference site[size=9]
31/10/2014 8:26:40 AM | climateprediction.net | Temporarily failed upload of hadam3p_eu_g3h1_2013_1_008852446_2_1.zip: transient HTTP error
31/10/2014 8:26:40 AM | climateprediction.net | Backing off 05:58:40 on upload of hadam3p_eu_g3h1_2013_1_008852446_2_1.zip
31/10/2014 8:26:41 AM | | Internet access OK - project servers may be temporarily down.
ID: 50700 · Report as offensive     Reply Quote
bl

Send message
Joined: 17 Nov 08
Posts: 5
Credit: 1,405,081
RAC: 57,350
Message 50708 - Posted: 2 Nov 2014, 6:38:40 UTC - in response to Message 50700.  

Hello everyone.

Yes, I'm experiencing the same problem and was hoping it would get fixed, yet day after day it's still stuck. The stuck units seem to be preventing a different project that requires all 8 cpu's from executing---because of a bug in BOING client I believe, that makes the BOINC client think that finished tasks queued for upload still require a cpu. (My BOINC client may not be the newest, but not ancient; it's only a version or two behind.)

ID: 50708 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4533
Credit: 18,900,368
RAC: 22,763
Message 50709 - Posted: 2 Nov 2014, 7:07:19 UTC

The stuck units seem to be preventing a different project that requires all 8 cpu's from executing


Never experienced this so am clutching at straws. Is it possible to kickstart the last cpu by either temorarily suspending CPDN and or network activity?
ID: 50709 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 50710 - Posted: 2 Nov 2014, 7:34:11 UTC - in response to Message 50708.  

bl

They're stuck because a server is down.
And it's the weekend.

ID: 50710 · Report as offensive     Reply Quote
bl

Send message
Joined: 17 Nov 08
Posts: 5
Credit: 1,405,081
RAC: 57,350
Message 50718 - Posted: 2 Nov 2014, 23:08:09 UTC - in response to Message 50709.  

oops, my bad---and no such boinc bug after all.

After trying what you suggested I realized my computing prefs we set to allow only for 7 cpu's to work on boinc.

sorry all, and thanks for the feedback/pointer.
ID: 50718 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4533
Credit: 18,900,368
RAC: 22,763
Message 50721 - Posted: 3 Nov 2014, 11:16:52 UTC

And upload server seems to be back on according to Server Status page but for those who haven't read this before, please be patient as it is probably now getting hammered with upload requests so you may get transient http error messages.
ID: 50721 · Report as offensive     Reply Quote
Profile Byron Leigh Hatch @ team Carl ...
Avatar

Send message
Joined: 17 Aug 04
Posts: 289
Credit: 44,103,664
RAC: 0
Message 50722 - Posted: 3 Nov 2014, 14:33:16 UTC
Last modified: 3 Nov 2014, 14:50:44 UTC

Thank you Dave. I can report that all my European region,

Finished upload of hadam3p_eu_g3h1_2013_1_008852446_2_7.zip

have uploaded from my BOINC transfer tab.
ID: 50722 · Report as offensive     Reply Quote
Helmer Bryd

Send message
Joined: 16 Aug 04
Posts: 156
Credit: 9,035,872
RAC: 2,928
Message 50724 - Posted: 3 Nov 2014, 20:29:56 UTC

Not an upload problem but Downloading of the new hadam3prm3pm2t_eu.

Something is funky on the server, boinc client can't find them.
ID: 50724 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4533
Credit: 18,900,368
RAC: 22,763
Message 50733 - Posted: 4 Nov 2014, 6:44:27 UTC
Last modified: 4 Nov 2014, 6:52:52 UTC

Not an upload problem but Downloading of the new hadam3prm3pm2t_eu.

Something is funky on the server, boinc client can't find them.


Same here, I set my preferences to only these work units after a suggestion elsewhere to download these to test. As I had download other tasks if none of selected available, I just filled up with short models, I will alert admin types on mailing list.

Edit: Les has also picked up on this.
ID: 50733 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4533
Credit: 18,900,368
RAC: 22,763
Message 50737 - Posted: 5 Nov 2014, 7:38:03 UTC

I see that these models have now all gone so whatever was stopping them from downloading is resolved.
ID: 50737 · Report as offensive     Reply Quote
rjs5

Send message
Joined: 16 Jun 05
Posts: 16
Credit: 19,361,204
RAC: 4,749
Message 50739 - Posted: 5 Nov 2014, 15:17:32 UTC - in response to Message 50737.  

My log only goes back through 11/4 but I have some ANZ completed workloads that continue to be hung. Is there something that has to be done on my end to free them?


11/5/2014 3:55:32 AM | climateprediction.net | Backing off 03:39:11 on upload of hadam3p_anz_r0ra_2012_1_008730596_0_13.zip
11/5/2014 3:55:33 AM | | Project communication failed: attempting access to reference site
11/5/2014 3:55:34 AM | | Internet access OK - project servers may be temporarily down.
11/5/2014 6:17:13 AM | climateprediction.net | Started upload of hadam3p_anz_r719_2012_1_008738731_0_13.zip
11/5/2014 6:17:13 AM | climateprediction.net | Started upload of hadam3p_anz_r0ra_2012_1_008730596_0_13.zip
11/5/2014 6:22:20 AM | climateprediction.net | Temporarily failed upload of hadam3p_anz_r719_2012_1_008738731_0_13.zip: transient HTTP error
11/5/2014 6:22:20 AM | climateprediction.net | Backing off 05:57:48 on upload of hadam3p_anz_r719_2012_1_008738731_0_13.zip
11/5/2014 6:22:20 AM | climateprediction.net | Temporarily failed upload of hadam3p_anz_r0ra_2012_1_008730596_0_13.zip: transient HTTP error
11/5/2014 6:22:20 AM | climateprediction.net | Backing off 03:42:10 on upload of hadam3p_anz_r0ra_2012_1_008730596_0_13.zip

ID: 50739 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4533
Credit: 18,900,368
RAC: 22,763
Message 50740 - Posted: 5 Nov 2014, 16:33:12 UTC

I have some ANZ completed workloads that continue to be hung. Is there something that has to be done on my end to free them?


Almost certainly not, the ANZ models go to a different server, (not an Oxford one) and my guess is they will go when it comes back on line.
ID: 50740 · Report as offensive     Reply Quote
Previous · 1 . . . 13 · 14 · 15 · 16 · 17 · 18 · 19 . . . 33 · Next

Message boards : Number crunching : ANOTHER UPLOAD PROBLEM

©2024 cpdn.org