climateprediction.net home page
ANOTHER UPLOAD PROBLEM

ANOTHER UPLOAD PROBLEM

Message boards : Number crunching : ANOTHER UPLOAD PROBLEM
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 33 · Next

AuthorMessage
elect

Send message
Joined: 28 Jul 11
Posts: 2
Credit: 61,196
RAC: 0
Message 50317 - Posted: 26 Sep 2014, 7:04:36 UTC - in response to Message 50307.  

rapid-watch seems to be up, but with a big backlog. Like locally, here, almost 100 63GB uploads to go.

Please don't try hitting the "retry now" -- that just makes it worse.

The files will be uploaded, the data will be saved and archived.

Patience is a virtue. As I remind myself time to time.

This time a peripheral server was down for about 5 days, and is catching up very well thank you.

Please consider the time scales of heat-death vs "models that anticipate heat-death"

Yup. Please smile. Thanks.



Yep, you were right, thanks :)
ID: 50317 · Report as offensive     Reply Quote
Profile Alan K

Send message
Joined: 22 Feb 06
Posts: 490
Credit: 30,893,022
RAC: 13,587
Message 50321 - Posted: 26 Sep 2014, 9:10:28 UTC - in response to Message 50317.  
Last modified: 26 Sep 2014, 9:10:42 UTC

Mine finally cleared betwen 7:00 and 8:30 (UK time) this morning. No new tasks though:-((
ID: 50321 · Report as offensive     Reply Quote
KWSN - Sir Frank of the Wood

Send message
Joined: 3 Nov 10
Posts: 39
Credit: 2,494,427
RAC: 0
Message 50329 - Posted: 27 Sep 2014, 4:50:22 UTC

...odd behavior of 3n work unit at 100%...elapsed clock still running, completion clock shows "---", status "Running", and no messages about upload attempt...

would this indicate waiting for server or something else ???

frank
ID: 50329 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 50331 - Posted: 27 Sep 2014, 6:41:15 UTC - in response to Message 50329.  

The only 3n that you seem to have, has the Time step of 1,036,800 as the last one received.
This is the last trickle_up, so it looks like that one just doesn't know when to call it quits.

You could try Exiting from BOINC and then restarting it, to see if that gets it going and gets you the Over Success Done set of messages, but otherwise just Abort it.

ID: 50331 · Report as offensive     Reply Quote
KWSN - Sir Frank of the Wood

Send message
Joined: 3 Nov 10
Posts: 39
Credit: 2,494,427
RAC: 0
Message 50333 - Posted: 27 Sep 2014, 19:24:34 UTC

hello les

did the exit/restart of BOINC as you suggested...status changed to "Computation error" and message file said that the 4.zip file could not be found...and that was the end...at 11,508 credits out of 12,440 or so...

on the bright side, none of the wingmen on this task got past zero !!!

frank

ID: 50333 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 50334 - Posted: 28 Sep 2014, 21:47:39 UTC - in response to Message 50333.  

Strange, but the only other option was to Abort.

Credits are a different matter. The scripts get run occasionally now, as per the discussion in the Credits? thread.

ID: 50334 · Report as offensive     Reply Quote
Profile JIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 50549 - Posted: 18 Oct 2014, 4:00:48 UTC

What do you know, it�s Friday night and we seem to have a upload problem. I presently have 2 zip files from 2 hadcm3s (1 from each) stuck in my transfer tab. I wonder how many I�ll have by Monday?

ID: 50549 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 50550 - Posted: 18 Oct 2014, 4:38:25 UTC - in response to Message 50549.  

Yes, it seems that the server at BADC has failed.
Been reported.


ID: 50550 · Report as offensive     Reply Quote
Eirik Redd

Send message
Joined: 31 Aug 04
Posts: 391
Credit: 219,896,461
RAC: 649
Message 50553 - Posted: 18 Oct 2014, 10:42:12 UTC - in response to Message 50550.  

Friday night server failures are us. :)
Been happening for years.
Hope the denier crew don't get a conspriancy theory out of this

ID: 50553 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4533
Credit: 18,899,540
RAC: 22,776
Message 50554 - Posted: 18 Oct 2014, 12:52:54 UTC

Just had one go through with no problems. :) Someone must have come in on overtime to kick the box.
ID: 50554 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,671,788
RAC: 12,733
Message 50555 - Posted: 18 Oct 2014, 15:04:46 UTC - in response to Message 50554.  

Just had one go through with no problems. :) Someone must have come in on overtime to kick the box.

The BADC server is on an independent site - the British Atmospheric Data Centre - which I would expect would aim for 24/7 operation on normal time.

It's currently showing "The CEDA site and web services have been fullly resolved following this morning's fault interupting services." (though that message is dated two days ago)
ID: 50555 · Report as offensive     Reply Quote
Profile JIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 50556 - Posted: 18 Oct 2014, 19:35:43 UTC

It�s nice that the the CEDA says that it it fully up and running, but, I now have 4 hadcm3s zip files stuck in my transfer tab. Still getting the transient HTTP error.

Messages follow:

10/18/2014 3:20:21 PM | climateprediction.net | Started upload of hadcm3s_3dtd_1993_2_009067812_1_1.zip
10/18/2014 3:20:21 PM | climateprediction.net | Started upload of hadcm3s_2wro_2003_2_009071879_0_1.zip
10/18/2014 3:23:23 PM | climateprediction.net | Temporarily failed upload of hadcm3s_3dtd_1993_2_009067812_1_1.zip: transient HTTP error
10/18/2014 3:23:23 PM | climateprediction.net | Backing off 00:08:24 on upload of hadcm3s_3dtd_1993_2_009067812_1_1.zip
10/18/2014 3:23:23 PM | climateprediction.net | Temporarily failed upload of hadcm3s_2wro_2003_2_009071879_0_1.zip: transient HTTP error
10/18/2014 3:23:23 PM | climateprediction.net | Backing off 00:06:22 on upload of hadcm3s_2wro_2003_2_009071879_0_1.zip
10/18/2014 3:23:26 PM | | Project communication failed: attempting access to reference site
10/18/2014 3:23:28 PM | | Internet access OK - project servers may be temporarily down.
10/18/2014 3:27:35 PM | climateprediction.net | Started upload of hadcm3s_2rwz_1981_2_009052675_1_2.zip
10/18/2014 3:27:58 PM | climateprediction.net | Temporarily failed upload of hadcm3s_2rwz_1981_2_009052675_1_2.zip: transient HTTP error
10/18/2014 3:27:58 PM | climateprediction.net | Backing off 03:32:46 on upload of hadcm3s_2rwz_1981_2_009052675_1_2.zip
10/18/2014 3:28:01 PM | | Project communication failed: attempting access to reference site
10/18/2014 3:28:03 PM | | Internet access OK - project servers may be temporarily down.
10/18/2014 3:29:46 PM | climateprediction.net | Started upload of hadcm3s_2wro_2003_2_009071879_0_1.zip
10/18/2014 3:30:09 PM | climateprediction.net | Temporarily failed upload of hadcm3s_2wro_2003_2_009071879_0_1.zip: transient HTTP error
10/18/2014 3:30:09 PM | climateprediction.net | Backing off 00:13:52 on upload of hadcm3s_2wro_2003_2_009071879_0_1.zip
10/18/2014 3:30:11 PM | | Project communication failed: attempting access to reference site
10/18/2014 3:30:13 PM | | Internet access OK - project servers may be temporarily down.

ID: 50556 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 50557 - Posted: 18 Oct 2014, 20:12:22 UTC - in response to Message 50556.  

That message just says that the data centre is back up. It says nothing about individual programs and services being run on the possibly many servers.

And, as the Badwatch server that the BADC people use to store the data sent from modellers is far away from Oxford, and nothing to do with our Oxford people, our problem will have to wait until Jonathan sends them an email and the BADC IT people do something about it.

ID: 50557 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4533
Credit: 18,899,540
RAC: 22,776
Message 50558 - Posted: 19 Oct 2014, 6:28:22 UTC

Got another coming up this morning, will see what happens. Interestingly, the last one that made it took 15 minutes which which is several times longer than normal. I wonder if this indicates that the BADC server is choked at some point? Could be anywhere from data going in from the interweb to the actual server.
ID: 50558 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4533
Credit: 18,899,540
RAC: 22,776
Message 50560 - Posted: 19 Oct 2014, 9:25:23 UTC - in response to Message 50558.  

Got another coming up this morning, will see what happens.


I am now getting the transient http error as well.
ID: 50560 · Report as offensive     Reply Quote
Profile JIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 50562 - Posted: 19 Oct 2014, 14:01:21 UTC

Looks like these people work a 5 day work week just the ones at Oxford.Still not movement on the uploads. And it seems that now Seti is down also.


ID: 50562 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4533
Credit: 18,899,540
RAC: 22,776
Message 50563 - Posted: 19 Oct 2014, 17:06:26 UTC

Two more zips gone to BADC server. Again, taking 16 minutes each about twice the time they normally do.
ID: 50563 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,671,788
RAC: 12,733
Message 50564 - Posted: 19 Oct 2014, 17:41:50 UTC - in response to Message 50562.  

Looks like these people work a 5 day work week just the ones at Oxford.Still not movement on the uploads. And it seems that now Seti is down also.

SETI is back up, and all my CPDN uploads have cleared. CPDN upload speed (when I watched the last one uploading) was just about the maximum my ADSL line can sustain, given what else was going on at the time - I can get about 1 Mbit/sec for a single upload, but only 500 Kbit/sec each if there are two uploads active at the same time).
ID: 50564 · Report as offensive     Reply Quote
Profile JIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 50565 - Posted: 19 Oct 2014, 19:16:43 UTC

All of mine have cleared also.

ID: 50565 · Report as offensive     Reply Quote
Bellator
Avatar

Send message
Joined: 31 Mar 05
Posts: 44
Credit: 234,235
RAC: 0
Message 50566 - Posted: 20 Oct 2014, 9:34:52 UTC - in response to Message 50565.  

20/10/2014 10:46:13 AM | climateprediction.net | Task hadam3p_anz_ron8_2012_1_008958519_1 exited with zero status but no 'finished' file
20/10/2014 10:46:13 AM | climateprediction.net | If this happens repeatedly you may need to reset the project.
20/10/2014 10:46:13 AM | climateprediction.net | Task hadam3p_anz_ron6_2012_1_008958517_1 exited with zero status but no 'finished' file
20/10/2014 10:46:13 AM | climateprediction.net | If this happens repeatedly you may need to reset the project.

Wasn't quite sure where to post this. Every time I restart my computer, I get the above message. I now have a total of 14 trickles, nothing gets updated and now I am beginning to wonder if I should reset as the message says?
ID: 50566 · Report as offensive     Reply Quote
Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 33 · Next

Message boards : Number crunching : ANOTHER UPLOAD PROBLEM

©2024 cpdn.org