Message boards : Number crunching : Completed task fails to upload several times over last few days
Message board moderation
Author | Message |
---|---|
Send message Joined: 15 Nov 10 Posts: 43 Credit: 6,118,949 RAC: 0 |
This tasks is not uploading (https://www.cpdn.org/workunit.php?wuid=12089722) Anyone has any idea on what is happening Sun 20 Jun 2021 01:36:57 PM WEST | climateprediction.net | Started upload of hadsm4_a0ed_201310_6_911_012089722_2_r478262704_4.zip Sun 20 Jun 2021 01:37:00 PM WEST | | Project communication failed: attempting access to reference site Sun 20 Jun 2021 01:37:00 PM WEST | climateprediction.net | Temporarily failed upload of hadsm4_a0ed_201310_6_911_012089722_2_r478262704_4.zip: transient HTTP error Sun 20 Jun 2021 01:37:00 PM WEST | climateprediction.net | Backing off 05:41:16 on upload of hadsm4_a0ed_201310_6_911_012089722_2_r478262704_4.zip Sun 20 Jun 2021 01:37:02 PM WEST | | Internet access OK - project servers may be temporarily down. Many thanks candido |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
email sent. |
Send message Joined: 15 Nov 10 Posts: 43 Credit: 6,118,949 RAC: 0 |
Thanks Les Bayliss |
Send message Joined: 15 May 09 Posts: 4542 Credit: 19,039,635 RAC: 18,944 |
Andy says there are no problems showing on the server. Are you still having problems? I see all 6 tricles have uploaded which means you will get your credit. Did zips 5 and 6 upload or do they have the same problem as 4? |
Send message Joined: 15 Nov 10 Posts: 43 Credit: 6,118,949 RAC: 0 |
Thanks for your reply. I just found out that I had a second WU finished today that was also not uploading. I decided to suspend all tasks and restart the machine. Hopefully it wouldn't break any of the running WU. And fortunately it didn't. And it solved the both uploading problems. Thanks again Candido |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Good old Reboot - it fix's lots of things. |
Send message Joined: 6 Oct 06 Posts: 204 Credit: 7,608,986 RAC: 0 |
Good old Reboot - it fixes lots of things. _____ Yes, but these WU's hate re-boots. |
Send message Joined: 5 Aug 04 Posts: 1120 Credit: 17,202,915 RAC: 2,154 |
Yes, but these WU's hate re-boots. I wonder if this is still true. I just had to replace the UPS on my machine, and that required new software to interface to it. While configuring that I accidentally powered down the machine by turning off the power to it. I did not even press the stop button on the machine, much less doing the normal shutdown. And four CPDN N215 models were running. When the machine came back up, those models resumed without complaint. |
Send message Joined: 15 May 09 Posts: 4542 Credit: 19,039,635 RAC: 18,944 |
I wonder if this is still true. I certainly don't lose as many as I used to. I sometimes get away with no failures on a reboot with 8 tasks running but it is still an issue. My anecdotal perception is that reboots involving a kernel upgrade are more likely to produce a failure but I haven't recorded this so it may not make any difference at all. I do get the odd failure on reboots with CPDN and don't remember any with other projects though as the longest tasks I run from other projects are only a couple of days, the chances of them running during a reboot are much lower when mostly my running them means no work from CPDN. |
Send message Joined: 6 Oct 06 Posts: 204 Credit: 7,608,986 RAC: 0 |
Neither do I lose as much as I used to but I still do, now and then. Current lot three. 1) Computer did an auto-re-boot after the update (Peter. I had set it to update after one month). 2) This one I lost due to power failure. 3) The WU was feeling tetchy. It is quite possible we are getting used to the shenanigans of these WU"s? Three WU's out of thirty-six is not bad. |
Send message Joined: 15 Nov 10 Posts: 43 Credit: 6,118,949 RAC: 0 |
I have another WU not uploading. Tried the "old reboot" fix a few times and didn't work. This is the WU: Tue 06 Jul 2021 08:58:04 PM WEST | climateprediction.net | Started upload of hadsm4_a1cu_201310_6_910_012088963_0_r2057498656_4.zip Tue 06 Jul 2021 08:58:08 PM WEST | | Project communication failed: attempting access to reference site Tue 06 Jul 2021 08:58:08 PM WEST | climateprediction.net | Temporarily failed upload of hadsm4_a1cu_201310_6_910_012088963_0_r2057498656_4.zip: transient HTTP error Tue 06 Jul 2021 08:58:08 PM WEST | climateprediction.net | Backing off 03:10:45 on upload of hadsm4_a1cu_201310_6_910_012088963_0_r2057498656_4.zip Tue 06 Jul 2021 08:58:09 PM WEST | | Internet access OK - project servers may be temporarily down. Any ideas? Thanks candido |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I'll send an email. |
Send message Joined: 26 Oct 11 Posts: 15 Credit: 3,275,889 RAC: 0 |
Hi, Indeed very odd as all of the other uploads for that WU are sitting waiting in the in_progress folder....? Can you forward that zip to me directly by email please? david.wallom at oerc.ox.ac.uk regards David |
Send message Joined: 15 Nov 10 Posts: 43 Credit: 6,118,949 RAC: 0 |
David, I have just sent the file by email to that address, Regards Candido |
Send message Joined: 15 Nov 10 Posts: 43 Credit: 6,118,949 RAC: 0 |
What should I do now. Abort the WU? IT's still trying to upload... Thanks candido |
©2024 cpdn.org