climateprediction.net (CPDN) home page
Thread 'Download stalled WAH2'

Thread 'Download stalled WAH2'

Message boards : Number crunching : Download stalled WAH2
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
bernard_ivo

Send message
Joined: 18 Jul 13
Posts: 438
Credit: 25,829,527
RAC: 9,480
Message 54598 - Posted: 1 Aug 2016, 19:44:20 UTC
Last modified: 1 Aug 2016, 19:44:39 UTC

Hi folks,
I receive Transient HTTP error when trying to download wah2_pnw25_zhlk_200312_24_406_010600701 model and its parts. BOINC suggests that Internet access OK - project servers may be temporarily down, but server status page looks OK. Anyone else having download problems?

Cheers
ID: 54598 · Report as offensive     Reply Quote
bernard_ivo

Send message
Joined: 18 Jul 13
Posts: 438
Credit: 25,829,527
RAC: 9,480
Message 54599 - Posted: 1 Aug 2016, 22:28:49 UTC

A second WU is failing to download on another machine, so I suspended those tasks and set BOINC to no new tasks, until it is resolved. The second machine is Windows one so BOINC log is more informative: Failed to connect on port 80 of download.cpdn.org.
ID: 54599 · Report as offensive     Reply Quote
Andrew Sanchez
Avatar

Send message
Joined: 28 May 14
Posts: 34
Credit: 705,936
RAC: 0
Message 54601 - Posted: 2 Aug 2016, 3:45:48 UTC
Last modified: 2 Aug 2016, 3:47:44 UTC

Yeah i'm having the same issue. I got 2 wus that won't download; they just keep backing off.
Seems that we can"t connect to the download server but Server Status says its 'running'.
ID: 54601 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 54602 - Posted: 2 Aug 2016, 3:58:15 UTC

Server Status says that the computer is running, not that all of the daemons are.

One or more of these must have failed.

Email sent.

ID: 54602 · Report as offensive     Reply Quote
bernard_ivo

Send message
Joined: 18 Jul 13
Posts: 438
Credit: 25,829,527
RAC: 9,480
Message 54603 - Posted: 2 Aug 2016, 10:23:42 UTC - in response to Message 54602.  

Seems fixed. Units downloaded and now crunching. Thanks Les
ID: 54603 · Report as offensive     Reply Quote
Andrew Sanchez
Avatar

Send message
Joined: 28 May 14
Posts: 34
Credit: 705,936
RAC: 0
Message 54607 - Posted: 3 Aug 2016, 2:47:52 UTC - in response to Message 54602.  

Yep, seems back on track. Thanks, Les.
ID: 54607 · Report as offensive     Reply Quote
Desti

Send message
Joined: 6 Aug 04
Posts: 124
Credit: 9,195,838
RAC: 0
Message 54612 - Posted: 4 Aug 2016, 15:52:06 UTC

Why do I get no WU?

04-Aug-2016 13:28:44 [climateprediction.net] No tasks are available for Weather At Home 2 (wah2)

But server status shows 12700 available.
Linux Users Everywhere @ BOINC
ID: 54612 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 54615 - Posted: 7 Aug 2016, 19:56:28 UTC - in response to Message 54612.  

You may have to look at other message lines to work it out.

However, one of your computers is crashing everything, and the other has crashed several.

These 2 links may give you some ideas about your error messages:

"finish file present too long" error

finish file present too long

ID: 54615 · Report as offensive     Reply Quote
Desti

Send message
Joined: 6 Aug 04
Posts: 124
Credit: 9,195,838
RAC: 0
Message 54755 - Posted: 6 Sep 2016, 14:15:22 UTC - in response to Message 54615.  

There are no other messages.


06-Sep-2016 04:38:12 [climateprediction.net] Sending scheduler request: To fetch work.
06-Sep-2016 04:38:12 [climateprediction.net] Requesting new tasks for CPU
06-Sep-2016 04:38:14 [Einstein@Home] Started upload of LATeah0003L_656.0_0_0.0_7535850_1_0
06-Sep-2016 04:38:14 [Einstein@Home] Started upload of LATeah0003L_656.0_0_0.0_7535850_1_1
06-Sep-2016 04:38:15 [Einstein@Home] Finished upload of LATeah0003L_656.0_0_0.0_7535850_1_0
06-Sep-2016 04:38:15 [Einstein@Home] Finished upload of LATeah0003L_656.0_0_0.0_7535850_1_1
06-Sep-2016 04:38:15 [climateprediction.net] Scheduler request completed: got 0 new tasks
06-Sep-2016 04:38:15 [climateprediction.net] Project has no tasks available

Linux Users Everywhere @ BOINC
ID: 54755 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4542
Credit: 19,039,635
RAC: 18,944
Message 54756 - Posted: 6 Sep 2016, 15:43:44 UTC

[climateprediction.net] Project has no tasks available


And consistent with this server status page showing no work at present.
ID: 54756 · Report as offensive     Reply Quote
John Eric Hopkinson

Send message
Joined: 27 Jan 05
Posts: 74
Credit: 1,047,809
RAC: 0
Message 54757 - Posted: 6 Sep 2016, 17:16:51 UTC - in response to Message 54756.  

I do not recall having seen the bin empty before.
Dave, yours is the only comment, and one would expect much more.
I presume that the proper procedure now would be "No New Tasks", to allow orderly recovery, if one is planned.
?????
ID: 54757 · Report as offensive     Reply Quote
John Eric Hopkinson

Send message
Joined: 27 Jan 05
Posts: 74
Credit: 1,047,809
RAC: 0
Message 54758 - Posted: 6 Sep 2016, 17:21:11 UTC - in response to Message 54757.  

Wrong!
Desti and his reply to a previous message solves the problem.

Daemons.
ID: 54758 · Report as offensive     Reply Quote
ProfileastroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 55160 - Posted: 17 Nov 2016, 23:29:32 UTC - in response to Message 54598.  

Hi folks,
I receive Transient HTTP error when trying to download ...
BOINC suggests that Internet access OK - project servers may be temporarily down, but server status page looks OK. Anyone else having download problems?

Cheers


Reverting to Bernard's original problem: Two of my machines each have a wah2_cafr25 ... batch 468 task with five small download files hung -- swinging in the breeze for more than a day -- analogous to the situation Bernard describes.

Anyone else have the problem? With batch 468 or any other tasks/batches?

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 55160 · Report as offensive     Reply Quote
bernard_ivo

Send message
Joined: 18 Jul 13
Posts: 438
Credit: 25,829,527
RAC: 9,480
Message 55216 - Posted: 27 Nov 2016, 8:05:29 UTC - in response to Message 55160.  

I have now wah2_sas50_cqx2_209112_13_432 hanging in download status. Project backed off for several hours

Sun 27 Nov 2016 10:02:55 EET | climateprediction.net | Started download of wah2_sas50_cqx2_209112_13_432_010667068.zip
Sun 27 Nov 2016 10:02:55 EET | climateprediction.net | Started download of restart_atmos_s005_1986-1201_rd0001.gz
Sun 27 Nov 2016 10:02:57 EET | | Project communication failed: attempting access to reference site
Sun 27 Nov 2016 10:02:57 EET | climateprediction.net | Temporarily failed download of wah2_sas50_cqx2_209112_13_432_010667068.zip: connect() failed
Sun 27 Nov 2016 10:02:57 EET | climateprediction.net | Backing off 00:30:31 on download of wah2_sas50_cqx2_209112_13_432_010667068.zip
Sun 27 Nov 2016 10:02:57 EET | climateprediction.net | Temporarily failed download of restart_atmos_s005_1986-1201_rd0001.gz: connect() failed
Sun 27 Nov 2016 10:02:57 EET | climateprediction.net | Backing off 00:15:05 on download of restart_atmos_s005_1986-1201_rd0001.gz
Sun 27 Nov 2016 10:02:59 EET | | Internet access OK - project servers may be temporarily down.
..........and it is weekend
ID: 55216 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 55219 - Posted: 28 Nov 2016, 7:35:30 UTC

Perhaps it's to do with some parts being on the new servers, but links are pointing to the old servers.
ID: 55219 · Report as offensive     Reply Quote
ProfileAlan K

Send message
Joined: 22 Feb 06
Posts: 492
Credit: 31,516,485
RAC: 14,727
Message 55220 - Posted: 28 Nov 2016, 11:19:42 UTC - in response to Message 55219.  

Looks like it. Can't ping the IP address from the log (126.67.195.140) and traceroute only gets as far as the Oxford ja.net address. Does this help?
ID: 55220 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 55232 - Posted: 28 Nov 2016, 20:57:01 UTC

Traceroute and similar programs are blocked at the entry point to the JA network.

My guess is that everyone will have to wait until such time as sufficient parts of our system have been migrated to the new servers, etc.

Which is why my computers are not only set for NNW, they've turned off.
ID: 55232 · Report as offensive     Reply Quote
bernard_ivo

Send message
Joined: 18 Jul 13
Posts: 438
Credit: 25,829,527
RAC: 9,480
Message 55236 - Posted: 29 Nov 2016, 9:47:31 UTC - in response to Message 55232.  

I'll wait hopping BOINC (or other power) won't finally dismiss the WU that can't be downloaded while waiting for Oxford to get the new servers on. I do not want to abort as ultimately the WU may be lost due to errors limit.
ID: 55236 · Report as offensive     Reply Quote
MartinNZ

Send message
Joined: 22 Mar 06
Posts: 144
Credit: 24,695,428
RAC: 0
Message 55246 - Posted: 1 Dec 2016, 4:11:33 UTC - in response to Message 55236.  

Download of 9 tasks have stalled for 4 hours. I'll keep watch, but can't see any server issues.
e.g.
1/12/2016 1:33:12 PM | climateprediction.net | Temporarily failed download of wah2_pnw25_a64n_20399_16_478_010792165.zip: connect() failed

1/12/2016 1:33:13 PM | | Project communication failed: attempting access to reference site
1/12/2016 1:33:15 PM | | Internet access OK - project servers may be temporarily down.
ID: 55246 · Report as offensive     Reply Quote
bernard_ivo

Send message
Joined: 18 Jul 13
Posts: 438
Credit: 25,829,527
RAC: 9,480
Message 55247 - Posted: 1 Dec 2016, 8:00:54 UTC - in response to Message 55246.  

Mine is old and has been stuck for 5 days already, yours seem to be brand new WUs and if they are stalled then I hope this will become priority to be fixed.
ID: 55247 · Report as offensive     Reply Quote
1 · 2 · 3 · Next

Message boards : Number crunching : Download stalled WAH2

©2024 cpdn.org