climateprediction.net home page
hadam3p_anz - latest models - strange times reported

hadam3p_anz - latest models - strange times reported

Message boards : Number crunching : hadam3p_anz - latest models - strange times reported
Message board moderation

To post messages, you must log in.

AuthorMessage
Dave Roberts

Send message
Joined: 15 Jan 11
Posts: 175
Credit: 6,242,691
RAC: 699
Message 53278 - Posted: 17 Jan 2016, 12:05:33 UTC
Last modified: 17 Jan 2016, 12:06:04 UTC

I downloaded a couple of the latest hadam3p_anz models on the 14th and on just checking, found the following :-

1. Elapsed time is increasing at the rate of 2hrs for every 12hrs real time
2. Time To Completion is increasing
3. No trickles sent

It looks as though I shall have to abort them. Anyone else found the same problem?
ID: 53278 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 53279 - Posted: 17 Jan 2016, 12:17:49 UTC - in response to Message 53278.  

It's probably because you had a different type of model running before, which set the Task duration correction factor to a certain value, and now BOINC is totally lost as to what that number should be now, so it keeps guessing.

(Yeah, yeah, I'll be finished in a moment.
I need another 5 minutes.
Be with you shortly, just hang on.)

**********

Or it could be something else. But Aborting everything that doesn't seem right is not the way to let BOINC learn.
And there's no more work, and may not be for "quite a while".

ID: 53279 · Report as offensive     Reply Quote
Dave Roberts

Send message
Joined: 15 Jan 11
Posts: 175
Credit: 6,242,691
RAC: 699
Message 53282 - Posted: 17 Jan 2016, 13:04:44 UTC - in response to Message 53279.  

Thanks Les. I'll just let it run.
ID: 53282 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 53283 - Posted: 17 Jan 2016, 22:09:46 UTC

I just had at look at your 2 ANZ, and they took close to 3 days to get to a zip.

On my 3.5 Gig Haswell, running Windows under Wine on Linux, they took 6 hours to get to the first zip.
Just had another look, and the next lot have been sitting there for half an hour, so they're now on their way.
Zips are about 14 Megs.


ID: 53283 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4345
Credit: 16,522,041
RAC: 5,856
Message 53287 - Posted: 18 Jan 2016, 15:31:00 UTC

And there's no more work, and may not be for "quite a while".


If only every time you posted about not knowing when work would appear more did!
ID: 53287 · Report as offensive     Reply Quote
Dave Roberts

Send message
Joined: 15 Jan 11
Posts: 175
Credit: 6,242,691
RAC: 699
Message 53295 - Posted: 21 Jan 2016, 21:36:52 UTC

It's all very odd. Each task does a bit of work, stops, "To Completion" increases a bit, task resumes, "To completion" drops more than the increase, does a bit of work, stops etc. etc.

I may have too little memory having upped to 'El Capitan'.

Anyway, when these tasks finish I'll be putting in more memory, so will see the result on the next tasks.
ID: 53295 · Report as offensive     Reply Quote
WB8ILI

Send message
Joined: 1 Sep 04
Posts: 161
Credit: 81,421,805
RAC: 1,225
Message 53296 - Posted: 22 Jan 2016, 0:21:28 UTC - in response to Message 53295.  

Dave Roberts -

Just curious. On the BOINC main screen, if you go to Options -> Computing Preferences -

Do you have -

100% of CPU's checked?

100% of CPU Time checked?

Under "When to Suspend", everything NOT checked?



ID: 53296 · Report as offensive     Reply Quote
Don Nicholson

Send message
Joined: 31 Aug 04
Posts: 18
Credit: 13,882,347
RAC: 0
Message 53309 - Posted: 24 Jan 2016, 21:51:45 UTC
Last modified: 24 Jan 2016, 22:22:42 UTC

I am running a Haswell 5820K in Windows 10.

I have six hadam3p.anz's running. Five are numbered in the hadam3p.anz_xxx 199012_12_287 series.

They have been running for over seven days and are showing two plus days to go.

NONE OF THEM HAVE RECORDED A TRICKLE and the time to go drops at about a third of the computer time.

The sixth is a 290 series and is behaving normally and will complete in under three

days as did another with a 289 number.

Worth persevering with?
ID: 53309 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 53310 - Posted: 24 Jan 2016, 23:22:42 UTC - in response to Message 53309.  

Don

The ANZ models should have graphics, which, while not as good as "in the good old days", should show what is happening to the "model time".
You'll have to watch for some time, but you should be able to tell if the hours and days are constantly increasing, or if they're in a loop.
Perhaps write day the info at intervals.

ID: 53310 · Report as offensive     Reply Quote
Don Nicholson

Send message
Joined: 31 Aug 04
Posts: 18
Credit: 13,882,347
RAC: 0
Message 53311 - Posted: 25 Jan 2016, 0:26:25 UTC

Thanks Les

All five -289- series state"No model is currently running".

The remaining time on each has dropped about an hour over the last three hours.

It would appear that while they have about two and a half days to go it could take

another week or so at that rate and even then would you get a result?


The -289 - series example is running a model and trickles and is absolutely normal.
ID: 53311 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 53312 - Posted: 25 Jan 2016, 3:14:22 UTC

Hello Don

On the front page, right hand side is this: Script testing.
Perhaps you have one of these, in which case the advice is what we used to say 10 years ago: "Keep running them. Only the researchers know if they're good or bad."

Hopefully you'll be able to do this.

More, regular info would be nice. Then I could point the project people to this thread.

ID: 53312 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 53314 - Posted: 25 Jan 2016, 4:16:12 UTC

I've just emailed the project anyway.

ID: 53314 · Report as offensive     Reply Quote
Don Nicholson

Send message
Joined: 31 Aug 04
Posts: 18
Credit: 13,882,347
RAC: 0
Message 53315 - Posted: 25 Jan 2016, 5:46:21 UTC

I will keep running them to the end unless I am advised to the contrary.

Please advise what info you would like and I will do my best to oblige.
ID: 53315 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 53320 - Posted: 25 Jan 2016, 9:29:36 UTC - in response to Message 53315.  

Ahh. Good question.
I usually play it by ear, and report things that are not going the way that the majority do.

Perhaps report what the graphics figures show re: progress. (Forward or looping.)

I also run with the 'net connection off, so that trickles and zips accumulate, so that I can see what's what. (When they get created, how big, etc.)
One thing that would show, (although it will also be in the Events Log/stdoutdae.txt), is if zips are being created and uploaded.

ID: 53320 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 53321 - Posted: 25 Jan 2016, 9:33:20 UTC

I should have checked first.
I've had a reply to my email:

Hi Les,

Thank you for this. Batch287 is the ANZ batch that was sent out previously and had a number of invalid combinations. My guess is that these runs relate to some of those setups. The invalid combinations have been corrected and resent in a new batch (batch 289).

Best wishes,
Sarah



So, dump batch 287.

ID: 53321 · Report as offensive     Reply Quote
Dave Roberts

Send message
Joined: 15 Jan 11
Posts: 175
Credit: 6,242,691
RAC: 699
Message 53331 - Posted: 26 Jan 2016, 8:35:26 UTC - in response to Message 53321.  

Where's the info regarding the batch no.? I've looked at task and work details but can't see it.

Thanks in advance.


ID: 53331 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 53332 - Posted: 26 Jan 2016, 8:57:46 UTC - in response to Message 53331.  

This is one of yours:

hadam3p_anz_q2g3_201212_12_290_010259670_2


ID: 53332 · Report as offensive     Reply Quote
Profile Alan K

Send message
Joined: 22 Feb 06
Posts: 485
Credit: 29,621,544
RAC: 3,291
Message 53333 - Posted: 26 Jan 2016, 10:18:39 UTC - in response to Message 53331.  

Right hand column on the tasks tab in BOINC manager.
ID: 53333 · Report as offensive     Reply Quote

Message boards : Number crunching : hadam3p_anz - latest models - strange times reported

©2024 climateprediction.net