climateprediction.net home page
Need urgent help with this one

Need urgent help with this one

Questions and Answers : Windows : Need urgent help with this one
Message board moderation

To post messages, you must log in.

AuthorMessage
Steinar1965

Send message
Joined: 4 Sep 06
Posts: 79
Credit: 5,583,517
RAC: 0
Message 32470 - Posted: 6 Feb 2008, 14:16:57 UTC
Last modified: 6 Feb 2008, 14:21:33 UTC

I did not fin the exitcode on this one. Is it hardware failure on my PC or is it the model?
Should I restore rom backup?

06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_3.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_4.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_5.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_6.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_7.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_8.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_9.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_10.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_11.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_12.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_13.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_14.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_15.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
06.02.2008 15:08:48|climateprediction.net|Output file hadcm3istd_7sb9_1920_160_05924482_0_16.zip for task hadcm3istd_7sb9_1920_160_05924482_0 absent
ID: 32470 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 32473 - Posted: 6 Feb 2008, 18:09:20 UTC



There is nothing mentioned on that model\'s web page:

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=7192296

I think the server thinks the model is still running.

I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 32473 · Report as offensive     Reply Quote
Steinar1965

Send message
Joined: 4 Sep 06
Posts: 79
Credit: 5,583,517
RAC: 0
Message 32475 - Posted: 6 Feb 2008, 18:38:37 UTC

The model uploaded the files but what else I dont know. I restored from backup anyway to see if it can be saved. The status in boinc says \"computation error\"
I will contiue, and if it crashes again I will post here aout it.
Nice if u can tell me when you know
Thank you

Steinar
ID: 32475 · Report as offensive     Reply Quote
Steinar1965

Send message
Joined: 4 Sep 06
Posts: 79
Credit: 5,583,517
RAC: 0
Message 32479 - Posted: 6 Feb 2008, 20:38:49 UTC

And what does this mean?
06.02.2008 21:39:45|climateprediction.net|Generated new host CPID: a0352200cc647eaf7f5472ab845f2371
It happened right now..
ID: 32479 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 32480 - Posted: 6 Feb 2008, 20:45:12 UTC
Last modified: 6 Feb 2008, 20:51:53 UTC

A record is kept of server contacts. When a lower sequence number is encountered, a new Computer ID is generated. (You can merge the two in your account.) Not sure why a new Cross-project ID was deemed necessary.

[Edited for typo.]
"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 32480 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 32487 - Posted: 7 Feb 2008, 1:11:28 UTC

When you merge computer records, don\'t you always get a new computer CPID on that project?
Cpdn news
ID: 32487 · Report as offensive     Reply Quote
Steinar1965

Send message
Joined: 4 Sep 06
Posts: 79
Credit: 5,583,517
RAC: 0
Message 32551 - Posted: 9 Feb 2008, 23:20:51 UTC

The model that crashed has, after restore, passed the point where it crashed.
I therefore consider it a \"problem on my PC\" and keep on crunching (and take frequent backups :-)
ID: 32551 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 32554 - Posted: 10 Feb 2008, 2:14:08 UTC


Progress, good for you!

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 32554 · Report as offensive     Reply Quote
Steinar1965

Send message
Joined: 4 Sep 06
Posts: 79
Credit: 5,583,517
RAC: 0
Message 32849 - Posted: 4 Mar 2008, 12:46:15 UTC

One of my models finished and uploaded after 38% of it was finished. I dont find the place where I can see if it was my PC or if it was the model\'s \"foul\".

Restored from backup and started it again. Could someone see if everything is OK with the WU and that I should continue to crunch it?

-I assume the \"system\" will recognize the zip-file that uploaded yesterday, and will be uploaded a 2\'nd time..

Thank you

Steinar
ID: 32849 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 32850 - Posted: 4 Mar 2008, 13:01:14 UTC
Last modified: 4 Mar 2008, 13:03:35 UTC

Here is the model\'s result page:

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=7192283


I see one \'negative theta\' followed by a group of six \'ocean UV\' errors. Intermittent errors are often a sign that there was a floating point error or a memory error, but there is only one of these so I\'m not sure. It may be worth running 24 hours of Prime95 (one copy per core, use the -A flag) or similar to confirm that the PC\'s hardware is behaving OK. (Basically I can\'t tell whether it was the PCs or models fault - the negative theta is pointing towards the PC, but there is only one of them so not conclusive).

I\'d give it 50/50 of being able to resume beyond the crash, but only if you have a backup older than a couple of model years or so (it will have already retried a single model year).
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 32850 · Report as offensive     Reply Quote
Steinar1965

Send message
Joined: 4 Sep 06
Posts: 79
Credit: 5,583,517
RAC: 0
Message 32853 - Posted: 4 Mar 2008, 16:32:58 UTC - in response to Message 32850.  

I restored, and after a while one of the models got a blue world, restored from backup again to see what happens. Hard to give up the models... want to run them to the end but will wait and see.
Report here again if the model braks down again.

Thank you
Steinar
ID: 32853 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 32855 - Posted: 4 Mar 2008, 17:30:44 UTC

I had a beta HADSM-type model where the temperature graph turned blue. I ran it again from a backup and it developed normally. So I had to assume the problem was some instability in the computer. Another member helped me find the probable cause.

But other blue-world models turn blue for every computer that tries them.
Cpdn news
ID: 32855 · Report as offensive     Reply Quote
Steinar1965

Send message
Joined: 4 Sep 06
Posts: 79
Credit: 5,583,517
RAC: 0
Message 32859 - Posted: 5 Mar 2008, 14:14:44 UTC

A model failed again. Is it the same model that failed? I dont know if it is possible to see but a model has failed three times in less than a week.

It was exit code 22 but prime 95 was OK for 24 hrs. is it maybe too much with 4 models at the same time? (Q6600 Asus P5K-VM)

I hope it is 1 model that causes the trouble and not all, cause then it may be the PC
ID: 32859 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 32861 - Posted: 5 Mar 2008, 15:18:26 UTC


All of the models have unique names, so it should be possible to tell if it\'s the same one each time.

And the point of this research is NOT to make a model run for the full amount of time, it\'s to see just how far it gets with it\'s starting values.
Some of these are bound to fail, and one just has to accept this.
They are the ones that say to the physicists: This set of values for this set of parameters doesn\'t replicate a viable climate model. Which is what they want to know.

ID: 32861 · Report as offensive     Reply Quote
Steinar1965

Send message
Joined: 4 Sep 06
Posts: 79
Credit: 5,583,517
RAC: 0
Message 32864 - Posted: 5 Mar 2008, 15:40:34 UTC - in response to Message 32861.  


All of the models have unique names, so it should be possible to tell if it\'s the same one each time.

And the point of this research is NOT to make a model run for the full amount of time, it\'s to see just how far it gets with it\'s starting values.
Some of these are bound to fail, and one just has to accept this.
They are the ones that say to the physicists: This set of values for this set of parameters doesn\'t replicate a viable climate model. Which is what they want to know.



Sure :-) But if it can reach the end it should..
ID: 32864 · Report as offensive     Reply Quote

Questions and Answers : Windows : Need urgent help with this one

©2024 climateprediction.net