Message boards :
climateprediction.net Science :
Misconfiguration e-mail
Message board moderation
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 25 · Next
Author | Message |
---|---|
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
For starters, this is your list of models. As you can see, everything is failing. The more recent have been for REPLANCA errors, which affected everyone at the time. As for the other failures, I'm not sure. They seem to be a long way into the model, and the usual cause may not apply. But this thread is about "the usual suspect". And, as Apple keep tightening the security measures with each OS upgrade, it has to be re-applied each time. I believe that a fix may be in the testing stage for our models. Apparently the very latest version of BOINC for Macs has a cure for some problems. I'm not sure what they're up to, as the BOINC site is down for building maintenance, but I think it's 7.0.31 or 32. Updating to this may help. Backups: Here |
Send message Joined: 1 Oct 11 Posts: 4 Credit: 888,758 RAC: 0 |
I'm now on BOINCManager 7.0.31, detached this project and re-attached it. All current jobs were downloaded again and started freshly. At around 12 % three of the models crashed again. So it looks like there is another problem than just the "standard" one. |
Send message Joined: 4 Dec 08 Posts: 27 Credit: 651,211 RAC: 0 |
Ok I got the libraries installed per the sticky and have run all updates to make sure there wasn't anything that might have been missed. Sorry for the multiple posts had a noob moment :/ Hope this got everything back up and running Thanks |
Send message Joined: 9 Oct 10 Posts: 1 Credit: 446,045 RAC: 0 |
Ok, I'm thinking that this issue may be due to missing 32bits libraries. Can you confirm ? |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
Ok, I'm thinking that this issue may be due to missing 32bits libraries. Can you confirm ? The error message in stderr for your tasks is: error while loading shared libraries: libstdc++.so.6: cannot open shared object file: No such file or directory This is happening on 6 of your computers (1202949, 1202950, 1202951, 1202953, 1202957 and 1202959) and the solution is linked from this sticky. It's likely that error is being generated before the applications hit the point where they access the 32-bit libraries (links to that solution in this sticky). Three of your computers (1202952, 1202955 and 1231875) seem to be running tasks successfully but are failing to run the post-processing phase because libz.so.1 is missing: Unable to load library hadam3p_eu_se_6.09_i686-pc-linux-gnu.so "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 19 Apr 08 Posts: 179 Credit: 4,306,992 RAC: 0 |
Windows/AMD laptop: 1177286. |
Send message Joined: 1 Oct 11 Posts: 4 Credit: 888,758 RAC: 0 |
Meanwhile I tested running 6 models without shutdown of my computer. That did work without an error. After that I returned to shutting down the computer late in the evening and the "193" error happen again crashing the models. Is there any advice you can give me to overcome that problem? I do not want to leave my computer on all the time. |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Peter, Do you first suspend CPDN tasks? Many files are open and simply cutting power typically doesn't allow time (sufficient residual power) to close all files; that results in a crash on restart. "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 25 Feb 05 Posts: 4 Credit: 13,605,157 RAC: 0 |
Dear Adi Your computer (host # 419870) described below appears to have a misconfigured BOINC installation and is crashing models. Would you please have a look at it? If you need assistance, please post in this thread on our BOINC forums and we will suggest a way to fix the problem. You may post in any language: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=6880 Please include this link so that we may more easily find your computer: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=419870 When you have applied the fix please post to say so. Until the problem is fixed no more work will be sent to your computer. Aaaa... Help? :) I see that every task this computer received since 23 Feb 2009 (this is the oldest one I can see on your site) have some errors. Maybe it wasn't OK from the beginning? A reset, detach, reattach might help? A review of my computers shows that many/all of them have errors for some types of CP applications, for example: UK Met Office HADAM3P European Region v6.09 UK Met Office HADAM3P Southern Africa v6.09 UK Met Office HADAM3P ... The newest computers have 0 credit for CP, but have credits for other BOINC projects, without errors. All this new ones have Centos 6 x86_64, and enough CPU power, RAM and HDD space. So please review ALL my computers, or at least the ones active in last 30 days, and give me some advice. Thank you |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
Adi, All of your tasks are failing with errors like the following (visible by clicking on the '+' on the stderr line of this page): hadam3p_saf_6.09_i686-pc-linux-gnu: /usr/lib/libstdc++.so.6: version `GLIBCXX_3.4.9' not found (required by hadam3p_saf_6.09_i686-pc-linux-gnu) That's indicating the model requires a more recent version of that library than you have on your system. You might find this post helpful. "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 25 Feb 05 Posts: 4 Credit: 13,605,157 RAC: 0 |
Thank you for your quick reply. I'll try to use the solution posted in the forum you pointed at. I'll post the results. |
Send message Joined: 12 Dec 07 Posts: 1 Credit: 1,363,669 RAC: 0 |
http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=944884 |
Send message Joined: 7 Aug 04 Posts: 2184 Credit: 64,822,615 RAC: 5,275 |
@Jay Levenson It looks like a problem with Mac hosts when upgrading boinc versions. See this sticky in the Mac forum of this site for a solution (detach the host, then reattach to cpdn). |
Send message Joined: 19 May 06 Posts: 1 Credit: 2,222,678 RAC: 485 |
Received the note below... interesting since work is running on my computer. Hmmm! Anyway, am posting as per instructions, because I'd like to help in anyway I can. <MV> --- Dear HoopRat Your computer (host # 1214304) described below appears to have a misconfigured BOINC installation and is crashing models. Would you please have a look at it? If you need assistance, please post in this thread on our BOINC forums and we will suggest a way to fix the problem. You may post in any language: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=6880 Please include this link so that we may more easily find your computer: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=1214304 When you have applied the fix please post to say so. Until the problem is fixed no more work will be sent to your computer. Sincerely, The climateprediction.net team |
Send message Joined: 7 Aug 04 Posts: 2184 Credit: 64,822,615 RAC: 5,275 |
Hooprat, That Linux PC is crashing many models. The stderr messages on the crashed models has a line about "execv". This is a symptom of a PC with a 64 bit distribution of Linux not having 32 bit compatibility libraries installed. See this sticky in the Linux forum for links on how to install the compatibility libraries. |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
HoopRat, The other error reported on your tasks (e.g. this one, errors visible by clicking on the '+' on the stderr line) is: sched_setscheduler: Operation not permitted That's indicating that BOINC can't set the project application's scheduling priority to batch (idle) which is very strange as all users should be able to lower the priority. "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 10 May 07 Posts: 2 Credit: 1,586,935 RAC: 0 |
Dear Mike Your computer (host # 1230696) described below appears to have a misconfigured BOINC installation and is crashing models. Would you please have a look at it? If you need assistance, please post in this thread on our BOINC forums and we will suggest a way to fix the problem. You may post in any language: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=6880 Please include this link so that we may more easily find your computer: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=1230696 When you have applied the fix please post to say so. Until the problem is fixed no more work will be sent to your computer. I've tried install 32bit libraries, was that the issue? |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
I've tried install 32bit libraries, was that the issue? 32 bit libraries might be a problem Mike, but the stderr messages for your failed tasks show that the project applications are failing to find the libstdc++.so.6 library first. "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 10 May 07 Posts: 2 Credit: 1,586,935 RAC: 0 |
Ok, I've installed the libstdc++.so.6.0.13 library, I hope it'll work. |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
Passed up to the project team for re-enabling of work fetch Mike. Edit: Andy has now done that. "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
©2024 cpdn.org