Message boards : Number crunching : Welcome back/checking if everything is working?
Message board moderation
Previous · 1 · 2 · 3
Author | Message |
---|---|
Send message Joined: 15 May 09 Posts: 4542 Credit: 19,039,635 RAC: 18,944 |
How projects play together is something I know little about because I only run CPDN tasks except when none are available so my knowledge of it is nearly all from reading posts here and on the BOINC fora. Currently I am going to Africa Rainfall Project with World Community Grid during breaks in work. I also occasionally get testing work when no main site work is available but it is not unusual to go for a week or even a month or more without work. I can also run Windows tasks under WINE if only windows work is available. |
Send message Joined: 6 Oct 06 Posts: 204 Credit: 7,608,986 RAC: 0 |
How projects play together is something I know little about because I only run CPDN tasks except when none are available so my knowledge of it is nearly all from reading posts here and on the BOINC fora. ---------------------------------------------- I have twelve cores and they are dedicated to CPDN only(CPU). I do not do any other CPU project. I also live in an area of the world where the ambient temperatures are high, so I cannot afford to run all cores. Having more cores at least for me is no fun if I end up burning equipment. Having said that, even though dedicated to CPDN I only do one task at a time. The upside to it is faster run times, error-free results, less power cost plus less cooling cost. I have also observed over time if I run all my cores, errors start to creep in. Plus I have to feel that there is also an element of interference across cores especially if other projects are also being run in tandem. Maybe it might be due to the GPU but no, if I run Rosetta then why does the error rate increase? For me at least, one task at a time is the way forward. ------------------------------------ As to the Server State Page and the quantity of work being shown, please someone take a broom to it. Two burnt-up laptops which might have been recycled years ago, their WU"s are still being shown as active. Don't ask me how. |
Send message Joined: 9 Oct 20 Posts: 690 Credit: 4,391,754 RAC: 6,918 |
I have twelve cores and they are dedicated to CPDN only(CPU). I do not do any other CPU project. I also live in an area of the world where the ambient temperatures are high, so I cannot afford to run all cores.Is it not possible to fit a larger cooling fan to the CPU, or even a watercooler? Running only one core is not doing much processing. Having more cores at least for me is no fun if I end up burning equipment. Having said that, even though dedicated to CPDN I only do one task at a time. The upside to it is faster run times, error-free results, less power cost plus less cooling cost.I think there may be a problem if CPDN tasks are suspended and resumed (due to rebooting, exclusive application like a game, or Boinc switching to Rosetta. I'm noticing the CPDN tasks I'm running have stopped giving me credit, I got credit right near the start, now nothing, maybe they are all damaged? I know three of them said computation error after all I did was restart a machine. Can anyone check please? As to the Server State Page and the quantity of work being shown, please someone take a broom to it. Two burnt-up laptops which might have been recycled years ago, their WU"s are still being shown as active. Don't ask me how.Agreed - that page serves no purpose whatsoever, it's just showing meaningless numbers and helps nobody. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Up until now, credits have been based on the receipt of trickle_up files. No trickles, no credits. A new system is on it's way, but it will be a while yet. |
Send message Joined: 15 May 09 Posts: 4542 Credit: 19,039,635 RAC: 18,944 |
I'm noticing the CPDN tasks I'm running have stopped giving me credit, I got credit right near the start, now nothing, maybe they are all damaged? I know three of them said computation error after all I did was restart a machine. Can anyone check please? Currently the credit script only runs on Thursdays so you should get a weekly update? |
Send message Joined: 15 May 09 Posts: 4542 Credit: 19,039,635 RAC: 18,944 |
I have twelve cores and they are dedicated to CPDN only(CPU). I do not do any other CPU project. I also live in an area of the world where the ambient temperatures are high, so I cannot afford to run all cores. Having more cores at least for me is no fun if I end up burning equipment. Having said that, even though dedicated to CPDN I only do one task at a time. The upside to it is faster run times, error-free results, less power cost plus less cooling cost. I haven't seen problems with concurrent projects however, running all 8 cores (16 threads) on my Ryzen results in a lower throughput than if I only run 8 tasks at a time with the N216 tasks. This is because they use up a lot of the level3 cache, about 3MB/task or a little more. I don't know if running another project that has heavy use of level 3 cache is the problem? |
Send message Joined: 9 Oct 20 Posts: 690 Credit: 4,391,754 RAC: 6,918 |
Thanks, that explains it. I think I got my credits this Thursday and assumed I was going to get some for every 10% done or something. I'll work on the assumption that the ones which didn't say "computation error" are doing something useful. I got 36 tasks, and 3 caused errors. I'm putting those down to problems I had with a new GPU that was crashing the OS. It seems CPDN tasks can't cope with that - maybe it's deliberate to take the task away from an unstable machine - fair enough. They're managing ok if they're cleanly paused (eg Boinc swapping projects or a game being played and pausing them with exclusive applications).I'm noticing the CPDN tasks I'm running have stopped giving me credit, I got credit right near the start, now nothing, maybe they are all damaged? I know three of them said computation error after all I did was restart a machine. Can anyone check please?Currently the credit script only runs on Thursdays so you should get a weekly update? |
Send message Joined: 15 May 09 Posts: 4542 Credit: 19,039,635 RAC: 18,944 |
Thanks, that explains it. I think I got my credits this Thursday and assumed I was going to get some for every 10% done or something. I'll work on the assumption that the ones which didn't say "computation error" are doing something useful. I got 36 tasks, and 3 caused errors. I'm putting those down to problems I had with a new GPU that was crashing the OS. It seems CPDN tasks can't cope with that - maybe it's deliberate to take the task away from an unstable machine - fair enough. They're managing ok if they're cleanly paused (eg Boinc swapping projects or a game being played and pausing them with exclusive applications).I'm noticing the CPDN tasks I'm running have stopped giving me credit, I got credit right near the start, now nothing, maybe they are all damaged? I know three of them said computation error after all I did was restart a machine. Can anyone check please?Currently the credit script only runs on Thursdays so you should get a weekly update? The trickle ups for which credit is give match the zips files uploaded at the end of each model month. So on this task of yours, wah2_sam50_a09k_201312_25_885_012039607_0 it is a 25 month task so every 4%. The other parts of the task number of interest are sam50 which tells you it is for the South America region and a resolution of 50Km squares, 201312 gives the year and start month of the task and 885 is the batch number. |
Send message Joined: 9 Oct 20 Posts: 690 Credit: 4,391,754 RAC: 6,918 |
The trickle ups for which credit is give match the zips files uploaded at the end of each model month. So on this task of yours,Thanks, now I can tell exactly what it's working on. That task is running on one of my slower (per core) machines and has only just reached 8%. The faster ones are now at up to 44%, and are also 25 month tasks, so should have done 10 trickles by now, so I assume I have to wait till Thursday to see credits. I'm not one of those credit addicts, I just like to see them to know it's working properly! A complete loss of new credits on one machine with LHC made me look for a problem with VirtualBox. Primegrid (a newly added project for me) gave me way too many tasks when I attached, and got in the way, I've kicked Boinc and made it do the CPDN. Just how soon should I be aiming to get them in? |
Send message Joined: 5 Aug 04 Posts: 1120 Credit: 17,202,915 RAC: 2,154 |
I think uploads from batch 860 go to a server in Tasmania. I have let Andy know. Is this the same problem? I started getting it yesterday and it is all I get from ClimatePrediction since then. The web site works OK. Mon 30 Nov 2020 12:07:29 AM EST | climateprediction.net | Not requesting tasks: don't need (not highest priority project) Mon 30 Nov 2020 12:07:31 AM EST | climateprediction.net | Scheduler request completed Mon 30 Nov 2020 12:07:31 AM EST | climateprediction.net | Project requested delay of 3636 seconds Mon 30 Nov 2020 12:08:39 AM EST | | Project communication failed: attempting access to reference site Mon 30 Nov 2020 12:08:41 AM EST | | Internet access OK - project servers may be temporarily down. |
Send message Joined: 15 May 09 Posts: 4542 Credit: 19,039,635 RAC: 18,944 |
I think uploads from batch 860 go to a server in Tasmania. I have let Andy know. Might have been a temporary glitch as I just got scheduler request completed and the delay message without the Project communication failed one but it is a different issue. The servers around the world to send the zips to can go down without anything being wrong at Oxford. I will check again in a few hours time as I have a couple of tasks that should finish then. Edit: Two tasks finished and have uploaded with no problems so whatever it was seems to have cleared. Please do not private message myself or other moderators for help. This limits the number of people who are able to help and deprives others who may benefit from the answer. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Jean That's 2 different things - the ANZ uploads are "uploads", and what your messages are saying, is that your computer is asking for work, which is "downloads". And it's also saying that BOINC doesn't want more work, because it has enough already. |
Send message Joined: 5 Aug 04 Posts: 1120 Credit: 17,202,915 RAC: 2,154 |
Might have been a temporary glitch as I just got scheduler request completed and the delay message without the Project communication failed one but it is a different issue. The servers around the world to send the zips to can go down without anything being wrong at Oxford. I will check again in a few hours time as I have a couple of tasks that should finish then. Edit: Two tasks finished and have uploaded with no problems so whatever it was seems to have cleared. I guess it was a long temporary glitch: Lasted all day yesterday, but it is working OK starting, perhaps, this afternoon. |
Send message Joined: 12 May 05 Posts: 34 Credit: 1,436,930 RAC: 2,182 |
Dedicating this Dell Latitude laptop to CPDN WU's. Installed Linux Mint 20 and have run into an issue; climateprediction.net is unaccessable so continuously get 'project communication failed' (although I was able to connect to my account using cpdn.org link). It's not just my issue: https://isdown.me/www.climateprediction.net reports the website is down. This is the most recent thread that a search showed a user mentioning this issue so I posted here instead of a new post. |
Send message Joined: 7 Aug 04 Posts: 2187 Credit: 64,822,615 RAC: 5,275 |
Dedicating this Dell Latitude laptop to CPDN WU's. I e-mailed the cpdn computer people about the problem. Not sure when it will be fixed as it is the weekend already in Oxford. |
Send message Joined: 7 Aug 04 Posts: 2187 Credit: 64,822,615 RAC: 5,275 |
Dedicating this Dell Latitude laptop to CPDN WU's. https://www.climateprediction.net should be back up and will hopefully resolve any problems connecting a computer to it. |
Send message Joined: 12 May 05 Posts: 34 Credit: 1,436,930 RAC: 2,182 |
That laptop is communicating with that domain now. Thanks. I could have edited the project .xml and changed it to cpdn.net to get an immediate fix but I did a normal install (instead of dropping all of BOINC in the user HOME directory) and so the files with the domain were locked. The root account could have taken ownership temporarily ... but I got lazy. Gonna wait for a few days as the new Mint 19.3 install dropped into fallback mode running gaia@home. Maybe one of the widgets for CPU, temps or process on the task bar crashed the OS (didn't seem like overheating issue at 65C). These CPDN WU's demand days of stability from what I've read. Mint 20 is out (20.1 in beta) but for some reason Wine absolutely failed to function. |
©2024 cpdn.org