climateprediction.net (CPDN) home page
Thread 'New here, not receiving work'

Thread 'New here, not receiving work'

Questions and Answers : Getting started : New here, not receiving work
Message board moderation

To post messages, you must log in.

AuthorMessage
David Cueto

Send message
Joined: 7 Jun 21
Posts: 2
Credit: 3,268
RAC: 0
Message 64030 - Posted: 7 Jun 2021, 5:28:40 UTC

Hello,

I'm a BOINC user of some time, having contributed to World Community Grid and CERN a fair bit. I've been eyeing the climate emergency with increasing alarm for years now, and wanted to help out wherever I can. To that end, I've added this project to my Debian box today (June, 7, 2021), but I have yet to receive any work on my machine. I've installed the 32-bit libraries as mentioned in this sticky thread: https://www.cpdn.org/forum_thread.php?id=8916

I'm running Debian Testing on a machine that has a Ryzen 7 3700x and 32GB of RAM.

Any help would be appreciated.
ID: 64030 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4541
Credit: 19,039,635
RAC: 18,944
Message 64031 - Posted: 7 Jun 2021, 6:42:12 UTC - in response to Message 64030.  
Last modified: 7 Jun 2021, 6:44:18 UTC

What does it say in the event log after you request work?
If you have requested work once, don't hit update till an hour has past as there is an automatic time backoff on the project.

Also once you start getting work, almost certainly going above five or six cores will lead to tasks slowing down quite a bit. Certainly not worth using more than the eight real cores as opposed to vitual ones. I have the same cpu and amount of RAM
ID: 64031 · Report as offensive     Reply Quote
David Cueto

Send message
Joined: 7 Jun 21
Posts: 2
Credit: 3,268
RAC: 0
Message 64032 - Posted: 7 Jun 2021, 9:41:19 UTC - in response to Message 64031.  

Ah, waiting for the hour to pass actually worked in this instance. Shouldn't have been so impatient, lol.

Thanks.
ID: 64032 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 1120
Credit: 17,202,915
RAC: 2,154
Message 64033 - Posted: 7 Jun 2021, 16:14:37 UTC - in response to Message 64030.  

I'm a BOINC user of some time, having contributed to World Community Grid and CERN a fair bit. I've been eyeing the climate emergency with increasing alarm for years now, and wanted to help out wherever I can.


Congratulations: my sentiments exactly.

I'm running Debian Testing on a machine that has a Ryzen 7 3700x and 32GB of RAM.


In anticipation of ever-larger ClimatePrediction models in the future, I doubled my RAM from 32 GBytes to 64 GBytes. RAM these days is so cheap I could not resist. My motherboard will take, IIRC, 256 GBytes of RAM, but it is not that cheap.

My processor claims 16 cores but 8 are real and 8 are hyperthreaded, so I told the BoincClient to use a maximum of 8 cores. I also told it to run only 4 ClimatePrediction models at a time. Processor cache is 16896 KB The rest are mainly WCG and Rosetta. My processor is 1511241. Running like this, the N216 models take about 8 days each.

Initially, I tried running all 16 cores, but to do this, I had to run the fans so fast I cold not stand the noise.

$ sensors
coretemp-isa-0000
Adapter: ISA adapter
Package id 0:  +75.0°C  (high = +88.0°C, crit = +98.0°C)
Core 1:        +68.0°C  (high = +88.0°C, crit = +98.0°C)
Core 2:        +68.0°C  (high = +88.0°C, crit = +98.0°C)
Core 3:        +65.0°C  (high = +88.0°C, crit = +98.0°C)
Core 5:        +63.0°C  (high = +88.0°C, crit = +98.0°C)
Core 8:        +74.0°C  (high = +88.0°C, crit = +98.0°C)
Core 9:        +75.0°C  (high = +88.0°C, crit = +98.0°C)
Core 11:       +72.0°C  (high = +88.0°C, crit = +98.0°C)
Core 12:       +66.0°C  (high = +88.0°C, crit = +98.0°C)

amdgpu-pci-6500
Adapter: PCI adapter
vddgfx:       +0.69 V  
fan1:        2068 RPM  (min = 1800 RPM, max = 6000 RPM)
edge:         +38.0°C  (crit = +97.0°C, hyst = -273.1°C)
power1:        3.21 W  (cap =  25.00 W)

dell_smm-virtual-0
Adapter: Virtual device
fan1:        4084 RPM
fan2:         898 RPM
fan3:        2759 RPM

ID: 64033 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4541
Credit: 19,039,635
RAC: 18,944
Message 64034 - Posted: 7 Jun 2021, 18:47:56 UTC

Initially, I tried running all 16 cores, but to do this, I had to run the fans so fast I cold not stand the noise.


I went for liquid cooling on mine :)
ID: 64034 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 1120
Credit: 17,202,915
RAC: 2,154
Message 64035 - Posted: 7 Jun 2021, 20:51:59 UTC - in response to Message 64034.  

Initially, I tried running all 16 cores, but to do this, I had to run the fans so fast I cold not stand the noise.



I went for liquid cooling on mine :)


I do not have enough L3 cache to justify running that many Boinc processes, especially the N216 models.
ID: 64035 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4541
Credit: 19,039,635
RAC: 18,944
Message 64037 - Posted: 8 Jun 2021, 9:09:06 UTC

I do not have enough L3 cache to justify running that many Boinc processes, especially the N216 models.


With the same CPU neither do I but thought it a useful bit of future proofing as well as keeping the noise down when i occasionally render video.
ID: 64037 · Report as offensive     Reply Quote
llskintech

Send message
Joined: 17 Jun 21
Posts: 1
Credit: 0
RAC: 0
Message 64066 - Posted: 19 Jun 2021, 8:46:23 UTC - in response to Message 64030.  

waiting for my task too, there is no work for 3 days, but seems cos I run several others on the same computer
ID: 64066 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 64067 - Posted: 19 Jun 2021, 10:23:18 UTC - in response to Message 64066.  

The work here is mostly for Linux.
The recent batch of Windows work may be all there is for months.
ID: 64067 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4541
Credit: 19,039,635
RAC: 18,944
Message 64068 - Posted: 19 Jun 2021, 10:54:50 UTC - in response to Message 64067.  

There are posts by people who are using WSL to run the Linux version of BOINC with success. Another option is virtual box. (I use it the other way round with Linux as the host to enable Linux and Windows clients to run at the same time.
ID: 64068 · Report as offensive     Reply Quote
JWW

Send message
Joined: 18 Jul 06
Posts: 4
Credit: 5,584,738
RAC: 0
Message 64203 - Posted: 23 Jul 2021, 3:19:11 UTC - in response to Message 64067.  

Hi Les,

I have been running BOINC+Climateprediction.net years ago and but kind of forgotten about it, after using mainly laptops and re-installing Windows and so on. But today I installed it on a desktop PC again running Windows 10, and it seems it is not fetching any tasks. Maybe there are no unsent Windows tasks? The problem is that on the server status website, it is not obvious at all which of the unsent work is for Windows/Mac/Linux. Instead of volunteers spending time trying to figure why no work is fetched, it would be much better if the "Tasks by Application" table would have another column with W/L/M to indicate which tasks are available for which OS.
If there is no Windows work, I might decide to run another BOINC project, even though I am mainly interested in Climateprediction.net.

Could you suggest that website improvement to the Admins? That would be great.
ID: 64203 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 64204 - Posted: 23 Jul 2021, 7:31:49 UTC - in response to Message 64203.  

It's just a few clicks away on the Project status page:
Go to the line of options at the top of the page, click on Computing, then click on Applications in the drop down menu.
ID: 64204 · Report as offensive     Reply Quote
JWW

Send message
Joined: 18 Jul 06
Posts: 4
Credit: 5,584,738
RAC: 0
Message 64209 - Posted: 25 Jul 2021, 4:03:45 UTC - in response to Message 64204.  

Great, thank you!
ID: 64209 · Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 15 Jul 17
Posts: 99
Credit: 18,701,746
RAC: 318
Message 64254 - Posted: 2 Aug 2021, 16:41:12 UTC - in response to Message 64035.  

I do not have enough L3 cache to justify running that many Boinc processes, especially the N216 models.
How much L3 cache is required for each kind of project WU?
ID: 64254 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4541
Credit: 19,039,635
RAC: 18,944
Message 64257 - Posted: 2 Aug 2021, 18:44:46 UTC - in response to Message 64254.  

Just been looking and failing to find the stuff about how much level3 cache the N216 tasks take up. It is only those that really hammer it. The others, I can run certainly 8 tasks on my Ryzen7 without any slowdown. Going above 5 N216's things start to slow down but I can run a mix of tasks using all 8 real cores and 8 virtual ones without getting errors. It would take some weeks to work out where the sweet spot is in terms of maximum throughput.
ID: 64257 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 1120
Credit: 17,202,915
RAC: 2,154
Message 64258 - Posted: 2 Aug 2021, 21:24:09 UTC - in response to Message 64257.  

Going above 5 N216's things start to slow down but I can run a mix of tasks using all 8 real cores and 8 virtual ones without getting errors.


Let me up mine to five for a while. But errors? From what? Slowdown understandable, but errors???
ID: 64258 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 64259 - Posted: 2 Aug 2021, 23:12:01 UTC - in response to Message 64254.  

I do not have enough L3 cache to justify running that many Boinc processes, especially the N216 models.
How much L3 cache is required for each kind of project WU?


I started a thread somewhere, soon after the n216 models first appeared, near the start of last year.
One of the researchers had just posted on our private site, about how the models used a lot of L3 cache, so I passed that on.
Over the course of a dozen or so posts, others wrote about how they were limiting the number of models they ran, and about how something similar was happening on some other project.

The consensus was that 4 megs of L3 cache was good, so people started checking their processor specs to find the amount of cache each had, and limiting the number of models so as not to exceed that.
ID: 64259 · Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 15 Jul 17
Posts: 99
Credit: 18,701,746
RAC: 318
Message 64421 - Posted: 5 Sep 2021, 14:23:14 UTC - in response to Message 64254.  

I do not have enough L3 cache to justify running that many Boinc processes, especially the N216 models.
How much L3 cache is required for each kind of project WU?
The project code was not written well and is incompatible with hyperthreading. I disabled HTing in my BIOS for any computer crunching CP. Since I've never found a single clue that anyone responsible for writing the project code or running the servers participate in this forum I doubt they'll ever do anything about it.
ID: 64421 · Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 15 Jul 17
Posts: 99
Credit: 18,701,746
RAC: 318
Message 64422 - Posted: 5 Sep 2021, 14:25:07 UTC - in response to Message 64204.  

It's just a few clicks away on the Project status page:
Go to the line of options at the top of the page, click on Computing, then click on Applications in the drop down menu.
The https://www.cpdn.org/apps.php page does not provide the information they asked for. All it has is some historical averages of an undisclosed duration.
ID: 64422 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4541
Credit: 19,039,635
RAC: 18,944
Message 64425 - Posted: 5 Sep 2021, 16:50:28 UTC

The project code was not written well and is incompatible with hyperthreading. I disabled HTing in my BIOS for any computer crunching CP. Since I've never found a single clue that anyone responsible for writing the project code or running the servers participate in this forum I doubt they'll ever do anything about it.


The project code is around one million lines of Fortran written by/for the Met office many years ago to run on their supercomputers. The CPDN programs work by sending various parameters to that compiled program. At some point there will be OpenIFS but don't hold your breath on that one. The met office code is proprietary and not open source and the CPDN licence does not allow them to modify it.

If CPDN is ever to have its own custom built code, it will require a lot more people to hit the donate button on the home page.
ID: 64425 · Report as offensive     Reply Quote

Questions and Answers : Getting started : New here, not receiving work

©2024 cpdn.org