climateprediction.net home page
No Data For Result

No Data For Result

Message boards : Number crunching : No Data For Result
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 37523 - Posted: 20 Jul 2009, 17:25:10 UTC

Several of the HadAM3P tasks that I've completed recently all show at the top of the task details page, "No data for result.."

Here is one example:
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=8799790

Trickles show but there are no graphs. I'm sure my result zip files were uploaded successfully. What does this mean? Did the results get lost?
ID: 37523 · Report as offensive     Reply Quote
Profile Iain Inglis

Send message
Joined: 9 Jan 07
Posts: 467
Credit: 14,549,176
RAC: 317
Message 37525 - Posted: 20 Jul 2009, 17:28:36 UTC - in response to Message 37523.  
Last modified: 20 Jul 2009, 17:29:39 UTC

Several of the HadAM3P tasks that I've completed recently all show at the top of the task details page, "No data for result.."

Here is one example:
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=8799790

Trickles show but there are no graphs. I'm sure my result zip files were uploaded successfully. What does this mean? Did the results get lost?

I think it's comment text in the PHP that's not tagged properly and is then promoted to the top of the visible page. It's been passed on to the project but nothing has happened thus far.

In any event it's benign.
ID: 37525 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 37527 - Posted: 20 Jul 2009, 18:15:25 UTC - in response to Message 37525.  

OK, I'll ignore the error message. But the fact that there are no graphs for several results.... Is this just an artifact of the upload server difficulties, slow processing, or something else?
ID: 37527 · Report as offensive     Reply Quote
Profile Iain Inglis

Send message
Joined: 9 Jan 07
Posts: 467
Credit: 14,549,176
RAC: 317
Message 37529 - Posted: 20 Jul 2009, 22:31:09 UTC - in response to Message 37527.  

OK, I'll ignore the error message. But the fact that there are no graphs for several results.... Is this just an artifact of the upload server difficulties, slow processing, or something else?

We've not had an explanation for that though, again, "it's been noticed": if the model completes then the graphs seem to be there (for HADSM3/MH at least), but the intermediate ones are missing.

I'll bump these two problems and see what happens: the first is easily neglected but also easily fixed, the second may be because something's been turned off until the new server is functioning properly - which I imagine is very much the priority at the moment.
ID: 37529 · Report as offensive     Reply Quote
old_user294426

Send message
Joined: 20 Feb 06
Posts: 158
Credit: 1,251,176
RAC: 0
Message 37531 - Posted: 21 Jul 2009, 9:04:53 UTC - in response to Message 37527.  
Last modified: 21 Jul 2009, 9:14:31 UTC

OK, I'll ignore the error message. But the fact that there are no graphs for several results.... Is this just an artifact of the upload server difficulties, slow processing, or something else?


I see your computer is hidden, but you are using LINUX.

If you look at the list of tasks, those for HADAM3P are shown as "success" with 72,000 time steps instead of 72,096, and with a credit granted of 1980.00 instead of 1,982.64. They have failed to do the post processing time step that follows the 72,096 time step.

This is the same problem that was happening with Mac OSX using version 6.07, which has now been upgraded to 6.08 and should now be corrected (as from yesterday).

Keith
ID: 37531 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 37532 - Posted: 21 Jul 2009, 9:36:59 UTC

The data used for the graphs is in the last trickle.

"No data for result.." is because not all of the trickles have been received.

ID: 37532 · Report as offensive     Reply Quote
old_user294426

Send message
Joined: 20 Feb 06
Posts: 158
Credit: 1,251,176
RAC: 0
Message 37533 - Posted: 21 Jul 2009, 9:49:41 UTC - in response to Message 37532.  

The data used for the graphs is in the last trickle.

"No data for result.." is because not all of the trickles have been received.



Having looked at other LINUX crunchers in the top computers, I found no others with the same problem, that all mac users were having.

I had monitored the last two "faulty" results on my Mac.
They appeared on the graphics view to complete right up to 72,096.
Then started another T/step of 96 as the post processing step, which failed to count down, and the task was marked as 100% and completed, with next task being started.
Even though the crunching appeared to go to 72,096, the list of tasks only showed 72,000.

It seems strange that this one linux consistently had the same problem as the Macs.

Keith
ID: 37533 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 37534 - Posted: 21 Jul 2009, 14:05:22 UTC - in response to Message 37533.  

Based on the applications page, the Linux HadAM3P application has not be updated, but I see the MacOSX one has. Does anyone know what they changed in the new MacOS version of the application? What libraries does the post-processing use? Perhaps the application is using an older library that has not been regression tested (or missing) on the newer unix-based OS. I'm running the latest Fedora 11 linux.

Unfortunately, I don't have access to a linux box at work right now. But could someone run the linker info like this:
ldd hadam3p_se_6.06_i686-pc-linux-gnu.so

When I get home tonight (EST), I'll check to see what 32-bit libraries I have installed. Perhaps I'm missing an older 32-bit library that CPDN needs.
ID: 37534 · Report as offensive     Reply Quote
Profile Iain Inglis

Send message
Joined: 9 Jan 07
Posts: 467
Credit: 14,549,176
RAC: 317
Message 37535 - Posted: 21 Jul 2009, 14:09:43 UTC - in response to Message 37534.  

Does anyone know what they changed in the new MacOS version of the application?
Some of the linking has been changed: it's a re-build rather than a code change.
ID: 37535 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 37536 - Posted: 22 Jul 2009, 3:33:14 UTC - in response to Message 37535.  

OK, so I did some research as I said earlier. Here is the shared library readout:
$ ldd hadam3p_se_6.06_i686-pc-linux-gnu.so
linux-gate.so.1 => (0x001f0000)
libz.so.1 => /lib/libz.so.1 (0x00e23000)
libstdc++.so.6 => /usr/lib/libstdc++.so.6 (0x001f1000)
libnsl.so.1 => /lib/libnsl.so.1 (0x00dd8000)
libm.so.6 => /lib/libm.so.6 (0x00857000)
libgcc_s.so.1 => /lib/libgcc_s.so.1 (0x0063b000)
libc.so.6 => /lib/libc.so.6 (0x002de000)
/lib/ld-linux.so.2 (0x00947000)

There is also an "/lib/i686/nosegneg/libm.so.6" on my system.

The error on the tasks are all the same:
Unable to load library hadam3p_se_6.06_i686-pc-linux-gnu.so
dlopen error: 138932776

Perhaps the finish script is not returning to the main project directory and is staying in the ./slots/X directory?
ID: 37536 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 37540 - Posted: 24 Jul 2009, 2:00:24 UTC - in response to Message 37533.  

Can anyone explain why HadAM3P models fail to finish on my computer?

Here is a perfect example workunit. Computer #124 (wow an original cruncher) and my computer #979394 did the same model. His finished at timestep 72,096, but mine finished at 72,000. And both the temperature and precipitation graphs are missing for mine. He's running an Intel Q9450 w/ Linux 2.6.25.20-0.4 and I'm running an Intel i7 920 w/ Linux 2.6.29.5-191. So, why do my tasks fail to finish?

Computer #859124 (on that same workunit) is a Mac, so I'm interested to see if that computer finishes the task correctly.
ID: 37540 · Report as offensive     Reply Quote
old_user294426

Send message
Joined: 20 Feb 06
Posts: 158
Credit: 1,251,176
RAC: 0
Message 37541 - Posted: 24 Jul 2009, 8:06:48 UTC - in response to Message 37540.  
Last modified: 24 Jul 2009, 8:18:47 UTC

Can anyone explain why HadAM3P models fail to finish on my computer?
.........


DJStarfox

I think you will find that that task for a Mac is running version 6.07 and will finish at 72,000.
However, I have started 2 tasks on the updated version of 6.08 and should complete at 72,096.
Watch this space on Tuesday 28th July!!! Should have finished at 72,096 by then (or the day before).
You might like to see thread "Good news for Mac users. HadAM3P Latest News???" with more information.

My Mac ran normally on version 6.06, had the 72,000 problem after the 6.07 update, and is now running 6.08 for the first time.

It does seem strange that other LINUX systems do not have a 72,000 problem while yours does.
I did pass on this detail, and it was then reported to Tolu 4 days ago.

Keith
ID: 37541 · Report as offensive     Reply Quote
metalius
Avatar

Send message
Joined: 28 Nov 06
Posts: 89
Credit: 11,414,142
RAC: 3,025
Message 37561 - Posted: 28 Jul 2009, 16:38:13 UTC

Hello!

This WU ID 6351662 has 3 finished tasks, processed on 3 different OS (Darwin, Linux and XP - my task). All 3 are marked as "Success", but all 3 have 72000 problem.

By accident I saw the end of process.
What I know exactly:
1. the last 72096 step was reached;
2. post processing has at least started;
3. 3 zip files were sent.
That is all - I decided the task is OK, and I did not fix additional details.

Now I have 3 ways:
1. just forget this task, because the situation is hopeless OR... in reality the task is finished successful, no matter, what is shown on the task page due missing last trickle;
2. restore the task from backup and try to finish it, because this task crushed, but the situation is not hopeless;
3. restore the task from backup and try to finish it, watch the end of process careful, write a short report here, what I saw, extract system messages and post them here, do something more.

Dear project team!
Which way I must go or which way is the best?
ID: 37561 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 37562 - Posted: 28 Jul 2009, 17:27:51 UTC

First off, please quote the Task Id when reporting problems, not the Work unit ID. It reduces the time needed to search through the computers to find yours.
This is the task in question.

As you say, it didn't upload the final trickle.
I don't think that there's much point in re-running from a backup, which will most likely end the say way, but please keep the backup in case we come up with something.
I'll pass this on to the project people for consideration.

In the meantime, you can go to your climateprediction.net preferences on your account page, and select a different model type to download.

And keep an eye on this thread. You can subscribe to it and get an email when a new post is made here. IF you have email notification turned on. :)


Backups: Here
ID: 37562 · Report as offensive     Reply Quote
metalius
Avatar

Send message
Joined: 28 Nov 06
Posts: 89
Credit: 11,414,142
RAC: 3,025
Message 37563 - Posted: 28 Jul 2009, 17:56:39 UTC - in response to Message 37562.  
Last modified: 28 Jul 2009, 18:16:57 UTC

First off, please quote the Task Id when reporting problems, not the Work unit ID. It reduces the time needed to search through the computers to find yours.

I understand - this time I quoted WU purposely, because my goal was to show the global problem, not my own. 3 parallel tasks on 3 different OS crushed exactly at the same point - looking from this position minimum quorum 3, required for this WU, is reached, just kidding. :-)
And, of course... Thank You very much for Your quick reply (as ever) and recommendations - CPDN forum is the best!
ID: 37563 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 37564 - Posted: 28 Jul 2009, 18:41:09 UTC

I was downloading something completely unrelated the other night, when I realized my system was lacking the ncompress tool. This is creates (and expands) .Z compressed files on unix-like systems. I remember this utility being on all my other Linux systems (before this last upgrade). Could that be the problem?
ID: 37564 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 37568 - Posted: 29 Jul 2009, 13:38:05 UTC

Update: From my earlier post about "ncompress" library...
Adding this package did not help. Two more HadAM3P tasks ended before the last trickle. If the re-linking of the HadAM3P binary fixed the issue with the MacOS version, could this also be done with the Linux version?

If not, then could the following questions be answered? What linker and parameters do you guys use to build the executable? What versions of shared libraries did you use to link against?
ID: 37568 · Report as offensive     Reply Quote
Profile Ananas
Volunteer moderator

Send message
Joined: 31 Oct 04
Posts: 336
Credit: 3,316,482
RAC: 0
Message 37573 - Posted: 29 Jul 2009, 15:25:54 UTC
Last modified: 29 Jul 2009, 15:47:15 UTC

If it is an dlopen() error from the program source code, would ldd even report it? I think it would not as it cannot know the first argument to the function.

Does the user who runs BOINC have read/execute on all library files that come with BOINC and CPDN?

As "only" the data trickle is missing, my guess would be some zlib thing to look for.


If this is a common problem, it would even be possible that everyone who already has crunched old models has that missing library from earlier downloads, but those who started with HadAM3p never received it.


edit : There are quite a few results with this problem (Google finds 246 entries) :

CPDN Monitor - Quit request from BOINC...
Unable to load library hadam3p_se_6.07_i686-apple-darwin.dylib
dlopen error: 3152085
17:09:08 (180): called boinc_finish

The interesting part is, that those I have checked have all been on Darwins (edit again ... now I found a few dlopen errors on Linuxes too)
ID: 37573 · Report as offensive     Reply Quote
metalius
Avatar

Send message
Joined: 28 Nov 06
Posts: 89
Credit: 11,414,142
RAC: 3,025
Message 37576 - Posted: 29 Jul 2009, 19:34:25 UTC
Last modified: 29 Jul 2009, 19:36:07 UTC

72,000 problem is present on Windows machines too. Next day, next task - 9105039 - with this problem.
I don't know, is this useful or not, I captured all messages from trickle after Timestep 69,120 to end oft task.
They are (PC time was ~UTC+02:55)...
29/07/2009 09:00:37 climateprediction.net Sending scheduler request: To send trickle-up message.
29/07/2009 09:00:37 climateprediction.net Not reporting or requesting tasks
29/07/2009 09:00:42 climateprediction.net Scheduler request completed: got 0 new tasks
29/07/2009 14:13:16 climateprediction.net Sending scheduler request: To send trickle-up message.
29/07/2009 14:13:16 climateprediction.net Not reporting or requesting tasks
29/07/2009 14:13:21 climateprediction.net Scheduler request completed: got 0 new tasks
29/07/2009 14:23:22 climateprediction.net Sending scheduler request: To send trickle-up message.
29/07/2009 14:23:22 climateprediction.net Not reporting or requesting tasks
29/07/2009 14:23:27 climateprediction.net Scheduler request completed: got 0 new tasks
29/07/2009 14:23:30 climateprediction.net Computation for task hadam3p_md2t_1964_2_006118783_3 finished
29/07/2009 14:23:32 climateprediction.net Started upload of hadam3p_md2t_1964_2_006118783_3_1.zip
29/07/2009 14:23:32 climateprediction.net Started upload of hadam3p_md2t_1964_2_006118783_3_2.zip
29/07/2009 14:26:15 climateprediction.net Finished upload of hadam3p_md2t_1964_2_006118783_3_2.zip
29/07/2009 14:26:15 climateprediction.net Started upload of hadam3p_md2t_1964_2_006118783_3_3.zip
29/07/2009 14:26:19 climateprediction.net Finished upload of hadam3p_md2t_1964_2_006118783_3_3.zip
29/07/2009 14:26:46 climateprediction.net Finished upload of hadam3p_md2t_1964_2_006118783_3_1.zip

Looking to the messages, the final trickle was sent (or an attempt to send it was done), but (possible) it was refused by server...
ID: 37576 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 37578 - Posted: 29 Jul 2009, 19:58:22 UTC - in response to Message 37573.  

It is a dlopen() error, but what file is it trying to open? That is the $1M question.

Here are the permissions of my /boinc/projects/climateprediction.net/

[size=8]total 214220
drwxrwxr-x. 5 boinc   boinc     4096 2009-07-29 15:48 .
drwxrwxr-x. 7 boinc   boinc     4096 2009-06-05 13:45 ..
-rw-rw-r--. 1 boinc   boinc  1573376 2008-08-29 07:57 globe.rgb
-rwxr-xr-x. 1 boinc   boinc  2551172 2009-06-19 13:59 hadam3_6.01_i686-pc-linux-gnu
-rw-r--r--. 1 boinc   boinc 32002981 2009-06-19 14:19 hadam3_data_6.01_i686-pc-linux-gnu.zip
-rwxr-xr-x. 1 boinc   boinc 16405162 2009-06-19 14:09 hadam3_graphics_6.01_i686-pc-linux-gnu
drwxrwx--x. 2 boinc   boinc     4096 2009-07-11 07:08 hadam3h_g_089se_005a_005a_2
drwxrwx--x. 2 boinc   boinc     4096 2009-06-20 10:23 hadam3h_g_90s08_005a_005a_2
-rwxr-xr-x. 1 boinc   boinc  1391705 2009-07-10 22:00 hadam3p_6.06_i686-pc-linux-gnu
-rw-r--r--. 1 boinc   boinc 83311850 2009-07-10 23:34 hadam3p_data_6.06_i686-pc-linux-gnu.zip
-rwxr-xr-x. 1 boinc   boinc 16399234 2009-07-10 22:09 hadam3p_graphics_6.06_i686-pc-linux-gnu
-rwxrwxr-x. 1 boinc   boinc  1950297 2009-03-06 08:34 hadam3p_se_6.06_i686-pc-linux-gnu.so
-rw-r--r--. 1 boinc   boinc  1837353 2009-07-10 22:01 hadam3p_se_6.06_i686-pc-linux-gnu.zip
-rwxrwxr-x. 1 boinc   boinc  5504520 2009-03-06 08:53 hadam3p_um_6.06_i686-pc-linux-gnu
-rw-r--r--. 1 boinc   boinc  2246011 2009-07-10 22:11 hadam3p_um_6.06_i686-pc-linux-gnu.zip
-rwxrwxr-x. 1 boinc   boinc  5102878 2008-08-22 06:45 hadam3_um_6.01_i686-pc-linux-gnu
-rw-r--r--. 1 boinc   boinc  3069209 2009-06-19 14:11 hadam3_um_6.01_i686-pc-linux-gnu.zip
-rw-r--r--. 1 boinc   boinc     3782 2009-06-13 10:36 hadcm3_40.png
-rw-r--r--. 1 boinc   boinc    13708 2009-06-13 10:36 hadcm3_banner_290.png
-rw-r--r--. 1 boinc   boinc    49890 2009-06-13 10:36 hadcm3_ss_290_1.png
-rw-r--r--. 1 boinc   boinc    48077 2009-06-13 10:36 hadcm3_ss_290_2.png
-rw-r--r--. 1 boinc   boinc    41929 2009-06-13 10:36 hadcm3_ss_290_3.png
-rwxr-xr-x. 1 boinc   boinc  3098846 2008-07-26 13:51 hadcm3trans_se_6.04_i686-pc-linux-gnu
-rwxrwxr-x. 1 boinc   boinc  5073074 2008-10-21 08:46 hadcm3trans_um_6.04_i686-pc-linux-gnu
-rwxr-xr-x. 1 boinc   boinc  2573442 2009-07-03 15:58 hadsm3mh_6.03_i686-pc-linux-gnu
-rw-r--r--. 1 boinc   boinc    80223 2009-07-03 16:01 hadsm3mh_data_6.03_i686-pc-linux-gnu.zip
-rwxr-xr-x. 1 boinc   boinc 16403992 2009-07-03 16:06 hadsm3mh_graphics_6.03_i686-pc-linux-gnu
-rwxrwxr-x. 1 boinc   boinc  2955099 2008-08-29 07:27 hadsm3mh_se_6.03_i686-pc-linux-gnu
-rw-r--r--. 1 boinc   boinc  2153701 2009-07-03 15:59 hadsm3mh_se_6.03_i686-pc-linux-gnu.zip
-rwxrwxr-x. 1 boinc   boinc 10050414 2008-08-29 07:26 hadsm3mh_um_6.03_i686-pc-linux-gnu
-rw-r--r--. 1 boinc   boinc  3361911 2009-07-03 16:01 hadsm3mh_um_6.03_i686-pc-linux-gnu.zip
-rw-r--r--. 1 boinc   boinc       76 2009-07-29 07:17 slideshow_hadcm3_00
-rw-r--r--. 1 boinc   boinc       74 2009-07-29 07:17 slideshow_hadcm3_01
-rw-r--r--. 1 boinc   boinc       74 2009-07-29 07:17 slideshow_hadcm3_02
-rw-r--r--. 1 boinc   boinc       74 2009-07-29 07:17 slideshow_hadcm3_03
-rw-r--r--. 1 boinc   boinc       76 2009-07-29 07:17 slideshow_hadcm3i_00
-rw-r--r--. 1 boinc   boinc       74 2009-07-29 07:17 slideshow_hadcm3i_01
-rw-r--r--. 1 boinc   boinc       74 2009-07-29 07:17 slideshow_hadcm3i_02
-rw-r--r--. 1 boinc   boinc       74 2009-07-29 07:17 slideshow_hadcm3i_03
-rw-r--r--. 1 boinc   boinc       68 2009-07-29 07:17 stat_icon
drwxrwxr-x. 2 boinc   boinc     4096 2009-07-03 16:14 txf
[/size]

ID: 37578 · Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : No Data For Result

©2024 climateprediction.net