Serverproblems ?

Message boards : Number crunching : Serverproblems ?

To post messages, you must log in.

AuthorMessage
Grutte Pier [Wa Oars]~MAB The Frisian
Avatar

Send message
Joined: 6 Nov 05
Posts: 87
Credit: 497,588
RAC: 0
Message 10748 - Posted: 14 Feb 2006, 14:07:02 UTC
Last modified: 14 Feb 2006, 14:08:06 UTC

If so, it's 'getting annoying and therefor time to fix it.
During the time your computer is trying to connect, you can put on the kettle for a cup of tea/coffee, drink it, take nap and you'll still be in time.

Or are we having this problem in the Netherlands only ?

ID: 10748 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Matthew Johnson

Send message
Joined: 11 Dec 05
Posts: 11
Credit: 12,962
RAC: 0
Message 10749 - Posted: 14 Feb 2006, 14:40:23 UTC

its affecting me as well ... i have mangaed to make a cup of tea. and drink it before it even starts attempting to download a WU. its starting to get annoying now as i want to crunch away and cant :(
ID: 10749 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
nairb

Send message
Joined: 8 Dec 05
Posts: 17
Credit: 990,147
RAC: 0
Message 10750 - Posted: 14 Feb 2006, 14:44:12 UTC

Whole page of downloads failed. Been like this for a while.

Someting on the project needs fixing I guess.
ID: 10750 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Matthew Johnson

Send message
Joined: 11 Dec 05
Posts: 11
Credit: 12,962
RAC: 0
Message 10751 - Posted: 14 Feb 2006, 15:05:50 UTC

my 3 computers are waiting for work.. doing my head in they have nuffin to do !
ID: 10751 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 10753 - Posted: 14 Feb 2006, 15:09:37 UTC
Last modified: 14 Feb 2006, 16:12:50 UTC

This issue does not seem to be local to any group of users. It is possible the project team is not aware this is happening, so I have reported the problem to them via e-mail.

EDIT: There is discussion on another thread that is related to this topic. David Baker has responded to that thread stating that they are looking into the problem.


Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 10753 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Keith E. Laidig
Volunteer moderator
Project developer
Avatar

Send message
Joined: 1 Jul 05
Posts: 154
Credit: 117,189,961
RAC: 0
Message 10759 - Posted: 14 Feb 2006, 22:02:00 UTC
Last modified: 14 Feb 2006, 22:11:36 UTC

We have just finished modifying the webserver to address the large number of connection problems in the past week or so... Sorry for the delays. This should improve matters for all.

ID: 10759 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Snake Doctor
Avatar

Send message
Joined: 17 Sep 05
Posts: 182
Credit: 6,401,938
RAC: 0
Message 10761 - Posted: 14 Feb 2006, 23:58:15 UTC

It seems to be running much better now, than it was thins morning.

Thanks for the speedy fix.

Regards
Phil


We Must look for intelligent life on other planets as,
it is becoming increasingly apparent we will not find any on our own.
ID: 10761 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
drsweger

Send message
Joined: 14 Feb 06
Posts: 1
Credit: 16,828
RAC: 0
Message 10762 - Posted: 15 Feb 2006, 0:06:55 UTC

New User here - Got two machines running so far and the third PC connects but gets message : " No work from project."

Is this same problem others are experiencing?


ID: 10762 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Angus

Send message
Joined: 17 Sep 05
Posts: 412
Credit: 321,053
RAC: 0
Message 10765 - Posted: 15 Feb 2006, 2:41:42 UTC - in response to Message 10759.  

We have just finished modifying the webserver to address the large number of connection problems in the past week or so... Sorry for the delays. This should improve matters for all.


The forum has gone dog-slow again.

Proudly Banned from Predictator@Home and now Cosmology@home as well. Added SETI to the list today. Temporary ban only - so need to work harder :)



"You can't fix stupid" (Ron White)
ID: 10765 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Matthew Johnson

Send message
Joined: 11 Dec 05
Posts: 11
Credit: 12,962
RAC: 0
Message 10816 - Posted: 16 Feb 2006, 17:13:34 UTC - in response to Message 10762.  

New User here - Got two machines running so far and the third PC connects but gets message : " No work from project."

Is this same problem others are experiencing?



Yes, No, And Inbetween .. i think were all getting werid silly errors . :( seems a lot lot better now .. still some outages
ID: 10816 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [B^S] ThatGuy

Send message
Joined: 4 Jan 06
Posts: 3
Credit: 24,872
RAC: 0
Message 10817 - Posted: 16 Feb 2006, 19:54:08 UTC
Last modified: 16 Feb 2006, 19:57:29 UTC

I'm having problems downloading to one of the computers that I have running Rosetta, but it is only certain WUs that have a problem. I've had many newer WUs come through fine.

I did a manual "Retry Now" on each of them, and I noticed a common denominator to the files that will not transfer - The actual files are larger (most of them 10-20x) than the "expected" size. My hypothesis is that the file gets downloaded, then a check happens to make sure that the file is "good", but the size / checksum doesn't match expected values, so it fails the transfer. Not much of a reach, I know.

So... do I abort the transfers, or should I turn off the validation? Would that actually help? More importantly, how in the world did the expected sizes become different than the real sizes?

EDIT: There is nothing in the message log that really indicates what is going on - just "Temporarily failed transfer".
ID: 10817 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 10825 - Posted: 16 Feb 2006, 22:02:11 UTC - in response to Message 10817.  

I'm having problems downloading to one of the computers that I have running Rosetta, but it is only certain WUs that have a problem. I've had many newer WUs come through fine.

I did a manual "Retry Now" on each of them, and I noticed a common denominator to the files that will not transfer - The actual files are larger (most of them 10-20x) than the "expected" size. My hypothesis is that the file gets downloaded, then a check happens to make sure that the file is "good", but the size / checksum doesn't match expected values, so it fails the transfer. Not much of a reach, I know.

So... do I abort the transfers, or should I turn off the validation? Would that actually help? More importantly, how in the world did the expected sizes become different than the real sizes?

EDIT: There is nothing in the message log that really indicates what is going on - just "Temporarily failed transfer".


This happens sometimes. It is not really an error in the file size, (though you are correct that it looks like one) it is a problem on the server side, usually a dropped connection. Usually, in time, these will sort out by themselves, but you can stop and start BOINC and that will "sometimes" get them going. In any case if left alone they will eventually download by themselves.
Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 10825 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [B^S] ThatGuy

Send message
Joined: 4 Jan 06
Posts: 3
Credit: 24,872
RAC: 0
Message 10829 - Posted: 16 Feb 2006, 23:45:59 UTC - in response to Message 10825.  

Thanks for the info!

I'm having problems downloading to one of the computers that I have running Rosetta, but it is only certain WUs that have a problem. I've had many newer WUs come through fine.

I did a manual "Retry Now" on each of them, and I noticed a common denominator to the files that will not transfer - The actual files are larger (most of them 10-20x) than the "expected" size. My hypothesis is that the file gets downloaded, then a check happens to make sure that the file is "good", but the size / checksum doesn't match expected values, so it fails the transfer. Not much of a reach, I know.

So... do I abort the transfers, or should I turn off the validation? Would that actually help? More importantly, how in the world did the expected sizes become different than the real sizes?

EDIT: There is nothing in the message log that really indicates what is going on - just "Temporarily failed transfer".


This happens sometimes. It is not really an error in the file size, (though you are correct that it looks like one) it is a problem on the server side, usually a dropped connection. Usually, in time, these will sort out by themselves, but you can stop and start BOINC and that will "sometimes" get them going. In any case if left alone they will eventually download by themselves.


ID: 10829 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BrainMcFly

Send message
Joined: 19 Feb 06
Posts: 2
Credit: 297,824
RAC: 0
Message 11139 - Posted: 21 Feb 2006, 19:18:51 UTC

ive got serveral problems to connect to server, i think your server is running on his bandwith-limit ;)
is there a chance to say boinc: "download work for a week, but try every 24 hours to up- and download workunits" so that rosetta never run out of work cause of connection-problems?
ID: 11139 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 11169 - Posted: 22 Feb 2006, 0:54:19 UTC - in response to Message 11139.  

ive got serveral problems to connect to server, i think your server is running on his bandwith-limit ;)
is there a chance to say boinc: "download work for a week, but try every 24 hours to up- and download workunits" so that rosetta never run out of work cause of connection-problems?


With the new time setting you can run one single WU for 4 days, then connect and get a new one. Or you can download a few and set the time to 4 days and connect any time after one of them is done.
Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 11169 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Morphy375
Avatar

Send message
Joined: 2 Nov 05
Posts: 86
Credit: 1,629,758
RAC: 0
Message 11247 - Posted: 23 Feb 2006, 13:47:50 UTC

Only 7000 WU's left.... Server problems?
Teddies....
ID: 11247 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile eL_nino

Send message
Joined: 20 Jan 06
Posts: 10
Credit: 45,343
RAC: 0
Message 11572 - Posted: 2 Mar 2006, 22:05:00 UTC

There is no work now... :(
ID: 11572 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Serverproblems ?



©2024 University of Washington
https://www.bakerlab.org