Master file fetch failed

Message boards : Number crunching : Master file fetch failed

To post messages, you must log in.

AuthorMessage
devzero

Send message
Joined: 8 Oct 05
Posts: 7
Credit: 35,876,636
RAC: 0
Message 2246 - Posted: 4 Nov 2005, 13:21:15 UTC

I'm trying to bring up a new machine and I am getting:

2005-11-04 06:59:45 [rosetta@home] Computer ID: not assigned yet; location: ; project prefs: default
...
2005-11-04 07:00:51 [rosetta@home] Fetching master file
2005-11-04 07:00:56 [rosetta@home] Master file fetch failed
2005-11-04 07:00:56 [rosetta@home] Too many backoffs - fetching master file
2005-11-04 07:01:01 [rosetta@home] Deferring communication with project for 2 minutes and 37 seconds
2005-11-04 07:03:41 [rosetta@home] Fetching master file
2005-11-04 07:03:46 [rosetta@home] Master file fetch failed
2005-11-04 07:03:46 [rosetta@home] Too many backoffs - fetching master file
2005-11-04 07:03:51 [rosetta@home] Deferring communication with project for 5 minutes and 19 seconds

ID: 2246 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Andrew

Send message
Joined: 19 Sep 05
Posts: 162
Credit: 105,512
RAC: 0
Message 2250 - Posted: 4 Nov 2005, 13:47:39 UTC

There was some maintanence work being done on the project this morning.

Are you still having that problem?
ID: 2250 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
devzero

Send message
Joined: 8 Oct 05
Posts: 7
Credit: 35,876,636
RAC: 0
Message 2251 - Posted: 4 Nov 2005, 14:06:28 UTC

Yes still having the problem.

2005-11-04 07:56:41 [rosetta@home] Too many backoffs - fetching master file
2005-11-04 07:56:46 [rosetta@home] Deferring communication with project for 1 hours, 23 minutes, and 10 seconds

Tried various versions (4.x,5.x) of the client also.
Currently using:
BOINC client version 5.2.6 for i686-pc-linux-gnu
ID: 2251 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Andrew

Send message
Joined: 19 Sep 05
Posts: 162
Credit: 105,512
RAC: 0
Message 2253 - Posted: 4 Nov 2005, 14:16:07 UTC

Check out this wiki page to see if it helps.

What is the URL you're trying to connect to?

Are you running a firewall?

ID: 2253 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
devzero

Send message
Joined: 8 Oct 05
Posts: 7
Credit: 35,876,636
RAC: 0
Message 2256 - Posted: 4 Nov 2005, 14:22:50 UTC

I am connecting to: https://boinc.bakerlab.org/rosetta as in the email from
registration. Just like all the others.

I know connectivity is good. I have other machines on this net working.

I can fetch from bakerlab.
wget https://boinc.bakerlab.org/rosetta
--08:19:35-- https://boinc.bakerlab.org/rosetta
=> `rosetta'
Resolving boinc.bakerlab.org... 140.142.20.103
Connecting to boinc.bakerlab.org|140.142.20.103|:80... connected.
HTTP request sent, awaiting response... 301 Moved Permanently
Location: https://boinc.bakerlab.org/rosetta/ [following]
--08:19:35-- https://boinc.bakerlab.org/rosetta/
=> `index.html'
Connecting to boinc.bakerlab.org|140.142.20.103|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [text/html]
08:19:36 (124.58 KB/s) - `index.html' saved [11376]
ID: 2256 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
devzero

Send message
Joined: 8 Oct 05
Posts: 7
Credit: 35,876,636
RAC: 0
Message 2257 - Posted: 4 Nov 2005, 14:40:10 UTC

Definetly making to bakerlab. Just isn't returning anything.

Internet Protocol, Src: 10.134.1.100 (10.134.1.100), Dst: 140.142.20.103 (140.142.20.103)
Transmission Control Protocol, Src Port: 60774 (60774), Dst Port: http (80), Seq: 1, Ack: 1, Len: 138
Hypertext Transfer Protocol
GET /rosetta/ HTTP/1.0rn
User-Agent: BOINC client (i686-pc-linux-gnu 4.72)rn
Host: boinc.bakerlab.org:80rn
Connection: closern
Accept: */*rn
rn

No. Time Source Destination Protocol Info
47 5.348989 140.142.20.103 10.134.1.100 TCP http > 60774 [FIN, ACK] Seq=1 Ack=139 Win=16384 Len=0 TSV=13843585 TSER=174941719
ID: 2257 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Bok

Send message
Joined: 17 Sep 05
Posts: 54
Credit: 3,514,973
RAC: 0
Message 2258 - Posted: 4 Nov 2005, 14:41:22 UTC

Do you have other linux boxen on the network that work ok ?

Which distro are you using in this case ?

I've had issues similar to this with some. Mostly due I think to libcurl incompatibilities. Usually a different client version will work, or compiling your own.

Bok


Free-DC

Stats for all projects

Custom Stats
ID: 2258 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
devzero

Send message
Joined: 8 Oct 05
Posts: 7
Credit: 35,876,636
RAC: 0
Message 2259 - Posted: 4 Nov 2005, 14:48:42 UTC - in response to Message 2258.  

Do you have other linux boxen on the network that work ok ?

Which distro are you using in this case ?

I've had issues similar to this with some. Mostly due I think to libcurl incompatibilities. Usually a different client version will work, or compiling your own.

Bok



Its running mandriva newest 2006.
Got others running this. Tried 4-5 different clients including one from another working box.
ID: 2259 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
devzero

Send message
Joined: 8 Oct 05
Posts: 7
Credit: 35,876,636
RAC: 0
Message 2260 - Posted: 4 Nov 2005, 14:50:31 UTC

Well OK it just started working. Another one of life's mysteries.
Thanks for your interest and time!
ID: 2260 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Andrew

Send message
Joined: 19 Sep 05
Posts: 162
Credit: 105,512
RAC: 0
Message 2261 - Posted: 4 Nov 2005, 14:52:11 UTC
Last modified: 4 Nov 2005, 14:53:12 UTC

I'm glad it's working :)

(damn gremlins)
ID: 2261 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
devzero

Send message
Joined: 8 Oct 05
Posts: 7
Credit: 35,876,636
RAC: 0
Message 2262 - Posted: 4 Nov 2005, 14:54:50 UTC

Maybe I spoke too soon. Now I get work but already had serveral of these:

2005-11-04 08:51:41 [rosetta@home] Unrecoverable error for result 1hz7A_abrelaxmode_random_only_length10_jitter02_3
5489_0 (process exited with code 1 (0x1))


2005-11-04 08:52:25 [rosetta@home] Result 1hz7A_abrelaxmode_test_35444_0 exited with zero status but no 'finished' file

ID: 2262 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Bok

Send message
Joined: 17 Sep 05
Posts: 54
Credit: 3,514,973
RAC: 0
Message 2263 - Posted: 4 Nov 2005, 15:19:59 UTC

Reset the project.

I occasionally get that when it's been having difficulty downloading all the files. A reset fixes it.

Bok
Free-DC

Stats for all projects

Custom Stats
ID: 2263 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Andrew

Send message
Joined: 19 Sep 05
Posts: 162
Credit: 105,512
RAC: 0
Message 2264 - Posted: 4 Nov 2005, 15:22:01 UTC

There's also another thread about errors on linux...

https://boinc.bakerlab.org/rosetta/forum_thread.php?id=201

It might be worth a read, but I don't know if it's applicable to you.

:)
ID: 2264 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 17 Sep 05
Posts: 815
Credit: 1,812,737
RAC: 0
Message 2455 - Posted: 6 Nov 2005, 15:53:25 UTC

The "no finished file" problem is benign. There were some older versions that were prone to this if the system clock is changed. The Science Application and the BOINC Daemon would be on a different timeline and lose lock on.

This was changed and seems to be less of a problem (I have not seen it at all in 4.72 ... knock wood) but, there is no need to reset the project.
ID: 2455 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Master file fetch failed



©2024 University of Washington
https://www.bakerlab.org