Report Problems with Rosetta Version 5.22

Message boards : Number crunching : Report Problems with Rosetta Version 5.22

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
Ian

Send message
Joined: 14 Apr 06
Posts: 29
Credit: 361,378
RAC: 763
Message 18833 - Posted: 17 Jun 2006, 0:16:35 UTC

Blimey. Whole flurry of errors. All today (well, yesterday - 16 June). Had nothing like this for weeks.

https://boinc.bakerlab.org/rosetta/result.php?resultid=24427279

https://boinc.bakerlab.org/rosetta/result.php?resultid=24460877

https://boinc.bakerlab.org/rosetta/result.php?resultid=24463664

https://boinc.bakerlab.org/rosetta/result.php?resultid=24495408

https://boinc.bakerlab.org/rosetta/result.php?resultid=24513042
Ian Cundell, St Albans, UK
ID: 18833 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Lee Carre

Send message
Joined: 6 Oct 05
Posts: 96
Credit: 79,331
RAC: 0
Message 18835 - Posted: 17 Jun 2006, 2:40:05 UTC
Last modified: 17 Jun 2006, 2:40:41 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=24571715

i was viewing the graphics window at the time it failed incase that makes a difference
Want to search the BOINC Wiki, BOINCstats, or various BOINC forums from within firefox? Try the BOINC related Firefox Search Plugins
ID: 18835 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tigher

Send message
Joined: 16 Jun 06
Posts: 5
Credit: 5,814
RAC: 0
Message 18842 - Posted: 17 Jun 2006, 8:49:03 UTC
Last modified: 17 Jun 2006, 8:54:05 UTC

I have just joined the project. On one PC of the 9 WUs it has been sent it has successfully processed 5 but errored out on 4.

from my log:

17/06/2006 04:25:04 Unrecoverable error for result t299__CASP7_JUMPRELAX_SAVE_ALL_OUT_BARCODE_cterm2_nohelix3_hom001__681_83011_0 ( - exit code -1073741819 (0xc0000005))

Clues or advice?


A different unit to that above but some debug info to help devs.
https://boinc.bakerlab.org/rosetta/result.php?resultid=24466158


ID: 18842 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jimi@0wned.org.uk

Send message
Joined: 10 Mar 06
Posts: 29
Credit: 335,252
RAC: 0
Message 18844 - Posted: 17 Jun 2006, 11:27:51 UTC

First error ever on this machine (31,000 credit):

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=19785224

stderr out

<core_client_version>5.5.0</core_client_version>
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# random seed: 3706611
# cpu_run_time_pref: 14400
# cpu_run_time_pref: 14400
ERROR:: Exit at: .dock_structure.cc line:401

</stderr_txt>

btw [BOINCUK]Tigher, (0xc0000005) is usually a memory error, in my experience.

ID: 18844 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tigher

Send message
Joined: 16 Jun 06
Posts: 5
Credit: 5,814
RAC: 0
Message 18855 - Posted: 17 Jun 2006, 15:22:10 UTC - in response to Message 18844.  



btw [BOINCUK]Tigher, (0xc0000005) is usually a memory error, in my experience.



Gulp! Hmmm thanks.

ID: 18855 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Bandit

Send message
Joined: 21 May 06
Posts: 12
Credit: 197,197
RAC: 0
Message 18857 - Posted: 17 Jun 2006, 16:14:38 UTC - in response to Message 18855.  
Last modified: 17 Jun 2006, 16:14:54 UTC

Another problem - looks the same from this end as the other ones I had.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=20802010

When I leave from working on the computer, I'll exit IE to see if that helps.

Bandit's Mom
ID: 18857 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Leonard Kevin Mcguire Jr.

Send message
Joined: 13 Jun 06
Posts: 29
Credit: 14,903
RAC: 0
Message 18861 - Posted: 17 Jun 2006, 18:47:32 UTC
Last modified: 17 Jun 2006, 18:48:14 UTC

https://boinc.bakerlab.org/rosetta/hosts_user.php?userid=94664

I have been accumulating computation errors lately.
ID: 18861 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tralala

Send message
Joined: 8 Apr 06
Posts: 376
Credit: 581,806
RAC: 0
Message 18881 - Posted: 18 Jun 2006, 9:01:30 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=24638007

This WU created about three good models with energy minima between -200 and -300. then it failed to do more good models which each succeeding model completing within minutes and always the same energy minimum of about -30. Watching on the graphics showed a stretched protein where no folding was achieved. I "aborted" the model the soft way with 6 restarts of BOINC (to prevent sending out the same WU).

I watched such WU in the past. Perhaps there is a pattern.
ID: 18881 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 18892 - Posted: 18 Jun 2006, 16:44:27 UTC - in response to Message 18881.  

This WU created about three good models with energy minima between -200 and -300. then it failed to do more good models which each succeeding model completing within minutes and always the same energy minimum of about -30.

I for one have been HOPING to see WUs that would act like that. If you knew that a -300 was possible, and you are sitting at a -30, there are cases where it might be SMART to bail on this one and invest the time in pursuing something with more potential.

I don't know that this is what happened in your case, I'll leave that for the project team to assess. I just wanted to point out that it is the TYPE of thing that I think we'll see more of as the algorythm gets smarter.
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 18892 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tralala

Send message
Joined: 8 Apr 06
Posts: 376
Credit: 581,806
RAC: 0
Message 18903 - Posted: 18 Jun 2006, 20:31:41 UTC - in response to Message 18892.  

This WU created about three good models with energy minima between -200 and -300. then it failed to do more good models which each succeeding model completing within minutes and always the same energy minimum of about -30.

I for one have been HOPING to see WUs that would act like that. If you knew that a -300 was possible, and you are sitting at a -30, there are cases where it might be SMART to bail on this one and invest the time in pursuing something with more potential.

I don't know that this is what happened in your case, I'll leave that for the project team to assess. I just wanted to point out that it is the TYPE of thing that I think we'll see more of as the algorythm gets smarter.


I agree! Using previous result for "pruning" decision is an idea that for a long time crossed my mind. I'm a bit in chess engine programming and in these engines a lot of "pruning" is done in positions where one side is just too worse to have any chance of reaching the current score with any move. However in the case reported it was most certainly something different, since the models finished successively in a few minutes without really folding the protein (it was stretched in the graphics) and with always the same score. In the end I had over 150 models of which only three had not been "aborted".

ID: 18903 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 22 Dec 05
Posts: 71
Credit: 138,867
RAC: 0
Message 18913 - Posted: 19 Jun 2006, 3:40:47 UTC

stuck at 74.101% Rosetta 5.22 Windows 0.0000% of CPU usage
Thus, aborted by hand after 3 hours of IDLE time!
https://boinc.bakerlab.org/rosetta/result.php?resultid=24659040

Thanks

Click signature for global team stats
ID: 18913 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rriggs

Send message
Joined: 5 Jun 06
Posts: 5
Credit: 48,672
RAC: 0
Message 18931 - Posted: 19 Jun 2006, 14:09:07 UTC

For the past week or so I've been getting 2-3 crashes per day. The failed work units show up as "Compute Error" with no credit. Do I need to report this? Or will the appropriate party see these errors and be able to deal with them on their own?

ID: 18931 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 18934 - Posted: 19 Jun 2006, 15:22:36 UTC - in response to Message 18931.  

Do I need to report this? Or will the appropriate party see these errors and be able to deal with them on their own?


It is "HELPFUL" if you report them. It gives the opportunity to ask you questions about your computing environment so they might learn more about the system that's seeing the failure. It is not "required".

Credit for failed WUs is issued once the daily credit run is made. You will see this when you display the WU details... not on the WU listing. Like this one for example.

It looks like most of them were ended by the "watchdog". One was a -107 error (which is something that's been under review for a while already).

The watchdog is trying to assure your computer doesn't get stuck in an unexpected loop on a work unit. If it notices no progress on a work unit in 5 restarts, then it ends it. Do you restart this computer frequently? Or have a number of other projects running in BOINC?

If you would, go to your General Preferences, and let us know what you have set for "Switch between applications every...minutes", and for "Leave applications in memory while preempted?". And is Rosetta your only BOINC project?
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 18934 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 18939 - Posted: 19 Jun 2006, 15:58:22 UTC

A crash.

https://boinc.bakerlab.org/rosetta/result.php?resultid=24876847

It happend when i was shutting down grafics window.

Anders n
ID: 18939 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 18940 - Posted: 19 Jun 2006, 17:26:32 UTC - in response to Message 18934.  

It looks like most of them were ended by the "watchdog". One was a -107 error (which is something that's been under review for a while already).


Correction, I misread that "watchdog is shutting down" message (again!). I keep thinking this message indicates that the watchdog is shutting down the WU, not just ending itself as a normal end of processing a WU.

Most of their errors were -107s.

Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 18940 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rriggs

Send message
Joined: 5 Jun 06
Posts: 5
Credit: 48,672
RAC: 0
Message 18978 - Posted: 20 Jun 2006, 15:14:25 UTC - in response to Message 18934.  


The watchdog is trying to assure your computer doesn't get stuck in an unexpected loop on a work unit. If it notices no progress on a work unit in 5 restarts, then it ends it. Do you restart this computer frequently? Or have a number of other projects running in BOINC?

If you would, go to your General Preferences, and let us know what you have set for "Switch between applications every...minutes", and for "Leave applications in memory while preempted?". And is Rosetta your only BOINC project?


I'll try to answer your questions here:

Machine is rarely restarted, once every 2-3 days.

This is the only project I have under BOINC. No other background/SETI type applications are installed.

I'm not sure where this "General Preferences" dialog is you're referring to. I don't see anything like this in BOINC.

I am an accomplished C++/Java/.NET developer w/ Visual Studio installed on this box if you need me to grab a stack trace, I'd be happy to next time!
ID: 18978 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 18983 - Posted: 20 Jun 2006, 15:50:45 UTC - in response to Message 18978.  

I'm not sure where this "General Preferences" dialog is you're referring to. I don't see anything like this in BOINC.


Now that you are viewing this message board, click the "Participants" link in the heading of the screen. In the "Preferences" section, click the link for "view or edit" of General preferences. Any changes made there require BOINC to update to the project to take effect. This is done from the projects tab of BOINC, select Rosetta, then click the update button.

Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 18983 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Bandit

Send message
Joined: 21 May 06
Posts: 12
Credit: 197,197
RAC: 0
Message 18998 - Posted: 20 Jun 2006, 17:54:18 UTC - in response to Message 18196.  

In followup to Message ID 18855, as long as I don't have IE running, I don't seem to have any BOINC problems. If I leave IE on, I have intermittant BOINC crashes. For me, it does not seem to be the screensaver at this time.

Bandit's Mom
ID: 18998 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
andrewsi

Send message
Joined: 19 Jun 06
Posts: 1
Credit: 10,139,108
RAC: 0
Message 19008 - Posted: 20 Jun 2006, 19:16:58 UTC
Last modified: 20 Jun 2006, 19:19:37 UTC

Ran into a compute error with 522.

6/20/2006 12:12:35 PM|rosetta@home|Unrecoverable error for result t304__CASP7_JUMPRELAX_SAVE_ALL_OUT_BARCODE_hom001__691_17229_0 ( - exit code -1 (0xffffffff)).

Looks like it was: https://boinc.bakerlab.org/rosetta/workunit.php?wuid=21222160

What other information should I provide?

ID: 19008 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rriggs

Send message
Joined: 5 Jun 06
Posts: 5
Credit: 48,672
RAC: 0
Message 19009 - Posted: 20 Jun 2006, 19:20:55 UTC - in response to Message 18983.  


Now that you are viewing this message board, click the "Participants" link in the heading of the screen. In the "Preferences" section, click the link for "view or edit" of General preferences. Any changes made there require BOINC to update to the project to take effect. This is done from the projects tab of BOINC, select Rosetta, then click the update button.


You didn't say what these 'should be' so I'm just reporting what they currently are and not changing anything:

work on batteries: no
work while in use: no
idle: 3 mins
hours: (no restrictions)
leave in memory: no
switch between: 60 mins
multiprocessors: 0 processors (although I have two of them!?)
use at most: 100 percent of CPU


ID: 19009 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Number crunching : Report Problems with Rosetta Version 5.22



©2025 University of Washington
https://www.bakerlab.org