Message boards : Number crunching : Problems and Technical Issues with Rosetta@home
Previous · 1 . . . 277 · 278 · 279 · 280 · 281 · 282 · 283 . . . 302 · Next
Author | Message |
---|---|
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
Nope.Server is still dead.It seem mostly up for me. The boinc-process server is still dead, that's according to the Server Staus page & the number of Tasks that are piling up waiting for Validation & Assimilation. Waiting for Validation is over 325,000 now. That's why even though people are returning work, their Credit isn't increasing & their RAC is going down. Grant Darwin NT |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
I don't want to tempt fate, but the boinc-process server appears to be alive again (at least for now). Grant Darwin NT |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
I really wish they'd fix the application error handling, or at least the data they send out to process. Got a bunch of Tasks that have errored out. ERROR: Error in protocols::cyclic_peptide_predict::SimpleCycpepPredictpplication::set_up_n_to_c_cyclization_mover() function: residue 1 does not have a LOWER_CONNECT.*deep sigh* Grant Darwin NT |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
I don't want to tempt fate, but the boinc-process server appears to be alive again (at least for now).And the backlog has cleared. Grant Darwin NT |
Chris Raisin Send message Joined: 18 May 16 Posts: 2 Credit: 5,536,296 RAC: 627 |
I am receiving a constant error message via BOINC re Rosetta@Home and I am not sure how to resolve it. The message (relating solely to Rosetta@Home) is: "Could not determine location of executable. Could not find database. Either specify -database or set variable ROSETTA3_db" Can someone advise where in user files (I assume) a configuration file relating to BOINC and Rosetta@Home needs modification? Many thanks, Chris Raisin |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,281,662 RAC: 1,807 |
I am receiving a constant error message via BOINC re Rosetta@Home and I am not sure how to resolve it. I've seen that message many times. Until those workunits get some hard to guess change, expect many more workunits running under Windows to have the same problem. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
I am receiving a constant error message via BOINC re Rosetta@Home and I am not sure how to resolve it.Where are those error messages being shown? Looking at your results, there are only 2 that have errored out, ERROR: Error in protocols::cyclic_peptide_predict::SimpleCycpepPredictpplication::set_up_n_to_c_cyclization_mover() function: residue 1 does not have a LOWER_CONNECT.Which has been an issue with some Tasks for ages now. Other than what appears to be a heavily loaded system (11.5 hours to do 8 hours work, 4 hrs 15 min to do 3 hrs work), other than the 2 errored Tasks(due to a configuration issue with the Tasks themselves), all the others have processed & Validated without issue. Grant Darwin NT |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 9,591 |
Where are those error messages being shown? Seems the message of the screensaver... |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 393 Credit: 12,110,248 RAC: 6,015 |
A strange error, sadly I can only give a sketchy report but I hope it’s enough :- Host = https://boinc.bakerlab.org/rosetta/results.php?hostid=6231982 Boinc 7.24.1, Ubuntu 22.04.4 I allowed Ubuntu to update and then rebooted, subsequent to this Boinc Manager disconnected after running for about a minute - the event log showed a Rosetta task restarting and immediately Boinc closing having received signal 15. This would repeat each time I restated the host and the Boinc service restarted. I have now aborted all of the Rosetta tasks and this behaviour has now stopped. (How) can a Rosetta task kill Boinc? Just a notification as I’ve never heard this described before. |
MStenholm Send message Joined: 18 Apr 20 Posts: 18 Credit: 26,052,819 RAC: 24,583 |
You ran out of memory. Six jobs of 2.6 GB and you have 16 GB. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
You ran out of memory. Six jobs of 2.6 GB and you have 16 GB.That might do it. I've got half that many cores/threads & twice that amount of RAM and over the last couple of days when i had mostly Rosetta_VS Tasks there have been times i've had over 60% of my RAM in use. Even without the 2GB + Tasks, there were plenty of others using 1-1.5GB. But normally if lack of RAM is an issue, the Taks should have suspended with a "Waiting for memory" note. It shouldn't cause things to crash & burn. Grant Darwin NT |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 393 Credit: 12,110,248 RAC: 6,015 |
You ran out of memory. Six jobs of 2.6 GB and you have 16 GB. Ach, I thought I had 32gb. I remember now, the 2 sticks wouldn't play with each other :-( The other machine has 64gb, I'll update this one to match Thanks |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
Looks like the boinc-process server is having issues yet again- Rosetta beta Validator & Assimilator are down (along with a few other processes). How far behind witll the Validator get this time? Presently 11,825 Workunits waiting for Validation. Grant Darwin NT |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
Looks like the boinc-process server is having issues yet again- Rosetta beta Validator & Assimilator are down (along with a few other processes). How far behind witll the Validator get this time?Backlog is now 20,000, but Validator now shows as running. Will have to wait a while to see if it actually is. Grant Darwin NT |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 9,591 |
Backlog is now 20,000, but Validator now shows as running. Will have to wait a while to see if it actually is. Now is 0. Validator queue is empty. |
Dr Who Fan Send message Joined: 28 May 06 Posts: 70 Credit: 267,358 RAC: 452 |
Me & all wingman Seeing lots of errors on Android due to what appears to be misconfigured Rosetta task: https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1395840251 [ ERROR ]: Caught exception: File: src/core/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306 chi angle must be between -180 and 180: nan ------------------------ Begin developer's backtrace ------------------------- BACKTRACE: ------------------------- End developer's backtrace -------------------------- |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
Me & all wingman Seeing lots of errors on Android due to what appears to be misconfigured Rosetta task:Been a problem for years now. Grant Darwin NT |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 9,591 |
chi angle must be between -180 and 180: nan A great classic!! |
dcs1955 Send message Joined: 2 Dec 22 Posts: 13 Credit: 5,953,397 RAC: 14,294 |
Waiting for Memory.... For the past two weeks I have had one of four core processes held up for needing memory.. It happens on two of my desktops with 16 GRAM. In over 8 years crunching WCG and Rosetta I have not had this happen. Since all the work is Rosetta Beta 6.04. Is this a known issue?? |
kotenok2000 Send message Joined: 22 Feb 11 Posts: 259 Credit: 497,274 RAC: 1,201 |
RosettaVS tasks use more memory than 8a_hal |
Message boards :
Number crunching :
Problems and Technical Issues with Rosetta@home
©2024 University of Washington
https://www.bakerlab.org