Bug in Rosetta for Linux

Message boards : Number crunching : Bug in Rosetta for Linux

To post messages, you must log in.

AuthorMessage
hurax

Send message
Joined: 24 Sep 05
Posts: 4
Credit: 295,636
RAC: 0
Message 530 - Posted: 26 Sep 2005, 8:10:46 UTC

Hi,
I have encountered a bug with Rosetta 4.77 running on Linux with BOINC 4.43. Sometimes when the scheduling process of BOINC gives control to the Rosetta application, the CPU gets idle, but Rosetta is visible via ps. I also had two computation errors which i very rarely had with other projects.
Greetings, Benno
ID: 530 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Daddygeek
Avatar

Send message
Joined: 17 Sep 05
Posts: 12
Credit: 4,071,353
RAC: 2,220
Message 543 - Posted: 26 Sep 2005, 17:20:59 UTC

I too have been having problems and only on my Linux machines. On my quad server was only using approximately 75% of its processing capability. Checking it this morning, It's down to about 30%. Even after changing the the nice level to -20.(overkill I know) I can't even get one out of four processors to run at 100%. I do plan on putting Microsoft on and see if that makes a difference. but I think the problem is more with BOINC than with Rosetta. Even on other projects, I could not get it to run at 100%.
ID: 543 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The Pirate
Avatar

Send message
Joined: 22 Sep 05
Posts: 20
Credit: 7,090,933
RAC: 0
Message 562 - Posted: 27 Sep 2005, 0:21:59 UTC
Last modified: 27 Sep 2005, 0:24:20 UTC

I think it is with the Rosetta linux client. I am running 3 or 4 distributed projects on four different linux boxes and they all run at 99%. One has dual AMD mp 2600's and both cpu's run at 99%. Currently I am only running Rosetta on Windows XP64 with an AMD 3000 cpu.

ID: 562 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Daddygeek
Avatar

Send message
Joined: 17 Sep 05
Posts: 12
Credit: 4,071,353
RAC: 2,220
Message 647 - Posted: 27 Sep 2005, 18:49:19 UTC

well, I just installed server 2003 started Rosetta and all four processors went to 100%. I will run Windows on this project as long as I can. But I will need to go back to Linux. I see there were asking for assistance in their code, I wish I knew more C++ and what was going on behind Linux to help with their code. As I plan on staying with this project as long as I can.
ID: 647 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Keith E. Laidig
Volunteer moderator
Project developer
Avatar

Send message
Joined: 1 Jul 05
Posts: 154
Credit: 117,189,961
RAC: 0
Message 662 - Posted: 27 Sep 2005, 21:22:28 UTC
Last modified: 27 Sep 2005, 21:25:22 UTC

I haven't seen this sort of problem (although more detail about your troubles would be illuminating). I ran R@H with 64 threads on a 32 node Linux cluster w/o error and with sustained load averages of 1.0 per thread just last night....We've run the app on a wide variety of Linux platforms, could you provide kernel and distro information please?

ID: 662 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Daddygeek
Avatar

Send message
Joined: 17 Sep 05
Posts: 12
Credit: 4,071,353
RAC: 2,220
Message 675 - Posted: 28 Sep 2005, 1:28:39 UTC

I am running a IBM Netfinity 5500. The only change I have made is the OS.

The systems are:

ID: 899 SuSE 10.0 beta (2.6.13-9-smp)

Measured floating point speed 284.71 million ops/sec
Measured integer speed 498.57 million ops/sec
--------------------------------------------------------

ID: 4461 Enterprise Server 2003 (no updates)

Measured floating point speed 493.17 million ops/sec
Measured integer speed 784.71 million ops/sec
--------------------------------------------------------

Yes, If you like Microsoft. This is excellent
But it is almost opposite to all the other benchmarks that I have seen. (well maybe not that good)
ID: 675 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hurax

Send message
Joined: 24 Sep 05
Posts: 4
Credit: 295,636
RAC: 0
Message 688 - Posted: 28 Sep 2005, 9:53:16 UTC - in response to Message 662.  
Last modified: 28 Sep 2005, 9:55:46 UTC

I haven't seen this sort of problem (although more detail about your troubles would be illuminating). I ran R@H with 64 threads on a 32 node Linux cluster w/o error and with sustained load averages of 1.0 per thread just last night....We've run the app on a wide variety of Linux platforms, could you provide kernel and distro information please?

BOINC 4.43
Distribution: Debian testing
Kernel: 2.6.11.10

The only thing I saw was that CPU load dropped to 0 when BOINC scheduled the time from other applications to Rosetta, but three Rosetta processes remained in memory. It worked again after restarting BOINC
ID: 688 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Keith E. Laidig
Volunteer moderator
Project developer
Avatar

Send message
Joined: 1 Jul 05
Posts: 154
Credit: 117,189,961
RAC: 0
Message 697 - Posted: 28 Sep 2005, 15:02:23 UTC - in response to Message 688.  


BOINC 4.43
Distribution: Debian testing
Kernel: 2.6.11.10

The only thing I saw was that CPU load dropped to 0 when BOINC scheduled the time from other applications to Rosetta, but three Rosetta processes remained in memory. It worked again after restarting BOINC


Thanks for the information. When David Kim gets back from holiday I'll see that we look into this.

ID: 697 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Crouse
Avatar

Send message
Joined: 1 Nov 05
Posts: 33
Credit: 67,332
RAC: 0
Message 2305 - Posted: 4 Nov 2005, 23:29:55 UTC

ID: 2305 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Bug in Rosetta for Linux



©2024 University of Washington
https://www.bakerlab.org