Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 216 · 217 · 218 · 219 · 220 · 221 · 222 . . . 303 · Next

AuthorMessage
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,874,133
RAC: 9,493
Message 106438 - Posted: 28 Jun 2022, 18:41:12 UTC - in response to Message 106436.  
Last modified: 28 Jun 2022, 18:43:01 UTC

While I was trying my last option of swapping the card to another motherboard port, I finally remembered about folding@home.
I rebooted, got some graphics output from the new port, uninstalled folding@home and now I finally think I'm back stable again.

I'm not certain my GTX750 graphics card isn't up to supporting folding@home's GPU tasks, and I'm not sure whether the new GTX1650 will survive it better, but I'm keeping folding@home off the PC just in case.
A pity tbh <sigh>
Random stuff like that can be bad power connectors (check the plug and socket are shiny), or bad heatsink paste (if you want to dismantle it - I made a few cards work fine that were either not being detected or crashing after 2 minutes work, even though they weren't getting hot yet). You can also drop the clock by 10% or so in something like MSI Afterburner to keep an old card plodding on.
ID: 106438 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jdzukley

Send message
Joined: 27 May 11
Posts: 3
Credit: 3,201,191
RAC: 0
Message 106449 - Posted: 30 Jun 2022, 12:26:19 UTC

I am having trouble with Rosetta python tasks completing.

For example I currently have 10 Rosetta jobs running. 4 @ 100%, 0 time left, running from 5 to 6 days, the other jobs are at 70% to 99% complete and have been running from 9 hours to 3 days. System monitor indicates almost no CPU activity. The tasks do slowly increment. Previous forums suggesting make sure the check boxes on disk are not checked. I have not noticed this makes any difference. Most if not all of the tasks currently running have started with the check boxes unchecked. I have seen a few of these jobs finish.

About 1 in 10+ downloaded tasks fall into this situation. The other tasks do complete just fine, usually in less than 2 hours.

Watching the system monitor, once a job stops using more than say 5% of a cpu, the task will be one of these never ending jobs. Also it appears that this happens at the +/- 4 hour mark.

Being summer, I have had BOINC set to 51% cpu to llimit heat generation. However, I did increase the available cpu percentages to allow for 2 python tasks that did consume cpu's to run. At time this was up to 88%

System is Ubuntu, 128 gb RAM 4tb disk, 16 threads.

I really do not want to monitor the system to delete tasks after 8+ hours run time.

No issues running LHC@home vm tasks.

So, if there is something else I could try, please advise. Else I suppose I will join the forum crowd that is just very frustrated with Rosetta.
ID: 106449 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,874,133
RAC: 9,493
Message 106450 - Posted: 30 Jun 2022, 12:46:06 UTC
Last modified: 30 Jun 2022, 12:47:35 UTC

Some tasks just do that. Maybe someone has a script you can use to monitor a task and kill it, otherwise you have to do it manually. If your computer is doing most of them ok, I wouldn't blame your setup.

As for heat, add more fans! I've got the majority of my Boinc equipment in the garage, so I don't care about noise. If it gets too warm in there I just open windows or even the door.
ID: 106450 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 106455 - Posted: 1 Jul 2022, 1:57:47 UTC - in response to Message 106438.  
Last modified: 1 Jul 2022, 1:58:12 UTC

While I was trying my last option of swapping the card to another motherboard port, I finally remembered about folding@home.
I rebooted, got some graphics output from the new port, uninstalled folding@home and now I finally think I'm back stable again.

I'm not certain my GTX750 graphics card isn't up to supporting folding@home's GPU tasks, and I'm not sure whether the new GTX1650 will survive it better, but I'm keeping folding@home off the PC just in case.
A pity tbh <sigh>
Random stuff like that can be bad power connectors (check the plug and socket are shiny), or bad heatsink paste (if you want to dismantle it - I made a few cards work fine that were either not being detected or crashing after 2 minutes work, even though they weren't getting hot yet). You can also drop the clock by 10% or so in something like MSI Afterburner to keep an old card plodding on.

It could be, but this card is 9+yrs old, been on 24/7, has somehow lost two fins on one of its fans (never found the bits that broke off) and when I upgraded the CPU 18 months ago, the guy that put it together for me took the piss out of me for having such an old card even back then. I've decided I've had enough hints and I'm happy about upgrading - not least because I've already bought the replacement so it's too late to change my mind anyway

As suspected (especially after the case clean I did) the PC is stable again and, in non-exceptional use, the graphics card has operated perfectly well
If you're keen or fancy having a play around with fixing it up, I can send it to you if you like. I've still got the original box. Only conditional on confirming the new card works as expected
ID: 106455 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 262
Credit: 499,663
RAC: 802
Message 106457 - Posted: 1 Jul 2022, 4:28:53 UTC - in response to Message 106455.  

I had AMD Radeon HD 6570 untill beginning of 2021.
ID: 106457 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,874,133
RAC: 9,493
Message 106460 - Posted: 1 Jul 2022, 11:43:08 UTC - in response to Message 106455.  
Last modified: 1 Jul 2022, 11:44:18 UTC

As suspected (especially after the case clean I did) the PC is stable again and, in non-exceptional use, the graphics card has operated perfectly well
If you're keen or fancy having a play around with fixing it up, I can send it to you if you like. I've still got the original box. Only conditional on confirming the new card works as expected
Just checked the specs and it's a quarter of the speed of the 280X cards I mostly use. I'll pass. You could sell it on Ebay for £20 to £30, but only in full working order, otherwise it's only worth £10.
ID: 106460 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 106464 - Posted: 2 Jul 2022, 2:57:58 UTC - in response to Message 106460.  

As suspected (especially after the case clean I did) the PC is stable again and, in non-exceptional use, the graphics card has operated perfectly well
If you're keen or fancy having a play around with fixing it up, I can send it to you if you like. I've still got the original box. Only conditional on confirming the new card works as expected
Just checked the specs and it's a quarter of the speed of the 280X cards I mostly use. I'll pass. You could sell it on ebay for £20 to £30, but only in full working order, otherwise it's only worth £10.

I'm sure you're right. More trouble than it's worth to sell so better to give it away to anyone who can make use of it in whatever way.
Better than most on-board graphics, for example, which is why I added a meagre R260X I was given to the PC I'm on at the moment which has U630 onboard graphics
ID: 106464 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,874,133
RAC: 9,493
Message 106466 - Posted: 2 Jul 2022, 10:52:26 UTC - in response to Message 106464.  
Last modified: 2 Jul 2022, 10:53:22 UTC

I'm sure you're right. More trouble than it's worth to sell so better to give it away to anyone who can make use of it in whatever way.
Better than most on-board graphics, for example, which is why I added a meagre R260X I was given to the PC I'm on at the moment which has U630 onboard graphics
Give it to someone's kid who plays games which aren't the 3D demanding stuff. Depends what the onboard graphics are, the latest ones are probably as good, but there's bound to be someone with a basic PC who could benefit. But didn't you say it was unreliable when it started to think? If someone plays a game on it wouldn't it crash? Maybe you should try one and see before you burden someone with a buggy computer.

Anyway, there's Gumtree and Freecycle you could stick it on to give it away, someone will take it if it's working. Saves you bothering with postage etc.
ID: 106466 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 106467 - Posted: 3 Jul 2022, 19:21:10 UTC - in response to Message 106466.  

I'm sure you're right. More trouble than it's worth to sell so better to give it away to anyone who can make use of it in whatever way.
Better than most on-board graphics, for example, which is why I added a meagre R260X I was given to the PC I'm on at the moment which has U630 onboard graphics
Give it to someone's kid who plays games which aren't the 3D demanding stuff. Depends what the onboard graphics are, the latest ones are probably as good, but there's bound to be someone with a basic PC who could benefit. But didn't you say it was unreliable when it started to think? If someone plays a game on it wouldn't it crash? Maybe you should try one and see before you burden someone with a buggy computer.

Anyway, there's Gumtree and Freecycle you could stick it on to give it away, someone will take it if it's working. Saves you bothering with postage etc.

I have quite modern motherboards whose onboard graphics are nowhere near the GTX750 or R260X. It's reliable for WCG GPU tasks, just not for folding@home. I think there's quite some difference there.
I wouldn't anticipate games being anywhere near as demanding in the same way, so while I'm having issues with the GTX750 with distributed computing, I've never had a single issue with more routine demands.

I missed DPD today for the new gear. Went to their pickup point, but they didn't leave it there, so I'll have to sort it out tomorrow.
After that I'll consider gumtreefreecycle. I've got quite the list of discards they might take off my hands
ID: 106467 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,874,133
RAC: 9,493
Message 106468 - Posted: 3 Jul 2022, 19:56:10 UTC - in response to Message 106467.  

I have quite modern motherboards whose onboard graphics are nowhere near the GTX750 or R260X. It's reliable for WCG GPU tasks, just not for folding@home. I think there's quite some difference there.
I wouldn't anticipate games being anywhere near as demanding in the same way, so while I'm having issues with the GTX750 with distributed computing, I've never had a single issue with more routine demands.

I missed DPD today for the new gear. Went to their pickup point, but they didn't leave it there, so I'll have to sort it out tomorrow.
After that I'll consider gumtreefreecycle. I've got quite the list of discards they might take off my hands
Games use a GPU fully, just like Boinc and Folding.

What new gear are you getting? I just get everything delivered to my garden shed (which has CCTV, not that anyone's ever stolen anything from it).
ID: 106468 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 106470 - Posted: 5 Jul 2022, 19:45:43 UTC - in response to Message 106468.  

I have quite modern motherboards whose onboard graphics are nowhere near the GTX750 or R260X. It's reliable for WCG GPU tasks, just not for folding@home. I think there's quite some difference there.
I wouldn't anticipate games being anywhere near as demanding in the same way, so while I'm having issues with the GTX750 with distributed computing, I've never had a single issue with more routine demands.

I missed DPD today for the new gear. Went to their pickup point, but they didn't leave it there, so I'll have to sort it out tomorrow.
After that I'll consider gumtreefreecycle. I've got quite the list of discards they might take off my hands
Games use a GPU fully, just like Boinc and Folding.
Not sure that's entirely right. WCG GPU tasks were trivial here. Folding went bang (not quite, but certainly tipped me over the edge in sort order)

What new gear are you getting? I just get everything delivered to my garden shed (which has CCTV, not that anyone's ever stolen anything from it).
I'm not getting them delivered to your shed...
The box was so badly packed a small 22" monitor and GTX1650 OC graphics card came in a box over 1m squared and about a foot high, so the guy at the pickup point refused to take it in due to its size. Not sure how that works.
DPD don't allow collections from their depot so they redelivered it yesterday. Assembled but not connected up just yet because the PC's still stable. I'll get to it shortly.
£158 for that graphics card was pretty decent, I thought, with prices the way they are atm. £20 off due to it being open box, but all it had was a label part torn off
ID: 106470 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 106471 - Posted: 5 Jul 2022, 19:48:14 UTC
Last modified: 5 Jul 2022, 19:50:32 UTC

Lol
05/07/2022 20:27:16 | Rosetta@home | Sending scheduler request: To report completed tasks.
05/07/2022 20:27:16 | Rosetta@home | Reporting 3 completed tasks
05/07/2022 20:27:16 | Rosetta@home | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: )
05/07/2022 20:27:19 | Rosetta@home | Scheduler request completed
05/07/2022 20:27:19 | Rosetta@home | Project requested delay of 31 seconds
05/07/2022 20:27:46 | World Community Grid | Resetting project
05/07/2022 20:27:46 | World Community Grid | Detaching from project
05/07/2022 20:28:51 | | Fetching configuration file from https://master.worldcommunitygrid.org/get_project_config.php
05/07/2022 20:29:48 | World Community Grid | Fetching scheduler list
05/07/2022 20:29:50 | World Community Grid | Master file download succeeded
05/07/2022 20:29:55 | World Community Grid | Sending scheduler request: Project initialization.
05/07/2022 20:29:55 | World Community Grid | Requesting new tasks for CPU and NVIDIA GPU
05/07/2022 20:29:57 | World Community Grid | Scheduler request completed: got 0 new tasks

Edit: Got Rosetta tasks at the next attempt. I thought I'd be getting none of either for a minute
ID: 106471 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 191
Credit: 6,475,014
RAC: 6,131
Message 106472 - Posted: 5 Jul 2022, 20:20:44 UTC - in response to Message 106471.  

Edit: Got Rosetta tasks at the next attempt. I thought I'd be getting none of either for a minute


I have not got any work units in about a monrh.
ID: 106472 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,874,133
RAC: 9,493
Message 106473 - Posted: 5 Jul 2022, 20:42:23 UTC - in response to Message 106470.  
Last modified: 5 Jul 2022, 20:43:01 UTC

Games use a GPU fully, just like Boinc and Folding.
Not sure that's entirely right. WCG GPU tasks were trivial here. Folding went bang (not quite, but certainly tipped me over the edge in sort order)
The fan speed, temperature, power consumption, % usage all go to maximum with a 3D game. Some Boinc tasks need a fair bit of CPU so the GPU might not get maxed out unless you get it to run a few at once.
ID: 106473 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,874,133
RAC: 9,493
Message 106474 - Posted: 5 Jul 2022, 20:44:36 UTC - in response to Message 106472.  
Last modified: 5 Jul 2022, 20:44:46 UTC

Edit: Got Rosetta tasks at the next attempt. I thought I'd be getting none of either for a minute
I have not got any work units in about a monrh.
Python tasks are always available, check you haven't had your host banned (for only one failure!) If you can't do thopse, it's pot luck, I get some every other day, but I have 10 machines asking.
ID: 106474 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,874,133
RAC: 9,493
Message 106475 - Posted: 5 Jul 2022, 20:45:58 UTC - in response to Message 106471.  
Last modified: 5 Jul 2022, 20:48:53 UTC

05/07/2022 20:27:46 | World Community Grid | Resetting project
05/07/2022 20:27:46 | World Community Grid | Detaching from project
05/07/2022 20:28:51 | | Fetching configuration file from https://master.worldcommunitygrid.org/get_project_config.php
05/07/2022 20:29:48 | World Community Grid | Fetching scheduler list
05/07/2022 20:29:50 | World Community Grid | Master file download succeeded
05/07/2022 20:29:55 | World Community Grid | Sending scheduler request: Project initialization.
05/07/2022 20:29:55 | World Community Grid | Requesting new tasks for CPU and NVIDIA GPU
05/07/2022 20:29:57 | World Community Grid | Scheduler request completed: got 0 new tasks
Ah! They're progressing. My machines have now successfully attached, and the three that were already attached have retrieved my username and points history. Just no tasks yet.

I keep getting this though:

This project seems to have changed its URL. When convenient, remove the project, then add https://master.worldcommunitygrid.org/

When I tried that address, it didn't work at all, I got:

Not found in the database
Invalid ID
ID: 106475 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 106494 - Posted: 10 Jul 2022, 4:54:59 UTC - in response to Message 106475.  

05/07/2022 20:27:46 | World Community Grid | Resetting project
05/07/2022 20:27:46 | World Community Grid | Detaching from project
05/07/2022 20:28:51 | | Fetching configuration file from https://master.worldcommunitygrid.org/get_project_config.php
05/07/2022 20:29:48 | World Community Grid | Fetching scheduler list
05/07/2022 20:29:50 | World Community Grid | Master file download succeeded
05/07/2022 20:29:55 | World Community Grid | Sending scheduler request: Project initialization.
05/07/2022 20:29:55 | World Community Grid | Requesting new tasks for CPU and NVIDIA GPU
05/07/2022 20:29:57 | World Community Grid | Scheduler request completed: got 0 new tasks
Ah! They're progressing. My machines have now successfully attached, and the three that were already attached have retrieved my username and points history. Just no tasks yet.

I keep getting this though:

This project seems to have changed its URL. When convenient, remove the project, then add https://master.worldcommunitygrid.org/

When I tried that address, it didn't work at all, I got:

Not found in the database
Invalid ID

I've changed over 3 of 4 PCs successfully enough. The only difficulty was a work machine where our email address changed after originally joining 12yrs ago. but I worked it out in the end.
I guess I'll have to do my phone as well, now I think about it.
Tasks are in short supply and only for the useless OPN project that I've seen.
But I wasn't expecting tasks until the end of the month, so anything is a bonus right now
ID: 106494 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 191
Credit: 6,475,014
RAC: 6,131
Message 106495 - Posted: 10 Jul 2022, 6:50:47 UTC - in response to Message 106431.  

I still get this from one that managed to attach:

471	World Community Grid	28-06-2022 10:55 AM	update requested by user	
472	World Community Grid	28-06-2022 10:55 AM	Sending scheduler request: Requested by user.	
473	World Community Grid	28-06-2022 10:55 AM	Requesting new tasks for CPU and AMD/ATI GPU	
474	World Community Grid	28-06-2022 10:55 AM	Scheduler request failed: HTTP service unavailable


I wonder why mine looks so different.

Sun 10 Jul 2022 02:37:31 AM EDT | World Community Grid | Sending scheduler request: To fetch work.
Sun 10 Jul 2022 02:37:31 AM EDT | World Community Grid | Requesting new tasks for CPU
Sun 10 Jul 2022 02:37:32 AM EDT | World Community Grid | Scheduler request completed: got 0 new tasks
Sun 10 Jul 2022 02:37:32 AM EDT | World Community Grid | This project is using an old URL.  When convenient, remove the project, then add https://master.worldcommunitygrid.org/
Sun 10 Jul 2022 02:37:32 AM EDT | World Community Grid | Project has no tasks available
Sun 10 Jul 2022 02:37:32 AM EDT | World Community Grid | Project requested delay of 121 seconds

ID: 106495 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,874,133
RAC: 9,493
Message 106496 - Posted: 10 Jul 2022, 11:44:20 UTC - in response to Message 106494.  
Last modified: 10 Jul 2022, 11:45:02 UTC

I keep getting this though:

This project seems to have changed its URL. When convenient, remove the project, then add https://master.worldcommunitygrid.org/

When I tried that address, it didn't work at all, I got:

Not found in the database
Invalid ID
I've changed over 3 of 4 PCs successfully enough. The only difficulty was a work machine where our email address changed after originally joining 12yrs ago. but I worked it out in the end.
I guess I'll have to do my phone as well, now I think about it.
Tasks are in short supply and only for the useless OPN project that I've seen.
But I wasn't expecting tasks until the end of the month, so anything is a bonus right now
Are you saying you changed your PCs to master instead of www? That does not work for me, I just tried it again, I'm not found in the database.

I got one OPN yesterday, and 7 OPNs today, that's with 10 devices attached to it, which will check every so often when they're low on work from other projects, since WCG will have a debt on them.

I've read in the WCG forum there's some technical problems. Others have reported an error of a lack of disk space on the server when returning results (odd).
ID: 106496 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
PMH_UK

Send message
Joined: 9 Aug 08
Posts: 16
Credit: 1,243,749
RAC: 0
Message 106497 - Posted: 10 Jul 2022, 13:32:42 UTC - in response to Message 106496.  
Last modified: 10 Jul 2022, 13:33:07 UTC

I keep getting this though:

This project seems to have changed its URL. When convenient, remove the project, then add https://master.worldcommunitygrid.org/

When I tried that address, it didn't work at all, I got:

Not found in the database
Invalid ID
I've changed over 3 of 4 PCs successfully enough. The only difficulty was a work machine where our email address changed after originally joining 12yrs ago. but I worked it out in the end.
I guess I'll have to do my phone as well, now I think about it.
Tasks are in short supply and only for the useless OPN project that I've seen.
But I wasn't expecting tasks until the end of the month, so anything is a bonus right now
Are you saying you changed your PCs to master instead of www? That does not work for me, I just tried it again, I'm not found in the database.

I got one OPN yesterday, and 7 OPNs today, that's with 10 devices attached to it, which will check every so often when they're low on work from other projects, since WCG will have a debt on them.

I've read in the WCG forum there's some technical problems. Others have reported an error of a lack of disk space on the server when returning results (odd).


No need to change/remove/add for new URL, that is an error not resolved yet.
Only a few units going out so far for MCM and OPN1 (CPU) for testing so far.

Disk space message was due to a sick server, removed pending fix.

See WCG forum, Recent Threads, for more.

Paul.
ID: 106497 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 216 · 217 · 218 · 219 · 220 · 221 · 222 . . . 303 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org