Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · 13 · 14 . . . 302 · Next

AuthorMessage
John Newbould

Send message
Joined: 8 Aug 07
Posts: 8
Credit: 6,329,749
RAC: 0
Message 81309 - Posted: 11 Mar 2017, 19:33:09 UTC - in response to Message 80621.  

All work units errored out. Reset project and all downloads show error in download. Have been doing Rosetta for years but now not sure I want to continue. Sorry, it now says "reached daily quota of 60 units" of course I now have NO work and will have to get a new Boinc project. If you want us to be loyal you must keep the errors fixed and your servers UP. Note my other computer is working fine so the problem is not with my end. Sorry, I'll wait over night then I guess I'm out-of-here. Really sorry because I think your doing the best work on Boinc. Best, John
ID: 81309 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BubbleBoy

Send message
Joined: 5 May 14
Posts: 2
Credit: 634,699,331
RAC: 0
Message 81311 - Posted: 11 Mar 2017, 20:42:25 UTC

I have never seen a project that is administrated so unprofessional.
it's sad, just sad.
ID: 81311 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 81312 - Posted: 11 Mar 2017, 21:35:22 UTC

@John Newbould, your download errors all seem to show "RSA key check failed for file", on numerous different files. As you say, your other machine is working fine, so that would tend to indicate the R@h servers copies are correct. Also, other machines appear to have downloaded the WUs yours failed on.

Is it possible you have some anti-virus software on the failing machine that is corrupting the download files because it believes they match some virus signature? Do you have a different AV between your two machines?
Rosetta Moderator: Mod.Sense
ID: 81312 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
John Newbould

Send message
Joined: 8 Aug 07
Posts: 8
Credit: 6,329,749
RAC: 0
Message 81313 - Posted: 12 Mar 2017, 0:08:50 UTC - in response to Message 81312.  

@John Newbould, your download errors all seem to show "RSA key check failed for file", on numerous different files. As you say, your other machine is working fine, so that would tend to indicate the R@h servers copies are correct. Also, other machines appear to have downloaded the WUs yours failed on.

Is it possible you have some anti-virus software on the failing machine that is corrupting the download files because it believes they match some virus signature? Do you have a different AV between your two machines?


Thanks for your quick reply. No both are win7 with Microsoft security essentials only. I saw this problem once before some months ago but it cleared itself after a day or two. Of course I still had valid work and never ran out. If it doesn't clear over night I will remove and reinstall Boinc. If that fails I will post again then. Thanks again. Best, John.
ID: 81313 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
John Newbould

Send message
Joined: 8 Aug 07
Posts: 8
Credit: 6,329,749
RAC: 0
Message 81319 - Posted: 12 Mar 2017, 11:50:33 UTC

Ok overnight I got more "RSA key check failed for file" and no valid work to do. I uninstalled Boinc and re-installed it - no joy. I checked I can download from other sites both program and data OK on all tests. I've checked all running processes and found nothing. see below the event log file if that gives any clue.:
3/12/2017 7:01:28 AM | | Starting BOINC client version 7.6.33 for windows_x86_64
3/12/2017 7:01:28 AM | | log flags: file_xfer, sched_ops, task
3/12/2017 7:01:28 AM | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2g zlib/1.2.8
3/12/2017 7:01:28 AM | | Data directory: C:ProgramDataBOINC
3/12/2017 7:01:28 AM | | Running under account John
3/12/2017 7:01:28 AM | | CAL: ATI GPU 0: ATI Radeon HD 4350/4550 (R710) (CAL version 1.4.1385, 1024MB, 992MB available, 192 GFLOPS peak)
3/12/2017 7:01:28 AM | | Version change (7.6.22 -> 7.6.33)
3/12/2017 7:01:28 AM | | Host name: TVcomputer
3/12/2017 7:01:28 AM | | Processor: 4 GenuineIntel Intel(R) Core(TM) i3 CPU 540 @ 3.07GHz [Family 6 Model 37 Stepping 5]
3/12/2017 7:01:28 AM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt syscall nx lm vmx tm2 pbe
3/12/2017 7:01:28 AM | | OS: Microsoft Windows 7: Professional x64 Edition, Service Pack 1, (06.01.7601.00)
3/12/2017 7:01:28 AM | | Memory: 15.93 GB physical, 31.81 GB virtual
3/12/2017 7:01:28 AM | | Disk: 148.95 GB total, 53.51 GB free
3/12/2017 7:01:28 AM | | Local time is UTC -4 hours
3/12/2017 7:01:28 AM | rosetta@home | URL https://boinc.bakerlab.org/rosetta/; Computer ID 1549931; resource share 100
3/12/2017 7:01:28 AM | rosetta@home | General prefs: from rosetta@home (last modified 06-Dec-2015 21:10:37)
3/12/2017 7:01:28 AM | rosetta@home | Computer location: home
3/12/2017 7:01:28 AM | rosetta@home | General prefs: no separate prefs for home; using your defaults
3/12/2017 7:01:28 AM | | Reading preferences override file
3/12/2017 7:01:28 AM | | Preferences:
3/12/2017 7:01:28 AM | | max memory usage when active: 15984.89MB
3/12/2017 7:01:28 AM | | max memory usage when idle: 16311.12MB
3/12/2017 7:01:28 AM | | max disk usage: 40.00GB
3/12/2017 7:01:28 AM | | don't use GPU while active
3/12/2017 7:01:28 AM | | suspend work if non-BOINC CPU load exceeds 25%
3/12/2017 7:01:28 AM | | (to change preferences, visit a project web site or select Preferences in the Manager)
3/12/2017 7:01:28 AM | rosetta@home | Resetting file projects/boinc.bakerlab.org_rosetta/jr9_0084__data.zip: md5 checksum failed for file
3/12/2017 7:01:28 AM | | Running CPU benchmarks
3/12/2017 7:01:28 AM | | Suspending computation - CPU benchmarks in progress
3/12/2017 7:01:28 AM | rosetta@home | Fetching scheduler list
3/12/2017 7:01:30 AM | rosetta@home | Master file download succeeded
3/12/2017 7:01:35 AM | rosetta@home | Sending scheduler request: To report completed tasks.
3/12/2017 7:01:35 AM | rosetta@home | Reporting 42 completed tasks
3/12/2017 7:01:35 AM | rosetta@home | Requesting new tasks for CPU and AMD/ATI GPU
3/12/2017 7:01:38 AM | rosetta@home | Scheduler request completed: got 0 new tasks
3/12/2017 7:01:38 AM | rosetta@home | No work sent
3/12/2017 7:01:38 AM | rosetta@home | (reached daily quota of 4 results)
3/12/2017 7:02:00 AM | | Benchmark results:
3/12/2017 7:02:00 AM | | Number of CPUs: 4
3/12/2017 7:02:00 AM | | 3091 floating point MIPS (Whetstone) per CPU
3/12/2017 7:02:00 AM | | 7739 integer MIPS (Dhrystone) per CPU
3/12/2017 7:02:01 AM | rosetta@home | Started download of minirosetta_database_d0bf94b.zip
3/12/2017 7:09:43 AM | rosetta@home | Finished download of minirosetta_database_d0bf94b.zip
3/12/2017 7:11:14 AM | rosetta@home | update requested by user
3/12/2017 7:11:19 AM | rosetta@home | Sending scheduler request: Requested by user.
3/12/2017 7:11:19 AM | rosetta@home | Requesting new tasks for CPU and AMD/ATI GPU
3/12/2017 7:11:20 AM | rosetta@home | Scheduler request completed: got 0 new tasks
3/12/2017 7:11:20 AM | rosetta@home | No work sent
3/12/2017 7:11:20 AM | rosetta@home | (reached daily quota of 4 results)
3/12/2017 7:11:33 AM | rosetta@home | update requested by user
3/12/2017 7:11:35 AM | rosetta@home | Sending scheduler request: Requested by user.
3/12/2017 7:11:35 AM | rosetta@home | Requesting new tasks for CPU and AMD/ATI GPU
3/12/2017 7:11:36 AM | rosetta@home | Scheduler request completed: got 0 new tasks
3/12/2017 7:11:36 AM | rosetta@home | Not sending work - last request too recent: 16 sec
3/12/2017 7:11:40 AM | rosetta@home | update requested by user
3/12/2017 7:11:41 AM | rosetta@home | Sending scheduler request: Requested by user.
3/12/2017 7:11:41 AM | rosetta@home | Requesting new tasks for CPU and AMD/ATI GPU
3/12/2017 7:11:43 AM | rosetta@home | Scheduler request completed: got 0 new tasks
3/12/2017 7:11:43 AM | rosetta@home | Not sending work - last request too recent: 7 sec
3/12/2017 7:15:49 AM | rosetta@home | Sending scheduler request: To fetch work.
3/12/2017 7:15:49 AM | rosetta@home | Requesting new tasks for CPU and AMD/ATI GPU
3/12/2017 7:15:51 AM | rosetta@home | Scheduler request completed: got 0 new tasks
3/12/2017 7:15:51 AM | rosetta@home | No work sent
3/12/2017 7:15:51 AM | rosetta@home | (reached daily quota of 4 results)


Now I will attach a new project. Einstein@Home attached OK got work and is now running the project OK
However Einstein gid have a glitch like we are seeing in the middle' which I paste here:

3/12/2017 7:29:59 AM | Einstein@Home | Finished download of Pulsars_J2007.jpg
3/12/2017 7:29:59 AM | Einstein@Home | Finished download of Pulsars_schem1.jpg
3/12/2017 7:30:00 AM | | Internet access OK - project servers may be temporarily down.
3/12/2017 7:30:12 AM | Einstein@Home | Finished download of JPLEPH.405
3/12/2017 7:30:13 AM | Einstein@Home | [error] MD5 check failed for JPLEPH.405
3/12/2017 7:30:13 AM | Einstein@Home | [error] expected d6ce12bacd2a81a56423f5f238ba84eb, got 19fb3bd06bc71de72e8f3eefbb77136e
3/12/2017 7:30:13 AM | Einstein@Home | [error] Checksum or signature error for JPLEPH.405
3/12/2017 7:30:51 AM | Einstein@Home | File GW_BBH1.jpg exists already, skipping download
3/12/2017 7:31:10 AM | Einstein@Home | update requested by user
3/12/2017 7:31:11 AM | Einstein@Home | Sending scheduler request: Requested by user.
3/12/2017 7:31:11 AM | Einstein@Home | Reporting 1 completed tasks
3/12/2017 7:31:11 AM | Einstein@Home | Requesting new tasks for CPU and AMD/ATI GPU
3/12/2017 7:31:14 AM | Einstein@Home | Scheduler request completed: got 35 new tasks
3/12/2017 7:31:16 AM | Einstein@Home | Started download of JPLEPH.405
3/12/2017 7:31:16 AM | Einstein@Home | Started download of templates_LATeah0015L_1136_12661600.dat
3/12/2017 7:31:18 AM | Einstein@Home | Finished download of templates_LATeah0015L_1136_12661600.dat
3/12/2017 7:31:18 AM | Einstein@Home | Started download of templates_LATeah0015L_1136_12661775.dat



It then completed OK and as I said is now running fine! everything ok the computer (my primary "fastest") is now without Rosetta work for the first time since I bought it!
I will leave both projects on Boinc for now. I can flush all App data if you think that will help? and start from scratch? Reload Boinc to an alternate location? I hope you can help. Thanks Much, John
ID: 81319 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2125
Credit: 41,249,734
RAC: 9,368
Message 81322 - Posted: 13 Mar 2017, 1:55:21 UTC - in response to Message 81319.  

3/12/2017 7:01:38 AM | rosetta@home | (reached daily quota of 4 results)

This is odd. I thought the limit was 100. How did that get in there?
ID: 81322 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
John Newbould

Send message
Joined: 8 Aug 07
Posts: 8
Credit: 6,329,749
RAC: 0
Message 81324 - Posted: 13 Mar 2017, 9:17:25 UTC

Yes 4 is strange. Two days ago it was 60 now FOUR? Of course the server must be limited so anyone with errors, like I'm having, would not "lock-up" the server with requests. But it would be nice that a manual "reset project" would clear/reset the limit. As it is now I must wait about 24 hours to try again!
ID: 81324 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Darrell

Send message
Joined: 28 Sep 06
Posts: 25
Credit: 51,934,631
RAC: 0
Message 81325 - Posted: 13 Mar 2017, 13:24:04 UTC

It appears that Rosetta Mini 3.73 is set to tell the BOINC Manager that it is a non-compute-intensive application, but we know it IS compute-intensive.

I use the BOINC Client parameters <process_priority>1</process_priority> and <process_priority_special>3</process_priority_special> at various times. They work as follows: (from the BOINC Wiki)

<process_priority>N</process_priority>, <process_priority_special>N</process_priority_special>
The OS process priority at which tasks are run. Values are 0 (lowest priority, the default), 1 (below normal), 2 (normal), 3 (above normal), 4 (high) and 5 (real-time - not recommended). 'special' process priority is used for coprocessor (GPU) applications, wrapper applications, and non-compute-intensive applications, 'process priority' for all others. The two options can be used independently.

Using my values, Rosetta is running as "Above Normal", i.e., BOINC Manager treats it as a non-compute-intensive application. This should be corrected so that the BOINC Manager can assign the correct (user defined) priority to the tasks.
ID: 81325 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 81326 - Posted: 13 Mar 2017, 14:06:13 UTC - in response to Message 81324.  

Yes 4 is strange. Two days ago it was 60 now FOUR? Of course the server must be limited so anyone with errors, like I'm having, would not "lock-up" the server with requests. But it would be nice that a manual "reset project" would clear/reset the limit. As it is now I must wait about 24 hours to try again!


Exactly, this is one of the server's self-protection mechanisms.

As failures are reported back, the number is reduced and reflected in the messages. Successful WU reports increase it again, as well as passage of time.
Rosetta Moderator: Mod.Sense
ID: 81326 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Darrell

Send message
Joined: 28 Sep 06
Posts: 25
Credit: 51,934,631
RAC: 0
Message 81329 - Posted: 14 Mar 2017, 0:42:08 UTC - in response to Message 81325.  

It appears that Rosetta Mini 3.73 is set to tell the BOINC Manager that it is a non-compute-intensive application, but we know it IS compute-intensive.


I need to retract this claim somewhat, as I have only observed it on one of my computers. Do not investigate further until/unless I can give more details as to why only one was affected.

Thanks.
ID: 81329 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
John Newbould

Send message
Joined: 8 Aug 07
Posts: 8
Credit: 6,329,749
RAC: 0
Message 81331 - Posted: 14 Mar 2017, 19:22:32 UTC - in response to Message 81326.  

Yes 4 is strange. Two days ago it was 60 now FOUR? Of course the server must be limited so anyone with errors, like I'm having, would not "lock-up" the server with requests. But it would be nice that a manual "reset project" would clear/reset the limit. As it is now I must wait about 24 hours to try again!


Exactly, this is one of the server's self-protection mechanisms.

As failures are reported back, the number is reduced and reflected in the messages. Successful WU reports increase it again, as well as passage of time.


I understand but that does not my error in downloading. I can't get new work but the Einstein@home works fine and I'm not sure if I shouldn't just abandon Rosetta after ten yeast of effort on the project. if you cant help with some ideas that's all I can do. I asked if detaching and wiping all App data might work? Please, help. John
ID: 81331 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 81332 - Posted: 14 Mar 2017, 20:23:52 UTC - in response to Message 81331.  

Yes 4 is strange. Two days ago it was 60 now FOUR? Of course the server must be limited so anyone with errors, like I'm having, would not "lock-up" the server with requests. But it would be nice that a manual "reset project" would clear/reset the limit. As it is now I must wait about 24 hours to try again!


Exactly, this is one of the server's self-protection mechanisms.

As failures are reported back, the number is reduced and reflected in the messages. Successful WU reports increase it again, as well as passage of time.


I understand but that does not my error in downloading. I can't get new work but the Einstein@home works fine and I'm not sure if I shouldn't just abandon Rosetta after ten yeast of effort on the project. if you cant help with some ideas that's all I can do. I asked if detaching and wiping all App data might work? Please, help. John


Looks like your system now has two WUs that completed their downloads without errors? Did you find a change to make? Or did your AV receive an update?

Rosetta Moderator: Mod.Sense
ID: 81332 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
John Newbould

Send message
Joined: 8 Aug 07
Posts: 8
Credit: 6,329,749
RAC: 0
Message 81336 - Posted: 15 Mar 2017, 21:02:32 UTC - in response to Message 81332.  

Yes 4 is strange. Two days ago it was 60 now FOUR? Of course the server must be limited so anyone with errors, like I'm having, would not "lock-up" the server with requests. But it would be nice that a manual "reset project" would clear/reset the limit. As it is now I must wait about 24 hours to try again!


Exactly, this is one of the server's self-protection mechanisms.

As failures are reported back, the number is reduced and reflected in the messages. Successful WU reports increase it again, as well as passage of time.


I understand but that does not my error in downloading. I can't get new work but the Einstein@home works fine and I'm not sure if I shouldn't just abandon Rosetta after ten yeast of effort on the project. if you cant help with some ideas that's all I can do. I asked if detaching and wiping all App data might work? Please, help. John


Looks like your system now has two WUs that completed their downloads without errors? Did you find a change to make? Or did your AV receive an update?

I did nothing! but you are right. The problem seems to have cleared overnight. I have stopped the Einstein from getting new tasks so as it runs out of work I hope Rosetta will return to normal. There is really no explanation for what happened... I thank you for your help and you can log this as unexplained. Glad to be back to work. Best, John.
ID: 81336 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Darrell

Send message
Joined: 28 Sep 06
Posts: 25
Credit: 51,934,631
RAC: 0
Message 81337 - Posted: 16 Mar 2017, 4:01:34 UTC - in response to Message 81325.  

See original post here

It appears that Rosetta Mini 3.73 is set to tell the BOINC Manager to use the priority for "... coprocessor (GPU) applications, wrapper applications, and non-compute-intensive applications ...". The priority assigned "follows" (is assigned the changed level) when I change the <process_priority_special>N</process_priority_special> parameter setting, but only in the one computer where I am running LHC applications using Virtual Box. It does not change in my other computers.

Does anyone have a clue as to why, and more importantly, how to separate the priorities between those two applications? Is this a bug in the server for Rosetta Mini 3.73, the application, or the BOINC Manager 7.6.33?

I want the VBox wrapper to run at a higher priority than compute-intensive applications, not at the same level.
ID: 81337 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 81351 - Posted: 20 Mar 2017, 17:58:51 UTC

I'm having a busy day. Could someone please look up the links to the instructions on moving BOINC to another PC harddrive and post back to SHAWN ?
Rosetta Moderator: Mod.Sense
ID: 81351 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Joan Tarshis

Send message
Joined: 17 Mar 17
Posts: 13
Credit: 11,336
RAC: 0
Message 81378 - Posted: 27 Mar 2017, 20:54:50 UTC - in response to Message 80657.  

My mac is running well. Are others having mac client issues?


My iMac hasn't received any work for several days. I've tried manual Updates and Reset Project from the BOINC Manager, but still get no new work.


I've gotten projects from others but none from Rosetta.



LHC gives me no tasks and the Lattice Project downloads fail one at a time.

I'm on a new MacBook.

Joan
ID: 81378 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Joan Tarshis

Send message
Joined: 17 Mar 17
Posts: 13
Credit: 11,336
RAC: 0
Message 81379 - Posted: 27 Mar 2017, 20:55:34 UTC - in response to Message 80657.  

My mac is running well. Are others having mac client issues?


My iMac hasn't received any work for several days. I've tried manual Updates and Reset Project from the BOINC Manager, but still get no new work.


I've gotten projects from others but none from Rosetta.



LHC gives me no tasks and the Lattice Project downloads fail one at a time.

I'm on a new MacBook.

Joan
ID: 81379 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Joan Tarshis

Send message
Joined: 17 Mar 17
Posts: 13
Credit: 11,336
RAC: 0
Message 81380 - Posted: 27 Mar 2017, 20:58:50 UTC - in response to Message 80657.  

My mac is running well. Are others having mac client issues?


My iMac hasn't received any work for several days. I've tried manual Updates and Reset Project from the BOINC Manager, but still get no new work.


I've gotten projects from others but none from Rosetta.



LHC gives me no tasks and the Lattice Project downloads fail one at a time.

I'm on a new MacBook.

Joan
ID: 81380 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Joan Tarshis

Send message
Joined: 17 Mar 17
Posts: 13
Credit: 11,336
RAC: 0
Message 81381 - Posted: 27 Mar 2017, 21:19:34 UTC - in response to Message 80657.  

My mac is running well. Are others having mac client issues?


My iMac hasn't received any work for several days. I've tried manual Updates and Reset Project from the BOINC Manager, but still get no new work.


I've gotten projects from others but none from Rosetta.



LHC gives me no tasks and the Lattice Project downloads fail one at a time.

I'm on a new MacBook.

Joan
ID: 81381 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 81386 - Posted: 28 Mar 2017, 17:57:21 UTC - in response to Message 81381.  

My mac is running well. Are others having mac client issues?


My iMac hasn't received any work for several days. I've tried manual Updates and Reset Project from the BOINC Manager, but still get no new work.


I've gotten projects from others but none from Rosetta.



LHC gives me no tasks and the Lattice Project downloads fail one at a time.

I'm on a new MacBook.

Joan


I'm not sure what's happening but I'll look into it.
ID: 81386 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · 13 · 14 . . . 302 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org