Message boards : Number crunching : Computation Error
Author | Message |
---|---|
Tom Lee Send message Joined: 10 Dec 05 Posts: 5 Credit: 129,728 RAC: 0 |
I'm having more and more wu that end-up with a computation error status. I've had 15 hours worth in the last 24. I'm not sure what this means. My computer can't add, subtract, and divide, or it's not following program instructions? Do I have a rogue CPU? Could someone enlighten me - I can't find mention of it on the message boards. Thanks. |
Moderator9 Volunteer moderator Send message Joined: 22 Jan 06 Posts: 1014 Credit: 0 RAC: 0 |
I'm having more and more wu that end-up with a computation error status. I've had 15 hours worth in the last 24. I'm not sure what this means. My computer can't add, subtract, and divide, or it's not following program instructions? Do I have a rogue CPU? Could someone enlighten me - I can't find mention of it on the message boards. Thanks. Check out this FAQ in the FAQs thread Moderator9 ROSETTA@home FAQ Moderator Contact |
Tom Lee Send message Joined: 10 Dec 05 Posts: 5 Credit: 129,728 RAC: 0 |
I'm having more and more wu that end-up with a computation error status. I've had 15 hours worth in the last 24. I'm not sure what this means. My computer can't add, subtract, and divide, or it's not following program instructions? Do I have a rogue CPU? Could someone enlighten me - I can't find mention of it on the message boards. Thanks. The status specifically says "computation error". I can't find this in the FAQ. So what's happened when I get this message? |
Moderator9 Volunteer moderator Send message Joined: 22 Jan 06 Posts: 1014 Credit: 0 RAC: 0 |
I'm having more and more wu that end-up with a computation error status. I've had 15 hours worth in the last 24. I'm not sure what this means. My computer can't add, subtract, and divide, or it's not following program instructions? Do I have a rogue CPU? Could someone enlighten me - I can't find mention of it on the message boards. Thanks. "Computation error" in the work display of BOINC is a catch all term used for a number of different problems. If you look at the workunit in you stats page, you will find the "exception" errors in the errors reported for the WUs. What I am seeing is that with the exception of one of your errors, other systems have also failed to complete the same work units. This tells me that it may be a work unit issue. You really do not have that many errors. and the one that is unique to your system may also turn out to be work unit related but the other system that it was issued to has not reported back yet. Based on what I saw on your stats page, I refered you to that particular FAQ. It is possible that there was some transient probelm on your system that has caused the one error that does not look like a WU issue. Your system does seem marginal on memory, you might try increasing your swap time if you are running more than one project and see if that helps. Moderator9 ROSETTA@home FAQ Moderator Contact |
Morphy375 Send message Joined: 2 Nov 05 Posts: 86 Credit: 1,629,758 RAC: 0 |
I have several machines with 256MB running Rosetta without any problems..... Teddies.... |
Moderator9 Volunteer moderator Send message Joined: 22 Jan 06 Posts: 1014 Credit: 0 RAC: 0 |
I have several machines with 256MB running Rosetta without any problems..... It is certainly possible to run Rosetta with less than the recommended minimum system configuration. However, this can cause errors and Wu failures. One related issue would be page faults which can also cause Rosetta to fail. EDIT: I just found this post from David Kim. Many of the errors you have are for this type of WU. The Batch has been cancelled. This should help your situation. Moderator9 ROSETTA@home FAQ Moderator Contact |
AMD_is_logical Send message Joined: 20 Dec 05 Posts: 299 Credit: 31,460,681 RAC: 0 |
I have several machines with 256MB running Rosetta without any problems..... 256MB is fine for a dedicated cruncher. The 512MB recomendation is for the typical type of computer running rosetta, meaning one with some other purpose and where rosetta is supposed to quietly eat spare cycles without affecting performance. If the system has 512MB+ memory installed then the amount rosetta uses will be too small a fraction of that to have a significant impact on the system's responsiveness. |
[B@H] Ray Send message Joined: 20 Sep 05 Posts: 118 Credit: 100,251 RAC: 0 |
I have several machines with 256MB running Rosetta without any problems..... Really should go with 512 Meg or more for Rosetta. My HD started running hotter part the time Rosetta was running at first, increased ram from 512 to 1024 Megs and it colled off again. Run times also dropped by about 10%. 256 Megs will work but hit the swapp file to often and will burn out the HD. Pizza@Home Rays Place Rays place Forums |
Tom Lee Send message Joined: 10 Dec 05 Posts: 5 Credit: 129,728 RAC: 0 |
I'm having more and more wu that end-up with a computation error status. I've had 15 hours worth in the last 24. I'm not sure what this means. My computer can't add, subtract, and divide, or it's not following program instructions? Do I have a rogue CPU? Could someone enlighten me - I can't find mention of it on the message boards. Thanks. Thanks for the reply and the time you took to look-up my stats. It's nice to know that it's probably not my machine, but it is discouraging to see the computer time go down the drain. I'm also not sure what you mean by increasing the swap time. If you find time in your busy schedule, maybe you can fill me in. Thanks, Tom. |
Whl. Send message Joined: 29 Dec 05 Posts: 203 Credit: 275,802 RAC: 0 |
Ray Brown wrote:
Putting the swap file on a second physical drive helps a bit also. |
Astro Send message Joined: 2 Oct 05 Posts: 987 Credit: 500,253 RAC: 0 |
I'm also not sure what you mean by increasing the swap time. If you find time in your busy schedule, maybe you can fill me in. Thanks, Tom. I'm not sure what he means but he may mean the "write to disk at most" setting. Below are the two possibly relevant setting in your "general preferences". (These are from my preference) Processor usage Switch between applications every (recommended: 60 minutes) 180 minutes Disk and memory usage Write to disk at most every 600 seconds |
Snake Doctor Send message Joined: 17 Sep 05 Posts: 182 Credit: 6,401,938 RAC: 0 |
I'm having more and more wu that end-up with a computation error status. I've had 15 hours worth in the last 24. I'm not sure what this means. My computer can't add, subtract, and divide, or it's not following program instructions? Do I have a rogue CPU? Could someone enlighten me - I can't find mention of it on the message boards. Thanks. I think he means the time between application swaps. There is a setting for that in the prefs. I think the deafult is every 60 min, but you can set it to other amounts of time. We Must look for intelligent life on other planets as, it is becoming increasingly apparent we will not find any on our own. |
Beezlebub Send message Joined: 18 Oct 05 Posts: 40 Credit: 260,375 RAC: 0 |
This will increase memory usage and make the program need more swap space which will mean slower wu times as writing to disk is vastly slower than RAM. The best solution for faster wu times is more memory, how much is up to you but 512 is the least I would use. If you still want more swap space go: start, settings, control panel, system, advanced, performance, advanced, virtual memory and adjust accordingly. Keep in mind any increase in hard disk activity decreases lifetime so the more RAM you have the better. e6600 quad @ 2.5ghz 2418 floating point 5227 integer e6750 dual @ 3.71ghz 3598 floating point 7918 integer |
AXEL 70 Send message Joined: 12 Dec 05 Posts: 1 Credit: 101,212 RAC: 0 |
excuse me for my english not very good (I'm Italian)... the last WU's that I download (HBLR_1.0_... series) give me all a computation error : |rosetta@home|Unrecoverable error for result HBLR_1.0_1hz6_314_44_0 ( - exit code -164 (0xffffff5c)) |rosetta@home|Unrecoverable error for result HBLR_1.0_1hz6_314_40_0 ( - exit code -164 (0xffffff5c)) |rosetta@home|Unrecoverable error for result HBLR_1.0_1b72_314_76_0 ( - exit code -164 (0xffffff5c)) and that after 6:27, 1:45 and 0:50 hours ... why? it's the first time that it happen thank you for the answer (please answer simply) p.s.: the client is 4.82 |
Message boards :
Number crunching :
Computation Error
©2024 University of Washington
https://www.bakerlab.org