Message boards : Number crunching : Report Problems with Rosetta Version 5.24
Previous · 1 · 2 · 3
Author | Message |
---|---|
Dimitris Hatzopoulos Send message Joined: 5 Jan 06 Posts: 336 Credit: 80,939 RAC: 0 |
Rom, thanks for the feedback. Although I don't know how the current system works (what are the "groups" of jobs sent, e.g. jobs needing 256, 512, 768, 1G? memory ) it seems it'd help to split the queue as you suggest to make sure there are always small jobs available. Apparently many people get this message "there was work, but your PC has less RAM than needed", see e.g. posts by Carlos (a very small percentage of users posts here). Best UFO Resources Wikipedia R@h How-To: Join Distributed Computing projects that benefit humanity |
Fuzzy Hollynoodles Send message Joined: 7 Oct 05 Posts: 234 Credit: 15,020 RAC: 0 |
This WU crashed on restart after being preempted. https://boinc.bakerlab.org/rosetta/workunit.php?wuid=21898739 Result: https://boinc.bakerlab.org/rosetta/result.php?resultid=25804159 It almost gave me a heartattack, I thought my harddisk had crashed! It sounded like that for about 5 minutes untill I exit'ed the BOINC manager and then I realized the harddisk was safe. PHEW!!!! I had a harddisk crash a little more than a year ago and I'll never forget that sound it gives, when this happens! So please don't do this to me again! :-( [b]"I'm trying to maintain a shred of dignity in this world." - Me[/b] |
Frisch Send message Joined: 5 Apr 06 Posts: 4 Credit: 133,315 RAC: 0 |
Just reported some finished jobs in, but one of them didn't get any credit, as it said, "too many jobs reported" never seen this one before. It was reported as a succes, but 0 credit granted. Result ID 25727160 Name t307__CASP7_ABRELAX_SAVE_ALL_OUT_BARCODE_hom001__714_30318_2 Workunit 20880206 Created 25 Jun 2006 7:45:32 UTC Sent 25 Jun 2006 14:39:07 UTC Received 27 Jun 2006 14:21:50 UTC Server state Over Outcome Success Client state Done Exit status 0 (0x0) Computer ID 212157 Report deadline 2 Jul 2006 14:39:07 UTC CPU time 2944.1875 stderr out <core_client_version>5.5.0</core_client_version> <stderr_txt> # random seed: 1551958 # cpu_run_time_pref: 3600 # DONE :: 1 starting structures built 2 (nstruct) times # This process generated 2 decoys from 2 attempts BOINC :: Watchdog shutting down... BOINC :: BOINC support services shutting down... </stderr_txt> Validate state Workunit error - check skipped Claimed credit 26.2700330744348 Granted credit 0 application version 5.24 |
Astro Send message Joined: 2 Oct 05 Posts: 987 Credit: 500,253 RAC: 0 |
Validate state Workunit error - check skipped This wu errored out for some reason. You're puters are hidden so I can look it up. You should get credit when they run the daily script. The credit won't show on the "Results Page", but will show on the "result ID" page. hope this helps tony |
Divide Overflow Send message Joined: 17 Sep 05 Posts: 82 Credit: 921,382 RAC: 0 |
Is a .pdb file available for download for the 5.24 version of the Rosetta application? I’m only able to find an older version posted online. |
Fuzzy Hollynoodles Send message Joined: 7 Oct 05 Posts: 234 Credit: 15,020 RAC: 0 |
Is a .pdb file available for download for the 5.24 version of the Rosetta application? I’m only able to find an older version posted online. I think it comes with the WU's now. [b]"I'm trying to maintain a shred of dignity in this world." - Me[/b] |
Brian B Send message Joined: 11 Dec 05 Posts: 3 Credit: 10,681 RAC: 0 |
Welcome, Brian Bowles. System Idle Process of zero is a cruncher's goal. A project that runs nicely in the background the way Rosetta does is a dream. Thanks Cureseekers, skutnar, tralala, and Vester for the feedback. My system was running correctly up until the recent batch of releases. I noticed recently that when R@H is running it would not release the processor for local work. I will try and answer the above questions.
|
Brian B Send message Joined: 11 Dec 05 Posts: 3 Credit: 10,681 RAC: 0 |
Sorry for the long post.... |
BennyRop Send message Joined: 17 Dec 05 Posts: 555 Credit: 140,800 RAC: 0 |
Brian: If you use Boinc 5.4.9, does it operate as it used to? |
Nightbird Send message Joined: 17 Sep 05 Posts: 70 Credit: 32,418 RAC: 0 |
wu : t312__CASP7_JUMPRELAX_SAVE_ALL_OUT_BARCODE_hom009__711_8478 estimated completion time : + 41 days Though the wu is suspended, it goes on to run in the background (slowly). cpu time : 9h 55 min xx sec. (always increasing) % done : 1 % |
Frisch Send message Joined: 5 Apr 06 Posts: 4 Credit: 133,315 RAC: 0 |
Did it again, it's in the process, when i report more than one job. It doesn't matter if it's 2 or 12 i send, it leaves one reported without credits, with the reason "too many total results" under work unit ID Link to PC https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=212157 Link to site with job listed https://boinc.bakerlab.org/rosetta/results.php?hostid=212157&offset=20 25941992 Name t312__CASP7_JUMPRELAX_SAVE_ALL_OUT_BARCODE_hom006__711_5945_2 Workunit 21099883 Created 26 Jun 2006 21:22:35 UTC Sent 27 Jun 2006 0:03:28 UTC Received 28 Jun 2006 0:00:59 UTC Server state Over Outcome Success Client state Done Exit status 0 (0x0) Computer ID 212157 Report deadline 4 Jul 2006 0:03:28 UTC CPU time 2651.671875 stderr out <core_client_version>5.5.0</core_client_version> <stderr_txt> # random seed: 2136331 # cpu_run_time_pref: 3600 # DONE :: 1 starting structures built 2 (nstruct) times # This process generated 2 decoys from 2 attempts BOINC :: Watchdog shutting down... BOINC :: BOINC support services shutting down... </stderr_txt> Validate state Workunit error - check skipped Claimed credit 23.6600107360005 Granted credit 0 application version 5.24 |
Feet1st Send message Joined: 30 Dec 05 Posts: 1755 Credit: 4,690,520 RAC: 0 |
Nightbird I'd say you need to restart BOINC on that PC. And if the crunching threads don't stop, then I'd reboot it. If the WU runs again for 2hrs without progressing beyond 1%, I'd abort that WU. Also, I note you are running an older version of BOINC. I had similar issues where I'd suspend in BOINC Manager but the crunching thread wouldn't respond. But they seem to have been resolved by the current BOINC releases. You can reference info. on how to get the new release in this QA to check your release, and this QA with download info. Add this signature to your EMail: Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might! https://boinc.bakerlab.org/rosetta/ |
Keith Akins Send message Joined: 22 Oct 05 Posts: 176 Credit: 71,779 RAC: 0 |
Error Result For WU ID: https://boinc.bakerlab.org/rosetta/workunit.php?wuid=22091481 |
Keith Akins Send message Joined: 22 Oct 05 Posts: 176 Credit: 71,779 RAC: 0 |
UPDATE: Result ID 26029052 Name t312__CASP7_ABINITIO_SAVE_ALL_OUT_BARCODE_hom007__812_931_1 Workunit 22091481 Created 27 Jun 2006 12:37:22 UTC Sent 27 Jun 2006 14:56:35 UTC Received 28 Jun 2006 17:25:04 UTC Server state Over Outcome Client error Client state Computing Exit status -1073741819 (0xc0000005) Computer ID 253124 Report deadline 4 Jul 2006 14:56:35 UTC CPU time 23627.59375 stderr out <core_client_version>5.4.9</core_client_version> <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> # cpu_run_time_pref: 28800 # random seed: 2908212 Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x008C0DD1 write attempt to address 0x25707280 Engaging BOINC Windows Runtime Debugger... ******************** BOINC Windows Runtime Debugger Version 5.5.0 |
Ubaida Send message Joined: 9 Jun 06 Posts: 3 Credit: 206,886 RAC: 0 |
<core_client_version>5.4.9</core_client_version> <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> # random seed: 2957681 # cpu_run_time_pref: 10800 # cpu_run_time_pref: 10800 # cpu_run_time_pref: 10800 Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x0060AAA8 read attempt to address 0x7D7427F9 https://boinc.bakerlab.org/rosetta/result.php?resultid=26281886 |
Mats Petersson Send message Joined: 29 Sep 05 Posts: 225 Credit: 951,788 RAC: 0 |
One interesting thing to note here is that both of these errors are caused by code trying to read/write to address that aren't valid memory addresses, and have something in common too: They consist of ASCII-text... Perhaps some function is not keeping it's text strings within their correct bounds? Of course, it could be completely random that they look like text, but in my experience that is NOT the case. Ubaida's address is: "?'t}" - the question mark is an unknown letter - because the code is most likely accessing an offset away from the address that got overwritten by the string. Keith's address is: "?rp%", again, the first (lowest byte) is unknown, as it's most likely an offset. Theoretically, the offset may be bigger than a byte so the letters further in would possibly also be affected [one likely scenario is that the "'" in Ubaida's text is actually a percent character, if the code is attempting to go 0x200+ bytes into a struct, which isn't entirely unlikely - a 512+ byte struct is not at all unlikely, but of course many common data structures are smaller than this... -- Mats |
Thalus Send message Joined: 1 Jun 06 Posts: 1 Credit: 1,893 RAC: 0 |
<core_client_version>5.3.6</core_client_version> <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> # random seed: 2950187 # cpu_run_time_pref: 10800 # cpu_run_time_pref: 10800 # cpu_run_time_pref: 10800 # cpu_run_time_pref: 10800 Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x0060AAA8 read attempt to address 0x757D40C9 Engaging BOINC Windows Runtime Debugger... ******************** BOINC Windows Runtime Debugger Version 5.5.0 ____________________________________________________________________ 4 times the same failure at the moment... |
Ubaida Send message Joined: 9 Jun 06 Posts: 3 Credit: 206,886 RAC: 0 |
got another two of those errors https://boinc.bakerlab.org/rosetta/result.php?resultid=26296697 https://boinc.bakerlab.org/rosetta/result.php?resultid=26297155 |
anders n Send message Joined: 19 Sep 05 Posts: 403 Credit: 537,991 RAC: 0 |
https://boinc.bakerlab.org/rosetta/result.php?resultid=25777930 - exit code -1073741819 (0xc0000005) Anders n |
Message boards :
Number crunching :
Report Problems with Rosetta Version 5.24
©2024 University of Washington
https://www.bakerlab.org