Report Problems with Rosetta Version 5.07

Message boards : Number crunching : Report Problems with Rosetta Version 5.07

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next

AuthorMessage
Profile arminius

Send message
Joined: 23 Sep 05
Posts: 8
Credit: 805,403
RAC: 0
Message 15531 - Posted: 4 May 2006, 20:27:18 UTC


ID: 15531 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BennyRop

Send message
Joined: 17 Dec 05
Posts: 555
Credit: 140,800
RAC: 0
Message 15550 - Posted: 5 May 2006, 4:45:48 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=18872518
Swap space 1692.22 MB
Total disk space 29.29 GB
Free Disk Space 8.53 GB
---
Use no more than 10 GB disk space
Leave at least 0.01 GB disk space free
Use no more than 50% of total disk space
Write to disk at most every 60 seconds
Use no more than 75% of total virtual memory

----

Generally, when I look at the memory usage on the machine itself, Rosetta is only claiming to use up around 20 megs. None of the partitions have less than 8 gigs free space - so did that WU really eat up the 8.52 gigs of HD space on the C: partition before erroring out?


ID: 15550 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BennyRop

Send message
Joined: 17 Dec 05
Posts: 555
Credit: 140,800
RAC: 0
Message 15555 - Posted: 5 May 2006, 6:41:18 UTC

5/2/2006 5:40:48 PM|rosetta@home|Aborting result JUMPTEST_CLOSECHAINBREAKS_1tul__469_2429_0: exceeded disk limit: 100308693.000000 > 100000000.000000
5/2/2006 5:40:48 PM|rosetta@home|Unrecoverable error for result JUMPTEST_CLOSECHAINBREAKS_1tul__469_2429_0 (Maximum disk usage exceeded)

From the message log, I see that it's whining about going over 100 megs. Where did it get this value from, since I can't see that representing the settings I've chosen for Boinc&Rosetta.
ID: 15555 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jimi@0wned.org.uk

Send message
Joined: 10 Mar 06
Posts: 29
Credit: 335,252
RAC: 0
Message 15557 - Posted: 5 May 2006, 8:34:15 UTC

2 WUs with coding errors?

WU 15827716

<core_client_version>5.2.13</core_client_version>
<message>Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# random seed: 1048865
# cpu_run_time_pref: 14400
# cpu_run_time_pref: 14400
ERROR:: Exit at: .hbonds.cc line:293

</stderr_txt>

WU 15757279

<core_client_version>5.2.13</core_client_version>
<message>Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# random seed: 1767551
# cpu_run_time_pref: 14400
# cpu_run_time_pref: 14400
ID: 15557 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Trog Dog
Avatar

Send message
Joined: 25 Nov 05
Posts: 129
Credit: 57,345
RAC: 0
Message 15559 - Posted: 5 May 2006, 11:41:51 UTC - in response to Message 15278.  


this is the system I am looking at. In the CPU section it shows as a 4 CPU system, but under number of CPUs to use is says 1.

Pardon the intrusion guys, but does'nt the 4 just mean it is a Pentium 4 ?


Yep, its a p4 - number of cpus is only reported as 1 so it doesn't even support hyper threading.
ID: 15559 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jon C Melusky
Avatar

Send message
Joined: 29 Nov 05
Posts: 12
Credit: 192,743
RAC: 103
Message 15587 - Posted: 5 May 2006, 18:30:19 UTC

Hi,

I am running 5 BOINC projects, but here are the Rosetta lines of info. Looks like Rosetta has not worked in 4 days. Or so says the BOINC manager statistics tab.

5/5/2006 5:00:53 AM||Starting BOINC client version 5.2.13 for windows_intelx86
5/5/2006 5:00:53 AM||libcurl/7.14.0 OpenSSL/0.9.8 zlib/1.2.3
5/5/2006 5:00:53 AM||Data directory: C:Program FilesBOINC
5/5/2006 5:00:54 AM||Processor: 1 GenuineIntel Intel(R) Celeron(TM) CPU 1400MHz
5/5/2006 5:00:54 AM||Memory: 382.52 MB physical, 728.66 MB virtual
5/5/2006 5:00:54 AM||Disk: 93.15 GB total, 12.41 GB free
5/5/2006 5:00:54 AM|rosetta@home|Computer ID: 78725; location: home; project prefs: default
5/5/2006 5:00:58 AM|rosetta@home|Deferring computation for result HBLR_1.0_1ogw_ROT_TRIALS_TRIE_CHECKPOINTS_482_2037_0
5/5/2006 7:30:59 AM|rosetta@home|Restarting result HBLR_1.0_1ogw_ROT_TRIALS_TRIE_CHECKPOINTS_482_2037_0 using rosetta version 507
5/5/2006 7:35:38 AM|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_ROT_TRIALS_TRIE_CHECKPOINTS_482_2037_0 ( - exit code -529697949 (0xe06d7363))
5/5/2006 7:35:38 AM|rosetta@home|Computation for result HBLR_1.0_1ogw_ROT_TRIALS_TRIE_CHECKPOINTS_482_2037_0 finished
5/5/2006 7:36:39 AM|rosetta@home|Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
5/5/2006 7:36:39 AM|rosetta@home|Reason: To fetch work
5/5/2006 7:36:39 AM|rosetta@home|Requesting 8640 seconds of new work, and reporting 1 results
5/5/2006 7:36:48 AM|rosetta@home|Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
5/5/2006 7:36:50 AM|rosetta@home|Started download of 1tul_.fasta.gz
5/5/2006 7:36:50 AM|rosetta@home|Started download of 1tul.pdb.gz
5/5/2006 7:36:52 AM|rosetta@home|Finished download of 1tul_.fasta.gz
5/5/2006 7:36:52 AM|rosetta@home|Throughput 476 bytes/sec
5/5/2006 7:36:52 AM|rosetta@home|Finished download of 1tul.pdb.gz
5/5/2006 7:36:52 AM|rosetta@home|Throughput 19178 bytes/sec
5/5/2006 7:36:52 AM|rosetta@home|Started download of 1tul_.psipred_ss2.gz
5/5/2006 7:36:52 AM|rosetta@home|Started download of aa1tul_03_05.200_v1_3.gz
5/5/2006 7:36:54 AM|rosetta@home|Finished download of 1tul_.psipred_ss2.gz
5/5/2006 7:36:54 AM|rosetta@home|Throughput 4763 bytes/sec
5/5/2006 7:36:54 AM|rosetta@home|Started download of aa1tul_09_05.200_v1_3.gz
5/5/2006 7:38:35 AM|rosetta@home|Finished download of aa1tul_03_05.200_v1_3.gz
5/5/2006 7:38:35 AM|rosetta@home|Throughput 13194 bytes/sec
5/5/2006 7:38:35 AM|rosetta@home|Started download of alltopcodes.pdat.gz
5/5/2006 7:38:37 AM|rosetta@home|Finished download of alltopcodes.pdat.gz
5/5/2006 7:38:37 AM|rosetta@home|Throughput 7279 bytes/sec
5/5/2006 7:38:37 AM|rosetta@home|Started download of allbarcodes04.bar.gz
5/5/2006 7:38:43 AM|rosetta@home|Finished download of allbarcodes04.bar.gz
5/5/2006 7:38:43 AM|rosetta@home|Throughput 9792 bytes/sec
5/5/2006 7:39:38 AM|rosetta@home|Finished download of aa1tul_09_05.200_v1_3.gz
5/5/2006 7:39:38 AM|rosetta@home|Throughput 18857 bytes/sec
5/5/2006 7:39:39 AM||request_reschedule_cpus: files downloaded
5/5/2006 7:53:10 AM||request_reschedule_cpus: process exited
5/5/2006 9:39:03 AM|rosetta@home|Starting result JUMP_ALLBARCODE04_1tul__468_8868_0 using rosetta version 507
5/5/2006 9:39:25 AM|rosetta@home|Unrecoverable error for result JUMP_ALLBARCODE04_1tul__468_8868_0 ( - exit code -164 (0xffffff5c))
5/5/2006 9:39:25 AM||request_reschedule_cpus: process exited
5/5/2006 9:39:25 AM|rosetta@home|Computation for result JUMP_ALLBARCODE04_1tul__468_8868_0 finished
5/5/2006 9:40:28 AM|rosetta@home|Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
5/5/2006 9:40:28 AM|rosetta@home|Reason: To fetch work
5/5/2006 9:40:28 AM|rosetta@home|Requesting 8640 seconds of new work, and reporting 1 results
5/5/2006 9:40:44 AM|rosetta@home|Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
5/5/2006 9:40:46 AM|rosetta@home|Started download of 1hz6A.psipred_ss2.gz
5/5/2006 9:40:46 AM|rosetta@home|Started download of aa1hz6A03_05.400_v1_3.gz
5/5/2006 9:40:49 AM|rosetta@home|Finished download of 1hz6A.psipred_ss2.gz
5/5/2006 9:40:49 AM|rosetta@home|Throughput 309 bytes/sec
5/5/2006 9:40:49 AM|rosetta@home|Started download of frags400.txt
5/5/2006 9:41:02 AM|rosetta@home|Finished download of frags400.txt
5/5/2006 9:41:02 AM|rosetta@home|Throughput 91 bytes/sec
5/5/2006 9:41:02 AM|rosetta@home|Started download of 1hz6.pdb.gz
5/5/2006 9:41:09 AM|rosetta@home|Finished download of 1hz6.pdb.gz
5/5/2006 9:41:09 AM|rosetta@home|Throughput 1475 bytes/sec
5/5/2006 9:41:09 AM|rosetta@home|Started download of aa1hz6A09_05.400_v1_3.gz
5/5/2006 9:41:47 AM||request_reschedule_cpus: files downloaded
5/5/2006 9:42:14 AM|rosetta@home|Finished download of aa1hz6A03_05.400_v1_3.gz
5/5/2006 9:42:14 AM|rosetta@home|Throughput 12589 bytes/sec
5/5/2006 9:42:14 AM|rosetta@home|Started download of 1hz6A.fasta
5/5/2006 9:42:16 AM|rosetta@home|Finished download of 1hz6A.fasta
5/5/2006 9:42:16 AM|rosetta@home|Throughput 48 bytes/sec
5/5/2006 9:43:22 AM|rosetta@home|Finished download of aa1hz6A09_05.400_v1_3.gz
5/5/2006 9:43:22 AM|rosetta@home|Throughput 21418 bytes/sec
5/5/2006 9:43:23 AM||request_reschedule_cpus: files downloaded

cheers,

Jonathan
ID: 15587 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Robert Everly

Send message
Joined: 8 Oct 05
Posts: 27
Credit: 665,094
RAC: 0
Message 15590 - Posted: 5 May 2006, 20:33:36 UTC

Two 5.07s died here.

resultid=19196099 died with <core_client_version>5.4.3</core_client_version>
<message>Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# random seed: 1535558
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
ERROR:: Exit at: .dock_structure.cc line:401

</stderr_txt>

and

resultid=19101907 died with
<core_client_version>5.4.3</core_client_version>
<message>Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# random seed: 3953224
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
ERROR:: Exit at: .hbonds.cc line:293

</stderr_txt>

These are my first errors in a long time, so keep up the good work.
ID: 15590 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 15597 - Posted: 6 May 2006, 3:46:48 UTC - in response to Message 15488.  

Hi Rebirther and others with "1.04%" after 3 hours or so, please let them run until they go about 4 times your cpu run time preference. (If you haven't set a preference, our default is 3 hours, so let them run 12 hours.) If they're running longer, the jobs should be aborted by the watchdog, but please post here if not!

I have suspend following WU: FA_CASP6_t198__470_5745_0
After 2:13h only 1.04%. Steps increasing very low.
Last entry stdout.txt:
CYCLES::number is 1 x total_residue: 69
initializing full atom coordinates
BOINC :: [2006-05-04 11:46:11] :: checkpoint_decoys() :: saved decoy info :: attempted_decoys: 7 :: num_decoys: 7 :: farlx_stage: 10
dump_fullatom_pdb: farlxcheck
starting score 357.328156 rms 4.70180273
starting full atom minimization
[T/F OPT]Default FALSE value for [-infinite_loop]

Should I running further or abort it? Don`t know how long does it take? Normally 3h for one WU. 200MB RAM usage now.


ID: 15597 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 15599 - Posted: 6 May 2006, 3:50:59 UTC - in response to Message 15587.  

Hi Jon: thanks for posting. We definitely don't want Rosetta to be dysfunctional on your PC! Can you possibly post here a link to your failed workunits? In the boinc manager, you can hit "Your results" and it will give you the links.

We are now beginning a big push on our test server ralph to track down the final set of bugs in rosetta@home. The app there is getting more debugging machinery added every few days. So if any users out there are seeing repeated failures on rosetta@home (there don't seem to be many -- our error rates are low), please consider attaching your computer to ralph!

Hi,

I am running 5 BOINC projects, but here are the Rosetta lines of info. Looks like Rosetta has not worked in 4 days. Or so says the BOINC manager statistics tab.

5/5/2006 5:00:53 AM||Starting BOINC client version 5.2.13 for windows_intelx86
5/5/2006 5:00:53 AM||libcurl/7.14.0 OpenSSL/0.9.8 zlib/1.2.3
5/5/2006 5:00:53 AM||Data directory: C:Program FilesBOINC
5/5/2006 5:00:54 AM||Processor: 1 GenuineIntel Intel(R) Celeron(TM) CPU 1400MHz
5/5/2006 5:00:54 AM||Memory: 382.52 MB physical, 728.66 MB virtual
5/5/2006 5:00:54 AM||Disk: 93.15 GB total, 12.41 GB free
5/5/2006 5:00:54 AM|rosetta@home|Computer ID: 78725; location: home; project prefs: default
5/5/2006 5:00:58 AM|rosetta@home|Deferring computation for result HBLR_1.0_1ogw_ROT_TRIALS_TRIE_CHECKPOINTS_482_2037_0
5/5/2006 7:30:59 AM|rosetta@home|Restarting result HBLR_1.0_1ogw_ROT_TRIALS_TRIE_CHECKPOINTS_482_2037_0 using rosetta version 507
5/5/2006 7:35:38 AM|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_ROT_TRIALS_TRIE_CHECKPOINTS_482_2037_0 ( - exit code -529697949 (0xe06d7363))
5/5/2006 7:35:38 AM|rosetta@home|Computation for result HBLR_1.0_1ogw_ROT_TRIALS_TRIE_CHECKPOINTS_482_2037_0 finished
5/5/2006 7:36:39 AM|rosetta@home|Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
5/5/2006 7:36:39 AM|rosetta@home|Reason: To fetch work
5/5/2006 7:36:39 AM|rosetta@home|Requesting 8640 seconds of new work, and reporting 1 results
5/5/2006 7:36:48 AM|rosetta@home|Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
5/5/2006 7:36:50 AM|rosetta@home|Started download of 1tul_.fasta.gz
5/5/2006 7:36:50 AM|rosetta@home|Started download of 1tul.pdb.gz
5/5/2006 7:36:52 AM|rosetta@home|Finished download of 1tul_.fasta.gz
5/5/2006 7:36:52 AM|rosetta@home|Throughput 476 bytes/sec
5/5/2006 7:36:52 AM|rosetta@home|Finished download of 1tul.pdb.gz
5/5/2006 7:36:52 AM|rosetta@home|Throughput 19178 bytes/sec
5/5/2006 7:36:52 AM|rosetta@home|Started download of 1tul_.psipred_ss2.gz
5/5/2006 7:36:52 AM|rosetta@home|Started download of aa1tul_03_05.200_v1_3.gz
5/5/2006 7:36:54 AM|rosetta@home|Finished download of 1tul_.psipred_ss2.gz
5/5/2006 7:36:54 AM|rosetta@home|Throughput 4763 bytes/sec
5/5/2006 7:36:54 AM|rosetta@home|Started download of aa1tul_09_05.200_v1_3.gz
5/5/2006 7:38:35 AM|rosetta@home|Finished download of aa1tul_03_05.200_v1_3.gz
5/5/2006 7:38:35 AM|rosetta@home|Throughput 13194 bytes/sec
5/5/2006 7:38:35 AM|rosetta@home|Started download of alltopcodes.pdat.gz
5/5/2006 7:38:37 AM|rosetta@home|Finished download of alltopcodes.pdat.gz
5/5/2006 7:38:37 AM|rosetta@home|Throughput 7279 bytes/sec
5/5/2006 7:38:37 AM|rosetta@home|Started download of allbarcodes04.bar.gz
5/5/2006 7:38:43 AM|rosetta@home|Finished download of allbarcodes04.bar.gz
5/5/2006 7:38:43 AM|rosetta@home|Throughput 9792 bytes/sec
5/5/2006 7:39:38 AM|rosetta@home|Finished download of aa1tul_09_05.200_v1_3.gz
5/5/2006 7:39:38 AM|rosetta@home|Throughput 18857 bytes/sec
5/5/2006 7:39:39 AM||request_reschedule_cpus: files downloaded
5/5/2006 7:53:10 AM||request_reschedule_cpus: process exited
5/5/2006 9:39:03 AM|rosetta@home|Starting result JUMP_ALLBARCODE04_1tul__468_8868_0 using rosetta version 507
5/5/2006 9:39:25 AM|rosetta@home|Unrecoverable error for result JUMP_ALLBARCODE04_1tul__468_8868_0 ( - exit code -164 (0xffffff5c))
5/5/2006 9:39:25 AM||request_reschedule_cpus: process exited
5/5/2006 9:39:25 AM|rosetta@home|Computation for result JUMP_ALLBARCODE04_1tul__468_8868_0 finished
5/5/2006 9:40:28 AM|rosetta@home|Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
5/5/2006 9:40:28 AM|rosetta@home|Reason: To fetch work
5/5/2006 9:40:28 AM|rosetta@home|Requesting 8640 seconds of new work, and reporting 1 results
5/5/2006 9:40:44 AM|rosetta@home|Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
5/5/2006 9:40:46 AM|rosetta@home|Started download of 1hz6A.psipred_ss2.gz
5/5/2006 9:40:46 AM|rosetta@home|Started download of aa1hz6A03_05.400_v1_3.gz
5/5/2006 9:40:49 AM|rosetta@home|Finished download of 1hz6A.psipred_ss2.gz
5/5/2006 9:40:49 AM|rosetta@home|Throughput 309 bytes/sec
5/5/2006 9:40:49 AM|rosetta@home|Started download of frags400.txt
5/5/2006 9:41:02 AM|rosetta@home|Finished download of frags400.txt
5/5/2006 9:41:02 AM|rosetta@home|Throughput 91 bytes/sec
5/5/2006 9:41:02 AM|rosetta@home|Started download of 1hz6.pdb.gz
5/5/2006 9:41:09 AM|rosetta@home|Finished download of 1hz6.pdb.gz
5/5/2006 9:41:09 AM|rosetta@home|Throughput 1475 bytes/sec
5/5/2006 9:41:09 AM|rosetta@home|Started download of aa1hz6A09_05.400_v1_3.gz
5/5/2006 9:41:47 AM||request_reschedule_cpus: files downloaded
5/5/2006 9:42:14 AM|rosetta@home|Finished download of aa1hz6A03_05.400_v1_3.gz
5/5/2006 9:42:14 AM|rosetta@home|Throughput 12589 bytes/sec
5/5/2006 9:42:14 AM|rosetta@home|Started download of 1hz6A.fasta
5/5/2006 9:42:16 AM|rosetta@home|Finished download of 1hz6A.fasta
5/5/2006 9:42:16 AM|rosetta@home|Throughput 48 bytes/sec
5/5/2006 9:43:22 AM|rosetta@home|Finished download of aa1hz6A09_05.400_v1_3.gz
5/5/2006 9:43:22 AM|rosetta@home|Throughput 21418 bytes/sec
5/5/2006 9:43:23 AM||request_reschedule_cpus: files downloaded

cheers,

Jonathan


ID: 15599 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Rebirther
Avatar

Send message
Joined: 17 Sep 05
Posts: 116
Credit: 41,315
RAC: 0
Message 15601 - Posted: 6 May 2006, 7:47:22 UTC - in response to Message 15597.  

Hi Rebirther and others with "1.04%" after 3 hours or so, please let them run until they go about 4 times your cpu run time preference. (If you haven't set a preference, our default is 3 hours, so let them run 12 hours.) If they're running longer, the jobs should be aborted by the watchdog, but please post here if not!

I have suspend following WU: FA_CASP6_t198__470_5745_0
After 2:13h only 1.04%. Steps increasing very low.
Last entry stdout.txt:
CYCLES::number is 1 x total_residue: 69
initializing full atom coordinates
BOINC :: [2006-05-04 11:46:11] :: checkpoint_decoys() :: saved decoy info :: attempted_decoys: 7 :: num_decoys: 7 :: farlx_stage: 10
dump_fullatom_pdb: farlxcheck
starting score 357.328156 rms 4.70180273
starting full atom minimization
[T/F OPT]Default FALSE value for [-infinite_loop]

Should I running further or abort it? Don`t know how long does it take? Normally 3h for one WU. 200MB RAM usage now.



Don`t worry Rhiju, I have finished this larger WU in 3h ;)
ID: 15601 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mmn

Send message
Joined: 11 Mar 06
Posts: 1
Credit: 23,115
RAC: 0
Message 15603 - Posted: 6 May 2006, 11:11:02 UTC

MODEL 1 STEP 0
cpu time : 7 min

created 6 May 2006 7:14:58 UTC
name AB_CASP6_t216__486_401
ID: 15603 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Knorr

Send message
Joined: 18 Feb 06
Posts: 21
Credit: 373,953
RAC: 0
Message 15604 - Posted: 6 May 2006, 11:26:15 UTC

Got an exit code 0x1 on this WU:

HBLR_1.0_1n0u_ROT_TRIALS_TRIE_CHECKPOINTS_482_4843_0

Just came out of the blue. Not while rescheduling etc.
ID: 15604 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jon C Melusky
Avatar

Send message
Joined: 29 Nov 05
Posts: 12
Credit: 192,743
RAC: 103
Message 15613 - Posted: 6 May 2006, 17:08:18 UTC

>>>Hi Jon: thanks for posting. We definitely don't want Rosetta to be dysfunctional on your PC! Can you possibly post here a link to your failed workunits? In the boinc manager, you can hit "Your results" and it will give you the links.

We are now beginning a big push on our test server ralph to track down the final set of bugs in rosetta@home. The app there is getting more debugging machinery added every few days. So if any users out there are seeing repeated failures on rosetta@home (there don't seem to be many -- our error rates are low), please consider attaching your computer to ralph!>>>

Hi Rhiju,

Here is the link to my failed work units. Looks like I have had one Rosetta success since April 22nd. I am running XP Home. HP Presario 6000. 384 Ram. Rosetta gets 20% like all my projects do.

I don't know what ralph is. I am attached to 4 of the other main BOINC projects. They run fine. Well, lately they have. (^:

https://boinc.bakerlab.org/rosetta/results.php?userid=23144

cheers,

Jonathan
ID: 15613 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 15616 - Posted: 6 May 2006, 18:06:01 UTC - in response to Message 15613.  
Last modified: 6 May 2006, 18:08:43 UTC

...I don't know what ralph is. I am attached to 4 of the other main BOINC projects. They run fine. Well, lately they have. (^:

https://boinc.bakerlab.org/rosetta/results.php?userid=23144

cheers,

Jonathan

Ralph is the Alpha test project for Rosetta. It is located Here.

Use the homepage as a URL for BOINC when it askes for a project URL during the attach process. What Rhiju is suggesting is that if you attach to Ralph we can help find out what is going wrong on your system. Also this will help find any errors on the application.

Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 15616 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Snake Doctor
Avatar

Send message
Joined: 17 Sep 05
Posts: 182
Credit: 6,401,938
RAC: 0
Message 15631 - Posted: 6 May 2006, 22:07:50 UTC
Last modified: 6 May 2006, 22:10:07 UTC

Well I don't get many errors but I have two within just a few moments of each other. Mac g4 Dual, 1GB of memory. It looked like BOINC tried to start running them before they finished downloading.

here and here

the errors were both -
<core_client_version>5.4.9</core_client_version>
<message>
Couldn't start or resume: -146
</message>

ID: 15631 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bones

Send message
Joined: 16 Sep 05
Posts: 3
Credit: 713,317
RAC: 0
Message 15633 - Posted: 7 May 2006, 1:36:51 UTC

resultid=19261039.

This one hasn't yet failed, but acted strangely in that it was at 0.00% complete after 2 hours and the cpu usage was also 2% (normally 100%) even though the wu was supposedly running. I restarted boinc (5.2.13) and the progress jumped to 66.01% and now appears to be running ok. Not sure if this problem is rosetta app or boinc causing the issue, but thought i'd let you know anyway.

ID: 15633 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 15635 - Posted: 7 May 2006, 3:15:03 UTC - in response to Message 15633.  

Not sure if this problem is rosetta app or boinc causing the issue, but thought i'd let you know anyway.


if it happens again, you might take a look at "task manager" to see what's taking up the other percentage of cpu usage. Maybe some other task was running?
ID: 15635 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Nightbird

Send message
Joined: 17 Sep 05
Posts: 70
Credit: 32,418
RAC: 0
Message 15640 - Posted: 7 May 2006, 8:05:41 UTC
Last modified: 7 May 2006, 8:08:58 UTC

Got after i rebooted my machine : (0x1) - exit code 1 (0x1)

FACONTACTS_NOFILTERS_1vie__441_93_1

stderr out <core_client_version>4.32</core_client_version>
<message>Fonction incorrecte. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# random seed: 3203608
# cpu_run_time_pref: 21600

</stderr_txt>


Validate state Invalid

https://boinc.bakerlab.org/rosetta/result.php?resultid=18716104






ID: 15640 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
R/B

Send message
Joined: 8 Dec 05
Posts: 195
Credit: 28,095
RAC: 0
Message 15644 - Posted: 7 May 2006, 9:14:50 UTC

Result ID 19329923
Name JUMP_CLOSE_CHAINBREAK_ALLBARCODE_1q7sA_SAVE_ALL_OUT_472_6930_0
Workunit 16022107
Created 6 May 2006 3:45:09 UTC
Sent 6 May 2006 7:44:35 UTC
Received 7 May 2006 8:15:56 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status 0 (0x0)
Computer ID 92884
Report deadline 20 May 2006 7:44:35 UTC
CPU time 19425.453125
stderr out <core_client_version>5.2.13</core_client_version>
<stderr_txt>


ID: 15644 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Snake Doctor
Avatar

Send message
Joined: 17 Sep 05
Posts: 182
Credit: 6,401,938
RAC: 0
Message 15649 - Posted: 7 May 2006, 15:30:02 UTC
Last modified: 7 May 2006, 15:32:37 UTC

All three on Mac OS 10.4.6, Dual G4, 1 GB memory. BOINC Ver 5.4.9, Rosetta 5.07.

AB_CASP6_t272__486_1242_0 - this WU Was killed by watchdog.
HBLR_1.0_1dtj_RDFLAGS_473_8871_0 - this WU failed almost on arrival.
HBLR_1.0_1mky_ROT_TRIALS_TRIE_CHECKPOINTS_482_7412_0 - This Wu Failed almost on arrival.
ID: 15649 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next

Message boards : Number crunching : Report Problems with Rosetta Version 5.07



©2024 University of Washington
https://www.bakerlab.org