Report stuck & aborted 5.01 WU here please - III

Message boards : Number crunching : Report stuck & aborted 5.01 WU here please - III

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
[DPC]NGS~StugIII

Send message
Joined: 8 Mar 06
Posts: 2
Credit: 58,616
RAC: 0
Message 14669 - Posted: 26 Apr 2006, 14:02:27 UTC - in response to Message 14662.  

Aborted FACONTACTS_RECENTER_NOFILTERS_256bA_448_973 after 24 hours.

Claimded credit 421.03

I didn't want to wait longer. It is 24 hours of lost computation time and I think that's a lot.
ID: 14669 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Honza

Send message
Joined: 18 Sep 05
Posts: 48
Credit: 173,517
RAC: 0
Message 14673 - Posted: 26 Apr 2006, 14:49:55 UTC

This ResultsID got only 1.356 after 15 hours on X2, manual abort in place I guess.
https://boinc.bakerlab.org/rosetta/result.php?resultid=18214281
On another machine that had the same WU, it got automatically aborted with -161 error in less than one hour...
ID: 14673 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
John McCallum
Avatar

Send message
Joined: 8 Jan 06
Posts: 12
Credit: 7,841,240
RAC: 5,621
Message 14677 - Posted: 26 Apr 2006, 15:48:56 UTC


26/04/2006 16:31:29|rosetta@home|Unrecoverable error for result FACONTACTS_RECENTER_NOFILTERS_1tul__448_830_1 (aborted via GUI RPC)
This had only done 1.9% after 20:23(73380+ sec)hope I have seen the last of this one:) did a back of the envalope calc it "might" have finished at current % done increse after 900+ hours:) although it might be that it is only the phase 1 part of the work unit that needs fixing as a quick scan of the posting all/most of them donot seem to get past it.
If you can't take a joke you should never have joined.
ID: 14677 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
milw0rm

Send message
Joined: 10 Dec 05
Posts: 22
Credit: 6,212,738
RAC: 0
Message 14679 - Posted: 26 Apr 2006, 15:55:51 UTC

i have just checked two machines here and i too have had 2 units stuck on 116hours and 102 hours!

i stupidly did not record the uni numbers! :(
ID: 14679 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rebel Alliance

Send message
Joined: 4 Nov 05
Posts: 50
Credit: 3,579,531
RAC: 0
Message 14684 - Posted: 26 Apr 2006, 16:19:40 UTC

Result ID 17981175
Name FACONTACTS_RECENTER_NOFILTERS_1wit__448_181_1
Workunit 14525452
Created 23 Apr 2006 2:00:28 UTC
Sent 23 Apr 2006 9:28:17 UTC
Received 26 Apr 2006 16:11:30 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 107679
Report deadline 7 May 2006 9:28:17 UTC
CPU time 83730.416483
ID: 14684 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile EdMulock
Avatar

Send message
Joined: 14 Mar 06
Posts: 30
Credit: 2,347,485
RAC: 0
Message 14689 - Posted: 26 Apr 2006, 17:30:23 UTC

work unit 12214678 aborted after 35 hours showing 4 % completion

Claimed credit 430.094033921575
ID: 14689 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC]TeamGrazzie~APCIII

Send message
Joined: 17 Mar 06
Posts: 1
Credit: 271,636
RAC: 0
Message 14700 - Posted: 26 Apr 2006, 20:25:08 UTC

Aborted HBLR_1.0_1hz6_420_9212 https://boinc.bakerlab.org/rosetta/workunit.php?wuid=13425964 on computer https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=184836.

WU is jumping back to "Ab initio" after model 1; step +/-34500 (full atom relax)
This repeads everytime the WU passes step 345xx.

Result ID https://boinc.bakerlab.org/rosetta/result.php?resultid=18221537
ID: 14700 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC]Division_Brabant~OldButNotSoWise
Avatar

Send message
Joined: 23 Jan 06
Posts: 42
Credit: 371,797
RAC: 0
Message 14701 - Posted: 26 Apr 2006, 20:55:44 UTC
Last modified: 26 Apr 2006, 20:57:39 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=17773392
Maximum CPU time exceeded.
That's no fun, more then 5 days crunching and suddenly it goes on error :(

*peep* happens, just joking, that's the risc when you're crunching :)

This one has give me beautiful red dots in the graphics :D
ID: 14701 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Los Alcoholicos~La Muis

Send message
Joined: 4 Nov 05
Posts: 34
Credit: 1,041,724
RAC: 0
Message 14704 - Posted: 26 Apr 2006, 22:16:38 UTC
Last modified: 26 Apr 2006, 22:21:42 UTC

Aborted HBLR_1.0_1mky_ROT_TRIALS_TRIE_449_22_0

cpu time 45:16 at 8,55%


And another one HBLR_1.0_1di2_420_4823_1

cpu time 55:25 at 33,92%
ID: 14704 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile KWSN - Sir Brian - err sorry - wrong film!

Send message
Joined: 23 Feb 06
Posts: 1
Credit: 353,945
RAC: 0
Message 14705 - Posted: 26 Apr 2006, 22:18:14 UTC

Here ye go a few from me to add to the list...


https://boinc.bakerlab.org/rosetta/result.php?resultid=18309752

https://boinc.bakerlab.org/rosetta/result.php?resultid=18068362

https://boinc.bakerlab.org/rosetta/result.php?resultid=17878733


ID: 14705 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
brianwc

Send message
Joined: 7 Dec 05
Posts: 3
Credit: 701,894
RAC: 0
Message 14718 - Posted: 27 Apr 2006, 3:50:14 UTC

Aborted work unit: 14555727

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=14555727

Three people errored-out on this one.
ID: 14718 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bones

Send message
Joined: 16 Sep 05
Posts: 3
Credit: 713,317
RAC: 0
Message 14728 - Posted: 27 Apr 2006, 7:19:17 UTC
Last modified: 27 Apr 2006, 7:19:42 UTC

And another one 13331599 going really slowly at 3.02% and aborted after 15 hours.
ID: 14728 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
msr-berlin

Send message
Joined: 28 Nov 05
Posts: 2
Credit: 8,058
RAC: 0
Message 14730 - Posted: 27 Apr 2006, 7:27:02 UTC

Aborted the following WU PROD_ABINITIO_1tul__447_80279
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=15073495

0.0% after 22 hours of work

ID: 14730 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 14740 - Posted: 27 Apr 2006, 10:25:54 UTC

Stuck at 1 % and with several "red dots" on grafics.

https://boinc.bakerlab.org/rosetta/result.php?resultid=18246367

Anders n
ID: 14740 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mind

Send message
Joined: 20 Feb 06
Posts: 1
Credit: 50,095
RAC: 0
Message 14742 - Posted: 27 Apr 2006, 11:04:19 UTC - in response to Message 14740.  
Last modified: 27 Apr 2006, 11:04:40 UTC

FACONTACTS_RECENTER_NOFILTER_1cei__448_591_2.

running 60 hours, aborted.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=14553294
https://boinc.bakerlab.org/rosetta/result.php?resultid=18081512

(not sure which link i had to post, so posted both.)
ID: 14742 · Rating: 1 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rebel Alliance

Send message
Joined: 4 Nov 05
Posts: 50
Credit: 3,579,531
RAC: 0
Message 14757 - Posted: 27 Apr 2006, 14:47:49 UTC

Result ID 18047453
Name HBLR_1.0_1ogw_420_7359_2
Workunit 13416696
Created 23 Apr 2006 19:40:01 UTC
Sent 24 Apr 2006 2:10:22 UTC
Received 27 Apr 2006 14:46:09 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 77284
Report deadline 8 May 2006 2:10:22 UTC
CPU time 127774.709382
ID: 14757 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rebel Alliance

Send message
Joined: 4 Nov 05
Posts: 50
Credit: 3,579,531
RAC: 0
Message 14758 - Posted: 27 Apr 2006, 15:20:05 UTC

Result ID 17930605
Name HBLR_1.0_1ogw_420_1370_2
Workunit 13336463
Created 22 Apr 2006 12:53:08 UTC
Sent 22 Apr 2006 19:30:38 UTC
Received 27 Apr 2006 15:18:39 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 148992
Report deadline 6 May 2006 19:30:38 UTC
CPU time 107976.460834
ID: 14758 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rebel Alliance

Send message
Joined: 4 Nov 05
Posts: 50
Credit: 3,579,531
RAC: 0
Message 14759 - Posted: 27 Apr 2006, 15:25:59 UTC

Result ID 17915761
Name FACONTACTS_RECENTER_NOFILTERS_1bk2__448_398_1
Workunit 14540163
Created 22 Apr 2006 9:00:23 UTC
Sent 22 Apr 2006 15:49:23 UTC
Received 27 Apr 2006 15:25:11 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 155638
Report deadline 6 May 2006 15:49:23 UTC
CPU time 129242.046875
ID: 14759 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Charlie

Send message
Joined: 25 Mar 06
Posts: 53
Credit: 424,472
RAC: 0
Message 14776 - Posted: 27 Apr 2006, 17:52:05 UTC

AB_CASP6_t272__456_3679 Aborted due to rosetta at how causeing a Windows error. I was unable to track down the error as it caused a reboot of windows.
ID: 14776 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jimi@0wned.org.uk

Send message
Joined: 10 Mar 06
Posts: 29
Credit: 335,252
RAC: 0
Message 14783 - Posted: 27 Apr 2006, 18:51:50 UTC

This didn't have a problem except it wouldn't restart after a reboot. Don't know why, don't think it's the unit's fault tbh.

Result ID 18286555
Name PROD_ABINITIO_FAST_1tul__447_82848_0
Workunit 15088910
Created 26 Apr 2006 6:21:10 UTC
Sent 26 Apr 2006 11:16:48 UTC
Received 26 Apr 2006 19:15:45 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 190981
Report deadline 10 May 2006 11:16:48 UTC
CPU time 4557.71875
ID: 14783 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Number crunching : Report stuck & aborted 5.01 WU here please - III



©2024 University of Washington
https://www.bakerlab.org