WU's fail with several error codes

Questions and Answers : Unix/Linux : WU's fail with several error codes

To post messages, you must log in.

AuthorMessage
Morphy375
Avatar

Send message
Joined: 2 Nov 05
Posts: 86
Credit: 1,629,758
RAC: 0
Message 2385 - Posted: 5 Nov 2005, 23:45:49 UTC

Just coming over from FaD I installed boinc on some machines. The Windows clients are running fine with Rosetta but the two Linux clients produce a lot of failed WU's. The error codes are 131 or 26 or 11.
Is there a listing of error codes to get an explanaition?

First machine is running SUSE Linux 9.2 on an AMD XP2200+ with 768MB Ram.
Second machine is a headless client running LTSP 4.1 on an AMD Sempron 2800+ with 256MB.
I've stopped Rosetta on this Linux machines and run Predictor which is working fine.


Teddies....
ID: 2385 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Desti

Send message
Joined: 16 Sep 05
Posts: 50
Credit: 3,018
RAC: 0
Message 2506 - Posted: 6 Nov 2005, 22:16:04 UTC

ID: 2506 · Rating: -1 · rate: Rate + / Rate - Report as offensive    Reply Quote
Morphy375
Avatar

Send message
Joined: 2 Nov 05
Posts: 86
Credit: 1,629,758
RAC: 0
Message 2517 - Posted: 6 Nov 2005, 23:16:25 UTC - in response to Message 2506.  

Take a look into the BOINC Wiki http://boinc-doc.net/boinc-wiki/index.php?title=Main_Page


Thanks!

But this doesn't explain why my Linux machines fail with Rosetta but not with other projects. And no, I have no harware problems... ;-)
Teddies....
ID: 2517 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr_Maniac
Avatar

Send message
Joined: 14 Nov 05
Posts: 10
Credit: 2,253,203
RAC: 0
Message 3600 - Posted: 18 Nov 2005, 12:59:22 UTC

Hello... My Results also fail...

Errors:

These get not processed at all:

<core_client_version>5.2.5</core_client_version>
<message>process exited with code 26 (0x1a)
</message>
<stderr_txt>
2005-11-14 18:42:03 [rosetta@home] execv(../../projects/boinc.bakerlab.org_rosetta/rosetta_4.79_i686-pc-linux-gnu) failed: error -1
execv: Text file busy

</stderr_txt>

And these are getting processed but... Well... Client error:

<core_client_version>5.2.7</core_client_version>
<message>process got signal 11
</message>
<stderr_txt>
# =====================================
# random seed: 1351821
# =====================================
*** glibc detected *** corrupted double-linked list: 0x08d1a660 ***
[0x86c248b]
[0x8733dac]
[0xffffe420]
[0x8740534]
[0x8755386]
[0x8759d4d]
[0x875a323]
[0x875a77f]
[0x8724df5]
[0x8724b61]
[0x8073739]
[0x8225dfd]
[0x8740a3f]
[0x86d236b]
[0x8734dbd]
[0x876cb0a]
# =====================================
# random seed: 1597921
# =====================================
[0x86c248b]
[0x8733dac]
[0xffffe420]
[0x875c2fd]
[0x8724b97]
[0x8726631]
[0x808740a]
[0x8447b1d]
[0x84637b0]
[0x85b10cb]
[0x85bc87c]
[0x85d18f8]
[0x85d228f]
[0x83e356a]
[0x8660c80]
[0x85797be]
[0x857aff7]
[0x857bef8]
[0x83a1cf4]
[0x83a2a07]
[0x87393f4]
[0x8048121]
# =====================================
# random seed: 38041
# =====================================
[0x86c248b]
[0x8733dac]
[0xffffe420]
[0x875c2fd]
[0x8724b97]
[0x8726631]
[0x8073ad0]
[0x83d49fe]
[0x83d5717]
[0x83e27cd]
[0x83e47d6]
[0x8660f6e]
[0x857983e]
[0x857aff7]
[0x857bef8]
[0x83a1cf4]
[0x83a2a07]
[0x87393f4]
[0x8048121]
# =====================================
# random seed: 1122061
# =====================================
[0x86c248b]
[0x8733dac]
[0xffffe420]
[0x875c2fd]
[0x8724b97]
[0x8726631]
[0x8073ad0]
[0x83d49fe]
[0x83d5717]
[0x83e4057]
[0x8660f6e]
[0x857a7c7]
[0x857bef8]
[0x83a1cf4]
[0x83a2a07]
[0x87393f4]
[0x8048121]
# =====================================
# random seed: 1127541
# =====================================
*** glibc detected *** corrupted double-linked list: 0x097b8100 ***
[0x86c248b]
[0x8733dac]
[0xffffe420]
[0x8740534]
[0x8755386]
[0x8759d9a]
[0x875aa2c]
[0x875c2fd]
[0x8724b97]
[0x8726631]
[0x8073ad0]
[0x83d49fe]
[0x83d5e88]
[0x83e27cd]
[0x83e47d6]
[0x865fee2]
[0x857a6b7]
[0x857bef8]
[0x83a1cf4]
[0x83a2a07]
[0x87393f4]
[0x8048121]
# =====================================
# random seed: 1435841
# =====================================
[0x86c248b]
[0x8733dac]
[0xffffe420]
[0x875a77f]
[0x8724df5]
[0x8724b61]
[0x8227645]
[0x8740a3f]
[0x86d236b]
[0x8734dbd]
[0x876cb0a]
[0x86c248b]
[0x8733dac]
[0xffffe420]
[0x84423d7]
[0x8409e60]
[0x83f34a8]
[0x83f635c]
[0x8660f93]
[0x8579798]
[0x857aff7]
[0x857bef8]
[0x83a1cf4]
[0x83a2a07]
[0x87393f4]
[0x8048121]
# =====================================
# random seed: 45981
# =====================================
*** glibc detected *** corrupted double-linked list: 0x095a2e98 ***
[0x86c248b]
[0x8733dac]
[0xffffe420]
[0x8740534]
[0x8755386]
[0x875a5a9]
[0x875a77f]
[0x8724df5]
[0x8724b61]
[0x8227ef9]
[0x8740a3f]
[0x86d236b]
[0x8734dbd]
[0x876cb0a]
# =====================================
# random seed: 1078001
# =====================================

</stderr_txt>


I'm a fresh starter at Rosetta@home (only did SETI@home 'til now) and I already have six failed results out of seven sent...
Well... This one is still being processed... But I think it will fail, too...

My System:
(Gentoo) Linux
Kernel: 2.6.14-gentoo-r4
CPU: Athlon Thunderbird 1.333 GHz
RAM: 512 MB (Non-DDR) SDRAM - 133 MHz
Motherboard: ASUS A7V-133
GCC: 3.4.4
Glibc: glibc-2.3.5-r2
dev-lang/python: 2.4.2
sys-apps/sandbox: 1.2.12
sys-devel/autoconf: 2.13, 2.59-r6
sys-devel/automake: 1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1
sys-devel/binutils: 2.15.92.0.2-r10
sys-devel/libtool: 1.5.20

Rosetta@home is the only Project where I have that many errors...
ID: 3600 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr_Maniac
Avatar

Send message
Joined: 14 Nov 05
Posts: 10
Credit: 2,253,203
RAC: 0
Message 3805 - Posted: 21 Nov 2005, 14:12:17 UTC - in response to Message 2517.  

Have you found a solution yet?

ID: 3805 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr_Maniac
Avatar

Send message
Joined: 14 Nov 05
Posts: 10
Credit: 2,253,203
RAC: 0
Message 3925 - Posted: 22 Nov 2005, 16:25:07 UTC - in response to Message 3805.  

Hmm... All of a sudden it works...
Already two WUs without errors...
ID: 3925 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile skildude

Send message
Joined: 13 Dec 05
Posts: 7
Credit: 1,295,582
RAC: 0
Message 45723 - Posted: 3 Sep 2007, 17:28:00 UTC

I stopped running Rosetta on my Mandriva linux box. Seems that if the WU switched from R@H to seti and back it would hang on the WU. It wouldnt recognize that it had hung and would sit there for hours idling. I caught 1 WU that had sat for at least 24 hours. This kind of problem is intolerable. I have the machine to intentionally do this type of thing and Get shafted because the Program isnt smart enough to continue or quit a WU.

I still have Rosetta running on my Windows boxes but will wait for fix from Rosetta on this
Space is a vast empty space. Let us hope that it does not occupy the region between your ears.

Come visit Team Starfire at www.TSWB.org

ID: 45723 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 45742 - Posted: 4 Sep 2007, 13:26:15 UTC

Skildude, you've grabbed an old thread here, and it sounds unlikely that you are encountering the same exit error, so please post your details (including your setting for preferred WU runtime) to the "problems with..." thread in the Number Crunching board for the release where you are running. Might be helpful if you state your BOINC version and details of your Linux installation as well.
Rosetta Moderator: Mod.Sense
ID: 45742 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Questions and Answers : Unix/Linux : WU's fail with several error codes



©2024 University of Washington
https://www.bakerlab.org