Thread: Re: [Aoetools-discuss] Trouble with vblade

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Adi Kriegisch wrote:
> Hi Sam!
> 
> Thank you for your quick reply!
> 
>> Just for sanity, what version of vblade are you using?  I didn't think
>> I broke anything with the latest release ...
> I first tried with v13; yesterday I repeated all the tests with v14 with 
> exactly the same results (and though decided to ask for help).
> Module aoe is version 39 running in a stock debian sid kernel with:
> "vermagic: 2.6.18-2-xen-686 SMP mod_unload 686 REGPARM gcc-4.1"
> 
> [SNIP]
>>> This is reproduceable in both directions. The funny thing is that it
>>> works like a charm via loopback on the same server.
>> The output of aoe-stat would be helpful to ensure you can actually see
>> what you think you can see.  Also, if you
> output of aoe-stat (I always checked that and just left it out in my posting 
> to not make it too long and unreadable):
>       e0.0         5.009GB   eth0 up
> At the beginning I did the testing just with killing vblade and removing and 
> reinserting module aoe. After some while I decided to reboot the machines 
> after a test run for not having anything left over.
> 
>> 	cat /dev/etherd/err
> Shows nothing throughout all the tests.
> 
>> you might get an idea of whether communication is having to retransmit
>> a lot.  It could be a cabling issue.
> This happened sometimes with my tests on the loopback device:
> (from /var/log/syslog)
> Nov 16 22:50:29 tritium kernel: aoe: e1.0: setting 1024 byte data frames on 
> lo:000000000000
> Nov 16 22:50:29 tritium kernel: aoe: e1.0: setting 16384 byte data frames on 
> lo:000000000000
> Nov 16 22:51:29 tritium kernel: aoe: e1.0: setting 1024 byte data frames on 
> lo:000000000000
> Nov 16 22:52:29 tritium kernel: aoe: e1.0: setting 16384 byte data frames on 
> lo:000000000000
> Nov 16 22:52:29 tritium kernel: aoe: e1.0: setting 1024 byte data frames on 
> lo:000000000000
> 
> But never ever on ethernet. 
> 
>>> My hardware configuration:
>>> both server are dual Intel PIII 1400MHz with 3GB RAM
>>> Network adapter is Ethernet controller: Intel Corporation 82557/8/9
>>> [Ethernet Pro 100] (rev 0c) used with e100 driver. (yes, the network is
>>> 100MBit) Networking using several different protocols works like a charm
>>> with the performance one can expect from a 100MBit Network.
>> You're not perchance connecting the servers directly to each other
>> without an intervening switch, are you?  I've never seen 100MbE that
>> did auto MDIX.
> No; there is a switch inbetween. Communication via other prtocols works at 
> full speed; ping -f isn't loosing packets. I even tried to increase network 
> buffer size as specified for GBit ethernet in the README file; but this also 
> had no effect.

Try to use cross-cable.

> I had an strace running on the vblade process that just showed read and write 
> operations and was identical to a session using the loopback device. So, 
> nothing unusual there.
> 
> Any automated tests I could run? Anything else I could check? May I provide 
> you with access to the servers (just send me a private mail!)?
>>From my point of view there are three things I am not sure about: first is 
> that the machine is smp. Second: the kernel is not stock but runs with xen 
> patches and third maybe there is an issue with the nics and their driver 
> (e100)?!
> 
> Any further hints highly appreciated! :-)

You may also try to use LiveCD on your server.
BTW, what partition type are you using?

Andrei
-- 
Lan.Art s.r.l.

via Co' del Panico 36/1
35028 Piove di Sacco (PD)

tel. 049-7966424
fax  049-7966600
http://www.lanart.it

Thread: Re: [Aoetools-discuss] Trouble with vblade

aoetools-discuss