Menu

#637 getmacs ping successful but mac is not displayed

closed
5
2012-09-19
2009-09-30
No

xCAT 2.3 (LINUX 0925 build) on p5 machine with SLES11

The ping against the adapter was successful, but it does not display it or populate the node database.

c862f1sq01:/var/log/consoles # getmacs c862f2sq02 -D -S 9.114.134.65 -G 9.114.134.126
c862f2sq02: # Connecting to c862f2sq02# Connected# Checking for power off.# Power off complete.# Power on c862f2sq02 to Open Firmware.# Power on complete.# Client IP address is 9.114.134.74# Server IP address is 9.114.134.65# Gateway IP address is 9.114.134.126# Getting adapter location codes.# /pci@800000020000001/pci@2,4/ethernet@1 ping successful.# Type Location Code MAC Address Full Path Name Ping Resultent U7879.001.DQD04EM-P1-T6 00096bdd0f22 /pci@800000020000001/pci@2,4/ethernet@1 successful physical# /pci@800000020000001/pci@2,4/ethernet@1,1 ping unsuccessful.ent U7879.001.DQD04EM-P1-T7 00096bdd0f23 /pci@800000020000001/pci@2,4/ethernet@1,1 unsuccessful physicallpar_netboot: Timeout

Here is output from the console window. On a successful node it goes through all 4 adapters (1 success and 3 fail). Here it hangs at '1 >' after the second adapter. I tried verbose but I do not understand the output.

...
000000e3ea00: /disk@0
000000d9d6c0: /pci@2
000000e44180: /ethernet@1
000000da49b0: /pci@2,6
ok
0 > dev /packages/net ok
0 > FALSE value OPEN-DEV_DEBUG ok
0 > : new-open-dev ( str len -- true|false ) open-dev_debug if cr ." NEW-OPEN-DEV: Entering, Device : " 2dup type cr then { _str _len ; _n } 0 -> _n get-msecs dup d# 60000 + ( start timeout ) begin ( start timeout ) get-msecs over > if open-dev_debug if ( start timeout ) drop get-msecs swap - cr ." FAILED TO OPEN DEVICE" cr ." NUMBER OF TRIES " _n .d cr ." TIME ELAPSED " ( time ) .d ." MSECONDS" cr else ( start timout ) 2drop then false exit else true then while ( start timeout ) _n 1 + -> _n _str _len open-dev ( ihandle|false ) ?dup if -rot ( ihandle start timeout ) open-dev_debug if ( start timeout ) drop get-msecs swap - cr ." SUCCESSFULLY OPENED DEVICE" cr ." NUMBER OF TRIES " _n .d cr ." TIME ELAPSED " ( time ) .d ." MSECONDS" cr else ( start timeout ) 2drop then ( ihandle ) exit then ( start timeout ) repeat ; ok
0 > patch new-open-dev open-dev net-ping ok
0 > " supported-network-types" 000000dfb4f8 get-package-property ok
3 > . 0 ok
2 > decode-string ok
4 > dump
000000e0ace0: 65 74 68 65 72 6e 65 74 2c 61 75 74 6f 2c 72 6a :ethernet,auto,rj:
000000e0acf0: 34 35 2c 61 75 74 6f :45,auto: ok
2 > .s e0acf8 72
ok
2 > decode-string ok
4 > dump
000000e0acf8: 65 74 68 65 72 6e 65 74 2c 31 30 2c 72 6a 34 35 :ethernet,10,rj45:
000000e0ad08: 2c 68 61 6c 66 :,half: ok
2 > .s e0ad0e 5c
ok
2 > decode-string ok
4 > dump
000000e0ad0e: 65 74 68 65 72 6e 65 74 2c 31 30 2c 72 6a 34 35 :ethernet,10,rj45:
000000e0ad1e: 2c 66 75 6c 6c :,full: ok
2 > .s e0ad24 46
ok
2 > decode-string ok
4 > dump
000000e0ad24: 65 74 68 65 72 6e 65 74 2c 31 30 30 2c 72 6a 34 :ethernet,100,rj4:
000000e0ad34: 35 2c 68 61 6c 66 :5,half: ok
2 > .s e0ad3b 2f
ok
2 > decode-string ok
4 > dump
000000e0ad3b: 65 74 68 65 72 6e 65 74 2c 31 30 30 2c 72 6a 34 :ethernet,100,rj4:
000000e0ad4b: 35 2c 66 75 6c 6c :5,full: ok
2 > .s e0ad52 18
ok
2 > decode-string ok
4 > dump
000000e0ad52: 65 74 68 65 72 6e 65 74 2c 31 30 30 30 2c 72 6a :ethernet,1000,rj:
000000e0ad62: 34 35 2c 66 75 6c 6c :45,full: ok
2 > .s e0ad6a 0
ok
2 > . 0 ok
1 > . e0ad6a ok
0 > dev /pci@800000020000001/pci@2,4/ethernet@1 ok
0 > " ethernet,auto,rj45,auto" encode-string " chosen-network-type" property ok
0 > dev /packages/net ok
0 > ping /pci@800000020000001/pci@2,4/ethernet@1:9.114.134.65,9.114.134.74,9.114.134.126
$PING: args = /pci@800000020000001/pci@2,4/ethernet@1:9.114.134.65,9.114.134.74,9.114.134.126

NET: Device String - /pci@800000020000001/pci@2,4/ethernet@1
NET: Ping timeout - 10000 msec

NET: Ping string - h,9.114.134.65,9.114.134.74,9.114.134.126
PING: chosen-network-type = ethernet,auto,rj45,auto
PING: client IP = 9.114.134.74
PING: server IP = 9.114.134.65
PING: gateway IP = 9.114.134.126

PING: device /pci@800000020000001/pci@2,4/ethernet@1
PING: loc-code U7879.001.DQD04EM-P1-T6

PING: wait 60 seconds for Spanning Tree ...
PING: Ready to ping:
PING: source hardware address is 0 9 6b dd f 22
PING: destination hardware address is 0 9 6b 6b 5 a
PING: source IP address is 9.114.134.74
PING: destination IP address is 9.114.134.65

PING SUCCESS.
ok
0 > 0 to my-self ok
0 > " local-mac-address" 000000dfb4f8 get-package-property ok
3 > . 0 ok
2 > dump
000000e0aec8: 00 09 6b dd 0f 22 :..k..": ok
0 > " ibm,loc-code" 000000dfb4f8 get-package-property ok
3 > . 0 ok
2 > dump
000000dfb598: 55 37 38 37 39 2e 30 30 31 2e 44 51 44 30 34 45 :U7879.001.DQD04E:
000000dfb5a8: 4d 2d 50 31 2d 54 36 00 :M-P1-T6.: ok
0 > " supported-network-types" 000000e0b0d0 get-package-property ok
3 > . 0 ok
2 > decode-string ok
4 > dump
000000e1a8b8: 65 74 68 65 72 6e 65 74 2c 61 75 74 6f 2c 72 6a :ethernet,auto,rj:
000000e1a8c8: 34 35 2c 61 75 74 6f :45,auto: ok
2 > .s e1a8d0 72
ok
2 > decode-string ok
4 > dump
000000e1a8d0: 65 74 68 65 72 6e 65 74 2c 31 30 2c 72 6a 34 35 :ethernet,10,rj45:
000000e1a8e0: 2c 68 61 6c 66 :,half: ok
2 > .s e1a8e6 5c
ok
2 > decode-string ok
4 > dump
000000e1a8e6: 65 74 68 65 72 6e 65 74 2c 31 30 2c 72 6a 34 35 :ethernet,10,rj45:
000000e1a8f6: 2c 66 75 6c 6c :,full: ok
2 > .s e1a8fc 46
ok
2 > decode-string ok
4 > dump
000000e1a8fc: 65 74 68 65 72 6e 65 74 2c 31 30 30 2c 72 6a 34 :ethernet,100,rj4:
000000e1a90c: 35 2c 68 61 6c 66 :5,half: ok
2 > .s e1a913 2f
ok
2 > decode-string ok
4 > dump
000000e1a913: 65 74 68 65 72 6e 65 74 2c 31 30 30 2c 72 6a 34 :ethernet,100,rj4:
000000e1a923: 35 2c 66 75 6c 6c :5,full: ok
2 > .s e1a92a 18
ok
2 > decode-string ok
4 > dump
000000e1a92a: 65 74 68 65 72 6e 65 74 2c 31 30 30 30 2c 72 6a :ethernet,1000,rj:
000000e1a93a: 34 35 2c 66 75 6c 6c :45,full: ok
2 > .s e1a942 0
ok
2 > . 0 ok
1 > . e1a942 ok
0 > dev /pci@800000020000001/pci@2,4/ethernet@1,1 ok
0 > " ethernet,auto,rj45,auto" encode-string " chosen-network-type" property ok
0 > dev /packages/net ok
0 > ping /pci@800000020000001/pci@2,4/ethernet@1,1:9.114.134.65,9.114.134.74,9.114.134.126
$PING: args = /pci@800000020000001/pci@2,4/ethernet@1,1:9.114.134.65,9.114.134.74,9.114.134.126

NET: Device String - /pci@800000020000001/pci@2,4/ethernet@1,1
NET: Ping timeout - 10000 msec

NET: Ping string - h,9.114.134.65,9.114.134.74,9.114.134.126
PING: chosen-network-type = ethernet,auto,rj45,auto
PING: client IP = 9.114.134.74
PING: server IP = 9.114.134.65
PING: gateway IP = 9.114.134.126

PING: device /pci@800000020000001/pci@2,4/ethernet@1,1
PING: loc-code U7879.001.DQD04EM-P1-T7

PING: wait 60 seconds for Spanning Tree ...
PING-ERROR: cannot arp gateway 9.114.134.126 or server 9.114.134.65

PING FAILED.
ok
0 > 0 to my-self ok
0 > " local-mac-address" 000000e0b0d0 get-package-property ok
3 > . 0 ok
2 > dump
000000e1aaa0: 00 09 6b dd 0f 23 :..k..#: ok
0 > " ibm,loc-code" 000000e0b0d0 get-package-property ok
3 > . 0 ok
2 > dump
000000e0b170: 55 37 38 37 39 2e 30 30 31 2e 44 51 44 30 34 45 :U7879.001.DQD04E:
000000e0b180: 4d 2d 50 31 2d 54 37 00 :M-P1-T7.: ok
0 > " supported-network-types" 000000e1aca8 get-package-property ok
1 >
[attached]

It has these adapters:


NIC Adapters
Device Location Code Hardware
Address
1. Port 1 - IBM 2 PORT 10/100/100 U7879.001.DQD04EM-P1-T6 00096bdd0f22
2. Port 2 - IBM 2 PORT 10/100/100 U7879.001.DQD04EM-P1-T7 00096bdd0f23
3. 10/100 Mbps Ethernet PCI Adapt U7879.001.DQD04EM-P1-C5-T1 000d600a9952
4. 10/100 Mbps Ethernet PCI Adapt U7879.001.DQD04EM-P1-C1-T1 000d600ac546


So I try and target the adapter that has the ping success, but it still goes and tries the next U7879.001.DQD04EM-P1-T7 and hangs at the '1 >' prompt again; then returns the timeout message.

getmacs c862f2sq02 -D -S 9.114.134.65 -G 9.114.134.126 -F Phys_Port_Loc=U7879.001.DQD04EM-P1-T6

c862f2sq02: # Connecting to c862f2sq02# Connected# Checking for power off.# Power off complete.# Power on c862f2sq02 to Open Firmware.# Power on complete.# Client IP address is 9.114.134.74# Server IP address is 9.114.134.65# Gateway IP address is 9.114.134.126# Getting adapter location codes.# /pci@800000020000001/pci@2,4/ethernet@1 ping successful.# Type Location Code MAC Address Full Path Name Ping Resultent U7879.001.DQD04EM-P1-T6 00096bdd0f22 /pci@800000020000001/pci@2,4/ethernet@1 successful physical# /pci@800000020000001/pci@2,4/ethernet@1,1 ping unsuccessful.ent U7879.001.DQD04EM-P1-T7 00096bdd0f23 /pci@800000020000001/pci@2,4/ethernet@1,1 unsuccessful physicallpar_netboot: Timeout

Discussion

  • Anonymous

    Anonymous - 2009-10-08

    HuaZhong - I logged onto Kerry's system last week to try to figure out what was going on. This is p5 hardware, and there is no LHEA adapter, I did some verbose runs, and did see something in the output about no lhea adapters found. Not sure if that is the cause of this problem or not. I tried comparing the trace output to the getmacs expect code, and got very lost. This is COMPLICATED code - thank you for looking into this.

     
  • wang huazhong

    wang huazhong - 2009-10-09

    This is a timeout issue in getmacs. See the last two line:
    "
    0 > " supported-network-types" 000000e1aca8 get-package-property ok
    1 >
    "

    In good case, after command supported-network-types returned, it should return to 3 >. But here it is 1 >. Our code cannot match it. Need discussion with open firmware team.

     
  • wang huazhong

    wang huazhong - 2009-10-10

    open firmware team confirmed that it is because the adapter is too old and have FCode. and solution is either remove/replace that adapter or update the microcode for it.

     
  • wang huazhong

    wang huazhong - 2009-10-10

    Put it to pending and let Kerry confirm it.

     
  • SourceForge Robot

    This Tracker item was closed automatically by the system. It was
    previously set to a Pending status, and the original submitter
    did not respond within 28 days (the time period specified by
    the administrator of this Tracker).