Hi,
Okay, switching to the router port which has USB routing build in did not resolve the problem, it is still the same. PymbleSoftware: The board seems to be all ok and the test with USB over Numalink would eliminate the possibility of bad port.
Now here is another interesting fact: I have two o300 servers, on BOTH of them, bricks with same serial exhibit same problem. What is the chance of two bricks going bad at the same time? So on both machines, 001c22 will drop the connection to L2. Bricks alone themselves seem to be okay. Funky stuff, eh?
Changing Numalink, USB cables, Router ports, e.t.c. does not help.
I did some more research, it looks like the brick will drop USB connection when it is powered on and regain it when off, here is sequence of events:
oink-001-L2 Is a L2 controller
001c22-L1 is a problematic brick
I have tried to keep other bricks out of the picture and only to work with the problematic one to show the issues.
oink-001-L2Code:
oink-001-L2>cfg
L2 192.168.1.11: - 001 (LOCAL)
L1 192.168.1.11:0:0 - 001c36
L1 192.168.1.11:2:0 - 001c24
L1 192.168.1.11:1:0 - 001c22
L1 192.168.1.11:3:0 - 001r20
L1 192.168.1.11:5:0 - 001c18
L1 192.168.1.11:4:0 - 001c16
L1 192.168.1.11:6:0 - 001c14
001c22-L1 Code:
001c22-L1>cfg
:0 - 001c22
001c22-L1>date
12/31/2010 14:07:19 CST
001c22-L1>pwr u
about 15 secs pass
oink-001-L2Code:
usb_control/bulk_msg: timeout
hub.c: USB device not accepting new address (error=-110)
SGIL1_RESET_DEVICE error, Connection timed out
WARNING: can't write to connection to 001c22 [0x0ffff00e]: IRouter:write failed
usb.c: USB disconnect on device 90
sgil1.c: SGI L1 disconnected, minor: 65 device: 2.90
INFO: closed connection to 001c22
hub.c: USB new device connect on bus2/1/1, assigned device number 91
usb_control/bulk_msg: timeout
usb.c: USB device not accepting new address=91 (error=-110)
hub.c: USB new device connect on bus2/1/1, assigned device number 92
usb.c: USB device not accepting new address=92 (error=-110)
hub.c: USB new device connect on bus2/1/1, assigned device number 93
usb_control/bulk_msg: timeout
usb.c: USB device not accepting new address=93 (error=-110)
hub.c: USB new device connect on bus2/1/1, assigned device number 94
usb.c: USB device not accepting new address=94 (error=-32)
oink-001-L2>cfg
L2 192.168.1.11: - 001 (LOCAL)
L1 192.168.1.11:0:0 - 001c36
L1 192.168.1.11:2:0 - 001c24
L1 192.168.1.11:3:0 - 001r20
L1 192.168.1.11:5:0 - 001c18
L1 192.168.1.11:4:0 - 001c16
L1 192.168.1.11:6:0 - 001c14
001c22-L1 Code:
001c22-L1>date
12/31/2010 14:07:54 CST
001c22-L1>pwr d
oink-001-L2Code:
hub.c: USB new device connect on bus2/1/1, assigned device number 95
Manufacturer: Silicon Graphics, Inc.
Product: SN1 L1 System Controller
SerialNumber: 00000000
sgil1.c: SGI L1 connected, minor: 65 device: 2.95
INFO: opened USB device at b2;p1/1;d95 (/dev/sgil1_1)
oink-001-L2>cfg
L2 192.168.1.11: - 001 (LOCAL)
L1 192.168.1.11:0:0 - 001c36
L1 192.168.1.11:2:0 - 001c24
L1 192.168.1.11:1:0 - 001c22
L1 192.168.1.11:3:0 - 001r20
L1 192.168.1.11:5:0 - 001c18
L1 192.168.1.11:4:0 - 001c16
L1 192.168.1.11:6:0 - 001c14
001c22-L1 Code:
001c22-L1>log
12/31/10 14:07:01 USB0: opened
12/31/10 14:07:01 USB0: registered for events
12/31/10 14:07:20 power up (COMMAND)
12/31/10 14:07:27 IP45 ASIC clock 08
12/31/10 14:07:27 IP45 XBridge clock 04
12/31/10 14:07:28 reset again MIPS
12/31/10 14:07:33 IP45 ASIC clock 08
12/31/10 14:07:33 IP45 XBridge clock 04
12/31/10 14:07:48 USB-R: USB:connection lost
12/31/10 14:07:48 UNREG: 30004c80 0 7
12/31/10 14:07:48 USB0: unregistered
12/31/10 14:07:48 USB-W: USB:connection lost
12/31/10 14:07:48 USB-W: IRouter:write failed
12/31/10 14:07:49 USB0-R: IRouter:read failed - read error
12/31/10 14:07:49 USB0: waiting on open
12/31/10 14:07:56 power down (COMMAND)
12/31/10 14:08:00 USB0: opened
12/31/10 14:08:00 USB0: registered for events
So, all I can say here is: huh??