From: William S. <wst...@po...> - 2000-11-07 19:54:23
|
Good day, Jeff and all, (Host 2.4.0-test10, rh7, uml root is rh62) Thanks for the infinite printk fix, Jeff. The printk meltdown found in previous versions ("Hmmm, can't printk that message? Must alert the user with a printk!") has disappeared. I'm trying to run test10 with stdin and stdout redirected from/to /dev/null - I'm hoping to use it as a server only platform. When I use e2fsck -v -y root_fs ; ./linux-2.4.0-test10 mem=48M </dev/null the boot proceeds pretty well until it gets to the named boot script. If I don't use the </dev/null, umnl boots to xterms and a login prompt. Once I add </dev/null, it stops somewhere in the named script (a "ping the hotst" script run before named executes, one run after named does not). Partway through that script, fd0 is completely closed on all uml threads. I can watch it with: while /bin/true ; do clear ; for X in $(ps axf | grep [l]inux-2.4.0-test10 | awk '{print $1}') ; do ls -al /proc/$X/fd/0 ; done ; sleep 1 ; done Up until named, I get: lr-x------ 1 wstearns wstearns 64 Nov 7 14:31 /proc/25751/fd/0 -> /dev/null lr-x------ 1 wstearns wstearns 64 Nov 7 14:31 /proc/25752/fd/0 -> /dev/null lr-x------ 1 wstearns wstearns 64 Nov 7 14:31 /proc/25763/fd/0 -> /dev/null ... lr-x------ 1 wstearns wstearns 64 Nov 7 14:31 /proc/25865/fd/0 -> /dev/null lr-x------ 1 wstearns wstearns 64 Nov 7 14:31 /proc/25884/fd/0 -> /dev/null lr-x------ 1 wstearns wstearns 64 Nov 7 14:31 /proc/25920/fd/0 -> /dev/null with the odd "No such file or directory" from the delay between looking up the pids and actually taking a dir. Once the named script gets underway, all threads no longer have stdin open: ls: /proc/26443/fd/0: No such file or directory ls: /proc/25751/fd/0: No such file or directory ls: /proc/25752/fd/0: No such file or directory ls: /proc/25763/fd/0: No such file or directory ls: /proc/25768/fd/0: No such file or directory ls: /proc/25769/fd/0: No such file or directory ls: /proc/25764/fd/0: No such file or directory ls: /proc/25765/fd/0: No such file or directory ls: /proc/25766/fd/0: No such file or directory ls: /proc/25767/fd/0: No such file or directory ls: /proc/25795/fd/0: No such file or directory ls: /proc/25828/fd/0: No such file or directory ls: /proc/25884/fd/0: No such file or directory ls: /proc/25920/fd/0: No such file or directory ls: /proc/26074/fd/0: No such file or directory ls: /proc/26266/fd/0: No such file or directory ls: /proc/26319/fd/0: No such file or directory ls: /proc/26330/fd/0: No such file or directory ls: /proc/26345/fd/0: No such file or directory ls: /proc/26346/fd/0: No such file or directory ls: /proc/26347/fd/0: No such file or directory ls: /proc/26348/fd/0: No such file or directory ls: /proc/26349/fd/0: No such file or directory ls: /proc/26365/fd/0: No such file or directory ls: /proc/26381/fd/0: No such file or directory ls: /proc/26385/fd/0: No such file or directory ls: /proc/26397/fd/0: No such file or directory ls: /proc/26398/fd/0: No such file or directory For the brief second between when the network is brought up and when fd0 is closed, I can ping uml from the host. Once fd0 is closed, I can't anymore. I'll guess that named isn't itself the problem; it just happens to be executing and trying to do outside lookups when the fd is closed: tcpdump: listening on tap1 14:47:51.592086 B arp who-has uml-router tell 192.168.210.141 14:47:51.592101 > arp reply uml-router (fe:fd:0:0:0:0) is-at fe:fd:0:0:0:0 (fe:fd:c0:a8:d2:8d) 14:47:51.597167 < 192.168.210.141.1024 > a.root-servers.net.domain: 31496 NS? . (17) (DF) 14:47:51.842545 > a.root-servers.net.domain > 192.168.210.141.1024: 31496*- 13/0/13 NS A.ROOT-SERVERS.NET., NS H.ROOT-SERVERS.NET., NS C.ROOT-SERVERS.NET., NS G.ROOT-SERVERS.NET., NS F.ROOT-SERVERS.NET., (436) 14:47:55.006502 < 192.168.210.141.1024 > d.root-servers.net.domain: 31496 NS? . (17) (DF) 14:47:55.532558 > d.root-servers.net.domain > 192.168.210.141.1024: 31496*- 13/0/13 NS C.ROOT-SERVERS.NET., NS G.ROOT-SERVERS.NET., NS F.ROOT-SERVERS.NET., NS B.ROOT-SERVERS.NET., NS J.ROOT-SERVERS.NET., (436) 14:47:59.007525 < 192.168.210.141.1024 > k.root-servers.net.domain: 31496 NS? . (17) (DF) 14:47:59.302574 > k.root-servers.net.domain > 192.168.210.141.1024: 31496*- 13/0/13 NS K.ROOT-SERVERS.NET., NS L.ROOT-SERVERS.NET., NS M.ROOT-SERVERS.NET., NS I.ROOT-SERVERS.NET., NS E.ROOT-SERVERS.NET., (436) It looks like the responses are being passed back to the uml correctly. (Note that the host is doing Masq/NAT for the uml). I'm not sure if these are all just symptoms of some other problem or even if a closed fd0 matters. I'm bringing it up on the off chance that close is being called accidentally in one of the uml threads, or being called correctly but with an incorrect parameter of 0. Just for reference, here are the fd's for the parent tracing thread once it's stopped at named: [root@sparrow rh6.2]# ls -al /proc/29904/fd total 8 dr-x------ 2 wstearns wstearns 0 Nov 7 14:45 . dr-xr-xr-x 3 wstearns wstearns 0 Nov 7 14:45 .. lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 1 -> /dev/pts/8 lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 10 -> /home/wstearns/uml/rh6.2/root_fs lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 11 -> socket:[12210072] lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 2 -> /dev/pts/8 lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 3 -> /home/wstearns/uml/rh6.2/vm_file (deleted) lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 4 -> /home/wstearns/uml/rh6.2/vm_file (deleted) lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 5 -> /home/wstearns/uml/rh6.2/vm_file (deleted) lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 6 -> /home/wstearns/uml/rh6.2/vm_file (deleted) lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 7 -> /dev/ptyp0 lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 8 -> /dev/ptmx lrwx------ 1 wstearns wstearns 64 Nov 7 14:45 9 -> /dev/pts/20 Cheers, - Bill 'eating red herrings again for dinner?' Stearns --------------------------------------------------------------------------- "Assuming that piece of ear weighed about 1/2 an ounce, Bill [Gates] could afford to eat 443.39 pounds of Evander Holyfield if he were so inclined." -=- http://web.quuxuum.org/~evan/bgnw.html (Courtesy of James Mastros <ro...@je...>) -------------------------------------------------------------------------- William Stearns (wst...@po...). Mason, Buildkernel, named2hosts, and ipfwadm2ipchains are at: http://www.pobox.com/~wstearns LinuxMonth; articles for Linux Enthusiasts! http://www.linuxmonth.com -------------------------------------------------------------------------- |