[geeks] NFS for the last time (I hope)
Joshua D Boyd
geeks at sunhelp.org
Mon Dec 3 00:31:17 CST 2001
I have now pinned down the cause of my NFS trouble. It has something do to
with the packet fragments. Specifically, if I leave it at default (4096 byte
packets sizes on both read and writes), I seem lots of "icmp: ip reassembly
time exceeded" errors (when running tcpdump), and lots of packet fragment
messages. Further, I can always cause the NFS server to freeze up by doing
heavy writing (by freeze up, all programs trying to access NFS shares are
frozen). Heavy reading does nothing. The problem does not occur if all the
test systems are on a hub (although I still see the fragment messages, but
not the icmp reassembly errors), only when they are on the lanplex.
Now, if on the client systems, I set the read and write packet sizes to 1500,
things work perfectly. Since only writes freeze things up, I want to try
setting the packet size on writes to 1500, but leaving the reads at the default
4096. But before I can test that, I have to wait for the frozen processes to
come back so that I can unmount the NFS share without having to reboot.
Anyway, now the question is: what is the real cause? Is the lanplexes buffers
just being overrun by fragments?
Second, is just setting all clients to use a write (and perhaps read) packet
size of 1500 the proper way to fix this, or what? Until someone suggests
something better, this is what I plan to do.
BTW, I do see that my Lanplex has simm sockets. I wonder if beefing up the
memory would help, assuming that the simms really are for ram. I haven't yet
figured out exactly what they are for, but I'd assume they are covered
somewhere in the lanplex documentation online.
--
Joshua D. Boyd
More information about the geeks
mailing list