non-trivial AWK and GNU AWK scripts I've written
Given an email (ie spam) as input, try determine appropriate abuse contact information for the email, including SMTP servers it passed through and for hosts/ISPs of URLs given within the email. Don't think I finished it though, or got it production-ready.
Find duplicated files on a file system according to their size, MD5 and SHA-1 checksums and, optionally, replace any redundant copies with hardlinks in order to save space. I wrote this because a user of mine is incapable of managing his files and has a tendency to copy over folders of photos from his camera multiple times to many different locations - using up literally many many GBs of space for no good reason.
This is a small but powerful and useful script. Also demonstrates ARGV/ARGC parsing in GAWK.
Convert firewalling rules specified in psuedo-english to Linux iptables rules. See the iptables directory for more information.
Generate and install reject routes based on input of several well-known blacklists.