#!/usr/bin/ruby # Creates a map of a given URI. Crawls the URI given and finds every link # in a depth-first search. For each onsite link found, a node is created. # For every link found by visiting that node, an edge is created. And so # on. # # VERY RESOURCE INTENSIVE. I recommend 2G+ of RAM or a limitation on the # depth of the crawl (-r). # # Requires the installation of ruby-graphviz: # # gem install -r ruby-graphviz # # Hawler: # # wget http://spoofed.org/files/hawler/Hawler.gem && gem install Hawler.gem # # grapviz/dot: # # apt-get install graphviz (debian, ubuntu) # # Jon Hart # # Copyright (c) 2008, Jon Hart # All rights reserved. # # Redistribution and use in source and binary forms, with or without # modification, are permitted provided that the following conditions are met: # * Redistributions of source code must retain the above copyright # notice, this list of conditions and the following disclaimer. # * Redistributions in binary form must reproduce the above copyright # notice, this list of conditions and the following disclaimer in the # documentation and/or other materials provided with the distribution. # * Neither the name of the nor the # names of its contributors may be used to endorse or promote products # derived from this software without specific prior written permission. # # THIS SOFTWARE IS PROVIDED BY Jon Hart ``AS IS'' AND ANY # EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED # WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE # DISCLAIMED. IN NO EVENT SHALL BE LIABLE FOR ANY # DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES # (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; # LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER