[Tfug] finding linked pages

Robert Hunter hunter at tfug.org
Thu May 1 00:15:47 MST 2008


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Wed, Apr 30, 2008 at 11:08:58PM -0700, christopher wrote:
> Do you know of a way to find all the linked pages on a
> site? Is there such a thing? I'd go through and just do
> it for everything on their server, but they have a ton
> of old pages that don't get used any more, so I thought
> I might be able to skip those. Thanks ~ chris

Discovering the pages is a graph traversal problem.  I wouldn't be
surprised if you could do it with wget, one of the coolest programs
ever.  But this may not be necessary, if you can use server side
includes, where you wrap each page with your banner.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)

iD8DBQFIGW4jJ1pz6tWxufARAshFAJoCcPgvBNsdPR+8YhdmCz6L8g/ehwCePTfH
gtE3h9gD2EG7QBm6YmmZRsk=
=ChtX
-----END PGP SIGNATURE-----




More information about the tfug mailing list