CPSC 489/IR/Spring 2002/Leggett/Programming Lab 5/Due April 11

WWW Indexing Bots

Write an interactive web-based application that uses Wget to mirror a set of web-based resources in preparation for inclusion in a digital library.

      Input:

            The web-based resources available by following NO MORE THAN two links
            (level 2, in-order, as discussed in class) from the class home page:
            www.csdl.tamu.edu/~leggett/courses/ugir/

            A button that produces the mirror.

      Output:

            The mirrored resources with the same directory structure and relative links.

            A message indicating that the mirror has been built successfully (or not!).

            A message indicating the number of files and total number of bytes downloaded.

            A button that is linked to the home page of the mirror.

Notes:

   1.    The web page(s) should be well-designed.

   2.    Do not retry URLs that fail.

   3.    When you have completed the lab, send me an email which includes your full name, userid, complete URL for testing, and your code (as ascii in the email itself). I will review your code, test the lab by producing and browsing the mirror as discussed above, and return the lab grade sometime after receiving your email.