Ephemeral Paths on the WWW:
The Walden's Paths Lightweight Path Mechanism

Richard Furuta, Frank M. Shipman III, Luis Francisco-Revilla, Haowei Hsieh, Unmil Karadkar, and Shueh-Cheng Hu

Center for the Study of Digital Libraries and
Department of Computer Science
Texas A&M University
College Station, TX 77843-3112

furuta, shipman@csdl.tamu.edu


Walden's Paths provides a mechanism for overlaying linear paths over World-Wide Web material. The path structure does not alter the pre-existing structure of the original material, but provides a means for organizing and contextualizing the collection of material. This paper first reviews the design of Walden's Paths and our experience with it. It then presents a new mechanism that enables the provision of ephemeral "lightweight" paths, which may be generated by a computer process.

1. Introduction

Walden's Paths enables the provision of directed paths over the World-Wide Web. In other words, it permits the specification of a meta-structure over the Web--a structure that brings together materials potentially found at widely-separated locations on the Web, forming a single, cohesive unit while at the same time leaving the pages' original structure and content untouched and accessible. Although the function provided by Walden's Paths is somewhat similar to that provided by bookmark lists, the path implementation provides the reader a stronger dynamic sense of context than do static lists. Navigational context is provided through addition of user controls that enable traversal through the path while representing position within the whole. Additional contextualization can be provided by a path's author through the addition of annotations that can be used to help the reader understand the page's material in the context of their own environment and experience.

The usefulness of paths (particularly linear paths) in organizing and contextualizing hypertextual material has been recognized earlier, including in Vannevar Bush's pioneering article "As We May Think" [Bush 1945]. Trigg [Trigg 1988] and Zellweger [Zellweger 1988, Zellweger 1989] reported on path implementations in stand-alone systems in the late 1980's. Even so, implementing paths over the World-Wide Web has required addressing novel issues, particularly issues related to the heterogeneous ownership of and decentralized administrative control over Web materials.

Our initial application of Walden's Paths, beginning in the Fall of 1995, has been supporting K-12 classroom use of Web materials. Teachers recognize the potential for classroom use of the material found on the World-Wide Web, but at the same time realize that the bulk of the material available today is not tailored for use by their students. Teachers face the challenge of adapting the available material to meet their own curricular goals, to meet their teaching styles, and indeed to resonate with their students' knowledge, reading, and maturity levels. We have reported further on these experiences in earlier papers [Shipman et al. 1996, Shipman et al. 1997, Furuta et al. 1997, Shipman et al. 1998, Shipman et al. 1999]; we will review these findings briefly in the next section of this paper. Another project with similar educational goals has been described by Nichol [Nichol 1996].

A primary use of these paths has been as a communication artifact--as a means for a teacher to organize material for student use, as a means for students to collect material as a virtual term paper, and as a means for students to communicate found resources to other students. Correspondingly, the mechanisms provided by the Walden's Paths system support temporal persistence in paths--authoring and annotation of paths is supported explicitly by a separate interface; paths, once "published", appear in a centralized site directory; and issues of sharing and of path maintenance are being addressed by our current research activities.

Paths, however, have additional uses beyond that of supporting interpersonal communication. They can be useful to an individual as an organizing vehicle--as a means of arranging discovered resources for subsequent inspection. They can provide navigational assistance that can aid a reader in making sure that all selected resources have been inspected. Such applications require paths that can be reconfigured dynamically--paths that perhaps are generated initially by a computer process and then are manipulated and pruned as the reader inspects the resources contained.

To increase our understanding of these applications, we have extended Walden's Paths to incorporate ephemeral paths--paths that provide a service upon creation but that are expected to be regenerated if needed subsequently rather than retained. We consider this to be a "lightweight" path mechanism--lightweight not from the standpoint of implementation but from the standpoint of user expectation. Indeed, the implementation costs of lightweight paths are approximately equal to that of the earlier-mentioned persistent paths. However, since they are not catalogued, no specific user-selectable "handle" is provided that encourages their general access. Consequently, such paths are intended to be a personal, rather than public, artifact.

The next section of this paper contains a discussion of the design, implementation, and use of the publicly-available version of Walden's Paths. Following this, we return our attention to lightweight paths in section 4, presenting examples of their use in section 5. Section 6's discussion and conclusions complete the paper.

2. Walden's Paths

Walden's Paths is our instantiation of a meta-document structure over Web-based information. The system allows for linear sequences of Web pages with associated annotations. The ordering of pages in a path is independent of the normal navigational structure of the Web pages (their embedded links). Path authors can use the annotation to provide context, commentary, focus, or missing information for the path reader.

Walden's Paths includes four primary components, the Path Authoring Tool, the Path Publisher, the Path Database, and the Path Server. Figure 1 shows the Walden's Paths architecture, illustrating the communications that occur between the major components. The two components visible to users, the Path Authoring Tool and the Path Server, will be described in more detail below. (The WWW Browser and the HTTP Server included in the diagram are external to the Walden's Paths system.)

Figure 1: The Walden's Paths Architecture

Figure 2: The Walden's Paths Authoring Tool

The Path Authoring Tool is a Java interface that combines searching for information on the Web with selecting, ordering, and annotating pages into a path. The Path Authoring Tool supports an authoring paradigm that cycles between posing Web queries, examining and evaluating the results of those queries, transferring relevant pages into the path, and annotating and arranging entries in the path.

Figure 2 shows a snapshot taken during authoring. Three primary areas are defined in this display. The upper left list holds the results of a Web query (the Path Authoring Tool allows the query to be posed to any of a number of commonly-available Web search engines). The lower left list is a workspace, holding pages that are being stored for later use. The right hand list is the path that is currently being edited. Pages can be viewed by clicking on any of the listed designators--this causes the page to be displayed in an instance of the Netscape browser running separately on the workstation. Pages in the search list can be moved to the workspace or to the path and pages can be moved back and forth between workspace and path. Additionally, pages can be entered manually into the path, can be moved up and down in the list, and can be annotated.

Completed or partially authored paths are stored in the Path Database. When an author is ready for readers to see his/her path, the path information (an ordered list of URL's, associated annotations, and additional display attributes) is transferred to the Path Server (the path author indicates this by selecting the "publish" option for the path).

The Path Server, implemented in C and invoked as a CGI program, takes the path information and produces the frame set displaying the control-flow buttons in the upper-left, the annotation in the upper-right, and the source Web page from the original site in the lower frame (see Figure 3). The control-flow frame provides the path reader with a sense of where they are in relation to the overall path, access to later pages in the path, and a pointer to where the source page comes from so the page can be viewed directly, without the Walden's Paths frames on top, if so desired.

Figure 3: A stop along a path.

Figure 4: Going off the path provides reader with quick way back.

Links in the source page are active and, besides changing the contents of the bottom frame, change the upper-left frame to a "back to path" button so the reader can quickly get back to where he/she left the path. Figure 4 shows an example of this when browsing off of the page. Here, the reader has selected the "FACILITIES" button in the lower window shown in Figure 3.

The Path Server provides additional services to the reader, including a path-author-customizable caching mechanism that helps speed up access. Further details on these mechanisms can be found in our earlier papers. The Path Server itself (as well as the other programs that make up the Walden's Paths system) are maintained in versions for Sun Solaris and for Windows NT. The paths displayed by the path server is not browser-specific or platform-specific, although the current version requires the provision of frames. In practice, current versions of Netscape and Internet Explorer are both being used to access paths.

Early versions of Walden's Paths have been used by teachers taking part in summer teaching workshops and by teachers and students in eight high-school art and music classrooms. It is these experiences that have determined subsequent development, including the lightweight path mechanism.

In both workshop and actual classroom situations, one of the most striking outcomes of Walden's Paths use is how quickly the path technology fades into the background. Teachers and students find the path metaphor very natural and, after a short period of learning about the system, quickly focus on the content within paths or technological limitations of the Web in general. This is a positive outcome as it indicates the ability for the technology to slip into the background in the classroom, allowing teachers and students to focus on what they are teaching/learning.

Uses of Walden's Paths have varied greatly in the teaching styles of users and the curricular materials being supported. Uses of Walden's Paths include: (1) paths as a reading activity with questions for students to answer; (2) as a slide show for evaluating student knowledge of art history; (3) as a set of materials used to support a constructive activity, like music composition; and (4) as a writing environment for students to create reports on particular topics [Shipman et al. 1999]. These rhetorical structures exhibited by path uses fall into three overlapping categories: (1) extended bookmark lists, (2) extended tours, and (3) curriculum-based expositions [Furuta et al. 1997].

One lesson of our experiences is that while teachers find paths easy to integrate in the classroom, they find it difficult to find time to author paths from scratch. Many of the paths used in the classroom were authored during training sessions rather than during the semester when they are spending most of their time in the classroom. It is this observation that has driven the need for greater authoring support for teachers and the more general notion of lightweight paths.

3. Lightweight paths

The persistent paths described in the previous section suggest the specification, refinement, and annotation of entries by a human author. The investment of time and effort is reflected in an expectation that such paths will be long-lived--that they will be catalogued, reused, shared, and maintained.

Generation of lightweight paths assumes the involvement of a computer process. They are intended to enable the application of Walden's Paths representations to computationally-generated collections of resources--results from queries, subsets or supersets of previously-developed collections, and applications customized for an individual or environment. Since lightweight paths are ephemeral, it seems unwise to support their fine-tuned tailoring or other large-scale investment of human energy in their specification.

Our implementation of lightweight paths fits into the general Walden's Paths architecture shown earlier in Figure 1. For lightweight paths, an additional path insertion interface has been added to the Path-Database, replacing the functionality of the Path Authoring Tool. This interface adds a temporary path to the database, but does not register the name in the path directory (i.e., does not publish the path). The interface then returns a handle to its caller, in the form of a URL, that can be used to access the path.

Lightweight paths use the same description format as persistent paths, but their entries are flagged as ephemeral and include information about their creation and retention times. The URL returned to the caller includes a flag that notifies the Path Server that the path is lightweight rather than persistent. Once the description is located, the Path Server's functionality is unchanged from that described before.

A process wishing to generate a lightweight path first creates a path file, passing it to the insertion interface. Once registered, the returned handle is used to access the lightweight path. A CGI program creating a lightweight path, then, can, in the expected case, generate an HTML page containing a redirection to the lightweight path (i.e., the ONLOAD attribute of the BODY element), or it can carry out further processing to use the lightweight path in whatever way it wishes.

4. Applications of lightweight paths

In this section, we will examine some scenarios of lightweight paths in use.

4.1. Organizing the results from a search query

Another group in our Center is developing a resource for humanities scholars studying the life and works of Miguel de Cervantes Saavendra (1547-1616). Cervantes, perhaps best known as the author of Don Quijote de la Mancha is a figure of immense significance in Spanish culture. Credited as the inventor of the modern novel, his works, and his use of language, receive careful scrutiny by modern-day researchers.

This project maintains a set of Web pages at http://www.csdl.tamu.edu/cervantes/. One of the important components of that site is an on-line, searchable, edition of Cervantes' complete works. A scholar seeking, for example, the uses of the term "fortuna" in the first book of the Don Quijote would pose a query as shown in Figure 5. The traditional result of such a query is a list of responses, as shown as Figure 6. It is likely in this case that the scholar will next examine the use context for each occurrence of the term. Carrying out such an examination will require selection of an element on the list, examination of the page that is then displayed (the page on which the match has occurred), selection of the browser's back button, and finally selection of the next element on the list. Each new selection requires reorientation within the list, to determine first which element had just been selected and then to choose the next in the list. When the list is long, the selection of each element in turn becomes an error-prone operation, as it is quite easy to re-select an element already visited or to skip an element accidentally.

Figure 7 shows the result of the query, implemented as a lightweight Walden's Path. In this implementation, each search result is shown as a separate stop in the path. In the Figure, we are at the first stop. A brief report summarizing the match is shown as the path annotation and the complete text of the page on which the match occurred is shown as the path stop's content (the matching words are highlighted in both places). Selecting the "next" arrow will take the scholar to the next query match. In addition, a link is provided in the matched page display that permits display of the full text of the Don Quijote edition. Selecting this link results in an off-the-path traversal, as shown in Figure 8. Consequently, return to the path can then be accomplished with a single mouse click on the button in the navigation window; this is the standard Walden's Paths mechanism.

Figure 5: A query posed to the Cervantes search engine, seeking all occurrences of the term "fortuna" in the 1605 edition of Don Quijote, book one.

Figure 6: The result of the query, presented in traditional list format.

Figure 7: The first result from the query, presented as a Walden's Paths entry.

Figure 8: Traversing the provided link to see the entire text of the Don Quijote edition.

Figure 9: Selecting pages to retain from the path resulting from the earlier query.

Figure 10: Choosing "persistent" paths for elision.

Path elision

The previous scenario has assumed that all pages selected by the search are relevant to the questioner. What if irrelevant material has been selected, a likely outcome of an automatic selection process, and the researcher wants to pick and choose the items worthy of closer inspection? Path elision permits the selection for display of a subset of the path.

We view lightweight paths as the intermediary representation that permits implementation of a set of small "filters"--relatively simple bits of executable code that accept a path (either lightweight or persistent) as input and produce a modified (lightweight) path as output. One such filter permits a degree of path customization--allowing the selection of pages to retain and those to remove from a previously-generated path. As Figure 9 shows, this allows the modification of the previous search-generated lightweight path. The resulting path, also a lightweight path, would then be browsed with the standard Walden's Paths interface.

Such filters are useful when applied to persistent paths, as well. Figure 10 shows a modified directory listing for the persistent paths stored at a particular site. Application of the elision filter allows the reader to customize the path to be viewed. In this example, the reader might wish, for example, to focus on one of the projects of the several that are reported in the tour of our Center's activities.

4.2. Potential areas of application for lightweight paths

We are considering the development of additional, more sophisticated, path generators and path filters for applications that are inconvenient to implement with persistent paths. These potential applications of lightweight paths include those in which the dynamic nature of the underlying content make persistent paths impractical, those in which individually customized versions of a path are appropriate, and those in which the content of the path may vary depending on the specified or identified characteristics of the reader.

Paths over transient materials

Many forms of information are temporally-transient but persistently structured. The contents of a newspaper change on a continuing basis but the general form of the newspaper remains constant (e.g., the sections into which it is divided, the relative importance of the front and inner pages, the significance of placement of materials on a page). Weather satellite pictures provide a consistent set of information, but visible-light images are only useful for areas of the globe that are in daylight.

We envision path generators that are able to locate the current versions of a set of resources (perhaps using different sources at different times of day), annotate them as appropriate to their context, and arrange them into lightweight paths for presentation.

Paths generated according to specified rules

Persistent paths have been used in creating on-line versions of course examinations (see [Shipman et al. 1998]). We envision applications that use lightweight paths to create customized tests from provided question pools using defined criteria. For example, some examinations for types of Federal licenses draw questions from a publically-available pool, with the distribution of the questions' subject areas specified by regulation. Other criteria might include randomization of order so that no two students had the same exact question sequence.

Paths generated in response to student strengths or weaknesses

A natural extension to testing mechanisms would be the use of the testing results to identify a set of tutorial material to aid the student in correcting deficiencies. We envision the presentation of such collections of material as being implemented by lightweight paths.

Template-generated paths

Customization of materials for students of differing skill levels may also be a possible application of lightweight paths. Here we envision a family of paths each with the same backbone structure, and with each drawing materials from a persistently-defined pool. However, the presentation of the specific topics along the path may be varied based on the individual student's age, grade level, interests, or other factors.

Randomly-generated paths

Finally, we envision applications that will benefit from randomly-generated paths. We have observed the use of persistent paths as a virtual "tour guide"--presenting an overview of an organization, perhaps. Randomized pages could be used, for example, to show selections from an art gallery's collection, or to show some tips about use of a software package.

5. Discussion and conclusions

Our experience with Walden's Paths has illustrated to us the strength and utility of applying meta-structures over the World-Wide Web. Our existing implementation provides a relatively simple linear path structure. This has had the desirable property of unobtrusively enabling the provision of guidance and context to readers. Our current research includes investigation of richer path structures, with the goal of retaining the same degree of path transparency while enriching the expressiveness of the path structures.

While it appears that meta-structures have positive benefit, it is also clear that authoring meta-structure requires effort and attention. This observation is not surprising, as it is a characteristic of any creative and organizational activity. Nevertheless, it does suggest that the benefits gained from authoring should be in proportion to the effort expended.

In this paper, we have reported on a mechanism to allow application of paths in contexts in which the ephemeral nature of the application mandates against large-scale investment of an author's time. We have demonstrated that heuristic approaches to building meta-structure can be effective for some applications in this class. Our assumption is that the paths created using the lightweight path mechanism also will be ephemeral, and consequently will not be catalogued or retained for reuse. In applications suggesting reuse, therefore, the further assumption is that the resources required to rebuild a lightweight path should be similar to the resources required to locate a path in a persistent collection.

Consequently, we have provided mechanisms at two ends of the persistent-ephemeral spectrum. However, the categorization of many documents and structures ranges between fixed and fluid, influenced particularly by the application. Buckland [Buckland 1997] illustrates this clearly in surveying the differing notions of how to define "document". Levy [Levy 1994] considers similar issues in his discussion of literary hypertextual presentations, and Levy and Marshall extend these thoughts into the Digital Library domain [Levy and Marshall 1995].

We see similar influences in the future directions of our project and expect to extend our model of persistent-ephemeral paths as it develops. We are examining a number of such extensions, already. These include extension of the directory mechanism to reflect categorization of paths along multiple dimensions, one of which may be the required temporal persistence of the path. We also see potential for generalizing our model of the authoring process, using lightweight paths to represent intermediate versions of a path while under development.


This material is based, in part, upon work supported by the National Science Foundation under Grant No. IIS-9812040. It also is supported, in part, by an award from Texas A&M University's Interdisciplinary Research Initiatives Program.

The project's Web pages are located at http://www.csdl.tamu.edu/walden/.


[Buckland 1997]
M.K. Buckland, "What is a 'Document'?", Journal of the American Society for Information Science, 48, 9 (September 1997), pp. 804-809.
[Bush 1945]
V. Bush, "As We May Think", Atlantic Monthly, July 1945, pp. 101-108.
[Furuta et al. 1997]
R. Furuta, F. Shipman, C. Marshall, D. Brenner, and H. Hsieh, "Hypertext Paths and the World-Wide Web: Experiences with Walden's Paths", Hypertext'97 Proceedings, 1997, pp. 167-176.
[Levy 1994]
D.M. Levy, "Fixed or Fluid? Document Stability and New Media", European Conference on Hypermedia Technology 1994 Proceedings, ACM, 1994, pp. 24-31.
[Levy and Marshall 1995]
D.M. Levy and C.C. Marshall, "Going Digital: A Look at Assumptions Underlying Digital Libraries", Communications of the ACM, 38, 4 (April 1995), pp. 77-84.
[Nichol 1996]
D. Nichol, "'Footsteps' linearsing hypertext", 1996. http://www-interact.eng.cam.ac.uk/footsteps/footsteps.html
[Shipman et al. 1996]
F. Shipman, C. Marshall, R. furuta, D. Brenner, H. Hsieh, and V. Kumar, "Creating Educational Guided Paths over the World-Wide Web", Proceedings of ED-TELECOM 96, pp. 326-331.
[Shipman et al. 1997]
F. Shipman, C. Marshall, R. Furuta, D. Brenner, H. Hsieh, and V. Kumar, "Using Networked Information to Create Educational Guided Paths", International Journal of Educational Telecommunications (IJET), 3, 4 (1997), pp. 383-400.
[Shipman et al. 1998]
F. Shipman, R. Furuta, D. Brenner, C. Chung, H. Hsieh, "Using Paths in the Classroom: Experiences and Adaptations", ACM Hypertext '98 Proceedings, 1998, pp. 267-276.
[Shipman et al. 1999]
F.M. Shipman III, R. Furuta, H. Hsieh, L. Francisco-Revilla, U. Karadkar, A. Rele, G.V. Shenoy, D.A. Brenner, "Using the Internet in the Classroom: Variety in the Use of Walden's Paths", internal report, submitted for publication, 1999.
[Trigg 1988]
R.H. Trigg, "Guided Tours and Tabletops: Tools for Communicating in a Hypertext Environment", ACM Transactions on Office Information Systems, 6, 4 (October, 1988), pp. 398-414.
[Zellweger 1988]
P.T. Zellweger, "Active paths through multimedia documents", Document Manipulation and Typography (Proceedings of the International Conference on Electronic Publishing, Document Manipulation, and Typography, Nice France, April 20--22, 1988), Cambridge University Press, 1988, pp. 19-34
[Zellweger 1989]
P.T. Zellweger, "Scripted Documents: A Hypertext Path Mechanism", Proceedings of the ACM Hypertext '89 Conference, 1989, pp. 1-26.