Information Today
Volume 19, Issue 7 — July/August 2002
Table of Contents Previous Issues Subscribe Now! ITI Home
IT Report from the Field
Libraries in the Digital Age 2002
This conference focused on information seeking and digital services
by Emil Levine

"Integrating Information Seeking and IR" and "Information Services—Practice and Research" were the themes of the fourth Libraries in the Digital Age (LIDA) conference, held May 21­24 at the Interuniversity Centre in Dubrovnik, Croatia. Speakers contrasted information seeking and information retrieval (IR) and called for new models of relevance for the evaluation of information seeking systems. They also explored the concept of "information literacy" and presented examples of successful programs.

The conference drew 98 participants (70 from Croatia) from 10 countries and included 25 students. The format featured eight workshops, two tutorials, 37 papers, and a poster session. The European Chapter of the American Society for Information Science and Technology awarded cash prizes for the three best student posters. Two tutorials were also held. Tefko Saracevic, from the School of Communication, Information and Library Studies at Rutgers University in New Jersey, led "Perspectives of Information Science in a Digital Age" and Peter Ingwersen, from the Department of Information Studies at the Royal School of Library and Information Science in Copenhagen, Denmark, led "Perspectives of Webometrics: With a Special Emphasis on Small World Phenomena."

The event's co-directors were Saracevic; Ingwersen; Tatjana Aparac, from the Department of Information Sciences' Faculty of Philosophy at the University of Zagreb in Croatia; and Pertii Vakkari, from the Department of Information Studies at the University of Tampere in Finland.

Information Seeking

The opening remarks were provided by Ingwersen, who developed the clear distinction between "information seeking and information retrieval" (IS&R) and "information retrieval" itself. In his paper "Integrating IS&R," Ingwersen explained this difference by noting that (in Denmark) the concepts represent two different research communities that do not communicate with each other and that have few cross-citations. He emphasized, however, that both groups cite the same body of research literature from cognitive researchers. Ingwersen characterized "hard-core IR" by static experimentation, lack of knowledge about user behavior, and only rare cases of interactive information retrieval (IIR) utilization. He believes that members of the information seeking community are infrequent users of "systems" who have no concern for best-match algorithms. He said that the "old hands want to expand the field of information retrieval, while younger researchers are integrating information seeking and information retrieval.... This does not mean abandoning mainstream IR values, but [it] means interaction...."

Ingwersen listed the three revolutions in IR (citing researchers Stephen E. Robertson and Micheline Hancock-Beaulieu): the Interactive Revolution, in which IR systems are becoming increasingly interactive; the Cognitive Revolution, in which cognitive behavior places pressure on algorithmic IR; and the Relevance Revolution, which is expanding the classical concepts of relevance to "higher orders." (New models of relevance were featured within the information seeking theme.)

Ingwersen closed by saying that the "new generation" has a mutual interest in both groups, including dynamic information needs, information behavior applied to algorithmic relevance feedback, and simulated work tasks to keep control of natural experiments.

Following these opening remarks, Saracevic noted that information seeking is not part of the mainstream research in either Europe or the U.S., as "most money from the National Science Foundation (NSF) is going into computer science and less into information seeking research." Christine Borgman, a professor at UCLA's Department of Information Studies, disagreed, saying that in the U.S., six digital library projects recently received $25 million from the NSF for information seeking research and that another 25 to 30 projects are underway.

Carol C. Kuhlthau, a professor at Rutgers University's Library and Information Science Department, presented the "Longitudinal Study Methods in Information Seeking and Retrieval" workshop. Longitudinal studies, she explained, provide data over time and are particularly useful for documenting transition and change. She talked about several implications for the methodology in information retrieval, such as moving beyond the single incident/one-point-in-time approach to the consideration and accommodation of the information search process. She noted, in particular, Rutgers professor Nicholas Belkin's interactive approach to information retrieval.

In his workshop, titled "Evaluation Methods for Interactive Information Retrieval," Belkin offered the reasons that evaluation in this area is problematic and then addressed the issues. The problems stem fromthree sources. First, past measures are based on criteria for the evaluation of non-interactive "batch mode" systems. Second, the nature of interactive systems contradicts general concepts of evaluation (such as replicability). Third, people who are engaged in interactive information retrieval will behave differently. A searcher's prior knowledge will affect his or her use of the system. Other problems include evaluation criteria, replicability, knowledge and salience, and realism. As a model, he discussed these issues as they relate to TREC Interactive IR tracks. Usability, satisfaction, and efficiency criteria are at least as important as performance; user judgments of relevance can be utilized in place of external judges; and salience and realism can be simulated.

Kalervo Jarvelin, from the University of Tampere, presented the results of his paper "Perspectives on Laboratory-Based IR Research." The paper developed objections against traditional IR research and provided responses to them. These objections included uncertainty of needs, the lack of real users and real work tasks, the variety of collections, the insufficiency of recall and precision as measures, heavy averaging, and IR experiments in isolation. In defense of these criticisms, he said that users and tasks are not needed to test realism and that averaging is not a limitation of the model. He agreed that the variety of collections and the uncertainty of needs shouldbe looked at within the models. The concept of relevance was expanded to include algorithmic, topical, cognitive, and situational levels (which were the subject of separate presentations).

Jarvelin concluded by noting that the laboratory model is not dead. Rather, it's limited but is still needed for algorithm development. He said that broader frameworks are necessary; integration with information seeking is required; and that humans, work tasks, relevances, cognition, and novel criteria must be included and understood.

Relevance

In a session titled "Performance Indicators for IS&R," Ingwersen and Jarvelin introduced the use of new performance measures in information seeking and retrieval. They said that evaluations must concern process characteristics of work and that seeking and searching tasks and relevance must be multilayered, situational, and dynamic. These relevance levels should be algorithmic, topical, cognitive, situational, and socio-cognitive and should have degrees of relevance. They noted that in the reassessment of 38 TREC topics using degrees of relevancy, it was found that of the prior (binarily relevant) documents, 50 percent were marginally relevant, 33 percent fairly relevant, and 16 percent highly relevant. Thus, the comparison of binary and graded relevance may affect evaluation results.

Ingwersen then introduced the new relevance types of Intellectual Topicality, which is the aboutness relation of a query; Pertinence (cognitive), the perceived correspondence of information need; Situational, the relation as perceived between the task and the situation/problem; and Socio-cognitive, the task as perceived by a group in context.

In conclusion, Ingwersen stated that the laboratory IR model is needed for comfortable systems evaluation, but it may be extended toward users through the application of their proposed performance indicators. He said that thorough user-oriented evaluation requires broader scenarios and different types of indicators and that "standard metrics may remain a dream in this case."

Discussions of relevance were continued by Pertii Vakkari in a workshop titled "Relevance—Which Criteria?" He further defined system or algorithmic relevance as the ranked output of information objects that are commonly judged against an expert's binary relevance assessment. Vakkari then introduced the idea of "topicality," which asks how well (the topic of) the user's conceptual construct matches (the topic of) conceptual constructs in retrieved documents. He explored the impact of prior knowledge on these utilities, citing studies by Daniel Isenberg, Vimla L. Patel and Marco Ramoni, and Amanda Spink. He also cited longitudinal studies by Kuhlthau that differentiated the types of information needed in task-performance stages. Situational relevance, he concluded, is "a relationship between a person's information problem and the information that contributes to the resolution of that problem. Relevance depends on the contribution."

Vakkari also led a session titled "Task-Based Information Searching: Tactics, Terms and Search Results." He said that ideas from information seeking and information retrieval must be integrated and that this is not an issue for a single study but for a series of studies. He noted that "IR is a broader process of information seeking" and from information retrieval "human interaction with systems must be taken into account."

Vakkari next contrasted the search styles of graduate students in psychology and information science. For example, the number of search terms and tactics increased for information science students, but remained constant for psychology students. One conclusion of this longitudinal research is that the size and specificity of the user's search vocabulary grows during the stages of the task performance and that the use of operators and tactics depends on the searcher's knowledge of the systems.

Katriina Bystrom, from the Swedish School of Library and Information Studies,also discussed the evaluation of a system based on task performance in her paper "Information and Information Sources in a Task Context: Methods and Results." She reported on research that concerns the use of different information types by municipal administrators relative to task complexity. One conclusion based on detailed task diaries was that increasing task complexity fosters the use of people as information sources.

Peiling Wang, an associate professor at the University of Tennessee, presented the results of her paper "Research on Web Searching: User Modeling & System Design." The research was based on both log data and experiments with searchers, which included their verbalization (concurrent withtheir searching and retrospectively). The log data showed that the most-searched categories included entertainment and recreations, career, academe, sports and housing, people, places, and things. Most query structures were simple, did not use advanced features, were short (an average of two to three words), and used natural language. Iterations were rare in the case of failed searches. Wang's conclusions concerning system design were that a one-size-fits-all interface does not work and that instruction on systems use should be embedded in the interaction process. She said, "Users will not come for help."

The first track concluded with a panel discussion, titled "Research Questions and Methodology for IS&R." It was chaired by Jarvelin and featured Belkin, Borgman, Ingwersen, Kuhlthau, Vakkari, and Wang.

Belkin noted: "We are still far from achieving an understanding of information seeking behavior and how IS&R systems should be designed." Performance evaluation should be based on the task of the systems and people. Criteria other than relevance should be developed. Borgman stated that it's time for a "new paradigm for information seeking and information retrieval." She said that information seeking is not yet well-enough understood for laboratory experiments. Ingwersen stated that behavioral aspects should impact on design and called for more longitudinal studies. Kuhlthau said that such studies should concern the process of cognitive development and understanding tasks over time. Wang raised the issue of imbedding instruction as an aid for information seeking, especially in Web-based systems.

Information Services

The second theme, "Information Services—Practice and Research," was briefly introduced by Tefko Saracevic. Borgman's keynote speech, "Whither, or Wither, Libraries?" was based on her book From Gutenberg to the Global Information Infrastructure: Access to Information in the Networked World.

Borgman asked what the roles of libraries are in the information infrastructure and then listed and discussed four challenges: the invisible infrastructure, content and collections, preservation and access, and institutional boundaries. She explored the impact of digital libraries and emphasized that digitization does not equal preservation. Digitization could be a big time bomb, she stated, given the migration of systems and formats.

Borgman also discussed ways that librarians can maintain visibility in the digital environment and encouraged them to become proactive in all aspects of the parent organization. Despite the movement toward digital libraries, she noted that new libraries and facilities are continuing to be built to hold conventional collections, more books are being published, book fairs are flourishing, e-books are not yet doing well, anddual (paper and electronic) journals are doing better than their electronic counterparts.

Marija Dalbello, from Rutgers' School of Communications, Information and Library Studies, presented the results of her paper "Selection Criteria for Building
Digital Collections: Is There an Emerging Model?" She identified and examined over 400 digital projects that concern cultural analysis in the U.S. and Europe and classified each project into various categories for analysis. She used the ARL Digital Initiatives Database to obtain descriptions of 412 sites, most of which reflect digital conversion of historical material. Eighty percent of these could be browsed, but not searched.Despite their growth, Dalbello concluded that "emerging digital collections are not presenting sustained historical narratives in their presentation of material.... Digital libraries springing up in various institutions are as yet unconnected masses of fragments of data and visual information."

Information Literacy

David Bawden, Lyn Robinson, and Polona Vilar, from City University in London, led the session "Literacies for the Digital Library." The concept of "information literacy" was discussed, with its many synonymous terms, including digital literacy, computer/IT/network literacy, library literacy, media literacy/mediacy, and information fluency/information mastery/informacy. They noted the requirement for employment as one of the major motivators in becoming information literate. In discussion, Kuhlthau said: "We do not have the information literacy problem solved. More work is needed."

"Information Literacy in the Wired University" was the subject of a workshop by Marta Deyrup and Beth Bloom, from Seton Hall University Libraries in New Jersey. They described the creation of an information literacy program (which included online and multimedia tools) that's targeted to the undergraduate population.

The results of a paper titled "Accessing Electronic Information in U.K. Further and Higher Education. Findings from JUBILEE, a JISC Funded Research Project" were presented by Kathryn Ray, Graham Coulson, Debbie Proud, and Linda Banwell, from the School of Information Studies at the University of Northumbria in the U.K. The paper reviewed JUBILEE (Joint Information Systems Committee User Behavior in Information Seeking Longitudinal Evaluation of Electronic Information Services), a current 3-year research project. The study found that the strategy of imbedding electronic information services (EIS) into the university curriculum, tailoring user training, raising user awareness, and other factors has increased users' access and their ability to effectively use EIS.

Oliver Obst, from the Central Medical Library at Westphalian Wilhelms University in Germany, led a session titled "LOTSE is Coming Aboard." He described the Library Online Tour & Self-Paced Education (LOTSE), a tutorial designed to teach students library and information literacy using online tools.

Koraljka Golub, from the Department of Information Sciences' Faculty of Philosophy at the University of Zagreb, who last year discussed "Digital Collection for Blind and Visually Impaired People (in Croatia)," presented new research about information accessibility for the disabled in Croatia. Using W3C guidelines, Golub and her co-author, Nikolaj Lazic, found that none of the 16 Croatian public libraries included in the research conform to these guidelines. She then listed a number of libraries on the Web that are compliant.

"Legal Deposit of Digital Documents—Practices and Trends" was led by Nenad Prelog, assistant minister at Croatia's Ministry of Foreign Affairs, and his co-author, Hrvoje Stancic, from the University of Zagreb's Department of Information Sciences. Prelog discussed approaches in othercountries, as well as issues of accessibility, which are complicated by copyright and physical access. He noted other problems, including authenticity and the migration of storage media. There is "no simple all-in-one solution," he concluded.

The linguistic challenge of the Web and the new Europe was discussed by Maja Bratanic and Vesna Turcin, from HIDRA, The Croatian Information and Documentation Agency, in their workshop "Multilingual Searching Using Thesaurus: An Example of Croatian Versions of the European Union Thesaurus EUROVOC." They talked about their creation and use of the Croatian version of EUROVOC.

Robert Hayes, a professor emeritus at UCLA, was recognized at the conference dinner for his 25 years of consecutive service to the Croatia Library and its information science community. Hayes also presented a workshop titled "Models for Library Management, Decision-Making and Planning." This session was based on his recent book, which includes programs on a CD-ROM for modeling libraries.

The next LIDA conference will be held in Dubrovnik in May 2003. Further details about this year's event are available at http://www.ffzg.hr/infoz/lida.

Emil Levine is a library and information consultant living in Vienna, Austria. He recently designed the digital libraries for the United Nations Industrial Development Organization (UNIDO) and the United Nations Comprehensive Test Ban Treaty Organization (CTBTO). His e-mail address is emil.levine@chello.at.

Table of Contents Previous Issues Subscribe Now! ITI Home
© 2002 Information Today, Inc. Home