KMWorld CRM Media Streaming Media Faulkner Speech Technology Unisphere/DBTA
PRIVACY/COOKIES POLICY
Other ITI Websites
American Library Directory Boardwalk Empire Database Trends and Applications DestinationCRM EContentMag Faulkner Information Services Fulltext Sources Online InfoToday Europe Internet@Schools Intranets Today KMWorld Library Resource Literary Market Place OnlineVideo.net Plexus Publishing Smart Customer Service Speech Technology Streaming Media Streaming Media Europe Streaming Media Producer Unisphere Research



For commercial reprints or PDFs contact Lauri Weiss-Rimler (lwrimler@infotoday.com)
Magazines > Online Searcher
Back Forward

ONLINE SEARCHER: Information Discovery, Technology, Strategies

Media Kit [PDF] Rate Card [PDF]
Editorial Calendar [PDF] Author Guidelines
SUBSCRIBE NOW! HOME

Disappearing and Disappeared Data
By
Volume 41, Number 2 - March/April 2017

Information professionals like their data to be stable and not to disappear. In that, they’re no different from researchers in other disciplines, particularly since it’s frequently these other researchers for whom information professionals are finding the data. Whether it’s time series, datasets, research reports, scientific studies, or digital texts, we want continuity.

Websites that change because of updated information are welcomed, of course. Data stability doesn’t preclude the addition of new data. It’s massive changes in older data or its outright vanishing that makes sources suspect and scares researchers.

Instability in data can happen for benign reasons. An agricultural time series can be thrown off kilter when a commodity is added or removed. Perhaps it was recently introduced in a region or no longer grown there. Data can also disappear when it’s wrong. Incorrectly gathered data—a population subset omitted, instrumentation malfunctions, or inaccurate analysis—should be withdrawn. Worse, if fraud is involved, that data should not see the light of day. I find stunning the number of retracted scholarly articles tracked by RetractionWatch.com.

Sometimes data disappears inadvertently. A domain name is not renewed. The information previously there is gone. If the domain name is then picked up by another entity, the original nature of the source becomes entirely different. I know of a respected conference website transmogrified to a porn site. The Indiana personal finance government website mentioned in this issue’s Dollar Sign column is another example.

Loss of funding can make a website unsustainable. Particularly susceptible are academic digitization projects funded by grants. When the grant runs out, local funding may not be sufficient, or even available, to keep the digitized materials online. It’s not just grant funding. Government agencies can decide to no longer fund a project, leading to data disappearing. GLIN, the Global Legal Information Network, lost U.S. funding in 2012 and is now attempting a comeback as the GLIN Foundation (glinf.org).

In a worst-case scenario, data disappears by design. A government decides that scientific environmental research reports, documentation of animal abuse, or earth science and atmospheric datasets will be scrubbed from its websites. When scientific data collides with a politician’s belief, too often it’s the data that loses. Suppression of research findings makes citizens’ access to information impossible. Vaporizing existing data does a disservice to humanity. Cutting back on government data collection, as has happened (and more is threatened) with the U.S. Census, leads to bad public policy. If local governments don’t have data, they can’t adequately plan for public services. The business community loses its ability to obtain essential information for growth.

A letter signed by 66 public interest institutions requests that the U.S. Office of Management and Budget remind government agencies that they are legally required to “give public notice before removing online government information” (openthegovernment.org). Librarians can champion the preservation of website data, either by archiving the sites at the Internet Archive or with newer initiatives such as Data Refuge (ppehlab.org/datarefuge). Data will persist if information professionals remain vigilant. Libraries should be dynamic places, but data should be stable and not disappear.


Marydee Ojala is Editor-in-Chief of Online Searcher (the successor journal to ONLINE) and writes its business research column ("The Dollar Sign"). She contributes feature articles and news stories to Information TodayEContentComputers in LibrariesIntranetsCyberSkeptic's Guide to the InternetBusiness Information Review, and Information Today's NewsBreaks. A long-time observer of the information industry, she speaks frequently at conferences, such as WebSearch University, Internet Librarian, Online Information (London, UK), Internet Librarian International, and national library meetings outside the U.S. She has adjunct faculty status at the School of Library and Information Science at IUPUI (Indiana University Purdue University Indianapolis). Her professional career began at BankAmerica Corporation, San Francisco, directing a worldwide program of research and information services. She established her independent information research business in 1987. Her undergraduate degree is from Brown University and her MLS was earned at the University of Pittsburgh.

 

Comments? Contact the editors at editors@onlinesearcher.net

       Back to top