Research Data Declines Rapidly with Article age
|
By HospiMedica International staff writers Posted on 30 Dec 2013 |
A new study reveals that the vast majority of raw data from old studies is missing, making the reproducibility of results, a cornerstone of science, unavailable.
Researchers at the University of British Columbia (Vancouver, Canada), Université Laval (Canada), and other institutions requested data sets from the authors of a relatively homogenous set of 516 articles published between 2 and 22 years ago, finding that the availability of the data was strongly affected by article age. For papers where the authors gave the status of their data, the odds of a data set being extant fell by 17% per year. As a result, more 90% of the oldest data were inaccessible, and even in papers published as recently as 2011, they were only able to track down the data for 23% of the studies.
In addition, the odds that the researchers could even locate a working e-mail address for the first, last, or corresponding author fell by 7% per year. Defunct addresses were listed on the paper itself, with web searches not turning up any current ones. For another 38% of studies, the researcher’s queries led to no response. And even when they received a reply, access to another7% of the data sets was inaccessible, or the data itself was lost. The study was published on December 19, 2013, in Current Biology.
“Everybody kind of knows that if you ask a researcher for data from old studies, they’ll hem and haw, because they don’t know where it is,” said lead author zoologist Timothy Vines, PhD, of the University of British Columbia. “Some of the time, for instance, it was saved on three-and-a-half inch floppy disks, so no one could access it, because they no longer had the proper drives.”
“Because the basic idea of keeping data is so that it can be used by others in future research, this sort of obsolescence essentially renders the data useless,” added Dr. Vines. “Our results reinforce the notion that, in the long term, research data cannot be reliably preserved by individual researchers, and further demonstrate the urgent need for policies mandating data sharing via public archives.”
Preserving raw data is important because it is impossible to predict in which directions research will move in the future. Dr. Vines, for instance, has been conducting research on a pair of toad species native to Eastern Europe that seem to be in the process of hybridizing. In the 1980s, a separate team of researchers was working on the same topic, and came across an old paper written in Polish that documented the distribution of the toads in the 1930s. Knowing that their distribution had changed relatively little over the intervening decades allowed the scientists to make calculations that would not have been possible otherwise.
Related Links:
University of British Columbia
Université Laval
Researchers at the University of British Columbia (Vancouver, Canada), Université Laval (Canada), and other institutions requested data sets from the authors of a relatively homogenous set of 516 articles published between 2 and 22 years ago, finding that the availability of the data was strongly affected by article age. For papers where the authors gave the status of their data, the odds of a data set being extant fell by 17% per year. As a result, more 90% of the oldest data were inaccessible, and even in papers published as recently as 2011, they were only able to track down the data for 23% of the studies.
In addition, the odds that the researchers could even locate a working e-mail address for the first, last, or corresponding author fell by 7% per year. Defunct addresses were listed on the paper itself, with web searches not turning up any current ones. For another 38% of studies, the researcher’s queries led to no response. And even when they received a reply, access to another7% of the data sets was inaccessible, or the data itself was lost. The study was published on December 19, 2013, in Current Biology.
“Everybody kind of knows that if you ask a researcher for data from old studies, they’ll hem and haw, because they don’t know where it is,” said lead author zoologist Timothy Vines, PhD, of the University of British Columbia. “Some of the time, for instance, it was saved on three-and-a-half inch floppy disks, so no one could access it, because they no longer had the proper drives.”
“Because the basic idea of keeping data is so that it can be used by others in future research, this sort of obsolescence essentially renders the data useless,” added Dr. Vines. “Our results reinforce the notion that, in the long term, research data cannot be reliably preserved by individual researchers, and further demonstrate the urgent need for policies mandating data sharing via public archives.”
Preserving raw data is important because it is impossible to predict in which directions research will move in the future. Dr. Vines, for instance, has been conducting research on a pair of toad species native to Eastern Europe that seem to be in the process of hybridizing. In the 1980s, a separate team of researchers was working on the same topic, and came across an old paper written in Polish that documented the distribution of the toads in the 1930s. Knowing that their distribution had changed relatively little over the intervening decades allowed the scientists to make calculations that would not have been possible otherwise.
Related Links:
University of British Columbia
Université Laval
Latest Health IT News
- Automated System Classifies and Tracks Cardiogenic Shock Across Hospital Settings
- Voice-Driven AI System Enables Structured GI Procedure Documentation
- EMR-Based Tool Predicts Graft Failure After Kidney Transplant
- Printable Molecule-Selective Nanoparticles Enable Mass Production of Wearable Biosensors
- Smartwatches Could Detect Congestive Heart Failure
- Versatile Smart Patch Combines Health Monitoring and Drug Delivery
Channels
Artificial Intelligence
view channel
AI Platform Interprets Real-Time Wearable Data for Parkinson’s Management
Parkinson’s disease presents fluctuating motor and non-motor symptoms that complicate day-to-day self-management and clinical decision-making. Care teams require timely, longitudinal insight into medication... Read more
Algorithm Identifies Cardiac Arrest Hotspots to Guide AED Placement
Out-of-hospital sudden cardiac arrest is common and usually fatal, and survival depends on rapid defibrillation. Many communities deploy automated external defibrillators without precise guidance, which... Read moreCritical Care
view channel
Review Advances Precision Care Pathway for Meningioma Management
Meningiomas are the most common primary brain tumors and account for about 37% of central nervous system tumors, often impairing neurological function depending on where they arise. Care has traditionally... Read more
3D Brain Imaging Marker Predicts Surgical Outcomes in Dementia-Causing Hydrocephalus
Idiopathic normal pressure hydrocephalus (iNPH) is an underdiagnosed cause of gait disturbance, urinary symptoms, and cognitive decline in older adults. Cerebrospinal fluid shunt surgery can alleviate... Read moreSurgical Techniques
view channel
Novel GERD Device Demonstrates Long-Term Safety in Real-World Study
Gastroesophageal reflux disease affects over 1 billion people worldwide, and many patients ultimately require procedural management. Surgical options exist but may introduce adverse effects such as dysphagia,... Read more
Pulsed Field Ablation Technology Showcases One-Year AF Outcomes
Catheter ablation for atrial fibrillation is drawing interest in nonthermal energy modalities that aim to spare non‑cellular tissue while creating durable lesions. Pulsed field ablation techniques that... Read more
Flexible Graphene Cortical Interface Enables Real-Time Mapping During Tumor Surgery
Safe, precise functional mapping during brain tumor surgery is critical to preserving speech and movement. Conventional metal electrodes can be rigid and less sensitive, limiting their ability to conform... Read morePatient Care
view channel
Wearable Sleep Data Predict Adherence to Pulmonary Rehabilitation
Chronic obstructive pulmonary disease (COPD) is a long-term lung disorder that makes breathing difficult and often disturbs sleep, reducing energy for daily activities. Limited engagement in pulmonary... Read more
Revolutionary Automatic IV-Line Flushing Device to Enhance Infusion Care
More than 80% of in-hospital patients receive intravenous (IV) therapy. Every dose of IV medicine delivered in a small volume (<250 mL) infusion bag should be followed by subsequent flushing to ensure... Read moreHealth IT
view channel
Automated System Classifies and Tracks Cardiogenic Shock Across Hospital Settings
Cardiogenic shock remains a difficult, time-sensitive emergency, with delayed identification driving poor outcomes and persistently high mortality. Many cases go undocumented even at advanced stages, hindering... Read more
Voice-Driven AI System Enables Structured GI Procedure Documentation
Documentation during gastrointestinal (GI) procedures often competes with real-time clinical decision-making and imposes a significant cognitive burden on physicians. Manual data entry and post-procedure... Read more
EMR-Based Tool Predicts Graft Failure After Kidney Transplant
Kidney transplantation offers patients with end-stage kidney disease longer survival and better quality of life than dialysis, yet graft failure remains a major challenge. Although a successful transplant... Read more
Printable Molecule-Selective Nanoparticles Enable Mass Production of Wearable Biosensors
The future of medicine is likely to focus on the personalization of healthcare—understanding exactly what an individual requires and delivering the appropriate combination of nutrients, metabolites, and... Read morePoint of Care
view channelBusiness
view channel
GE HealthCare, DeepHealth Expand AI Breast Imaging Collaboration
GE HealthCare and DeepHealth, a RadNet subsidiary, have expanded their collaboration to bring enhanced capabilities of DeepHealth’s new AI-powered Breast Suite to market, including ProFound Pro and Safeguard Review.... Read more







