Research Data Declines Rapidly with Article age
|
By HospiMedica International staff writers Posted on 30 Dec 2013 |
A new study reveals that the vast majority of raw data from old studies is missing, making the reproducibility of results, a cornerstone of science, unavailable.
Researchers at the University of British Columbia (Vancouver, Canada), Université Laval (Canada), and other institutions requested data sets from the authors of a relatively homogenous set of 516 articles published between 2 and 22 years ago, finding that the availability of the data was strongly affected by article age. For papers where the authors gave the status of their data, the odds of a data set being extant fell by 17% per year. As a result, more 90% of the oldest data were inaccessible, and even in papers published as recently as 2011, they were only able to track down the data for 23% of the studies.
In addition, the odds that the researchers could even locate a working e-mail address for the first, last, or corresponding author fell by 7% per year. Defunct addresses were listed on the paper itself, with web searches not turning up any current ones. For another 38% of studies, the researcher’s queries led to no response. And even when they received a reply, access to another7% of the data sets was inaccessible, or the data itself was lost. The study was published on December 19, 2013, in Current Biology.
“Everybody kind of knows that if you ask a researcher for data from old studies, they’ll hem and haw, because they don’t know where it is,” said lead author zoologist Timothy Vines, PhD, of the University of British Columbia. “Some of the time, for instance, it was saved on three-and-a-half inch floppy disks, so no one could access it, because they no longer had the proper drives.”
“Because the basic idea of keeping data is so that it can be used by others in future research, this sort of obsolescence essentially renders the data useless,” added Dr. Vines. “Our results reinforce the notion that, in the long term, research data cannot be reliably preserved by individual researchers, and further demonstrate the urgent need for policies mandating data sharing via public archives.”
Preserving raw data is important because it is impossible to predict in which directions research will move in the future. Dr. Vines, for instance, has been conducting research on a pair of toad species native to Eastern Europe that seem to be in the process of hybridizing. In the 1980s, a separate team of researchers was working on the same topic, and came across an old paper written in Polish that documented the distribution of the toads in the 1930s. Knowing that their distribution had changed relatively little over the intervening decades allowed the scientists to make calculations that would not have been possible otherwise.
Related Links:
University of British Columbia
Université Laval
Researchers at the University of British Columbia (Vancouver, Canada), Université Laval (Canada), and other institutions requested data sets from the authors of a relatively homogenous set of 516 articles published between 2 and 22 years ago, finding that the availability of the data was strongly affected by article age. For papers where the authors gave the status of their data, the odds of a data set being extant fell by 17% per year. As a result, more 90% of the oldest data were inaccessible, and even in papers published as recently as 2011, they were only able to track down the data for 23% of the studies.
In addition, the odds that the researchers could even locate a working e-mail address for the first, last, or corresponding author fell by 7% per year. Defunct addresses were listed on the paper itself, with web searches not turning up any current ones. For another 38% of studies, the researcher’s queries led to no response. And even when they received a reply, access to another7% of the data sets was inaccessible, or the data itself was lost. The study was published on December 19, 2013, in Current Biology.
“Everybody kind of knows that if you ask a researcher for data from old studies, they’ll hem and haw, because they don’t know where it is,” said lead author zoologist Timothy Vines, PhD, of the University of British Columbia. “Some of the time, for instance, it was saved on three-and-a-half inch floppy disks, so no one could access it, because they no longer had the proper drives.”
“Because the basic idea of keeping data is so that it can be used by others in future research, this sort of obsolescence essentially renders the data useless,” added Dr. Vines. “Our results reinforce the notion that, in the long term, research data cannot be reliably preserved by individual researchers, and further demonstrate the urgent need for policies mandating data sharing via public archives.”
Preserving raw data is important because it is impossible to predict in which directions research will move in the future. Dr. Vines, for instance, has been conducting research on a pair of toad species native to Eastern Europe that seem to be in the process of hybridizing. In the 1980s, a separate team of researchers was working on the same topic, and came across an old paper written in Polish that documented the distribution of the toads in the 1930s. Knowing that their distribution had changed relatively little over the intervening decades allowed the scientists to make calculations that would not have been possible otherwise.
Related Links:
University of British Columbia
Université Laval
Latest Health IT News
- Digital Heart Model Supports Targeted Ablation in Atrial Fibrillation
- AI Framework Helps Clinicians Create Trustworthy Risk Prediction Tools
- AI Tool Screens for Primary Aldosteronism Using Routine EHR Data
- AI-Enabled ECG Software Predicts One-Year Atrial Fibrillation Risk
- AI-Native EHR Achieves EU Medical Device Certification
- EHR-Integrated Screening Workflow Detects Cognitive Impairment at Admission
- AI System Detects and Quantifies Chronic Subdural Hematoma
- Continuous Monitoring Platform Detects Infection Risk Across Care Transitions
- Automated System Classifies and Tracks Cardiogenic Shock Across Hospital Settings
- Voice-Driven AI System Enables Structured GI Procedure Documentation
- EMR-Based Tool Predicts Graft Failure After Kidney Transplant
Channels
Artificial Intelligence
view channel
AI Trends Report Guides Responsible, Effective Healthcare Deployment
Hospitals are under growing pressure to adopt artificial intelligence tools that improve safety, efficiency, and continuity of care without compromising quality. At the same time, clinicians need clearer... Read more
Privacy-Preserving AI Protects Sensitive Information in ECG Data
Artificial intelligence applied to electrocardiography can extract more than cardiac rhythm. Algorithms can infer age, sex, race, and even identity from electrocardiogram (ECG) signals, creating privacy... Read moreCritical Care
view channel
Inhaled Analgesic Matches Morphine for Prehospital Trauma Pain
Managing acute pain in the first minutes of prehospital care is difficult when intravenous access is delayed or impossible. Patients with fractures, falls, or traffic injuries often endure significant... Read more
Medical Drone Program Improves Blood Access and Patient Survival
Delayed access to blood products can be fatal for patients with postpartum hemorrhage and traumatic bleeding, and it also drives waste in hospital inventories. Rwanda’s mountainous terrain has historically... Read more
FDA Clears Tongue-Applied Neuromodulation System for Stroke Gait Rehabilitation
Gait impairment after stroke is a persistent driver of fall risk, loss of independence, and reduced quality of life. Despite structured rehabilitation, many adults continue to experience dynamic gait deficits... Read moreSurgical Techniques
view channel
Aortic Arch Remodeling Device Improves Type I Dissection Repair
Acute DeBakey Type I aortic dissection is an emergent tear of the ascending aorta that can extend into the arch and descending thoracic aorta. Rapid ischemic complications and high early mortality make... Read more
Intravesical CAR T Therapy Shows Promise for Bladder Cancer Treatment
Bladder cancer is common and frequently recurs after initial therapy, exposing patients to repeated procedures and cumulative toxicity. High‑risk disease often progresses despite intravesical drugs or... Read morePatient Care
view channel
AI Avatar Doctor Improves Patient Understanding Before Radiotherapy
Radiation oncology consultations require patients to grasp complex concepts quickly, yet anxiety and information overload often undermine understanding and informed consent. Poor comprehension can also... Read more
Wearable Sleep Data Predict Adherence to Pulmonary Rehabilitation
Chronic obstructive pulmonary disease (COPD) is a long-term lung disorder that makes breathing difficult and often disturbs sleep, reducing energy for daily activities. Limited engagement in pulmonary... Read moreHealth IT
view channel
Digital Heart Model Supports Targeted Ablation in Atrial Fibrillation
Atrial fibrillation is an erratic, quivering heartbeat and a leading cause of stroke. Catheter ablation is widely used to interrupt arrhythmogenic tissue, yet many patients—especially with persistent ... Read moreAI Framework Helps Clinicians Create Trustworthy Risk Prediction Tools
Artificial intelligence (AI) is increasingly used to estimate risks for conditions such as sepsis, heart disease, and cancer, yet many models remain difficult for clinicians to interpret or trust.... Read morePoint of Care
view channel
AI Dermatology Platform Targets Early Detection of Non-Melanoma Skin Cancer
Keratinocyte skin cancers, including basal cell and squamous cell carcinoma, account for a substantial share of dermatology workload in the United States and require accurate triage to guide biopsy decisions.... Read more







