Thomas Lumley 8/14/2022

Tracking down a Real Data Set(tm)

Read Original

The author describes the investigative process of tracking down the original source and context of a dataset on urinary tract infection risk, which is used in multiple R packages (elrm, logistf) and cited in academic papers. The article highlights challenges in data provenance, discrepancies between dataset versions, and the importance of accurate metadata for statistical analysis and reproducibility in scientific computing.

Tracking down a Real Data Set(tm)

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser