Journal of Data Science

Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, April 2024

Order by:

Select: All None Download:

Editorial: Inquire, Investigate, Implement, Innovate – Symposium on Data Science and Statistics 2023

Emily Dodwell Amanda A. Koepke

https://doi.org/10.6339/24-JDS222EDI

Pub. online: 4 Jun 2024 Type: Editorial

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 173–175

Evaluating Perceptual Judgements on 3D Printed Bar Charts

Tyler Wiederich Susan VanderPlas

https://doi.org/10.6339/24-JDS1131

Pub. online: 24 May 2024 Type: Data Science In Action

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 176–190

Abstract

A Multi-Model Framework to Explore ADHD Diagnosis from Neuroimaging Data

Yagmur Yavuz Ozdemir Naga Chandra Padmini Nukala Roberto Molinari All authors (4)

https://doi.org/10.6339/24-JDS1128

Pub. online: 2 May 2024 Type: Data Science In Action

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 191–207

Abstract

A Platform for Large Scale Statistical Modelling in R

Jason Cairns Simon Urbanek Paul Murrell

https://doi.org/10.6339/24-JDS1132

Pub. online: 24 May 2024 Type: Computing In Data Science

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 208–220

Abstract

Spatial-Temporal Extreme Modeling for Point-to-Area Random Effects (PARE)

Carlynn Fagnant Julia C. Schedler

Katherine B. Ensor

https://doi.org/10.6339/24-JDS1133

Pub. online: 24 May 2024 Type: Statistical Data Science

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 221–238

Abstract

Producing Fast and Convenient Machine Learning Benchmarks in R with the stressor Package

Sam Haycock Brennan Bean Emily Burchfield

https://doi.org/10.6339/24-JDS1123

Pub. online: 4 Jun 2024 Type: Statistical Data Science

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 239–258

Abstract

Interaction Selection and Prediction Performance in High-Dimensional Data: A Comparative Study of Statistical and Tree-Based Methods

Chinedu J. Nzekwe Seongtae Kim Sayed A. Mostafa

https://doi.org/10.6339/24-JDS1127

Pub. online: 22 May 2024 Type: Statistical Data Science

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 259–279

Abstract

Predictive modeling often ignores interaction effects among predictors in high-dimensional data because of analytical and computational challenges. Research in interaction selection has been galvanized along with methodological and computational advances. In this study, we aim to investigate the performance of two types of predictive algorithms that can perform interaction selection. Specifically, we compare the predictive performance and interaction selection accuracy of both penalty-based and tree-based predictive algorithms. Penalty-based algorithms included in our comparative study are the regularization path algorithm under the marginality principle (RAMP), the least absolute shrinkage selector operator (LASSO), the smoothed clipped absolute deviance (SCAD), and the minimax concave penalty (MCP). The tree-based algorithms considered are random forest (RF) and iterative random forest (iRF). We evaluate the effectiveness of these algorithms under various regression and classification models with varying structures and dimensions. We assess predictive performance using the mean squared error for regression and accuracy, sensitivity, specificity, balanced accuracy, and F1 score for classification. We use interaction coverage to judge the algorithm’s efficacy for interaction selection. Our findings reveal that the effectiveness of the selected algorithms varies depending on the number of predictors (data dimension) and the structure of the data-generating model, i.e., linear or nonlinear, hierarchical or non-hierarchical. There were at least one or more scenarios that favored each of the algorithms included in this study. However, from the general pattern, we are able to recommend one or more specific algorithm(s) for some specific scenarios. Our analysis helps clarify each algorithm’s strengths and limitations, offering guidance to researchers and data analysts in choosing an appropriate algorithm for their predictive modeling task based on their data structure.

Testing Perceptual Accuracy in a U.S. General Population Survey Using Stacked Bar Charts

Kiegan Rice

Heike Hofmann

Nola du Toit

All authors (4)

https://doi.org/10.6339/24-JDS1121

Pub. online: 13 Mar 2024 Type: Statistical Data Science

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 280–297

Abstract

Precision Medicine: Interaction Survival Tree for Recurrent Event Data

Yushan Yang Chamila Perera Philip Miller All authors (5)

https://doi.org/10.6339/24-JDS1126

Pub. online: 17 Apr 2024 Type: Statistical Data Science

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 298–313

Abstract

Demonstrative Evidence and the Use of Algorithms in Jury Trials

Rachel Rogers

Susan VanderPlas

https://doi.org/10.6339/24-JDS1130

Pub. online: 2 May 2024 Type: Education In Data Science

Open Access

Journal: Journal of Data Science Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, pp. 314–332

Abstract

1 2

Items per page

RSS

Volume 22, Issue 2 (2024): Special Issue: 2023 Symposium on Data Science and Statistics (SDSS): “Inquire, Investigate, Implement, Innovate”, April 2024

Export citation

Copy and paste formatted citation

Download citation in file

Authors