Journal of Data Science logo


Login Register

  1. Home
  2. Issues
  3. Volume 21, Issue 2 (2023): Special Issue: Symposium Data Science and Statistics 2022
  4. ‘You Draw It’: Implementation of Visuall ...

Journal of Data Science

Submit your article Information
  • Article info
  • More
    Article info

‘You Draw It’: Implementation of Visually Fitted Trends with r2d3
Volume 21, Issue 2 (2023): Special Issue: Symposium Data Science and Statistics 2022, pp. 281–294
Emily A. Robinson ORCID icon link to view author Emily A. Robinson details   Reka Howard   Susan VanderPlas  

Authors

 
Placeholder
https://doi.org/10.6339/22-JDS1083
Pub. online: 12 January 2023      Type: Computing In Data Science      Open accessOpen Access

Received
29 July 2022
Accepted
23 December 2022
Published
12 January 2023

Abstract

How do statistical regression results compare to intuitive, visually fitted results? Fitting lines by eye through a set of points has been explored since the 20th century. Common methods of fitting trends by eye involve maneuvering a string, black thread, or ruler until the fit is suitable, then drawing the line through the set of points. In 2015, the New York Times introduced an interactive feature, called ‘You Draw It,’ where readers are asked to input their own assumptions about various metrics and compare how these assumptions relate to reality. This research is intended to implement ‘You Draw It’, adapted from the New York Times, as a way to measure the patterns we see in data. In this paper, we describe the adaptation of an old tool for graphical testing and evaluation, eye-fitting, for use in modern web-applications suitable for testing statistical graphics. We present an empirical evaluation of this testing method for linear regression, and briefly discuss an extension of this method to non-linear applications.

Supplementary material

 Supplementary Material
• ‘You Draw It’ Demonstration Applet: The shiny app used to demonstrate the ‘You Draw It’ method can be accessed at https://emily-robinson.shinyapps.io/can-you-draw-it/. • Participant Data (Linear): De-identified participant data collected in the linear study and used for analyses are available to be downloaded from GitHub at https://github.com/earobinson95/sdss-2022-you-draw-it-manuscript/raw/master/data/eyefitting-model-data.csv. • Participant Data (Non-linear): De-identified participant data collected in the non-linear study and used for analyses are available to be downloaded from GitHub at https://github.com/earobinson95/sdss-2022-you-draw-it-manuscript/raw/master/data/youdrawit-model-data.csv. • Data Analysis Code: The code used to replicate the linear and non-linear study analysis in this paper can be found at https://github.com/earobinson95/sdss-2022-you-draw-it-manuscript/blob/master/analysis/data-analysis.Rmd.

References

 
Aisch G, Cox A, Quealy K (2015). You draw it: How family income predicts children’s college chances. The New York Times, 28.
 
Bates D, Mächler M, Bolker B, Walker S (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1): 1–48. https://doi.org/10.18637/jss.v067.i01.
 
Bostock M, Ogievetsky V, Heer J (2011). D3 data-driven documents. IEEE Transactions on Visualization and Computer Graphics, 17(12): 2301–9.
 
Casey SA (2015). Examining student conceptions of covariation: A focus on the line of best fit. Journal of Statistics Education, 23(1).
 
Casey SA (2016). Finding what fits. Mathematics Teaching in the Middle School, 21(8): 484–91.
 
Casey SA, Nagle C (2016). Students’ use of slope conceptualizations when reasoning about the line of best fit. Educational Studies in Mathematics, 92(2): 163–77.
 
Casey SA, Wasserman NH (2015). Teachers’knowledge about informal line of best fit. Statistics Education Research Journal, 14(1): 8–35.
 
Ciccione L, Dehaene S (2021). Can humans perform mental regression on a graph? Accuracy and bias in the perception of scatterplots. Cognitive Psychology, 128: 101406.
 
Deming EW (1943). Statistical Adjustment of Data.
 
Finney DJ (1951). Subjective judgment in statistical analysis: An experimental study. Journal of the Royal Statistical Society: Series B (Methodological), 13(2): 284–97.
 
Furic B (2017). “You-Draw-It-Graph: The User Can Draw His Own Graph Line and Compare with the Reality.” GitHub. https://github.com/BenoitFuric/you-draw-it-graph.
 
Geidner N, Pjesivac I, Imre I, Coman I, Yuran D (2015). The role of interactive graphics in reducing misperceptions in the electorate. Visual Communication Quarterly, 22(3): 133–45.
 
George-Palilonis J, Spillman M (2011). Interactive graphics development: A framework for studying innovative visual story forms. Visual Communication Quarterly, 18(3): 167–77.
 
George-Palilonis J, Spillman M (2013). Storytelling with interactive graphics: An analysis of editors’ attitudes and practices. Visual Communication Quarterly, 20(1): 20–27.
 
Mosteller F, Siegel AF, Trapido E, Youtz C (1981). Eye fitting straight lines. The American Statistician, 35(3): 150–52.
 
Nagle C, Casey S, Moore-Russo D (2017). Slope and line of best fit: A transfer of knowledge case study. School Science and Mathematics, 117(1–2): 13–26.
 
Pearce A (2019). “You-Draw-It.” Observable. https://bl.ocks.org/1wheel/07d9040c3422dac16bd5be741433ff1e.
 
R Core Team (2022). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.
 
Strayer N, Luraschi J, Allaire JJ (2020). R2d3: Interface to ‘D3’ Visualizations. https://CRAN.R-project.org/package=r2d3.
 
Unwin AR, Wills G (1988). Eyeballing time series. In: Proceedings of the 1988 ASA Statistical Computing Section, 263–68.
 
US Energy Information Administration (2022). Weekly U.S. All Grades All Formulations Retail Gasoline Prices (Dollars Per Gallon). US Energy Information Administration Independent Statistics & Analysis. https://www.eia.gov/dnav/pet/hist/LeafHandler.ashx?n=PET&s=EMM_EPM0_PTE_NUS_DPG&f=W.
 
VanderPlas S, Hofmann H (2015). Spatial reasoning and data displays. IEEE Transactions on Visualization and Computer Graphics, 22(1): 459–68.
 
Wattenberger A (2019). Fullstack D3 and Data Visualization: Build Beautiful Data Visualizations and Dashboards with D3. Fullstack.io.
 
Wood SN (2017). Generalized Additive Models: An Introduction with R. 2nd ed. Chapman; Hall/CRC.

PDF XML
PDF XML

Copyright
2023 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.
by logo by logo
Open access article under the CC BY license.

Metrics
since February 2021
681

Article info
views

325

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy