Home
Search

Journal of Data Science

Submit your article Information

Journal home
To appear
Current issue
All issues
More
Journal home To appear Current issue All issues

Detailed search

Title

Author

Types

Abstract

Keywords

Published

Pages

Volumes

Issues

DOI

Affiliation

Search results 892

Order by:

Select: All None Download:

Estimating Small Area Diabetes Prevalence in the US Using the Behavioral Risk Factor Surveillance System

Peter Congdon Patsy Lloyd

https://doi.org/10.6339/JDS.2010.08(2).583

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 8, Issue 2 (2010), pp. 235–252

Abstract

Abstract: Information regarding small area prevalence of chronic disease is important for public health strategy and resourcing equity. This paper develops a prevalence model taking account of survey and census data to derive small area prevalence estimates for diabetes. The application involves 32000 small area subdivisions (zip code census tracts) of the US, with the prevalence estimates taking account of information from the US-wide Behavioral Risk Factor Surveillance System (BRFSS) survey on population prevalence differentials by age, gender, ethnic group and education. The effects of such aspects of population composition on prevalence are widely recognized. However, the model also incorporates spatial or contextual influences via spatially structured effects for each US state; such contextual effects are allowed to differ between ethnic groups and other demographic categories using a multivariate spatial prior. A Bayesian estimation approach is used and analysis demonstrates the considerably improved fit of a fully specified compositional-contextual model as compared to simpler ‘standard’ approaches which are typically limited to age and area effects.

Encouraging Students to Think Critically: Regression Modelling and Goodness-of-Fit

Timothy E. O’Brien Suree Chooprateep2 Gerald M. Funk

https://doi.org/10.6339/JDS.2009.07(2).448

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 7, Issue 2 (2009), pp. 235–253

Abstract

Abstract: This note underscores important considerations that should be taken into account when teaching students to check for inadequacies of a given linear, nonlinear or logistic regression models. Key illustrations are provided which underscore the shortcomings of currently used procedures. A brief overview of nonlinear regression models is given in order to lay the foundation for testing for lack of fit in nonlinear models. This paper also introduces a new ’scaled’ binary logistic regression model to highlight po tential problems with the usual logistic model, and implications for choosing a robust optimal experimental design are also underscored and discussed. Key words: Lack of fit, logistic regression, nonlinear regression, optimal de

Statistical Functional Modeling of Quality Changes of Garlic under Different Storage Regimes

E.T. Castano E.S. Mercado F.G. Leon

https://doi.org/10.6339/JDS.2006.04(2).245

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 4, Issue 2 (2006), pp. 233–246

Abstract

Abstract: In this paper we analyze the weight loss behaviour of Mexican garlic under different storage conditions. Garlic is an important Mexican export product. Quality losses during storage are important to understand due to cost and sale opportunity implications. Weight losses profiles for each experimental conditions, represented as functions, are modeled by means of functional linear models and hypotheses tests are performed to compare treatments. Monte Carlo sampling version of permutation tests are used to obtain p-values. Using the functional approach clearly defined storage regimes that significantly decrease the speed of deterioration of the product relative to traditional Mexican agricultural practices.

On the Radical Views of Coal Miners in the Early Twentieth Century in Southern Illinois

Stephane E. Booth David E. Booth

https://doi.org/10.6339/JDS.2005.03(2).188

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 3, Issue 2 (2005), pp. 233–241

Abstract

Abstract: There has been great interest in the Southern Illinois mine war by historians. An explanation has been that this war was caused by miners who had radical political beliefs. We examine this view by applying four methods of ecological inference to estimate the proportion of coal miners who were socialist voters in this time period. Based on these results (especially considering the assumptions of the methods) we conclude that miners were politically less radical than previously thought.

Predicting Students’ Problem Solving Performance using Support Vector Machine

Young-Jin Lee

https://doi.org/10.6339/JDS.201604_14(2).0003

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 14, Issue 2 (2016), pp. 231–244

Abstract

This study investigates whether Support Vector Machine (SVM) can be used to predict the problem solving performance of students in the computerbased learning environment. The SVM models using RBF, linear, polynomial and sigmoid kernels were developed to estimate the probability for middle school students to get mathematics problems correct at their first attempt without using hints available in the computer-based learning environment based on their problem solving performance observed in the past. The SVM models showed better predictions than the standard Bayesian Knowledge Tracing (BKT) model, one of the most widely used prediction models in educational data mining research, in terms of Area Under the receiver operating characteristic Curve (AUC). Four SVM models got AUC values from 0.73 to 0.77, which is approximately 29% improvement, compared to the standard BKT model whose AUC was 0.58.

Application of the Pattern-Mixture Latent Trajectory Model in an Epidemiological Study with Non-Ignorable Missingness

Hiroko H. Dodge Mary Ganguli Changyu Shen

https://doi.org/10.6339/JDS.2008.06(2).410

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 6, Issue 2 (2008), pp. 231–246

Abstract

Abstract: In longitudinal studies where the same individuals are followed over time, bias caused by unobserved data raises a serious concern, particularly when the data are missing in a non-ignorable manner. One approach to deal with non-ignorable missing data is a pattern mixture model. In this paper, we combine the pattern mixture model with latent trajectory analysis using the SAS TRAJ procedure, which offers a practical solution to many problems of the same nature. Our model assumes a stochastic process that categorizes a relative large number of missing-data patterns into several latent groups, each of which has unique outcome trajectory, which allows patterns with missing values to share information with patterns with more data points. We estimated the longitudinal trajectories of a memory test over 12 years of follow-up, using data from the prospective epidemiological study of dementia. Missing data patterns were created conditional on survival, and final marginal response was obtained by excluding those who had died at each time point. The approach presented here is appealing since it can be easily implemented using common software.

Application of the Pattern-Mixture Latent Trajectory Model in an Epidemiological Study with Non-Ignorable Missingness

Hiroko H. Dodge Changyu Shen Mary Ganguli

https://doi.org/10.6339/JDS.2008.06(3).501

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 6, Issue 3 (2008), pp. 231–246

Estimating Vehicle Speed from Traffic Count and Occupancy Data

Martin L. Hazelton

https://doi.org/10.6339/JDS.2004.02(3).159

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 2, Issue 3 (2004), pp. 231–244

Exploratory Model Selection for Spatially Designed Experiments – Some Examples

Walter T. Federer

https://doi.org/10.6339/JDS.2003.01(3).124

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 1, Issue 3 (2003), pp. 231–248

A Folded Normal Slash Distribution and Its Applications to Non-negative Measurements

Wenhao Gui Pei-Hua Chen Haiyan Wu

https://doi.org/10.6339/JDS.2013.11(2).1142

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 11, Issue 2 (2013), pp. 231–247

Abstract

Abstract: We introduce a new class of the slash distribution using folded normal distribution. The proposed model defined on non-negative measure ments extends the slashed half normal distribution and has higher kurtosis than the ordinary half normal distribution. We study the characterization and properties involving moments and some measures based on moments of this distribution. Finally, we illustrate the proposed model with a simulation study and a real application.

57 58 59 60 61

Items per page

Export citation

Copy and paste formatted citation

Formatted citation

Placeholder

Citation style

Download citation in file

Export format

Authors

Placeholder

RSS

Journal of data science

Online ISSN: 1683-8602
Print ISSN: 1680-743X

About

About journal

For contributors

Submit
OA Policy
Become a Peer-reviewer

Contact us

JDS@ruc.edu.cn
No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China