Home
Search

Journal of Data Science

Submit your article Information

Journal home
To appear
Current issue
All issues
More
Journal home To appear Current issue All issues

Detailed search

Title

Author

Types

Abstract

Keywords

Published

Pages

Volumes

Issues

DOI

Affiliation

Search results 892

Order by:

Select: All None Download:

Power Generalized Weibull Distribution Based on Generalised Order Statistics

Devendra Kumar Neetu Jain

https://doi.org/10.6339/JDS.201807_16(3).00010

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 16, Issue 3 (2018), pp. 621–646

Abstract

The power generalized Weibull distribution due to Bagdonovacius and Nikulin (2002) is an alternative,and always provides better fits than the exponentiated Weibull family for modeling lifetime data. In this paper, we consider the generalized order statistics (GOS) from this distribution. We obtain exact explicit expressions as well as recurrence relations for the single, product and conditional moments of generalized order statistics from the power generalized Weibull distribution and then we use these results to compute the means and variances of order statistics and record values for samples of different sizes for various values of the shape and scale parameters.

The Analysis of Health Care Coverage through Transition Matrices Using a One Factor Model

Eric D. Olson Billie S. Anderson J. Michael Hardin

https://doi.org/10.6339/JDS.2010.08(4).634

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 8, Issue 4 (2010), pp. 619–630

Abstract

Abstract: This paper studies the affect the tax environment has on health care coverage of individuals. This study adds to the current literature of health care policy by examining how individuals switch types of health care coverage given a change in the tax environment. The distribution of health care coverage will be investigated using transition matrices. Then, a model is used to determine how the individuals might be expected to switch insurance types given a change in the tax environment. Based on the results of this study, the authors give some recommendations on what the implications of the results may mean to health care policy makers.

Influence of Choices of Statistical Models on Neural Spike Trend

Shu-Chuan Chen Lung-An Li Shen Li All authors (4)

https://doi.org/10.6339/JDS.2012.10(4).732

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 10, Issue 4 (2012), pp. 619–651

Abstract

Abstract: The Center for Neural Interface Design of the Biodesign Institute at Arizona State University conducted an experiment to investigate how the central nervous system controls hand orientation and movement direction during reach-to-grasp movements. ANOVA (Analysis of Variance), a conventional data analysis widely used in neural science, was performed to categorized different neural activities. Some preliminary studies on data analysis methods have shown that the principal assumption of ANOVA is violated and some characteristics of data are missing from taking the ratio of recorded data. To compensate the deficiency of ANOVA, ANCOVA (Analysis of covariance) is introduced in this paper. By considering neural firing counts and temporal intervals respectively, we expect to extract more useful information for determining the correlations among different types of neurons with motor behavior. Comparing to ANOVA, ANCOVA can be one step further to identify which direction or orientation is favored during which epoch. We find that a considerable number of neurons are involved in movement direction, hand orientation, or both combined, and some are significant in more than one epoch, which indicates there exists a network with unknown pathways connecting neurons in motor cortex throughout the entire movement. For the future studies we suggest to integrate this study into neural networking in order to simulate the whole reach-to-grasp process.

Factor Analysis as A Tool for Pattern Recognition in Biomedical Research: A Review with Application in R Software

Dimitris Panaretos George Tzavelas Malvina Vamvakari All authors (4)

https://doi.org/10.6339/JDS.201710_15(4).00003

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 15, Issue 4 (2017), pp. 615–630

Abstract

Factor Analysis is one of the data mining methods that can be used to analyse, mainly large-scale, multi-variable datasets. The main objective of this method is to derive a set of uncorrelated variables for further analysis when the use of highly inter-correlated variables may give misleading results in regression analysis. In the light of the vast and broad advances that have occurred in factor analysis due largely to the advent of electronic computers, this article attempt to provide researchers with a simplified approach to comprehend how exploratory factors analysis work, and to provide a guide of application using R. This multivariate mathematical method is an important tool which very often used in the development and evaluation of tests and measures that can be used in biomedical research. The paper comes to the conclusion that the factor analysis is a proper method used in biomedical research, just because clinical readers can better interpret and evaluate their goal and results.

Bayesian Estimation in Shared Positive Stable Frailty Models

David D. Hanagal Asmita T. Kamble

https://doi.org/10.6339/JDS.201610_14(4).0003

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 14, Issue 4 (2016), pp. 615–640

Abstract

Abstract: The concept of frailty provides a suitable way to introduce random effects in the model to account for association and unobserved heterogeneity. In its simplest form, a frailty is an unobserved random factor that modifies multiplicatively the hazard function of an individual or a group or cluster of individuals. In this paper, we study positive stable distribution as frailty distribution and two different baseline distributions namely Pareto and linear failure rate distribution. We estimate parameters of proposed models by introducing Bayesian estimation procedure using Markov Chain Monte Carlo (MCMC) technique. In the present study a simulation is done to compare the true values of parameters with the estimated value. We try to fit the proposed models to a real life bivariate survival data set of McGrilchrist and Aisbett (1991) related to kidney infection. Also, we present a comparison study for the same data by using model selection criterion, and suggest a better model.

Estimation Methods for the New Weibull-Pareto Distribution: Simulation and Application

Ehab. M. Almetwally Hisham. M. Almongy

https://doi.org/10.6339/JDS.201907_17(3).0009

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 17, Issue 3 (2019), pp. 613–632

Abstract

In this paper, we introduce the alternative methods to estimation for the new weibull-pareto distribution parameters. We discussed of point estimation and interval estimation for parameters of the new weibull-pareto distribution. We have also discussed the method of Maximum Likelihood estimation, the method of Least Squares estimation, the method of Weighted Least Squares estimation and the method of Maximum Product Spacing estimation. In addition, we discussed the raw moment of random variable X and the reliability functions (survival and hazard functions). Further, we compared between the results of the methods that have been discussed using Monte Carlo Simulation method and application study.

Data-Syntactic Regression of Ill-Being

Gordon G. Bechtel

https://doi.org/10.6339/JDS.201410_12(4).0004

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 12, Issue 4 (2014), pp. 613–632

Abstract

Abstract: Panel data transcends cross-sectional data by tapping pooled inter- and intra-individual differences, along with between and within individual variation separately. In the present study these micro variations in ill-being are predicted by psychological indicators constructed from the British Household Panel Survey (BHPS). Panel regression effects are corrected for errors-in-variables, which attenuate slopes estimated by traditional panel regressions. These corrections reveal that unhappiness and life dissatisfaction are distinct variables that have different psychological causations.

To Do or Not To Do Business with a Country: A Robust Classification Approach

Kuntal Bhattacharyya Pratim Datta

https://doi.org/10.6339/JDS.201110_09(4).0008

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 9, Issue 4 (2011), pp. 607–623

Abstract

Abstract: In the face of global uncertainty and a growing reliance on third party indices to gain a snapshot of a country’s operations, accurate decision making makes or breaks relationships in global trade. Under this aegis, we question the validity of traditional logistic regression using the maximum likelihood estimator (MLE) in classifying countries for doing business. This paper proposes that a weighted version of the Bianco and Yohai (BY) estimator is a superlative and robust (outlier resistant) tool in the hands of practitioners to gauge the correct antecedents of a country’s internal environment and decide whether to do or not do business with that country. In addition, this robust process is effective in differentiating between “problem” countries and “safe” countries for doing business. An existing “R” program for the BY estimation technique by Croux and Haesbroeck has been modified to fit our cause.

Generalized Poisson-Poisson Mixture Model for Misreported Counts with an Application to Smoking Data

Mavis Pararai Felix Famoye

https://doi.org/10.6339/JDS.2010.08(4).608

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 8, Issue 4 (2010), pp. 607–617

Abstract

Abstract: The assumption that is usually made when modeling count data is that the response variable, which is the count, is correctly reported. Some counts might be over- or under-reported. We derive the Generalized PoissonPoisson mixture regression (GPPMR) model that can handle accurate, underreported and overreported counts. The parameters in the model will be estimated via the maximum likelihood method. We apply the GPPMR model to a real-life data set.

Tree-Based Missing Value Imputation Using Feature Selection

Heizel Rosado-Galindo Saylisse Dávila-Padilla

https://doi.org/10.6339/JDS.202010_18(4).0002

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 18, Issue 4 (2020), pp. 606–631

Abstract

Researchers and practitioners of many areas of knowledge frequently struggle with missing data. Missing data is a problem because almost all standard statistical methods assume that the information is complete. Consequently, missing value imputation offers a solution to this problem. The main contribution of this paper lies on the development of a random forest-based imputation method (TI-FS) that can handle any type of data, including high-dimensional data with nonlinear complex interactions. The premise behind the proposed scheme is that a variable can be imputed considering only those variables that are related to it using feature selection. This work compares the performance of the proposed scheme with other two imputation methods commonly used in literature: KNN and missForest. The results suggest that the proposed method can be useful in complex scenarios with categorical variables and a high volume of missing values, while reducing the amount of variables used and their corresponding preliminary imputations.

23 24 25 26 27

Items per page

Export citation

Copy and paste formatted citation

Formatted citation

Placeholder

Citation style

Download citation in file

Export format

Authors

Placeholder

RSS

Journal of data science

Online ISSN: 1683-8602
Print ISSN: 1680-743X

About

About journal

For contributors

Submit
OA Policy
Become a Peer-reviewer

Contact us

JDS@ruc.edu.cn
No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China