Home
Search

Journal of Data Science

Submit your article Information

Journal home
To appear
Current issue
All issues
More
Journal home To appear Current issue All issues

Detailed search

Title

Author

Types

Abstract

Keywords

Published

Pages

Volumes

Issues

DOI

Affiliation

Search results 892

Order by:

Select: All None Download:

Forward Selection Two Sample Binomial Test

Kam-Fai Wong Weng-Kee Wong Miao-Shan Lin

https://doi.org/10.6339/JDS.201410_12(4).0001

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 12, Issue 4 (2014), pp. 279–294

Abstract

Abstract: Fisher’s exact test (FET) is a conditional method that is frequently used to analyze data in a 2 × 2 table for small samples. This test is conservative and attempts have been made to modify the test to make it less conservative. For example, Crans and Shuster (2008) proposed adding more points in the rejection region to make the test more powerful. We provide another way to modify the test to make it less conservative by using two independent binomial distributions as the reference distribution for the test statistic. We compare our new test with several methods and show that our test has advantages over existing methods in terms of control of the type 1 and type 2 errors. We reanalyze results from an oncology trial using our proposed method and our software which is freely available to the reader.

Monitoring the SARS Epidemic in China: A Time Series Analysis

Dejian Lai

https://doi.org/10.6339/JDS.2005.03(3).229

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 3, Issue 3 (2005), pp. 279–293

Abstract

Abstract: In this article, we studied three types of time series analysis methods in modeling and forecasting the severe acute respiratory syndrome (SARS) epidemic in mainland China. The first model was a Box-Jenkins model, autoregressive model with order 1 (AR(1)). The second model was a random walk (ARIMA(0,1,0)) model on the log transformed daily reported SARS cases and the third one was a combination of growth curve fitting and autoregressive moving average model, ARMA(1,1). We applied all these three methods to monitor the dynamic of SARS in China based on the daily probable new cases reported by the Ministry of Health of China.

Zografos Balakrishnan Power Lindley Distriution

Javeria Khokhar Rashida Khalil Noor Shahid

https://doi.org/10.6339/JDS.202004_18(2).0004

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 18, Issue 2 (2020), pp. 279–298

Abstract

In this paper Zografos Balakrishnan Power Lindley (ZB-PL) distribution has been obtained through the generalization of Power Lindley distribution using Zografos and Balakrishnan (2009) technique. For this technique, density of upper record values exists as their special case. Probability density (pdf), cumulative distribution (cdf) and hazard rate function (hrf) of the proposed distribution are obtained. The probability density and cumulative distribution function are expanded as linear combination of the density and distribution function of Exponentiated Power Lindley (EPL) distribution. This expansion is further used to study different properties of the new distribution. Some mathematical and statistical properties such as asymptotes, quantile function, moments, mgf, mean deviation, renyi entropy and reliability are also discussed. Probability density (pdf), cumulative distribution (cdf) and hazard rate (hrf) functions are graphically presented for different values of the parameters. In the end Maximum Likelihood Method is used to estimate the unknown parameters and application to a real data set is provided a. It has been observed that the proposed distribution provides superior fit than many useful distributions for given data set.

Exponentiated Weibull-Lomax Distribution: Properties and Estimation

Amal S. Hassan Marwa Abd-Allah

https://doi.org/10.6339/JDS.201804_16(2).0004

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 16, Issue 2 (2018), pp. 277–298

Abstract

In this article, we introduce a new class of five-parameter model called the Exponentiated Weibull Lomax arising from the Exponentiated Weibull generated family. The new class contains some existing distributions as well as some new models. Explicit expressions for its moments, distribution and density functions, moments of residual life function are derived. Furthermore, Rényi and q–entropies, probability weighted moments, and order statistics are obtained. Three suggested procedures of estimation, namely, the maximum likelihood, least squares and weigthed least squares are used to obtain the point estimators of the model parameters. Simulation study is performed to compare the performance of different estimates in terms of their relative biases and standard errors. In addition, an application to two real data sets demonstrate the usefulness of the new model comparing with some new models.

Robust estimation of the mAR index of high grossing films at the US box office, 1935 to 2005

Nick Redfern

https://doi.org/10.6339/JDS.201404_12(2).0004

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 12, Issue 2 (2014), pp. 277–291

Abstract

Abstract: The modified autoregressive (mAR) index has been proposed as a description of the clustering of shots of similar duration in a motion picture. In this paper we derive robust estimates of the mAR index for high grossing films at the US box office using a rank-based autocorrelation function resis tant to the influence of outliers and compare this to estimates obtained using the classical, moment-based autocorrelation function. The results show that (1) The classical mAR index underestimates both the level of shot clustering in a film and the variation in style among the films in the sample; (2) there is a decline in shot clustering from 1935 to the 1950s followed by an increase from the 1960s to the 1980s and a levelling off thereafter rather than the monotonic trend indicated by the classical index, and this is mirrored in the trend of the median shot lengths and interquartile range; and (3) the rank mAR index identifies differences between genres overlooked when using the classical index.

On Some Structural Importance of System Components

Fan C. Meng

https://doi.org/10.6339/JDS.2009.07(2).472

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 7, Issue 2 (2009), pp. 277–283

Abstract

Abstract: In this note a new method of comparing component structural importance is introduced and compared to other existing ones. Especially, relationships of the new comparison method to the H-importance due to Hwang (2001,2005), the criticality ordering due to Boland et al. (1989) and Birnbaum importance are obtained. Illustrative examples are given.

Case Deletion Diagnostics in Liu Semiparametric Regression Models

Hadi Emami

https://doi.org/10.6339/JDS.201704_15(2).0006

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 15, Issue 2 (2017), pp. 275–292

Abstract

In semiparametric regression it is of interest to detect anomalous observations that exert an unduly large influence on the parameter’s esti-mate and fitted values. Usually the existence of influential observations is complicated by the presence of collinearity. However no method of influ-ence diagnostics available for the possible effects that collinearity can have on the influence of an observation on the estimates of parametric and non-parametric component of semiparametric regression models. In this paper we show when Liu estimators are used to mitigate the effects of collinearity the influence of some observations can be drastically modified. We propose a case deletion formula to detect influential points in Liu estimators of semi-parametric regression models . As an illustrative example a real data set are analysed.

Testing for Activation in Data from FMRI Experiments

Martina Pavlicov´a Noel Cressie Thomas J. Santner

https://doi.org/10.6339/JDS.2006.04(3).254

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 4, Issue 3 (2006), pp. 275–289

Abstract

Abstract: The traditional method for processing functional magnetic resonance imaging (FMRI) data is based on a voxel-wise, general linear model. For experiments conducted using a block design, where periods of activation are interspersed with periods of rest, a haemodynamic response function (HRF) is convolved with the design function and, for each voxel, the convolution is regressed on prewhitened data. An initial analysis of the data often involves computing voxel-wise two-sample t-tests, which avoids a direct specification of the HRF. Assuming only the length of the haemodynamic delay is known, scans acquired in transition periods between activation and rest are omitted, and the two-sample t-test is used to compare mean levels during activation versus mean levels during rest. However, the validity of the two-sample t-test is based on the assumption that the data are Gaussian with equal variances. In this article, we consider the Wilcoxon rank test as well as modified versions of the classical t-test that correct for departures from these assumptions. The relative performance of the tests are assessed by applying them to simulated data and comparing their size and power; one of the modified tests (the CW test) is shown to be superior.

Imputation Allowing Standard Variance Formulas

Michael P. Cohen

https://doi.org/10.6339/JDS.2003.01(3).128

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 1, Issue 3 (2003), pp. 275–292

An Analysis of Quasi-complete Binary Data with Logistic Models: Applications to Alcohol Abuse Data

Mandy C. Webb

https://doi.org/10.6339/JDS.2004.02(3).155

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 2, Issue 3 (2004), pp. 273–285

52 53 54 55 56

Items per page

Export citation

Copy and paste formatted citation

Formatted citation

Placeholder

Citation style

Download citation in file

Export format

Authors

Placeholder

RSS

Journal of data science

Online ISSN: 1683-8602
Print ISSN: 1680-743X

About

About journal

For contributors

Submit
OA Policy
Become a Peer-reviewer

Contact us

JDS@ruc.edu.cn
No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China