Home
Search

Journal of Data Science

Submit your article Information

Journal home
To appear
Current issue
All issues
More
Journal home To appear Current issue All issues

Detailed search

Title

Author

Types

Abstract

Keywords

Published

Pages

Volumes

Issues

DOI

Affiliation

Search results 892

Order by:

Select: All None Download:

Bayesian Wavelet Regression for Spatial Estimation

G. Avarez B. Sans´o

https://doi.org/10.6339/JDS.2008.06(3).506

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 6, Issue 3 (2008), pp. 219–229

Generating correlated random vector by Johnson system

Qing Xiao

https://doi.org/10.6339/JDS.201401_12(2).0001

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 12, Issue 2 (2014), pp. 217–234

Abstract

Abstract: This paper aims to generate multivariate random vector with prescribed correlation matrix by Johnson system. The probability weighted moment (PWM) is employed to assess the parameters of Johnson system. By equat ing the first four PWMs of Johnson system with those of the target distri bution, a system of equations solved for the parameters is established. With suitable initial values, solutions to the equations are obtained by the New ton iteration procedure. To allow for the generation of random vector with prescribed correlation matrix, approaches to accommodate the dependency are put forward. For the four transformation models of Johnson system, nine cases are addressed. Analytical formulae are derived to determine the equivalent correlation coefficient in the standard normal space for six cases, the rest three ones are handled by an interpolation method. Finally, several numerical examples are given out to check the proposed method.

Alternative Tests of Independence in Two-Way Categorical Tables

Balgobin Nandram Jai Won Choi

https://doi.org/10.6339/JDS.2007.05(2).323

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 5, Issue 2 (2007), pp. 217–237

Abstract

Abstract: The chi-squared test for independence in two-way categorical tables depends on the assumptions that the data follow the multinomial distribution. Thus, we suggest alternatives when the assumptions of multi nomial distribution do not hold. First, we consider the Bayes factor which is used for hypothesis testing in Bayesian statistics. Unfortunately, this has the problem that it is sensitive to the choice of prior distributions. We note here that the intrinsic Bayes factor is not appropriate because the prior distribu tions under consideration are all proper. Thus, we propose using Bayesian estimation which is generally not as sensitive to prior specifications as the Bayes factor. Our approach is to construct a 95% simultaneous credible re gion (i.e., a hyper-rectangle) for the interactions. A test that all interactions are zero is equivalent to a test of independence in two-way categorical tables. Thus, a 95% simultaneous credible region of the interactions provides a test of independence by inversion.

A New Generalized of Exponentiated Modified Weibull Distribution

Faton Merovci Ibrahim Elbatal

https://doi.org/10.6339/JDS.201504_13(2).0001

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 13, Issue 2 (2015), pp. 213–240

Abstract

In this paper, we propose a new generalization of exponentiated modified Weibull distribution, called the McDonald exponentiated modified Weibull distribution. The new distribution has a large number of well-known lifetime special sub-models such as the McDonald exponentiated Weibull, beta exponentiated Weibull, exponentiated Weibull, exponentiated expo- nential, linear exponential distribution, generalized Rayleigh, among others. Some structural properties of the new distribution are studied. Moreover, we discuss the method of maximum likelihood for estimating the model parameters.

A Study of the Suprenewal Process

Wen-Jang Huang Chia-Ling Lai

https://doi.org/10.6339/JDS.2010.08(2).602

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 8, Issue 2 (2010), pp. 213–234

Abstract

Abstract: The classical coupon collector’s problem is concerned with the number of purchases in order to have a complete collection, assuming that on each purchase a consumer can obtain a randomly chosen coupon. For most real situations, a consumer may not just get exactly one coupon on each purchase. Motivated by the classical coupon collector’s problem, in this work, we study the so-called suprenewal process. Let {Xi , i ≥ 1} be a sequence of independent and identically distributed random variables, ∑ Sn = n i=1 Xi , n ≥ 1, S0 = 0. For every t ≥ 0, define Qt = inf{n | n ≥ 0, Sn ≥ t}. For the classical coupon collector’s problem, Qt denotes the minimal number of purchases, such that the total number of coupons that the consumer has owned is greater than or equal to t, t ≥ 0. First the process {Qt, t ≥ 0} and the renewal process {Nt, t ≥ 0}, where Nt = sup{n|n ≥ 0, Sn ≤ t}, generated by the same sequence {Xi , i ≥ 1} are compared. Next some fundamental and interesting properties of {Qt, t ≥ 0} are provided. Finally limiting and some other related results are obtained for the process {Qt, t ≥ 0}.

Efficient Sampling Design in Audit Data

Yan Liu Mary Batcher Fritz Scheuren

https://doi.org/10.6339/JDS.2005.03(3).222

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 3, Issue 3 (2005), pp. 213–222

Abstract

Abstract: Auditors are often faced with reviewing a sample drawn from special populations. One is the special population where invoices are divided into two categories, according to whether or not invoices are qualified. In other words, the qualified amount follows a nonstandard mixture distribution in which the qualified amount is either zero with a certain probability or the same as the known invoice amount with a certain probability. The other is the population where some invoices are partially qualified. In other words, some invoices have a qualified amount between zero and the full invoice amount. For these settings, the typical sample design is stratified random, with the estimation method employing a ratio type method. This paper focuses on efficient sample design for this setting and provides some guidelines in setting up stratum boundaries, calculating sample size and allocating sample size optimally across strata.

Factor Effects Testing for Mixture Distributions with Application to the Study of Emergence of Pontomyia Oceana

Mong-Na Lo Huang Chun-Sui Lin Keryea Soong

https://doi.org/10.6339/JDS.2004.02(3).153

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 2, Issue 3 (2004), pp. 213–230

Detecting Influential observations in Two-Parameter Liu-Ridge Estimator

Adewale F. Lukman Kayode Ayinde

https://doi.org/10.6339/JDS.201804_16(2).0001

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 16, Issue 2 (2018), pp. 207–218

Abstract

Influential observations do posed a major threat on the performance of regression model. Different influential statistics including Cook’s Distance and DFFITS have been introduced in literatures using Ordinary Least Squares (OLS). The efficiency of these measures will be affected with the presence of multicollinearity in linear regression. However, both problems can jointly exist in a regression model. New diagnostic measures based on the Two-Parameter Liu-Ridge Estimator (TPE) defined by Ozkale and Kaciranlar (2007) was proposed as alternatives to the existing ones. Approximate deletion formulas for the detection of influential cases for TPE are proposed. Finally, the diagnostic measures are illustrated with two real life dataset.

Confidence Band for Additive Regression Model

Lijian Yang

https://doi.org/10.6339/JDS.2008.06(2).408

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 6, Issue 2 (2008), pp. 207–217

Abstract

Abstract: Additive model is widely recognized as an effective tool for di mension reduction. Existing methods for estimation of additive regression function, including backfitting, marginal integration, projection and spline methods, do not provide any level of uniform confidence. In this paper a sim ple construction of confidence band is proposed for the additive regression function based on polynomial spline estimation and wild bootstrap. Monte Carlo results show three desirable properties of the proposed band: excellent coverage of the true function, width rapidly shrinking to zero with increasing sample size, and minimal computing time. These properties make he pro cedure is highly recommended for nonparametric regression with confidence when additive modelling is appropriate.

Confidence Band for Additive Regression Model

Lijian Yang

https://doi.org/10.6339/JDS.2008.06(3).504

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 6, Issue 3 (2008), pp. 207–217

59 60 61 62 63

Items per page

Export citation

Copy and paste formatted citation

Formatted citation

Placeholder

Citation style

Download citation in file

Export format

Authors

Placeholder

RSS

Journal of data science

Online ISSN: 1683-8602
Print ISSN: 1680-743X

About

About journal

For contributors

Submit
OA Policy
Become a Peer-reviewer

Contact us

JDS@ruc.edu.cn
No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China