Abstract: Kang (2006) used the log-likelihood function with Lagrangian multipliers for estimation of cell probabilities in two-way incomplete contingency tables. The constraints on cell probabilities can be incorporated through Lagrangian multipliers for the likelihood function. The method can be readily extended to multidimensional tables. Variances of the MLEs are derived from the matrix of second derivatives of the log likelihood with respect to cell probabilities and the Lagrange multiplier. Wald and likelihood ratio tests of independence are derived using the estimates and estimated variances. Simulation results, when data are missing at random, reveal that maximum likelihood estimation (MLE) produces more efficient estimates of population proportions than either multiple imputation (MI) based on data augmentation or complete case (CC) analysis. Neither MLE nor MI, however, leads to an improvement over CC analysis with respect to power of tests for independence in 2×2 tables. Thus, the partially classified marginal information increases precision about proportions, but is not helpful for judging independence.
Abstract: A survival model is derived from the exponential function using the concept of fractional differentiation. The hazard function of the proposed model generates various shapes of curves including increasing, increasing constant-increasing, increasing-decreasing-increasing, and so-called bathtub hazard curve. The model also contains a parameter that is the maximum of the survival time.
In this paper, we define and study a four-parameter model called the transmuted Burr XII distribution. We obtain some of its mathematical properties including explicit expressions for the ordinary and incomplete moments, generating function, order statistics, probability weighted moments and entropies. We formulate and develop a log-linear model using the new distribution so-called the log-transmuted Burr XII distribution for modeling data with a unimodal failure rate function, as an alternative to the log-McDonald Burr XII, log-beta Burr XII, log-Kumaraswamy Burr XII, log-Burr XII and logistic regression models. The flexibility of the proposed models is illustrated by means of three applications to real data sets.
Pub. online:4 Aug 2022Type:Research ArticleOpen Access
Journal:Journal of Data Science
Volume 18, Issue 3 (2020): Special issue: Data Science in Action in Response to the Outbreak of COVID-19, pp. 483–495
Abstract
Coronavirus disease 2019 (COVID-19) is an infectious disease caused by severe acute respiratory syndrome coronvirus, which was declared as a global pandemic by the World Health Organization on March 11, 2020. In this work, we conduct a cross-sectional study to investigate how the infection fatality rate (IFR) of COVID-19 may be associated with possible geographical or demographical features of the infected population. We employ a multiple index model in combination with sliced inverse regression to facilitate the relationship between the IFR and possible risk factors. To select associated features for the infection fatality rate, we utilize an adaptive Lasso penalized sliced inverse regression method, which achieves variable selection and sufficient dimension reduction simultaneously with unimportant features removed automatically. We apply the proposed method to conduct a cross-sectional study for the COVID-19 data obtained from two time points of the outbreak.
Abstract: In this article, we consider a model of time-varying volatility which generalizes the classical Black-Scholes model to include regime-switching properties. Specifically, the unobservable state variables for stock fluctuations are modeled by a Markov process, and the drift and volatility parameters take different values depending on the state of this hidden Markov process. We provide a closed-form formula for the arbitrage-free price of the European call option, when the hidden Markov process has finite number of states. Two simulation methods, the discrete diffusion method and the Markovian tree method, for computing the European call option price are presented for comparison.
In this work, we introduce a new distribution for modeling the extreme values. Some important mathematical properties of the new model are derived. We assess the performance of the maximum likelihood method in terms of biases and mean squared errors by means of a simulation study. The new model is better than some other important competitive models in modeling the repair times data and the breaking stress data.
Abstract: This paper uses a structural time series methodology to test the notion of interconnectedness between the UK and the US credit markets. The empirical tests utilise data on premium for the Banking sector credit default swaps (CDS) and covers the recent period of financial turmoil. The methodology based on Kalman filter is robust in the presence of limited convergence. The long-term steady state convergence in CDS premium is clearly noticeable between these two markets from the results. This observation lends support for the coordinated regulatory policy initiatives to deal with the crisis and offer suggestions for sound operations of the international financial systems.
Abstract: In multivariate regression, interest lies on how the response vector depends on a set of covariates. A multivariate regression model is proposed where the covariates explain variation in the response only in the direction of the first principal component axis. This model is not only parsimonious, but it provides an easy interpretation in allometric growth studies where the first principal component of the log-transformed data corresponds to constants of allometric growth. The proposed model naturally generalizes the two–group allometric extension model to the situation where groups differ according to a set of covariates. A bootstrap test for the model is proposed and a study on plant growth in the Florida Everglades is used to illustrate the model.