Home
Search

Journal of Data Science

Submit your article Information

Journal home
To appear
Current issue
All issues
More
Journal home To appear Current issue All issues

Detailed search

Title

Author

Types

Abstract

Keywords

Published

Pages

Volumes

Issues

DOI

Affiliation

Search results 892

Order by:

Select: All None Download:

Partial Least Squares Analysis in Electrical Brain Activity

Aylin Alın Serdar Kurt Anthony Randal McIntosh All authors (5)

https://doi.org/10.6339/JDS.2009.07(1).434

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 7, Issue 1 (2009), pp. 99–110

Abstract

Abstract: Abstract: Partial least squares (PLS) method has been designed for handling two common problems in the data that are encountered in most of the applied sciences including the neuroimaging data: 1) Collinearity problem among explanatory variables (X) or among dependent variables (Y); 2) Small number of observations with large number of explanatory variables. The idea behind this method is to explain as much as possible covariance between two blocks of X and Y variables by a small number of uncorrelated variables. Apart from the other applied sciences in which PLS are used, in the application of imaging data PLS has been used to identify task dependent changes in activity, changes in the relations between brain and behavior, and to examine functional connectivity of one or more brain regions. The aim of this paper is to give some information about PLS and apply on electroencephalography (EEG) data to identify stimulation dependent changes in EEG activity.

Prediction of Hypothyroidism Disease by Data Mining Technique

Kim-Fa KHIEW Tsuey-Lan WANG Mark Y.S. LIN All authors (4)

https://doi.org/10.6339/JDS.201601_14(1).0006

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 14, Issue 1 (2016), pp. 97–116

Predicting Future CD4 Cell Counts of HIV/AIDS Patients by Non Stationary Markov Chain: A Case Study of Anambra State

U.A. Osisiogu Nwosu C.A.

https://doi.org/10.6339/JDS.201501_13(1).0006

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 13, Issue 1 (2015), pp. 95–114

Abstract

Abstract: A total of 1094 HIV patients were involved in a cohort study (from January-December 2010) with follow-up in their CD4 cell transition counts and grouped according to their immunological states into five(5) states developed by Guiseppe Di Biase et al (2007). The five states (5) considered were: State one (CD4 > 500 cells/mm3 ), State two (350 < CD4 500 cells /mm3 ) State three(200 < CD4 350 cells/mm3 ), State four(CD4 200 cells/mm3 ), State five(Death). These states de ne the seriousness of the sickness based on the epidemiological states of the patients CD4 cell counts. We use the non-stationary Markov chain model for the prediction. The estimation of the non-stationary probabilities were done using the exponential smoothing technique. The result of the prediction showed a gradual decrease of the CD4 cells as we move from Jan-Dec. Furthermore, the result showed that the patients in the study cannot survive death from the month Dec. 2011, if they are not subjected to therapy, using highly active antiretrovirals (HAART). The results also showed that the model can be used for the testing of the drug e efficacy administered to patients within a given period.

Coherent Forecasting in Integer-Valued AR(1) Models with Geometric Marginals

Manik Awale T. V. Ramanathan Mohan Kale

https://doi.org/10.6339/JDS.201701_15(1).0006

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 15, Issue 1 (2017), pp. 95–114

Abstract

This paper discusses the coherent forecasting in two types of integervalued geometric autoregressive time series models of order one, viz., Geometric Integer-valued Autoregressive (GINAR(1)) model and New Geometric Integer-valued Autoregressive (NGINAR(1)) model. GINAR(1) model uses binomial thinning for the process generation, whereas, NGINAR(1) uses negative binomial thinning. The k-step ahead conditional probability mass function and the corresponding probability generating functions are derived. It is observed that for higher order lags, the conditional mean, variance and the probability generating functions of these two processes are close to each other, whereas, for lower order lags, they differ. The coherent forecasting performance of these models is studied with the help of simulated and real data sets.

On Interval Estimation for Exponential Power Distribution Parameters

A. A. Olosunde A. T. Soyinka

https://doi.org/10.6339/JDS.201801_16(1).0006

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 16, Issue 1 (2018), pp. 93–104

Abstract

The probability that the estimator is equal to the value of the estimated parameter is zero. Hence in practical applications we provide together with the point estimates their estimated standard errors. Given a distribution of random variable which has heavier tails or thinner tails than a normal distribution, then the confidence interval common in the literature will not be applicable. In this study, we obtained some results on the confidence procedure for the parameters of generalized normal distribution which is robust in any case of heavier or thinner than the normal distribution using pivotal quantities approach, and on the basis of a random sample of fixed size n. Some simulation studies and applications are also examined.

Multilevel Logistic Regression Analysis Applied to Binary Contraceptive Prevalence Data

Md. Hasinur Rahaman Khan J. Ewart H. Shaw

https://doi.org/10.6339/JDS.201101_09(1).0008

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 9, Issue 1 (2011), pp. 93–110

Abstract

Abstract: In public health, demography and sociology, large-scale surveys often follow a hierarchical data structure as the surveys are based on multistage stratified cluster sampling. The appropriate approach to analyzing such survey data is therefore based on nested sources of variability which come from different levels of the hierarchy. When the variance of the residual errors is correlated between individual observations as a result of these nested structures, traditional logistic regression is inappropriate. We use the 2004 Bangladesh Demographic and Health Survey (BDHS) contraceptive binary data which is a multistage stratified cluster dataset. This dataset is used to exemplify all aspects of working with multilevel logistic regression models, including model conceptualization, model description, understanding of the structure of required multilevel data, estimation of the model via the statistical package MLwiN, comparison between different estimations, and investigation of the selected determinants of contraceptive use.

Using Conditional Copula to Estimate Value at Risk

Helder Parra Palaro Luiz Koodi Hotta

https://doi.org/10.6339/JDS.2006.04(1).226

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 4, Issue 1 (2006), pp. 93–115

Abstract

Abstract: Value at Risk (VaR) plays a central role in risk management. There are several approaches for the estimation of VaR, such as historical simulation, the variance-covariance (also known as analytical), and the Monte Carlo approaches. Whereas the first approach does not assume any distribution, the last two approaches demand the joint distribution to be known, which in the analytical approach is frequently the normal distribution. The copula theory is a fundamental tool in modeling multivariate distributions. It allows the definition of the joint distribution through the marginal distributions and the dependence between the variables. Recently the copula theory has been extended to the conditional case, allowing the use of copulae to model dynamical structures. Time variation in the first and second conditional moments is widely discussed in the literature, so allowing the time variation in the conditional dependence seems to be natural. This work presents some concepts and properties of copula functions and an application of the copula theory in the estimation of VaR of a portfolio composed by Nasdaq and S&P500 stock indices.

Language Rhythm Model Selection by Weighted Kappa

Viviana Giampaoli Arnaldo Mandel

https://doi.org/10.6339/JDS.2010.08(1).568

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 8, Issue 1 (2010), pp. 91–99

Abstract

Abstract: Given processes that assign binary vectors to data, one wish to test models that simulate those processes and uncover groupings in the processes. It is shown that a suitable test can be derived from a kappa type agreement measure. This is applied to analyze stress placement in spoken phrases, based on experimental data previously obtained. The processes were Portuguese speakers and the grouping corresponds to the Brazilian and European varieties of that language. Optimality Theory gave rise to different models. The agreement measure was successful in pointing the relative fitness of models to language varieties.

Modelling Current Temperature Trends

Terence C. Mills

https://doi.org/10.6339/JDS.2009.07(1).436

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 7, Issue 1 (2009), pp. 89–97

Abstract

Abstract: Current trends in Northern Hemisphere and Central England temperatures are estimated using a variety of statistical signal extraction and filtering techniques and their extrapolations are compared with the pre dictions from coupled atmospheric-ocean general circulation models. Ear lier warming trend epochs are also analysed and compared with the current warming trend, suggesting that the long-run patterns of temperature trends should also be considered alongside the current emphasis on global warming.

Identifying Multisubject Cortical Activation in Functional MRI: A Frequency Domain Approach

Joao Ricardo Sato

https://doi.org/10.6339/JDS.2008.06(1).375

Pub. online: 4 Aug 2022 Type: Research Article

Open Access

Journal: Journal of Data Science Volume 6, Issue 1 (2008), pp. 89–103

Abstract

Abstract: Functional magnetic resonance imaging (fMRI) has, since its de scription fifteen years ago, become the most common in-vivo neuroimaging technique. FMRI allows the identification of brain areas which are related to specific tasks, by statistical analysis of the BOLD (blood oxigenation level dependent) signal. Classically, the observed BOLD signal is compared to an expected haemodynamic response function (HRF) using a general linear model (GLM). However, the results of GLM rely on the HRF specification, which is usually determined in an ad hoc fashion. For periodic experimental designs, we propose a multisubject frequency domain brain mapping, which requires only the stimulation frequency, and consequently avoids subjective choices of HRF. We present some computational simulations, which demon strate a good performance of the proposed approach in short length time series. In addition, an application to real fMRI datasets is also presented.

70 71 72 73 74

Items per page

Export citation

Copy and paste formatted citation

Formatted citation

Placeholder

Citation style

Download citation in file

Export format

Authors

Placeholder

RSS

Journal of data science

Online ISSN: 1683-8602
Print ISSN: 1680-743X

About

About journal

For contributors

Submit
OA Policy
Become a Peer-reviewer

Contact us

JDS@ruc.edu.cn
No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China