Pub. online:4 Aug 2022Type:Research ArticleOpen Access
Journal:Journal of Data Science
Volume 18, Issue 5 (2020): Special Issue S1 in Chinese (with abstract in English), pp. 889–906
Abstract
The new coronavirus disease (COVID-19), as a new infectious disease, has relatively strong ability to spread from person to person. This paper studies several meteorological factors and air quality indicators between Shenzhen and Wenzhou, China, and conducts modelling analysis on whether the transmission of COVID-19 is affected by atmosphere. A comparative assessment is made on the characteristics of meteorological factors and air quality in these two typical cities in China and their impacts on the spread of COVID-19. The article uses meteorological data and air quality data, including 7 variables: daily average temperature, daily average relative humidity, daily average wind speed, nitrogen dioxide (NO2), atmospheric fine particulate matter (PM2.5), carbon monoxide (CO) and ozone (O3), a distributed lag non-linear model (DLNM) is constructed to explore the correlation between atmospheric conditions and non-imported confirmed cases of COVID-19, and the relative risk is introduced to measure the lagging effects of meteorological factors and air pollution on the number of non-imported confirmed cases. Our finding indicates that there is significant differences in the relationship between 7 predictors and the transmission of COVID-19 in Shenzhen and Wenzhou. However, all predictors between the two cities have a non-linear relationship with the number of non-imported confirmed cases. The lower daily average temperature has increased the risk of epidemic transmission in the two cities. As the temperature rises, the risk of epidemic transmission in both cities will significantly decrease. The average daily relative humidity has no significant effects on the epidemic situation in Shenzhen, but the lower relative humidity reduces the risk of epidemic spread in Wenzhou. In contrast, meteorological data have significant impacts on the spread of COVID-19 in Wenzhou. The four predictors (NO2, PM2.5, CO, and O3) have significant effects on the number of nonimported confirmed cases. Among them, PM2.5 has a significant positive correlation with the number of non-imported confirmed cases. Daily average wind speed, NO2 and O3 have different effects on the number of non-imported confirmed cases in different cities.
Abstract: A prospective, multi-institutional and randomized surgical trial involving 724 early stage melanoma patients was conducted to determine whether excision margins for intermediate-thickness melanomas (1.0 to 4.0 mm) could be safely reduced from the standard 4-cm radius. Patients with 1- to 4-mm-thick melanomas on the trunk or proximal extremities were randomly assigned to receive either a 2- or 4-cm surgical margin with or without immediate node dissection (i.e. immediate vs. later -within 6 months). The median follow-up time was 6 years. Recurrence rates did not correlate with surgical margins, even among stratified thickness groups. The hospital stay was shortened from 7.0 days for patients receiving 4-cm surgical margins to 5.2 days for those receiving 2-cm margins (p = 0.0001). This reduction was largely due to reduced need for skin grafting in the 2cm group. The overall conclusion was that the narrower margins significantly reduced the need for skin grafting and shortened the hospital stay. Due to the adequacy of subject follow up, recently a statistical focus was on what prognostics factors usually called covariates actually determined recurrence. As was anticipated, the thickness of the lesion (p = 0.0091) and whether or not the lesion was ulcerated (p = 0.0079), were determined to be significantly associated with recurrence events using the logistic regression model. This type of fixed effect analysis is rather a routine. The authors have determined that a Bayesian consideration of the results would afford a more coherent interpretation of the effect of the model assuming a random effect of the covariates of thickness and ulceration. Thus, using a Markov Chain Monte Carlo method of parameter estimation with non informative priors, one is able to obtain the posterior estimates and credible regions of estimates of these effects as well as their interaction on recurrence outcome. Graphical displays of convergence history and posterior densities affirm the stability of the results. We demonstrate how the model performs under relevant clinical conditions. The conditions are all tested using a Bayesian statistical approach allowing for the robust testing of the model parameters under various recursive partitioning conditions of the covariates and hyper parameters which we introduce into the model. The convergence of the parameters to stable values are seen in trace plots which follow the convergence patterns This allows for precise estimation for determining clinical conditions under which the response pattern will change.
Abstract: Meta-analytic methods for diagnostic test performance, Bayesian methods in particular, have not been well developed. The most commonly used method for meta-analysis of diagnostic test performance is the Summary Receiver Operator Characteristic (SROC) curve approach of Moses, Shapiro and Littenberg. In this paper, we provide a brief summary of the SROC method, then present a case study of a Bayesian adaptation of their SROC curve method that retains the simplicity of the original model while additionally incorporating uncertainty in the parameters, and can also easily be extended to incorporate the effect of covariates. We further derive a simple transformation which facilitates prior elicitation from clinicians. The method is applied to two datasets: an assessment of computed tomography for detecting metastases in non-small-cell lung cancer, and a novel dataset to assess the diagnostic performance of endoscopic ultrasound (EUS) in the detection of biliary obstructions relative to the current gold standard of endoscopic retrograde cholangiopancreatography (ERCP).
Abstract: The assessment of modality or “bumps” in distributions is of in terest to scientists in many areas. We compare the performance of four statistical methods to test for departures from unimodality in simulations, and further illustrate the four methods using well-known ecological datasets on body mass published by Holling in 1992 to illustrate their advantages and disadvantages. Silverman’s kernel density method was found to be very conservative. The excess mass test and a Bayesian mixture model approach showed agreement among the data sets, whereas Hall and York’s test pro vided strong evidence for the existence of two or more modes in all data sets. The Bayesian mixture model also provided a way to quantify the un certainty associated with the number of modes. This work demonstrates the inherent richness of animal body mass distributions but also the difficulties for characterizing it, and ultimately understanding the processes underlying them.
Abstract: An analysis of air quality data is provided for the municipal area of Taranto characterized by high environmental risks, due to the massive presence of industrial sites with elevated environmental impact activities. The present study is focused on particulate matter as measured by PM10 concentrations. Preliminary analysis involved addressing several data problems, mainly: (i) an imputation techniques were considered to cope with the large number of missing data, due to both different working periods for groups of monitoring stations and occasional malfunction of PM10 sensors; (ii) due to the use of different validation techniques for each of the three monitoring networks, a calibration procedure was devised to allow for data comparability. Missing data imputation and calibration were addressed by three alternative procedures sharing a leave-one-out type mechanism and based on ad hoc exploratory tools and on the recursive Bayesian estimation and prediction of spatial linear mixed effects models. The three procedures are introduced by motivating issues and compared in terms of performance.