Library Portal | UWC Portal | National ETDs | Global ETDs
    • Login
    Contact Us | About Us | FAQs | Login
    View Item 
    •   ETD Home
    • Faculty of Natural Science
    • Department of Statistics & Population Studies
    • Magister Scientiae - MSc (Statistics and Population Studies)
    • View Item
    •   ETD Home
    • Faculty of Natural Science
    • Department of Statistics & Population Studies
    • Magister Scientiae - MSc (Statistics and Population Studies)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    The statistical analysis of complex sampling data

    Thumbnail
    View/Open
    paulse_msc_nsc_2018 (16.41Mb)
    Date
    2018
    Author
    Paulse, Bradley
    Metadata
    Show full item record
    Abstract
    Most standard statistical techniques illustrated in text books assume that the data are collected from a simple random sample (SRS) and hence are independently and identically distributed (i.i.d.). In reality, data are often sourced through complex sampling (CS) designs, with a combination of stratification and clustering at different levels of the design. Consequently, the CS data are not i.i.d. and sampling weights that are developed over different stages, are calculated and included in the analysis of this data to account for the sampling design. Logistic regression is often employed in the modelling of survey data since the response under investigation typically has a dichotomous outcome. Furthermore, since the logistic regression model has no homogeneity or normality assumptions, it is appealing when modelling a dichotomous response from survey data. This research considers the comparison of the estimates of the logistic regression model parameters when the CS design is accounted for, i.e. weighting is present, to when the data are modelled using an SRS design, i.e. no weighting. In addition, the standard errors of the estimators will be obtained using three different variance techniques, viz. Taylor series linearization, the jackknife and the bootstrap. The different estimated standard errors will be used in the calculation of the standard (asymptotic) interval which will be compared to the bootstrap percentile interval in terms of the interval coverage probability. A further level of comparison is obtained when using only design weights to those obtained using calibrated and integrated sampling weights. This simulation study is based on the Income and Expenditure Survey (IES) of 2005/2006. The results showed that generally when weighting was used the estimators performed better as opposed to when the design was ignored, i.e. under the assumption of SRS, with the results for the Taylor series linearization being more stable.
    URI
    http://hdl.handle.net/11394/6754
    Collections
    • Magister Scientiae - MSc (Statistics and Population Studies)

    DSpace 6.3 | Ubuntu | Copyright © University of the Western Cape
    Contact Us | Send Feedback
    Theme by 
    @mire NV
     

     

    Browse

    All of RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Login

    Statistics

    View Usage Statistics

    DSpace 6.3 | Ubuntu | Copyright © University of the Western Cape
    Contact Us | Send Feedback
    Theme by 
    @mire NV