Library Portal | UWC Portal | National ETDs | Global ETDs
    • Login
    Contact Us | Quick Submission Guide | About Us | FAQs | Login
    View Item 
    •   ETD Home
    • Faculty of Natural Science
    • South African National Bioinformatics Institute (SANBI)
    • Magister Scientiae - MSc (Bioinformatics)
    • View Item
    •   ETD Home
    • Faculty of Natural Science
    • South African National Bioinformatics Institute (SANBI)
    • Magister Scientiae - MSc (Bioinformatics)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    SNP based literature and data retrieval

    Thumbnail
    View/Open
    Thesis (6.071Mb)
    Date
    2016
    Author
    Veldsman, Werner Pieter
    Metadata
    Show full item record
    Abstract
    Reference single nucleotide polymorphism (refSNP) identifiers are used to earmark SNPs in the human genome. These identifiers are often found in variant call format (VCF) files. RefSNPs can be useful to include as terms submitted to search engines when sourcing biomedical literature. In this thesis, the development of a bioinformatics software package is motivated, planned and implemented as a web application (http://sniphunter.sanbi.ac.za) with an application programming interface (API). The purpose is to allow scientists searching for relevant literature to query a database using refSNP identifiers and potential keywords assigned to scientific literature by the authors. Multiple queries can be simultaneously launched using either the web interface or the API. In addition, a VCF file parser was developed and packaged with the application to allow users to upload, extract and write information from VCF files to a file format that can be interpreted by the novel search engine created during this project. The parsing feature is seamlessly integrated with the web application's user interface, meaning there is no expectation on the user to learn a scripting language. This multi-faceted software system, called SNiPhunter, envisions saving researchers time during life sciences literature procurement, by suggesting articles based on the amount of times a reference SNP identifier has been mentioned in an article. This will allow the user to make a quantitative estimate as to the relevance of an article. A second novel feature is the inclusion of the email address of a correspondence author in the results returned to the user, which promotes communication between scientists. Moreover, links to external functional information are provided to allow researchers to examine annotations associated with their reference SNP identifier of interest. Standard information such as digital object identifiers and publishing dates, that are typically provided by other search engines, are also included in the results returned to the user.
    URI
    http://hdl.handle.net/11394/5345
    Collections
    • Magister Scientiae - MSc (Bioinformatics) [24]

    DSpace 5.5 | Ubuntu 14.04 | Copyright © University of the Western Cape
    Contact Us | Send Feedback
    Theme by 
    @mire NV
     

     

    Browse

    All of RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Login

    Statistics

    View Usage Statistics

    DSpace 5.5 | Ubuntu 14.04 | Copyright © University of the Western Cape
    Contact Us | Send Feedback
    Theme by 
    @mire NV