FUMA practical
Background
In this practical you will explore FUMA results for functional analysis of a GWAS of LDL cholesterol from Willer et al 2013 Nature Genetics.
The original dataset was downloaded here.
Because it can take some time to run FUMA, the GWAS summary data has already been formatted, uploaded to FUMA and results made public.
Go to the list of publicly available FUMA results and click on the results with ID 610 (title: GLGC_Willeretl_Submission2) to see the FUMA results for this GWAS.
Aim: The purpose of this practical is to get you to explore and get familiar with FUMA results by answering the questions below, so that you are able to run and interpret such analyses with your own data.
Questions:
Exploring SNP2GENE Genome-wide plots
What is the most significantly associated gene from the gene-based analysis?
Based on MAGMA gene-set analysis, what processes are enriched in the GWAS risk regions?
Based on MAGMA gene-set analysis, which tissue is significantly enriched amongst GWAS loci, and is that expected a priori?
Exploring SNP2GENE Results Summary
How many genomic risk loci are associated with LDL-C?
How may genes are mapped to these regions?
Which genomic region (e.g. intronic, intergenic, 3’UTR etc) is significantly underrepresented amongst candidate GWAS SNPs?
What proportion of the candidate SNPs are intronic?
Which genomic region is the most enriched amongst candidate SNPs?
Exploring SNP2GENE Results
What is the rs ID for the most significantly associated SNP?
Which chromosome is this SNP located on?
What genomic region is the SNP located in (e.g. coding region for gene X, UTR3 for gene Y, downstream of gene X or intergenic between gene X and gene Y)?
What is the CADD score for this SNP?
What is the RegulomeDB score for this SNP and what does the score mean?
Which genes is the above SNP an eQTL for in GTEx Liver data?
Of the above identified genes, which have evidence of physical interaction between the GWAS associated region and the gene’s promoter (use the circus plot to determine this)?
Exploring GENE2FUNC Gene-Set Results
Which Wiki pathways related to medication are enriched in LDL-C-associated loci?
Which non-cardiovascular GWAS data also show an enrichment of LDL-C-associated loci and which genes overlap with both traits?