Showing posts with label METSIM. Show all posts
Showing posts with label METSIM. Show all posts

Friday, April 27, 2018

New T2DKP release adds individual-level data for interactive analysis

With the April release of the Type 2 Diabetes Knowledge Portal, we are increasing the number of datasets and samples available for interactive analysis via the LocusZoom and GAIT tools. These tools now access individual-level data from three additional datasets, all of which were quality controlled and analyzed at the Accelerating Medicines Partnership in Type 2 Diabetes (AMP T2D) Data Coordinating Center (DCC):
  • CAMP GWAS: 3,628 multi-ancestry samples from the MGH Cardiology and Metabolic Patient cohort, generated by a public-private partnership between Pfizer Inc. and Massachusetts General Hospital;
  • METSIM GWAS: 8,791 European ancestry samples from the Metabolic Syndrome in Men study.
These individual-level data are available as "dynamic" datasets, powered by Hail software, in LocusZoom on Gene pages and Variant pages of the T2DKP, for the following phenotypes: 
  • BioMe AMP T2D GWAS: type 2 diabetes, BMI, diastolic blood pressure, fasting glucose, HbA1c, HDL cholesterol, LDL cholesterol, systolic blood pressure
  • CAMP GWAS: type 2 diabetes, BMI, fasting glucose, fasting insulin
  • METSIM GWAS: type 2 diabetes, BMI, diastolic blood pressure, fasting glucose, fasting insulin, HbA1c, HDL cholesterol, LDL cholesterol, systolic blood pressure
To perform interactive analyses on these data in LocusZoom, select one of the available phenotypes in step 1 and then choose a "dynamic" dataset in step 2.


When you click on a variant in the resulting LocusZoom plot, the option to condition on that variant appears in the tooltip:


Clicking on that link starts on-the-fly association analysis for the region while conditioning on that variant, which can reveal whether association signals are independent of each other. You can choose to condition on multiple variants. The variants of your choice are listed in the upper left-hand corner of the plot, and the list may be edited:



Individual-level data from these three datasets are also available for interactive analysis via the Genetic Association Interactive Tool (GAIT) on Variant Pages. After selecting one of the datasets, you will be able to choose a phenotype for association analysis, filter the sample pool by specifying a range of values for one or more phenotypes, choose custom covariates, and then run on-the-fly association analysis for your chosen subset of samples. Find all of the details about how to use this tool in our GAIT guide.

We hope that the increased ability to interact with individual-level data in the T2DKP will be helpful to your research. As always, we are happy to answer any questions about these or other data and tools; please contact us for help.

Thursday, March 1, 2018

New release today, as the KPN moves to a regular release schedule

At the Knowledge Portal Network (consisting of the Type 2 Diabetes, Cardiovascular Disease, and Cerebrovascular Disease Knowledge Portals), we are establishing a regular bimonthly release schedule. Every other month, new data and features will be incorporated into the Portals. Today, we are pleased to announce the first of these releases.

New data in the Type 2 Diabetes Knowledge Portal

This release adds two new datasets to the T2DKP. The Diabetic Cohort - Singapore Prospective Study Program is a T2D case-control study to identify genetic and environmental risk factors for diabetes in Singapore Chinese. The DC-SP2 GWAS set, a meta-analysis of summary level T2D associations from 3,951 individuals, was contributed by Drs. Rob Martinus Van Dam, E Shyong Tai, and Xueling Sim from the National University of Singapore. They have also submitted individual-level data from this study to the Accelerating Medicines Partnership Data Coordinating Center (AMP DCC), and these data will be incorporated into the T2DKP after quality control and analysis are complete.

In addition to this set, we have incorporated the publicly available summary statistics from the DIAGRAM 1000G GWAS. This dataset, from the DIAGRAM (DIAbetes Genetics Replication And Meta-analysis) consortium, is a meta-analysis of 26,676 T2D cases and 132,532 control participants from 18 GWAS (Scott RA, et al. An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans. (2017) Diabetes 66:2888). Samples were imputed using the all ancestries 1000 Genomes Project reference panel.

More details about both of these datasets are available on our Data page.

New features specific to the Type 2 Diabetes Knowledge Portal

We have expanded the range of data available for interactive analysis by adding individual-level data from the CAMP GWAS, BioMe AMP T2D GWAS, and METSIM GWAS datasets to the dynamic analysis modules LocusZoom and GAIT (Genetic Association Interactive Tool). LocusZoom, powered by the Hail software developed at the Broad Institute as part of the AMP T2D project, allows you to perform custom association analysis while conditioning on specific variants or sets of variants.

GAIT offers alternative options for custom association analysis, such as filtering samples by their phenotypic characteristics (e.g., age, BMI, cholesterol levels) and choosing specific covariates. To date, seven different datasets comprised of over 67,000 samples are available for dynamic analysis in GAIT. These include datasets housed both at the AMP DCC (19k exome sequence analysis; CAMP GWAS; BioMe AMP T2D GWAS; METSIM GWAS) and at the EBI Federated node (EXTEND GWAS; Oxford Biobank exome chip analysis; GoDARTS Affymetrix GWAS).

We have also taken an initial step towards integration of the T2DKP with a new federated node, the T2DREAM database of epigenomic data relevant to T2D. In the near future, epigenomic data displayed in the T2DKP will be drawn dynamically from T2DREAM. In the meantime, we have added gene- and variant-specific links to T2DREAM from the re-styled External Resources section at the bottom of Gene and Variant pages.

New features for all Knowledge Portals

Some of the improvements in this release are visible in all the Portals of the Knowledge Portal Network. One of the most significant affects LocusZoom, the dynamic plot that displays variant associations along with their genomic coordinates, linkage disequilibrium, and other information. Previously, the only way to select a phenotype was to scroll through a long list. Now, a new phenotype filter lets you enter one or more search criteria and filter the list by those criteria. Once you have selected a phenotype, the datasets that include associations for that phenotype are presented for selection. Previously, only one dataset (the one with the largest sample size) was available for each phenotype; now, associations from all relevant datasets may be viewed in LocusZoom.


Portion of the updated LocusZoom interface, showing phenotype filtering capability.


The sample filtering panel of the user interface for the custom burden test and GAIT (Genetic Association Interactive Tool) has also been improved to make it more intuitive to use. The External Resources sections of Gene and Variant pages have been re-styled, and gene- and variant-specific links to PheWeb have been added. PheWeb displays phenotypes most significantly associated with the gene or variant, based on a GWAS for over 2,400 phenotypes in UK Biobank data that was performed by Ben Neale's group. Finally, the home pages of all the Portals have been redesigned to make the appearance of the disease-specific portals more distinct.


Please browse these new data and features, and let us know what you think!

Friday, January 19, 2018

New METSIM dataset adds individual-level GWAS data to the T2DKP

The Finnish population is a valuable genetic resource. Having undergone multiple population bottlenecks, this relatively homogeneous population is enriched in low-frequency and loss-of-function variants. Even better, Finns are generally willing to participate in research studies, and many measures of their health are detailed in comprehensive electronic health records.

To take advantage of these characteristics, the METSIM (Metabolic Syndrome in Men) study (Laakso et al. 2017, J. Lipid Res. 58, 481-493) was initiated in 2005. Over 10,000 Finnish men were examined between 2005 and 2010. All of the subjects were phenotyped extensively, with an emphasis on traits associated with type 2 diabetes (T2D), cardiovascular disease, and insulin resistance, and their genotypes and exome sequences were determined. Subsets of the group have been characterized in more detail, with whole-genome sequencing and detailed analyses of transcripts and gene expression, DNA methylation, gut microbiome composition, and other phenotypes.

Now, you can easily access results from the METSIM cohort in the T2D Knowledge Portal. Variant associations with T2D, fasting glucose levels, and fasting insulin levels are available, both unadjusted or adjusted for body mass index. The individual-level data are also available for interactive analyses using our Genetic Association Interactive Tool (GAIT; see below), which allows you to design and run custom association analyses using custom subsets of the samples, while always protecting patient privacy. The addition of METSIM data brings to nearly 68,000 the number of samples available for analysis in GAIT.

The Foundation for the NIH and the Accelerating Medicines Partnership in Type 2 Diabetes were instrumental in bringing these data, generated by researchers in Finland and the U.S., to the T2DKP. Individual-level genotype data from 1,185 T2D cases and 7,357 controls were deposited into the Data Coordinating Center (AMP T2D DCC), and analysis and quality control were performed by the DCC analysis team. The experiment design and analysis are summarized on our Data page, and detailed reports that fully document the analysis are available for download.

The METSIM GWAS dataset currently has "Early Access Phase 1" status in the T2DKP, which is assigned to new data. This status denotes that although analysis and quality control checks have been performed, the data are not yet considered to be in their final state. During the early access period, users may analyze the data but may not submit the results of these analyses for publication. Find full details about the different phases of data release on our Policies page.

Results from METSIM GWAS may be viewed at these locations in the T2D Knowledge Portal:

• On Gene Pages (e.g., MTNR1B) in the Common variants and High-impact variants tables and in LocusZoom static plots, for the phenotypes T2D, T2D adjusted for BMI, fasting glucose, fasting glucose adjusted for BMI, fasting insulin, and fasting insulin adjusted for BMI;

• On Variant Pages (e.g.rs579060) in the Associations at a glance section, the Association statistics across traits table, and in LocusZoom static plots;

• From the View full genetic association results for a phenotype search on the home page: first select one of the phenotypes listed above, and then on the resulting page, select the METSIM GWAS dataset.

Individual-level METSIM GWAS data may be used for custom interactive analyses using these tools in the T2DKP:

• Using the Variant Finder tool, you may specify multiple criteria and retrieve the set of variants meeting those criteria;

• Using the Genetic Association Interactive Tool (GAIT) on Variant Pages, you may select the METSIM GWAS dataset, choose one of 5 phenotypes for association analysis, choose custom covariates, and filter the sample pool by specifying a range of values for one or more of 8 different phenotypes, then run on-the-fly analysis.

Phenotypes available for association analysis of METSIM GWAS data in GAIT


Covariates available for selection when analyzing METSIM GWAS data in GAIT


Samples may be filtered by setting ranges for one or more of 8 phenotypes for the METSIM GWAS dataset