PGC bioinformatics

PGC bioinformaticsPF Sullivan

20 September 2016

ApologiesTo the whole of the US for having this at an inaccessible time

Then again, US presidential election has polluted global consciousness

Next one will be more US-friendly

Motivation

• The PGC is no longer a ”one-and-done” organization. Not just a handful of gwas mega-analyses, but dozens

• Interpretation of results & downstream analyses should be routine but is often cumbersome, incomplete, and error-prone

• This seems pretty clear, obvious, unobjectionable

• Develop, test, & deploy command line tools to interpret gwas results

• Integrated with ricopili

• Implement on LISA, initially for use within PGC (open later), /home/pgcbioif (thanks Danielle)

• Open-source, community effort, full documentation

• Some databases will need to be updated

• Two types: • Mature, best-in-class

• Experimental/investigative (to be used cautiously)

Proposal, mature: for a set of gwas resultsType Content Status

Lookups, general Genes, OMIM, gwas catalog, CNV, ID, DD, ASD TIEFIghter java, beta test

Lookups, vs all PGC findings Find SNP results all prior PGC studies Available now, gwasLibrary on LISA

SNP-h2 Use LDSR to compute SNP-h2 for range of K(0.001 to 0.01 by 0.001, 0.01 to 0.15 by 0.01)

Need a script

Local SNP h2 Bogdan Pasaniuc, HESS Available now

rg vs PGC & vs LD-Hub Genetic correlations vs all PGC & LD-Hub(need both b/c LD-Hub update status unclear)

Available now

Partitioned LDSR Hilary Finucane (pmid 26414678) Available now, but input data

TWAS Eg, Sasha Gusev, Alkes Price (submitted): impute brain expression levels, case-control; other methods exist

Available now, but input data

MAGMA de Leeuw, Posthuma Available now

Others? SMR. GCTA. Popcorn. Credible SNPs. eQTLs. I personally wouldn’t do much with ENCODE/RoadMap – better data coming.

DiscussionCorrect current mature set? Others?

geneMatrixSven Stringer, VU Amsterdam

• Pipeline to automatically create psychiatric genetics-focused annotated geneMatrix

• Usable in PGC and COSYN project

• Gene matrix should be • useful for most of the people most of the time (not 100%)

• easy to update

• well-documented

• directly usable in Excel as well as other analytic environments (R, matlab, python, linux, etc.)

• Housed on LISA /home/pgcbioif (thanks Danielle)

• Original by PF Sullivan beginning 2004 (not general)

• Create flexible geneMatrix pipeline

• Pipeline will be• fully portable across linux environments• run from lisa cluster on its own account (/home/pgcbioif )• easy to configure• well-documented• create a gene matrix suitable for human and computer consumption (probably .csv

format)

• Implementation mostly in R

• Update and distribution policy for gene matrix will be put in place

HGNCdata

GENCODE V24

back-mapped to hg19

core matrix

preprocess

mergeand

format

create gene translation table

(GTT)GTT

outputsettings

external annotations

mergeand

format

output matrix

Design

Main annotations

• Gene names (official HUGO symbol and aliases)

• Location on hg19 (GENCODE v24)

• Information about LD and SNP density

• ExAC1 constraint score (pLI)

• Associated OMIM diseases & NHGRI/EBI GWAS catalog traits

• Gene-based p-values from large psychiatric GWAS

• Disease-specific manually curated annotations (ID, DD, ASD, brain expression, community flags)

Manual curation

• Extracting information from important disease-specific papers

• Distribute tasks/responsibilities across stakeholders

• Data from curators will need to conform to specific format to be included automatically in pipeline

• Policies and conventions will be put in place to make this manual curation work

Limitations

• Quality of information obviously depends on data sources used (GENCODE, HGNC, etc.)

• Gene matrix is provided “as-is”, no guarantees

• However• care is taken to ensure quality as much as possible

• sanity checks are performed

• pipeline is transparent and documented

We need a core team to take responsibility

• Suggest based in PGC Stats Group, need steering group – two leaders to be responsible

• PGC has employed analysts & data wranglers

• Use our paid PGC consultants (advice only)

• Implementation / update data / add new data & features

• Interface with PGC Data Access Committee, Pathway group

• Simple standard formats (please, let’s not get fancy)

(PGC liberalizing results access policy – in progress)

How can I get involved with the PGC?

Much of PGC leadership is 55+. Turnover is good for an organization.

The people who are in leadership roles in PGC stepped up:

• Volunteered, followed through, did tasks well

• Took on small roles, did them well, got more to do

• Volunteered to write parts of papers

• Were consistently on callsPGC FAQ

Let’s get phase 1 doneThen we can move from there.

Particularly working with psychENCODE on functional genomic data.

I can’t do this too…if people don’t step up, won’t happen

PGC bioinformatics

Documents

Transcript of PGC bioinformatics

Pgc para slideshare

Correspondencia entre cuentas PGC 90 y nuevo PGC

Pgc - Plano Geral Contabilidade

qrr pgc MC.yqQIJJ!G qn pgc IPIGIJ a Bgccs1gr1LGgr qn pgc ...fsjesm.ma/FSJESM2018/wp-content/uploads/2020/08/eco.pdf · qrr pgc MC.yqQIJJ!G qn pgc IPIGIJ a Bgccs1gr1LGgr qn pgc qn

2017 - Pennsylvania PUC PGC-1 PGC 2017 1307(f) Rates PGC Rate In Effect Prior To Annual Review

Manual Sirene Pgc

Documentacion Curso Pgc Esal

PGC Angola

Activos y Pasivos Financieros del PGC 2007 AL PGC 2020 ...

PGC PGC-VIP CORPORATE BASIC · 2020. 12. 3. · PGC PGC-VIP CORPORATE $799. Title: _PBR Membership 8 Created Date: 9/9/2020 1:20:14 PM ...

Guia Rapida Pgc

PGC New Normal

PGC - NIRF

Process Gas Chromatograph PGC 9300 - RMG Messtechnik GmbH€¦ · • Basic calibration of the PGC’s (device type: PGC 9301, PGC 9302, PGC 9303 with internal calibration gas) •

PGC Prospectus

Cours 5 PGC

Manual consultor pgc

pgc notes 1

Bombas Pgc

Marco conceptual del pgc