ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of...

28
ALFRED: the AL ALlele FRE FREquency D Database Kenneth K. Kidd and the ALFRED Team artment of Genetics and Center for Medical Informat Yale University School of Medicine pported by the U.S. National Science Foundat
  • date post

    21-Dec-2015
  • Category

    Documents

  • view

    230
  • download

    0

Transcript of ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of...

Page 1: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

ALFRED: the ALALlele FREFREquency DDatabase

ALFRED: the ALALlele FREFREquency DDatabase

Kenneth K. Kidd and the ALFRED Team

Department of Genetics and Center for Medical InformaticsYale University School of Medicine

Supported by the U.S. National Science Foundation

Page 2: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

IntroductionIntroductionIntroductionIntroduction

What is in ALFRED?What is in ALFRED?

How to access ALFREDHow to access ALFRED

What is ALFRED?What is ALFRED?

Why is it necessary?Why is it necessary?

What we are doing now?What we are doing now?

Page 3: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

What is ALFRED?What is ALFRED?What is ALFRED?What is ALFRED?

ALFRED, ALFRED, the ALlele FREquency the ALlele FREquency DatabaseDatabase, is designed to integrate into , is designed to integrate into a single source information on the a single source information on the frequencies of human DNA sequence frequencies of human DNA sequence variants.variants.

Page 4: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

ALFRED Home PageALFRED Home Pagehttp://alfred.med.yale.edu/alfred/ALFRED Home PageALFRED Home Pagehttp://alfred.med.yale.edu/alfred/

Page 5: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

ALFRED is designed to allow ALFRED is designed to allow reference of frequencies to:reference of frequencies to:ALFRED is designed to allow ALFRED is designed to allow reference of frequencies to:reference of frequencies to:

A specific typing protocol for a specific polymorphism at a specific A specific typing protocol for a specific polymorphism at a specific locus.locus.

A specific sampling of an ethnic group.A specific sampling of an ethnic group.

Cross reference to the literature for other publications of frequencies based Cross reference to the literature for other publications of frequencies based on the same sample or ethnic group.on the same sample or ethnic group.

Page 6: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

Why is it necessary?Why is it necessary?Why is it necessary?Why is it necessary?

ALFRED is designed to serve as a ALFRED is designed to serve as a central repository of frequencies for central repository of frequencies for variation in the human genomevariation in the human genome––curated curated and cross referenced to molecular and and cross referenced to molecular and ethnographic databasesethnographic databases––by assembling by assembling in one place data that are dispersed in one place data that are dispersed very widely in the scientific literature.very widely in the scientific literature.

Page 7: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

Why is it necessary?Why is it necessary?Why is it necessary?Why is it necessary?

ALFRED is web-based, publicly ALFRED is web-based, publicly available, with easy to download data available, with easy to download data thus serving as a resource for many thus serving as a resource for many types of research projects.types of research projects.

With its graphic displays of data, ALFRED With its graphic displays of data, ALFRED can also serve as an educational can also serve as an educational resource for physical anthropology and resource for physical anthropology and human population genetics.human population genetics.

Page 8: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

What are we doing now?What are we doing now? Data content - Quality controlData content - Quality control

What are we doing now?What are we doing now? Data content - Quality controlData content - Quality control

Criteria - a minimal typed sample size of Criteria - a minimal typed sample size of 20 individuals, minimization of missing 20 individuals, minimization of missing data, time-stamped frequency data (i.e., data, time-stamped frequency data (i.e., different versions of frequency data are different versions of frequency data are available).available).

Page 9: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

What are we doing now?What are we doing now? Data integration and accumulationData integration and accumulation

What are we doing now?What are we doing now? Data integration and accumulationData integration and accumulation

ALFRED curators are currently ALFRED curators are currently uploading allele frequency data from uploading allele frequency data from published literature throughout the published literature throughout the physical anthropology and population physical anthropology and population genetics peer reviewed journals.genetics peer reviewed journals.

Page 10: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

What are we doing now?What are we doing now? Data managementData management

What are we doing now?What are we doing now? Data managementData management

ALFRED programmers are currently ALFRED programmers are currently working on the migration of ALFRED working on the migration of ALFRED from Access to Oracle in order to from Access to Oracle in order to handle the rapidly growing database.handle the rapidly growing database.

Page 11: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

What is in ALFRED?What is in ALFRED?What is in ALFRED?What is in ALFRED?

ALFRED stores allele frequencies and ALFRED stores allele frequencies and information on a wide range of loci, information on a wide range of loci, polymorphic sites, populations, and polymorphic sites, populations, and samples.samples.

Page 12: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

Table Summary NumbersTable Summary NumbersAs of April 9, 2002As of April 9, 2002Table Summary NumbersTable Summary NumbersAs of April 9, 2002As of April 9, 2002

Page 13: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

LociLociExample: Chromosome 22Example: Chromosome 22LociLociExample: Chromosome 22Example: Chromosome 22

Page 14: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

LocusLocusExample: Catechol-O-Methyl TransferaseExample: Catechol-O-Methyl TransferaseLocusLocusExample: Catechol-O-Methyl TransferaseExample: Catechol-O-Methyl Transferase

Page 15: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

LocusLocusExample: Catechol-O-Methyl TransferaseExample: Catechol-O-Methyl TransferaseLocusLocusExample: Catechol-O-Methyl TransferaseExample: Catechol-O-Methyl Transferase

Page 16: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

Definition of the PolymorphismDefinition of the PolymorphismDefinition of the PolymorphismDefinition of the Polymorphism

A clear protocolPCR primers and product sizes for In/Dels and

STRPs

PCR primers and fragment sizes after enzyme digestion for RSPs

Unambiguous definition of varying nucleotides based on flanking sequence.

Page 17: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

PolymorphismsPolymorphismsExample: COMT, 3-site haplotypeExample: COMT, 3-site haplotypePolymorphismsPolymorphismsExample: COMT, 3-site haplotypeExample: COMT, 3-site haplotype

Page 18: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

Allele FrequenciesAllele FrequenciesExample: COMT, 3-site haplotypeExample: COMT, 3-site haplotypeAllele FrequenciesAllele FrequenciesExample: COMT, 3-site haplotypeExample: COMT, 3-site haplotype

Page 19: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

Allele FrequenciesAllele FrequenciesExample: COMT, 3-site haplotypeExample: COMT, 3-site haplotypeAllele FrequenciesAllele FrequenciesExample: COMT, 3-site haplotypeExample: COMT, 3-site haplotype

Page 20: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

PopulationsPopulationsExample: North AmericaExample: North AmericaPopulationsPopulationsExample: North AmericaExample: North America

Page 21: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

PopulationsPopulationsExample: Maya, YucatanExample: Maya, YucatanPopulationsPopulationsExample: Maya, YucatanExample: Maya, Yucatan

Page 22: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

SamplesSamplesExample: Maya, YucatanExample: Maya, YucatanSamplesSamplesExample: Maya, YucatanExample: Maya, Yucatan

Page 23: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

Frequency data retrievalFrequency data retrievalSearchSearch Frequency data retrievalFrequency data retrievalSearchSearch

Page 24: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

Frequency Variation for Frequency Variation for Four SNPS in 33 PopulationsFour SNPS in 33 PopulationsFrequency Variation for Frequency Variation for Four SNPS in 33 PopulationsFour SNPS in 33 Populations

00.10.20.30.40.50.60.70.80.9

1

DRD2 TaqI"D" RET TaqI APOB XbaI RBP3 MspI

AfricaAfrica Europe/Middle EastEurope/Middle East East AsiaEast AsiaNorthNorth

AmericaAmericaSouthSouth

AmericaAmericaP.P. S.S.

Page 25: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

ALFRED System ImplementationALFRED System ImplementationALFRED System ImplementationALFRED System Implementation

Microsoft Access (migration to Oracle)Microsoft Access (migration to Oracle)

Microsoft NT Server with Internet Microsoft NT Server with Internet

Information Server (IIS)Information Server (IIS)

Scripts Written in Server-side ASP (VB Script)Scripts Written in Server-side ASP (VB Script)

Microsoft Access (migration to Oracle)Microsoft Access (migration to Oracle)

Microsoft NT Server with Internet Microsoft NT Server with Internet

Information Server (IIS)Information Server (IIS)

Scripts Written in Server-side ASP (VB Script)Scripts Written in Server-side ASP (VB Script)

Page 26: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

ALFRED System OverviewALFRED System OverviewALFRED System OverviewALFRED System Overview

PhenoDB

ALFREDWeb Server

(ASP)ODBC

ClientBrowser

ExternalData

Resources

Collaborators HAPLOProgram

Others (e.g.literature)

Input Data Sources

NT Server

Kidd Lab Data

Page 27: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

ALFREDALFREDTheThe ALALlelelele FREFREquency quency DDatabaseatabase

from Kidd Labfrom Kidd Lab

ALFREDALFREDTheThe ALALlelelele FREFREquency quency DDatabaseatabase

from Kidd Labfrom Kidd Lab

http://alfred.med.yale.edu/alfred/index.asphttp://alfred.med.yale.edu/alfred/index.asp

Suggestions and comments are welcome.Suggestions and comments are welcome.

http://alfred.med.yale.edu/alfred/index.asphttp://alfred.med.yale.edu/alfred/index.asp

Suggestions and comments are welcome.Suggestions and comments are welcome.

Page 28: ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

The ALFRED TeamThe ALFRED TeamThe ALFRED TeamThe ALFRED TeamSenior FacultySenior FacultyKenneth K. Kidd, Ph.D., Professor of Genetics and Psychiatry (ALFRED P.I.)Perry Miller, M.D., Ph.D., Director of Center for Medical Informatics

CuratorsCuratorsChen-Chen Yeh, M.S., Research AssociateRebekah Heinzen, B.A., Research Assistant

ProgrammersProgrammersMichael V. Osier, Ph.D. Candidate, Graduate StudentHaseena Rajeevan, Ph. D., Systems ProgrammerNicholas P. Tosches, M.D., Associate Research ScientistLyudmila Druskin, M.D., Postdoctoral Fellow and Associate

ConsultantsConsultantsAndrew J. Pakstis, Ph. D., Research Scientist

Judith R. Kidd, Ph. D., Research Scientist

Kei-Hoi Cheung, Ph. D., Assistant Professor