CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.
-
date post
19-Dec-2015 -
Category
Documents
-
view
219 -
download
0
Transcript of CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.
![Page 1: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/1.jpg)
CACAO - Remote training
Gene Function and Gene OntologyFall 2011
http://gowiki.tamu.edu/wiki/index.php/Category:CACAO
![Page 2: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/2.jpg)
“Scientists find gene that ...”
![Page 3: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/3.jpg)
An avalanche of genes
• High throughput sequencing is finding genes faster than we can understand them
• Goals for annotation:– Where the genes are in
the genome
– What their functions are
![Page 4: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/4.jpg)
Function annotation
• Allows us to– Infer the functions of genes
• Related by common descent
• Related by similar expression patterns
• Related by phylogenetic profiles
• ...
![Page 5: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/5.jpg)
Function annotation
• Allows us to – Understand the capabilities of
organisms genomes
– Understand patterns of gene expression• In different environments
• In different tissues
• In disease states
– ...
![Page 6: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/6.jpg)
Classic MODel
Literature
Datasets
Curators(rate limiting)
Database
![Page 7: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/7.jpg)
Requirements
• Accurate functional annotation for as many genes as possible
• A system of assigning function that allows both humans and computers to compare, contrast, analyze, and predict gene function
• Curators to make and/or check these assignments– For CACAO, we will teach you what
biocurators do.
![Page 8: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/8.jpg)
CACAO
• Community
• Assessment– How well can
• Community – you (with our coaching)
• Annotation with– assign gene functions
• Ontologies– using GO?
![Page 9: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/9.jpg)
CACAO is competitive
• Teams get points for complete annotations– GO term (right level of specificity)– reference– evidence code– identify where in the paper the evidence comes
from
• Teams can take away points from competitors by challenging annotations– finding a problem– suggesting a better alternative
![Page 10: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/10.jpg)
What’s in it for you (besides credit)?
– We hope you will • learn how we think
about gene function
• gain skills that will help your future career
• enjoy contributing to a resource used by people all over the world
• have fun!
![Page 11: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/11.jpg)
The gist of CACAO…
Finding evidence(in papers)
Making annotations
Using GO terms
![Page 12: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/12.jpg)
GO = Gene Ontology
• Controlled vocabulary– Everyone uses the same terms
– Terms have IDs that computers can understand
• Relationships between functions
![Page 13: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/13.jpg)
Gene OntologyA common system for describing gene function
![Page 14: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/14.jpg)
GO
• 3 aspects (ontologies) for gene products
1. Biological Process
2. Molecular Function
3. Cellular Component
• Used to make annotations– aka Gene associations– Term + qualifiers + evidence code + reference etc.
![Page 15: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/15.jpg)
Molecular Function
• activities or “jobs” of a gene product
glucose-6-phosphate isomerase activity
from GOCfigure from GO consortium presentations
![Page 16: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/16.jpg)
Biological Processa commonly recognized series of events
cell division
Figure from Nature Reviews Microbiology 6, 28-40 (January 2008)
![Page 17: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/17.jpg)
Cellular Component
• where a gene product acts
![Page 18: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/18.jpg)
Key elements of a GO annotation
Submitted to GO consortium
Viewable on GONUTS
**Don’t worry - I will cover this again (several times)!
![Page 19: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/19.jpg)
GO Annotation
• To make an annotation, you need to– Assign GO terms to genes (gene
products)• At appropriate level of specificity
• Sometimes with Qualifiers – NOT
– Contributes_to
– Colocalizes_with
– Record the evidence
![Page 20: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/20.jpg)
Record the evidence
• Where it came from: – Reference (database accession)
• PMID:6987663
• Kind of evidence: – Evidence codes
• IMP: Inferred from Mutant Phenotype
• IDA: Inferred from Direct Assay
• …
![Page 21: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/21.jpg)
CACAO - the “Community Annotation” part
What I am going to tell you about next is:1. How to choose proteins to annotate2. Finding GO terms & navigating a GO term page
3. Finding UniProt accessions4. Making gene pages on GONUTS & the anatomy of a gene page
5. How and where to add an annotation6. Where to look for your annotations & other teams’ annotations … (& the challenges!)
http://gowiki.tamu.edu/wiki/index.php/
![Page 22: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/22.jpg)
Deciding what to annotate1. randomly
2. topics of interest (ie efflux pump proteins, biofilms)
3. papers you have come across while doing other stuff
4. methods you know or want to learn
5. phenotypes and mutants you are interested in
6. by author
7. by pathway or regulon
8. suggested by another (ie high IEA:manual annotation ratio)
9. current paper mentions another gene product
10. review papers (ie Annual Reviews are excellent sources)
EXAMPLE #1: let’s say you have a great paper (PMID:1111) that characterizes the tyrosine kinase activity of your
favorite protein (human p53)…
![Page 23: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/23.jpg)
Part I: Where do you search for GO terms? GONUTS
http://gowiki.tamu.edu
![Page 24: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/24.jpg)
![Page 25: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/25.jpg)
![Page 26: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/26.jpg)
![Page 27: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/27.jpg)
![Page 28: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/28.jpg)
![Page 29: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/29.jpg)
![Page 30: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/30.jpg)
![Page 31: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/31.jpg)
• CHICK - AgBase (Gallus gallus)• dictyBase - dictyBase (Dictyostelium discoideum - slime mold)• FB - FlyBase (Drosophila melanogaster)• HUMAN - Reactome, BHF-UCL• MGI - Mouse genome informatics (Mus musculus - house mouse)• SGD - Saccharomyces genome database (Saccharomyces cerevisiase - yeast)• TAIR - The Arabidopsis Informatics Resource (Arabidopsis thaliana)• WB - WormBase (Caenorhabditis elegans)• ZFIN - Zebrafish model organism database (Danio rerio)
![Page 32: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/32.jpg)
What do you actually need once you have found the correct term?
GO:0004713
![Page 33: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/33.jpg)
Part II: You now have a paper, a protein & you found a suitable GO
term… what next?
• UniProt accession - http://www.uniprot.org
- Search (“Query”) & find the correct UniProt accession for your protein
- Look something like: P012A9
![Page 34: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/34.jpg)
Part III: Where are you going to add your annotations? GONUTS
http://gowiki.tamu.edu
![Page 35: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/35.jpg)
How do you make a new gene page in GONUTS?
• Use the UniProt accession to make a page that you will be able to add your own annotation to.
• GoPageMaker will:1. Check if the page exists in GONUTS & take you there if it does.2. Make a page & pull all of the annotations from UniProt into a
table that you can edit.
![Page 36: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/36.jpg)
…
![Page 37: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/37.jpg)
…
![Page 38: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/38.jpg)
…
![Page 39: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/39.jpg)
…
Where do you add an annotation? Add a row in the table.
![Page 40: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/40.jpg)
![Page 41: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/41.jpg)
What you must fill in (for every annotation)
GO:0004713
PMID:1111
IDA: Inferred from direct assay
Figure 2a
![Page 42: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/42.jpg)
What you might also have to fill in
Not sure? Check the competition guidelines. Ask a coach (Jim, Debby, Adrienne or usually me)!
![Page 43: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/43.jpg)
Where will your annotation now show up?
1. In the “Annotation” table on the gene page you just edited
2. In the table on your user pagehttp://gowiki.tamu.edu/wiki/index.php/User:Oherrera
3. In the table on your team pagehttp://gowiki.tamu.edu/wiki/index.php/Category:Team_That_Will_Beat_You!!!
4. As points on the scoreboardhttp://gowiki.tamu.edu/wiki/index.php/Category:CACAO_UW_Parkside
![Page 44: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/44.jpg)
Questions?
At this point, you should be able to:1. Find GO terms on GONUTS2. Find UniProt accessions on UniProt3. Make a gene page on GONUTS4. Add an annotation
![Page 45: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/45.jpg)
CACAO - the “Community Assessment” part
![Page 46: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/46.jpg)
![Page 47: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/47.jpg)
![Page 48: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/48.jpg)
http://gowiki.tamu.edu/wiki/index.php/Category:CACAO_UW_Parkside
![Page 49: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/49.jpg)
Example starting from a topic– Shiga toxin
PMID:2677606
Make page on GONUTS for Q7BQ98. Has
PMID:2677606 been annotated?
![Page 50: CACAO - Remote training Gene Function and Gene Ontology Fall 2011 CACAO.](https://reader031.fdocuments.net/reader031/viewer/2022032309/56649d2d5503460f94a03ffd/html5/thumbnails/50.jpg)
If it has, search PubMed for a different article.
http://www.ncbi.nlm.nih.gov/pubmed?term=2677606
GO:0009405 ?
What GO term? (hint: search GONUTS)