Getting GO annotation for your dataset

Post on 06-Jan-2016

27 views 0 download

Tags:

description

Getting GO annotation for your dataset. Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of Digital Biology, Mississippi State University. Getting GO:. Primary sources of GO: from the GO Consortium (GOC) & GOC members most up to date most comprehensive - PowerPoint PPT Presentation

Transcript of Getting GO annotation for your dataset

Getting GO annotation for your dataset

Fiona McCarthy, Shane Burgess, Susan BridgesThe AgBase Databases, Institute of Digital Biology, Mississippi State University

Getting GO:1. Primary sources of GO: from the GO Consortium

(GOC) & GOC members most up to date most comprehensive

2. Secondary sources: other resources that use GO provided by GOC members

public databases (eg. NCBI, UniProtKB) genome browsers (eg. Ensembl) array vendors (eg. Affymetrix) GO expression analysis tools

GO Browsers

1.1 GO Browsers QuickGO Browser (EBI GOA Project)

http://www.ebi.ac.uk/ego/can search by GO Term or by UniProt ID includes IEA annotationsbeta version in testing: will expand functionality

AmiGO Browser (GO Consortium Project)http://amigo.geneontology.org/cgi-bin/amigo/go.c

gican search by GO Term or by UniProt IDdoes not include IEA annotations

A. GO Browser demonstration

AIM: To demonstrate the QuickGO & AmiGO browsers and how they can be used to find GO, find GO terms

The QuickGO Browser (EBI-GOA)

This site is under development - feedback is being requested to make this resource useful to you.Coming soon: download GO annotations for your dataset.

http://www.ebi.ac.uk/ego/

1. Searching by Protein

Multiple protein IDs can be separated by a space or a comma.

Click on “Home” to start a new search.

Search for a GO Term

Filtering Results

Displays all annotations to the search term and its child terms

Filtering Results

use to filter results use to filter results

Filter by database.

Filter by GO Terms

Filter by GO evidence code.

Filter by taxon.

Click on “Home” to start a new search.

The AmiGO Browserhttp://amigo.geneontology.org/cgi-bin/amigo/go.cgi

1. Searching by Gene Product

Note: does not search for multiple IDs

Information about the gene product.

Links to GO annotations.

Download gene GO annotations.

2. Searching by GO Term

Results can be filtered by ontology.

Find term details by clicking on the term name.

See how the term fits in the ontology by selecting Term Lineage.

Select ‘gene product associations’ to display the GO annotations for your search results.

3. Filtering Results

Annotations can be filtered using drop down windows & selecting “set filters”.

Filtered results can be selected & downloaded as a gene association file.

Downloading ga files

1.2 Gene Association files The gene association (ga) file is used by GO

groups to link GO function to gene products tab-delimited file (smaller files can be viewed by opening in Excel) 15 columns (‘fields’) GO Annotation File Format Guide explains data in each column:

http://www.geneontology.org/GO.format.annotation.shtml

Gene Association files The gene association (ga) file is used by GO

groups to link function (GO) to gene products tab-delimited file (smaller files can be viewed by opening in Excel) 15 columns (‘fields’) GO Annotation File Format Guide explains data in each column:http://www.geneontology.org/GO.format.annotation.shtml

http://www.geneontology.org/GO.current.annotations.shtml

http://www.ebi.ac.uk/GOA/downloads.html

GOC: submitted to GO Consortium via EBI-GOACommunity: includes annotations not yet submitted, not supported (have QC)

B. Gene Association file Demonstration

AIM: To show how to download gene association files.

http://www.agbase.msstate.edu/

Select “Downloads & Statistics link.

Click on “Download” link and choose “Save to disk” option.

Find file on desktop.

Unzip using Winzip.

Open extracted file in Excel.

WARNING: some files are too large to open in Excel.

2. Secondary sources of GO annotation

EXAMPLES: public databases (eg. NCBI, UniProtKB) genome browsers (eg. Ensembl) array vendors (eg. Affymetrix)

CONSIDERATIONS: What is the original source? When was it last updated? Are evidence codes displayed?

Secondary Sources of GO annotation

C. Obtaining GO annotations from Secondary Sources

AIM: Look at how different public resources and tools display GO.

GO annotation obtained for chick GAPDH (P00356) from different sources:

QuickGO (primary source) UniProtKB database NCBI Entrez Gene page Ensembl browser Affymetrix chick array g:profiler gene expression analysis tool

QuickGO

1.

2.

Comparison of Chicken GAPDH GO annotation obtained from different sources.

GO evidence codes not reported.