Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure...

10
NSilico Life Science Introductory Bioinformatics Course

Transcript of Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure...

Page 1: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

NSilico Life ScienceIntroductoryBioinformaticsCourse

Page 2: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

INTRODUCTORY BIOINFORMATICS COURSE

A public course delivered over three days on the fundamentals of bioinformatics and illustrated with lectures, hands-on software workshops and case studies on prokaryotes, microbial pathogens, eukaryotes, and infection and cancer studies. This NSilico Life Science certified bioinformatics professional course will allow delegates to rapidly enter this burgeoning field.

Audience: Biologists and computer scientists wishing to extend their expertise to the domain of bioinformatics.

COURSE OVERVIEW

An introduction to bioinformatics and the use of online resources in modern molecular biology and also working with workflows in NSilico’s bioinformatics pipeline software called Simplicity™ and with the statistical programming package R. The student will cover methods for gene sequencing (Illumina next-generation and others), reads quality control, adapter trimming and cleaning, assembly, assembly quality assessment, gene prediction, codon and amino acid usage tables, similarity searching with BLAST, Gene Ontology database and GO terms, protein structure and family classification, multiple sequence alignment, phylogenetic analysis for evolutionary relatedness.

Simplicity™ is an easy-to-use, cloud-based high performance system for the automatic annotation, analysis and visualisation of prokaryote genetic data. It is aimed at researchers who lack an in-depth knowledge of bioinformatics and enables the generation of comprehensive reports from raw data with just a few mouse clicks.

NSilico Life Science Introductory Bioinformatics Course

www.nsilico.com 2

Page 3: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

LEARNING OUTCOMES

1. Interpret information from a range of sources and apply strategies for the analysis and management of next generation sequence data.

2. Critically evaluate the strengths and weaknesses of the bioinformatics tools available for genome analysis.

3. Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance of the results.

4. Synthesise multiple sequence alignments and highlight the significant features of the alignment using an alignment editor.

5. Design appropriate work flows for annotation of large amounts of sequence data.

6. Design and implement pipelines for gene expression analysis.7. Demonstrate an ability to interpret complex sequence analysis data and

report the findings in publishable format.

www.nsilico.com

NSilico Life Science Introductory Bioinformatics Course

3

Page 4: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

WHY ATTEND THIS COURSE?

Bioinformatics is a rapidly moving field with numerous practical applications in different areas of biology, drug discovery, medicine and more. This course seeks to equip the participants with a clear methodology and practical tools and techniques designed to make bioinformatics easy to handle. During the seminar, participants will undertake numerous exercises in small groups giving them the opportunities to apply the theory blocks and reinforce the learning.

THIS HIGHLY INTERACTIVE HANDS-ON COURSE REQUIRES USE OF A PERSONAL LAPTOP.

COURSE IS HELD IN HOUSE OR IN OUR TRAINING CENTRE WITH SOCIAL EVENTS IN HISTORIC DUBLIN.

NSilico Life Science Introductory Bioinformatics Course

4www.nsilico.com

Page 5: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

NSilico Life Science Introductory Bioinformatics Course

5

COURSE CONTENTDAY 1

INTRODUCTION TO COMPUTATIONAL BIOLOGY

READS QUALITY CONTROL

Overview, cell biology, genes, genomes, environment and epigenetics, history of genomics, biology for computer scientists, computer science for biologists, research strategies, algorithms, sequencing technology, file formats, FastQ files, file orientation, encoding type, depth of coverage, paired end, mate pair, unpaired, bioinformatics databases, software tools, online services.

Hands on Workshop: Online bioinformatics tools.

Sequence data and quality issues, finding how many reads are in the file, percentage GC of the entire dataset, sequence length of the reads, finding the top over-represented sequence, per-base sequence quality, per sequence quality, per base sequence content, per-base GC content, per-sequence GC content, per-base N content, sequence length distribution, duplicate sequences, over-represented sequences, over-represented k-mers.

Hands on Workshop: Reads quality.

www.nsilico.com

Page 6: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

NSilico Life Science Introductory Bioinformatics Course

6

ADAPTER TRIMMING

ASSEMBLY ASSESSMENT

ASSEMBLING NEXT GENERATION SEQUENCES

GENE PREDICTION, CODON AND AMINO ACID USAGE

Dealing with contamination, phred quality scores, forward and reverse adapters, handling mismatch error rate (%), quality cut off (%), synchronised paired files. Choosing trimming tools and configuring parameters.

Hands on Workshop: Adapter trimming.

GC% content, no. of contigs, largest contig length, total length of all contigs, total length of contigs ≥ 1000 bp, N50 score, N75 score, L50 score, L75 score, SNPs, INDELs and variant calling.

Hands on Workshop: Assembly assessment with cases studies on SNPs and variants.

De novo, mapping to reference genome, De Bruijn graphs, k-mers, contigs, scaffolds, mismatch error correction, coverage, file orientation, unsorted contigs and gaps.

Hands on Workshop: De novo assembly.

Gene finding, open reading frames, exon, codon tables, hidden markov models, interpolated markov model, empirical methods, ab initio methods, codon and amino usage tables, codon frequency, amino acid frequency, organism codon preferences, GC content.

Hands on Workshop: Gene predication assessment.

DAY 2

www.nsilico.com

Page 7: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

NSilico Life Science Introductory Bioinformatics Course

7

GENE EXPRESSION AND VISUALISATION

GENE ONTOLOGIES

Basic LOCAL ALIGNMENT SEARCH TOOL (BLAST)

PROTEIN CLASSIFICATION

Gene expression analysis, representing output, descriptive statistics, charting, gene map, circular map, linear map, heat maps.

Hands on Workshop: Visualising genomic data.

GC% content, number of contigs, largest contig length, total length of all contigsGo terms, MySQL database, integrated into EBI QuickGo, universal standard terminology for biology, gene product properties, protein domains or structural features, protein-protein interactions, cellular components, molecular functions, biological processes.

Hands on Workshop: Gene Ontology.

BLASTn, BLASTp, BLASTx, tBLASTn, tBLASTx, hit, heuristic algorithms, % identity, homologues, infer functional and evolutionary relationships between sequences, BLOSUM/PAM matrixes (proteins), match/mismatch(dna), gap open, gap extend, e-value, scores, evidence/confidence.

Hands on Workshop: Basic local alignment.

Class, Architecture, Topology, Homology, Protein Data Bank, domains, protein structures are classified using a combination of automated and manual procedures, homologous super families, fold groups, CATH code, similarity search with BLASTp, pattern databases, profile databases, motifs, protein domains and families, hidden Markov model, similarity search with BLASTp.

Hands on Workshop: Protein classification with case study in antibiotics resistance and virulence.

DAY 3

www.nsilico.com

Page 8: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

NSilico Life Science Introductory Bioinformatics Course

8

MULTIPLE SEQUENCE ALIGNMENT

WRAP UP

PHYLOGENETIC ANALYSIS

Pairwise alignment, dot plot, local alignment, global alignment, dynamic programming, progressive alignment, dealign input sequences, Clustal format, PHYLIP format, MSF format, max guide tree iterations, max HMM iterations. Hands on Workshop: Multiple alignment.

Final assessment and research project mentoring.

Tree construction, phylogenies, phylogenetic inference, root, node, outgroup, branch length, leaf, progressive alignment, Newick format, orthologous genes, paralogous genes, distance and character based methods, NJ, UPGMA, maximum parsimony, maximum likelihood, bootstrap analysis.

Hands on Workshop: Phylogenetic analysis.

Dr Paul Walsh is chief technology officer at NSilico with over 20 years’ experience in high performance computing, medical applications and bioinformatics. He has led National and EU Framework projects for major projects in bioinformatics, microbial biomarker detection and cancer genomics. He has an extensive list of publications in both computer science, biology and management. He is a certified project manager and has over 20 years’ experience in the training sector.

Course Director

www.nsilico.com

Page 9: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

NSilico Life Science Introductory Bioinformatics Course

9

BOOKING FORM

DELEGATE DETAILS

COMPANY DETAILS

I have read and agreed to the following terms and conditions Signature: __________________

1. Please Invoice my CompanyVisa Master Card

Card Number : ________________________________CVS/CCV Number: _____________Exp Date : ____/____/____Name on card : ________________________________Signature : ________________________________

2. Please change my Credit Card

• € 1,650 per delegate • 6 or more delegates 20% discount• 12 or more delegates 25% discount

Phone: +353 21 234 0232 Email: [email protected] Web: http://www.nsilico.com/Contact

PRICING GROUP DISCOUNTS

Name:

Tel:

Name:

Tel:

Job title:

Fax:

Job title:

Fax:

E-mail:

Mob:

E-mail:

Mob:

Company:

Post code:

Tel:

Address:

Country:

Fax:

__________________ ________________ _________________

__________________ ________________ _________________

__________________ ________________ _________________

__________________

____________________________ _____________________________

____________________________ _____________________________

____________________________ _____________________________

________________ _________________

www.nsilico.com

Page 10: Introductory Bioinformatics Course - NSilico · INTRODUCTORY BIOINFORMATICS COURSE ... Configure bioinformatics tools to aid in the evaluation of hypotheses and explain the significance

NSilico Lifescience Ltd.Grange Erin Lodge, Grange Road, Douglas, Cork Ireland

+353 (021) 234 0232

[email protected]

www.nsilico.com