Puton bosc2010 bio_python-modules-rna

16
BioPython modules for handling RNA sequences containing modified nucleosides. Tomasz Puton, Kristian Rother, Magdalena Rother, Janusz M. Bujnicki Laboratory of Structural Bioinformatics Adam Mickiewicz University, Poznań, Poland http://bioinformatics.amu.edu.pl/Lab/ International Institute of Molecular and Cell Biology Warsaw, Poland http://iimcb.genesilico.pl/

Transcript of Puton bosc2010 bio_python-modules-rna

Page 1: Puton bosc2010 bio_python-modules-rna

BioPython modules for handling RNA sequences containing modified

nucleosides. Tomasz Puton, Kristian Rother,

Magdalena Rother, Janusz M. Bujnicki

Laboratory of Structural Bioinformatics Adam Mickiewicz University, Poznań, Poland

http://bioinformatics.amu.edu.pl/Lab/

International Institute of Molecular and Cell Biology Warsaw, Poland http://iimcb.genesilico.pl/

Page 2: Puton bosc2010 bio_python-modules-rna

BioPython – a set of freely available tools for biological computation written in Python.

http://biopython.org/

Page 3: Puton bosc2010 bio_python-modules-rna

ACUGAUC

Page 4: Puton bosc2010 bio_python-modules-rna

ACUGAUC

Adenosine (A) Uridine (U)

Cytidine (C)

Guanosine (G)

Page 5: Puton bosc2010 bio_python-modules-rna

ACUGAUC

>>> from Bio.Alphabet.IUPAC import unambiguous_rna >>> from Bio.Seq import Seq

>>> seq = Seq(‘ACUGAUC’, unambiguous_rna)

Page 6: Puton bosc2010 bio_python-modules-rna

ACUGAUC

>>> from Bio.Alphabet.IUPAC import unambiguous_rna >>> from Bio.Seq import Seq

>>> seq = Seq(‘ACUGAUC’, unambiguous_rna) >>> print seq.reverse_complement() GAUCAGU

Page 7: Puton bosc2010 bio_python-modules-rna

Problem:

The linked image cannot be displayed. The file may have been moved, renamed, or deleted. Verify that the link points to the correct file and location.

There are 115 known post-transcriptionally modified nucleosides in RNA.

Moreover, several nomenclature schemes exist in parallel.

Known as:  wybutosine  yW  Y  16G

Page 8: Puton bosc2010 bio_python-modules-rna

http://github.com/krother/biopython branch rna_alphabet

$ git clone git://github.com/krother/biopython.git $ cd biopython $ git checkout rna_alphabet

Solution:

Page 9: Puton bosc2010 bio_python-modules-rna

>>> from Bio.Alphabet.ModifiedRNAAlphabet import modified_rna >>> from Bio.RNA.RNASeq import RNASeq

$ python

Solution:

Page 10: Puton bosc2010 bio_python-modules-rna

>>> from Bio.Alphabet.ModifiedRNAAlphabet import modified_rna >>> from Bio.RNA.RNASeq import RNASeq

>>> seq = RNASeq('AG:7CU', modified_rna)

$ python

Solution:

Page 11: Puton bosc2010 bio_python-modules-rna

>>> from Bio.Alphabet.ModifiedRNAAlphabet import modified_rna >>> from Bio.RNA.RNASeq import RNASeq

>>> seq = RNASeq('AG:7CU', modified_rna)

>>> print seq[2].full_name 2-O-methyloadenosine

$ python

Solution:

Page 12: Puton bosc2010 bio_python-modules-rna

>>> from Bio.Alphabet.ModifiedRNAAlphabet import modified_rna >>> from Bio.RNA.RNASeq import RNASeq

>>> seq = RNASeq('AG:7CU', modified_rna)

>>> print seq[2].full_name 2-O-methyloadenosine

>>> print seq[3].long_abbrev m7G

$ python

Solution:

Page 13: Puton bosc2010 bio_python-modules-rna

http://modomics.genesilico.pl/

Page 14: Puton bosc2010 bio_python-modules-rna

Example applications (part 1):

ModeRNA: A tool for comparative modeling of RNA 3D structure.

http://iimcb.genesilico.pl/moderna/

Our software models modified RNA 3D structures!

Open source project!

Page 15: Puton bosc2010 bio_python-modules-rna

Example applications (part 2):

CompaRNA: A server for continuous benchmarking of automated methods for RNA structure prediction

POSTER NUMBER: W17 SUNDAY, JULY 11: 12:40 p.m. – 2.30 p.m.

http://comparna.amu.edu.pl/

Uses open source software e.g. BioPython, PyCogent & ModeRNA

Page 16: Puton bosc2010 bio_python-modules-rna

Thank you for attention!

Tomasz Puton [email protected]

Magdalena Rother [email protected]

Kristian Rother [email protected]

Janusz M. Bujnicki [email protected]