Appendix C - ars.els-cdn.com8: ssPSMA4.1b 51 50 50 50 50 49 100 100 85 46 46 46 46 38 37 9:...
Transcript of Appendix C - ars.els-cdn.com8: ssPSMA4.1b 51 50 50 50 50 49 100 100 85 46 46 46 46 38 37 9:...
Appendix C
1
Appendix C
Content Page
C.1. Phylogeny of duplicate Atlantic salmon proteasome amino acid sequences 2 C.2. Nucleotide sequence identity between DE Atlantic salmon proteasome subunit paralog sequences 3
C.3. Expression patterns of Atlantic salmon proteasome subunit duplicates in various normal tissues 4 C.4. Alignment of selected deduced PSMA7 and PSMA8 amino acid sequences 5
C.5. Deduced proteasome amino acid sequences used in phylogenies 8
Appendix C
2
Appendix C.1. Phylogeny of DE Atlantic salmon proteasome paralog amino
acid sequences
The evolutionary history was inferred by using the Maximum Likelihood method based on the Le_Gascuel_2008 model [1]. The tree with the highest log likelihood (-3481.0643) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 8.6048)). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 20 amino acid sequences. All positions with less than 95% site coverage were eliminated. That is, fewer than 5% alignment gaps, missing data, and ambiguous bases were allowed at any position. There were a total of 219 positions in the final dataset. Evolutionary analyses were conducted in MEGA7 [2]. Sequence references can be found in main text Table 1 and Appendix C.5 and hs is Homo sapiens and ss is Salmo salar. Blue sharing defines differentially expressed genes shown as downregulated with arrows.
1. Le S.Q. and Gascuel O. (2008). An Improved General Amino Acid Replacement Matrix. Mol Biol Evol 25(7):1307-1320. 2. Kumar S., Stecher G., and Tamura K. (2016). MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets.Molecular Biology and Evolution 33:1870-1874.
Appendix C
3
Appendix C.2. Nucleotide sequence identity between DE Atlantic salmon proteasome subunit sequences.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1: ssPSMA8.1a 100 93 82 83 74 74 51 51 51 45 45 44 45 41 39
2: ssPSMA7.1b 93 100 81 83 73 72 50 50 51 44 44 44 45 41 40
3: ssPSMA8.2a 82 81 100 93 73 72 50 50 50 45 45 46 46 39 38
4: ssPSMA8.2b 83 83 93 100 73 72 50 50 51 47 46 46 46 39 38
5: ssPSMA7a 74 73 73 73 100 94 50 50 50 47 47 46 45 41 40
6: ssPSMA7b 74 72 72 72 94 100 49 49 49 46 46 45 45 39 39
7: ssPSMA4.1a 51 50 50 50 50 49 100 100 85 45 45 46 46 38 37
8: ssPSMA4.1b 51 50 50 50 50 49 100 100 85 46 46 46 46 38 37
9: ssPSMA4.2 51 51 50 51 50 49 85 85 100 46 46 47 46 37 37
10: ssPSMA6.1a 45 44 45 47 47 46 45 46 46 100 94 86 71 42 40
11: ssPSMA6.1b 45 44 45 46 47 46 45 46 46 94 100 85 71 41 40
12: ssPSMA6.2 44 44 46 46 46 45 46 46 47 86 85 100 72 40 40
13: ssPSMA6.3 45 45 46 46 45 45 46 46 46 71 71 72 100 39 39
14: ssPSMB6a 41 41 39 39 41 39 38 38 37 42 41 40 39 100 94
15: ssPSMB6b 39 40 38 38 40 39 37 37 37 40 40 40 39 94 100
Only open reading frame nucleotide sequences were included. Percent Identity Matrix was created by Clustal2.1.
Appendix C
4
Appendix C.3. Expression patterns of Atlantic salmon proteasome subunit duplicates in various normal tissues New name Genome name HK Spleen Gills Heart Liver Nose Gut Brain Ovary Testis Query length (bp)
PSMB6a PSMB6_Like 15,81 7,06 18,26 7,34 2,40 15,64 9,11 21,87 110,72 38,52 696
PSMB6b PSMB6 38,40 25,64 37,54 22,28 9,88 32,87 26,43 26,72 22,46 89,00 699
PSMA4.1a/-b PSMA4/-4_Like 36,11 23,65 19,94 13,30 8,79 19,70 32,25 18,52 29,96 37,74 786
PSMA4.2 PSMA4 23,65 21,15 15,78 4,01 3,34 14,03 14,87 7,41 29,41 46,07 831
PSMA6.1a PSMA6 11,44 6,07 6,25 3,64 2,23 7,84 5,71 7,24 27,78 49,66 741
PSMA6.1b PSMA6_Like 17,40 11,54 9,82 3,76 3,26 11,15 8,62 9,57 61,33 40,60 741
PSMA6.2 PSMA6_Like 20,35 14,08 18,94 9,26 6,82 20,65 20,56 11,95 18,09 42,21 741
PSMA6.3 PSMA6_Like 20,83 16,50 17,45 1,42 1,56 8,50 16,95 0,69 0,23 4,87 741
PSMA7a PSMA7_Like 9,39 6,05 8,60 0,79 0,96 6,14 14,42 1,53 0 2,36 741
PSMA7b PSMA7 38,42 38,48 14,98 2,00 1,73 3,80 9,56 0,79 0,15 4,20 729
PSMA8.1a PSMA7_Like 31,92 25,86 16,26 8,01 2,75 18,99 12,78 10,23 40,23 59,65 756
PSMA8.1b PSMA7_Like 34,54 21,36 18,56 9,26 5,74 21,97 23,91 19,05 48,69 54,49 756
PSMA8.2a PSMA7_Like 2,04 1,23 1,68 2,52 0,65 3,00 0,80 2,60 50,95 22,15 756
PSMA8.2b PSMA7 8,31 5,30 6,50 4,66 1,73 7,64 8,60 9,31 29,41 35,88 756
Total # reads 59084708 60203316 59793962 58163180 58784272 59545012 59806348 58939250 35803964 185311952
Transcriptional values are given in RPKM (Reads Per Kilo base per Million mapped reads). Mapping reads back to Atlantic salmon tissue reference transcriptomes was done with the CLC v 6.0.5 software. Reads
were mapped with high stringency i.e. greater than 98% identity over more than 95% of the total length of the query read and only uniquely matched reads were counted. The reference transcriptomes are described
in main text reference Lien et al. (2016) and have the SRA accession numbers as follows: Head kidney SRS639992, spleen SRS640021, gill SRS639861, heart SRS639994, liver SRS640002, nose SRS640009, gut
SRS639990, brain SRS639037, ovary SRS640030 and testis SRS640031. As nucleotide sequence identity between PSMA4.1a and PSMA4.1b was 100%, these RPKM values are merged. Cells are color coded
according to RPKM values where grey is negligible expression with RPKM values below 1, white is low expression levels with RPKM values 1-10, blue is medium levels 10-50, green is high expression levels
50-100 and orange is very high levels with RPKM values >100. Yellow colored cells show the down-regulated genes identified in resistant animals. Gene annotations in the Sally genome alongside our unique gene
identifiers are shown
Appendix C
5
Appendix C.4. Alignment of selected deduced PSMA7 and PSMA8 amino acid sequences
* 20 * 40 * 60 * 80
hsPSMA8 : MASRYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGTNIVVLGVEKKSVAKLQDERTVRKICALDDHVCMAFAGLTADARVV : 85
SasaPSMA8.1b : ..A....................................KD...............E..........................I. : 85
SasaPSMA8.1a : ..A....................................KD...............E..........................I. : 85
SasaPSMA8.2a : ..A....................................KD...............E....C...T.................I. : 85
SasaPSMA8.2b : ..A....................................KD...............E..........................I. : 85
EsluPSMA7.Like: ..A....................................KD...............E..........................I. : 85
EsluPSMA7(2) : ..A....................................KD...............E..........................I. : 85
DarePSMA8 : ..A....................................KD...............E...........E..............I. : 85
AsmePSMA8 : ..A....................................KD...............E...........E..............I. : 85
LeocPSMA8 : ..A....................................RD...............E...........E..............I. : 85
OrmePSMA8 : ..A....................................KD...............E...........E..............I. : 85
OrniPSMA8 : ..A....................................KD...............E...........E..............I. : 85
GaacPSMA8 : ..A....................................KD...............E...........E..............I. : 85
hsPSMA7 : --MS................................V..RD............................N.............I. : 83
SasaPSMA7a : --MS................................V..K.V........T......D...........N.F...........I. : 83
SasaPSMA7b : --MS.................D..............V..K..........T.....ED.......S...N.F...........II : 83
EsluPSMA7(1) : --MS................................V..KK..........I.....D...........N.FL............ : 83
LeocPSMA7 : --MS................................V..K...........M.....D...........N.F...........I. : 83
IcpuPSMA7 : --MS................................V..K...........A.M...............N.F...........I. : 83
AsmePSM7 : --MS................................V..K...I.......A.M...D...........N.F...........II : 83
ClhaPSMA7 : --MS.............................V..V..KT...V......L.....D.......P...N.F...........I. : 83
Appendix C
6
* 100 * 120 * 140 * 160 *
hsPSMA7.Like : INRARVECQSHKLTVEDPVTVEYITRFIATLKQKYTQSNGRRPFGISALIVGFDDDGISRLYQTDPSGTYHAWKANAIGRSAKTV : 170
SasaPSMA8.1b : ...........R..............H......R....................N..TP.................S........ : 170
SasaPSMA8.1a : ...........R..............H......R....................C..TP.......................... : 170
SasaPSMA8.2a : ...........R..............H......R....................C..TP.......................... : 170
SasaPSMA8.2b : ...........R..............H......R....................Y..TP.......................... : 170
EsluPSMA7.Like: ...........R..............H......R....................Y..TP.......................... : 170
EsluPSMA7(2) : ...........R..............H......R....................Y..TP.......................... : 170
DarePSMA8 : ...........R..............Y......R....................Y..TP.......................... : 170
AsmePSMA8 : ...........R..............Y......R....................Y..TP.......................... : 170
LeocPSMA8 : ...........R..............Y......R....................Y..TP.......................... : 170
OrmePSMA8 : ...........R..............Y......R....................Y..TP.......................... : 170
OrniPSMA8 : ...........R..............Y......R....................Y..TP.......................... : 170
GaacPSMA8 : ...........R..............Y......R.......................TP.......................... : 170
hsPSMA7 : ...........R..............Y..S...R....................F..TP.....................G..S. : 168
SasaPSMA7a : V..........R................SSI..R.............S......F..TPH......................... : 168
SasaPSMA7b : V..........R......A.......Y.SSI..R.............S......F..TPH......................... : 168
EsluPSMA7(1) : V..........R................SSI..R.............S......F..TPH....E.................... : 168
LeocPSMA7 : ...........R..............Y.SS...R.............S......F..TP.....................A.... : 168
IcpuPSMA7 : ...........R......A.......Y.SNM..R.............S..M...F..TPN.......................S. : 168
AsmePSM7 : ...........R........I.....Y.SSM..R....................F..TPN..............VGD.------- : 161
ClhaPSMA7 : ...........R................CN...R.............S..A...F..TPH.....................G... : 168
Appendix C
7
180 * 200 * 220 * 240 *
hsPSMA7.Like : REFLEKNYTEDAIASDSEAIKLAIKALLEVVQSGGKNIELAIIRRNQPLKMFSAKEVELYVTEIEKEK-EEAEKKKSKKSV-- : 250
SasaPSMA8.1b : .........DE...T.N........................V........LLES..I.TL.A......E.......P...T-- : 251
SasaPSMA8.1a : ..........ES..T.N........................V.....S..LLES..I.TL........E.......Q...T-- : 251
SasaPSMA8.2a : .........DE...T.N.S......................V........LLES..I.TL...M....ED......Q...T-- : 251
SasaPSMA8.2b : ..........E...T.N.....................D..V........LLES..I.TL.A......ED......Q...T-- : 251
EsluPSMA7.Like: .........DE...T.ND....S..................V........LLES..I.TL.A......E.......Q...T-- : 251
EsluPSMA7(2) : .........DE...T.ND.......................V........LLES..I.TL.A......E.......Q...T-- : 251
DarePSMA8 : .........DE.....ND.......................V........ILES..I.TL.A......E.......Q...S-- : 251
AsmePSMA8 : .........DE.L...ND.......................V........ILES..I.TL.A......E.......Q..AS-- : 251
LeocPSMA8 : ..........E...T.ND.......................V........ILES..I.TL.A......E.......Q..AT-- : 251
OrmePSMA8 : .........DE...G.N........................V........ILES..I.TL.S......E.......Q...T-- : 251
OrniPSMA8 : ..........E...G.N........................V........ILES..I.TL........E.......Q...T-- : 251
GaacPSMA8 : .........DE...G.N........................V........ILES..I.TL.A......E.......Q...T-- : 251
hsPSMA7 : .........DE..ET.DLT...V..................VM..D.S..ILNPE.I.K..A......-..N....Q..AS-- : 248
SasaPSMA7a : ........K.EHME..TDT.....R.................MK.DESM.ILVQE.I.E...A.....E.AEKQ..KT----- : 246
SasaPSMA7b : ........K..-ME..TDT.....R.................MK.DESM.ILVQE.I.E...A..T..--AEKQ..P------ : 242
EsluPSMA7(1) : ......H.K.EDME..T.V.....R.................MKQDESM.ILDQE.IKK...A..N..E.AEK...T------ : 245
LeocPSMA7 : ........KDEDME..K.TV..V.R................VM..DE...ILDQE.I.K..S......E.TEKQ..KT----- : 246
IcpuPSMA7 : ........K.EDMDT.ELVL...VR................TMK.GEAM.ILDPE...KI.SAL....E.AEK...AAT.TTS : 251
AsmePSM7 : -----------------------------............TMKNGETM.ILEPA...NI.AA.....E.ADK...ATA.AST : 215
ClhaPSMA7 : ......S.K.EDVE..EG.......................VMKQ.EA..ILDPE..DKH.LAL....E.ADK...T------ : 245
Amino acid alignment of selected proteasome PSMA7 and PSMA8 (or 7-like) sequences used to produce phylogenetic tree shown in main text Figure 4. Numbering above the alignment refers
to mature human PSMA7.Like (alias hsPSMA8). Amino acids are colored according to physiochemical properties. Gaps shown with a dash are introduced to optimize the alignment using the
human sequence PSMA.Like sequence as a reference. Species are reflected in sequence names as follows: Leoc is Lepisosteus oculatus (Spotted gar), Sasa is Salmo salar (Atlantic salmon),
Eslu is Esox lucius (Northern pike), Asme is Astyanax mexicanus (Mexican tetra), Clha is Clupea harengus (Atlantic herring), Dare is Danio rerio (zebrafish), Icpu is Ictalurus punctatus
(catfish), Orme is Oryzias melastigma (marine medaka), Gaac is Gasterosteus aculeatus (stickleback) and Orni is Oreochromis niloticus (tilapia). Sequence references are shown in Appendix
C.5.
Appendix C
8
Appendix C.5. Deduced proteasome amino acid sequences used in phylogenies
>SasaPSMB6a ssa04 NC_027303.1:35.985.620-35.988.142 XP_014051784.1
MAAYMSASHSENAFSTHDFVPEWAQEEVSTGTTIMAVEFDGGVVIAADSRTTTGAYIANR
VTDKLTPIHDHIFCCRSGSAADTQAVADIVTYQLGFHSIELDEPPLVQTAANLFKQTCYR
YREELMAGIIVAGWDKRRGGQVYTVPMGGMIVRQPVSVGGSGSSYIYGFMDSNYKPGMTK
EECLHFCTQALALAMERDGSSGGVARLAAITEEGLERRVVLGNQLPRFSNA
>SasaPSMB6b ssa11 NC_027310.1: 77.336.949-77.339.681 XP_013984457.1
MATAYMSANRSENAFSTADLVPEWAQEEVSTGTTIMAVEFDGGVVIGADSRTTTGAYIAN
RVTDKLTPIHDRIFCCRSGSAADTQAVADIVTYQLGFHSIELDEPPLVQTAANLFKQTCY
RYREELMAGIIVAGWDKRRGGQVYTVPMGGMIVRQPVSVGGSGSSYIYGFMDSNYKPGMT
KEECLHFCTQALALAMERDGSSGGVARLAAITEEGLERRVVLGNQLPKFSNA
>SasaPSMA4.1a ssa26 NC_027325.1:32939806-32943155 NP_001134515.1
MSRRYDSRTTIFSPEGRLYQVEYAMEAIGHAGSCLGILANDGVLLAAERRNIHKLLDEVF
FSEKIYKLNEDMACSVAGITSDANVLTNELRLIAQRYLLQYQEPIPCEQLVTALCDIKQA
YTQFGGKRPFGVSLLYMGWDKHYGFQLYQSDPSGNYGGWKATCIGNNSAAAVSMLKQDFK
EGEMSLSSALALAVKVLNKTMDVSKLSAEKVEIATLTREDGKTKIKVLKQKDVEELIKRH
EAEEAKAEKDKKDKEQKEKDK
>SasaPSMA4.1b ssa11 NC_027310.1:38380642-38383977 XP_013983107.1
MSRRYDSRTTIFSPEGRLYQVEYAMEAIGHAGSCLGILANDGVLLAAERRNIHKLLDEVF
FSEKIYKLNEDMACSVAGITSDANVLTNELRLIAQRYLLQYQEPIPCEQLVTALCDIKQA
YTQFGGKRPFGVSLLYMGWDKHYGFQLYQSDPSGNYGGWKATCIGNNSAAAVSMLKQDFK
EGEMSLSSALALAVKVLNKTMDVSKLSAEKVEIATLTREDGKTKIKVLKQKEVEELIKRH
EAEEAKAEKDKKDKEQKEKD
>SasaPSMA4.2 ssa10 NC_027309.1:62947128-62979393 XP_013979513.1
MLFPQMELILKETTKMSRRYDSRTTIFSPEGRLYQVEYAMEAIGHAGTCLGILANDGVLL
AAERRNIHKLLDEVFFSEKIYKLNEDMACSVAGITSDANVLTNELRLIAQRYLLQYQEPI
PCEQLVTALCDIKQAYTQFGGKRPFGVSLLYMGWDKHYGFQLYQSDPSGNYGGWKATCIG
NNSAAAVSMLKQDFKEGEMSLSSALALAVKVLNKTMDVSKLSAEKVEIATLTRENGKTCI
KVLKQKEVDELIKKHEAEEAKAEKEKKEKEQKEKDK
>SasaPSMA6.1a ssa06 NC_027305.1:48395986-48398391 XP_014060163.1
MSRGSSAGFDRHITIFSPEGRLYQVEYAFKAINQGGLTSVAVRGKDCAVVVTQKKVPDKL
LDASTVTHLFRITENIGCVMSGMTADSKSQVQRARYEAANWKYKYGYEIPVDMLCKRIAD
ISQVYTQNAEMRPLGCCMIVIGVDEECGPQVYKCDPAGYYCGFKATAAGVKQTEATSFLE
KKVKKKLDWTFKETVETAITCLSTVLSIDFKPSELEIGVITTEDPKFRILTESEVDIHLV
SLAERD
>SasaPSMA6.1b ssa15 NC_027314.1:24839610-24842186 XP_013999490.1
MSRGSSAGFDRHITIFSPEGRLYQVEYAFKAINQGGLTSVAVRGKDCAVVVTQKKVPDTL
LDSSTVTHLFRITENIGCVMSGMTADGKSQVQRARYEAANWNYKYGYEIPVDMLCKRIAD
ISQVYTQNAEMRPLGCCMIVIGVDDECGPQVYKCDPAGYYCGFKATAAGVKQTEATSFLE
Appendix C
9
KKVKKKLDLTFKETVETAITCLSTVLSIDFKPSELEIGVISTEDPKFRILTESEVDTHLV
SLAERD
>SasaPSMA6.2 ssa09 NC_027308.1:31342996-31358744 XP_014066593.1
MSRGSSAGFDRHITIFSPEGRLYQVEYAFKAINQGGLTSLAIRGQDCAVVVTQKKVPDKL
LDASTVTHLFKITENIGCVMSGMTADSKSQVQRARYEAANWKYKYGYEIPVDMLCKRMAD
ISQVYTQNAEMRPLGCCMIVIGVDEECGPQVYKCDPAGYYCGFKATAAGVKQTEATTFLE
KKVKKKLDWTFAQTIETAITCLSTVLSIDFKPSELEIGVITTEDPKFRILTETEIDVHLV
SLSERD
>SasaPSMA6.3 ssa09 NC_027308.1:105413536-105420909 XP_014069427.1
MSRGSNAGFDRHITIFSPEGRLYQVEYAFKAIAQGGLTSLGVRGTDCAVVVTQKKVPDKL
LDSATLTNMYPLTPRIGCVMTGHNADSRSQVHRARVEAAEWKYKFGYDITADMLCRRMAD
ISQVYTQNAEMRPLGCCMILVAMDPQLGPMLYKCDPAGYFCGFRATSVGAKQTEANSYLE
KKLKKRPELSYEMTVELAISCLSSILSMDFKPSEIEVGVVTKESPKFRTLSETEIDTHLV
AIAERE
>SasaPSMA7a ssa15 NC_027314.1:60419652-60421283 XP_014000965.1
MSYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGVRGKNVVVLGVEKKTVAKLQDDRTVR
KICALDDNVFMAFAGLTADARIVVNRARVECQSHRLTVEDPVTVEYITRFISSIKQRYTQ
SNGRRPFGISSLIVGFDFDGTPHLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYKEEH
MESDTDTIKLAIRALLEVVQSGGKNIELAIMKRDESMKILVQEEIEEYVTAIEKEKEEAE
KQKKKT
>SasaPSMA7b ssa13 NC_027312.1:19630566-19634058 XP_013990556.1
MSYDRAITVFSPDGHLFQVDYAQEAVKKGSTAVGVRGKNIVVLGVEKKTVAKLQEDRTVR
KICSLDDNVFMAFAGLTADARIIVNRARVECQSHRLTVEDPATVEYITRYISSIKQRYTQ
SNGRRPFGISSLIVGFDFDGTPHLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYKEDM
ESDTDTIKLAIRALLEVVQSGGKNIELAIMKRDESMKILVQEEIEEYVTAIETEKAEKQK
KP
>SasaPSMA8.1a ssa23 NC_027322.1:14663633-14665712 XP_014024650.1
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VRKICALDDHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRHIATLKQRY
TQSNGRRPFGISALIVGFDCDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTE
ESIATDNEAIKLAIKALLEVVQSGGKNIELAVIRRNQSLKLLESKEIETLVTEIEKEKEE
EAEKKKQKKST
>SasaPSMA8.1b ssa10 NC_027309.1:35937788-35939901 XP_013978661.1
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VRKICALDDHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRHIATLKQRY
TQSNGRRPFGISALIVGFDNDGTPRLYQTDPSGTYHAWKANSIGRSAKTVREFLEKNYTD
EAIATDNEAIKLAIKALLEVVQSGGKNIELAVIRRNQPLKLLESKEIETLVAEIEKEKEE
EAEKKKPKKST
>SasaPSMA8.2a ssa14 NC_027313.1:27557984-27577773 XP_013995708.1
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VCKICTLDDHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRHIATLKQRY
Appendix C
10
TQSNGRRPFGISALIVGFDCDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTD
EAIATDNESIKLAIKALLEVVQSGGKNIELAVIRRNQPLKLLESKEIETLVTEMEKEKED
EAEKKKQKKST
>SasaPSMA8.2b ssa03 NC_027302.1:27740291-27753411 XP_014046464.1
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VRKICALDDHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRHIATLKQRY
TQSNGRRPFGISALIVGFDYDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTE
EAIATDNEAIKLAIKALLEVVQSGGKNIDLAVIRRNQPLKLLESKEIETLVAEIEKEKED
EAEKKKQKKST
>OrmePSMA8 [Oryzias melastigma] marine medaka TSA:GDGP01030614 Om_CDS_27313-1 transcribed RNA sequence
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VRKICALDEHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRYIATLKQRY
TQSNGRRPFGISALIVGFDYDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTD
EAIAGDNEAIKLAIKALLEVVQSGGKNIELAVIRRNQPLKILESKEIETLVSEIEKEKEE
EAEKKKQKKST*
>OrniPSMA8 [Oreochromis niloticus] proteasome subunit alpha 8 ENSONIG00000009734 Scaffold GL831134.1: 5,204,529-5,208,745
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VRKICALDEHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRYIATLKQRY
TQSNGRRPFGISALIVGFDYDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTE
EAIAGDNEAIKLAIKALLEVVQSGGKNIELAVIRRNQPLKILESKEIETLVTEIEKEKEE
EAEKKKQKKST
>GaacPSMA8 [Gasterosteus aculeatus] proteasome subunit alpha 8 ENSGACT00000017525.1 groupIII: 948,765-953,424 251 bp
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VRKICALDEHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRYIATLKQRY
TQSNGRRPFGISALIVGFDDDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTD
EAIAGDNEAIKLAIKALLEVVQSGGKNIELAVIRRNQPLKILESKEIETLVAEIEKEKEE
EAEKKKQKKST
>IcpuPSMA7 [Ictalurus punctatus] PREDICTED: proteasome subunit alpha type-7 XP_017343612.1
MSYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGVRGKNIVVLGVEKKSAAMLQDERTVR
KICALDDNVFMAFAGLTADARIVINRARVECQSHRLTVEDPATVEYITRYISNMKQRYTQ
SNGRRPFGISSLIMGFDFDGTPNLYQTDPSGTYHAWKANAIGRSAKSVREFLEKNYKEED
MDTDELVLKLAVRALLEVVQSGGKNIELATMKRGEAMKILDPEEVEKIVSALEKEKEEAE
KKKKAATSTTS
>EsluPSMA7 (1) [Esox Lucius] PREDICTED: proteasome subunit alpha type-7 XP_010894973.1 LG12:27743363-27748418
MSYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGVRGKKIVVLGVEKKSIAKLQDDRTVR
KICALDDNVFLAFAGLTADARVVVNRARVECQSHRLTVEDPVTVEYITRFISSIKQRYTQ
SNGRRPFGISSLIVGFDFDGTPHLYQTEPSGTYHAWKANAIGRSAKTVREFLEKHYKEED
MESDTEVIKLAIRALLEVVQSGGKNIELAIMKQDESMKILDQEEIKKYVTAIENEKEEAE
KKKKT
>EsluPSMA7 (2) [Esox Lucius] proteasome subunit alpha type-7 NP_001290983.1 LG08:24735569-24737952
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
Appendix C
11
VRKICALDDHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRHIATLKQRY
TQSNGRRPFGISALIVGFDYDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTD
EAIATDNDAIKLAIKALLEVVQSGGKNIELAVIRRNQPLKLLESKEIETLVAEIEKEKEE
EAEKKKQKKST
>EsluPSMA7-Like [Esox Lucius] PREDICTED: proteasome subunit alpha type-7-like XP_010882855.1 LG03:9081749-9084178
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VRKICALDDHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRHIATLKQRY
TQSNGRRPFGISALIVGFDYDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTD
EAIATDNDAIKLSIKALLEVVQSGGKNIELAVIRRNQPLKLLESKEIETLVAEIEKEKEE
EAEKKKQKKST
>DarePSMA8 [Danio rerio] proteasome subunit alpha 8 ENSDARG00000010965 Chr.20: 18,406,539-18,414,281
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VRKICALDEHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRYIATLKQRY
TQSNGRRPFGISALIVGFDYDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTD
EAIASDNDAIKLAIKALLEVVQSGGKNIELAVIRRNQPLKILESKEIETLVAEIEKEKEE
EAEKKKQKKSS
>AsmePSMA7 [Astyanax mexicanus] proteasome subunit alpha 7 XP_015457589.1 ENSAMXG00000002859 Scaffold KB871891.1: 375,519-
378,990
MSYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGVRGKNIVILGVEKKSAAMLQDDRTVR
KICALDDNVFMAFAGLTADARIIINRARVECQSHRLTVEDPVTIEYITRYISSMKQRYTQ
SNGRRPFGISALIVGFDFDGTPNLYQTDPSGTYHAWKVGDIVVQSGGKNIELATMKNGET
MKILEPAEVENIVAAIEKEKEEADKKKKATASAST
>AsmePSMA8 [Astyanax mexicanus] proteasome subunit alpha 8 ENSAMXG00000019432 Scaffold KB871746.1: 9,366-16,256
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGKDIVVLGVEKKSVAKLQEERT
VRKICALDEHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRYIATLKQRY
TQSNGRRPFGISALIVGFDYDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTD
EALASDNDAIKLAIKALLEVVQSGGKNIELAVIRRNQPLKILESKEIETLVAEIEKEKEE
EAEKKKQKKAS
>ClhaPSMA7 [Clupea harengus] PREDICTED: proteasome subunit alpha type-7 XP_012686185.1
MSYDRAITVFSPDGHLFQVEYAQEAVKKGSTVVGVRGKTIVVVGVEKKSLAKLQDDRTVR
KICPLDDNVFMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRFICNLKQRYTQ
SNGRRPFGISSLIAGFDFDGTPHLYQTDPSGTYHAWKANAIGRSGKTVREFLEKSYKEED
VESDEGAIKLAIKALLEVVQSGGKNIELAVMKQNEALKILDPEEVDKHVLALEKEKEEAD
KKKKT
>LeocPSMA7 [Lepisosteus oculates] proteasome subunit alpha 7 ENSLOCG00000003735 Chr.LG18: 7,557,365-7,561,553
MSYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGVRGKNIVVLGVEKKSMAKLQDDRTVR
KICALDDNVFMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRYISSLKQRYTQ
SNGRRPFGISSLIVGFDFDGTPRLYQTDPSGTYHAWKANAIGRAAKTVREFLEKNYKDED
MESDKETVKLVIRALLEVVQSGGKNIELAVMRRDEPLKILDQEEIEKYVSEIEKEKEETE
KQKKKT
>LeocPSMA8 [Lepisosteus oculates] proteasome subunit alpha 8 ENSLOCG00000006356 Chr.LG9: 12,636,489-12,646,112
Appendix C
12
MAARYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGRDIVVLGVEKKSVAKLQEERT
VRKICALDEHVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRYIATLKQRY
TQSNGRRPFGISALIVGFDYDGTPRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTE
EAIATDNDAIKLAIKALLEVVQSGGKNIELAVIRRNQPLKILESKEIETLVAEIEKEKEE
EAEKKKQKKAT
>hsPSMA4 [Homo sapiens] NP_002780 proteasome subunit alpha type-4
MSRRYDSRTTIFSPEGRLYQVEYAMEAIGHAGTCLGILANDGVLLAAERRNIHKLLDEVF
FSEKIYKLNEDMACSVAGITSDANVLTNELRLIAQRYLLQYQEPIPCEQLVTALCDIKQA
YTQFGGKRPFGVSLLYIGWDKHYGFQLYQSDPSGNYGGWKATCIGNNSAAAVSMLKQDYK
EGEMTLKSALALAIKVLNKTMDVSKLSAEKVEIATLTRENGKTVIRVLKQKEVEQLIKKH
EEEEAKAEREKKEKEQKEKDK
>hsPSMA6 [Homo sapiens] NP_002782 proteasome subunit alpha type-6
MSRGSSAGFDRHITIFSPEGRLYQVEYAFKAINQGGLTSVAVRGKDCAVIVTQKKVPDKL
LDSSTVTHLFKITENIGCVMTGMTADSRSQVQRARYEAANWKYKYGYEIPVDMLCKRIAD
ISQVYTQNAEMRPLGCCMILIGIDEEQGPQVYKCDPAGYYCGFKATAAGVKQTESTSFLE
KKVKKKFDWTFEQTVETAITCLSTVLSIDFKPSEIEVGVVTVENPKFRILTEAEIDAHLV
ALAERD
>hsPSMA7 [Homo sapiens] proteasome subunit alpha type-7 NP_002783 Chr.20: 62,136,735-62,143,430
MSYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGVRGRDIVVLGVEKKSVAKLQDERTVR
KICALDDNVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRYIASLKQRYTQ
SNGRRPFGISALIVGFDFDGTPRLYQTDPSGTYHAWKANAIGRGAKSVREFLEKNYTDEA
IETDDLTIKLVIKALLEVVQSGGKNIELAVMRRDQSLKILNPEEIEKYVAEIEKEKEENE
KKKQKKAS
>hsPSMA7_Like (alias PSMA8) [Homo sapiens] proteasome subunit alpha type-7-like NP_001020267.1 Chr.18: 26,133,877-26,193,355
MASRYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGTNIVVLGVEKKSVAKLQDERT
VRKICALDDHVCMAFAGLTADARVVINRARVECQSHKLTVEDPVTVEYITRFIATLKQKY
TQSNGRRPFGISALIVGFDDDGISRLYQTDPSGTYHAWKANAIGRSAKTVREFLEKNYTE
DAIASDSEAIKLAIKALLEVVQSGGKNIELAIIRRNQPLKMFSAKEVELYVTEIEKEKEE
AEKKKSKKSV
>hsPSMB6 [Homo sapiens] NP_002789 proteasome subunit beta type-6
MAATLLAARGAGPAPAWGPEAFTPDWESREVSTGTTIMAVQFDGGVVLGADSRTTTGSYI
ANRVTDKLTPIHDRIFCCRSGSAADTQAVADAVTYQLGFHSIELNEPPLVHTAASLFKEM
CYRYREDLMAGIIIAGWDPQEGGQVYSVPMGGMMVRQSFAIGGSGSSYIYGYVDATYREG
MTKEECLQFTANALALAMERDGSSGGVIRLAAIAESGVERQVLLGDQIPKFAVATLPPA