BLASTX nr result
ID: Papaver30_contig00005824
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver30_contig00005824 (747 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008244885.1| PREDICTED: uncharacterized protein LOC103342... 273 7e-71 ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [The... 270 1e-69 ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The... 269 2e-69 emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] 268 3e-69 emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera] 267 5e-69 gb|AIG55302.1| gag-pol, partial [Camellia sinensis] 267 6e-69 ref|XP_013695570.1| PREDICTED: uncharacterized protein LOC106399... 266 8e-69 ref|XP_013645649.1| PREDICTED: uncharacterized protein LOC106350... 266 1e-68 ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [The... 265 3e-68 ref|XP_010111872.1| Transposon Ty3-I Gag-Pol polyprotein [Morus ... 262 2e-67 ref|XP_012567311.1| PREDICTED: uncharacterized protein LOC105851... 261 3e-67 emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] 261 5e-67 ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417... 258 4e-66 ref|XP_007028151.1| DNA/RNA polymerases superfamily protein [The... 255 2e-65 ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao] g... 255 3e-65 ref|XP_010541787.1| PREDICTED: uncharacterized protein LOC104815... 254 4e-65 gb|AAD20658.1| putative retroelement pol polyprotein [Arabidopsi... 254 4e-65 emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera] 254 6e-65 gb|AAD37020.1| putative retroelement pol polyprotein [Arabidopsi... 253 7e-65 emb|CAC44142.1| putative polyprotein [Cicer arietinum] 253 1e-64 >ref|XP_008244885.1| PREDICTED: uncharacterized protein LOC103342989 [Prunus mume] Length = 1162 Score = 273 bits (699), Expect = 7e-71 Identities = 135/252 (53%), Positives = 170/252 (67%), Gaps = 7/252 (2%) Frame = -3 Query: 739 RLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLTCQ 560 RL VP DL KE+L H S T+H G TKM DL+R FWW GM +DI FV+ CLTCQ Sbjct: 561 RLYVPEISDLRKEVLKEGHHSFYTIHPGGTKMYLDLKRNFWWNGMKRDIEKFVAKCLTCQ 620 Query: 559 KVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF--- 389 +VK EHQ+P+G LQ LP+AEWKWD I MDF GLP+ +G+D++WVI+DRLTKS HF Sbjct: 621 QVKAEHQKPSGSLQPLPVAEWKWDHITMDFVTGLPRSPKGRDAIWVIVDRLTKSAHFLPV 680 Query: 388 ----SSDEAGR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGTTLSM 221 S++ G+ + +H P + S FTSKFW + ++GT L+ Sbjct: 681 KTTESTENLGKLYVREIVRLH-------GIPVSIVSDRDSKFTSKFWGSLQKALGTQLNF 733 Query: 220 SSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEA 41 S+AFH Q DGQSERTIQ+LEDML+AC L+F G+W + L L EFAYN+SY SSI M +EA Sbjct: 734 STAFHPQTDGQSERTIQILEDMLRACILDFGGSWEDHLILAEFAYNNSYQSSIQMAPYEA 793 Query: 40 LYGRPCRTPLCW 5 LYGRPCR+P+CW Sbjct: 794 LYGRPCRSPVCW 805 >ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702098|gb|EOX93994.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 811 Score = 270 bits (689), Expect = 1e-69 Identities = 133/256 (51%), Positives = 170/256 (66%), Gaps = 9/256 (3%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 + R+CVP D+ L + IL AH S LH GSTKM ++ +WW GM +DIA FV+ CLT Sbjct: 486 RDRICVPKDDQLRRAILEEAHSSAYALHPGSTKMYRTIKESYWWPGMKRDIAKFVAKCLT 545 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF- 389 CQ++K EHQ+ +G LQ LPI EWKW+ + MDF GLP+ Q GKD++WVI+DRLTKS HF Sbjct: 546 CQQIKAEHQKSSGTLQPLPIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKSAHFL 605 Query: 388 ------SSDEAGR*HC*SQ*FVH--PRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGT 233 S + R + +H P + R P FTS+FW +F+ ++GT Sbjct: 606 AIHSTYSIERLARLYIDEVVRLHGVPISIVSDRDPR---------FTSRFWPKFQEALGT 656 Query: 232 TLSMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMT 53 L S++FH Q DGQSERTIQ LEDML+AC ++F G+W LPLVEFAYN+S+ SSIGM Sbjct: 657 KLRFSTSFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNNSFQSSIGMA 716 Query: 52 LFEALYGRPCRTPLCW 5 +EALYGR CRTPLCW Sbjct: 717 PYEALYGRKCRTPLCW 732 >ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708318|gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 269 bits (687), Expect = 2e-69 Identities = 134/262 (51%), Positives = 168/262 (64%), Gaps = 15/262 (5%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 + R+CVP D+ L + IL AH S LH GSTKM ++ +WW GM +DIA FV+ CLT Sbjct: 1085 RDRICVPKDDQLRRAILEEAHYSAYALHPGSTKMYRTIKESYWWPGMERDIAEFVAKCLT 1144 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHFS 386 CQ++K EHQ+P+G LQ L I EWKW+ + MDF GLP+ Q GKD++WVI+DRLTKS HF Sbjct: 1145 CQQIKAEHQKPSGTLQPLSIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKSAHFL 1204 Query: 385 S---------------DEAGR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERF 251 + DE R H V R+ FTS+FW +F Sbjct: 1205 AIHSTYSIERLARLYIDEIVRLHGVPVSIVSDRDL---------------RFTSRFWPKF 1249 Query: 250 KLSMGTTLSMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY* 71 + ++GT L S+AFH Q DGQSERTIQ LEDML+AC ++F G+W LPLVEFAYN+S+ Sbjct: 1250 QEALGTKLRFSTAFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNNSFQ 1309 Query: 70 SSIGMTLFEALYGRPCRTPLCW 5 SSIGM +EALYGR CRTPLCW Sbjct: 1310 SSIGMAPYEALYGRKCRTPLCW 1331 >emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] Length = 1573 Score = 268 bits (685), Expect = 3e-69 Identities = 135/254 (53%), Positives = 172/254 (67%), Gaps = 7/254 (2%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 KGRLCVP D +L E+L AH + T+H G+TKM DL+RQF W GM +DIA FV++C Sbjct: 1183 KGRLCVPKDVELRNELLADAHRAKYTIHPGNTKMYQDLKRQFXWSGMKRDIAQFVANCQI 1242 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF- 389 CQ+VK EHQRPA LQ LPI +WKWD+I MDF GLP+ + K+ VWVI+DRLTKS HF Sbjct: 1243 CQQVKAEHQRPAELLQPLPIPKWKWDNITMDFVIGLPRTRSKKNGVWVIVDRLTKSAHFL 1302 Query: 388 ------SSDEAGR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGTTL 227 S + + + Q V ++ + D FTS+FW+ + ++GT L Sbjct: 1303 AMKTTDSMNSLAKLYI--QEIVRLHGIPVSIVSDRD-----PKFTSQFWQSLQRALGTQL 1355 Query: 226 SMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLF 47 + S+ FH Q DGQSER IQ+LEDML+AC L+F GNW++ LPL EFAYN+ Y SSIGM + Sbjct: 1356 NFSTVFHPQTDGQSERVIQILEDMLRACVLDFGGNWADYLPLAEFAYNNXYQSSIGMAPY 1415 Query: 46 EALYGRPCRTPLCW 5 EALYGRPCR+PLCW Sbjct: 1416 EALYGRPCRSPLCW 1429 >emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera] Length = 1313 Score = 267 bits (683), Expect = 5e-69 Identities = 135/254 (53%), Positives = 172/254 (67%), Gaps = 7/254 (2%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 KGRLCVP D +L E+L AH + T+H G+TKM DL+RQFWW GM +DIA FV++ Sbjct: 877 KGRLCVPKDVELRNELLADAHRAKYTIHPGNTKMYQDLKRQFWWSGMKRDIAQFVANFQI 936 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF- 389 CQ+VK EHQRPAG LQ LPI EWKWD+I MDF GLP+ + K+ VWVI+D LTKS HF Sbjct: 937 CQQVKAEHQRPAGLLQPLPIPEWKWDNITMDFVIGLPRTRSKKNGVWVIVDCLTKSAHFL 996 Query: 388 ------SSDEAGR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGTTL 227 S + + + Q V ++ + D FTS+FW+ + ++GT L Sbjct: 997 AMKTTDSMNSLAKLYI--QEIVRLHGILVSIVSDRD-----PKFTSQFWQSLQRALGTQL 1049 Query: 226 SMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLF 47 + ++AFH Q DGQSER IQ+LEDML+AC L+F GNW++ LPL EFAYN+SY SSI + Sbjct: 1050 NFNTAFHPQTDGQSERVIQILEDMLRACVLDFGGNWADYLPLAEFAYNNSYQSSIXXAPY 1109 Query: 46 EALYGRPCRTPLCW 5 EALYGRPCR+PLCW Sbjct: 1110 EALYGRPCRSPLCW 1123 >gb|AIG55302.1| gag-pol, partial [Camellia sinensis] Length = 923 Score = 267 bits (682), Expect = 6e-69 Identities = 136/255 (53%), Positives = 171/255 (67%), Gaps = 7/255 (2%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 + RL VP E +E+L H S L +H G TKM DL RQFWWRGM +D+A+FVS CLT Sbjct: 489 RDRLFVP--ESCREEVLGEFHHSRLAVHPGGTKMYQDLGRQFWWRGMKRDVAVFVSKCLT 546 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF- 389 CQ+VK EHQRPAG LQ LPIAEWKW+ I MDF GLP+ Q+G D++WV++DRLTKS HF Sbjct: 547 CQQVKAEHQRPAGLLQPLPIAEWKWEHITMDFVVGLPRTQRGSDAIWVVVDRLTKSAHFI 606 Query: 388 ------SSDEAGR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGTTL 227 S D + + V +T + D FT++ W+ + ++GT L Sbjct: 607 PMRVRDSMDHLADLYI--RDVVRLHGVPVTIVSDRD-----PCFTARLWQSLQSALGTKL 659 Query: 226 SMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLF 47 + S+A+H Q DGQSERTIQ+LEDML+ C L+F G W LPLVEFAYN+S+ SSIGM F Sbjct: 660 TFSTAYHPQTDGQSERTIQILEDMLRGCVLDFSGTWERHLPLVEFAYNNSFQSSIGMAPF 719 Query: 46 EALYGRPCRTPLCWA 2 EALYGRPCR+P+ WA Sbjct: 720 EALYGRPCRSPVFWA 734 >ref|XP_013695570.1| PREDICTED: uncharacterized protein LOC106399662 [Brassica napus] Length = 1869 Score = 266 bits (681), Expect = 8e-69 Identities = 132/251 (52%), Positives = 174/251 (69%), Gaps = 4/251 (1%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 + R+CVP DE L KEIL AH S ++H G+TKM DL+R + W GM +D+A F+S C T Sbjct: 1427 RNRVCVPNDELLKKEILQQAHHSRFSIHPGNTKMYKDLKRYYHWPGMKRDVASFISQCQT 1486 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF- 389 CQ VK EHQ P+G LQ LP+ EWKWD + MDF GLP GK+++WVI+DRLTKS HF Sbjct: 1487 CQMVKAEHQVPSGLLQNLPLPEWKWDMVTMDFVTGLPTTLGGKNAIWVIVDRLTKSSHFL 1546 Query: 388 SSDEAGR*HC*SQ*FVHPRNC*ITRYPNID---CFRQRSLFTSKFWERFKLSMGTTLSMS 218 + + R +Q +++ I R + + + FTS+FW F+ ++GT + MS Sbjct: 1547 TIKKTDRADQLAQTYINE----IVRLHGVPVSIVSDRDTKFTSEFWRAFQKALGTKVHMS 1602 Query: 217 SAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEAL 38 +A+H Q DGQSERTIQ LEDML+AC L+++G+W++ LPL EFAYN+SY SSI M +EAL Sbjct: 1603 TAYHPQTDGQSERTIQTLEDMLRACVLDWEGSWAKYLPLTEFAYNNSYHSSIKMAPYEAL 1662 Query: 37 YGRPCRTPLCW 5 YGRPCRTPLCW Sbjct: 1663 YGRPCRTPLCW 1673 >ref|XP_013645649.1| PREDICTED: uncharacterized protein LOC106350287 [Brassica napus] Length = 3063 Score = 266 bits (680), Expect = 1e-68 Identities = 134/249 (53%), Positives = 172/249 (69%), Gaps = 4/249 (1%) Frame = -3 Query: 739 RLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLTCQ 560 R+CVP +E L KEIL AH S ++H G+TKM DL+R + W GM +D+A FVS C TCQ Sbjct: 1904 RVCVPDNEPLRKEILRQAHHSNFSIHPGNTKMYRDLKRYYHWPGMKRDVASFVSQCQTCQ 1963 Query: 559 KVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF-SS 383 VK EHQ P+G LQ LP+ EWKWD + MDF GLP GK+++WVI+DRLTKS HF + Sbjct: 1964 MVKAEHQVPSGLLQNLPLPEWKWDMVTMDFVTGLPTTSGGKNAIWVIVDRLTKSAHFLAI 2023 Query: 382 DEAGR*HC*SQ*FVHPRNC*ITRYPN--IDCFRQRSL-FTSKFWERFKLSMGTTLSMSSA 212 + R +Q ++ I R + R + FTS+FW F+ ++GT + MS+A Sbjct: 2024 KKTDRADQLAQIYISE----IVRLHGVPVSIVSDRDVKFTSEFWRAFQKALGTKVHMSTA 2079 Query: 211 FHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEALYG 32 +H Q DGQSERTIQ LEDML+AC L+++G+W + LPL EFAYN+SY SSIGM +EALYG Sbjct: 2080 YHPQTDGQSERTIQTLEDMLRACVLDWEGSWVKYLPLAEFAYNNSYHSSIGMAPYEALYG 2139 Query: 31 RPCRTPLCW 5 RPCRTPLCW Sbjct: 2140 RPCRTPLCW 2148 Score = 266 bits (680), Expect = 1e-68 Identities = 134/249 (53%), Positives = 172/249 (69%), Gaps = 4/249 (1%) Frame = -3 Query: 739 RLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLTCQ 560 R+CVP +E L KEIL AH S ++H G+TKM DL+R + W GM +D+A FVS C TCQ Sbjct: 2647 RVCVPDNEPLRKEILRQAHHSNFSIHPGNTKMYRDLKRYYHWPGMKRDVASFVSQCQTCQ 2706 Query: 559 KVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF-SS 383 VK EHQ P+G LQ LP+ EWKWD + MDF GLP GK+++WVI+DRLTKS HF + Sbjct: 2707 MVKAEHQVPSGLLQNLPLPEWKWDMVTMDFVTGLPTTSGGKNAIWVIVDRLTKSAHFLAI 2766 Query: 382 DEAGR*HC*SQ*FVHPRNC*ITRYPN--IDCFRQRSL-FTSKFWERFKLSMGTTLSMSSA 212 + R +Q ++ I R + R + FTS+FW F+ ++GT + MS+A Sbjct: 2767 KKTDRADQLAQIYISE----IVRLHGVPVSIVSDRDVKFTSEFWRAFQKALGTKVHMSTA 2822 Query: 211 FHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEALYG 32 +H Q DGQSERTIQ LEDML+AC L+++G+W + LPL EFAYN+SY SSIGM +EALYG Sbjct: 2823 YHPQTDGQSERTIQTLEDMLRACVLDWEGSWVKYLPLAEFAYNNSYHSSIGMAPYEALYG 2882 Query: 31 RPCRTPLCW 5 RPCRTPLCW Sbjct: 2883 RPCRTPLCW 2891 Score = 263 bits (673), Expect = 7e-68 Identities = 133/249 (53%), Positives = 171/249 (68%), Gaps = 4/249 (1%) Frame = -3 Query: 739 RLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLTCQ 560 R+CVP +E L KEIL AH S ++H G+TKM DL+R + W GM +D+A FVS C TCQ Sbjct: 888 RVCVPDNEPLRKEILRQAHHSNFSIHPGNTKMYRDLKRYYHWPGMKRDVASFVSQCQTCQ 947 Query: 559 KVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF-SS 383 VK EHQ P+G LQ LP+ EWKWD + MDF GLP GK+++WVI+DRLTKS HF + Sbjct: 948 MVKAEHQVPSGLLQNLPLPEWKWDMVTMDFVTGLPTTSGGKNAIWVIVDRLTKSAHFLAI 1007 Query: 382 DEAGR*HC*SQ*FVHPRNC*ITRYPN--IDCFRQRSL-FTSKFWERFKLSMGTTLSMSSA 212 + R +Q ++ I R + R + FTS+FW F+ ++GT + MS+A Sbjct: 1008 KKTDRADQLAQIYISE----IVRLHGVPVSIVSDRDVKFTSEFWRAFQKALGTKVHMSTA 1063 Query: 211 FHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEALYG 32 +H Q D QSERTIQ LEDML+AC L+++G+W + LPL EFAYN+SY SSIGM +EALYG Sbjct: 1064 YHPQTDDQSERTIQTLEDMLRACVLDWEGSWVKYLPLAEFAYNNSYHSSIGMAPYEALYG 1123 Query: 31 RPCRTPLCW 5 RPCRTPLCW Sbjct: 1124 RPCRTPLCW 1132 >ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508779254|gb|EOY26510.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1290 Score = 265 bits (676), Expect = 3e-68 Identities = 132/256 (51%), Positives = 168/256 (65%), Gaps = 9/256 (3%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 + R+CVP D+ L + IL AH S LH GSTKM ++ +WW GM +DIA FV+ CL Sbjct: 874 RDRICVPKDDQLRRAILEEAHSSAYALHPGSTKMYQTIKESYWWPGMKRDIAEFVAKCLI 933 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF- 389 CQ++K EHQ+ +G LQ LPI EWKW+ + MDF GLP+ Q GKD++WVI+ RLTKS HF Sbjct: 934 CQQIKAEHQKSSGTLQPLPIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIMGRLTKSAHFL 993 Query: 388 ------SSDEAGR*HC*SQ*FVH--PRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGT 233 S + R + +H P + R P FTS+FW +F+ ++GT Sbjct: 994 AIHSTYSIERLARLYIDEVVRLHGVPVSIVSDRDPR---------FTSRFWPKFQEALGT 1044 Query: 232 TLSMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMT 53 L S+AFH QIDGQSERTIQ LEDML+AC ++F +W LPLVEFAYN+S+ SSIGM Sbjct: 1045 KLRFSTAFHPQIDGQSERTIQTLEDMLRACVIDFIRSWDRHLPLVEFAYNNSFQSSIGMA 1104 Query: 52 LFEALYGRPCRTPLCW 5 +EALYGR CRTPLCW Sbjct: 1105 TYEALYGRKCRTPLCW 1120 >ref|XP_010111872.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notabilis] gi|587945430|gb|EXC31837.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notabilis] Length = 1088 Score = 262 bits (670), Expect = 2e-67 Identities = 129/256 (50%), Positives = 169/256 (66%), Gaps = 8/256 (3%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 KG+L V D DL +L AH S ++HLGSTKM DL+RQ+WWRGM +D+ FV+ C Sbjct: 808 KGKLVVLNDSDLRDAVLYEAHRSKFSIHLGSTKMYMDLKRQYWWRGMKRDVVNFVAKCSI 867 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHFS 386 C++VK +HQRP+G LQ LPI +WKWD + MDF GLP+ Q+G D+VWV++DRLTK+ HF Sbjct: 868 CKQVKADHQRPSGELQPLPIPDWKWDHVTMDFVTGLPRTQEGYDAVWVVVDRLTKTAHFI 927 Query: 385 SDEAGR*HC*SQ*FVHPRNC--------*ITRYPNIDCFRQRSLFTSKFWERFKLSMGTT 230 A + P+ C + P + + FTSKFW+ + ++GT Sbjct: 928 PIRAD--------YKVPKLCRLYIERIVTLHGVPVSIVSDRDAQFTSKFWKGLQNALGTE 979 Query: 229 LSMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTL 50 L S+AFH Q DGQSER IQ+LED+L+A L+F+G W + LP EFAYN+SY +SI M Sbjct: 980 LRFSTAFHPQTDGQSERVIQILEDILRAYVLDFEGRWGKYLPNAEFAYNNSYQASIRMAP 1039 Query: 49 FEALYGRPCRTPLCWA 2 FEALYGRPCR+PLCWA Sbjct: 1040 FEALYGRPCRSPLCWA 1055 >ref|XP_012567311.1| PREDICTED: uncharacterized protein LOC105851235 [Cicer arietinum] Length = 1114 Score = 261 bits (667), Expect = 3e-67 Identities = 132/247 (53%), Positives = 165/247 (66%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 K RLCVP L ++IL AH S T+H GS KM DLR +WW GM +D+A FVS CL Sbjct: 681 KARLCVPNVGGLRRKILEEAHHSSYTIHPGSNKMYQDLRELYWWEGMKRDVADFVSRCLV 740 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHFS 386 CQ+VK EHQ+PAG LQ + I EWKW+ IAMDF GLP+ Q+G DSVWVIIDRLTKS HF Sbjct: 741 CQQVKAEHQKPAGLLQPVEIPEWKWEGIAMDFVTGLPRTQKGYDSVWVIIDRLTKSAHFL 800 Query: 385 SDEAGR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGTTLSMSSAFH 206 + + + P + + FT++FW+ F+ S+GT L +S+AFH Sbjct: 801 PVKTTYTASQYAKLYLDKIVSLHGVPVSIISDRGAQFTAQFWKSFQTSLGTRLKLSTAFH 860 Query: 205 TQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEALYGRP 26 Q DGQSERTIQ+LEDM +AC L+ G+W + LPL+EFAYN+SY SSI M FEALYGR Sbjct: 861 PQTDGQSERTIQILEDMFRACVLDLGGSWDQHLPLMEFAYNNSYQSSIQMAPFEALYGRR 920 Query: 25 CRTPLCW 5 CR+P+ W Sbjct: 921 CRSPIGW 927 >emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] Length = 1495 Score = 261 bits (666), Expect = 5e-67 Identities = 131/247 (53%), Positives = 161/247 (65%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 KGRLCVP D +L E+L AH + T+H G+TKM DL+RQFWW GM +DIA FV +C Sbjct: 1087 KGRLCVPKDVELRNELLADAHRAKYTIHPGNTKMYQDLKRQFWWSGMKRDIAQFVXNCQI 1146 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHFS 386 CQ+VK EHQRPAG LQ LPI EWKWD+I MDF GLP+ + K+ VW+ Sbjct: 1147 CQQVKAEHQRPAGLLQPLPIPEWKWDNITMDFVIGLPRTRSKKNGVWI------------ 1194 Query: 385 SDEAGR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGTTLSMSSAFH 206 E R H V R+ FTS+FW+ + ++GT L+ S+AFH Sbjct: 1195 -QEIVRLHGIPVSIVSDRD---------------PKFTSQFWQSLQRALGTQLNFSTAFH 1238 Query: 205 TQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEALYGRP 26 Q DGQSER IQ+LEDML+AC L+F GNW++ LPL EFAYN+SY SSIGM +EALYGRP Sbjct: 1239 PQTDGQSERVIQILEDMLRACVLDFGGNWADYLPLAEFAYNNSYQSSIGMAPYEALYGRP 1298 Query: 25 CRTPLCW 5 CR+PLCW Sbjct: 1299 CRSPLCW 1305 >ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417177 [Eucalyptus grandis] Length = 1753 Score = 258 bits (658), Expect = 4e-66 Identities = 134/256 (52%), Positives = 168/256 (65%), Gaps = 9/256 (3%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 +GRL VP D +L +EIL+ AH S ++H GSTKM +LR+ +WW GM DIA V+ CLT Sbjct: 1060 QGRLVVPDDVELREEILSEAHRSNYSIHPGSTKMYQNLRQHYWWCGMKADIAKHVAKCLT 1119 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF- 389 CQ+VK +H +P G L+ L I EWKW+ I MDF GLP+ Q+G DS+WV++DRLTKS HF Sbjct: 1120 CQQVKAQHCKPGGLLRPLEIPEWKWEHITMDFVTGLPRSQRGNDSIWVVVDRLTKSAHFI 1179 Query: 388 ------SSDEAGR*HC*SQ*FVHPRNC*IT--RYPNIDCFRQRSLFTSKFWERFKLSMGT 233 S D + +H IT R P FT+ FW+ + ++GT Sbjct: 1180 AVRRDLSLDRLADLYVRQVVRMHGVPVTITSDRDPR---------FTAAFWKSLQSALGT 1230 Query: 232 TLSMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMT 53 L S+A+H Q DGQSERTIQ LEDML+AC L+FKG+W EQL LVEFAYN+SY SI M Sbjct: 1231 KLQYSTAYHPQTDGQSERTIQTLEDMLRACVLDFKGSWEEQLHLVEFAYNNSYQQSIQMA 1290 Query: 52 LFEALYGRPCRTPLCW 5 FEALYGR CRTP+CW Sbjct: 1291 PFEALYGRACRTPVCW 1306 >ref|XP_007028151.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508716756|gb|EOY08653.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1110 Score = 255 bits (652), Expect = 2e-65 Identities = 129/262 (49%), Positives = 165/262 (62%), Gaps = 15/262 (5%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 + R+CV D+ L + IL AH S LHL STKM ++ +WW GM +DIA FV+ CLT Sbjct: 780 RDRICVLKDDQLRRAILEEAHSSAYALHLESTKMYRTIKESYWWPGMKRDIAEFVAKCLT 839 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHFS 386 CQ++K EHQ+ +G LQ LPI EWKW+ + MDF GL + Q GKD++WVI+DRLTKS HF Sbjct: 840 CQQIKAEHQKLSGTLQPLPIPEWKWEHVTMDFVLGLLRTQSGKDAIWVIVDRLTKSAHFL 899 Query: 385 S---------------DEAGR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERF 251 + DE R + V R+ FTS+FW +F Sbjct: 900 AIHNTYSIEKLVKLYIDEIVRLYGVPISIVSDRD---------------PRFTSRFWSKF 944 Query: 250 KLSMGTTLSMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY* 71 + ++GT L S+AFH Q DGQSERTIQ LEDML+AC ++F G+W LPLVEFAYN+S+ Sbjct: 945 QEALGTKLRFSTAFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNNSFQ 1004 Query: 70 SSIGMTLFEALYGRPCRTPLCW 5 SSIGM +EALYGR C+TP CW Sbjct: 1005 SSIGMAPYEALYGRKCQTPFCW 1026 >ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao] gi|508702196|gb|EOX94092.1| Gag protease polyprotein [Theobroma cacao] Length = 269 Score = 255 bits (651), Expect = 3e-65 Identities = 126/243 (51%), Positives = 156/243 (64%), Gaps = 15/243 (6%) Frame = -3 Query: 688 AHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLTCQKVKGEHQRPAGPLQLLP 509 AH S LH GSTKM ++ +WW GM +D+A FV+ CL CQ+VK EHQRPAG LQ LP Sbjct: 4 AHSSAYALHPGSTKMYRTIKENYWWPGMKRDVAEFVAKCLVCQQVKAEHQRPAGTLQSLP 63 Query: 508 IAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHFSS---------------DEA 374 + EWKW+ + MDF GLP+ Q+G D++WVI+DRLTKS HF + DE Sbjct: 64 VPEWKWEHVTMDFVLGLPRTQRGNDAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEI 123 Query: 373 GR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGTTLSMSSAFHTQID 194 R H V R+ FTS+FW +F+ ++GT L S+AFH Q D Sbjct: 124 VRLHGVPVSIVSDRD---------------PRFTSRFWLKFQEALGTKLKFSTAFHPQTD 168 Query: 193 GQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEALYGRPCRTP 14 GQSERTIQ LEDML+AC ++F G+W LPLVEFAYN+S+ SSIGM +EALYGR CRTP Sbjct: 169 GQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNNSFQSSIGMAPYEALYGRKCRTP 228 Query: 13 LCW 5 LCW Sbjct: 229 LCW 231 >ref|XP_010541787.1| PREDICTED: uncharacterized protein LOC104815170 [Tarenaya hassleriana] Length = 1003 Score = 254 bits (649), Expect = 4e-65 Identities = 132/262 (50%), Positives = 166/262 (63%), Gaps = 15/262 (5%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 +GR VP DE+L KE+L AH + ++H G+TKM +L++ F W GM KDIA FV+ CLT Sbjct: 675 RGRAYVPKDEELRKELLKEAHNTRYSIHPGTTKMYQNLKQYFLWHGMKKDIAKFVTHCLT 734 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHFS 386 CQ VK EHQ AG LQ L I +WKWD + MDF GLP+ +G D++WVI+DRLTKS HF Sbjct: 735 CQLVKAEHQVSAGKLQSLSIPQWKWDLVTMDFIVGLPRKPKGNDAIWVIVDRLTKSAHFI 794 Query: 385 S---------------DEAGR*HC*SQ*FVHPRNC*ITRYPNIDCFRQRSLFTSKFWERF 251 S +E R H V R+ FTS+FW Sbjct: 795 SIIKTFSMPRLAQVYIEEVVRLHGIPISIVSDRD---------------PRFTSRFWNSL 839 Query: 250 KLSMGTTLSMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY* 71 + +M T + +S+A+H Q DGQSERTIQ LEDML+AC L++ G W LPLVEFAYN+S+ Sbjct: 840 QEAMRTKVRLSTAYHPQTDGQSERTIQTLEDMLRACVLDWGGEWDRHLPLVEFAYNNSFH 899 Query: 70 SSIGMTLFEALYGRPCRTPLCW 5 SSIGM+ FEALYGRPC+TPLCW Sbjct: 900 SSIGMSPFEALYGRPCKTPLCW 921 >gb|AAD20658.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1611 Score = 254 bits (649), Expect = 4e-65 Identities = 133/252 (52%), Positives = 172/252 (68%), Gaps = 6/252 (2%) Frame = -3 Query: 742 GRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLTC 563 GR+CVP D L +EIL AH+S ++H GS KM DL+R + W GM KD+A +V+ C TC Sbjct: 1149 GRVCVPNDRALKEEILREAHQSKFSIHPGSNKMYRDLKRYYHWVGMKKDVARWVAKCPTC 1208 Query: 562 QKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGK-DSVWVIIDRLTKSDHFS 386 Q VK EHQ P+G LQ LPI EWKWD I MDF GLP G + K ++VWV++DRLTKS HF Sbjct: 1209 QLVKAEHQVPSGLLQNLPIPEWKWDHITMDFVTGLPTGIKSKHNAVWVVVDRLTKSAHFM 1268 Query: 385 --SDEAGR*HC*SQ*FVHPRNC*ITRYPNID---CFRQRSLFTSKFWERFKLSMGTTLSM 221 SD+ G ++ I R I + + FTSKFW+ F+ ++GT +++ Sbjct: 1269 AISDKDG-----AEIIAEKYIDEIVRLHGIPVSIVSDRDTRFTSKFWKAFQKALGTRVNL 1323 Query: 220 SSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEA 41 S+A+H Q D QSERTIQ LEDML+AC L++ GNW + L LVEFAYN+S+ +SIGM+ +EA Sbjct: 1324 STAYHPQTDEQSERTIQTLEDMLRACVLDWGGNWEKYLRLVEFAYNNSFQASIGMSPYEA 1383 Query: 40 LYGRPCRTPLCW 5 LYGR CRTPLCW Sbjct: 1384 LYGRACRTPLCW 1395 >emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera] Length = 1387 Score = 254 bits (648), Expect = 6e-65 Identities = 133/256 (51%), Positives = 168/256 (65%), Gaps = 9/256 (3%) Frame = -3 Query: 745 KGRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLT 566 KGRLCVP D +L E+L AH + T+H G+TK+ GM KDIA FV++C Sbjct: 962 KGRLCVPKDVELRNELLADAHRAKYTIHPGNTKI-----------GMKKDIAQFVANCQI 1010 Query: 565 CQKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF- 389 CQ+VK EHQRPAG LQ LPI EWKWD+I MDF GLP+ + K+ VW+I+DRLTKS HF Sbjct: 1011 CQQVKAEHQRPAGLLQPLPIPEWKWDNITMDFVIGLPRTRSKKNGVWMIVDRLTKSTHFL 1070 Query: 388 ------SSDEAGR*HC*SQ*FVH--PRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGT 233 S + + + +H P + R P FTS+FW+ + ++GT Sbjct: 1071 AMKTIDSMNSLAKLYIQEIVRLHGIPVSIVSDRDPK---------FTSQFWQSLQRTLGT 1121 Query: 232 TLSMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMT 53 L+ S+AFH Q DGQSER IQ+LEDML+AC L+F GNW++ LPL EFAYN+SY SSIGM Sbjct: 1122 QLNFSTAFHPQTDGQSERVIQILEDMLRACVLDFGGNWADYLPLAEFAYNNSYQSSIGMX 1181 Query: 52 LFEALYGRPCRTPLCW 5 +EALYGRPCR+PLCW Sbjct: 1182 TYEALYGRPCRSPLCW 1197 >gb|AAD37020.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 949 Score = 253 bits (647), Expect = 7e-65 Identities = 132/252 (52%), Positives = 172/252 (68%), Gaps = 6/252 (2%) Frame = -3 Query: 742 GRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLTC 563 GR+CVP D L +EIL AH+S ++H GS KM DL+R + W GM KD+A +V+ C TC Sbjct: 541 GRVCVPNDRALKEEILREAHQSKFSIHPGSNKMYRDLKRYYHWVGMRKDVARWVAKCPTC 600 Query: 562 QKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGK-DSVWVIIDRLTKSDHFS 386 Q VK EHQ P+G LQ LPI+EWKWD I MDF LP G + K ++VWV++DRLTKS HF Sbjct: 601 QLVKAEHQVPSGLLQNLPISEWKWDHITMDFVTRLPTGIKSKHNAVWVVVDRLTKSAHFM 660 Query: 385 --SDEAGR*HC*SQ*FVHPRNC*ITRYPNID---CFRQRSLFTSKFWERFKLSMGTTLSM 221 SD+ G ++ I R I + + FTSKFW F+ ++GT +++ Sbjct: 661 AISDKDG-----AEIIAEKYIDEIMRLHGIPVSIVSDRDTRFTSKFWNAFQKALGTRVNL 715 Query: 220 SSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTLFEA 41 S+A+H Q DGQSERTIQ LEDML+AC L++ GNW + L L+EFAYN+S+ +SIGM+ +EA Sbjct: 716 STAYHPQTDGQSERTIQTLEDMLRACVLDWGGNWEKYLRLIEFAYNNSFQASIGMSPYEA 775 Query: 40 LYGRPCRTPLCW 5 LYGR CRTPLCW Sbjct: 776 LYGRACRTPLCW 787 >emb|CAC44142.1| putative polyprotein [Cicer arietinum] Length = 655 Score = 253 bits (646), Expect = 1e-64 Identities = 130/255 (50%), Positives = 167/255 (65%), Gaps = 9/255 (3%) Frame = -3 Query: 742 GRLCVPYDEDLLKEILTMAHESVLTLHLGSTKMNYDLRRQFWWRGMSKDIALFVSSCLTC 563 GR+CVP + K IL AH+S L++H G+TKM DLR+ +WW GM K +A +VS+CLTC Sbjct: 315 GRICVPEITAMRKTILEEAHKSKLSIHPGATKMYQDLRQNYWWPGMKKHVAEYVSTCLTC 374 Query: 562 QKVKGEHQRPAGPLQLLPIAEWKWDSIAMDFARGLPKGQQGKDSVWVIIDRLTKSDHF-- 389 QK K EHQRPAG LQ L I EWKWDSI+MDF GLPK ++ DS+WVI+DRLTKS HF Sbjct: 375 QKAKVEHQRPAGMLQPLDIPEWKWDSISMDFITGLPKTRRKNDSIWVIVDRLTKSAHFLP 434 Query: 388 -----SSDEAGR*HC*SQ*FVH--PRNC*ITRYPNIDCFRQRSLFTSKFWERFKLSMGTT 230 D+ + +H P + R P FTS FW ++GT Sbjct: 435 VRTTYKVDQLTEIYIAEIVRLHGVPSSIVSDRDPK---------FTSHFWGALHEALGTK 485 Query: 229 LSMSSAFHTQIDGQSERTIQVLEDMLKACALEFKGNWSEQLPLVEFAYNSSY*SSIGMTL 50 L +SSA+H Q DGQ+ERT Q LED+L+AC L+ +G+W LPL+EF YN+S+ +SIGM Sbjct: 486 LRLSSAYHPQTDGQTERTNQSLEDLLRACVLDDRGSWDHVLPLIEFTYNNSFHTSIGMAP 545 Query: 49 FEALYGRPCRTPLCW 5 ++ALYGR C+TPLCW Sbjct: 546 YQALYGRKCQTPLCW 560