BLASTX nr result
ID: Rehmannia26_contig00018232
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia26_contig00018232 (848 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY03692.1| Gag protease polyprotein [Theobroma cacao] 108 3e-21 emb|CAN72584.1| hypothetical protein VITISV_001910 [Vitis vinifera] 107 7e-21 gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobrom... 105 2e-20 gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma ... 104 4e-20 gb|EMJ28581.1| hypothetical protein PRUPE_ppb016096mg [Prunus pe... 101 3e-19 gb|EOY26216.1| Gag protease polyprotein [Theobroma cacao] 100 1e-18 emb|CAN81227.1| hypothetical protein VITISV_038888 [Vitis vinifera] 99 2e-18 gb|EOY31906.1| Gag protease polyprotein [Theobroma cacao] 98 4e-18 gb|EOY08512.1| Gag protease polyprotein [Theobroma cacao] 97 6e-18 gb|EOY14138.1| Uncharacterized protein TCM_033423 [Theobroma cacao] 97 1e-17 gb|EOY26650.1| Gag protease polyprotein [Theobroma cacao] 96 2e-17 gb|EOY26606.1| Gag protease polyprotein [Theobroma cacao] 96 2e-17 gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gy... 95 3e-17 gb|EOX98886.1| Gag protease polyprotein [Theobroma cacao] 95 4e-17 emb|CAN68039.1| hypothetical protein VITISV_018924 [Vitis vinifera] 94 5e-17 emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera] 94 6e-17 gb|EMJ28586.1| hypothetical protein PRUPE_ppb016975mg [Prunus pe... 94 8e-17 gb|EOY16714.1| Gag protease polyprotein [Theobroma cacao] 93 1e-16 gb|EOX99639.1| Gag protease polyprotein [Theobroma cacao] 93 1e-16 gb|EOY19679.1| Gag protease polyprotein [Theobroma cacao] 91 4e-16 >gb|EOY03692.1| Gag protease polyprotein [Theobroma cacao] Length = 689 Score = 108 bits (269), Expect = 3e-21 Identities = 80/283 (28%), Positives = 119/283 (42%), Gaps = 24/283 (8%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL LKQGN +V EYE FN+L Y P ++ +E+ + + FE G Sbjct: 142 EFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETHFNKLMLYVPDLVKSEQDQASYFEEG 201 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-NFQYNK------------KRQGD 526 LR IR R+T++ E + ++ A+RAEKL E++ ++ K KR D Sbjct: 202 LRNEIRERMTVTGREPHKEVVQMALRAEKLATENRTIRTEFAKRRNPGMSSSQLVKRGKD 261 Query: 525 ETGGFKNKKEFTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGECRL 346 F S + C +CG H G CR Sbjct: 262 SAISGSTTSVFVTSPRPPFPPSQQRPSRFSRSAMTGSRKSFGGSDRCKNCGNYHSGLCRG 321 Query: 345 LTGKCFRCGEPGHIVRNCPKPREENTVE---------QXXXXXXXXXXXXXXXXXXXXXG 193 T +CF+CG+ GHI NCP+ TV Q Sbjct: 322 PT-RCFQCGQTGHIRSNCPRLGRATTVASSSPVHTDMQRRDSSGLPLRQGVAIRSGVESN 380 Query: 192 ISADESQGSRQQTQARVFAITKEDAKATPTVITG--QNLDKEA 70 A + +T RVFA+T+++A+ P +TG DK+A Sbjct: 381 TPAHPPSRPQTRTSTRVFAVTEDEARVRPGAVTGTMSLFDKDA 423 >emb|CAN72584.1| hypothetical protein VITISV_001910 [Vitis vinifera] Length = 279 Score = 107 bits (266), Expect = 7e-21 Identities = 77/261 (29%), Positives = 118/261 (45%), Gaps = 10/261 (3%) Frame = -3 Query: 843 FHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAGL 664 F+ YLP+ R QK EF+ L+QG+ +VA+YE F +LS +A +IA E+KK +F+ GL Sbjct: 55 FYKKYLPDNVRRQKVGEFVRLEQGDITVAQYEAKFTELSRFARQLIAIEEKKTLKFQDGL 114 Query: 663 RFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-NFQYNKKRQGDETGGFKNKKEFTQ 487 + ++N+I++ L YS + A+ AEK E +Q Q K + D + K F+ Sbjct: 115 KPYLKNKISILKLNVYSGVVDRALIAEKDNKELQQYREQQRKSSKSDGAHDNQEPKRFSS 174 Query: 486 STQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGE-CRLLTGKCFRCGEPG 310 VTC CGK H+ + C + CF CG+ G Sbjct: 175 VESHIKGEVVQNLD-----------------VTCSICGKKHWSKPCYKESEACFDCGKHG 217 Query: 309 HIVRNCP--------KPREENTVEQXXXXXXXXXXXXXXXXXXXXXGISADESQGSRQQT 154 HI+R+CP KP++EN +++ + + Sbjct: 218 HIIRDCPENKKFIIGKPKKENKMDK------------------------------QKPRV 247 Query: 153 QARVFAITKEDAKATPTVITG 91 Q R+FA T +D +AT V TG Sbjct: 248 QGRMFATTHQDTQATSDVTTG 268 >gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1515 Score = 105 bits (262), Expect = 2e-20 Identities = 79/295 (26%), Positives = 126/295 (42%), Gaps = 24/295 (8%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL LKQGN +V EYE FN+L Y P ++ +E+ + + FE G Sbjct: 109 EFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVPDLVKSEQDQASYFEEG 168 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-NFQYNKKRQGDETGG--FKNKKE 496 LR IR R+T++ E + ++ A+RAEKL E+++ ++ K+R + K K+ Sbjct: 169 LRNEIRERMTVNGREPHKEVVQMALRAEKLAIENRRIRIEFAKRRNPGMSSSQPVKRGKD 228 Query: 495 FTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVV----------TCVHCGKNHYGECRL 346 S + C +CG H G CR Sbjct: 229 SAISGSTTSVSVTSPRPPFPPSQQRPSRFSRSDMTGSGKSFGGSDRCRNCGNYHSGLCRE 288 Query: 345 LTGKCFRCGEPGHIVRNCPKPREENTVE---------QXXXXXXXXXXXXXXXXXXXXXG 193 T +CF+CG+ GHI NCP+ V Q Sbjct: 289 PT-RCFQCGQTGHIRSNCPRLGRATVVASSSPARTDIQRRDSSGLPPRQGVAIPSGVESN 347 Query: 192 ISADESQGSRQQTQARVFAITKEDAKATPTVITG--QNLDKEANKVD*TNRDKDF 34 A + +T RVFA+T+++A+ P +TG DK+A + + D+ + Sbjct: 348 TPAHPPSRPQTRTSTRVFAVTEDEAQVRPGAVTGTMSLFDKDAYVLIDSGSDRSY 402 >gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma cacao] Length = 665 Score = 104 bits (260), Expect = 4e-20 Identities = 79/295 (26%), Positives = 127/295 (43%), Gaps = 24/295 (8%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL LKQGN +V EYE FN+L Y P ++ +E+ + + FE G Sbjct: 125 EFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVPDLVKSEQDQASYFEEG 184 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-NFQYNKKRQGDETGG--FKNKKE 496 LR IR R+T++ E + ++ A+RAEKL E+++ ++ K+R + K K+ Sbjct: 185 LRNEIRERMTVTGREPHKEVVQMALRAEKLAIENRRIRTEFAKRRNPGMSSSQPVKRGKD 244 Query: 495 FTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVV----------TCVHCGKNHYGECRL 346 S ++ C +CG H G CR Sbjct: 245 SAISGSTTSVSVTSPRPPFPPSQQRPSRFSRSAMTGSGRSFGGSDRCRNCGNYHSGLCRE 304 Query: 345 LTGKCFRCGEPGHIVRNCPKPREENTVE---------QXXXXXXXXXXXXXXXXXXXXXG 193 T +CF+CG+ GHI NCP+ V Q Sbjct: 305 PT-RCFQCGQTGHIRSNCPRLGRATVVASSSPARTDIQRRDSSGLPPRQGVAIRSGVESN 363 Query: 192 ISADESQGSRQQTQARVFAITKEDAKATPTVITG--QNLDKEANKVD*TNRDKDF 34 A + +T RVFA+T+++A+ P +TG DK+A + + D+ + Sbjct: 364 TPAHPPSRPQTRTSTRVFAVTEDEAQVRPGAVTGTISLFDKDAYVLIDSGSDRSY 418 >gb|EMJ28581.1| hypothetical protein PRUPE_ppb016096mg [Prunus persica] Length = 505 Score = 101 bits (252), Expect = 3e-19 Identities = 74/265 (27%), Positives = 117/265 (44%), Gaps = 14/265 (5%) Frame = -3 Query: 843 FHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAGL 664 F + + P YR K+ EFL LKQG+ SV EYE FN++S +AP ++ATE+ +C RF+ GL Sbjct: 46 FSEQFYPPSYRHAKKSEFLYLKQGSMSVVEYEHKFNEMSRFAPELVATEEDRCRRFDEGL 105 Query: 663 RFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQNFQYNKKRQGDETGGFKNKKEFTQS 484 I+ +T + + L A R + + S + + R G + G K+ + S Sbjct: 106 WCEIQAVVTANTYPNMRALAQAIERVSRKLSGSAGRRRRDTPRIGGPSQGPSKKRGSSSS 165 Query: 483 TQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHY------------GECRLLT 340 + +TC +CG+ + G+ + + Sbjct: 166 SASGEWQTG---------------------LTCFNCGQVGHMVKDCPSYTQGGGQSQSSS 204 Query: 339 GKCFRCGEPGHIVRNCPKPREENTVEQXXXXXXXXXXXXXXXXXXXXXGISADESQG--S 166 C+ CG+ GH R+CP + + Q G S +S+G Sbjct: 205 LTCYFCGQVGHTKRSCPIILQSDAAIQRTGAQQGQAGSSNSRALSSSRGRSGRQSRGQPG 264 Query: 165 RQQTQARVFAITKEDAKATPTVITG 91 R TQ RVF++T+++A ATP VITG Sbjct: 265 RSTTQGRVFSMTQQEAHATPDVITG 289 >gb|EOY26216.1| Gag protease polyprotein [Theobroma cacao] Length = 426 Score = 99.8 bits (247), Expect = 1e-18 Identities = 61/200 (30%), Positives = 93/200 (46%), Gaps = 13/200 (6%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL LKQGN +V EYE FN+L Y P ++ +E+ + + FE G Sbjct: 189 EFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVPDLVKSEQDQASYFEEG 248 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-------------NFQYNKKRQGD 526 LR IR R+T++ E + ++ A+RAEKL E+++ ++ + KR D Sbjct: 249 LRNEIRERMTVTGREPHKEVVQMALRAEKLATENRRIRTEFAKRRNPGMSYSQSVKRGKD 308 Query: 525 ETGGFKNKKEFTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGECRL 346 S + C +CG H G CR Sbjct: 309 SAISRSTTSISVTSPRPPFPPSQQRPSRFSRSAMTGSGKSFGGSDRCRNCGNYHSGLCRE 368 Query: 345 LTGKCFRCGEPGHIVRNCPK 286 T +CF+CG+ GHI NCP+ Sbjct: 369 PT-RCFQCGQTGHIRSNCPR 387 >emb|CAN81227.1| hypothetical protein VITISV_038888 [Vitis vinifera] Length = 1132 Score = 99.0 bits (245), Expect = 2e-18 Identities = 74/265 (27%), Positives = 116/265 (43%), Gaps = 10/265 (3%) Frame = -3 Query: 843 FHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAGL 664 F+ Y P+ R QK EF+ L+Q N +VA+YE F +LS ++P +IAT+++K +F+ GL Sbjct: 116 FYKKYFPDSVRQQKVGEFVRLEQRNLTVAQYEAKFTELSCFSPQLIATKEEKTLKFQDGL 175 Query: 663 RFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-NFQYNKKRQGDETGGFKNKKEFTQ 487 + ++N+ ++ L Y ++ A+ A+K E Q Q K + D G + +K+ T Sbjct: 176 KPYLKNKTSILKLSIYLEVVDRALIAKKDNEELHQYKEQQRKSNRNDGAHGNQARKKPTP 235 Query: 486 STQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGE-CRLLTGKCFRCGEPG 310 S C CGK H G C F CG+ G Sbjct: 236 SRNQNKGKVTQNLDE-----------------ICPTCGKKHGGRPCYREIRAWFGCGKQG 278 Query: 309 HIVRNCP--------KPREENTVEQXXXXXXXXXXXXXXXXXXXXXGISADESQGSRQQT 154 H+VR+CP KP+EEN ++ + + Sbjct: 279 HMVRDCPENKKFVFGKPKEENKEDR------------------------------QKPRA 308 Query: 153 QARVFAITKEDAKATPTVITGQNLD 79 Q RVF++T DA+AT V+ G +D Sbjct: 309 QGRVFSMTHRDAQATSDVVAGMPID 333 >gb|EOY31906.1| Gag protease polyprotein [Theobroma cacao] Length = 389 Score = 97.8 bits (242), Expect = 4e-18 Identities = 62/200 (31%), Positives = 97/200 (48%), Gaps = 13/200 (6%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL LKQGN +V EYE FN+L Y P ++ +E+ + + FE G Sbjct: 189 EFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVPDLVKSEQDQASYFEEG 248 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-NFQYNKKRQGDETGG--FKNKKE 496 LR IR R+T+ E + ++ A+RAEKL E+++ ++ K+R + K K+ Sbjct: 249 LRNEIRERMTVIGREPHKEVVQMALRAEKLATENRRIRTEFAKRRNPGMSSSQPVKRGKD 308 Query: 495 FTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVV----------TCVHCGKNHYGECRL 346 S +++ C +CG H G CR Sbjct: 309 SATSGSTTSVSVTSPRPPFPPSQQRPSRFSRSAMIGSGKSLGGSDRCRNCGNYHSGLCRG 368 Query: 345 LTGKCFRCGEPGHIVRNCPK 286 T +CF+CG+ GHI NCP+ Sbjct: 369 PT-RCFQCGQTGHIRSNCPQ 387 >gb|EOY08512.1| Gag protease polyprotein [Theobroma cacao] Length = 404 Score = 97.4 bits (241), Expect = 6e-18 Identities = 63/201 (31%), Positives = 94/201 (46%), Gaps = 14/201 (6%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL LKQGN +V EYE FN+L Y P ++ +E+ + + FE G Sbjct: 189 EFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVPDLVKSEQDQASYFEEG 248 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ--------------NFQYNKKRQG 529 LR IR R+T+ E + ++ A+RAEKL E+++ + Q K+ + Sbjct: 249 LRNEIRERMTVIGREPHKEVVQMALRAEKLATENRRIRTKFAKRRNLGMSSSQPVKRGKD 308 Query: 528 DETGGFKNKKEFTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGECR 349 T G T S + C +CG H G CR Sbjct: 309 SATSGSTTSISVT-SPRPPFPPSQQRPSRFSRSAMTGSGKSLGGFDRCRNCGNYHSGLCR 367 Query: 348 LLTGKCFRCGEPGHIVRNCPK 286 T +CF+CG+ GHI NCP+ Sbjct: 368 GPT-RCFQCGQTGHIRSNCPQ 387 >gb|EOY14138.1| Uncharacterized protein TCM_033423 [Theobroma cacao] Length = 809 Score = 96.7 bits (239), Expect = 1e-17 Identities = 74/275 (26%), Positives = 117/275 (42%), Gaps = 24/275 (8%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL LKQGN +V EYE FN+L Y P+++ +E+ + + FE G Sbjct: 164 EFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVPNLVKSEQDQASYFEEG 223 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ--------------NFQYNKKRQG 529 LR IR R+T+ E + ++ A+RAEKL E+++ + Q K+ + Sbjct: 224 LRNEIRERMTVIGREPHKEVVQMALRAEKLATENRRIRTEFAKRRNPGMSSSQSVKRGKD 283 Query: 528 DETGGFKNKKEFTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGECR 349 T G T S + C + G H G CR Sbjct: 284 SATSGSTTSVSVT-SPRPPFPPSQQRPSRFSRSAMTGSGKSLGGSDRCRNYGNYHSGLCR 342 Query: 348 LLTGKCFRCGEPGHIVRNCPK----------PREENTVEQXXXXXXXXXXXXXXXXXXXX 199 T +CF+CG+ GHI NCP+ P +++ Sbjct: 343 GPT-RCFQCGQTGHIRSNCPQLGRATVAASSPPARTDIQRRDSSGLPPRQGGAIRSGVES 401 Query: 198 XGISADESQGSRQQTQARVFAITKEDAKATPTVIT 94 S S+ + +T RVFA+T+++A P +T Sbjct: 402 NTPSHPPSR-PQTRTATRVFAVTEDEALVRPGAVT 435 >gb|EOY26650.1| Gag protease polyprotein [Theobroma cacao] Length = 467 Score = 95.9 bits (237), Expect = 2e-17 Identities = 59/201 (29%), Positives = 96/201 (47%), Gaps = 14/201 (6%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL L+QGN ++ EYE FN+L Y P ++ +E+ + + FE G Sbjct: 196 EFDGQYYTYFHQKEKKREFLSLQQGNLTIEEYEARFNELMSYVPDLVKSEQDQASYFEEG 255 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-NFQYNKKRQGDETGG---FKNKK 499 LR IR R+T++ E + ++ A+RAEKL E+++ ++ KKR + + + K Sbjct: 256 LRNEIRERMTVTGREPHKEVVQMALRAEKLTNENRRMRAEFAKKRNPNVSSSQLPKRGKD 315 Query: 498 EFTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVT----------CVHCGKNHYGECR 349 F + T C CG+ H GEC Sbjct: 316 TFASESTVSVPVISPRPPLSQLQQRPPRFNRSGMSSTSEKSFGGLNKCEKCGRYHVGECW 375 Query: 348 LLTGKCFRCGEPGHIVRNCPK 286 + +CF C +PGHI +CP+ Sbjct: 376 GI--RCFHCDQPGHIRSDCPQ 394 >gb|EOY26606.1| Gag protease polyprotein [Theobroma cacao] Length = 669 Score = 95.9 bits (237), Expect = 2e-17 Identities = 58/201 (28%), Positives = 93/201 (46%), Gaps = 14/201 (6%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF+ Y ++ +K++EFL L+QGN ++ EYE FN+L Y P ++ +E+ + + FE G Sbjct: 226 EFNGQYYTYFHQKEKKREFLSLQQGNLTIEEYEARFNELMSYVPDLVKSEQDQASYFEEG 285 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQNFQYNKKRQGDETGGF----KNKK 499 LR IR R+T++ E + ++ A+RAEKL E+++ KR+ + K Sbjct: 286 LRNEIRERMTVTGREPHKEVVQMALRAEKLTNENRRKRAEFAKRRNPNVSSSQLPKRGKD 345 Query: 498 EFTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVT----------CVHCGKNHYGECR 349 F + T C CG+ H GEC Sbjct: 346 TFASESTVSVPVISPRPPLSQLQQRPPRFNRSGMSSTSEKSFGGLNKCEKCGRYHVGECW 405 Query: 348 LLTGKCFRCGEPGHIVRNCPK 286 + +CF C +PGHI +CP+ Sbjct: 406 GI--RCFHCDQPGHIRSDCPQ 424 >gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa] gi|21327374|gb|AAM48279.1|AC122148_32 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa Japonica Group] gi|31431495|gb|AAP53268.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1230 Score = 95.1 bits (235), Expect = 3e-17 Identities = 73/273 (26%), Positives = 115/273 (42%), Gaps = 19/273 (6%) Frame = -3 Query: 843 FHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAGL 664 F+ Y PE + KEKEFLELKQGN+SVAEYE+ F++L+ +AP + T+ K RFE+GL Sbjct: 150 FYKKYFPESVKRMKEKEFLELKQGNKSVAEYEIEFSRLARFAPEFVQTDGSKARRFESGL 209 Query: 663 RFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQNFQYNKKRQGDETGGFKNKKEFTQS 484 R ++ R+ +L + ++ + A EK +Q ++ + ++ +T +N+ F + Sbjct: 210 RQPLKRRVEAFELTIFREVVSKAQLLEK--GYHEQRIEHGQPQKKFKTNNPQNQGRFRGN 267 Query: 483 TQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGE-CRLLTGKCFRCGEPGH 307 C C +H C G+CF CGE GH Sbjct: 268 YSGQMQRKSSENQGR----------------KCPICQGSHVPSICPNCWGRCFECGEAGH 311 Query: 306 IVRNCP-----KPREENTVEQ--------XXXXXXXXXXXXXXXXXXXXXGISADESQGS 166 CP K R +T + + + ++G Sbjct: 312 TRYQCPLLQKGKNRVSSTTQPNTKVLTPVPSLYLPGPSSANNHGPNQGKPLANTNTTRGM 371 Query: 165 RQQ-----TQARVFAITKEDAKATPTVITGQNL 82 R ARV+ +TK A+ + TV+TG L Sbjct: 372 RSNNSQGGNHARVYNLTKSTAEESNTVVTGNVL 404 >gb|EOX98886.1| Gag protease polyprotein [Theobroma cacao] Length = 467 Score = 94.7 bits (234), Expect = 4e-17 Identities = 58/201 (28%), Positives = 94/201 (46%), Gaps = 14/201 (6%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL L+QGN ++ EYE FN+L Y P ++ +E+ + + FE G Sbjct: 196 EFDGQYYTYFHQKEKKREFLSLQQGNLTIEEYEARFNELMSYVPDLVKSEQDQASYFEEG 255 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQNFQYNKKRQGDETGGF---KNKKE 496 LR IR R+T++ E + ++ A+RAEKL E+++ KR+ K K+ Sbjct: 256 LRNEIRERMTVTGREPHKEVVQMALRAEKLTNENRRMRAEFAKRRNPNVSSIQLPKRGKD 315 Query: 495 FTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVT-----------CVHCGKNHYGECR 349 + S + + C CG+ H GEC Sbjct: 316 TSASESTVSVPVISPRPPFSQLQQRPPRFSRSGMSSTSEKSFGGLNKCEKCGRYHVGECW 375 Query: 348 LLTGKCFRCGEPGHIVRNCPK 286 + +CF C +PGHI +CP+ Sbjct: 376 GI--RCFHCDQPGHIRSDCPQ 394 >emb|CAN68039.1| hypothetical protein VITISV_018924 [Vitis vinifera] Length = 548 Score = 94.4 bits (233), Expect = 5e-17 Identities = 72/262 (27%), Positives = 112/262 (42%), Gaps = 11/262 (4%) Frame = -3 Query: 843 FHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAGL 664 F+ Y P+ R QK EF+ L+QG+ VA YE F +LS ++P +IATE++K +F+ GL Sbjct: 208 FYKKYFPDSVRRQKVGEFIRLEQGDMIVAHYEAKFTELSRFSPQLIATEEEKALKFQDGL 267 Query: 663 RFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQNFQYNKKRQGDE--TGGFKNKKEFT 490 + ++N+I++ L YS++ A+ EK E Q + +KR + G +NK + Sbjct: 268 KPYLKNKISILKLGVYSEVVDRALIVEKDNEELHQYREQQRKRNRSDGAHGRNQNKGKAA 327 Query: 489 QSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGE-CRLLTGKCFRCGEP 313 Q+ C CGK H G C CF G+ Sbjct: 328 QNLDG----------------------------ACPTCGKKHGGRPCYREIEACFGYGKQ 359 Query: 312 GHIVRNC--------PKPREENTVEQXXXXXXXXXXXXXXXXXXXXXGISADESQGSRQQ 157 H++R+C KP+EEN + + + Sbjct: 360 RHLIRDCLENRKFITGKPKEEN------------------------------KENKQKPK 389 Query: 156 TQARVFAITKEDAKATPTVITG 91 Q RVFA+T DA+A+ V+ G Sbjct: 390 AQGRVFAMTHRDAQASSDVVIG 411 >emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera] Length = 360 Score = 94.0 bits (232), Expect = 6e-17 Identities = 55/190 (28%), Positives = 93/190 (48%), Gaps = 1/190 (0%) Frame = -3 Query: 843 FHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAGL 664 F+ Y P+ R QK EF+ L+QG+ +VA+YE F +LS ++P +IATE++K +F+ L Sbjct: 179 FYKKYFPDSVRRQKVGEFIRLEQGDMTVAQYEAKFTELSRFSPQLIATEEEKALKFQDXL 238 Query: 663 RFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQNFQYNKKRQGDETGGFKNKKEFTQS 484 + ++N+ ++ L YS+ R ++ + N+ ++ +G +NK + Q+ Sbjct: 239 KPYLKNKXSILXLGXYSE-----YREQQRKRNRSDGAHGNQXQRRSTSGRNQNKGKAAQN 293 Query: 483 TQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGE-CRLLTGKCFRCGEPGH 307 C CGK H G C TG CF CG+ GH Sbjct: 294 LDG----------------------------ACPTCGKKHGGRPCYRETGACFGCGKQGH 325 Query: 306 IVRNCPKPRE 277 ++R+CP+ R+ Sbjct: 326 LIRDCPENRK 335 >gb|EMJ28586.1| hypothetical protein PRUPE_ppb016975mg [Prunus persica] Length = 650 Score = 93.6 bits (231), Expect = 8e-17 Identities = 75/300 (25%), Positives = 117/300 (39%), Gaps = 49/300 (16%) Frame = -3 Query: 843 FHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAGL 664 F D + P Y+ K+ EFL LKQG+ SV EYE FN+LS +AP ++ATE+ +C RFE GL Sbjct: 153 FSDPFYPPSYKQAKKSEFLYLKQGSMSVVEYEHKFNELSRFAPELVATEEDRCRRFEEGL 212 Query: 663 RFGIRNRITLSDLESYSKLKAAAIRAEK-------------------LEAESKQNFQYNK 541 + I+ +T + + L AA R + + SK+ + Sbjct: 213 WWEIQAVVTANTYPNMRALAQAAARVSRKLGGNVSRRRRDTSGIGGPSQGPSKRGGSSSS 272 Query: 540 KRQGDETGGFKNKKEFTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHY 361 G +GG + + S + + +TC HCG+ + Sbjct: 273 SASGGWSGG---RGSSSSSGRSGSRSAWTQYSGPQSAASTARAPSRYTGLTCFHCGQVGH 329 Query: 360 ------------GECRLLTGKCFRCGEPGHIVRNCPKPREENTVEQXXXXXXXXXXXXXX 217 G + + C+ CG+ GH R+CP + + Q Sbjct: 330 IAKDCPSYTQGGGPSQSSSLTCYFCGQVGHTKRSCPIILQNEAINQGTGAQQGQGILGQN 389 Query: 216 XXXXXXXGISADES------------------QGSRQQTQARVFAITKEDAKATPTVITG 91 +A S + R TQA VF++++++A ATP VITG Sbjct: 390 QNQGGVSSSAAGSSSSRASSSSRGRCGRQSRGEPGRSTTQAHVFSMSQQEAYATPDVITG 449 >gb|EOY16714.1| Gag protease polyprotein [Theobroma cacao] Length = 447 Score = 93.2 bits (230), Expect = 1e-16 Identities = 62/200 (31%), Positives = 90/200 (45%), Gaps = 13/200 (6%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL LKQGN +V EYE FN+L Y P ++ TE+ + N FE G Sbjct: 176 EFDSQYYTHFHKKEKKREFLSLKQGNLTVEEYETQFNELLSYVPDLVRTEQDQANYFEEG 235 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQNFQYNKKRQG---DETGGFKNKKE 496 LR IR R+T++ E + ++ A+RAEKL E+++ KR+ + K K Sbjct: 236 LRNEIRERMTVTGREPHKEVVQMALRAEKLANENRRMRAELAKRKNLNMSFSQPLKRSKG 295 Query: 495 FTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVT----------CVHCGKNHYGECRL 346 S +V T C CG+ H G C Sbjct: 296 SFDSRSAPSVSVTSSRPSFSQMQQRLPRFSGSAVTTSEKSFGGFDRCRECGRFHGGVC-W 354 Query: 345 LTGKCFRCGEPGHIVRNCPK 286 +CF CG+ H NCP+ Sbjct: 355 GPLRCFHCGQMSHFRTNCPQ 374 >gb|EOX99639.1| Gag protease polyprotein [Theobroma cacao] Length = 413 Score = 92.8 bits (229), Expect = 1e-16 Identities = 60/200 (30%), Positives = 90/200 (45%), Gaps = 13/200 (6%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 E Y ++ +K++EFL LKQG+ ++ EYE FN+L Y P ++ TE+ + FE G Sbjct: 172 ELDGQYYTHFHQKEKKREFLSLKQGSSTIEEYEARFNELMSYVPDLVKTEQDQVTYFEEG 231 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-------------NFQYNKKRQGD 526 LR IR+R+T++ E Y ++ A+R EKL E+KQ +F K+ D Sbjct: 232 LRNEIRDRMTVTGKEPYKEVVQMAMRVEKLAIENKQIRAEFAKMRNLSISFYQPSKKGKD 291 Query: 525 ETGGFKNKKEFTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVTCVHCGKNHYGECRL 346 + ST+ S C +C K H G CR Sbjct: 292 LSTSGSTTAILVASTRPPSQQSQQRPSRFSRSATSAPGKSFKSFDRCRNCKKVHPGPCRE 351 Query: 345 LTGKCFRCGEPGHIVRNCPK 286 +CF+C + GHI CP+ Sbjct: 352 PV-RCFQCEQQGHIRSACPQ 370 >gb|EOY19679.1| Gag protease polyprotein [Theobroma cacao] Length = 474 Score = 91.3 bits (225), Expect = 4e-16 Identities = 57/201 (28%), Positives = 94/201 (46%), Gaps = 14/201 (6%) Frame = -3 Query: 846 EFHDAYLPEVYRDQKEKEFLELKQGNRSVAEYEVLFNQLSHYAPHMIATEKKKCNRFEAG 667 EF Y ++ +K++EFL L+QGN ++ EYE FN+L Y P ++ +E+ + + FE G Sbjct: 202 EFDGQYYTYFHQKEKKREFLSLQQGNLTIEEYEARFNELMSYVPDLVKSEQDQASYFEEG 261 Query: 666 LRFGIRNRITLSDLESYSKLKAAAIRAEKLEAESKQ-NFQYNKKRQGDETGG---FKNKK 499 LR IR R+T++ E + ++ A+RAEKL E+++ ++ K+R + + + K Sbjct: 262 LRNEIRERMTVTGREPHKEVVQMALRAEKLTNENRRMRAEFAKRRNPNVSSSQLPKRGKD 321 Query: 498 EFTQSTQXXXXXXXXXXXXXXXXXXXXXXXXXXSVVT----------CVHCGKNHYGECR 349 F T C CG+ H GEC Sbjct: 322 TFASENTVSVPVISPRPPLSQLQQRPPRFNRSGMSSTSEKSFGGLNKCEKCGRYHVGECW 381 Query: 348 LLTGKCFRCGEPGHIVRNCPK 286 + +CF C + GHI +CP+ Sbjct: 382 GI--RCFHCDQSGHIRSDCPQ 400