BLASTX nr result
ID: Mentha28_contig00019938
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00019938 (1431 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670... 186 2e-44 ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665... 183 1e-43 ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664... 169 3e-39 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 168 5e-39 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 166 2e-38 ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663... 152 3e-34 ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661... 148 5e-33 ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668... 144 7e-32 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 140 1e-30 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 132 3e-28 ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660... 128 5e-27 ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A... 127 1e-26 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 126 2e-26 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 126 3e-26 ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein A... 123 2e-25 ref|XP_004173049.1| PREDICTED: putative ribonuclease H protein A... 112 5e-22 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 111 7e-22 ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268... 110 1e-21 ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232... 110 1e-21 dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like ... 110 2e-21 >ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max] Length = 383 Score = 186 bits (473), Expect = 2e-44 Identities = 100/279 (35%), Positives = 147/279 (52%), Gaps = 7/279 (2%) Frame = -3 Query: 1396 RWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFLWRDSQCP---- 1229 RWS +LS AG++ELIR+V+QG+ +W+ PLP +V+ I R FLW + Sbjct: 110 RWSRKSLSYAGKVELIRAVIQGIANFWMSIFPLPQSVLDTIIATCRNFLWGKADGGKIKP 169 Query: 1228 -VSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWE 1052 V+W VC P+ EGGLGL +L WN AL S LW++H+K DSLW++ +H Y +G ++W+ Sbjct: 170 LVAWSEVCTPKKEGGLGLFNLKDWNIALLSCILWDLHSKKDSLWVRLVHHYYFKGGNVWD 229 Query: 1051 FPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGT--SEAYEHFRVKGEK 878 F D+ + IRD +I N+ AK L W + T + Y++ R Sbjct: 230 FISSSSDSV----FIHIRD-IIISKEENIEVAKLMLNSWGCNEQTLAGKMYDYIRGTRPV 284 Query: 877 KFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIARGCVLCESADETHDHLFFKC 698 W IW IP K S LWLA RL DR + C LC + E+H HLFF C Sbjct: 285 VHWSSIIWNPVIPSKMSFILWLATKNRLLALDRAAFLNKGFLCPLCTNEAESHAHLFFSC 344 Query: 697 DKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGI 581 ++ VW+ I W+ + + ++ ++ R +A SG+ Sbjct: 345 RTSLRVWAHIRDWIPLKRQSISLQHSISALIRRRATSGV 383 >ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max] Length = 506 Score = 183 bits (465), Expect = 1e-43 Identities = 102/313 (32%), Positives = 164/313 (52%), Gaps = 7/313 (2%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ +I I+ W+ LS AGRL+L+ SV+ + YWL P P +V+ +I + R FL Sbjct: 159 PLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALTNYWLNCFPFPKSVLQKIEAICRIFL 218 Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W + PV+WK +C PR GGL + D+ +WNKA K LWN+ +K DSLW+KWI Sbjct: 219 WTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNKANLMKLLWNLSSKEDSLWVKWIQ 278 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTSEAY 905 A Y++ ++ D+ M IL+ R+ L +++ + ++ G + Y Sbjct: 279 AYYVKRSELMHIEMKNTDSWIMKAILKQREDL-----EKIDNMEELMIRGSINMG--KLY 331 Query: 904 EHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRM-KHSDI-ARGCVLCESA 731 + G++K W ++ + P+ + LWLA HGRL T DR+ K+ I + C C S Sbjct: 332 RKLQDCGQRKEWKNLLYGNTARPRANFILWLACHGRLSTKDRLCKYGMIDDKSCCFC-SE 390 Query: 730 DETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKWVALG 551 +E+ +HLFF CD + VW + W++ R++ + P+ + G G +A+ Sbjct: 391 EESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSDWPNELHWLTHHTKGKGTRAAVLKMAIA 450 Query: 550 ATVQYLWQARNLK 512 T+ +W RN K Sbjct: 451 ETIYEIWNIRNNK 463 >ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max] Length = 939 Score = 169 bits (428), Expect = 3e-39 Identities = 98/310 (31%), Positives = 158/310 (50%), Gaps = 7/310 (2%) Frame = -3 Query: 1426 LLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFLW 1247 L+ +I I WS LS AGR++LI+SV+ +W+Q LPLP VI RI + R FLW Sbjct: 602 LIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLPLPKFVIMRINAICRSFLW 661 Query: 1246 RDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIHA 1082 + + P++W+ VC P+ GGL + +LA+WNK K LWN+ K+D+LWIKW+H Sbjct: 662 IGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNKISILKLLWNVCNKSDNLWIKWLHT 721 Query: 1081 EYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTSEAYE 902 Y+RG IW + + M++++++R L+ ++++ F K + Y Sbjct: 722 YYIRGQSIWSMVLKKSHSWIMSSMMKLRPLLL--------QYQSRMQDVFKMK---KIYL 770 Query: 901 HFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKH--SDIARGCVLCESAD 728 + EK W + + P+ LW A H RL + DR+ ++ C C S Sbjct: 771 ALFEESEKMSWRTLMCNNLARPRALFCLWQACHFRLASKDRLIKFGLNVDANCAFCSSM- 829 Query: 727 ETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKWVALGA 548 E+H+HLFF C + +W+ + +WL+ + +T + R+ G G A Sbjct: 830 ESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTWSEELNWITRKCKGKGWRAMLLKCAFTE 889 Query: 547 TVQYLWQARN 518 T+ ++W RN Sbjct: 890 TIYHIWAYRN 899 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 168 bits (426), Expect = 5e-39 Identities = 107/333 (32%), Positives = 162/333 (48%), Gaps = 11/333 (3%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ I+N Q W LS AGRL+LI+S+L ++ YW PL VI + K+ RKFL Sbjct: 773 PLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFL 832 Query: 1249 W-----RDSQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W + PV+W T+ P+ GG + ++ WN+A K LW I K D LW++WIH Sbjct: 833 WTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIH 892 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVG-WFAGKGTSEA 908 + Y++ DI + + I++ RD L N+ D +G F+ K +A Sbjct: 893 SYYIKRQDILTVNISNQTTWILRKIVKARDHL-----SNIGDWDEICIGDKFSMK---KA 944 Query: 907 YEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIA--RGCVLCES 734 Y+ GE+ W + I +Y PK LW+ LH RL T DR+ + LC + Sbjct: 945 YKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRN 1004 Query: 733 ADETHDHLFFKCDKAMAVWSGICSWLRCRNEMTT---IPSAVRRFQREKAGSGIIRKAKW 563 ET HLFF C + VWS IC +R N + I S+V R+K G I+ Sbjct: 1005 DGETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVCGQARKKKGKLIV----- 1059 Query: 562 VALGATVQYLWQARNLKYVEKKPFEASHIIKEI 464 + V +W+ RN + + + + ++++I Sbjct: 1060 MLYTEFVYAIWKQRNKRTFTGENKDENEVLRKI 1092 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 166 bits (421), Expect = 2e-38 Identities = 115/336 (34%), Positives = 159/336 (47%), Gaps = 14/336 (4%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PLL +I + I W N LS AGRL+L+ SV+ + +W+ A LP I I ++ FL Sbjct: 1043 PLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFL 1102 Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W + + V+W VC P+ EGGLGLR L NK K +W + + SLW+ WI Sbjct: 1103 WSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQ 1162 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILR-IRDQL-IFDCGGNLNDAKAKLV----GWFAGK 923 +R + E R H +IL I ++L C G + L G F K Sbjct: 1163 NNLIR--TVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDRSLCRSIGGQFKAK 1220 Query: 922 GTS-EAYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMK--HSDIARG 752 S E + R +G K W+KAIW S PKF+ WLA H RL T D+M + I+ Sbjct: 1221 FFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSV 1280 Query: 751 CVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRK 572 CVLC + E+ DHLFF C+ + +W + L T P+ + + SG R Sbjct: 1281 CVLCNISAESRDHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALLLLLSGQDF-SGTKRF 1339 Query: 571 AKWVALGATVQYLWQARNLKYVEKKPFEASHIIKEI 464 AT+ LW+ RN + P + HIIK I Sbjct: 1340 LLRYVFQATIHTLWRERNKRRHGDLPIPSDHIIKFI 1375 >ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max] Length = 514 Score = 152 bits (385), Expect = 3e-34 Identities = 85/260 (32%), Positives = 135/260 (51%), Gaps = 9/260 (3%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ +I I+ WS+ LS AGR++L+RS++ + YW+ P+P VI +I + R F+ Sbjct: 262 PLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPKKVIQKIDSICRSFI 321 Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W S + V+WK VC P GGL L +L +WN K LWNI +K D+LW+KWIH Sbjct: 322 WSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWNVTAMLKCLWNICSKEDNLWVKWIH 381 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTS--E 911 A +L+G ++ + ++++ R Q +N+ + + + S + Sbjct: 382 AYFLKGDNVMSATIKSNSTWILKSVMKQRPQ--------VNNLQLVWIEMLRKRKFSMKQ 433 Query: 910 AYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIARG--CVLCE 737 Y K W++ + + P+ +VTLWLA RL T R+K+ ++ + C LC+ Sbjct: 434 VYMELVEDHNKIDWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMNMIQCSLCSLCK 493 Query: 736 SADETHDHLFFKCDKAMAVW 677 DE DHL F C A+W Sbjct: 494 EQDEDLDHLMFSCRVTKAIW 513 >ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max] Length = 947 Score = 148 bits (374), Expect = 5e-33 Identities = 79/266 (29%), Positives = 139/266 (52%), Gaps = 12/266 (4%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ +I+ I+ W++ L+ GR++++ + + +W+Q LP+P +VI +I M R F+ Sbjct: 601 PLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIPMSVIKKIDSMCRSFV 660 Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W S + P++W +VC P+ +GGL + +L VWN LWN+ K D+LW+KWIH Sbjct: 661 WSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLNCLWNLCKKVDNLWVKWIH 720 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRD-----QLIFDCGGNLNDAKAKLVGWFAGKG 920 A Y++ + + + N+L R+ Q ++D LN + K+ Sbjct: 721 AHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPVWD--ELLNSERFKM-------- 770 Query: 919 TSEAYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIARGCV-- 746 +AY+ ++ ++ W + ++ P+ T WLA HGRL T DR+ + + Sbjct: 771 -KKAYDKM-MEADRVHWSGLMRKNCARPRAIHTTWLACHGRLGTKDRLVRFGMITDKIWS 828 Query: 745 LCESADETHDHLFFKCDKAMAVWSGI 668 LC+ +ET +H+ F C A +WS + Sbjct: 829 LCKEVEETQNHILFSCKVATDIWSNV 854 >ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max] Length = 477 Score = 144 bits (364), Expect = 7e-32 Identities = 93/325 (28%), Positives = 140/325 (43%), Gaps = 2/325 (0%) Frame = -3 Query: 1414 ISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFLWRDSQ 1235 I++ IQ WS+ LS AG++ELIR+V+QG+ +W PLP V+ RI R FLW ++ Sbjct: 182 ITSLIQGWSSKTLSYAGKVELIRAVIQGIANFWTDIFPLPQFVLDRINVSYRNFLWGKAE 241 Query: 1234 CPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIW 1055 +H Y +G ++W Sbjct: 242 ------------------------------------------------VHHNYFKGGNVW 253 Query: 1054 EFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKG--TSEAYEHFRVKGE 881 +F D+ + I+ IRD +I N+ AK L W + + +AY++ R Sbjct: 254 DFISSASDSVLIKKIIHIRD-IITIKEDNVEAAKQTLNSWNSNEQLLAGKAYDYIRGVKP 312 Query: 880 KKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIARGCVLCESADETHDHLFFK 701 W +W IP K S LWLA L T DR + C LC + ++H HLFF Sbjct: 313 AVNWNSVVWNPAIPSKMSFILWLATKNHLLTLDRAAFLNKGLLCPLCRTKAKSHAHLFFS 372 Query: 700 CDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQAR 521 C ++ VW+ I W+ + ++ + +A SG K + +AL V W +R Sbjct: 373 CRISLQVWANIRDWIPLHRQTISLQCTINSRICGRATSGTWGKFRCLALAIAVYCTWISR 432 Query: 520 NLKYVEKKPFEASHIIKEIKLDVYR 446 NL E PF +II +IK VY+ Sbjct: 433 NLLLFENSPFSVINIINKIKFLVYK 457 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 140 bits (353), Expect = 1e-30 Identities = 93/330 (28%), Positives = 153/330 (46%), Gaps = 8/330 (2%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ +I+ Q W LS AGRL+L++++L ++ YW Q PLP +I + RKFL Sbjct: 776 PLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKFL 835 Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W + + PV+W + P+ GGL + ++ +WNKA K LW I K D LW++W++ Sbjct: 836 WTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVN 895 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTSEAY 905 A Y++ +I + + I R +L+ GG + V + Y Sbjct: 896 AYYIKRQNIENVTVSSNTSWILRKIFESR-ELLTRTGG------WEAVSNHMNFSIKKTY 948 Query: 904 EHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMK--HSDIARGCVLCESA 731 + + E W + I + PK LWLA+ RL T +R+ + D++ C +C + Sbjct: 949 KLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKMCGNE 1008 Query: 730 DETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKWVAL- 554 ET HLFF C + +W + +L + + A + +KA S R +V + Sbjct: 1009 IETIQHLFFNCIYSKEIWGKVLLYLNLQPQADA--QAKKELAIKKARSTKDRNKLYVMMF 1066 Query: 553 GATVQYLWQARNLKYVEKKPFEASHIIKEI 464 +V +W RN K + +K I Sbjct: 1067 TESVYAIWLLRNAKVFRGIEINQNQAVKSI 1096 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 132 bits (333), Expect = 3e-28 Identities = 97/333 (29%), Positives = 149/333 (44%), Gaps = 11/333 (3%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PLL +I I+ W N LS AGRL+LI+SVL ++ YW L LP V+ I K LR FL Sbjct: 610 PLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFL 669 Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W + V+W +CLP+ EGGLG++DL WNKAL +WN+ + + + W W+ Sbjct: 670 WAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWVK 729 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTSEAY 905 L+G W P P + + +L+IR+ C +N ++G G+ TS + Sbjct: 730 VYLLKGNSFWNAPLPSICSWNWRKLLKIRE---LCCSFFVN-----IIG--DGRATSLWF 779 Query: 904 EHFRVKGEKKFWYKAIWRSYI--PPKFSVTLWLALHGRLKTFDRMKHSDIARGCV----L 743 +++ G W S I S + L +G T +R V L Sbjct: 780 DNWHPLGP----LTLRWSSNIIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFIVPWYRL 835 Query: 742 CESADETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKW 563 ETH+HLFF C + +W+ + S + + G+ + Sbjct: 836 VWFVAETHNHLFFDCAYSFGIWTHVLSKCDVSKPLLPWSDFIFWVATNWKGNSLPVVILK 895 Query: 562 VALGATVQYLWQARNLKYVEKKPFEASHIIKEI 464 +AL A V +W+ RN + + + + K I Sbjct: 896 LALQAVVYAIWRERNNRRFRNESLPPAVVFKGI 928 >ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max] Length = 303 Score = 128 bits (322), Expect = 5e-27 Identities = 69/170 (40%), Positives = 97/170 (57%), Gaps = 5/170 (2%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PLL++I+ IQ WS +LS AG+LELIR+V+QG+ +W+ PLP +V+ RI R FL Sbjct: 111 PLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWIGIFPLPQSVLDRINASCRNFL 170 Query: 1249 WRDSQCP-----VSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W + V+W VC P+ EGGLGL +L WN AL S LW+ H K DSL W+H Sbjct: 171 WGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWNLALLSCILWDFHCKKDSL---WVH 227 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGW 935 Y R D+W + + + I++IRD I + +AK ++ W Sbjct: 228 HYYFRRSDVWNYNTSSSYSVLIKKIIQIRD-FIISKELSTEEAKKRIQSW 276 >ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 316 Score = 127 bits (319), Expect = 1e-26 Identities = 64/170 (37%), Positives = 96/170 (56%), Gaps = 5/170 (2%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PLL +I IQ W+ +LS G+LELI++V+QG+ +W++ PLP +V+ RI FL Sbjct: 144 PLLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQSVLDRINASCCNFL 203 Query: 1249 WRDSQCP-----VSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W + V+W VC P+ EGGLGL +L WN AL S LW+ H K DSL ++W+H Sbjct: 204 WSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWNLALLSHILWDFHCKKDSLRVRWVH 263 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGW 935 Y R D W + ++ + I++IRD I ++ + K ++ W Sbjct: 264 HYYFRRSDEWNYNISSSNSVLIKKIIQIRD-FIISKELSMEETKKRIQSW 312 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 126 bits (317), Expect = 2e-26 Identities = 106/409 (25%), Positives = 161/409 (39%), Gaps = 87/409 (21%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ QI I W++ LS AGRL LI SVL + +W+ A LP I+ I ++ L Sbjct: 509 PLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNFWMNAFRLPRECINEINRISSALL 568 Query: 1249 W-----RDSQCPVSWKTVCLPRDEGGLGLRDLA----------VWNKALHSKTLW----- 1130 W + VSW +C P+ EGGLGL+ L +W +LW Sbjct: 569 WSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKWTR 628 Query: 1129 -NIHAKADSLWI--------KWI------HAEYLRGL---------------DIWEFPYP 1040 N+ K +S W WI H E + D W P Sbjct: 629 MNL-LKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKGP 687 Query: 1039 ---------------------------RRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLV 941 RR H IL ++++ + N + Sbjct: 688 LINLTGARGAIDMGISRHMTLAEAWSRRRRKRHRVEILNEFEEILLQKYQHRNIELEDAI 747 Query: 940 GWFAGK--------GTSEAYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTF 785 W GK T + + H R ++ W+K +W ++ PKFS WLA+ RL T Sbjct: 748 LW-RGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTG 806 Query: 784 DRMK--HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRR 611 DRM ++ CV C S ET DHLFF+C + +W+ I + ++ +T SAV Sbjct: 807 DRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSIAKNV-YKDRFSTKWSAVVN 865 Query: 610 FQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHIIKEI 464 + + I ++ +W+ RN + +K AS++I++I Sbjct: 866 YISDSQPDRIQSFLSRYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQI 914 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 126 bits (316), Expect = 3e-26 Identities = 110/411 (26%), Positives = 165/411 (40%), Gaps = 92/411 (22%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL QI N I W++ LS AGRL LI SVL +W+ A LP + I + FL Sbjct: 193 PLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTMNFWMSAFRLPSACLKEINSICSAFL 252 Query: 1249 WRDSQ-----CPVSWKTVCLPRDEGGLGLRDLA----------VWNKALHSKTLW----- 1130 W + VSW +C P+ EGGLGLR L +W + +LW Sbjct: 253 WSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEANVVSVLKLIWRVTSNDDSLWVKWSK 312 Query: 1129 -NIHAKADSLWI--------KWIHAEYLRGLDIWEFPYPRRDAP---------------- 1025 N+ K +S W W+ + L+ + + P+ R + Sbjct: 313 MNL-LKQESFWSLTPNSSLGSWMWKKMLKYRETAK-PFSRVEVNNGARTSFWFDNWSGMG 370 Query: 1024 HMTNILRIRDQLIFDCGGN-------------------LNDAKAKL-------------V 941 H+ ++ R Q+ N LND +A L Sbjct: 371 HLMDVTGQRGQIDLGISRNKTVAEAWSNRRRRKHRTEQLNDIEAALNQKYQTRNLLREDA 430 Query: 940 GWFAGKG--------TSEAYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTF 785 + GKG T + + R K + WYK +W S+ PK+ WLAL RL T Sbjct: 431 TLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLSTG 490 Query: 784 DRMK----HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWL---RCRNEMTTIP 626 RM+ SD+ C C ++ ET DHLFF C A A+W+ I + R + TI Sbjct: 491 YRMQLWNNGSDVK--CTFCSTSIETRDHLFFSCSYASAIWTAIAKNVLQHRFSTDWQTIV 548 Query: 625 SAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHII 473 + + Q ++ S + R TV +W+ RN + ++P ++++I Sbjct: 549 NYISETQTDRIRSFLSR----YIFQLTVHTVWKERNDRRHGEEPRTSANLI 595 >ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 239 Score = 123 bits (308), Expect = 2e-25 Identities = 56/129 (43%), Positives = 80/129 (62%), Gaps = 5/129 (3%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PLL++I+ IQ WS +LS AG+LELIR+V+QG+ +W++ PL +V+ RI FL Sbjct: 111 PLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWMKIFPLSQSVLDRINASCCNFL 170 Query: 1249 WRDSQCP-----VSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W + ++W VC P+ EGGLGL +L WN L S+ LW+ H K D LW++W+H Sbjct: 171 WGKADIGKNKSLIAWSVVCSPKKEGGLGLFNLKDWNLTLLSRILWDFHCKKDFLWVRWVH 230 Query: 1084 AEYLRGLDI 1058 Y R D+ Sbjct: 231 HYYFRASDV 239 >ref|XP_004173049.1| PREDICTED: putative ribonuclease H protein At1g65750-like, partial [Cucumis sativus] Length = 647 Score = 112 bits (279), Expect = 5e-22 Identities = 65/176 (36%), Positives = 90/176 (51%), Gaps = 8/176 (4%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ I++ I+ WS LS A L+L+R VL+ ++ YW LP V + K+LR +L Sbjct: 240 PLIQCITSRIRSWSARVLSFASSLQLVRLVLRSLQVYWASVFMLPMKVHKDVDKILRSYL 299 Query: 1249 WRDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 WR + V+W VCLP DEGGL + D + WNKA K LW + K+ SLW+ W+ Sbjct: 300 WRGKEEGRGGAKVAWDEVCLPFDEGGLAICDGSSWNKASTLKILWLLLVKSGSLWVAWVE 359 Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCG---GNLNDAKAKLVGWFAG 926 A L+G +WE + ILR RD L GN+ + L W G Sbjct: 360 AYILKGRSLWEIDAGAGRSWCFRAILRKRDILKAHVEMKLGNVRKCRMLLDAWIQG 415 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 111 bits (278), Expect = 7e-22 Identities = 107/380 (28%), Positives = 155/380 (40%), Gaps = 46/380 (12%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PLL +++ + WS LS AGR++LI SV+ G+ +W+ LP + RI + +FL Sbjct: 714 PLLEKLAKRFRSWSVKCLSFAGRVQLIASVISGIINFWISTFILPKGCVKRIEALCARFL 773 Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLW----- 1100 W + V+W VCLP++EGG+GLR V N TLW+ K S W Sbjct: 774 WSGNIDVKKGAKVAWSEVCLPKEEGGVGLRRFTVLN-----TTLWD--GKKISFWFDNWS 826 Query: 1099 -----IKWIHAEYLRGLDI--------------WEFPYPRRDAP-----HMTNILRIRDQ 992 K + R L I W PR D H+T I Sbjct: 827 PLGPLFKLFGSSGPRALCIPIQAKVADACSDVGWLISPPRTDQALALLIHLTTI------ 880 Query: 991 LIFDCGGNLNDAKAKLVGWFAGKGTSEA--YEHFRVKGEKKFWYKAIWRSYIPPKFSVTL 818 C + D +V F G S A +E R K K W K++W PK + + Sbjct: 881 -ALPCFDSSPDTFVWIVDDFTCHGFSAARTWEAMRPKKPVKDWTKSVWFKGSVPKHAFNM 939 Query: 817 WLALHGRLKTFDRMKHSDI--ARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRN 644 W++ RL T R+ + C LC S E+ DHL C + +W + + R Sbjct: 940 WVSHLNRLPTRQRLAAWGVTTTTDCCLCSSRPESRDHLLLYCVFSAVIWKLV--FFRLTP 997 Query: 643 EMTTIPS-----AVRRFQREKAGSGIIRKAKWVALGATVQYLWQARN---LKYVEKKPFE 488 S + R KA S ++RK +A A+V +LW+ RN + P Sbjct: 998 SQAIFNSWAELLSWTRINSSKAPS-LLRK---IAAQASVFHLWKQRNNVLHNSIFISPAT 1053 Query: 487 ASHIIKEIKLDVYRVLYSLF 428 H I ++YR + LF Sbjct: 1054 VFHFIDRELENLYRYIQILF 1073 >ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum lycopersicum] Length = 717 Score = 110 bits (276), Expect = 1e-21 Identities = 50/131 (38%), Positives = 77/131 (58%), Gaps = 5/131 (3%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ ++ I W+ LS AGR +L+++VL GV+ W Q +P +I I + R +L Sbjct: 581 PLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPAKIIKLIEGLCRSYL 640 Query: 1249 WRD-----SQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W + ++W VC P+ EGGLGL +L +WN++ +K W++ K D LWIKWIH Sbjct: 641 WSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWDLANKEDKLWIKWIH 700 Query: 1084 AEYLRGLDIWE 1052 A Y++G W+ Sbjct: 701 AYYIKGQREWK 711 >ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232446, partial [Cucumis sativus] Length = 382 Score = 110 bits (276), Expect = 1e-21 Identities = 55/126 (43%), Positives = 77/126 (61%), Gaps = 5/126 (3%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ +I++ I+ WS LS AGRL+L+RSVL+ ++ YW LP V + K+LR +L Sbjct: 55 PLIQRITSRIRSWSARVLSFAGRLQLVRSVLRSLQVYWASVFMLPMKVHRDVDKILRSYL 114 Query: 1249 WRDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 WR + V+W VCLP DEGGL +RD + WN A K LW + K+ SLW+ W+ Sbjct: 115 WRGKEEGRGGAKVAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVE 174 Query: 1084 AEYLRG 1067 A L+G Sbjct: 175 AYILKG 180 >dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 489 Score = 110 bits (274), Expect = 2e-21 Identities = 64/163 (39%), Positives = 82/163 (50%), Gaps = 6/163 (3%) Frame = -3 Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250 PL+ I I WS LS AGRL LI SVL + +W+ A LP I I KM +L Sbjct: 170 PLIEHIKKKIGSWSARFLSYAGRLNLISSVLWSICNFWMGAFRLPRECIREIDKMCSAYL 229 Query: 1249 W-----RDSQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085 W S+ ++W VC P+DEGGLGLR L N K +W I + ADSLW+KWIH Sbjct: 230 WSGGDLNTSKAKIAWTDVCKPKDEGGLGLRSLKEANDVSCLKLIWRIISHADSLWVKWIH 289 Query: 1084 AEYLRGLDIWEFPYPRRDAPHM-TNILRIRDQLIFDCGGNLND 959 A L+ + W M +L+ RD I C +N+ Sbjct: 290 ATLLKQVSFWAVRENTSLGSWMWKKVLKFRDAAIQLCKAEVNN 332