BLASTX nr result

ID: Mentha28_contig00019938 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00019938
         (1431 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...   186   2e-44
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...   183   1e-43
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...   169   3e-39
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   168   5e-39
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   166   2e-38
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...   152   3e-34
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...   148   5e-33
ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668...   144   7e-32
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   140   1e-30
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   132   3e-28
ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660...   128   5e-27
ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A...   127   1e-26
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   126   2e-26
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   126   3e-26
ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein A...   123   2e-25
ref|XP_004173049.1| PREDICTED: putative ribonuclease H protein A...   112   5e-22
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...   111   7e-22
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...   110   1e-21
ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232...   110   1e-21
dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like ...   110   2e-21

>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score =  186 bits (473), Expect = 2e-44
 Identities = 100/279 (35%), Positives = 147/279 (52%), Gaps = 7/279 (2%)
 Frame = -3

Query: 1396 RWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFLWRDSQCP---- 1229
            RWS  +LS AG++ELIR+V+QG+  +W+   PLP +V+  I    R FLW  +       
Sbjct: 110  RWSRKSLSYAGKVELIRAVIQGIANFWMSIFPLPQSVLDTIIATCRNFLWGKADGGKIKP 169

Query: 1228 -VSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWE 1052
             V+W  VC P+ EGGLGL +L  WN AL S  LW++H+K DSLW++ +H  Y +G ++W+
Sbjct: 170  LVAWSEVCTPKKEGGLGLFNLKDWNIALLSCILWDLHSKKDSLWVRLVHHYYFKGGNVWD 229

Query: 1051 FPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGT--SEAYEHFRVKGEK 878
            F     D+      + IRD +I     N+  AK  L  W   + T   + Y++ R     
Sbjct: 230  FISSSSDSV----FIHIRD-IIISKEENIEVAKLMLNSWGCNEQTLAGKMYDYIRGTRPV 284

Query: 877  KFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIARGCVLCESADETHDHLFFKC 698
              W   IW   IP K S  LWLA   RL   DR    +    C LC +  E+H HLFF C
Sbjct: 285  VHWSSIIWNPVIPSKMSFILWLATKNRLLALDRAAFLNKGFLCPLCTNEAESHAHLFFSC 344

Query: 697  DKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGI 581
              ++ VW+ I  W+  + +  ++  ++    R +A SG+
Sbjct: 345  RTSLRVWAHIRDWIPLKRQSISLQHSISALIRRRATSGV 383


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score =  183 bits (465), Expect = 1e-43
 Identities = 102/313 (32%), Positives = 164/313 (52%), Gaps = 7/313 (2%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+ +I   I+ W+   LS AGRL+L+ SV+  +  YWL   P P +V+ +I  + R FL
Sbjct: 159  PLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALTNYWLNCFPFPKSVLQKIEAICRIFL 218

Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W        + PV+WK +C PR  GGL + D+ +WNKA   K LWN+ +K DSLW+KWI 
Sbjct: 219  WTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNKANLMKLLWNLSSKEDSLWVKWIQ 278

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTSEAY 905
            A Y++  ++        D+  M  IL+ R+ L       +++ +  ++      G  + Y
Sbjct: 279  AYYVKRSELMHIEMKNTDSWIMKAILKQREDL-----EKIDNMEELMIRGSINMG--KLY 331

Query: 904  EHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRM-KHSDI-ARGCVLCESA 731
               +  G++K W   ++ +   P+ +  LWLA HGRL T DR+ K+  I  + C  C S 
Sbjct: 332  RKLQDCGQRKEWKNLLYGNTARPRANFILWLACHGRLSTKDRLCKYGMIDDKSCCFC-SE 390

Query: 730  DETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKWVALG 551
            +E+ +HLFF CD +  VW  +  W++ R++ +  P+ +        G G       +A+ 
Sbjct: 391  EESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSDWPNELHWLTHHTKGKGTRAAVLKMAIA 450

Query: 550  ATVQYLWQARNLK 512
             T+  +W  RN K
Sbjct: 451  ETIYEIWNIRNNK 463


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score =  169 bits (428), Expect = 3e-39
 Identities = 98/310 (31%), Positives = 158/310 (50%), Gaps = 7/310 (2%)
 Frame = -3

Query: 1426 LLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFLW 1247
            L+ +I   I  WS   LS AGR++LI+SV+     +W+Q LPLP  VI RI  + R FLW
Sbjct: 602  LIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLPLPKFVIMRINAICRSFLW 661

Query: 1246 RDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIHA 1082
              +     + P++W+ VC P+  GGL + +LA+WNK    K LWN+  K+D+LWIKW+H 
Sbjct: 662  IGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNKISILKLLWNVCNKSDNLWIKWLHT 721

Query: 1081 EYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTSEAYE 902
             Y+RG  IW     +  +  M++++++R  L+          ++++   F  K   + Y 
Sbjct: 722  YYIRGQSIWSMVLKKSHSWIMSSMMKLRPLLL--------QYQSRMQDVFKMK---KIYL 770

Query: 901  HFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKH--SDIARGCVLCESAD 728
                + EK  W   +  +   P+    LW A H RL + DR+     ++   C  C S  
Sbjct: 771  ALFEESEKMSWRTLMCNNLARPRALFCLWQACHFRLASKDRLIKFGLNVDANCAFCSSM- 829

Query: 727  ETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKWVALGA 548
            E+H+HLFF C +   +W+ + +WL+  +  +T    +    R+  G G        A   
Sbjct: 830  ESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTWSEELNWITRKCKGKGWRAMLLKCAFTE 889

Query: 547  TVQYLWQARN 518
            T+ ++W  RN
Sbjct: 890  TIYHIWAYRN 899


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  168 bits (426), Expect = 5e-39
 Identities = 107/333 (32%), Positives = 162/333 (48%), Gaps = 11/333 (3%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+  I+N  Q W    LS AGRL+LI+S+L  ++ YW    PL   VI  + K+ RKFL
Sbjct: 773  PLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFL 832

Query: 1249 W-----RDSQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W        + PV+W T+  P+  GG  + ++  WN+A   K LW I  K D LW++WIH
Sbjct: 833  WTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIH 892

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVG-WFAGKGTSEA 908
            + Y++  DI       +    +  I++ RD L      N+ D     +G  F+ K   +A
Sbjct: 893  SYYIKRQDILTVNISNQTTWILRKIVKARDHL-----SNIGDWDEICIGDKFSMK---KA 944

Query: 907  YEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIA--RGCVLCES 734
            Y+     GE+  W + I  +Y  PK    LW+ LH RL T DR+    +       LC +
Sbjct: 945  YKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRN 1004

Query: 733  ADETHDHLFFKCDKAMAVWSGICSWLRCRNEMTT---IPSAVRRFQREKAGSGIIRKAKW 563
              ET  HLFF C  +  VWS IC  +R  N   +   I S+V    R+K G  I+     
Sbjct: 1005 DGETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVCGQARKKKGKLIV----- 1059

Query: 562  VALGATVQYLWQARNLKYVEKKPFEASHIIKEI 464
            +     V  +W+ RN +    +  + + ++++I
Sbjct: 1060 MLYTEFVYAIWKQRNKRTFTGENKDENEVLRKI 1092


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  166 bits (421), Expect = 2e-38
 Identities = 115/336 (34%), Positives = 159/336 (47%), Gaps = 14/336 (4%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PLL +I + I  W N  LS AGRL+L+ SV+  +  +W+ A  LP   I  I ++   FL
Sbjct: 1043 PLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFL 1102

Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W  +     +  V+W  VC P+ EGGLGLR L   NK    K +W + +   SLW+ WI 
Sbjct: 1103 WSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQ 1162

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILR-IRDQL-IFDCGGNLNDAKAKLV----GWFAGK 923
               +R   + E     R   H  +IL  I ++L    C G   +    L     G F  K
Sbjct: 1163 NNLIR--TVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDRSLCRSIGGQFKAK 1220

Query: 922  GTS-EAYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMK--HSDIARG 752
              S E +   R +G  K W+KAIW S   PKF+   WLA H RL T D+M   +  I+  
Sbjct: 1221 FFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSV 1280

Query: 751  CVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRK 572
            CVLC  + E+ DHLFF C+ +  +W  +   L      T  P+ +     +   SG  R 
Sbjct: 1281 CVLCNISAESRDHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALLLLLSGQDF-SGTKRF 1339

Query: 571  AKWVALGATVQYLWQARNLKYVEKKPFEASHIIKEI 464
                   AT+  LW+ RN +     P  + HIIK I
Sbjct: 1340 LLRYVFQATIHTLWRERNKRRHGDLPIPSDHIIKFI 1375


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score =  152 bits (385), Expect = 3e-34
 Identities = 85/260 (32%), Positives = 135/260 (51%), Gaps = 9/260 (3%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+ +I   I+ WS+  LS AGR++L+RS++  +  YW+   P+P  VI +I  + R F+
Sbjct: 262  PLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPKKVIQKIDSICRSFI 321

Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W  S     +  V+WK VC P   GGL L +L +WN     K LWNI +K D+LW+KWIH
Sbjct: 322  WSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWNVTAMLKCLWNICSKEDNLWVKWIH 381

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTS--E 911
            A +L+G ++            + ++++ R Q        +N+ +   +     +  S  +
Sbjct: 382  AYFLKGDNVMSATIKSNSTWILKSVMKQRPQ--------VNNLQLVWIEMLRKRKFSMKQ 433

Query: 910  AYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIARG--CVLCE 737
             Y        K  W++ +  +   P+ +VTLWLA   RL T  R+K+ ++ +   C LC+
Sbjct: 434  VYMELVEDHNKIDWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMNMIQCSLCSLCK 493

Query: 736  SADETHDHLFFKCDKAMAVW 677
              DE  DHL F C    A+W
Sbjct: 494  EQDEDLDHLMFSCRVTKAIW 513


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score =  148 bits (374), Expect = 5e-33
 Identities = 79/266 (29%), Positives = 139/266 (52%), Gaps = 12/266 (4%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+ +I+  I+ W++  L+  GR++++   +  +  +W+Q LP+P +VI +I  M R F+
Sbjct: 601  PLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIPMSVIKKIDSMCRSFV 660

Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W  S     + P++W +VC P+ +GGL + +L VWN       LWN+  K D+LW+KWIH
Sbjct: 661  WSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLNCLWNLCKKVDNLWVKWIH 720

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRD-----QLIFDCGGNLNDAKAKLVGWFAGKG 920
            A Y++   +         +  + N+L  R+     Q ++D    LN  + K+        
Sbjct: 721  AHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPVWD--ELLNSERFKM-------- 770

Query: 919  TSEAYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIARGCV-- 746
              +AY+   ++ ++  W   + ++   P+   T WLA HGRL T DR+    +    +  
Sbjct: 771  -KKAYDKM-MEADRVHWSGLMRKNCARPRAIHTTWLACHGRLGTKDRLVRFGMITDKIWS 828

Query: 745  LCESADETHDHLFFKCDKAMAVWSGI 668
            LC+  +ET +H+ F C  A  +WS +
Sbjct: 829  LCKEVEETQNHILFSCKVATDIWSNV 854


>ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max]
          Length = 477

 Score =  144 bits (364), Expect = 7e-32
 Identities = 93/325 (28%), Positives = 140/325 (43%), Gaps = 2/325 (0%)
 Frame = -3

Query: 1414 ISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFLWRDSQ 1235
            I++ IQ WS+  LS AG++ELIR+V+QG+  +W    PLP  V+ RI    R FLW  ++
Sbjct: 182  ITSLIQGWSSKTLSYAGKVELIRAVIQGIANFWTDIFPLPQFVLDRINVSYRNFLWGKAE 241

Query: 1234 CPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIW 1055
                                                            +H  Y +G ++W
Sbjct: 242  ------------------------------------------------VHHNYFKGGNVW 253

Query: 1054 EFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKG--TSEAYEHFRVKGE 881
            +F     D+  +  I+ IRD +I     N+  AK  L  W + +     +AY++ R    
Sbjct: 254  DFISSASDSVLIKKIIHIRD-IITIKEDNVEAAKQTLNSWNSNEQLLAGKAYDYIRGVKP 312

Query: 880  KKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMKHSDIARGCVLCESADETHDHLFFK 701
               W   +W   IP K S  LWLA    L T DR    +    C LC +  ++H HLFF 
Sbjct: 313  AVNWNSVVWNPAIPSKMSFILWLATKNHLLTLDRAAFLNKGLLCPLCRTKAKSHAHLFFS 372

Query: 700  CDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQAR 521
            C  ++ VW+ I  W+    +  ++   +      +A SG   K + +AL   V   W +R
Sbjct: 373  CRISLQVWANIRDWIPLHRQTISLQCTINSRICGRATSGTWGKFRCLALAIAVYCTWISR 432

Query: 520  NLKYVEKKPFEASHIIKEIKLDVYR 446
            NL   E  PF   +II +IK  VY+
Sbjct: 433  NLLLFENSPFSVINIINKIKFLVYK 457


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  140 bits (353), Expect = 1e-30
 Identities = 93/330 (28%), Positives = 153/330 (46%), Gaps = 8/330 (2%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+ +I+   Q W    LS AGRL+L++++L  ++ YW Q  PLP  +I  +    RKFL
Sbjct: 776  PLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKFL 835

Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W  +     + PV+W  +  P+  GGL + ++ +WNKA   K LW I  K D LW++W++
Sbjct: 836  WTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVN 895

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTSEAY 905
            A Y++  +I         +  +  I   R +L+   GG       + V         + Y
Sbjct: 896  AYYIKRQNIENVTVSSNTSWILRKIFESR-ELLTRTGG------WEAVSNHMNFSIKKTY 948

Query: 904  EHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTFDRMK--HSDIARGCVLCESA 731
            +  +   E   W + I  +   PK    LWLA+  RL T +R+   + D++  C +C + 
Sbjct: 949  KLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKMCGNE 1008

Query: 730  DETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKWVAL- 554
             ET  HLFF C  +  +W  +  +L  + +      A +    +KA S   R   +V + 
Sbjct: 1009 IETIQHLFFNCIYSKEIWGKVLLYLNLQPQADA--QAKKELAIKKARSTKDRNKLYVMMF 1066

Query: 553  GATVQYLWQARNLKYVEKKPFEASHIIKEI 464
              +V  +W  RN K         +  +K I
Sbjct: 1067 TESVYAIWLLRNAKVFRGIEINQNQAVKSI 1096


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  132 bits (333), Expect = 3e-28
 Identities = 97/333 (29%), Positives = 149/333 (44%), Gaps = 11/333 (3%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PLL +I   I+ W N  LS AGRL+LI+SVL  ++ YW   L LP  V+  I K LR FL
Sbjct: 610  PLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFL 669

Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W  +        V+W  +CLP+ EGGLG++DL  WNKAL    +WN+ + + + W  W+ 
Sbjct: 670  WAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWVK 729

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGWFAGKGTSEAY 905
               L+G   W  P P   + +   +L+IR+     C   +N     ++G   G+ TS  +
Sbjct: 730  VYLLKGNSFWNAPLPSICSWNWRKLLKIRE---LCCSFFVN-----IIG--DGRATSLWF 779

Query: 904  EHFRVKGEKKFWYKAIWRSYI--PPKFSVTLWLALHGRLKTFDRMKHSDIARGCV----L 743
            +++   G         W S I      S +  L  +G   T         +R  V    L
Sbjct: 780  DNWHPLGP----LTLRWSSNIIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFIVPWYRL 835

Query: 742  CESADETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRRFQREKAGSGIIRKAKW 563
                 ETH+HLFF C  +  +W+ + S       +      +        G+ +      
Sbjct: 836  VWFVAETHNHLFFDCAYSFGIWTHVLSKCDVSKPLLPWSDFIFWVATNWKGNSLPVVILK 895

Query: 562  VALGATVQYLWQARNLKYVEKKPFEASHIIKEI 464
            +AL A V  +W+ RN +    +    + + K I
Sbjct: 896  LALQAVVYAIWRERNNRRFRNESLPPAVVFKGI 928


>ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max]
          Length = 303

 Score =  128 bits (322), Expect = 5e-27
 Identities = 69/170 (40%), Positives = 97/170 (57%), Gaps = 5/170 (2%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PLL++I+  IQ WS  +LS AG+LELIR+V+QG+  +W+   PLP +V+ RI    R FL
Sbjct: 111  PLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWIGIFPLPQSVLDRINASCRNFL 170

Query: 1249 WRDSQCP-----VSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W  +        V+W  VC P+ EGGLGL +L  WN AL S  LW+ H K DSL   W+H
Sbjct: 171  WGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWNLALLSCILWDFHCKKDSL---WVH 227

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGW 935
              Y R  D+W +      +  +  I++IRD  I     +  +AK ++  W
Sbjct: 228  HYYFRRSDVWNYNTSSSYSVLIKKIIQIRD-FIISKELSTEEAKKRIQSW 276


>ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
            max]
          Length = 316

 Score =  127 bits (319), Expect = 1e-26
 Identities = 64/170 (37%), Positives = 96/170 (56%), Gaps = 5/170 (2%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PLL +I   IQ W+  +LS  G+LELI++V+QG+  +W++  PLP +V+ RI      FL
Sbjct: 144  PLLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQSVLDRINASCCNFL 203

Query: 1249 WRDSQCP-----VSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W  +        V+W  VC P+ EGGLGL +L  WN AL S  LW+ H K DSL ++W+H
Sbjct: 204  WSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWNLALLSHILWDFHCKKDSLRVRWVH 263

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLVGW 935
              Y R  D W +     ++  +  I++IRD  I     ++ + K ++  W
Sbjct: 264  HYYFRRSDEWNYNISSSNSVLIKKIIQIRD-FIISKELSMEETKKRIQSW 312


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  126 bits (317), Expect = 2e-26
 Identities = 106/409 (25%), Positives = 161/409 (39%), Gaps = 87/409 (21%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+ QI   I  W++  LS AGRL LI SVL  +  +W+ A  LP   I+ I ++    L
Sbjct: 509  PLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNFWMNAFRLPRECINEINRISSALL 568

Query: 1249 W-----RDSQCPVSWKTVCLPRDEGGLGLRDLA----------VWNKALHSKTLW----- 1130
            W        +  VSW  +C P+ EGGLGL+ L           +W       +LW     
Sbjct: 569  WSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKWTR 628

Query: 1129 -NIHAKADSLWI--------KWI------HAEYLRGL---------------DIWEFPYP 1040
             N+  K +S W          WI      H E  +                 D W    P
Sbjct: 629  MNL-LKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKGP 687

Query: 1039 ---------------------------RRDAPHMTNILRIRDQLIFDCGGNLNDAKAKLV 941
                                       RR   H   IL   ++++     + N      +
Sbjct: 688  LINLTGARGAIDMGISRHMTLAEAWSRRRRKRHRVEILNEFEEILLQKYQHRNIELEDAI 747

Query: 940  GWFAGK--------GTSEAYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTF 785
             W  GK         T + + H R    ++ W+K +W ++  PKFS   WLA+  RL T 
Sbjct: 748  LW-RGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTG 806

Query: 784  DRMK--HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNEMTTIPSAVRR 611
            DRM   ++     CV C S  ET DHLFF+C  +  +W+ I   +  ++  +T  SAV  
Sbjct: 807  DRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSIAKNV-YKDRFSTKWSAVVN 865

Query: 610  FQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHIIKEI 464
            +  +     I           ++  +W+ RN +   +K   AS++I++I
Sbjct: 866  YISDSQPDRIQSFLSRYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQI 914


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  126 bits (316), Expect = 3e-26
 Identities = 110/411 (26%), Positives = 165/411 (40%), Gaps = 92/411 (22%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL  QI N I  W++  LS AGRL LI SVL     +W+ A  LP   +  I  +   FL
Sbjct: 193  PLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTMNFWMSAFRLPSACLKEINSICSAFL 252

Query: 1249 WRDSQ-----CPVSWKTVCLPRDEGGLGLRDLA----------VWNKALHSKTLW----- 1130
            W   +       VSW  +C P+ EGGLGLR L           +W    +  +LW     
Sbjct: 253  WSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEANVVSVLKLIWRVTSNDDSLWVKWSK 312

Query: 1129 -NIHAKADSLWI--------KWIHAEYLRGLDIWEFPYPRRDAP---------------- 1025
             N+  K +S W          W+  + L+  +  + P+ R +                  
Sbjct: 313  MNL-LKQESFWSLTPNSSLGSWMWKKMLKYRETAK-PFSRVEVNNGARTSFWFDNWSGMG 370

Query: 1024 HMTNILRIRDQLIFDCGGN-------------------LNDAKAKL-------------V 941
            H+ ++   R Q+      N                   LND +A L              
Sbjct: 371  HLMDVTGQRGQIDLGISRNKTVAEAWSNRRRRKHRTEQLNDIEAALNQKYQTRNLLREDA 430

Query: 940  GWFAGKG--------TSEAYEHFRVKGEKKFWYKAIWRSYIPPKFSVTLWLALHGRLKTF 785
              + GKG        T + +   R K  +  WYK +W S+  PK+    WLAL  RL T 
Sbjct: 431  TLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLSTG 490

Query: 784  DRMK----HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWL---RCRNEMTTIP 626
             RM+     SD+   C  C ++ ET DHLFF C  A A+W+ I   +   R   +  TI 
Sbjct: 491  YRMQLWNNGSDVK--CTFCSTSIETRDHLFFSCSYASAIWTAIAKNVLQHRFSTDWQTIV 548

Query: 625  SAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHII 473
            + +   Q ++  S + R         TV  +W+ RN +   ++P  ++++I
Sbjct: 549  NYISETQTDRIRSFLSR----YIFQLTVHTVWKERNDRRHGEEPRTSANLI 595


>ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
            max]
          Length = 239

 Score =  123 bits (308), Expect = 2e-25
 Identities = 56/129 (43%), Positives = 80/129 (62%), Gaps = 5/129 (3%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PLL++I+  IQ WS  +LS AG+LELIR+V+QG+  +W++  PL  +V+ RI      FL
Sbjct: 111  PLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWMKIFPLSQSVLDRINASCCNFL 170

Query: 1249 WRDSQCP-----VSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W  +        ++W  VC P+ EGGLGL +L  WN  L S+ LW+ H K D LW++W+H
Sbjct: 171  WGKADIGKNKSLIAWSVVCSPKKEGGLGLFNLKDWNLTLLSRILWDFHCKKDFLWVRWVH 230

Query: 1084 AEYLRGLDI 1058
              Y R  D+
Sbjct: 231  HYYFRASDV 239


>ref|XP_004173049.1| PREDICTED: putative ribonuclease H protein At1g65750-like, partial
            [Cucumis sativus]
          Length = 647

 Score =  112 bits (279), Expect = 5e-22
 Identities = 65/176 (36%), Positives = 90/176 (51%), Gaps = 8/176 (4%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+  I++ I+ WS   LS A  L+L+R VL+ ++ YW     LP  V   + K+LR +L
Sbjct: 240  PLIQCITSRIRSWSARVLSFASSLQLVRLVLRSLQVYWASVFMLPMKVHKDVDKILRSYL 299

Query: 1249 WRDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            WR  +       V+W  VCLP DEGGL + D + WNKA   K LW +  K+ SLW+ W+ 
Sbjct: 300  WRGKEEGRGGAKVAWDEVCLPFDEGGLAICDGSSWNKASTLKILWLLLVKSGSLWVAWVE 359

Query: 1084 AEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIFDCG---GNLNDAKAKLVGWFAG 926
            A  L+G  +WE       +     ILR RD L        GN+   +  L  W  G
Sbjct: 360  AYILKGRSLWEIDAGAGRSWCFRAILRKRDILKAHVEMKLGNVRKCRMLLDAWIQG 415


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score =  111 bits (278), Expect = 7e-22
 Identities = 107/380 (28%), Positives = 155/380 (40%), Gaps = 46/380 (12%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PLL +++   + WS   LS AGR++LI SV+ G+  +W+    LP   + RI  +  +FL
Sbjct: 714  PLLEKLAKRFRSWSVKCLSFAGRVQLIASVISGIINFWISTFILPKGCVKRIEALCARFL 773

Query: 1249 WRDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLW----- 1100
            W  +        V+W  VCLP++EGG+GLR   V N      TLW+   K  S W     
Sbjct: 774  WSGNIDVKKGAKVAWSEVCLPKEEGGVGLRRFTVLN-----TTLWD--GKKISFWFDNWS 826

Query: 1099 -----IKWIHAEYLRGLDI--------------WEFPYPRRDAP-----HMTNILRIRDQ 992
                  K   +   R L I              W    PR D       H+T I      
Sbjct: 827  PLGPLFKLFGSSGPRALCIPIQAKVADACSDVGWLISPPRTDQALALLIHLTTI------ 880

Query: 991  LIFDCGGNLNDAKAKLVGWFAGKGTSEA--YEHFRVKGEKKFWYKAIWRSYIPPKFSVTL 818
                C  +  D    +V  F   G S A  +E  R K   K W K++W     PK +  +
Sbjct: 881  -ALPCFDSSPDTFVWIVDDFTCHGFSAARTWEAMRPKKPVKDWTKSVWFKGSVPKHAFNM 939

Query: 817  WLALHGRLKTFDRMKHSDI--ARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRN 644
            W++   RL T  R+    +     C LC S  E+ DHL   C  +  +W  +  + R   
Sbjct: 940  WVSHLNRLPTRQRLAAWGVTTTTDCCLCSSRPESRDHLLLYCVFSAVIWKLV--FFRLTP 997

Query: 643  EMTTIPS-----AVRRFQREKAGSGIIRKAKWVALGATVQYLWQARN---LKYVEKKPFE 488
                  S     +  R    KA S ++RK   +A  A+V +LW+ RN      +   P  
Sbjct: 998  SQAIFNSWAELLSWTRINSSKAPS-LLRK---IAAQASVFHLWKQRNNVLHNSIFISPAT 1053

Query: 487  ASHIIKEIKLDVYRVLYSLF 428
              H I     ++YR +  LF
Sbjct: 1054 VFHFIDRELENLYRYIQILF 1073


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
            lycopersicum]
          Length = 717

 Score =  110 bits (276), Expect = 1e-21
 Identities = 50/131 (38%), Positives = 77/131 (58%), Gaps = 5/131 (3%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+ ++   I  W+   LS AGR +L+++VL GV+  W Q   +P  +I  I  + R +L
Sbjct: 581  PLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPAKIIKLIEGLCRSYL 640

Query: 1249 WRD-----SQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W        +  ++W  VC P+ EGGLGL +L +WN++  +K  W++  K D LWIKWIH
Sbjct: 641  WSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWDLANKEDKLWIKWIH 700

Query: 1084 AEYLRGLDIWE 1052
            A Y++G   W+
Sbjct: 701  AYYIKGQREWK 711


>ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232446, partial [Cucumis
            sativus]
          Length = 382

 Score =  110 bits (276), Expect = 1e-21
 Identities = 55/126 (43%), Positives = 77/126 (61%), Gaps = 5/126 (3%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+ +I++ I+ WS   LS AGRL+L+RSVL+ ++ YW     LP  V   + K+LR +L
Sbjct: 55   PLIQRITSRIRSWSARVLSFAGRLQLVRSVLRSLQVYWASVFMLPMKVHRDVDKILRSYL 114

Query: 1249 WRDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            WR  +       V+W  VCLP DEGGL +RD + WN A   K LW +  K+ SLW+ W+ 
Sbjct: 115  WRGKEEGRGGAKVAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVE 174

Query: 1084 AEYLRG 1067
            A  L+G
Sbjct: 175  AYILKG 180


>dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 489

 Score =  110 bits (274), Expect = 2e-21
 Identities = 64/163 (39%), Positives = 82/163 (50%), Gaps = 6/163 (3%)
 Frame = -3

Query: 1429 PLLAQISNFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVISRITKMLRKFL 1250
            PL+  I   I  WS   LS AGRL LI SVL  +  +W+ A  LP   I  I KM   +L
Sbjct: 170  PLIEHIKKKIGSWSARFLSYAGRLNLISSVLWSICNFWMGAFRLPRECIREIDKMCSAYL 229

Query: 1249 W-----RDSQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIH 1085
            W       S+  ++W  VC P+DEGGLGLR L   N     K +W I + ADSLW+KWIH
Sbjct: 230  WSGGDLNTSKAKIAWTDVCKPKDEGGLGLRSLKEANDVSCLKLIWRIISHADSLWVKWIH 289

Query: 1084 AEYLRGLDIWEFPYPRRDAPHM-TNILRIRDQLIFDCGGNLND 959
            A  L+ +  W           M   +L+ RD  I  C   +N+
Sbjct: 290  ATLLKQVSFWAVRENTSLGSWMWKKVLKFRDAAIQLCKAEVNN 332


Top