BLASTX nr result

ID: Mentha29_contig00009729 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00009729
         (1694 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...   303   1e-79
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...   284   1e-73
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   274   8e-71
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...   260   1e-66
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   256   2e-65
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...   254   8e-65
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   248   6e-63
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   243   1e-61
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]             238   5e-60
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...   233   2e-58
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   230   1e-57
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       227   1e-56
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   221   8e-55
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   219   3e-54
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   211   6e-52
ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...   209   3e-51
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               208   7e-51
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   207   9e-51
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   207   1e-50
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           207   1e-50

>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score =  303 bits (777), Expect = 1e-79
 Identities = 167/458 (36%), Positives = 249/458 (54%), Gaps = 7/458 (1%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG     +  +RGLRQGDP+SP LF++ ME L+  ++    D  F +HPKC     T+L 
Sbjct: 14   NGYKTEIMGARRGLRQGDPISPMLFVIVMECLNRYLYKMQKDGDFNYHPKCDKLKITNLC 73

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADDLLLF RGD  S+ ++  A + F+  +GL +N  K  +   G+    KR ILE+ GF
Sbjct: 74   FADDLLLFSRGDKISVGMMMRAYESFSKATGLLVNPQKCSLLCAGIDAVTKREILEVSGF 133

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
             EG LP KYLG+P+ SK L+T+ YS L+ +I   I  W+   LS  GRL+L+ SV+  + 
Sbjct: 134  QEGQLPFKYLGVPVTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALT 193

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVW 706
             YWL   P   +V+  I  + R FLW        + PV+WK +C PR  GGL + D+ +W
Sbjct: 194  NYWLNCFPFPKSVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIW 253

Query: 707  NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFD 886
            NKA   K LWN+ +K DSLW+KWI A Y++  ++        D+  M  IL+ R+ L   
Sbjct: 254  NKANLMKLLWNLSSKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIMKAILKQREDL--- 310

Query: 887  CEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHG 1066
                +++ +  ++      G  + Y   +  G++K W   ++ +   P+ + ILWLA HG
Sbjct: 311  --EKIDNMEELMIRGSINMG--KLYRKLQDCGQRKEWKNLLYGNTARPRANFILWLACHG 366

Query: 1067 RLKTFDRL-KHSDI-ARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTIP 1240
            RL T DRL K+  I  + C  C S +E+ +HLFF CD +  VW  +  W++ R+  +  P
Sbjct: 367  RLSTKDRLCKYGMIDDKSCCFC-SEEESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSDWP 425

Query: 1241 SAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLK 1354
            + +        G G       +A+  T+  +W  RN K
Sbjct: 426  NELHWLTHHTKGKGTRAAVLKMAIAETIYEIWNIRNNK 463


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score =  284 bits (726), Expect = 1e-73
 Identities = 159/457 (34%), Positives = 243/457 (53%), Gaps = 8/457 (1%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG     +  +RG+RQGDP+SP LF+L MEYL+ ++        F +H KC     T+L 
Sbjct: 456  NGRFTRRLEARRGIRQGDPISPLLFILVMEYLNRILSQLDKIPNFNYHSKCEKMKITNLC 515

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADDLLLF RGD  S++++ D  + F  + GL +N SK +I+ G V    K  +L + GF
Sbjct: 516  FADDLLLFSRGDIGSVQIMLDKFNTFLRSMGLHVNPSKCNIYCGSVDINVKEQLLLISGF 575

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
             EG +P +YLG+PL+SK L    Y +L+ +I   I  WS   LS  GR++LI+SV+    
Sbjct: 576  KEGKMPFRYLGIPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATI 635

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVW 706
             +W+Q LPL   VI  I  + R FLW  +     + P++W+ VC P+  GGL + +LA+W
Sbjct: 636  NFWMQCLPLPKFVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIW 695

Query: 707  NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFD 886
            NK    K LWN+  K+D+LWIKW+H  Y+RG+ IW     +  +  M++++++R  L+  
Sbjct: 696  NKISILKLLWNVCNKSDNLWIKWLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLR-PLLLQ 754

Query: 887  CEGNLNDA-KAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMH 1063
             +  + D  K K +           Y     + EK  W   +  +   P+    LW A H
Sbjct: 755  YQSRMQDVFKMKKI-----------YLALFEESEKMSWRTLMCNNLARPRALFCLWQACH 803

Query: 1064 GRLKTFDRLKH--SDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTI 1237
             RL + DRL     ++   C  C S  E+H+HLFF C +   +W+ + +WL+  +  +T 
Sbjct: 804  FRLASKDRLIKFGLNVDANCAFCSSM-ESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTW 862

Query: 1238 PSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARN 1348
               +    R+  G G        A   T+ ++W  RN
Sbjct: 863  SEELNWITRKCKGKGWRAMLLKCAFTETIYHIWAYRN 899


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  274 bits (701), Expect = 8e-71
 Identities = 165/470 (35%), Positives = 239/470 (50%), Gaps = 11/470 (2%)
 Frame = +2

Query: 26   RGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLF 205
            + ++GLRQGDPMSP LF LCMEYLS  +        F  HPKC   + THL FADDLL+F
Sbjct: 636  QARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMF 695

Query: 206  GRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVK 385
             R D  S+  +  A  +F+  SGL  +  KS+I+  GV     R + +      G LP +
Sbjct: 696  CRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFR 755

Query: 386  YLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVLP 565
            YLG+PL SK LT      L+  I+N    W    LS  GRL+LI+S+L  ++ YW  + P
Sbjct: 756  YLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFP 815

Query: 566  LQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKT 730
            L   VI  + K+ RKFLW        + PV+W T+  P+  GG  + ++  WN+A   K 
Sbjct: 816  LSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKL 875

Query: 731  LWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFDCEGNLNDA 910
            LW I  K D LW++WIH+ Y++ +DI       +    +  I++ RD L      N+ D 
Sbjct: 876  LWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILRKIVKARDHL-----SNIGDW 930

Query: 911  KAKLVG-WFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTFDR 1087
                +G  F+ K   +AY+     GE+  W + I  +Y  PK   ILW+ +H RL T DR
Sbjct: 931  DEICIGDKFSMK---KAYKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDR 987

Query: 1088 LKHSDIA--RGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTT---IPSAVR 1252
            +    +       LC +  ET  HLFF C  +  VWS IC  +R  N   +   I S+V 
Sbjct: 988  ISRWGVQCDLNYRLCRNDGETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVC 1047

Query: 1253 RFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEI 1402
               R+K G  I+     +     V  +W+ RN +    +  + + V+++I
Sbjct: 1048 GQARKKKGKLIV-----MLYTEFVYAIWKQRNKRTFTGENKDENEVLRKI 1092


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score =  260 bits (665), Expect = 1e-66
 Identities = 140/404 (34%), Positives = 221/404 (54%), Gaps = 12/404 (2%)
 Frame = +2

Query: 23   VRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLL 202
            +  +RG+RQGDP+SP LF++ MEYL+ L+     D  F HH KC     THL FADD+LL
Sbjct: 463  IAAKRGIRQGDPISPLLFVVMMEYLNRLLVKLQLDLNFNHHAKCEKLGITHLTFADDVLL 522

Query: 203  FGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPV 382
            F RGD  S+ ++   +++F+AT+GL +N +K  I+ GGV    K  I ++  + EG LPV
Sbjct: 523  FCRGDVMSVEMMLHVINKFSATTGLVVNPNKCRIYFGGVDGTTKNKIQQISSYEEGQLPV 582

Query: 383  KYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVL 562
            +YLG+PL SK L    Y  L+ +I+  I  W++  L+  GR++++   +  +  +W+Q L
Sbjct: 583  RYLGVPLTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCL 642

Query: 563  PLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSK 727
            P+  +VI  I  M R F+W  S     + P++W +VC P+ +GGL + +L VWN      
Sbjct: 643  PIPMSVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLN 702

Query: 728  TLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRD-----QLIFDCE 892
             LWN+  K D+LW+KWIHA Y++   +         +  + N+L  R+     Q ++D  
Sbjct: 703  CLWNLCKKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPVWD-- 760

Query: 893  GNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRL 1072
              LN  + K+          +AY+    + ++  W   + ++   P+     WLA HGRL
Sbjct: 761  ELLNSERFKM---------KKAYDKM-MEADRVHWSGLMRKNCARPRAIHTTWLACHGRL 810

Query: 1073 KTFDRLKHSDIARGCV--LCESADETHDHLFFKCDKAMAVWSGI 1198
             T DRL    +    +  LC+  +ET +H+ F C  A  +WS +
Sbjct: 811  GTKDRLVRFGMITDKIWSLCKEVEETQNHILFSCKVATDIWSNV 854


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  256 bits (655), Expect = 2e-65
 Identities = 163/469 (34%), Positives = 232/469 (49%), Gaps = 14/469 (2%)
 Frame = +2

Query: 38   GLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGRGD 217
            GLRQG  +SP LF++CM  LS ++     +  F +HP+C     THL FADD+++F  G 
Sbjct: 910  GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYHPRCRNMGLTHLCFADDIMVFSAGS 969

Query: 218  PDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVKYLGL 397
              S+  +     +F A SGL I+  KS +F+  +      +IL  F F  G+LPV+YLGL
Sbjct: 970  AHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSETCASILARFPFDSGSLPVRYLGL 1029

Query: 398  PLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVLPLQGT 577
            PL +K +T  D   LL +I + I  W N  LS  GRL+L+ SV+  +  +W+    L   
Sbjct: 1030 PLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRA 1089

Query: 578  VIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNI 742
             I  I ++   FLW  +     +  V+W  VC P+ EGGLGLR L   NK    K +W +
Sbjct: 1090 CIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRL 1149

Query: 743  HAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILR-IRDQL-IFDCEGNLNDAKA 916
             +   SLW+ WI    +  R + E     R   H  +IL  I ++L    C G   +   
Sbjct: 1150 VSAKHSLWVNWIQNNLI--RTVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDR 1207

Query: 917  KLV----GWFAGKGTS-EAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTF 1081
             L     G F  K  S E +   R +G  K W+KAIW S   PKF+ I WLA H RL T 
Sbjct: 1208 SLCRSIGGQFKAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTG 1267

Query: 1082 DRLK--HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTIPSAVRR 1255
            D++   +  I+  CVLC  + E+ DHLFF C+ +  +W  +   L      T  P+ +  
Sbjct: 1268 DKMASWNRGISSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALLLL 1327

Query: 1256 FQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEI 1402
               +   SG  R        AT+  LW+ RN +     P  + H+IK I
Sbjct: 1328 LSGQDF-SGTKRFLLRYVFQATIHTLWRERNKRRHGDLPIPSDHIIKFI 1375


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score =  254 bits (649), Expect = 8e-65
 Identities = 141/405 (34%), Positives = 217/405 (53%), Gaps = 9/405 (2%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG     +  + G+ QGDP+SP LF+L MEY + ++     + +F HH +C     THL+
Sbjct: 117  NGELSNVLETKIGIWQGDPISPLLFVLMMEYFNRIMVKMQRNPSFNHHSQCERLGITHLS 176

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADD+ L  RGD  S++++  A   F+ ++GL IN +K  +F GG+     + I ++ GF
Sbjct: 177  FADDVFLLCRGDKKSIKMIIKAFSFFSKSTGLQINPAKCKVFCGGLNCDSIQVITKITGF 236

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
             EGTLPV+YLG+PL+ K L    Y  L+ +I   I  WS+  LS  GR++L+RS++  + 
Sbjct: 237  EEGTLPVRYLGVPLSCKKLNVHHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIA 296

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVW 706
             YW+ V P+   VI  I  + R F+W  S     +  V+WK VC P   GGL L +L +W
Sbjct: 297  QYWMSVFPMPKKVIQKIDSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELW 356

Query: 707  NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFD 886
            N     K LWNI +K D+LW+KWIHA +L+G ++            + ++++ R Q    
Sbjct: 357  NVTAMLKCLWNICSKEDNLWVKWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQ---- 412

Query: 887  CEGNLNDAKAKLVGWFAGKGTS--EAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAM 1060
                +N+ +   +     +  S  + Y        K  W++ +  +   P+ +V LWLA 
Sbjct: 413  ----VNNLQLVWIEMLRKRKFSMKQVYMELVEDHNKIDWFRLLRYNRARPRANVTLWLAC 468

Query: 1061 HGRLKTFDRLKHSDIARG--CVLCESADETHDHLFFKCDKAMAVW 1189
              RL T  RLK+ ++ +   C LC+  DE  DHL F C    A+W
Sbjct: 469  QNRLATKTRLKNMNMIQCSLCSLCKEQDEDLDHLMFSCRVTKAIW 513


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  248 bits (633), Expect = 6e-63
 Identities = 154/470 (32%), Positives = 228/470 (48%), Gaps = 13/470 (2%)
 Frame = +2

Query: 32   QRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGR 211
            Q+GLRQGDP+SP LF L MEYLS  +     D  F  HPKC     THL FADDLL+F R
Sbjct: 641  QKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFAR 700

Query: 212  GDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVKYL 391
             D  S+  +  A + F+  SGL  +  KS I+ GGV   E   + +    P G+LP +YL
Sbjct: 701  ADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYL 760

Query: 392  GLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVLPLQ 571
            G+PLASK L       L+ +I+     W    LS  GRL+L++++L  ++ YW Q+ PL 
Sbjct: 761  GVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLP 820

Query: 572  GTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLW 736
              +I  +    RKFLW  +     + PV+W  +  P+  GGL + ++ +WNKA   K LW
Sbjct: 821  KKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLW 880

Query: 737  NIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFDCEGNLNDAKA 916
             I  K D LW++W++A Y++ ++I         +  +  I   R+ L             
Sbjct: 881  AITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFESRELL------------T 928

Query: 917  KLVGWFA-----GKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTF 1081
            +  GW A          + Y+  +   E   W + I  +   PK   ILWLAM  RL T 
Sbjct: 929  RTGGWEAVSNHMNFSIKKTYKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATA 988

Query: 1082 DRLK--HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTIPSAVRR 1255
            +R+   + D++  C +C +  ET  HLFF C  +  +W  +  +L  + Q      A + 
Sbjct: 989  ERVSRWNRDVSPLCKMCGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQADA--QAKKE 1046

Query: 1256 FQREKAGSGIIRKAKWVAL-GATVQYLWQARNLKYVEKKPFEASHVIKEI 1402
               +KA S   R   +V +   +V  +W  RN K         +  +K I
Sbjct: 1047 LAIKKARSTKDRNKLYVMMFTESVYAIWLLRNAKVFRGIEINQNQAVKSI 1096


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  243 bits (621), Expect = 1e-61
 Identities = 160/475 (33%), Positives = 235/475 (49%), Gaps = 8/475 (1%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDST-FMHHPKCSTTDTTHL 178
            NG   GF   +RGLRQGDP+SP LF++ ME LS  I  R + S  F +H +C   + +HL
Sbjct: 464  NGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNLSHL 523

Query: 179  AFADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFG 358
             FADDLL+F  GD +S+R L DA   F + S L  N S+S IFL GV      ++L++  
Sbjct: 524  CFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVTN 583

Query: 359  FPEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGV 538
            F  GT PV+YLG+PL +  L   D S LL +I   I  W N  LS  GRL+LI+SVL  +
Sbjct: 584  FSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSI 643

Query: 539  ECYWLQVLPLQGTVIATITKMLRKFLW---CDSQC--PVSWKTVCLPRDEGGLGLRDLAV 703
            + YW   L L   V+  I K LR FLW   C  +    V+W  +CLP+ EGGLG++DL  
Sbjct: 644  QVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHC 703

Query: 704  WNKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIF 883
            WNKAL    +WN+ + + + W  W+    L+G   W  P P   + +   +L+IR+    
Sbjct: 704  WNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIRE---L 760

Query: 884  DCEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKF-WYKAIWRSYIPPKFSVILWLAM 1060
             C   +N     ++G   G+ TS  ++++   G     W   I       K +++     
Sbjct: 761  CCSFFVN-----IIG--DGRATSLWFDNWHPLGPLTLRWSSNIIGESGLSKSAMLTPNGF 813

Query: 1061 HGRLKTFDRLKHSD-IARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTI 1237
            +     ++ L+ S  I     L     ETH+HLFF C  +  +W+ + S       +   
Sbjct: 814  YSTSSAWNTLRPSRFIVPWYRLVWFVAETHNHLFFDCAYSFGIWTHVLSKCDVSKPLLPW 873

Query: 1238 PSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEI 1402
               +        G+ +      +AL A V  +W+ RN +    +    + V K I
Sbjct: 874  SDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNNRRFRNESLPPAVVFKGI 928


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score =  238 bits (608), Expect = 5e-60
 Identities = 155/474 (32%), Positives = 223/474 (47%), Gaps = 10/474 (2%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG   GF + +RGLRQG  +SP LF++ M+ LS L+        F +H +C     THL+
Sbjct: 170  NGELVGFFQSKRGLRQGCSLSPYLFVMSMDVLSKLLDQAASAKKFGYHSRCKELSLTHLS 229

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADDL++   G   S+  + +  D F   SGL I+  KS I+L GV       I   + F
Sbjct: 230  FADDLMVLSDGKVRSIDGIVEVFDIFAKFSGLKISMEKSTIYLAGVTEDVYHEIQNRYQF 289

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
              G LPV+YLGLPL +K LT  DYS LL  I   I  W+   LS  GRL LI SVL  + 
Sbjct: 290  DVGQLPVRYLGLPLVTKRLTATDYSPLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSIC 349

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVW 706
             +WL    L    I  I K+   FLW        +  V W  VC P+ EGGLGLR L   
Sbjct: 350  NFWLAAFRLPRECIREIDKICSAFLWSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKEM 409

Query: 707  NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFD 886
            N+    K +W I +  +SLW++WI    L+    W      +   +M ++L         
Sbjct: 410  NEVSCLKLIWRIVSHTNSLWVRWIEQYLLKHDTFWSV----QTTTNMDSVL--------- 456

Query: 887  CEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHG 1066
              G  ++   K         T + +   R       W+  IW ++  PKFS   WLA+  
Sbjct: 457  WRGRNDEYMPKF-------STRDTWNQTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQN 509

Query: 1067 RLKTFDRLK--HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWL---RCRNQMT 1231
            RL T D++   +  ++  CVLC +  ET +HLFF C     +W  +   +   +     +
Sbjct: 510  RLSTGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSCCYTAEIWENLAKNIYKAKFSTNWS 569

Query: 1232 TIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVI 1393
            TI ++V    R +  S + R        AT+  +W  RN +   ++   A+H+I
Sbjct: 570  TILTSVSTTWRNRTESFLAR----YIFQATIHTIWHERNGRRHGERSNSATHLI 619


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
            lycopersicum]
          Length = 717

 Score =  233 bits (594), Expect = 2e-58
 Identities = 113/265 (42%), Positives = 165/265 (62%), Gaps = 5/265 (1%)
 Frame = +2

Query: 35   RGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGRG 214
            +GLRQGDPMSP LF + MEYLS L+     D +F +HPK +  D THL FADDLLLF RG
Sbjct: 447  KGLRQGDPMSPFLFAIAMEYLSRLLKGLKEDKSFKYHPKYAKLDVTHLCFADDLLLFSRG 506

Query: 215  DPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVKYLG 394
            D +S++ L+    EF+  SGL  N +KS I+ GGV+   ++ I++  G+    LP KYLG
Sbjct: 507  DLNSIKALQKCFTEFSQASGLQANLNKSSIYCGGVQMEVRQQIIQQLGYTIEELPFKYLG 566

Query: 395  LPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVLPLQG 574
            +PL+SK L T+ +  L+ ++   I  W+   LS  GR +L+++VL GV+  W Q+  +  
Sbjct: 567  VPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPA 626

Query: 575  TVIATITKMLRKFLW-----CDSQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWN 739
             +I  I  + R +LW        +  ++W  VC P+ EGGLGL +L +WN++  +K  W+
Sbjct: 627  KIIKLIEGLCRSYLWSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWD 686

Query: 740  IHAKADSLWIKWIHAEYLRGRDIWE 814
            +  K D LWIKWIHA Y++G+  W+
Sbjct: 687  LANKEDKLWIKWIHAYYIKGQREWK 711


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  230 bits (587), Expect = 1e-57
 Identities = 160/553 (28%), Positives = 242/553 (43%), Gaps = 86/553 (15%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG   G+ R  RGLRQG  +SP LF++ M+ LS ++        F +HP+C T   THL 
Sbjct: 364  NGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYHPRCKTLGLTHLC 423

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADDL++   G   S+  +   L++F A  GL I   K+ ++L GV    ++ +   + F
Sbjct: 424  FADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMSSRYSF 483

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
              G LPV+YLGLPL +K LTT DYS L+ QI   I  W++  LS  GRL LI SVL  + 
Sbjct: 484  GVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSIT 543

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWCDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVW 706
             +W+    L    I  I ++    LW   +       VSW  +C P+ EGGLGL+ L   
Sbjct: 544  NFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREA 603

Query: 707  NKALHSKTLWNIHA---------------KADSLWI--------KWIHAEYLRGRDI--- 808
            NK    K +W + +               K +S W          WI    L+ R++   
Sbjct: 604  NKVSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKS 663

Query: 809  ------------------WEFPYP---------------------------RRDAPHMTN 853
                              W    P                           RR   H   
Sbjct: 664  FCKIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISRHMTLAEAWSRRRRKRHRVE 723

Query: 854  ILRIRDQLIFDCEGNLNDAKAKLVGWFAGK--------GTSEAYEHFRAKGEKKFWYKAI 1009
            IL   ++++     + N      + W  GK         T + + H R    ++ W+K +
Sbjct: 724  ILNEFEEILLQKYQHRNIELEDAILW-RGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGV 782

Query: 1010 WRSYIPPKFSVILWLAMHGRLKTFDRLK--HSDIARGCVLCESADETHDHLFFKCDKAMA 1183
            W ++  PKFS   WLA+  RL T DR+   ++     CV C S  ET DHLFF+C  +  
Sbjct: 783  WFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSE 842

Query: 1184 VWSGICSWLRCRNQMTTIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVE 1363
            +W+ I   +  +++ +T  SAV  +  +     I           ++  +W+ RN +   
Sbjct: 843  IWTSIAKNV-YKDRFSTKWSAVVNYISDSQPDRIQSFLSRYTFQVSIHSIWRERNSRRHG 901

Query: 1364 KKPFEASHVIKEI 1402
            +K   AS++I++I
Sbjct: 902  EKSRSASNLIRQI 914


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  227 bits (579), Expect = 1e-56
 Identities = 117/294 (39%), Positives = 165/294 (56%), Gaps = 5/294 (1%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NGG+ GF +  +GLRQGDP+SP LF+L ME  S+L+H+R       +HPK S    +HL 
Sbjct: 637  NGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLM 696

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADD+++F  G   S+  + + LD+F + SGL +NK KSH++L G+   E  A    +GF
Sbjct: 697  FADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLESNA-NAAYGF 755

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
            P GTLP++YLGLPL ++ L   +Y  LL +I+     W N  LS  GR++LI SV+ G  
Sbjct: 756  PIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSI 815

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVW 706
             +W+    L    I  I  +  +FLW  +        VSW  +CLP+ EGGLGLR L  W
Sbjct: 816  NFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEW 875

Query: 707  NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIR 868
            NK L  + +W +    DSLW  W H  +L     W     + D+     +L +R
Sbjct: 876  NKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLR 929


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  221 bits (563), Expect = 8e-55
 Identities = 163/555 (29%), Positives = 240/555 (43%), Gaps = 91/555 (16%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG   G+ R  RG+RQG  +SP LF++ ME LS ++        F  HPKC     THL 
Sbjct: 48   NGELAGYFRSARGIRQGCALSPYLFVISMEVLSKMLDQAAGGKRFGFHPKCKNLGLTHLC 107

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADDL++   G   S+  + + ++ F   SGL IN  K+ ++  GV    +  ++  + F
Sbjct: 108  FADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMISRYPF 167

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
              G LPV+YLGLPL +K LT  D S L  QI N I  W++  LS  GRL LI SVL    
Sbjct: 168  GLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTM 227

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWCDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVW 706
             +W+    L    +  I  +   FLW   +       VSW  +C P+ EGGLGLR L   
Sbjct: 228  NFWMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEA 287

Query: 707  NKALHSKTLWNIHAKADSL---WIK--------------------WIHAEYLRGRDIWEF 817
            N     K +W + +  DSL   W K                    W+  + L+ R+  + 
Sbjct: 288  NVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLTPNSSLGSWMWKKMLKYRETAK- 346

Query: 818  PYPRRDAP----------------HMTNILRIRDQLIFDCEGN----------------- 898
            P+ R +                  H+ ++   R Q+      N                 
Sbjct: 347  PFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNRRRRKHRT 406

Query: 899  --LNDAKAKL-------------VGWFAGKG--------TSEAYEHFRAKGEKKFWYKAI 1009
              LND +A L                + GKG        T + +   R K  +  WYK +
Sbjct: 407  EQLNDIEAALNQKYQTRNLLREDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGV 466

Query: 1010 WRSYIPPKFSVILWLAMHGRLKTFDRLK----HSDIARGCVLCESADETHDHLFFKCDKA 1177
            W S+  PK+    WLA+  RL T  R++     SD+   C  C ++ ET DHLFF C  A
Sbjct: 467  WFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVK--CTFCSTSIETRDHLFFSCSYA 524

Query: 1178 MAVWSGICSWL---RCRNQMTTIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARN 1348
             A+W+ I   +   R      TI + +   Q ++  S + R         TV  +W+ RN
Sbjct: 525  SAIWTAIAKNVLQHRFSTDWQTIVNYISETQTDRIRSFLSR----YIFQLTVHTVWKERN 580

Query: 1349 LKYVEKKPFEASHVI 1393
             +   ++P  ++++I
Sbjct: 581  DRRHGEEPRTSANLI 595


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  219 bits (558), Expect = 3e-54
 Identities = 112/275 (40%), Positives = 159/275 (57%), Gaps = 5/275 (1%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            +G   G+ +G +GLRQGDP+SP+LF++ ME LS L+  +  D +  +HPK S    + LA
Sbjct: 636  SGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLA 695

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADDL++F  G   S+R ++  L+ F   SGL +N  KS ++  G+   +K   L  FGF
Sbjct: 696  FADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGF 754

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
              GT P +YLGLPL  + L   DYS L+ +I+     W+   LS  GRL+LI SV+    
Sbjct: 755  VNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTV 814

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWCD-----SQCPVSWKTVCLPRDEGGLGLRDLAVW 706
             +WL    L    + TI +M  +FLW +         VSW+  CLP+ EGGLGLR+   W
Sbjct: 815  NFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTW 874

Query: 707  NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIW 811
            NK L+ + +W + A+ DSLW+ W HA  LR  + W
Sbjct: 875  NKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFW 909


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  211 bits (538), Expect = 6e-52
 Identities = 110/275 (40%), Positives = 156/275 (56%), Gaps = 5/275 (1%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG   GF R +RGLRQG  +SP L+++CM  LS ++     +    +HP+C   + THL 
Sbjct: 790  NGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCRNMNLTHLC 849

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADD+++F  G   S++      ++F A S L I+  KS IF+ G+ P  K +IL+ F F
Sbjct: 850  FADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQQFPF 909

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
              GTLPVKYLGLPL +K +T  DY  L+ +I   I  W+N  LS  GRL+LI+SVL  + 
Sbjct: 910  ELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSIT 969

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVW 706
             +WL V  L    +  I KM   FLW        +  ++W  VC  ++EGGLGL+ L   
Sbjct: 970  NFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEA 1029

Query: 707  NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIW 811
            N+    K +W I +  DSLW+KW++   +R    W
Sbjct: 1030 NEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFW 1064



 Score = 67.4 bits (163), Expect = 2e-08
 Identities = 34/90 (37%), Positives = 49/90 (54%), Gaps = 2/90 (2%)
 Frame = +2

Query: 947  TSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTFDRL-KHSDIAR-GCV 1120
            +S+ ++  R+   +  WY+ +W S   PK+S + WLA H RL T D++ K +  AR  CV
Sbjct: 1187 SSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCV 1246

Query: 1121 LCESADETHDHLFFKCDKAMAVWSGICSWL 1210
             C    ET DHLFF C  +  VW  +   L
Sbjct: 1247 FCGEELETRDHLFFSCPYSSHVWFSLTKGL 1276


>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score =  209 bits (532), Expect = 3e-51
 Identities = 136/433 (31%), Positives = 202/433 (46%), Gaps = 5/433 (1%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG  +G  +GQRGLRQ DP+SP LF+L +EY +  I +   ++ F  +P C+ T  +HL 
Sbjct: 14   NGSIYGHFKGQRGLRQWDPLSPYLFVLYIEYFARDIQSLKDNANFQFNPNCAVTQLSHLT 73

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTIN---KSKSHIFLGGVRPFEKRAILEL 352
            FADD++L  RGD  S+  +   L  F   SGL+I+     KS  + G V     RA+++ 
Sbjct: 74   FADDIMLLSRGDLPSVSAIYAKLQHFCNVSGLSISSRWSRKSLSYAGKVELI--RAVIQ- 130

Query: 353  FGFPEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQ 532
             G     + +     PL    L T     ++A   NF+  W  ++  ++  L        
Sbjct: 131  -GIANFWMSI----FPLPQSVLDT-----IIATCRNFL--WGKADGGKIKPL-------- 170

Query: 533  GVECYWLQVLPLQGTVIATITKMLRKFLWCDSQCPVSWKTVCLPRDEGGLGLRDLAVWNK 712
                                               V+W  VC P+ EGGLGL +L  WN 
Sbjct: 171  -----------------------------------VAWSEVCTPKKEGGLGLFNLKDWNI 195

Query: 713  ALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFDCE 892
            AL S  LW++H+K DSLW++ +H  Y +G ++W+F     D+      + IRD +I   E
Sbjct: 196  ALLSCILWDLHSKKDSLWVRLVHHYYFKGGNVWDFISSSSDSV----FIHIRD-IIISKE 250

Query: 893  GNLNDAKAKLVGWFAGKGT--SEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHG 1066
             N+  AK  L  W   + T   + Y++ R       W   IW   IP K S ILWLA   
Sbjct: 251  ENIEVAKLMLNSWGCNEQTLAGKMYDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLATKN 310

Query: 1067 RLKTFDRLKHSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTIPSA 1246
            RL   DR    +    C LC +  E+H HLFF C  ++ VW+ I  W+  + Q  ++  +
Sbjct: 311  RLLALDRAAFLNKGFLCPLCTNEAESHAHLFFSCRTSLRVWAHIRDWIPLKRQSISLQHS 370

Query: 1247 VRRFQREKAGSGI 1285
            +    R +A SG+
Sbjct: 371  ISALIRRRATSGV 383


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  208 bits (529), Expect = 7e-51
 Identities = 118/296 (39%), Positives = 159/296 (53%), Gaps = 6/296 (2%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG   G+ + +RGLRQG  +SP LF++CM+ LS ++        F  HPKC     THL+
Sbjct: 290  NGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFHPKCQRLGLTHLS 349

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADDL++   G   S+  + +  DEF   SGL I+  KS +++ GV P  K+ I   F F
Sbjct: 350  FADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAAKFLF 409

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
              G LPV+YLGLPL +K LT+ DYS LL QI   I  W+    S  GR  LI+SVL  + 
Sbjct: 410  DVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSIC 469

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWCDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVW 706
             +WL    L    I  I K+   FLW  S+       +SW  VC P+ EGGLGLR+L   
Sbjct: 470  NFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNLKEA 529

Query: 707  NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHM-TNILRIRD 871
            N     K +W I + ++SLW KW+    +R + IW           +   IL+IRD
Sbjct: 530  NDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRD 585



 Score = 71.2 bits (173), Expect = 1e-09
 Identities = 39/156 (25%), Positives = 74/156 (47%), Gaps = 7/156 (4%)
 Frame = +2

Query: 947  TSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTFDRL----KHSDIARG 1114
            T + +   +A      W+K +W  +  PK+++  WLA+H RL T DR+        ++  
Sbjct: 687  TRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGN 746

Query: 1115 CVLCESADETHDHLFFKCDKAMAVWSGICSWL---RCRNQMTTIPSAVRRFQREKAGSGI 1285
            CVLC +  +T +HLFF C  A  VW+ +   +   R   + + + + +    +++    +
Sbjct: 747  CVLCTNNSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLLTHISTHFQDRVEGFL 806

Query: 1286 IRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVI 1393
             R        AT+ ++W+ RN +  +  P   + VI
Sbjct: 807  TR----YIFQATIYHVWRERNGRRHDAAPNTPATVI 838


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  207 bits (528), Expect = 9e-51
 Identities = 111/276 (40%), Positives = 151/276 (54%), Gaps = 5/276 (1%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181
            NG   G+ +  RGLRQG  +SP LF++CM+ LS ++        F +HPKC T   THL+
Sbjct: 643  NGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLS 702

Query: 182  FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361
            FADDL++   G   S+  +    DEF   SGL I+  KS ++L G+    +  + + F F
Sbjct: 703  FADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPF 762

Query: 362  PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541
              G LPV+YLGLPL +K L+T D   LL Q+   I  W++  LS  GRL LI SVL  + 
Sbjct: 763  SSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSIC 822

Query: 542  CYWLQVLPLQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVW 706
             +WL    L    I  + KM   FLW       ++  +SW  VC P+DEGGLGLR L   
Sbjct: 823  NFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEA 882

Query: 707  NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWE 814
            N     K +W I + ++SLW+KW+    LR    WE
Sbjct: 883  NDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWE 918



 Score = 81.6 bits (200), Expect = 1e-12
 Identities = 49/168 (29%), Positives = 79/168 (47%), Gaps = 5/168 (2%)
 Frame = +2

Query: 947  TSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTFDRLKH--SDIARGCV 1120
            T + + H R+   +  W+K IW S+  PK+S   WLA HGRL T DR+ +  + IA  C+
Sbjct: 1040 TRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCI 1099

Query: 1121 LCESADETHDHLFFKCDKAMAVWSGICSWL---RCRNQMTTIPSAVRRFQREKAGSGIIR 1291
             C+   ET DHLFF C     +W  +   +   +  +   +I  A+   Q  +    + R
Sbjct: 1100 FCQGTLETRDHLFFTCSFTSVIWVDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRR 1159

Query: 1292 KAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEIKLDVYRVLYSL 1435
                    AT+  +W+ RN +   + P  AS ++  I   +   L S+
Sbjct: 1160 ----YVFQATIYIVWRERNGRRHGEPPNTASQLVGWIDKQIRNQLSSI 1203


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  207 bits (527), Expect = 1e-50
 Identities = 111/277 (40%), Positives = 153/277 (55%), Gaps = 6/277 (2%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMH-HPKCSTTDTTHL 178
            NG + GF R  +GLRQGDP+SP LF+L ME  S L+++R +DS ++H HPK      +HL
Sbjct: 497  NGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR-YDSGYIHYHPKAGDLSISHL 555

Query: 179  AFADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFG 358
             FADD+++F  G   SM  + + LD+F   SGL +NK KS +F  G+    +R     +G
Sbjct: 556  MFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYG 614

Query: 359  FPEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGV 538
            FP GT P++YLGLPL  + L   DY  LL ++S  +  W +  LS  GR +LI SV+ G+
Sbjct: 615  FPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGL 674

Query: 539  ECYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAV 703
              +W+    L    I  I  +  KFLW  S        VSW   CLP+ EGGLG R    
Sbjct: 675  INFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGE 734

Query: 704  WNKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWE 814
            WNK L  + +W +  +  SLW +W     L     W+
Sbjct: 735  WNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQ 771


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  207 bits (527), Expect = 1e-50
 Identities = 111/277 (40%), Positives = 153/277 (55%), Gaps = 6/277 (2%)
 Frame = +2

Query: 2    NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMH-HPKCSTTDTTHL 178
            NG + GF R  +GLRQGDP+SP LF+L ME  S L+++R +DS ++H HPK      +HL
Sbjct: 497  NGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR-YDSGYIHYHPKAGDLSISHL 555

Query: 179  AFADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFG 358
             FADD+++F  G   SM  + + LD+F   SGL +NK KS +F  G+    +R     +G
Sbjct: 556  MFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYG 614

Query: 359  FPEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGV 538
            FP GT P++YLGLPL  + L   DY  LL ++S  +  W +  LS  GR +LI SV+ G+
Sbjct: 615  FPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGL 674

Query: 539  ECYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAV 703
              +W+    L    I  I  +  KFLW  S        VSW   CLP+ EGGLG R    
Sbjct: 675  INFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGE 734

Query: 704  WNKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWE 814
            WNK L  + +W +  +  SLW +W     L     W+
Sbjct: 735  WNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQ 771


Top