BLASTX nr result

ID: Alisma22_contig00012011 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Alisma22_contig00012011
         (1478 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis]              266   1e-74
AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease ho...   257   2e-71
CAC37623.1 copia-like polyprotein [Arabidopsis thaliana]              254   2e-70
OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis tha...   253   3e-70
AAK51235.1 polyprotein [Arabidopsis thaliana]                         249   6e-69
AAC67200.1 putative retroelement pol polyprotein [Arabidopsis th...   247   5e-68
AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thal...   247   5e-68
AAC35532.1 contains similarity to proteases [Arabidopsis thaliana]    246   7e-68
CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] C...   246   7e-68
KZV38965.1 hypothetical protein F511_40717, partial [Dorcoceras ...   234   2e-66
KZV57610.1 hypothetical protein F511_03070 [Dorcoceras hygrometr...   239   4e-66
XP_010064888.1 PREDICTED: uncharacterized protein LOC104452050 [...   232   9e-66
ACP30598.1 disease resistance protein [Brassica rapa subsp. peki...   235   8e-64
KYP63355.1 Retrovirus-related Pol polyprotein from transposon TN...   233   8e-64
KZV30597.1 hypothetical protein F511_05747 [Dorcoceras hygrometr...   231   8e-63
CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 pu...   223   6e-60
XP_019085816.1 PREDICTED: uncharacterized protein LOC109126581 [...   213   1e-59
AAC61290.1 putative retroelement pol polyprotein [Arabidopsis th...   217   6e-58
XP_013690295.1 PREDICTED: uncharacterized protein LOC106394259 [...   216   2e-57
KYP43751.1 Retrovirus-related Pol polyprotein from transposon TN...   202   2e-55

>OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis]
          Length = 1996

 Score =  266 bits (680), Expect = 1e-74
 Identities = 167/495 (33%), Positives = 256/495 (51%), Gaps = 5/495 (1%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRPPIPSFEEVTSLLK 1298
            +Y+R FK VCD L AI KPV D  KV+ LL GLG+D+E+F  +M++PPIP++ ++  LL+
Sbjct: 110  DYIRIFKNVCDDLAAIGKPVDDRAKVFGLLRGLGSDYESFITSMLKPPIPTYNDLIPLLQ 169

Query: 1297 YHEMRKTTQENTLVGGNSFAFL-XXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPTGP 1121
             HE  K+  + +     + AF+                  +  GRG   TYNN       
Sbjct: 170  GHETMKSLHQTSKSPNLNMAFMSQRNTANRNFSKRGRGSFSSRGRGFPQTYNNKFN---- 225

Query: 1120 SSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPL-GNTSFNSAGRCFGPSVRNNETGYR 944
                                 N G +  NS GS   GN S              +++G  
Sbjct: 226  --------------------RNDGYSGSNSNGSSTHGNNS-------------QDDSGKT 252

Query: 943  VVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWHVD 764
             + CQIC    H AL C+ RF+++Y+S         +A+   A+ +   D    + W  D
Sbjct: 253  NIVCQICKLPKHTALDCYNRFNHAYQS--------EKARQAMAMKL---DGPIDNSWFPD 301

Query: 763  TGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIG-QSVIVQNQDIP*YKVLLVP 587
            T A+ H+T  P ILS++  Y+G D I++G+   + I+H G   + V + ++    VL+VP
Sbjct: 302  TAASAHMTADPGILSSLSQYHGCDKILIGDGSLLDISHTGTMDIPVLDGNLQLNNVLVVP 361

Query: 586  QIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAYYL 407
            +IK NLLS  +LT+ +P    F      IK  +   ++ KG  ++G+Y L T ++AA++ 
Sbjct: 362  EIKKNLLSAGQLTDDYPYTCEFSSAGVVIKDRETGKMIAKGSKQDGVYALGTKEKAAFFS 421

Query: 406  NKV*GVSYELWHRRLGHANQEVFHIL-QNKGEISTLDNKT-HFCNRFAM*KSHRLPFFSS 233
             +    S E+WH+RLGH   +V  +L +NK   ST  NK  HFC+   M K+ RLPF  S
Sbjct: 422  TRFKTASDEVWHQRLGHPQPKVVELLKKNKLITSTSGNKVEHFCDSCQMAKACRLPFILS 481

Query: 232  HNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVMFC 53
            +     P D+++CDLWG + + S  ++ YY +FVD+ +RF W+ PLK+KS+F  CF+ F 
Sbjct: 482  NEFCDTPMDVIHCDLWGAAPVASFQKFKYYVIFVDEYSRFTWYFPLKHKSDFFQCFLNFH 541

Query: 52   KMIKTSFMITPKVFQ 8
              ++  F    KVFQ
Sbjct: 542  AYVENQFGKKIKVFQ 556


>AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078 [Arabidopsis
            thaliana]
          Length = 1415

 Score =  257 bits (656), Expect = 2e-71
 Identities = 162/497 (32%), Positives = 242/497 (48%), Gaps = 7/497 (1%)
 Frame = -2

Query: 1474 YLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTV----TMMRPPIPSFEEVTS 1307
            Y REFK +CD L +I KPV +  K++  L+GLG D++  T     ++ + P P+F +V S
Sbjct: 144  YCREFKTICDALSSIGKPVDESMKIFGFLNGLGRDYDPITTVIQSSLSKLPTPTFNDVVS 203

Query: 1306 LLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPT 1127
             ++  + +  + E                              P     +L +N     +
Sbjct: 204  EVQGFDSKLQSYEEAA------------------------SVTP-----HLAFNIERSES 234

Query: 1126 G-PSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETG 950
            G P  N   +G                      +G   G   +++ GR F     + +  
Sbjct: 235  GSPQYNPNQKGR-------------------GRSGQNKGRGGYSTRGRGFSQHQSSPQVS 275

Query: 949  YRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWH 770
                 CQICG+ GH ALKC+ RFDN+Y++         E + F+ L + +       EWH
Sbjct: 276  GPRPVCQICGRTGHTALKCYNRFDNNYQA---------EIQAFSTLRVSDD---TGKEWH 323

Query: 769  VDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQ-NQDIP*YKVLL 593
             D+ AT H+T+  + L +   Y G D ++VG+  Y+PI H G + I   N  IP  +VL+
Sbjct: 324  PDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKSSNGKIPLNEVLV 383

Query: 592  VPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAY 413
            VP I+ +LLSVSKL + +PC   F  +   I  LQ   ++  G  +NGLYVL+  +  A 
Sbjct: 384  VPNIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYVLENQEFVAL 443

Query: 412  YLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNKTH-FCNRFAM*KSHRLPFFS 236
            Y N+    + E+WH RLGHAN +    LQN   I    ++T   C    M KS RLPF  
Sbjct: 444  YSNRQCAATEEVWHHRLGHANSKALQHLQNSKAIQINKSRTSPVCEPCQMGKSSRLPFLI 503

Query: 235  SHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVMF 56
            S +   +P D ++CDLWG S ++S     YYA+FVDD +R+ WF PL NKS F+  F+ F
Sbjct: 504  SDSRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYSWFYPLHNKSEFLSVFISF 563

Query: 55   CKMIKTSFMITPKVFQT 5
             K+++       KVFQ+
Sbjct: 564  QKLVENQLNTKIKVFQS 580


>CAC37623.1 copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  254 bits (649), Expect = 2e-70
 Identities = 162/496 (32%), Positives = 248/496 (50%), Gaps = 6/496 (1%)
 Frame = -2

Query: 1474 YLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTV----TMMRPPIPSFEEVTS 1307
            Y R+FK++CD L +I KPV +  K++  L+GLG +++  T     ++ + P P+F +V S
Sbjct: 144  YCRDFKIICDSLSSIGKPVEESMKIFGFLNGLGREYDPITTVIQSSLSKLPAPTFNDVIS 203

Query: 1306 LLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPT 1127
             ++  + +  + ++T+      AF                  AP        YN++ R  
Sbjct: 204  EVQGFDSKLQSYDDTVSVNPHLAF----------NTERSNSGAP-------QYNSNSRGR 246

Query: 1126 GPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETGY 947
            G S                              G   G   +++ GR F      + +  
Sbjct: 247  GRS------------------------------GQNRGRGGYSTRGRGFSQHQSASPSSG 276

Query: 946  RVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWHV 767
            +   CQICG+ GH A+KC+ RFDN+Y+S  VP       + F+AL + +       EW+ 
Sbjct: 277  QRPVCQICGRIGHTAIKCYNRFDNNYQS-EVP------TQAFSALRVSDE---TGKEWYP 326

Query: 766  DTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQD-IP*YKVLLV 590
            D+ AT HIT     L N  +Y G D ++VG+  Y+PI H+G + I  ++  IP  +VL+ 
Sbjct: 327  DSAATAHITASTSGLQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVLVC 386

Query: 589  PQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAYY 410
            P I+ +LLSVSKL + +PC   F  +   I  L    ++ KG   NGLY+L+ ++  A Y
Sbjct: 387  PAIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYMLENSEFVALY 446

Query: 409  LNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNKTH-FCNRFAM*KSHRLPFFSS 233
             N+    S E WH RLGH+N ++   L  + EI    ++T   C    M KS RL FFSS
Sbjct: 447  SNRQCAASMETWHHRLGHSNSKILQQLLTRKEIQVNKSRTSPVCEPCQMGKSTRLQFFSS 506

Query: 232  HNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVMFC 53
               +  P D V+CDLWG S ++S   + YYA+FVDD +RF WF PL+ KS FI  F+ + 
Sbjct: 507  DFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLRMKSKFISVFIAYQ 566

Query: 52   KMIKTSFMITPKVFQT 5
            K+++       K FQ+
Sbjct: 567  KLVENQLGTKIKEFQS 582


>OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis thaliana]
          Length = 2099

 Score =  253 bits (647), Expect = 3e-70
 Identities = 167/487 (34%), Positives = 236/487 (48%), Gaps = 5/487 (1%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRP----PIPSFEEVT 1310
            EYLR+ K +C+QL +I  PV ++ K++ +L GLG ++E   V +       P P+ EEV+
Sbjct: 262  EYLRDIKSICEQLASIGSPVPEKMKIFAVLKGLGREYEPIKVNIEGMIDMYPGPTLEEVS 321

Query: 1309 SLLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRP 1130
            S LK    R  +    +      AF                     G+GN      + +P
Sbjct: 322  SRLKSFSDRLASYNVGMEVSPHLAFYANYSGK--------------GKGN-----QYGKP 362

Query: 1129 TGPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETG 950
             G      +QG               GN S    G P   +S  S           N T 
Sbjct: 363  GG------NQG-------------KSGNYSTKGRGFPQQISSSTSGSY--------NNTE 395

Query: 949  YRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWH 770
             RVV CQICGK GH ALKCW+RF+NSY+   +P          TA+ I +    N ++W 
Sbjct: 396  NRVV-CQICGKPGHPALKCWHRFNNSYQYEELP-------AALTAMRITDVTDHNGNKWV 447

Query: 769  VDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQDIP*YK-VLL 593
             D+GAT H+TN    L     Y G D +MVGN  ++PI H G + +  +  I   K VL+
Sbjct: 448  GDSGATAHVTNSTHNLQQSQPYGGSDSVMVGNGDFLPITHTGSTTLPSSSGILSLKDVLV 507

Query: 592  VPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAY 413
             P I  +L+SVSKLT  +PC   F  D   +       LL +G   NGLYVLK +   A+
Sbjct: 508  CPNIGKSLVSVSKLTRDYPCSVDFDCDYVRVTDKATKKLLAQGNNFNGLYVLKDSSVHAF 567

Query: 412  YLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNKTHFCNRFAM*KSHRLPFFSS 233
            Y ++    S ++WH RLGH NQ++  +L     ++   +    C      KS RLPF SS
Sbjct: 568  YSSRQQTTSEDVWHMRLGHPNQQILQLLHKNKAVNISKSSKGICEACQYGKSSRLPFSSS 627

Query: 232  HNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVMFC 53
             +  + P   ++CDLWG + I SV  + YYA+FVD+ +RF WF PLK KS+F   F +F 
Sbjct: 628  CSTISKPLQKIHCDLWGPAPIKSVQGFSYYAIFVDNYSRFCWFYPLKFKSDFFKIFTIFQ 687

Query: 52   KMIKTSF 32
             +++  F
Sbjct: 688  ALVENQF 694


>AAK51235.1 polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  249 bits (637), Expect = 6e-69
 Identities = 159/497 (31%), Positives = 248/497 (49%), Gaps = 7/497 (1%)
 Frame = -2

Query: 1474 YLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTV----TMMRPPIPSFEEVTS 1307
            Y REF  VCD L +I KPV +  K++  L+GLG +++  T     ++ +   P+F +V S
Sbjct: 144  YCREFIAVCDALSSIGKPVDESMKIFGFLNGLGREYDPITTVIQSSLSKISPPTFRDVIS 203

Query: 1306 LLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPT 1127
             +K  +++  + E ++      AF                 +   GRG Y          
Sbjct: 204  EVKGFDVKLQSYEESVTANPHMAFNTQRSEYTDNYTSGNRGK---GRGGY---------- 250

Query: 1126 GPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRN-NETG 950
                                             G   G + +++ GR F     N N TG
Sbjct: 251  ---------------------------------GQNRGRSGYSTRGRGFSQHQTNSNNTG 277

Query: 949  YRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWH 770
             R V CQICG+ GH ALKC+ RFD++Y+S        + A+ F++L + ++   +  EW 
Sbjct: 278  ERPV-CQICGRTGHTALKCYNRFDHNYQSV-------DTAQAFSSLRVSDS---SGKEWV 326

Query: 769  VDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQD-IP*YKVLL 593
             D+ AT H+T+  + L     YNG D ++VG+  Y+PI H+G + I  +   +P  +VL+
Sbjct: 327  PDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGSTTISSDSGTLPLNEVLV 386

Query: 592  VPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAY 413
             P I+ +LLSVSKL + +PC   F  +   I  +    ++ KG   NGLYVL+  +  A+
Sbjct: 387  CPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKGPRSNGLYVLENQEFVAF 446

Query: 412  YLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNK-THFCNRFAM*KSHRLPFFS 236
            Y N+    S E+WH RLGH+N  +   L++  EIS   ++ +  C    M KS +L FFS
Sbjct: 447  YSNRQCAASEEIWHHRLGHSNSRILQQLKSSKEISFNKSRMSPVCEPCQMGKSSKLQFFS 506

Query: 235  SHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVMF 56
            S++   +    ++CDLWG S ++S   + YY +FVDD +R+ WF PLK KS+F   FV F
Sbjct: 507  SNSRELDLLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYSRYSWFYPLKAKSDFFAVFVAF 566

Query: 55   CKMIKTSFMITPKVFQT 5
              +++  F    KVFQ+
Sbjct: 567  QNLVENQFNTKIKVFQS 583


>AAC67200.1 putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1402

 Score =  247 bits (630), Expect = 5e-68
 Identities = 167/503 (33%), Positives = 239/503 (47%), Gaps = 13/503 (2%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRP----PIPSFEEVT 1310
            EYL++ K +CDQL ++  PV+++ K++  L+GLG ++E    T+       P PS E+V 
Sbjct: 142  EYLKDLKTICDQLASVGSPVTEKMKIFAALNGLGREYEPIKTTIENSMDALPGPSLEDVI 201

Query: 1309 -SLLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVR 1133
              L  Y +  +   E T V                         +P    N  T ++   
Sbjct: 202  PKLTGYDDRLQGYLEETAV-------------------------SPHVAFNITTSDD--- 233

Query: 1132 PTGPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRN--- 962
                           NA    N   N G    N      G  SF++ GR F   + +   
Sbjct: 234  --------------SNASGYFNAY-NRGKGKSNR-----GRNSFSTRGRGFHQQISSTNS 273

Query: 961  ---NETGYRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADK 791
               +++G   V CQICGK GH ALKCW+RF+NSY+   +P  +A       A+ I +   
Sbjct: 274  SSGSQSGGTSVVCQICGKMGHPALKCWHRFNNSYQYEELPRALA-------AMRITDITD 326

Query: 790  LNTDEWHVDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQ-DI 614
             + +EW  D+ AT H+TN P  L     Y+G D +MV +  ++PI H G + +  +  ++
Sbjct: 327  QHGNEWLPDSAATAHVTNSPRSLQQSQPYHGSDAVMVADGNFLPITHTGSTNLASSSGNV 386

Query: 613  P*YKVLLVPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLK 434
            P   VL+ P I  +LLSVSKLT  +PC   F  D   I       LL+ G   +GLY LK
Sbjct: 387  PLTDVLVCPSITKSLLSVSKLTQDYPCTVEFDSDGVRINDKATKKLLIMGSTCDGLYCLK 446

Query: 433  TTKR-AAYYLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNKTHFCNRFAM*KS 257
               +  A++  +    S E+WHRRLGH + +V   L     IS        C    + KS
Sbjct: 447  DDSQFKAFFSTRQQSASDEVWHRRLGHPHPQVLQQLVKTNSISINKTSKSLCEACQLGKS 506

Query: 256  HRLPFFSSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNF 77
             RLPF SS   S  P + V+CDLWG S I SV  + YYA+F+D  +RF W  PLK KS+F
Sbjct: 507  TRLPFVSSSFTSNRPLERVHCDLWGPSPITSVQGFRYYAVFIDHYSRFSWIYPLKLKSDF 566

Query: 76   IDCFVMFCKMIKTSFMITPKVFQ 8
             + FV F K+++        VFQ
Sbjct: 567  YNIFVAFHKLVENQLNHKISVFQ 589


>AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thaliana]
          Length = 1522

 Score =  247 bits (630), Expect = 5e-68
 Identities = 163/500 (32%), Positives = 239/500 (47%), Gaps = 11/500 (2%)
 Frame = -2

Query: 1474 YLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRP----PIPSFEEVTS 1307
            +L++ K +CDQL ++  PV ++ K++  L+GLG ++E    T+       P  S +EV S
Sbjct: 142  FLKDLKHICDQLASVGSPVPEKMKIFSALNGLGREYEPIKTTIENSVDSNPSLSLDEVAS 201

Query: 1306 LLKYHE--MRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVR 1133
             L+ ++  ++    E T+                                 ++ +N    
Sbjct: 202  KLRGYDDRLQSYVTEPTI-------------------------------SPHVAFN---- 226

Query: 1132 PTGPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVR---N 962
             T   S   H               N G    NS     G +SF++ GR F   +     
Sbjct: 227  VTHSDSGYYHNN-------------NRGKGRSNSGS---GKSSFSTRGRGFHQQISPTSG 270

Query: 961  NETGYRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNT 782
            ++ G   + CQICGK GH ALKCW+RFDNSY+   +P  +A        + I +    + 
Sbjct: 271  SQAGNSGLVCQICGKAGHHALKCWHRFDNSYQHEDLPMALAT-------MRITDVTDHHG 323

Query: 781  DEWHVDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQD-IP*Y 605
             EW  D+ A+ H+TN   +L     Y+G D IMV +  ++PI H G   I  +   IP  
Sbjct: 324  HEWIPDSAASAHVTNNRHVLQQSQPYHGSDSIMVADGNFLPITHTGSGSIASSSGKIPLK 383

Query: 604  KVLLVPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTK 425
            +VL+ P I  +LLSVSKLT+ +PC   F  D+  I       LLV G  ++GLY L+  K
Sbjct: 384  EVLVCPDIVKSLLSVSKLTSDYPCSVEFDADSVRINDKATKKLLVMGRNRDGLYSLEEPK 443

Query: 424  RAAYYLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNKTH-FCNRFAM*KSHRL 248
                Y  +    S E+WHRRLGHAN EV H L +   I  ++      C    + KS RL
Sbjct: 444  LQVLYSTRQNSASSEVWHRRLGHANAEVLHQLASSKSIIIINKVVKTVCEACHLGKSTRL 503

Query: 247  PFFSSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDC 68
            PF  S   ++ P + ++CDLWG S   SV  + YY +F+D  +RF WF PLK KS+F   
Sbjct: 504  PFMLSTFNASRPLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLKLKSDFFST 563

Query: 67   FVMFCKMIKTSFMITPKVFQ 8
            FVMF K+++       K+FQ
Sbjct: 564  FVMFQKLVENQLGHKIKIFQ 583


>AAC35532.1 contains similarity to proteases [Arabidopsis thaliana]
          Length = 1392

 Score =  246 bits (629), Expect = 7e-68
 Identities = 156/498 (31%), Positives = 237/498 (47%), Gaps = 9/498 (1%)
 Frame = -2

Query: 1474 YLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRP----PIPSFEEVTS 1307
            YL E K +CDQL +I  PV++++K++ +L+GLG ++E+    +       P P F++V  
Sbjct: 143  YLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHSLDVYPGPCFDDVVY 202

Query: 1306 LLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPT 1127
             L   + + +T                                        T N+ V P 
Sbjct: 203  KLTTFDDKLSTY---------------------------------------TANSEVTPH 223

Query: 1126 GPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCF----GPSVRNN 959
                  +     GN            N+     G+  G  S++S GR F    G    N 
Sbjct: 224  LAFYTDKSYSSRGN-----------NNSRGGRYGNFRGRGSYSSRGRGFHQQFGSGSNNG 272

Query: 958  ETGYRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTD 779
                    CQIC K GH A KC+ RF+ +Y    +P+        F A+ + + ++ ++ 
Sbjct: 273  SGNGSKPTCQICRKYGHSAFKCYTRFEENYLPEDLPN-------AFAAMRVSDQNQASSH 325

Query: 778  EWHVDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVI-VQNQDIP*YK 602
            EW  D+ AT HITN  D L N  +Y+G D ++VGN  ++PI HIG   + +    +P   
Sbjct: 326  EWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPLED 385

Query: 601  VLLVPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKR 422
            VL+ P I  +LLSVSKLT+ +PC F F  D+  IK  +   LL +G    GLYVLK    
Sbjct: 386  VLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDVPF 445

Query: 421  AAYYLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNKTHFCNRFAM*KSHRLPF 242
              YY  +      E+WH+RLGH N+EV   L     I      ++ C    M K  RLPF
Sbjct: 446  QTYYSTRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNKTSSNMCEACQMGKVCRLPF 505

Query: 241  FSSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFV 62
             +S  +S+ P + ++CDLWG + + S   + YY +F+D+ +RF WF PLK KS+F   FV
Sbjct: 506  VASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSRFTWFYPLKLKSDFFSVFV 565

Query: 61   MFCKMIKTSFMITPKVFQ 8
            +F ++++  +     +FQ
Sbjct: 566  LFQQLVENQYQHKIAMFQ 583


>CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] CAB81170.1
            retrotransposon like protein [Arabidopsis thaliana]
          Length = 1515

 Score =  246 bits (629), Expect = 7e-68
 Identities = 156/498 (31%), Positives = 237/498 (47%), Gaps = 9/498 (1%)
 Frame = -2

Query: 1474 YLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRP----PIPSFEEVTS 1307
            YL E K +CDQL +I  PV++++K++ +L+GLG ++E+    +       P P F++V  
Sbjct: 140  YLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHSLDVYPGPCFDDVVY 199

Query: 1306 LLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPT 1127
             L   + + +T                                        T N+ V P 
Sbjct: 200  KLTTFDDKLSTY---------------------------------------TANSEVTPH 220

Query: 1126 GPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCF----GPSVRNN 959
                  +     GN            N+     G+  G  S++S GR F    G    N 
Sbjct: 221  LAFYTDKSYSSRGN-----------NNSRGGRYGNFRGRGSYSSRGRGFHQQFGSGSNNG 269

Query: 958  ETGYRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTD 779
                    CQIC K GH A KC+ RF+ +Y    +P+        F A+ + + ++ ++ 
Sbjct: 270  SGNGSKPTCQICRKYGHSAFKCYTRFEENYLPEDLPN-------AFAAMRVSDQNQASSH 322

Query: 778  EWHVDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVI-VQNQDIP*YK 602
            EW  D+ AT HITN  D L N  +Y+G D ++VGN  ++PI HIG   + +    +P   
Sbjct: 323  EWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPLED 382

Query: 601  VLLVPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKR 422
            VL+ P I  +LLSVSKLT+ +PC F F  D+  IK  +   LL +G    GLYVLK    
Sbjct: 383  VLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDVPF 442

Query: 421  AAYYLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNKTHFCNRFAM*KSHRLPF 242
              YY  +      E+WH+RLGH N+EV   L     I      ++ C    M K  RLPF
Sbjct: 443  QTYYSTRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNKTSSNMCEACQMGKVCRLPF 502

Query: 241  FSSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFV 62
             +S  +S+ P + ++CDLWG + + S   + YY +F+D+ +RF WF PLK KS+F   FV
Sbjct: 503  VASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSRFTWFYPLKLKSDFFSVFV 562

Query: 61   MFCKMIKTSFMITPKVFQ 8
            +F ++++  +     +FQ
Sbjct: 563  LFQQLVENQYQHKIAMFQ 580


>KZV38965.1 hypothetical protein F511_40717, partial [Dorcoceras hygrometricum]
          Length = 593

 Score =  234 bits (596), Expect = 2e-66
 Identities = 152/477 (31%), Positives = 235/477 (49%), Gaps = 3/477 (0%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRPPIPSFEEVTSLLK 1298
            E++R+FK +CD L AI  PV+D+ KV+ LL+ LG  +E+FT TM++PP PS+ E+ SLL+
Sbjct: 65   EHIRKFKSICDNLAAIGSPVTDKVKVFSLLTSLGPKYESFTTTMLKPPPPSYTELVSLLQ 124

Query: 1297 YHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPTGPS 1118
             +E R+T    T    +  AF                   P G G    +  H       
Sbjct: 125  GYEQRQTWFSTT-APTHQLAFYGQKQRHGSIEPQTNFN--PNGHG----FQAHKHHLSGH 177

Query: 1117 SNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETGYRVV 938
            +N   Q +  +              S      P G      A R             R  
Sbjct: 178  NNFSQQNVTKD--------------SKLQTPPPPGKRRMTPAEREM----------CRDE 213

Query: 937  FCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWHVDTG 758
             CQ+CG  GH A  CW+    +     +P       K   ALT++ +  +   EW  DTG
Sbjct: 214  ICQLCGGMGHVAKICWHLSKYTQAQDEIP-------KALAALTLDNS--VLDIEWTSDTG 264

Query: 757  ATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQ-NQDIP*YKVLLVPQI 581
            A+ H+T    +L N   Y G D +++G++  + I  +G + I   NQ +P   VL VP +
Sbjct: 265  ASHHMTGNAGMLKNKRPYFGNDSVLIGDDTLLGIKSVGDTQIKNGNQTLPLNDVLHVPNL 324

Query: 580  KCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAYYLNK 401
              NLLS+S+LT+ +P         F +K  +    L++G+ K  LYV+ ++    Y+ ++
Sbjct: 325  NRNLLSISQLTDHYPVNSELSNVDFCVKERETGPKLMQGQRKGDLYVI-SSPHELYFSHR 383

Query: 400  V*GVSYELWHRRLGHANQEVFHILQNKGEISTL-DNKTHF-CNRFAM*KSHRLPFFSSHN 227
                + E+WH+RLGH       +LQ KG I     NK  F C+   + K  +LPF+   N
Sbjct: 384  FKSSTAEVWHQRLGHPQISTLKLLQQKGLIDVQGSNKLQFMCDSCQLVKLSKLPFYVLEN 443

Query: 226  ISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVMF 56
             S++ F+ ++CDLWG SL++S+ +++YYA  VDD +R+ WF PLK KS+F+D ++ F
Sbjct: 444  SSSSNFNKIHCDLWGPSLVLSLEKFHYYACIVDDFSRYTWFIPLKRKSDFVDAYLAF 500


>KZV57610.1 hypothetical protein F511_03070 [Dorcoceras hygrometricum]
          Length = 1011

 Score =  239 bits (611), Expect = 4e-66
 Identities = 161/494 (32%), Positives = 244/494 (49%), Gaps = 3/494 (0%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRPPIPSFEEVTSLLK 1298
            E++R+FK +CD L AI  PV+D+ KV+ LL+ LG  +E+FT TM++PP PS+ E+ SLLK
Sbjct: 84   EHIRKFKSICDNLAAIGSPVTDKVKVFSLLTSLGPRYESFTTTMLKPPRPSYTELVSLLK 143

Query: 1297 YHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPTGPS 1118
             +E R+     T    +  AF                     GRG    +  H       
Sbjct: 144  GYEQRQAWFSTT-APTHQLAFYGQKQRLGSVNHKPQTNFNSTGRG----FQAHKHHLNGH 198

Query: 1117 SNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETGYRVV 938
            +N   Q I+ ++   R   P                       R   PS R     YR  
Sbjct: 199  NNFGQQNIIKDSKMQRPPPPG---------------------KRRMTPSEREM---YRDE 234

Query: 937  FCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWHVDTG 758
             CQ+CG  GH A  CW+    +     +P  +A       ALT++ +  L+T EW  DTG
Sbjct: 235  TCQLCGGTGHVAKICWHLSKYTQAQDEIPQALA-------ALTLDNS-VLDT-EWTSDTG 285

Query: 757  ATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQ-NQDIP*YKVLLVPQI 581
            A+ H+T    +L NI  Y G D +++G+   + I  +G + I   NQ +P   VL VP +
Sbjct: 286  ASHHMTGNAGMLKNIRPYFGSDSVLIGDGTLLGIKSVGDTQIQNGNQTLPLNNVLHVPNL 345

Query: 580  KCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAYYLNK 401
              NLLS+S+LT+ +P    F    F +K       +++G+ K  LYV+ ++    ++ ++
Sbjct: 346  NRNLLSISQLTDHYPVNCEFSNVDFCVKERATGHKVMQGQRKGDLYVI-SSPHELHFSHR 404

Query: 400  V*GVSYELWHRRLGHANQEVFHILQNKGEISTL-DNKTHF-CNRFAM*KSHRLPFFSSHN 227
                + E+WH+RLGH       +LQ KG I     NK  F C+   + K  +LPF  S N
Sbjct: 405  FKSGTAEVWHQRLGHPQISTLKLLQQKGLIDVQGSNKLQFMCDSCQLAKLSKLPFSISEN 464

Query: 226  ISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVMFCKM 47
             S++ F  ++CDLWG + ++S+ ++ YYA  VDD +R+ WF PLK KS+F+D F  F K 
Sbjct: 465  SSSSSFIKIHCDLWGPAPVLSLEKFRYYACIVDDFSRYTWFIPLKKKSDFVDAFFAFEKY 524

Query: 46   IKTSFMITPKVFQT 5
            +   F    K+F +
Sbjct: 525  VARQFDKKIKIFHS 538


>XP_010064888.1 PREDICTED: uncharacterized protein LOC104452050 [Eucalyptus grandis]
          Length = 616

 Score =  232 bits (592), Expect = 9e-66
 Identities = 156/480 (32%), Positives = 227/480 (47%), Gaps = 30/480 (6%)
 Frame = -2

Query: 1474 YLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMM-RPPIPSFEEVTSLLK 1298
            YLRE+K +CD+L AI KPV D  K++ +L GLGA++E F  T+    P P ++EV + L+
Sbjct: 147  YLREYKYICDRLNAIGKPVDDITKLFGVLEGLGAEYENFRTTIYCLKPQPEYDEVIAQLE 206

Query: 1297 YHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGG-----RGNYLTYNNHVR 1133
              E R      T    N   F                  +        RGNY    ++ R
Sbjct: 207  RFESRMQNYSRTQFNSNMAYFGQRHPQAQFKETTDGEYISQNSGFVAQRGNYRGGRSYGR 266

Query: 1132 PTGPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTS---FNSAGRCF-GPSVR 965
              G   N + +G      N+     N      N    P  N+S   F S  R + GPS R
Sbjct: 267  --GRFLNNKGRGFRYPGDNLGYRSSNTSYMQENQRTKPYQNSSDSGFTSGFRPYAGPSQR 324

Query: 964  NNE-------------------TGYRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLV 842
              E                   T    + CQIC K GHDAL CWYRFDNSY++  +P   
Sbjct: 325  YQEVKSYPSNLTLSKSMQNEKATSSIKLECQICKKPGHDALHCWYRFDNSYQAEEIPT-- 382

Query: 841  ANEAKTFTALTIEEADKLNTDEWHVDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYI 662
                 T  A+ +++A      EW+ DTGAT HIT    IL N   Y G D +M+G+  ++
Sbjct: 383  -----TLAAIHLKDA---KGSEWYPDTGATAHITANSSILHNSSKYTGYDTVMIGDGSHL 434

Query: 661  PIAHIGQSVIVQNQDI-P*YKVLLVPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQN 485
             +   G +++   + + P   VL+VP IK NLL VSKLT+ + C F+F +   YIK    
Sbjct: 435  SVTCTGNTLLHTGKSLLPLNDVLIVPDIKKNLLLVSKLTDDYHCSFVFDKFGVYIKDNWT 494

Query: 484  HSLLVKGELKNGLYVLKTTKRAAYYLNKV*GVSYELWHRRLGHANQEVFHILQNKGEIST 305
            ++ L+ G    GLY + +    A+   +   ++ + WH+RL H N  +   LQN+  I  
Sbjct: 495  NTTLLLGRKTKGLYQMNSKTTQAFLAQRHRAIAEDTWHQRLAHTNLNILKYLQNQKLIQC 554

Query: 304  LDNKTHFCNRFAM*KSHRLPFFSSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDD 125
                 + C+   + K+  LPF SS +I+  P   ++CD+WG S + S   + YY LFVD+
Sbjct: 555  SSRMLNVCSSCQVAKAVALPFPSSESITTMPLQKIHCDIWGPSPVTSFQNFKYYVLFVDN 614


>ACP30598.1 disease resistance protein [Brassica rapa subsp. pekinensis]
          Length = 2301

 Score =  235 bits (599), Expect = 8e-64
 Identities = 158/498 (31%), Positives = 230/498 (46%), Gaps = 8/498 (1%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVT----MMRPPIPSFEEVT 1310
            +YLR+ K +CDQL +I +PV +  K++  L GLG ++E    +    M     PSFE+V 
Sbjct: 147  DYLRDIKTICDQLTSIGQPVDERMKIFAALLGLGKEYEPIKTSIEGSMDTQYHPSFEDVV 206

Query: 1309 SLLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRP 1130
              L   E R  +           AF                      RG      N  R 
Sbjct: 207  PRLVAFEDRLKSYTTDTAVSPHLAFNTV-------------------RGRPFFTRNRGRN 247

Query: 1129 TGPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETG 950
             G  S    +G            P      ++S+ S   + S +S  R            
Sbjct: 248  RGGRSFFSTRG---------RGFPQH----LSSSSSSRSSVSADSEAR------------ 282

Query: 949  YRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWH 770
                 CQICGK GH+A++CW+RFDNSY+   + + +A       A+ + +       EW 
Sbjct: 283  ---PVCQICGKSGHEAMRCWHRFDNSYQLDEMHNALA-------AMRVSDMIDSRGGEWF 332

Query: 769  VDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQ-DIP*YKVLL 593
             DTGA+ HITN P  L N   Y G D +MVGN +Y+PI H G + I  +  ++    VL+
Sbjct: 333  PDTGASAHITNTPHHLQNAQPYMGSDSVMVGNGEYLPITHTGAASIASSSGNLILNDVLV 392

Query: 592  VPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAY 413
             PQI   LLSVSK T  +PC F F  D   I       +L++G    GLY +K     A+
Sbjct: 393  CPQIAKPLLSVSKFTTDYPCGFDFDADNVCIYDKATKKVLLQGRNTKGLYSIKEPAFHAF 452

Query: 412  YLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTL---DNKTHFCNRFAM*KSHRLPF 242
            +  +    S E+WH+RLGH N    HILQ    I ++         C    M KS RLPF
Sbjct: 453  FSTRQVAASDEVWHQRLGHPNP---HILQRLASIKSVFINKRSKSLCVSCQMAKSSRLPF 509

Query: 241  FSSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFV 62
             +S  ++  P + ++CD+WG S ++SV  + YY + +D+ +R+ W  P+K KS+F   F+
Sbjct: 510  SASQFVATRPLERIHCDVWGPSPVVSVQEFKYYVVLIDNYSRYCWMYPMKKKSDFHSIFI 569

Query: 61   MFCKMIKTSFMITPKVFQ 8
             F  +++  F  T   FQ
Sbjct: 570  AFQSLVQNQFHTTIGTFQ 587


>KYP63355.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1040

 Score =  233 bits (595), Expect = 8e-64
 Identities = 160/498 (32%), Positives = 245/498 (49%), Gaps = 7/498 (1%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMM-RPPIPSFEEVTSLL 1301
            EY ++F+ +CDQL AI  P++++DKV++ L GLG  +  F+   + + P P F ++   +
Sbjct: 10   EYGKKFRTICDQLAAIGAPIANDDKVHWFLRGLGPSYANFSTGQLDQVPSPRFTDILCKV 69

Query: 1300 KYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPTGP 1121
            + H + + + E       +  ++                            NN  R +G 
Sbjct: 70   ESHAIFQASLEEPTPSQPAAFYV----------------------------NNQPRSSGN 101

Query: 1120 SSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETGYR- 944
             S+                    GN   ++ GS  GN   NS G   G   R N +G R 
Sbjct: 102  QSS------------------RSGNRGHSNDGSNGGNRG-NSNG---GSRQRRNNSGRRN 139

Query: 943  --VVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWH 770
              V  CQIC ++GH A KC  R+D S ES       AN A+ F A      D  N  +W+
Sbjct: 140  SYVPRCQICREQGHYANKCPVRWDRSSES-------ANLAQAFAASC--SVDTSNQSDWY 190

Query: 769  VDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQDIP*YKVLLV 590
            +DTGAT H+T     L++  +YNG D++ VGNE  + I H+G   +  +  +P   VL+V
Sbjct: 191  MDTGATSHMTPSSSQLTDSQNYNGTDHVFVGNETSLNITHVGSRSL--SHTVPLSDVLVV 248

Query: 589  PQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAYY 410
            P +  NL+SV+KLT       IF+ D+F I+  +   +L +G    GLYV+    +A   
Sbjct: 249  PNLTKNLVSVNKLTRDNHAKAIFVDDSFVIQNRKTGRVLARGRCDQGLYVMDQGPQALLT 308

Query: 409  LNK-V*GVSYELWHRRLGHANQEVFHILQNKG--EISTLDNKTHFCNRFAM*KSHRLPFF 239
             +  +    +ELWH  LG  N +V + L  +G   +S++     +C    M KS RL F+
Sbjct: 309  TSSSLPRACFELWHSSLGRVNFDVINKLNQQGYLNVSSILPNPIYCTTCQMAKSKRLVFY 368

Query: 238  SSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVM 59
             ++  ++   DL++CDLWG S + SV  Y Y+ +FVDD +RF WF PL++KSNF D  V 
Sbjct: 369  DNNKRASAVLDLIHCDLWGPSPVASVAGYSYFVIFVDDFSRFTWFYPLRHKSNFYDVLVR 428

Query: 58   FCKMIKTSFMITPKVFQT 5
            F   ++  F    KVFQ+
Sbjct: 429  FKVFVENQFSRFIKVFQS 446


>KZV30597.1 hypothetical protein F511_05747 [Dorcoceras hygrometricum]
          Length = 1233

 Score =  231 bits (590), Expect = 8e-63
 Identities = 157/494 (31%), Positives = 241/494 (48%), Gaps = 3/494 (0%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRPPIPSFEEVTSLLK 1298
            E++R+FK +CD L  I  PV+D+ KV+ LL+ LG  +E+FT TM++PP PS+ E+ SLL+
Sbjct: 136  EHIRKFKSICDNLAVIGSPVTDKVKVFSLLTSLGPRYESFTTTMLKPPRPSYTELVSLLQ 195

Query: 1297 YHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPTGPS 1118
             +E R+     T    +  AF                     GRG    +  H       
Sbjct: 196  GYEQRQAWFSTT-APTHQLAFYGQKQRLGSVNHKPQTNFNSTGRG----FQAHKHHLNGH 250

Query: 1117 SNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETGYRVV 938
            +N   Q I+ ++   R   P                       R   PS R     YR  
Sbjct: 251  NNFGQQNIIKDSKMQRPPPPG---------------------KRRMTPSEREM---YRDE 286

Query: 937  FCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWHVDTG 758
             CQ+CG  GH A  CW+    +     +P  +A       ALT++ +  L+T EW  DT 
Sbjct: 287  TCQLCGGTGHVAKICWHLSKYTQAQDEIPQALA-------ALTLDNS-VLDT-EWTSDTR 337

Query: 757  ATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQ-NQDIP*YKVLLVPQI 581
            A+ H+T    +L NI  Y G D +++G+   + I  +G + I   NQ +P   VL VP +
Sbjct: 338  ASHHMTGNAGMLKNIRPYFGSDSVLIGDGTLLGIKSVGDTQIQNGNQTLPLNNVLHVPNL 397

Query: 580  KCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAYYLNK 401
              NLLS+S+LT+ +P    F    F +K       +++G+ K  LYV+ ++    ++ ++
Sbjct: 398  NRNLLSISQLTDHYPVNCEFSNVDFCVKERATGHKVMQGQRKGDLYVI-SSPHELHFSHR 456

Query: 400  V*GVSYELWHRRLGHANQEVFHILQNKGEISTL-DNKTHF-CNRFAM*KSHRLPFFSSHN 227
                + E+WH+RLGH       +LQ KG I     NK  F C+   + K  +LPF  S N
Sbjct: 457  FKSGTAEVWHQRLGHPQISTLKLLQQKGLIDVQGSNKLQFMCDSCQLAKLSKLPFSISEN 516

Query: 226  ISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVMFCKM 47
             S++ F  ++CDLWG + ++S  ++ YYA  VDD +R+ WF PLK KS+F+D +  F K 
Sbjct: 517  SSSSSFIKIHCDLWGPAPVLSFEKFRYYACIVDDFSRYTWFIPLKKKSDFVDAYFAFEKY 576

Query: 46   IKTSFMITPKVFQT 5
            +   F    K+F +
Sbjct: 577  VARQFDKKIKIFHS 590


>CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 putative protein
            [Arabidopsis thaliana]
          Length = 1415

 Score =  223 bits (569), Expect = 6e-60
 Identities = 128/346 (36%), Positives = 189/346 (54%), Gaps = 5/346 (1%)
 Frame = -2

Query: 1063 LPNFGNASINS-AGSPLGNTSFNSAGRCFGPSVRN---NETGYRVVFCQICGKRGHDALK 896
            L NF +   N  +G   G  ++ + GR F   + +   +++G R   CQIC K GH A K
Sbjct: 204  LINFDDKLQNGQSGGNRGRNNYTTKGRGFPQQISSGSPSDSGTRPT-CQICNKYGHSAYK 262

Query: 895  CWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWHVDTGATDHITN*PDILSN 716
            CW RFD++++S          +K F A+ + +     ++ W  D+GAT HITN    L +
Sbjct: 263  CWKRFDHAFQSEDF-------SKAFAAMRVSDQ---KSNPWVTDSGATSHITNSTSQLQS 312

Query: 715  IVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQ-DIP*YKVLLVPQIKCNLLSVSKLTNQF 539
               Y+G D ++VGN  ++PI HIG +V+  NQ ++P   VL+ P I  +LLSVSKLT+ +
Sbjct: 313  AQPYSGEDSVIVGNSDFLPITHIGSAVLTSNQGNLPLRDVLVCPNITKSLLSVSKLTSDY 372

Query: 538  PCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAYYLNKV*GVSYELWHRRLG 359
            PC   F  D   +K      LL KG   N LY+L+  K  A Y ++    S E+WH RLG
Sbjct: 373  PCVIEFDSDGVIVKDKLTKQLLTKGTRHNDLYLLENPKFMACYSSRQQATSDEVWHMRLG 432

Query: 358  HANQEVFHILQNKGEISTLDNKTHFCNRFAM*KSHRLPFFSSHNISANPFDLVYCDLWGK 179
            H NQ+V   L     I         C+   M K  +LPF SS  +S+   + V+CDLWG 
Sbjct: 433  HPNQDVLQQLLRNKAIVISKTSHSLCDACQMGKICKLPFASSDFVSSRLLERVHCDLWGP 492

Query: 178  SLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFIDCFVMFCKMIK 41
            + ++S   + YY +F+D+ +RF WF PL+ KS+F   F+ F KM++
Sbjct: 493  APVVSSQGFRYYVIFIDNYSRFTWFYPLRLKSDFFSVFLTFQKMVE 538


>XP_019085816.1 PREDICTED: uncharacterized protein LOC109126581 [Camelina sativa]
          Length = 475

 Score =  213 bits (541), Expect = 1e-59
 Identities = 140/450 (31%), Positives = 218/450 (48%), Gaps = 6/450 (1%)
 Frame = -2

Query: 1426 KPVSDEDKVYFLLSGLGADFETFTVTMM----RPPIPSFEEVTSLLKYHEMRKTTQENTL 1259
            K + +  K++  L+GLG +++  T  +     + P P+F +V S ++  + +  + +++ 
Sbjct: 82   KHIDESMKIFGFLNGLGREYDPITTVIQNYLSKLPTPTFNDVISEVQGFDTKLQSYDDSP 141

Query: 1258 VGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPTGPSSNVQHQGILGNAP 1079
                  AF+                                  T P ++ Q+Q      P
Sbjct: 142  SANPHLAFM-------------------------------TEKTNPCAS-QYQ------P 163

Query: 1078 NIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETGYRVVFCQICGKRGHDAL 899
            N R     F    +           +++ GR F     ++ T      CQICG+ G+ A+
Sbjct: 164  NSRGRGGRFSQNRVRGG--------YSTRGRGFSQHQSSSTTQGERPICQICGRTGYTAI 215

Query: 898  KCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWHVDTGATDHITN*PDILS 719
            KC+  FDN+Y+S  VP      ++ F  L + +    N  EWH+D+ AT HIT     L 
Sbjct: 216  KCYNHFDNNYQS-EVP------SQAFAYLRVSDE---NGREWHLDSAATAHITTLTSGLQ 265

Query: 718  NIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQD-IP*YKVLLVPQIKCNLLSVSKLTNQ 542
            +  SY G D +MVG+  Y+PI HIG + I   +  IP  +VL+ P ++ NLLSVSKL + 
Sbjct: 266  DATSYKGTDAVMVGDGAYLPITHIGSTTISSAKGTIPLNEVLVCPDMQKNLLSVSKLCDD 325

Query: 541  FPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAYYLNKV*GVSYELWHRRL 362
            + C   F  D  YI  L    ++ KG  K GLYVL+  +  A+Y N+    + + WH RL
Sbjct: 326  YSCGVFFDSDFVYIIDLTTQKVVSKGPRKKGLYVLQNQEFVAFYSNRQCAATLDTWHHRL 385

Query: 361  GHANQEVFHILQNKGEISTLDNKTH-FCNRFAM*KSHRLPFFSSHNISANPFDLVYCDLW 185
            GH+N  +   L+   EI    ++T   C    M KS++L FFSS +    P + V+CDLW
Sbjct: 386  GHSNSRILQHLRACKEIEVNKSRTSPICEPCQMRKSNKLQFFSSDSRDLQPLERVHCDLW 445

Query: 184  GKSLIMSVHRYYYYALFVDDCTRFMWFCPL 95
            G S ++S   + YYA+FVDD +R+ WF PL
Sbjct: 446  GPSPVVSNKGFKYYAVFVDDHSRYSWFFPL 475


>AAC61290.1 putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1149

 Score =  217 bits (553), Expect = 6e-58
 Identities = 123/342 (35%), Positives = 180/342 (52%), Gaps = 5/342 (1%)
 Frame = -2

Query: 1015 GNTSFNSAGRCFGPSVRNNETGYRVV---FCQICGKRGHDALKCWYRFDNSYESTHVPDL 845
            G  +F++ GR F     ++ +         CQICGKRGH AL+CW+RFD+SY+ +     
Sbjct: 239  GKGNFSTRGRGFQQQFSSSSSSVSASEKPMCQICGKRGHYALQCWHRFDDSYQHSEAA-- 296

Query: 844  VANEAKTFTALTIEEADKLNTDEWHVDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKY 665
                A  F+AL I   D  +   W  D+ AT HITN    L  +  Y G D +M  +  +
Sbjct: 297  ----AAAFSALHI--TDVSDDSGWVPDSAATAHITNNSSRLQQMQPYLGNDTVMASDGNF 350

Query: 664  IPIAHIGQSVIVQNQ-DIP*YKVLLVPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQ 488
            +PI HIG + +     ++P   VL+ P I  +LLSVSKLT  +PC F F  D   +K   
Sbjct: 351  LPITHIGSANLPSTSGNLPLKDVLVCPNIAKSLLSVSKLTKDYPCSFTFDADGVLVKDKA 410

Query: 487  NHSLLVKGE-LKNGLYVLKTTKRAAYYLNKV*GVSYELWHRRLGHANQEVFHILQNKGEI 311
               +L KG     GLY L+  K   +Y  +    + E+WH RLGH N +V  +L NK  I
Sbjct: 411  TCKVLTKGSSTSEGLYKLENPKFQMFYSTRQVKATDEVWHMRLGHPNPQVLQLLANKKAI 470

Query: 310  STLDNKTHFCNRFAM*KSHRLPFFSSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFV 131
                + +  C    + KS RLPF +S  I++ P + V+CDLWG + + S+  + YY +F+
Sbjct: 471  QINKSTSKMCESCRLGKSSRLPFIASDFIASRPLERVHCDLWGPAPVSSIQGFQYYVIFI 530

Query: 130  DDCTRFMWFCPLKNKSNFIDCFVMFCKMIKTSFMITPKVFQT 5
            D+ +RF WF PLK+KS+F   F+ F   ++         FQ+
Sbjct: 531  DNRSRFCWFYPLKHKSDFCSLFMKFQSFVENLLQTKIGTFQS 572


>XP_013690295.1 PREDICTED: uncharacterized protein LOC106394259 [Brassica napus]
          Length = 2800

 Score =  216 bits (550), Expect = 2e-57
 Identities = 149/503 (29%), Positives = 227/503 (45%), Gaps = 12/503 (2%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRP----PIPSFEEVT 1310
            EYL E K + DQL +I  P+++ +K+Y +L+GLG ++E  T  +       P P FE+V 
Sbjct: 143  EYLSEIKSLSDQLDSIGAPITEHEKIYGVLNGLGREYEAVTTVIEHSMDVFPGPCFEDVV 202

Query: 1309 -SLLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVR 1133
              L  + +  +T Q+ T V  +   +                      RG Y        
Sbjct: 203  HKLTGFDDKLRTYQQQTDVSPHQAFY--------------------ANRGGYS------- 235

Query: 1132 PTGPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCF------GPS 971
                                       G     + G   G  S+++ GR F      G  
Sbjct: 236  ---------------------------GRGRGQNRGGYRGRASYSTQGRGFPQQFGQGAQ 268

Query: 970  VRNNETGYRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADK 791
              ++ +  +   CQICGK    A KC+ RFD ++  +  P     +A   T +       
Sbjct: 269  RTSSASDNQRPTCQICGKYD-PAYKCYKRFDVNFVVSDPPP----QANVLTTVAAHNQST 323

Query: 790  LNTDEWHVDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQ-SVIVQNQDI 614
             +  EW+ D+G++ H+TN  D L     Y GLD +MVGN +++PI H+G  S+  Q+  I
Sbjct: 324  PSGAEWYPDSGSSHHVTNSVDHLDTAQPYAGLDQVMVGNGEFLPITHVGSASIPTQSGKI 383

Query: 613  P*YKVLLVPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLK 434
            P   VL+ P I  +LLSVSKLT+ FPC F F   T  +K      +L KG    GLY L 
Sbjct: 384  PLSDVLICPDITKSLLSVSKLTDDFPCEFTFDSTTVCVKDKATCRVLSKGNKIKGLYRLD 443

Query: 433  TTKRAAYYLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNKTHFCNRFAM*KSH 254
              +   +Y  +    S  +WH+RLGH N +V   L     IS        C    + K+ 
Sbjct: 444  VPQLLTFYSFRQQVASDGVWHKRLGHPNDQVLKHLSTIKAISFNKTSQSMCESCQLGKTC 503

Query: 253  RLPFFSSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFI 74
            RLPF SS   S+ P + ++CD+WG + ++S   + +Y +F+D+ +RF W  PLK KS+  
Sbjct: 504  RLPFSSSDFRSSRPLERIHCDVWGPAPVVSTQGFRFYVVFIDNYSRFCWLYPLKMKSDVF 563

Query: 73   DCFVMFCKMIKTSFMITPKVFQT 5
              F  F   ++  +     VFQ+
Sbjct: 564  TIFKAFQSQVENQYKQKISVFQS 586



 Score =  216 bits (550), Expect = 2e-57
 Identities = 149/503 (29%), Positives = 227/503 (45%), Gaps = 12/503 (2%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMMRP----PIPSFEEVT 1310
            EYL E K + DQL +I  P+++ +K+Y +L+GLG ++E  T  +       P P FE+V 
Sbjct: 1532 EYLSEIKSLSDQLDSIGAPITEHEKIYGVLNGLGREYEAVTTVIEHSMDVFPGPCFEDVV 1591

Query: 1309 -SLLKYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVR 1133
              L  + +  +T Q+ T V  +   +                      RG Y        
Sbjct: 1592 HKLTGFDDKLRTYQQQTDVSPHQAFY--------------------ANRGGYS------- 1624

Query: 1132 PTGPSSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCF------GPS 971
                                       G     + G   G  S+++ GR F      G  
Sbjct: 1625 ---------------------------GRGRGQNRGGYRGRASYSTQGRGFPQQFGQGAQ 1657

Query: 970  VRNNETGYRVVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADK 791
              ++ +  +   CQICGK    A KC+ RFD ++  +  P     +A   T +       
Sbjct: 1658 RTSSASDNQRPTCQICGKYD-PAYKCYKRFDVNFVVSDPPP----QANVLTTVAAHNQST 1712

Query: 790  LNTDEWHVDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQ-SVIVQNQDI 614
             +  EW+ D+G++ H+TN  D L     Y GLD +MVGN +++PI H+G  S+  Q+  I
Sbjct: 1713 PSGAEWYPDSGSSHHVTNSVDHLDTAQPYAGLDQVMVGNGEFLPITHVGSASIPTQSGKI 1772

Query: 613  P*YKVLLVPQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLK 434
            P   VL+ P I  +LLSVSKLT+ FPC F F   T  +K      +L KG    GLY L 
Sbjct: 1773 PLSDVLICPDITKSLLSVSKLTDDFPCEFTFDSTTVCVKDKATCRVLSKGNKIKGLYRLD 1832

Query: 433  TTKRAAYYLNKV*GVSYELWHRRLGHANQEVFHILQNKGEISTLDNKTHFCNRFAM*KSH 254
              +   +Y  +    S  +WH+RLGH N +V   L     IS        C    + K+ 
Sbjct: 1833 VPQLLTFYSFRQQVASDGVWHKRLGHPNDQVLKHLSTIKAISFNKTSQSMCESCQLGKTC 1892

Query: 253  RLPFFSSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKNKSNFI 74
            RLPF SS   S+ P + ++CD+WG + ++S   + +Y +F+D+ +RF W  PLK KS+  
Sbjct: 1893 RLPFSSSDFRSSRPLERIHCDVWGPAPVVSTQGFRFYVVFIDNYSRFCWLYPLKMKSDVF 1952

Query: 73   DCFVMFCKMIKTSFMITPKVFQT 5
              F  F   ++  +     VFQ+
Sbjct: 1953 TIFKAFQSQVENQYKQKISVFQS 1975


>KYP43751.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Cajanus cajan]
          Length = 483

 Score =  202 bits (513), Expect = 2e-55
 Identities = 144/470 (30%), Positives = 225/470 (47%), Gaps = 7/470 (1%)
 Frame = -2

Query: 1477 EYLREFKMVCDQLQAIKKPVSDEDKVYFLLSGLGADFETFTVTMM-RPPIPSFEEVTSLL 1301
            EY ++F+ +CDQL AI  P+++++KV++ L GLG  +  F+   +   P+P F ++   +
Sbjct: 75   EYGKKFRTICDQLAAIGAPIANDNKVHWFLRGLGPSYANFSTGQLDHVPLPRFTDILCKV 134

Query: 1300 KYHEMRKTTQENTLVGGNSFAFLXXXXXXXXXXXXXXXXRAPGGRGNYLTYNNHVRPTGP 1121
            + H + + + E       +  +                             NN  R +G 
Sbjct: 135  ESHAIFQASLEEPTPSQPAAFYA----------------------------NNQPRSSGN 166

Query: 1120 SSNVQHQGILGNAPNIRNVLPNFGNASINSAGSPLGNTSFNSAGRCFGPSVRNNETGYR- 944
             S+                    GN   ++ GS  GN   NS G   G   R N +G R 
Sbjct: 167  HSS------------------RSGNRGHSNDGSNGGNRG-NSIG---GSRQRRNNSGRRN 204

Query: 943  --VVFCQICGKRGHDALKCWYRFDNSYESTHVPDLVANEAKTFTALTIEEADKLNTDEWH 770
              V  CQIC ++GH A KC  R+D S ES  +    A      T+         N  + +
Sbjct: 205  SYVPRCQICCEQGHYANKCPVRWDRSSESASLAQAFAGSCSLDTS---------NKSDSY 255

Query: 769  VDTGATDHITN*PDILSNIVSYNGLDYIMVGNEKYIPIAHIGQSVIVQNQDIP*YKVLLV 590
            +DTGAT H+T     L++  +YNG D + VGN   + I H+G   +  +  +P   VL+V
Sbjct: 256  MDTGATSHMTPSSSQLTDSQNYNGTDRVFVGNGTSLNITHVGSRSL--SHTVPLLDVLVV 313

Query: 589  PQIKCNLLSVSKLTNQFPCYFIFLRDTFYIKPLQNHSLLVKGELKNGLYVLKTTKRAAYY 410
            P +  NL+SVSKLT       IF+ D+F I+  +   +L +G    GLYV+    +A   
Sbjct: 314  PNLTKNLVSVSKLTRDNHAKAIFVDDSFVIQNRKTGRVLARGRCDQGLYVMDQGPQALLT 373

Query: 409  LNK-V*GVSYELWHRRLGHANQEVFHILQNKG--EISTLDNKTHFCNRFAM*KSHRLPFF 239
             +  +   S+ELWH RLGH N +V + L  +G   +S++  K   C    M KS RL F+
Sbjct: 374  TSSSLPRASFELWHSRLGHVNFDVINKLNQQGYLNVSSILPKPICCTTCQMAKSKRLVFY 433

Query: 238  SSHNISANPFDLVYCDLWGKSLIMSVHRYYYYALFVDDCTRFMWFCPLKN 89
             ++  ++   DL++CDLWG S + S+  Y Y+ +FVDD +RF  F PL++
Sbjct: 434  DNNKRASAVLDLIHCDLWGPSPVASIVGYSYFVIFVDDFSRFTRFYPLRH 483


Top