BLASTX nr result

ID: Astragalus22_contig00032832 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00032832
         (991 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_013624380.1| PREDICTED: uncharacterized protein LOC106330...   247   8e-72
ref|XP_015960841.1| uncharacterized protein LOC107484813 [Arachi...   243   2e-71
ref|XP_013594438.1| PREDICTED: uncharacterized protein LOC106302...   237   3e-67
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf...   236   4e-66
emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis...   235   7e-66
gb|KYP69518.1| Retrovirus-related Pol polyprotein from transposo...   225   3e-64
gb|KYP64799.1| Retrovirus-related Pol polyprotein from transposo...   224   7e-64
gb|KZV25004.1| Cysteine-rich RLK (receptor-like protein kinase) ...   227   4e-63
gb|KZV44334.1| hypothetical protein F511_18136 [Dorcoceras hygro...   214   1e-62
ref|XP_010526684.1| PREDICTED: uncharacterized protein LOC104804...   221   2e-62
ref|XP_022548561.1| uncharacterized protein LOC111201234 [Brassi...   223   3e-62
gb|KYP41111.1| Retrovirus-related Pol polyprotein from transposo...   214   4e-62
ref|XP_010526683.1| PREDICTED: uncharacterized protein LOC104804...   221   6e-62
ref|XP_018454140.1| PREDICTED: uncharacterized protein LOC108825...   222   6e-62
gb|KZV17946.1| hypothetical protein F511_10775 [Dorcoceras hygro...   222   7e-62
emb|CAA72989.1| unnamed protein product [Brassica oleracea var. ...   223   7e-62
ref|XP_009121252.1| PREDICTED: uncharacterized protein LOC103846...   221   8e-62
ref|XP_010526682.1| PREDICTED: uncharacterized protein LOC104804...   221   9e-62
dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis t...   223   1e-61
gb|PRQ38882.1| putative RNA-directed DNA polymerase [Rosa chinen...   223   1e-61

>ref|XP_013624380.1| PREDICTED: uncharacterized protein LOC106330465 [Brassica oleracea
            var. oleracea]
          Length = 803

 Score =  247 bits (630), Expect = 8e-72
 Identities = 121/222 (54%), Positives = 152/222 (68%), Gaps = 4/222 (1%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K KS  S +   F +  HTQ+++ IK IRSDNA E   TD LQ+ G  H F CP+ PQQN
Sbjct: 526  KNKSSVSTVFPEFLRLIHTQYNSNIKAIRSDNAPELAFTDLLQEKGIEHYFSCPYTPQQN 585

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKHQH+LNVAR+L+FQ+ V L YWGEC+  A+YLIN+TPSPLL N+SPFE+L SK 
Sbjct: 586  SVVERKHQHILNVARALLFQSKVPLIYWGECIQTAIYLINRTPSPLLQNKSPFELLTSKI 645

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            P Y  ++VFGCL Y ST    RNKFTP A P +F+GYP GYKG+++ D    K +ISR+V
Sbjct: 646  PSYDHLRVFGCLCYTSTLQKDRNKFTPRANPGVFLGYPHGYKGYRVLDTTTNKIIISRNV 705

Query: 580  IFHESIFPFKKNS-TIDLHLHDPATTVLP---NISDLPSTFD 693
            +FHES FPF K+S  +D   +     VLP     S  P+ FD
Sbjct: 706  VFHESYFPFAKDSNNLDAENYFFDQDVLPMHVPDSSFPAFFD 747


>ref|XP_015960841.1| uncharacterized protein LOC107484813 [Arachis duranensis]
          Length = 672

 Score =  243 bits (620), Expect = 2e-71
 Identities = 134/308 (43%), Positives = 175/308 (56%), Gaps = 10/308 (3%)
 Frame = +1

Query: 67   IKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQNAVVERKHQH 246
            IKAF+    TQFS  IKC RSDNAKE   TDFLQ+ G  H F CP+RPQQNAVVERKHQH
Sbjct: 380  IKAFYAMIKTQFSKKIKCFRSDNAKELAATDFLQEKGVLHHFSCPYRPQQNAVVERKHQH 439

Query: 247  LLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKSPDYSLVKVF 426
            LLNVAR+L FQ+ V +T+ GECVS A +LIN+TPS LL  +SPFE+L+ KSP+Y  +K+F
Sbjct: 440  LLNVARALYFQSQVPITFLGECVSTAAFLINRTPSSLLKMKSPFELLFEKSPNYKAMKIF 499

Query: 427  GCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDVIFHESIFPF 606
            GCL+Y +T    R KF P A   +F GYP GYKG+KL+++  ++F+IS DVIFHE   PF
Sbjct: 500  GCLAYATTNTSSRLKFDPRADTTVFFGYPFGYKGYKLYNLRTKQFLISMDVIFHEDTMPF 559

Query: 607  KKNS--------TIDLHLHDPA--TTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXX 756
             +N           D+ L +P   +  LPN   +PS                        
Sbjct: 560  AQNPHTQLNNDIFFDVVLPNPILDSEPLPNAPTIPSV----------------------- 596

Query: 757  XXXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPA 936
                               IPQ +   +T   +  L     +   ++ PRRS+RT + P+
Sbjct: 597  -----------------PKIPQPSTTTNTQSQILPLIENQISPSTSIQPRRSTRTKHTPS 639

Query: 937  YLKEYDCN 960
            YL +Y C+
Sbjct: 640  YLHDYICH 647


>ref|XP_013594438.1| PREDICTED: uncharacterized protein LOC106302482 [Brassica oleracea
            var. oleracea]
          Length = 977

 Score =  237 bits (604), Expect = 3e-67
 Identities = 109/222 (49%), Positives = 154/222 (69%), Gaps = 2/222 (0%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K+KSD   L   F +  +TQ++  +K IRSDNA E      ++ NG  H F C + P+QN
Sbjct: 683  KRKSDVITLFPEFLQRVYTQYNVRVKAIRSDNAPELRFAKLIKTNGMIHYFSCAYTPEQN 742

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKHQHLLNVAR+L+FQ+ V L YW +C++ AV+LIN+ PSPLLD+++P+E+L  + 
Sbjct: 743  SVVERKHQHLLNVARALLFQSNVPLLYWSDCITTAVFLINRIPSPLLDHRTPYEVLLKRK 802

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            PDYSL++ FGCL YVST    RNKF+P A PC+F+GYP GYKG+KL D+ +    ISR V
Sbjct: 803  PDYSLLRSFGCLCYVSTLQKDRNKFSPRARPCLFLGYPSGYKGYKLLDLDNNSVSISRHV 862

Query: 580  IFHESIFPFKKNSTI--DLHLHDPATTVLPNISDLPSTFDYS 699
            +FHES++P K +++I  D   H      +P  +DL ++ D++
Sbjct: 863  VFHESVYPLKSSTSIIPDFFSHYILPNSVPYTADLDASIDHN 904


>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score:
            11.19) [Arabidopsis thaliana]
          Length = 1633

 Score =  236 bits (601), Expect = 4e-66
 Identities = 127/307 (41%), Positives = 172/307 (56%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K KS+ S++   F K   TQ++  IK IRSDN KE   T F+++ G  HQF C + PQQN
Sbjct: 595  KNKSEVSNIFPVFVKLIFTQYNAKIKAIRSDNVKELAFTKFVKEQGMIHQFSCAYTPQQN 654

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKHQHLLN+ARSL+FQ+ V L YW +CV  A YLIN+ PSPLLDN++PFE+L  K 
Sbjct: 655  SVVERKHQHLLNIARSLLFQSNVPLQYWSDCVLTAAYLINRLPSPLLDNKTPFELLLKKI 714

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            PDY+L+K   CL Y ST +  RNKF+P A PC+F+GYP GYKG+K+ D+      I+R+V
Sbjct: 715  PDYTLLK--SCLCYASTNVHDRNKFSPRARPCVFLGYPSGYKGYKVLDLESHSISITRNV 772

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759
            +FHE+ FPFK +  +   +     ++LP    LP+   +                     
Sbjct: 773  VFHETKFPFKTSKFLKESVDMFPNSILP----LPAPLHF--------VESMPLDDDLRAD 820

Query: 760  XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939
                              +P     Q+T + LD+       + ++VP  R  R    PAY
Sbjct: 821  DNNASTSNSASSASSIPPLPSTVNTQNT-DALDI-------DTNSVPIARPKRNAKAPAY 872

Query: 940  LKEYDCN 960
            L EY CN
Sbjct: 873  LSEYHCN 879


>emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana]
 emb|CAB78488.1| retrovirus-related like polyprotein [Arabidopsis thaliana]
          Length = 1489

 Score =  235 bits (599), Expect = 7e-66
 Identities = 124/310 (40%), Positives = 174/310 (56%), Gaps = 1/310 (0%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            + K D S +   F K   TQF+  IK IRSDNA E   T+ ++++G  H F C + PQQN
Sbjct: 672  RNKKDVSSVFPEFIKLVSTQFNAKIKAIRSDNAPELGFTEIVKEHGMLHHFSCAYTPQQN 731

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKHQH+LNVAR+L+FQ+ + + YW +CV+ AV+LIN+ PSPLL+N+SP+E++ +K 
Sbjct: 732  SVVERKHQHILNVARALLFQSNIPMQYWSDCVTTAVFLINRLPSPLLNNKSPYELILNKQ 791

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            PDYSL+K FGCL +VST    R KFTP A  C+F+GYP GYKG+K+ D+      +SR+V
Sbjct: 792  PDYSLLKNFGCLCFVSTNAHERTKFTPRARACVFLGYPSGYKGYKVLDLESHSVTVSRNV 851

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPN-ISDLPSTFDYSXXXXXXXXXXXXXXXXXXX 756
            +F E +FPFK +      L + A  + PN I  LP+   +                    
Sbjct: 852  VFKEHVFPFKTS-----ELLNKAVDMFPNSILPLPAPLHF-------VETMPLIDEDSLI 899

Query: 757  XXXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPA 936
                               +P      S +E  D+       + + VP  RS RT   P+
Sbjct: 900  PTTTDSRTADNHASSSSSALPSIIPPSSNTETQDI-------DSNAVPITRSKRTTRAPS 952

Query: 937  YLKEYDCNLL 966
            YL EY C+L+
Sbjct: 953  YLSEYHCSLV 962


>gb|KYP69518.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 731

 Score =  225 bits (574), Expect = 3e-64
 Identities = 127/306 (41%), Positives = 169/306 (55%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            + KSD    I   F Y   QF T IK  RSDN +E    DF  + G  HQF C  RP+QN
Sbjct: 237  QNKSDCIKNIPQIFAYVENQFQTKIKSFRSDNVRELHFKDFFLEKGVLHQFLCVERPKQN 296

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKH H+LN+AR+LMFQ+ V +  WG+ V   V+++N+TPSP+L++ SP+EILY+K 
Sbjct: 297  SVVERKHLHILNIARTLMFQSNVPIKIWGDYVKTVVFIMNRTPSPILNHISPYEILYNKV 356

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            P+YS  + FG L Y ST L  R+KF+P A   +FIGYP GYKG+KLFD+   +  IS+DV
Sbjct: 357  PNYSDFRTFGTLCYASTLLSGRHKFSPRAIAAVFIGYPHGYKGYKLFDLTTHQTFISKDV 416

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759
             F+E IFPF+ +++ D         + PN++  P+TFD S                    
Sbjct: 417  KFYEHIFPFQNSNSSDRGHGVFNDQITPNLNH-PATFDDSDPIQ---------------- 459

Query: 760  XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939
                               P +T  Q T + L    SP   EP   P RRS+R  NPP Y
Sbjct: 460  -------------------PTHTQNQFTIQQL----SP-PNEPDQAPIRRSNRAVNPPGY 495

Query: 940  LKEYDC 957
            L +Y C
Sbjct: 496  LSDYHC 501


>gb|KYP64799.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 714

 Score =  224 bits (571), Expect = 7e-64
 Identities = 132/307 (42%), Positives = 169/307 (55%), Gaps = 1/307 (0%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K KSD + +I  F  Y  TQF+ T K  RSDNAKE   TD   + G  HQF C  RPQQN
Sbjct: 407  KNKSDCAIVIPQFISYIETQFNKTPKTFRSDNAKELSFTDLFSKKGIIHQFSCVERPQQN 466

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKH H+LN+AR+LMFQ+ V L +WGECV  AV+L+N+TPS +LD++SPFEILY K 
Sbjct: 467  SVVERKHLHILNIARALMFQSNVPLKFWGECVKTAVFLMNRTPSLILDSKSPFEILYDKI 526

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            P+Y   +VFG L Y ST L  R+KFT  A   +F+GYP+GYKG+KL D+  ++  ISRDV
Sbjct: 527  PNYVDFRVFGSLCYASTLLSSRHKFTHRAVAAVFLGYPKGYKGYKLLDLSTKQIFISRDV 586

Query: 580  IFHESIFPFK-KNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXX 756
             F E IF FK  + TI     DP  T  PNI+   +  D                     
Sbjct: 587  KFFEHIFSFKPAHQTIS----DPYIT--PNITYSHTLDDID------------------- 621

Query: 757  XXXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPA 936
                                P  T +  T  P  ++  P+         RRS++T NPP+
Sbjct: 622  --------------DHNTIAPYTTTIPHT--PPTIMDQPSL--------RRSNKTSNPPS 657

Query: 937  YLKEYDC 957
            YL +Y C
Sbjct: 658  YLNDYHC 664


>gb|KZV25004.1| Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras
            hygrometricum]
          Length = 1404

 Score =  227 bits (578), Expect = 4e-63
 Identities = 110/215 (51%), Positives = 146/215 (67%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K KSD   +   F +   TQF  T+K +RSDNA E    DF  + G TH   C  RPQQN
Sbjct: 613  KSKSDVLSIFPDFCRMVSTQFGVTVKSVRSDNAPELGFADFFAKAGITHYHSCVERPQQN 672

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKHQH+LNVAR+L+FQ+ + L YW +C++ +VYLIN+TPSP+L +++PFE+L+ K 
Sbjct: 673  SVVERKHQHILNVARALLFQSHIPLDYWCDCINTSVYLINRTPSPILAHKTPFELLHGKL 732

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            P YS +KVFGCL Y ST L  R+KF+P A  C+FIGYP GYKG+KL ++   +  ISRDV
Sbjct: 733  PSYSHLKVFGCLCYASTLLSSRHKFSPRAIRCVFIGYPPGYKGYKLLNLETNEIFISRDV 792

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPS 684
            IFHE+ FP++  +T  + L D    V P+    PS
Sbjct: 793  IFHENTFPYQ--NTSPMSLSDMTFEVSPSSQITPS 825


>gb|KZV44334.1| hypothetical protein F511_18136 [Dorcoceras hygrometricum]
          Length = 442

 Score =  214 bits (546), Expect = 1e-62
 Identities = 100/190 (52%), Positives = 133/190 (70%)
 Frame = +1

Query: 40  KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
           + KS+ S +   F +  +TQF   IK +RSDNA E    +   + G  H   C  RPQQN
Sbjct: 3   RSKSNVSSIFPTFCQKIYTQFGAKIKAVRSDNAPELGFVNLFNKLGIIHNHSCVERPQQN 62

Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
           +VVERKHQH+LNVAR+LMFQ+ + + Y  +C+  +VYLIN+TPSPLL +Q+PFE+L+ K 
Sbjct: 63  SVVERKHQHILNVARALMFQSHLPIAYGSDCIVTSVYLINRTPSPLLSHQTPFEVLHRKR 122

Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
           P YS +KVFGCL Y ST L  R+KF+P A  C+F+GYP GYKG+KL ++   +  ISRDV
Sbjct: 123 PAYSHLKVFGCLCYASTLLSSRSKFSPRAVKCVFLGYPPGYKGYKLINLDTNEIFISRDV 182

Query: 580 IFHESIFPFK 609
           IFHE +FPF+
Sbjct: 183 IFHEHVFPFQ 192


>ref|XP_010526684.1| PREDICTED: uncharacterized protein LOC104804180 isoform X6 [Tarenaya
            hassleriana]
          Length = 789

 Score =  221 bits (564), Expect = 2e-62
 Identities = 121/309 (39%), Positives = 163/309 (52%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K KSD       F  +   QF+ +IKC+RSDNA E        + G  HQF CP+ PQQN
Sbjct: 187  KSKSDVLQKFPEFVSFVENQFNASIKCVRSDNAPELGFKSLFAKKGILHQFSCPYTPQQN 246

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            ++VERKHQH+LNVAR+L+FQ+ V L +WG+C+  +VYLIN+TPSPLL N++PFE+L   S
Sbjct: 247  SIVERKHQHILNVARALLFQSNVPLAFWGDCILTSVYLINRTPSPLLQNKTPFELLTGCS 306

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            P YS ++VFGCL YVST    R+KF P A   +F+GYP G KG+K+ D+     +ISR+V
Sbjct: 307  PSYSHLRVFGCLCYVSTLTKDRHKFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNV 366

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759
            +FHE+ FPFK        L     +V P   +  S  + S                    
Sbjct: 367  VFHETTFPFKSFPQSQPALDPFPQSVSPFFYESISPQNLS-------------------- 406

Query: 760  XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939
                               P +      S   D     T++  H   P+R S+T   PAY
Sbjct: 407  -------SSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQRQSKT---PAY 456

Query: 940  LKEYDCNLL 966
            L +Y C L+
Sbjct: 457  LSDYHCYLI 465


>ref|XP_022548561.1| uncharacterized protein LOC111201234 [Brassica napus]
          Length = 927

 Score =  223 bits (567), Expect = 3e-62
 Identities = 124/315 (39%), Positives = 174/315 (55%), Gaps = 7/315 (2%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            + K +   +   F     TQ+ T ++ +RSDNAKE + TD  +  G      CP  PQQN
Sbjct: 657  RTKDEVLRVFPEFITMVETQYKTKVRGVRSDNAKELMFTDLYRAKGIKAFHSCPETPQQN 716

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKHQH+LNVAR+LMFQ+ +SL YW +CV  AV+LIN+ PSPLL ++SP+++L+ K 
Sbjct: 717  SVVERKHQHILNVARALMFQSKLSLEYWSDCVLTAVFLINRLPSPLLQDKSPYQLLHKKK 776

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            PDYS +KVFGCL YVST+  +R+KF P + PC+F+GYP G+KG+K+ D+      +SR+V
Sbjct: 777  PDYSEIKVFGCLCYVSTSSKNRHKFQPRSRPCLFLGYPAGFKGYKVMDLDTNIISVSRNV 836

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPNI-------SDLPSTFDYSXXXXXXXXXXXXX 738
            +FHE IFPF  + + DLH HD    + P +       SD+P++                 
Sbjct: 837  VFHEDIFPFTCSES-DLH-HDLYPNIDPVVVNKTHIASDVPTS----------------- 877

Query: 739  XXXXXXXXXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSR 918
                                     +     V  T EP+     PT AE      + S R
Sbjct: 878  -------------------------VNTEIPVVVTDEPVVDSQIPTKAE------KISKR 906

Query: 919  THNPPAYLKEYDCNL 963
            T   PAYL++Y CN+
Sbjct: 907  TSKQPAYLEDYYCNM 921


>gb|KYP41111.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 476

 Score =  214 bits (545), Expect = 4e-62
 Identities = 106/220 (48%), Positives = 144/220 (65%)
 Frame = +1

Query: 40  KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
           + KSD    I   F Y   QF T IK  RSDN +E    DF  + G  HQF C  RP+QN
Sbjct: 246 QNKSDCIKNIPQIFAYVENQFQTKIKSFRSDNVRELHFKDFFLEKGVLHQFLCVERPKQN 305

Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
           +VVERKH H+LN+AR+LMFQ+ V +  WG+ V   V+++N+TPSP+L++ SP+EILY+K 
Sbjct: 306 SVVERKHLHILNIARTLMFQSNVPIKIWGDYVKTVVFIMNRTPSPILNHISPYEILYNKV 365

Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
           P+YS  + FG L Y ST L  R+KF+P A   +FIGYP GYKG+KLFD+   +  IS+DV
Sbjct: 366 PNYSDFRTFGTLCYASTLLSGRHKFSPRAIAAVFIGYPHGYKGYKLFDLTTHQTFISKDV 425

Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYS 699
            F+E IFPF+ +++ D         + PN++  P+TFD S
Sbjct: 426 KFYEHIFPFQNSNSSDRGHGVFNDQITPNLNH-PATFDDS 464


>ref|XP_010526683.1| PREDICTED: uncharacterized protein LOC104804180 isoform X5 [Tarenaya
            hassleriana]
          Length = 886

 Score =  221 bits (564), Expect = 6e-62
 Identities = 121/309 (39%), Positives = 163/309 (52%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K KSD       F  +   QF+ +IKC+RSDNA E        + G  HQF CP+ PQQN
Sbjct: 284  KSKSDVLQKFPEFVSFVENQFNASIKCVRSDNAPELGFKSLFAKKGILHQFSCPYTPQQN 343

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            ++VERKHQH+LNVAR+L+FQ+ V L +WG+C+  +VYLIN+TPSPLL N++PFE+L   S
Sbjct: 344  SIVERKHQHILNVARALLFQSNVPLAFWGDCILTSVYLINRTPSPLLQNKTPFELLTGCS 403

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            P YS ++VFGCL YVST    R+KF P A   +F+GYP G KG+K+ D+     +ISR+V
Sbjct: 404  PSYSHLRVFGCLCYVSTLTKDRHKFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNV 463

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759
            +FHE+ FPFK        L     +V P   +  S  + S                    
Sbjct: 464  VFHETTFPFKSFPQSQPALDPFPQSVSPFFYESISPQNLS-------------------- 503

Query: 760  XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939
                               P +      S   D     T++  H   P+R S+T   PAY
Sbjct: 504  -------SSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQRQSKT---PAY 553

Query: 940  LKEYDCNLL 966
            L +Y C L+
Sbjct: 554  LSDYHCYLI 562


>ref|XP_018454140.1| PREDICTED: uncharacterized protein LOC108825334 [Raphanus sativus]
          Length = 980

 Score =  222 bits (566), Expect = 6e-62
 Identities = 108/214 (50%), Positives = 143/214 (66%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            + K +   +  AF K   T++   +K +RSDNA+E L T F Q  G T    CP  P+QN
Sbjct: 656  RTKDEVIQVFPAFVKQVETKYGVRVKSVRSDNAQELLFTKFYQAQGITAYNSCPETPEQN 715

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKHQH+LNVARSLMFQ+ V L++WG+CV  AV+LIN+TP+ LL N++PFE+L   S
Sbjct: 716  SVVERKHQHILNVARSLMFQSHVPLSFWGDCVLTAVFLINRTPAKLLHNKTPFEVLNGTS 775

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            PDYS +K FGCL Y ST+   R+KF P +  CIF+GYP G KG+KL D+   K  ISR+V
Sbjct: 776  PDYSQLKTFGCLCYGSTSPKQRHKFLPRSRACIFLGYPPGVKGYKLMDLESNKIYISRNV 835

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLP 681
            +FHE +FP KKN   DLH+ +        ++ LP
Sbjct: 836  LFHEDLFPLKKN--YDLHVPEWVNPSSEPLATLP 867


>gb|KZV17946.1| hypothetical protein F511_10775 [Dorcoceras hygrometricum]
          Length = 989

 Score =  222 bits (566), Expect = 7e-62
 Identities = 104/194 (53%), Positives = 137/194 (70%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K KS+   +   F +  H QF  +IK +RSDNA E   ++F +  G      C  RPQQN
Sbjct: 552  KSKSEVIDIFPTFCRMIHKQFGKSIKSVRSDNAPELKFSEFFKAEGIVAFHSCVERPQQN 611

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKHQH+LNVAR+L+FQ+ + L YW EC+  AVYLIN+TP+PLL N++PFE++++K 
Sbjct: 612  SVVERKHQHILNVARALLFQSGIPLVYWSECILTAVYLINRTPAPLLSNKTPFELMHNKP 671

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            P YS ++VFGCL Y ST L  R KF+P AT  IF+GYP GYKG+KL ++   +  ISRDV
Sbjct: 672  PTYSHLRVFGCLCYGSTLLNQRTKFSPRATRSIFLGYPPGYKGYKLLNLDTNEVYISRDV 731

Query: 580  IFHESIFPFKKNST 621
            IFHE++FPFK  ST
Sbjct: 732  IFHETVFPFKNKST 745


>emb|CAA72989.1| unnamed protein product [Brassica oleracea var. viridis]
          Length = 1131

 Score =  223 bits (568), Expect = 7e-62
 Identities = 109/221 (49%), Positives = 141/221 (63%), Gaps = 7/221 (3%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            + KSD  H+   F     TQ++T IK +R DNA E   T+  ++ G      CP   +QN
Sbjct: 613  QSKSDVLHIFPTFVNQIETQYNTKIKSVRRDNAPELSFTELFKEKGIVSYHSCPETLEQN 672

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +V+ERKHQHLLNVAR+LMFQ+ V L YWG+CV  A +LIN+TPSPLL N+SP+E+L  K+
Sbjct: 673  SVLERKHQHLLNVARALMFQSQVPLQYWGDCVLTAAFLINRTPSPLLANKSPYEVLMGKA 732

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            P Y  ++ FGCL Y ST+   R+KF P +  C+F+GYP GYKG+KL D+   K  ISR+V
Sbjct: 733  PQYDQLRTFGCLCYGSTSPKQRHKFMPRSRACVFLGYPSGYKGYKLLDLESNKIYISRNV 792

Query: 580  IFHESIFPFKKNSTID---LHLHDPATTV----LPNISDLP 681
             FHE IFP  K+  +D   LH   P  TV     PNIS  P
Sbjct: 793  TFHEDIFPMAKHQKMDESSLHFFPPKVTVPSAPSPNISSSP 833


>ref|XP_009121252.1| PREDICTED: uncharacterized protein LOC103846085 [Brassica rapa]
          Length = 860

 Score =  221 bits (562), Expect = 8e-62
 Identities = 123/303 (40%), Positives = 170/303 (56%), Gaps = 7/303 (2%)
 Frame = +1

Query: 76   FFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQNAVVERKHQHLLN 255
            F     TQ+ T ++ +RSDNAKE + TD  +  G      CP  PQQN+VVERKHQH+LN
Sbjct: 602  FITMVETQYKTKVRGVRSDNAKELMFTDLYRAKGIKAFHSCPETPQQNSVVERKHQHILN 661

Query: 256  VARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKSPDYSLVKVFGCL 435
            VAR+LMFQ+ +SL YW +CV  AV+LIN+ PSPLL ++SP+++L+ K PDYS +KVFGCL
Sbjct: 662  VARALMFQSKLSLEYWSDCVLTAVFLINRLPSPLLQDKSPYQLLHKKKPDYSEIKVFGCL 721

Query: 436  SYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDVIFHESIFPFKKN 615
             YVST+  +R+KF P + PC+F+GYP G+KG+K+ D+      +SR+V+FHE IFPF  +
Sbjct: 722  CYVSTSSKNRHKFQPRSRPCLFLGYPAGFKGYKVMDLDTNIISVSRNVVFHEDIFPFTCS 781

Query: 616  STIDLHLHDPATTVLPNI-------SDLPSTFDYSXXXXXXXXXXXXXXXXXXXXXXXXX 774
             + DLH HD    + P +       SD+P++                             
Sbjct: 782  ES-DLH-HDLYPNIDPVVVNKTHIASDVPTS----------------------------- 810

Query: 775  XXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAYLKEYD 954
                         +     V  T EP+     PT AE      + S RT   PAYL++Y 
Sbjct: 811  -------------VNTEIPVVVTDEPVVDSQIPTKAE------KISKRTSKQPAYLEDYY 851

Query: 955  CNL 963
            CN+
Sbjct: 852  CNM 854


>ref|XP_010526682.1| PREDICTED: uncharacterized protein LOC104804180 isoform X4 [Tarenaya
            hassleriana]
          Length = 940

 Score =  221 bits (564), Expect = 9e-62
 Identities = 121/309 (39%), Positives = 163/309 (52%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K KSD       F  +   QF+ +IKC+RSDNA E        + G  HQF CP+ PQQN
Sbjct: 338  KSKSDVLQKFPEFVSFVENQFNASIKCVRSDNAPELGFKSLFAKKGILHQFSCPYTPQQN 397

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            ++VERKHQH+LNVAR+L+FQ+ V L +WG+C+  +VYLIN+TPSPLL N++PFE+L   S
Sbjct: 398  SIVERKHQHILNVARALLFQSNVPLAFWGDCILTSVYLINRTPSPLLQNKTPFELLTGCS 457

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            P YS ++VFGCL YVST    R+KF P A   +F+GYP G KG+K+ D+     +ISR+V
Sbjct: 458  PSYSHLRVFGCLCYVSTLTKDRHKFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNV 517

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759
            +FHE+ FPFK        L     +V P   +  S  + S                    
Sbjct: 518  VFHETTFPFKSFPQSQPALDPFPQSVSPFFYESISPQNLS-------------------- 557

Query: 760  XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939
                               P +      S   D     T++  H   P+R S+T   PAY
Sbjct: 558  -------SSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQRQSKT---PAY 607

Query: 940  LKEYDCNLL 966
            L +Y C L+
Sbjct: 608  LSDYHCYLI 616


>dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1475

 Score =  223 bits (567), Expect = 1e-61
 Identities = 106/216 (49%), Positives = 141/216 (65%)
 Frame = +1

Query: 40   KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219
            K K+D   +   F K   TQ+ T +K +RSDNA E       Q  G      CP  PQQN
Sbjct: 681  KAKNDVLQIFPDFLKMVETQYGTLVKAVRSDNAPELRFEALYQAKGIISYHSCPETPQQN 740

Query: 220  AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399
            +VVERKHQH+LNVAR+LMF+A + L +WG+C+ +AV+LIN+ P+PLL N+SPFE+L+ K 
Sbjct: 741  SVVERKHQHILNVARALMFEANMPLEFWGDCILSAVFLINRLPTPLLSNKSPFELLHLKV 800

Query: 400  PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579
            PDY+ +KVFGCL Y ST+   R+KF P A  C+F+GYP GYKG+KL D+      ISR V
Sbjct: 801  PDYTSLKVFGCLCYESTSPQQRHKFAPRARACVFLGYPSGYKGYKLLDLETNTIHISRHV 860

Query: 580  IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPST 687
            +F+E++FPF   + I   + D    V  NI + PST
Sbjct: 861  VFYETVFPFTDKTIIPRDVFDLVDPVHENIENPPST 896


>gb|PRQ38882.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 2324

 Score =  223 bits (567), Expect = 1e-61
 Identities = 127/317 (40%), Positives = 174/317 (54%), Gaps = 8/317 (2%)
 Frame = +1

Query: 46   KSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFL-LTDFLQQNGTTHQFYCPHRPQQNA 222
            KS+  +L+K+FF +  TQF+  ++ IRSDN  EFL +  F Q NG  HQ  C + PQQN 
Sbjct: 836  KSETQNLLKSFFAFTETQFNQKVQHIRSDNGSEFLSMRSFFQANGIIHQHSCVYTPQQNG 895

Query: 223  VVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKSP 402
            VVERKH+H++ +AR+L+FQA + L +W ECV   VYLIN+ P+PLL  +SPFE ++ + P
Sbjct: 896  VVERKHRHIITIARALLFQANLPLEFWAECVLTVVYLINRLPAPLLSGKSPFEKIFQRVP 955

Query: 403  DYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDVI 582
             YS ++VFGCL+Y +   P + KF P A  CIF+GYP G K +KL+D+  +KF  SRDV+
Sbjct: 956  QYSHIRVFGCLAYATNVHP-KQKFDPRAHKCIFVGYPFGQKAYKLYDLTTKKFFTSRDVV 1014

Query: 583  FHESIFPFKKNS-TIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759
            FHE IFP+K++S  + L  HD    VLPN+  +P   D                      
Sbjct: 1015 FHEDIFPYKQDSPNLSLQPHD---AVLPNV--IPEN-DIPQEPLSASRVSPIEHTLPQVD 1068

Query: 760  XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLD-----LL*SPTAAEPHTVPP-RRSSRT 921
                               P + +   +S PLD        SP      TVP  RRS R 
Sbjct: 1069 NSLSPNVLSDHETHPNDQTPPSPSSHHSSPPLDNSSPSSPSSPPVPNEDTVPALRRSERV 1128

Query: 922  HNPPAYLKEYDCNLLYL 972
              P   LK+Y C+ + L
Sbjct: 1129 RKPNVKLKDYVCSHVVL 1145


Top