BLASTX nr result

ID: Rehmannia29_contig00038933 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia29_contig00038933
         (882 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX96408.1| hypothetical protein L195_g019614 [Trifolium prat...   197   2e-55
ref|XP_024195790.1| uncharacterized protein LOC112198938 [Rosa c...   200   3e-54
ref|XP_024172304.1| uncharacterized protein LOC112178381 [Rosa c...   200   3e-54
ref|XP_024172006.1| uncharacterized protein LOC112178017 [Rosa c...   199   3e-54
gb|PNY01158.1| ribonuclease H, partial [Trifolium pratense]           199   6e-54
ref|XP_024156142.1| uncharacterized protein LOC112164137 [Rosa c...   198   9e-54
ref|XP_023905045.1| uncharacterized protein LOC112016795 [Quercu...   198   1e-53
gb|PNX95563.1| ribonuclease H, partial [Trifolium pratense]           197   2e-53
dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subte...   197   3e-53
emb|CDP14470.1| unnamed protein product [Coffea canephora]            193   4e-53
emb|CDP09717.1| unnamed protein product [Coffea canephora]            190   2e-52
gb|EPS72636.1| hypothetical protein M569_02121, partial [Genlise...   192   1e-51
gb|EEC81662.1| hypothetical protein OsI_25211 [Oryza sativa Indi...   186   2e-51
gb|PNX80752.1| ribonuclease H, partial [Trifolium pratense]           188   3e-51
ref|XP_024172251.1| uncharacterized protein LOC112178325 [Rosa c...   191   4e-51
ref|XP_018805736.1| PREDICTED: uncharacterized protein LOC108979...   191   4e-51
gb|PNY15111.1| ribonuclease H [Trifolium pratense]                    190   6e-51
gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense]           189   6e-51
gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-...   190   8e-51
gb|EEE54600.1| hypothetical protein OsJ_01823 [Oryza sativa Japo...   186   8e-51

>gb|PNX96408.1| hypothetical protein L195_g019614 [Trifolium pratense]
          Length = 548

 Score =  197 bits (501), Expect = 2e-55
 Identities = 92/288 (31%), Positives = 157/288 (54%)
 Frame = +3

Query: 15  MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
           M  F +P  VC+ L ++   FWWG+T ++ R++HW KW K+   K+ GGLGFRD + FN 
Sbjct: 85  MSCFLIPKGVCEQLEKMICNFWWGSTTDQ-RKMHWLKWSKVCNQKRNGGLGFRDLRAFNE 143

Query: 195 SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
           +L+AKQ WR++++P  L+++++K +YFP     + K     S+ WR  M+A  V++ G  
Sbjct: 144 ALLAKQGWRLITKPTSLVAQVLKAKYFPNESFLNAKHKQVMSYTWRSIMQASWVIKRGSY 203

Query: 375 FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
           +S+G G++I IWE  W+ +  +    R       +++V EL++++   WN +++ ++F P
Sbjct: 204 WSIGDGEDINIWEDNWMQQKSATYKGRPKPNNLNLIKVKELMDSNYNEWNTDIINQVFLP 263

Query: 555 NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXXX 734
            +A++IL IP++     D L W  T  G ++V S Y A+ +   L    P  S       
Sbjct: 264 YEAQMILNIPIIDKTQPDMLTWDCTQDGQYSVKSGYHAIMEWGNL----PNASPSNNSQH 319

Query: 735 XXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGN 878
                          H +WR +++ LP   NL  +G++ D +C  C N
Sbjct: 320 IWNVLWKLKVPPKHSHLLWRVLHNALPVKNNLFKRGVRCDPLCPRCSN 367


>ref|XP_024195790.1| uncharacterized protein LOC112198938 [Rosa chinensis]
          Length = 1175

 Score =  200 bits (508), Expect = 3e-54
 Identities = 103/288 (35%), Positives = 149/288 (51%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  F+LP  +C  + +L ++FWWG      R+IHW  WDKL   K+ GGLGFRD   FN 
Sbjct: 646  MSCFELPKHLCDEMHRLMARFWWGEF-GAERKIHWVAWDKLCAPKKEGGLGFRDMHLFNT 704

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+AKQ WR++  P  L++++ K RYFP  D      H   S+ WR  MK + +L+ G+R
Sbjct: 705  ALLAKQGWRLICRPDSLLAQVFKARYFPNTDFMHAVLHKGASFSWRSIMKGRDLLKKGLR 764

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
            F VG G++I IW  PW+P    F P  + M+    +RV +L++ +   W E L+ E+FTP
Sbjct: 765  FQVGNGEDISIWNDPWVPLPYRFKPFSIPMQGAEDLRVVDLIDEETGDWQEWLLHELFTP 824

Query: 555  NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXXX 734
             +   I+KIPL   G  DRL  H   KG ++V + Y   +    L +   G+S G     
Sbjct: 825  MEVVNIMKIPLSLSGGIDRLVCHFDKKGRYSVKNGYHVARVMDTLERTTSGSSFGADRAR 884

Query: 735  XXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGN 878
                         ++   WR V  +LP+ A L  +    D  C  C N
Sbjct: 885  LWGKLWKVNVPPKVRMHAWRLVKGMLPTRATLAKRVQLSDVRCVYCSN 932


>ref|XP_024172304.1| uncharacterized protein LOC112178381 [Rosa chinensis]
          Length = 1602

 Score =  200 bits (508), Expect = 3e-54
 Identities = 102/286 (35%), Positives = 155/286 (54%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  F+LP  +CQ + +  ++FWWG ++ K R+IHW  WDK+   K+ GGLGFR+ + FN 
Sbjct: 822  MSCFELPKHLCQEMHRCMAEFWWGDSE-KGRKIHWLAWDKMCVPKEKGGLGFRNMEYFNQ 880

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+AKQ WR++  P  L+ K +K +YFP  D      +  DS+ WR  MK K +LE G+R
Sbjct: 881  ALLAKQGWRILRHPDSLLGKTLKAKYFPNNDFIHASVNQGDSYTWRSLMKGKVLLEKGLR 940

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
            F VG G  I +W  PWIP+  SF P    ME    + V +L++ D K W  + + E+F  
Sbjct: 941  FQVGSGTRISVWFDPWIPRPYSFRPYSTVMEGLEDLTVADLIDPDSKDWMVDWLEELFFA 1000

Query: 555  NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXXX 734
            ++ ++I KIPL     +DRL WH   +G ++V S Y  V +    L     TS+      
Sbjct: 1001 DEVDLIRKIPLSLRNPEDRLIWHFDKRGLYSVKSGYH-VARCVASLSSHVSTSNSQGDKD 1059

Query: 735  XXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSC 872
                         +++F+WR V +++P+  NL  +    + IC  C
Sbjct: 1060 LWRRVWHARVQPKVRNFVWRLVKNIVPTKVNLGRRVNLDERICPFC 1105


>ref|XP_024172006.1| uncharacterized protein LOC112178017 [Rosa chinensis]
          Length = 1045

 Score =  199 bits (507), Expect = 3e-54
 Identities = 103/288 (35%), Positives = 149/288 (51%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  F+LP  +C  + +L ++FWWG    + R+IHW  WDKL   K+ GGLGFRD   FN 
Sbjct: 421  MSCFELPKHLCDEMHRLMARFWWGEF-GEERKIHWVAWDKLCAPKKEGGLGFRDMHLFNT 479

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+AKQ WR++     L++++ K RYFP  D      H   S+ WR  MK + +L+ G+R
Sbjct: 480  ALLAKQGWRLICRLDSLLAQVFKARYFPNTDFMHAVLHKGASFSWRSIMKGRDLLKKGLR 539

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
            F VG G++I IW  PW+P    F P  + M+    +RV +L++ +   W E L+ E+FTP
Sbjct: 540  FQVGNGEDISIWNDPWVPLPYRFKPFSIPMQGAEDLRVVDLIDEETGDWQEWLLHELFTP 599

Query: 555  NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXXX 734
             +   I+KIPL   G  DRL WH   KG ++V + Y   +    L +   G+S G     
Sbjct: 600  MEVVNIMKIPLSLSGGIDRLVWHFDKKGRYSVKNGYHVARVMDTLERTASGSSFGADRAR 659

Query: 735  XXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGN 878
                         ++   WR V   LP+ A L  +    D  C  C N
Sbjct: 660  LWGKLWKVNVPPKVRMHAWRLVKGTLPTRAALAKRVQLSDVRCVYCSN 707


>gb|PNY01158.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1068

 Score =  199 bits (505), Expect = 6e-54
 Identities = 103/288 (35%), Positives = 156/288 (54%), Gaps = 2/288 (0%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  +++P   C  +  + SKFWWG++D K++ IHW  WD+L  +K  GGLGFR F DFN 
Sbjct: 772  MSCYKIPEGSCANIESMLSKFWWGSSDQKNK-IHWMSWDRLGRAKNKGGLGFRGFSDFNK 830

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+ KQ WR++S    L+S++ K +YFP       K     S+ WR    AK V++LG+R
Sbjct: 831  ALLGKQCWRLMSNEDSLLSRVFKSKYFPRSCFLKSKCGYQPSYAWRSLFNAKSVIDLGLR 890

Query: 375  FSVGGGKEIKIWETPWIPKFPSF--LPDRVHMEETGVVRVCELLENDGKAWNEELVMEMF 548
            +++G G+++KIW+ PW+P+  SF       +++E  V  V EL++ D K W  ELVM  F
Sbjct: 891  WTIGNGQQVKIWKDPWLPELSSFKVWSPVCNLDEDAV--VAELIDVDLKKWKRELVMNSF 948

Query: 549  TPNDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXX 728
               +A  IL IPL      D+  W     GN++V SAY  +++       +P T+     
Sbjct: 949  NEFEANQILNIPLSWRLPDDKKVWSWERNGNYSVRSAYHLLKEETLRDIPEPSTAGN--- 1005

Query: 729  XXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSC 872
                           +K+F+WR V  +LP+   L  KG+ +D IC  C
Sbjct: 1006 TGIWKSIWKVQAPQRVKNFLWRVVKRILPTRCRLEQKGVALDPICPLC 1053


>ref|XP_024156142.1| uncharacterized protein LOC112164137 [Rosa chinensis]
          Length = 1293

 Score =  198 bits (504), Expect = 9e-54
 Identities = 102/286 (35%), Positives = 155/286 (54%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  F+LP  +CQ + +  ++FWWG ++ K R+IHW  WDK+   K+ GGLGFR+ + FN 
Sbjct: 614  MSCFELPKHLCQEMHRCMAEFWWGDSE-KGRKIHWLAWDKMCVPKEEGGLGFRNMEYFNQ 672

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+AKQ WR++  P  L+ K +K +YFP  D      +  DS+ WR  MK K +LE G+R
Sbjct: 673  ALLAKQGWRILRHPDSLLGKTLKAKYFPNNDFIHASVNQGDSYTWRSLMKGKVLLEKGLR 732

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
            F VG G  I +W  PWIP+  SF P    ME    + V +L++ D K W  + + E+F  
Sbjct: 733  FQVGLGTRISVWFDPWIPRPYSFRPYSTVMEGLEDLTVADLIDPDSKDWMVDWLEELFFA 792

Query: 555  NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXXX 734
            ++ ++I KIPL     +DRL WH   +G ++V S Y  V +    L     TS+      
Sbjct: 793  DEVDLIRKIPLSLRNPEDRLIWHFDKRGLYSVKSGYH-VARCVASLSSHVSTSNSQGDKD 851

Query: 735  XXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSC 872
                         +++F+WR V +++P+  NL  +    + IC  C
Sbjct: 852  LWRRVWHARVQPKVRNFVWRLVKNIVPTKVNLGRRVNLDERICPFC 897


>ref|XP_023905045.1| uncharacterized protein LOC112016795 [Quercus suber]
          Length = 1373

 Score =  198 bits (504), Expect = 1e-53
 Identities = 105/287 (36%), Positives = 153/287 (53%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M VF+LP S+C  ++     FWWG  + +++ + W  W+K+   K+ GGLGFRD + FN+
Sbjct: 812  MSVFKLPNSLCDEMTSTVRNFWWGQKEGRNK-MAWLSWEKMCAPKKDGGLGFRDLKAFNM 870

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+AKQ WR+ S  + L+ +++K RYFP  D    +     S+ WR  M A+ V++ G R
Sbjct: 871  ALLAKQGWRLQSNTRSLVHRVLKARYFPDRDFLHAELGRTPSYAWRSIMAAQDVVKAGHR 930

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
            + VG G  I+IW   W+PK  +F              V EL++     WN +LV  +F P
Sbjct: 931  WQVGDGTSIQIWRDKWLPKPSTFRVISTPNTLNEAATVSELIDEVTGEWNVDLVKHVFLP 990

Query: 555  NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXXX 734
            +DA  IL IP  S  ++DR+ W +T KG FTV+SAY  V       K    TSD      
Sbjct: 991  DDAHTILGIPRSSKRNRDRMIWAYTPKGTFTVNSAYK-VALSLSQSKAKEETSDASSHSQ 1049

Query: 735  XXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCG 875
                         +K F WR   ++LP+ ANLCS+G+  D  C +CG
Sbjct: 1050 FWQKIWSLRIPNKLKTFAWRASRNILPTKANLCSRGVIDDPTCDACG 1096


>gb|PNX95563.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1188

 Score =  197 bits (501), Expect = 2e-53
 Identities = 92/288 (31%), Positives = 157/288 (54%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  F +P  VC+ L ++   FWWG+T ++ R++HW KW K+   K+ GGLGFRD + FN 
Sbjct: 627  MSCFLIPKGVCEQLEKMICNFWWGSTTDQ-RKMHWLKWSKVCNQKRNGGLGFRDLRAFNE 685

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+AKQ WR++++P  L+++++K +YFP     + K     S+ WR  M+A  V++ G  
Sbjct: 686  ALLAKQGWRLITKPTSLVAQVLKAKYFPNESFLNAKHKQVMSYTWRSIMQASWVIKRGSY 745

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
            +S+G G++I IWE  W+ +  +    R       +++V EL++++   WN +++ ++F P
Sbjct: 746  WSIGDGEDINIWEDNWMQQKSATYKGRPKPNNLNLIKVKELMDSNYNEWNTDIINQVFLP 805

Query: 555  NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXXX 734
             +A++IL IP++     D L W  T  G ++V S Y A+ +   L    P  S       
Sbjct: 806  YEAQMILNIPIIDKTQPDMLTWDCTQDGQYSVKSGYHAIMEWGNL----PNASPSNNSQH 861

Query: 735  XXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGN 878
                           H +WR +++ LP   NL  +G++ D +C  C N
Sbjct: 862  IWNVLWKLKVPPKHSHLLWRVLHNALPVKNNLFKRGVRCDPLCPRCSN 909


>dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subterraneum]
          Length = 1626

 Score =  197 bits (500), Expect = 3e-53
 Identities = 99/292 (33%), Positives = 156/292 (53%), Gaps = 3/292 (1%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  ++LP   C  +  + +KFWWG T+ K R+IHW  W+KL ++K  GGLGFR F+DFN 
Sbjct: 1074 MSCYKLPTGCCDNIEAMLAKFWWGTTE-KQRKIHWVSWNKLGKAKSKGGLGFRSFEDFNK 1132

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+ KQ WR++  P  L++K+ K RYFP  +          S+ WR    A++V+++G R
Sbjct: 1133 ALLGKQCWRLLQNPDSLLAKVFKSRYFPRSNFMDASVGYQPSYAWRSLCNAREVIDMGAR 1192

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVV---RVCELLENDGKAWNEELVMEM 545
            + +G G+++ IW   W+P+   F   +V    + +V    V +L+    KAW++ L++  
Sbjct: 1193 WLIGNGQDVHIWNDKWLPEQDKF---KVWSPVSNLVPNAMVSDLINPVTKAWDKNLILNC 1249

Query: 546  FTPNDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXX 725
            F+P +AE IL IP+      D+L WH    G F+V SAY  + + R   K  P  S    
Sbjct: 1250 FSPFEAEQILNIPISWRLPADKLIWHWEKNGEFSVRSAYHMLSEDRN--KNSPEASSS-R 1306

Query: 726  XXXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGNE 881
                            +K+F+WR   ++LP+   L  KG+ +D  C  C N+
Sbjct: 1307 DQLLWKTIWKVNVPNCIKNFLWRVAKAILPTRGRLEKKGITLDTTCPLCFND 1358


>emb|CDP14470.1| unnamed protein product [Coffea canephora]
          Length = 660

 Score =  193 bits (490), Expect = 4e-53
 Identities = 96/296 (32%), Positives = 155/296 (52%), Gaps = 7/296 (2%)
 Frame = +3

Query: 15  MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
           M VF+LP  +C+ +S L + +WWG  + K++ +HW  W K++ S+  GGLGF+D + +N 
Sbjct: 101 MSVFKLPRKLCKDISALMANYWWGEANGKNK-LHWLSWRKMSLSRNAGGLGFKDIEAYNK 159

Query: 195 SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
           +L+ KQ+WR++++P LL+SK+++ RYFP   + + +P    SW+W+G + A++V+E G+ 
Sbjct: 160 ALLGKQVWRILTKPNLLISKVLRARYFPKDSILTCRPKQNASWIWQGLLGARRVVEKGVI 219

Query: 375 FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
             +G G+   IW   WIP   S  P  +  +   +  V EL+ +    W    + + F  
Sbjct: 220 RRIGNGRSTSIWGHRWIPGSSSGRPTSLGPQSYNLKMVNELISH--HRWKRNTIFQHFNQ 277

Query: 555 NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLK-------GDPGTS 713
           +DAE IL IPL   G +D   W H   G +TVSS Y  + + R  +K       G   T 
Sbjct: 278 SDAEKILNIPLSLMGREDNYYWQHNPGGIYTVSSGYKCIMKERTNVKQIAPEEAGPSITG 337

Query: 714 DGXXXXXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGNE 881
           +                   +K FIW+C+   LP  A +  K    D +C  CG +
Sbjct: 338 EDQQSRQMWTTLWKLNIKHKVKIFIWKCITGALPVRAAIFRKTRMGDPVCRLCGED 393


>emb|CDP09717.1| unnamed protein product [Coffea canephora]
          Length = 613

 Score =  190 bits (483), Expect = 2e-52
 Identities = 99/293 (33%), Positives = 152/293 (51%), Gaps = 4/293 (1%)
 Frame = +3

Query: 15  MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
           M  F+LP  +C+ ++ + + +WWG ++ +++ +HW  W +LA  K+ GGLGFR+ Q+FN 
Sbjct: 59  MSCFKLPNKLCKEVTSIFANYWWGESEGRNK-MHWCSWGRLARDKKEGGLGFRELQNFNK 117

Query: 195 SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
           +L+AKQ+WRV+S+P LL+SK+++ +YF    +   K     SW+W+  M  +  +  G R
Sbjct: 118 ALLAKQVWRVISKPNLLVSKVLRAKYFHKESIFKCKIPKCASWIWQSLMNVRDFVRKGTR 177

Query: 375 FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
             +G GK   IWE  WIP         V  +   + RV EL+   G  W   LV  +F  
Sbjct: 178 RKIGNGKATNIWEDNWIPGNKDGKVTTVMPQSCNIRRVEELI--SGFRWRIPLVSRIFNR 235

Query: 555 NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAV----QQHRRLLKGDPGTSDGX 722
            DA+ IL IP+   G +D   W H+  G +TV+S Y A+     QH+     + GTS   
Sbjct: 236 KDAKEILDIPISIAGREDSNYWLHSGSGTYTVNSGYKALCQETSQHKGRRDNEAGTSSAN 295

Query: 723 XXXXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGNE 881
                            +KHFIWR +  LLP    +  +  + D IC  CG +
Sbjct: 296 SNEKQWKWLWKLKVKSKIKHFIWRSLNGLLPVNDLVFKRIHQGDPICDGCGEQ 348


>gb|EPS72636.1| hypothetical protein M569_02121, partial [Genlisea aurea]
          Length = 1503

 Score =  192 bits (489), Expect = 1e-51
 Identities = 98/288 (34%), Positives = 152/288 (52%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  F LP S    L    S++WW   + K   IHW  WD ++ S + GGLGFRD  DFNL
Sbjct: 1144 MSCFALPKSFLGDLQSAISRYWWRNRNGKG--IHWKSWDFISRSFKEGGLGFRDLHDFNL 1201

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+ KQ+WR+ S P  ++S++ + +YFP GD+ + +P +  S++W G MK++ ++  G+R
Sbjct: 1202 ALLGKQVWRIASAPHSILSRVFRAKYFPNGDIWTARPCARGSYVWNGIMKSRDLVSKGIR 1261

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
              +G G  + IW  PWIPK P+F P  + + E     V  L+++  K W+   + E F P
Sbjct: 1262 HLIGDGSSVDIWHDPWIPKPPTFKPTNL-LGERRRASVATLIDSRTKWWDVGRIREKFDP 1320

Query: 555  NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXXX 734
             DA  I+ IPL     +D++ WH++  G +TV SAY  V+  R  ++    +SD      
Sbjct: 1321 VDANHIISIPLSESPSEDKILWHYSKSGTYTVRSAYHLVRSLR--VEVSSSSSDSRVTPK 1378

Query: 735  XXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGN 878
                         +  F+WR  +  LP+   L  + + ID  C  C N
Sbjct: 1379 VWDLIWKHACCPKIGLFMWRLAHGCLPTNETLWRRRIPIDKECSICLN 1426


>gb|EEC81662.1| hypothetical protein OsI_25211 [Oryza sativa Indica Group]
          Length = 561

 Score =  186 bits (473), Expect = 2e-51
 Identities = 105/293 (35%), Positives = 154/293 (52%), Gaps = 6/293 (2%)
 Frame = +3

Query: 15  MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
           M  F+L   +C  +S++ +K+WW +   K  ++HW  W+KL   K  GGLGFRD   FNL
Sbjct: 1   MGCFELTKDLCDQISKMIAKYWW-SNQEKDNKMHWLSWNKLTLPKNMGGLGFRDIYIFNL 59

Query: 195 SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
           +++AKQ WR++ +P  L S++++ +YFPLGD    K  S  S+ WR   K  +VL+ GM 
Sbjct: 60  AMLAKQGWRLIQDPDSLCSRVLRAKYFPLGDCFRPKQTSNVSYTWRSIQKGLRVLQNGMI 119

Query: 375 FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETG---VVRVCELLENDGKAWNEELVMEM 545
           + +G G +I IW  PWIP+  S  P    M   G   V +V EL++     W+E+L+ + 
Sbjct: 120 WRMGDGSKINIWADPWIPRGWSRKP----MTPRGANLVTKVEELIDPYTGTWDEDLLSQT 175

Query: 546 FTPNDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQ--QHRRLLKGDPGTSD- 716
           F   D   I  IP +    +D L WH  A+G FTV SAY   +  + R    G PG S+ 
Sbjct: 176 FWEEDVAAIKSIP-VHVEMEDVLAWHFDARGCFTVKSAYKVQREMERRASRNGCPGVSNW 234

Query: 717 GXXXXXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCG 875
                              +KHF+WR  ++ L   ANL  +G+ +D  C  CG
Sbjct: 235 ESGDDDFWKKLWKLGVPGKIKHFLWRMCHNTLALRANLHHRGMDVDTRCVMCG 287


>gb|PNX80752.1| ribonuclease H, partial [Trifolium pratense]
          Length = 696

 Score =  188 bits (478), Expect = 3e-51
 Identities = 97/290 (33%), Positives = 162/290 (55%), Gaps = 2/290 (0%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  ++LP S CQ +  + +KFWWG+ + K R+IHW  W++L++SK+ GG+GFR   +FN 
Sbjct: 366  MSCYKLPESCCQEIESMLAKFWWGSKEGK-RKIHWMSWERLSKSKKGGGMGFRGISNFNS 424

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+ K  WR+ +  + LM ++ K RY+P       K     S+ WR    AK+V+ LG R
Sbjct: 425  ALLGKHCWRLTTGEESLMGRVFKSRYYPRASFMEAKIGYQPSYAWRSIQSAKEVIILGSR 484

Query: 375  FSVGGGKEIKIWETPWIPKFPSF-LPDRV-HMEETGVVRVCELLENDGKAWNEELVMEMF 548
            + +G G+++KI +  W+P    F +  R   +EE  +V    L++ D K W  +LV  +F
Sbjct: 485  WRIGNGEKVKICKDKWLPNQVGFKVWSRCDELEEDALVST--LIDPDTKQWKRDLVSHIF 542

Query: 549  TPNDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXX 728
             P++A+ IL +P+      D++ WH    G ++V SA+  ++QH    +   GTS G   
Sbjct: 543  FPHEAKQILSLPISPRLPSDKIIWHFERNGEYSVRSAHHLLKQHSS--RNAAGTS-GQQT 599

Query: 729  XXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGN 878
                           +++F+WR   ++LP+ ANL  KG++++ +C  C N
Sbjct: 600  DILWREIWXAPVPNRVRNFLWRLGKNILPTRANLSRKGVQVENLCPQCNN 649


>ref|XP_024172251.1| uncharacterized protein LOC112178325 [Rosa chinensis]
          Length = 1211

 Score =  191 bits (484), Expect = 4e-51
 Identities = 107/291 (36%), Positives = 156/291 (53%), Gaps = 2/291 (0%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  F LP + C  L Q+ +KFWWG+  N+ R+IHW  W++L  SK+ GG+GFRD    NL
Sbjct: 746  MSCFLLPNNFCDDLHQMCAKFWWGSKPNE-RKIHWMSWERLCRSKEEGGMGFRDLHAHNL 804

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+AKQ WR++  P  L+S++ K RYFP     +    +  S  WRG   AK VL+ G+R
Sbjct: 805  ALLAKQGWRLIRYPGSLVSRLFKARYFPHSSFLNATTPTHASACWRGIFAAKSVLQAGLR 864

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
            + VG G  I+IW+ PWIP+   F P  +    + +V V +L+  DG  WN+EL+ E F  
Sbjct: 865  WQVGNGTSIRIWDDPWIPRPNLFRP--IRYGPSPLVLVSDLMV-DGH-WNKELISENFHA 920

Query: 555  NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAY--SAVQQHRRLLKGDPGTSDGXXX 728
            ++A +I  IPL      DRL WH    G FT  SAY  +    H  L+ G    S+    
Sbjct: 921  DEALLICSIPLSRSIVPDRLIWHFDMNGLFTTKSAYKIAFASLHLVLISGSSSNSN---- 976

Query: 729  XXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGNE 881
                           +K  +W+   S+LP+A+ L S+ + ++  C  C +E
Sbjct: 977  PSHWKFIWAAPIPGKVKVHVWKVCASILPTASQLRSRRVPVEDGCLFCNSE 1027


>ref|XP_018805736.1| PREDICTED: uncharacterized protein LOC108979499 [Juglans regia]
          Length = 1227

 Score =  191 bits (484), Expect = 4e-51
 Identities = 103/288 (35%), Positives = 158/288 (54%), Gaps = 2/288 (0%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M VF LP  + + L+ +   FWWG  +   R+IHW  W K+  SK  GGLGFRD + FNL
Sbjct: 676  MSVFMLPAKLLRNLNSIMHNFWWGQHE-AIRKIHWVAWSKMGRSKADGGLGFRDLEGFNL 734

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+AKQ WRV+  P  L+S+++K +YF      + K     S++WR  +KA+ +++ G  
Sbjct: 735  ALLAKQGWRVIHTPFSLVSQVLKAKYFQETSFMNAKLGQKPSYIWRSMLKARNLIDAGSY 794

Query: 375  FSVGGGKEIKIWETPWIPK-FPSFLPDRV-HMEETGVVRVCELLENDGKAWNEELVMEMF 548
            + +G G +++IW   W+PK FPS +   + H++  G  +V ELL+   K WN ELV  +F
Sbjct: 795  WRIGSGDKVRIWGDKWLPKTFPSAVKSPISHLD--GNAKVQELLKPGEKEWNMELVEHLF 852

Query: 549  TPNDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXX 728
               +A+II++IPL +    D+L W  T+ G FTV SAY   Q+  +  KG P +S     
Sbjct: 853  CKEEADIIVQIPLSTSNRPDQLIWKGTSTGFFTVKSAYHLHQELIQDTKGQPSSSHNSKE 912

Query: 729  XXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSC 872
                            K F+WR  ++ LP+ +NL  + +  +  C  C
Sbjct: 913  VWKTLWQLQVTPGE--KVFLWRACHNALPTQSNLFKRKIVANPKCPIC 958


>gb|PNY15111.1| ribonuclease H [Trifolium pratense]
          Length = 1334

 Score =  190 bits (483), Expect = 6e-51
 Identities = 97/287 (33%), Positives = 158/287 (55%), Gaps = 1/287 (0%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  ++LP S CQ +  + +KFWWG+   + R+IHW  W++L+++K+ GG+GFR   +FN 
Sbjct: 780  MGCYKLPNSCCQEIETMLAKFWWGSKGGE-RKIHWMSWERLSKTKKDGGMGFRGINNFNK 838

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+ K  WR+++  + LM +I K RYFP       K     S+ WR    A  V++LG R
Sbjct: 839  ALLGKHCWRLMTGEESLMGRIFKSRYFPRTSFLEAKIGYQPSYAWRSIQSATDVMKLGTR 898

Query: 375  FSVGGGKEIKIWETPWIPKFPSF-LPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFT 551
            + +G G+ +KI E  W+P    F +  R    E G + V  L++ D K WN +LV++ F 
Sbjct: 899  WRIGNGESVKIREDRWLPNQVGFKVWSRGEELENGAL-VSALIDPDTKQWNRQLVVQTFY 957

Query: 552  PNDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXX 731
            P++A+ IL IP+      D++ WH+   G ++V SA+  ++QH      D   S G    
Sbjct: 958  PDEAKQILSIPISQRLPADKIIWHYERDGEYSVRSAHHLLKQHN---SRDVAASSGQQMN 1014

Query: 732  XXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSC 872
                          +++F+WR   ++LP+ ANL  KG++I+ +C  C
Sbjct: 1015 NLWREIWKAPVPNRVRNFLWRLGKNILPTRANLVRKGVQIENLCPQC 1061


>gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense]
          Length = 894

 Score =  189 bits (481), Expect = 6e-51
 Identities = 95/289 (32%), Positives = 147/289 (50%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  ++LP   C  +  + +KFWWG T+ K R+IHW  W+KL ++K  GGLGFR F+DFN 
Sbjct: 401  MSCYKLPTGCCDNIEAMLAKFWWGTTE-KQRKIHWVSWNKLGKAKSKGGLGFRSFEDFNK 459

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
            +L+ KQ WR++  P  L++K+ K RYFP             S+ WR    +++V+++G R
Sbjct: 460  ALLGKQCWRLLQNPDSLLAKVFKSRYFPRSKFMDANVGYQPSYAWRSLCNSREVIDVGAR 519

Query: 375  FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMFTP 554
            + +G GK++ IW   W+P    F              V +L+  + K W++ LV   F  
Sbjct: 520  WLIGNGKDVHIWNDKWLPAQDKFKVWSPVSNLAPNAMVSDLINLETKMWDKNLVQNCFNS 579

Query: 555  NDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXXXX 734
             +AE IL IPL      D+L WH    G F+V SAY  + + R     +  +S       
Sbjct: 580  FEAEQILNIPLSWRLPADKLIWHWEKNGEFSVRSAYHMLSEIRNQNSPEASSS---RDHL 636

Query: 735  XXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCGNE 881
                         +K+F+WR   ++LP+ + L  KG+ +D  C  C N+
Sbjct: 637  LWKAIWKVKVPNCIKNFLWRLAKAILPTRSRLEKKGITLDTTCPLCFND 685


>gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana]
          Length = 1142

 Score =  190 bits (482), Expect = 8e-51
 Identities = 103/289 (35%), Positives = 156/289 (53%), Gaps = 2/289 (0%)
 Frame = +3

Query: 15   MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
            M  F+LP ++   L+   +KFWW +++  SR +HW  WDKL  SK  GGLGFR+  DFN 
Sbjct: 589  MSCFRLPKAITSKLTSAVAKFWW-SSNGDSRGMHWMAWDKLCSSKSDGGLGFRNVDDFNS 647

Query: 195  SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGD-LASIKPHSFDSWLWRGWMKAKKVLELGM 371
            +L+AKQLWR+++ P  L +K+ KGRYF   + L SIK +S  S+ WR  + A+ ++  G+
Sbjct: 648  ALLAKQLWRLITAPDSLFAKVFKGRYFRKSNPLDSIKSYS-PSYGWRSMISARSLVYKGL 706

Query: 372  RFSVGGGKEIKIWETPWIP-KFPSFLPDRVHMEETGVVRVCELLENDGKAWNEELVMEMF 548
               VG G  I +W  PWIP +FP        + +   ++V  L+++    WN +L+ E+F
Sbjct: 707  IKRVGSGASISVWNDPWIPAQFPRPAKYGGSIVDPS-LKVKSLIDSRSNFWNIDLLKELF 765

Query: 549  TPNDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQQHRRLLKGDPGTSDGXXX 728
             P D  +I  +P+ +P  +D L WH T  GN+TV S Y       RL   +  T  G   
Sbjct: 766  DPEDVPLISALPIGNPNMEDTLGWHFTKAGNYTVKSGYHTA----RLDLNEGTTLIGPDL 821

Query: 729  XXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCG 875
                           ++HF+W+ +   +P + NL  +G+  D  C SCG
Sbjct: 822  TTLKAYIWKVQCPPKLRHFLWQILSGCVPVSENLRKRGILCDKGCVSCG 870


>gb|EEE54600.1| hypothetical protein OsJ_01823 [Oryza sativa Japonica Group]
          Length = 639

 Score =  186 bits (473), Expect = 8e-51
 Identities = 105/293 (35%), Positives = 154/293 (52%), Gaps = 6/293 (2%)
 Frame = +3

Query: 15  MDVFQLPVSVCQALSQLASKFWWGATDNKSRRIHWAKWDKLAESKQTGGLGFRDFQDFNL 194
           M  F+L   +C  +S++ +K+WW +   K  ++HW  W+KL   K  GGLGFRD   FNL
Sbjct: 1   MGCFELTKDLCDQISKMIAKYWW-SNQEKDNKMHWLSWNKLTLPKNMGGLGFRDIYIFNL 59

Query: 195 SLIAKQLWRVVSEPQLLMSKIIKGRYFPLGDLASIKPHSFDSWLWRGWMKAKKVLELGMR 374
           +++AKQ WR++ +P  L S++++ +YFPLGD    K  S  S+ WR   K  +VL+ GM 
Sbjct: 60  AMLAKQGWRLIQDPDSLCSRVLRAKYFPLGDCFRPKQTSNVSYTWRSIQKGLRVLQNGMI 119

Query: 375 FSVGGGKEIKIWETPWIPKFPSFLPDRVHMEETG---VVRVCELLENDGKAWNEELVMEM 545
           + +G G +I IW  PWIP+  S  P    M   G   V +V EL++     W+E+L+ + 
Sbjct: 120 WRMGDGSKINIWADPWIPRGWSRKP----MTPRGANLVTKVEELIDPYTGTWDEDLLSQT 175

Query: 546 FTPNDAEIILKIPLLSPGDKDRLRWHHTAKGNFTVSSAYSAVQ--QHRRLLKGDPGTSD- 716
           F   D   I  IP +    +D L WH  A+G FTV SAY   +  + R    G PG S+ 
Sbjct: 176 FWEEDVAAIKSIP-VHVEMEDVLAWHFDARGCFTVKSAYKVQREMERRASRNGCPGVSNW 234

Query: 717 GXXXXXXXXXXXXXXXXXXMKHFIWRCVYSLLPSAANLCSKGLKIDAICFSCG 875
                              +KHF+WR  ++ L   ANL  +G+ +D  C  CG
Sbjct: 235 ESGDDDFWKKLWKLGVPGKIKHFLWRMCHNTLALRANLQHRGMDVDTRCVMCG 287


Top