BLASTX nr result

ID: Rehmannia31_contig00019344 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00019344
         (1331 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX96408.1| hypothetical protein L195_g019614 [Trifolium prat...   328   e-103
gb|PNX95563.1| ribonuclease H, partial [Trifolium pratense]           329   1e-98
gb|PNY03092.1| ribonuclease H [Trifolium pratense]                    321   6e-97
gb|PNX71213.1| ribonuclease H [Trifolium pratense]                    317   1e-95
gb|PNX96384.1| ribonuclease H, partial [Trifolium pratense]           322   3e-95
ref|XP_023906330.1| uncharacterized protein LOC112018052 [Quercu...   321   6e-95
ref|XP_024037590.1| uncharacterized protein LOC112097210 [Citrus...   318   9e-95
ref|XP_023884545.1| uncharacterized protein LOC111996780 [Quercu...   316   1e-94
ref|XP_023899813.1| uncharacterized protein LOC112011695 [Quercu...   315   3e-93
gb|PRQ55763.1| putative RNA-directed DNA polymerase [Rosa chinen...   310   3e-91
ref|XP_023913142.1| uncharacterized protein LOC112024740 [Quercu...   309   3e-91
gb|PNX92808.1| ribonuclease H [Trifolium pratense]                    304   7e-91
dbj|GAU51007.1| hypothetical protein TSUD_411560 [Trifolium subt...   310   1e-90
gb|PNY07715.1| ribonuclease H [Trifolium pratense]                    310   1e-90
ref|XP_023914298.1| uncharacterized protein LOC112025844 [Quercu...   309   2e-90
ref|XP_023878301.1| uncharacterized protein LOC111990748 [Quercu...   308   2e-90
ref|XP_024156142.1| uncharacterized protein LOC112164137 [Rosa c...   308   3e-90
ref|XP_024172006.1| uncharacterized protein LOC112178017 [Rosa c...   304   4e-90
ref|XP_024172304.1| uncharacterized protein LOC112178381 [Rosa c...   309   5e-90
gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense]           301   8e-90

>gb|PNX96408.1| hypothetical protein L195_g019614 [Trifolium pratense]
          Length = 548

 Score =  328 bits (840), Expect = e-103
 Identities = 171/455 (37%), Positives = 248/455 (54%), Gaps = 15/455 (3%)
 Frame = +1

Query: 4    LIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKENG 183
            LIK+VAQAIPT++MSCFL+P+ VC ++  +I NFWWG   D+R++HW  W  +CN K NG
Sbjct: 72   LIKAVAQAIPTYLMSCFLIPKGVCEQLEKMICNFWWGSTTDQRKMHWLKWSKVCNQKRNG 131

Query: 184  GLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRSI 363
            GLGFR+L AFN+A+LAKQGWR++   TSL+A+  KA+YFPN  FL+A+     SYTWRSI
Sbjct: 132  GLGFRDLRAFNEALLAKQGWRLITKPTSLVAQVLKAKYFPNESFLNAKHKQVMSYTWRSI 191

Query: 364  LAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTHC 543
            +    ++++G  W +GDG  I +W+D W+ +        P     + ++V EL++ + + 
Sbjct: 192  MQASWVIKRGSYWSIGDGEDINIWEDNWMQQKSATYKGRPKPNNLNLIKVKELMDSNYNE 251

Query: 544  WDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXXXX 723
            W+   +  +F   +A  IL IP+ +    D L W  T+ G Y+VKSGY            
Sbjct: 252  WNTDIINQVFLPYEAQMILNIPIIDKTQPDMLTWDCTQDGQYSVKSGYH--AIMEWGNLP 309

Query: 724  XXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGVEI 903
                      +W  +W L +PPK    +W+V ++ LPV   L +R    DPLC +C   +
Sbjct: 310  NASPSNNSQHIWNVLWKLKVPPKHSHLLWRVLHNALPVKNNLFKRGVRCDPLCPRCSNSM 369

Query: 904  ETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLMWS 1083
            ET  H   DC W+   W AS L L+    +      D I   + N + E  E    +M+ 
Sbjct: 370  ETIHHVFLDCEWAKQTWFASALTLNLG-QNQLTDFYDWINYMINNTNKECIEKITAIMYG 428

Query: 1084 IWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPE--QSAATLQI-WSKPP 1254
            IWYARN L FQGK+L   +  + A    + YQ      H Q P+  ++  +  I WS PP
Sbjct: 429  IWYARNVLVFQGKNLPPQEISSTALNQLQEYQTHGLEQHIQDPQVRRNGCSNDISWSPPP 488

Query: 1255 PGSTKINSDASVV------------RSQGTGIGVS 1323
             G+ KIN DA +             RS G+ +GV+
Sbjct: 489  RGTLKINVDAHLSSDGHWSTGLVLRRSDGSTVGVA 523


>gb|PNX95563.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1188

 Score =  329 bits (844), Expect = 1e-98
 Identities = 172/457 (37%), Positives = 249/457 (54%), Gaps = 15/457 (3%)
 Frame = +1

Query: 4    LIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKENG 183
            LIK+VAQAIPT++MSCFL+P+ VC ++  +I NFWWG   D+R++HW  W  +CN K NG
Sbjct: 614  LIKAVAQAIPTYLMSCFLIPKGVCEQLEKMICNFWWGSTTDQRKMHWLKWSKVCNQKRNG 673

Query: 184  GLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRSI 363
            GLGFR+L AFN+A+LAKQGWR++   TSL+A+  KA+YFPN  FL+A+     SYTWRSI
Sbjct: 674  GLGFRDLRAFNEALLAKQGWRLITKPTSLVAQVLKAKYFPNESFLNAKHKQVMSYTWRSI 733

Query: 364  LAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTHC 543
            +    ++++G  W +GDG  I +W+D W+ +        P     + ++V EL++ + + 
Sbjct: 734  MQASWVIKRGSYWSIGDGEDINIWEDNWMQQKSATYKGRPKPNNLNLIKVKELMDSNYNE 793

Query: 544  WDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXXXX 723
            W+   +  +F   +A  IL IP+ +    D L W  T+ G Y+VKSGY            
Sbjct: 794  WNTDIINQVFLPYEAQMILNIPIIDKTQPDMLTWDCTQDGQYSVKSGYH--AIMEWGNLP 851

Query: 724  XXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGVEI 903
                      +W  +W L +PPK    +W+V ++ LPV   L +R    DPLC +C   +
Sbjct: 852  NASPSNNSQHIWNVLWKLKVPPKHSHLLWRVLHNALPVKNNLFKRGVRCDPLCPRCSNSM 911

Query: 904  ETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLMWS 1083
            ET  H   DC W+   W AS L L+    +      D I   + N + E  E    +M+ 
Sbjct: 912  ETIHHVFLDCEWAKQTWFASALTLNLG-QNQLTDFYDWINYMINNTNKECIEKITAIMYG 970

Query: 1084 IWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPE--QSAATLQI-WSKPP 1254
            IWYARN L FQGK+L   +  + A    + YQ      H Q P+  ++  +  I WS PP
Sbjct: 971  IWYARNVLVFQGKNLPPQEISSTALNQLQEYQTHGLEQHIQDPQVRRNGCSNDISWSPPP 1030

Query: 1255 PGSTKINSDASVV------------RSQGTGIGVSIR 1329
             G+ KIN DA +             RS G+ +GV+ R
Sbjct: 1031 RGTLKINVDAHLSSDGHWSTGLVLRRSDGSTVGVATR 1067


>gb|PNY03092.1| ribonuclease H [Trifolium pratense]
          Length = 970

 Score =  321 bits (823), Expect = 6e-97
 Identities = 165/431 (38%), Positives = 238/431 (55%), Gaps = 3/431 (0%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            +LIK+VAQAIP +IMS FL+P+ VC ++  +I NFWWG   D R+IHW +W  +C  K+ 
Sbjct: 395  ILIKAVAQAIPNYIMSSFLIPKEVCAQMEKMICNFWWGSTTDSRKIHWINWQKICRQKKT 454

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GGLGFREL AFN+A+LAKQGWRI+   +SL+A+  KA+YFPN  FL A+   + SYTWRS
Sbjct: 455  GGLGFRELRAFNEALLAKQGWRIICQPSSLMAQVLKAKYFPNDQFLQAKPKQHMSYTWRS 514

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            IL  + I++KG  W +G G  + +W D WI +  N           +  +V +++N+  +
Sbjct: 515  ILQARWIIKKGCYWNIGTGEEVDIWKDNWIHQKGNSSTWSSKPNLTTHHKVKDIMNIHNN 574

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXXX 720
             W+E  +  +F   +A +IL+IP+      D L W  T  G YTVKSGY+          
Sbjct: 575  SWNETIINQLFLPIEAQKILQIPITGTSQPDALIWAGTIDGHYTVKSGYQ--AITEWSES 632

Query: 721  XXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGVE 900
                       +W  +W LNIPPK    +W+  N+ +PV   L ++    DPLC +C   
Sbjct: 633  TQASTSANNDDIWPMLWKLNIPPKHSHLLWRALNEAIPVKGNLFKKGVKCDPLCPRCFNH 692

Query: 901  IETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLMW 1080
            +ET  HA  DC W+   W +S L ++      H ++ D +M   K  D  + EL    ++
Sbjct: 693  VETIHHAFLDCVWAKQMWFSSNLTIN-LNHCQHTNLHDWVMHMFKQTDQASRELITSNLY 751

Query: 1081 SIWYARNQLQFQGKDLSHADCFTMADRCFRSYQK---ANEPPHRQSPEQSAATLQIWSKP 1251
             IWYARN L FQGK L   +  ++A    + YQK        ++     +++    WS P
Sbjct: 752  GIWYARNLLVFQGKSLPPHEVSSIALAQLQEYQKHCIKKNFVNQTRATGNSSNNNCWSPP 811

Query: 1252 PPGSTKINSDA 1284
            P G+ KIN DA
Sbjct: 812  PRGTLKINVDA 822


>gb|PNX71213.1| ribonuclease H [Trifolium pratense]
          Length = 951

 Score =  317 bits (813), Expect = 1e-95
 Identities = 162/446 (36%), Positives = 234/446 (52%), Gaps = 4/446 (0%)
 Frame = +1

Query: 4    LIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKENG 183
            L+K+V QAIPT+IMSCFLLP+ +C +I S+ +NFWWG   D++++HW +W  +C  K  G
Sbjct: 372  LLKAVIQAIPTYIMSCFLLPKGLCKQIESMTSNFWWGSNTDKKKLHWINWKKMCKNKTQG 431

Query: 184  GLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRSI 363
            G GFR    FN+A+LAKQGWRI     SL+A+ YKA+YFP   F+ A+ G   SYTWRSI
Sbjct: 432  GYGFRNTCMFNEALLAKQGWRIATQPDSLVAKVYKAKYFPKCQFMEAKNGNMLSYTWRSI 491

Query: 364  LAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTHC 543
            L  + ILQKG  W +G+G ++ +W D W+ + + F+      G      V +LIN  T+ 
Sbjct: 492  LHARWILQKGCFWTIGNGESVNIWKDSWLPKQNGFKVWSTQQGYTRYNLVKDLINPATNQ 551

Query: 544  WDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXXXX 723
            W++  +  IF   +A+QIL++PL    S D+L W  T+ G YTVKSGY            
Sbjct: 552  WNQNLIHQIFLPFEANQILQLPLVEPNSKDELVWSGTKDGLYTVKSGYHAAMEWNHLRHN 611

Query: 724  XXXXXXXXXXL-WKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGVE 900
                      L W+ +W L IPPK    +W++ N  LPV + L+ +    +P C +C   
Sbjct: 612  QVSSNAIATDLTWQNLWKLKIPPKHATLIWRILNHSLPVRSSLSSKGIQCNPTCPRCNTS 671

Query: 901  IETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLMW 1080
            +ET +H    C W+   W  S + +       H S ++ +   +K ++ E     A L +
Sbjct: 672  LETIDHVFMQCEWAKVVWFGSPMTIHFNTTDRHQSFSEWLSTMLKTKNQECMAPIAALTY 731

Query: 1081 SIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEP---PHRQSPEQSAATLQIWSKP 1251
             IW ARN L FQ K++        A      YQ    P   P   +  Q       W+ P
Sbjct: 732  HIWRARNLLVFQDKNVPVMCVVQQAISSSIEYQTLGHPHRLPMCAAATQPRGNNTNWTPP 791

Query: 1252 PPGSTKINSDASVVRSQGTGIGVSIR 1329
            P  S K+N DA        G+G+ +R
Sbjct: 792  PRNSLKLNVDAHPCGDGRWGLGMVLR 817


>gb|PNX96384.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1376

 Score =  322 bits (826), Expect = 3e-95
 Identities = 166/445 (37%), Positives = 240/445 (53%), Gaps = 3/445 (0%)
 Frame = +1

Query: 4    LIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKENG 183
            LIK+VAQAIPT++MSCFL+P+ +C+++  +I NFWWG   D R++HW  W  +C  K NG
Sbjct: 799  LIKAVAQAIPTYLMSCFLIPKGICDQLEKMICNFWWGSTTDHRKMHWVKWSNICTQKNNG 858

Query: 184  GLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRSI 363
            GLGFR+L AFN+A+LAKQ WR++ + TSL+A+  KA+Y+P  D L A+   N SYTWRSI
Sbjct: 859  GLGFRDLRAFNEALLAKQSWRLITNPTSLVAQVLKAKYYPKEDLLHAKYSKNMSYTWRSI 918

Query: 364  LAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTHC 543
            +    I+++G  W +G G    +W+D WI +  +     P     +  +V EL++ + + 
Sbjct: 919  IQTNWIIKRGSYWTIGSGQNTNIWEDNWIQQASSTYKCNPKPNNLNLTKVQELMDTNNNA 978

Query: 544  WDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXXXX 723
            W E  +  IF   +A  ILKIP+ +    D L W +T+ G Y+VKSGY            
Sbjct: 979  WKEDIINQIFQPYEAQMILKIPIMDKAQPDTLTWDNTQDGIYSVKSGYH--SIMEWSHTL 1036

Query: 724  XXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGVEI 903
                      LWK IW LN+PPK    +W+V  + LPV   L +R    DPLC +C   +
Sbjct: 1037 NATTSNNSQDLWKAIWKLNVPPKHTHLLWRVLKNALPVKNNLFKRGVRCDPLCPRCTNCL 1096

Query: 904  ETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLMWS 1083
            ET  H   DC W+   W AS L L+    +    + D I   + N + E  E    +++ 
Sbjct: 1097 ETTNHVFLDCEWTKQVWFASSLNLNLG-QNQITDVYDWIRYMINNTNKECIEQITAIIYG 1155

Query: 1084 IWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPE---QSAATLQIWSKPP 1254
            IWYARN L FQ K L   +  ++A +  + YQ          P+   +S +    WS P 
Sbjct: 1156 IWYARNMLVFQEKLLPPQEISSIATKQLQEYQLHGFEQEIHEPQVRTKSCSNDISWSPPL 1215

Query: 1255 PGSTKINSDASVVRSQGTGIGVSIR 1329
             G+ KIN DA +        G+ +R
Sbjct: 1216 RGTLKINVDAHLSSDGHWSTGLVLR 1240


>ref|XP_023906330.1| uncharacterized protein LOC112018052 [Quercus suber]
          Length = 1301

 Score =  321 bits (822), Expect = 6e-95
 Identities = 159/403 (39%), Positives = 232/403 (57%), Gaps = 2/403 (0%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            +LIK+VAQA+PT+ MS F +P+ +C  INS++A +WWGQ  DE+++HW SW  LC PK+ 
Sbjct: 906  ILIKTVAQAVPTYTMSIFNIPKQICEDINSILARYWWGQLRDEKKVHWMSWRRLCKPKKV 965

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GG+GF++L AFN A+LAKQ W+++Q K SL  R YKARYFP   FL A++G NPSY WRS
Sbjct: 966  GGMGFQDLHAFNLALLAKQAWKLVQKKNSLFYRIYKARYFPTTTFLEAELGHNPSYVWRS 1025

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            +L+ +EI+  G  W VGDG +I+V    W+         +P      DM VHELIN  T 
Sbjct: 1026 LLSAREIVMAGSRWQVGDGKSIKVTSHAWLTHPPRLNGDIP-----EDMWVHELINQQTR 1080

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXXX 720
             WD G++  +F  E   +IL +PL  +   D++ W   +  +++VKS Y +         
Sbjct: 1081 QWDRGKITALFGVETRQEILAVPLTRMEDEDRVIWKENKGQSFSVKSAYLVAQRLAQQQS 1140

Query: 721  XXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGVE 900
                       +WK IWS+N+PPK+R F+W+   +ILP  A L RR  GVDP C+ C  +
Sbjct: 1141 GESSQSRREEKVWKLIWSMNVPPKVRNFVWRACLNILPTRANLQRRKIGVDPRCEFCKQQ 1200

Query: 901  IETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAG--ELFACL 1074
             ET  H L +CP++   W  +++R       + A+   L++  +++R LE G  E++A  
Sbjct: 1201 PETAGHVLWECPFARNTW--ALVRGSLQKCPNEATDIFLLLKGLQDR-LERGDVEVWAVT 1257

Query: 1075 MWSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHR 1203
             W++W ARN+  F+   L        A      YQ   E  +R
Sbjct: 1258 AWALWNARNKYYFERVQLQPTVIACGALTLLSEYQSLMEAQNR 1300


>ref|XP_024037590.1| uncharacterized protein LOC112097210 [Citrus clementina]
          Length = 1154

 Score =  318 bits (816), Expect = 9e-95
 Identities = 168/445 (37%), Positives = 254/445 (57%), Gaps = 2/445 (0%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            VLIK+VAQAIPT+ MS F +P  +C  I   +A FWWG K D + IHW  W  + + K  
Sbjct: 590  VLIKAVAQAIPTYAMSVFKIPLGLCEDIQKAMARFWWGTKQDRKGIHWARWERISHSKAR 649

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GG+GFR+LS+FN+A++AKQGWRI+Q  +SL+AR  KARYF +  F++A +G  PS+ WRS
Sbjct: 650  GGMGFRDLSSFNQALVAKQGWRIMQFPSSLVARVLKARYFKHTGFMNAGLGSKPSFVWRS 709

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRP-QLPNIGENSDMRVHELINMDT 537
            I+ G+++L KG  W +G+G  + V+ + WI     F+P   P++G  +D  V ELI+ + 
Sbjct: 710  IVWGRQVLHKGARWRIGNGQNVLVYGNNWIPRPTTFKPISAPSMG--TDTTVAELID-EK 766

Query: 538  HCWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXX 717
              W E  +   F  EDA+ I++IPL      DQL WH+ + G Y+VKSGY++        
Sbjct: 767  QQWREDLILQHFRPEDAEAIMQIPLPKRPKEDQLIWHYDKKGYYSVKSGYQV--AMRIKF 824

Query: 718  XXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGV 897
                        LW++IW L IP K++IF+W+ A+D+LP    L ++    +P+C+ C  
Sbjct: 825  PEDPSCSNHDQNLWRFIWKLAIPEKVKIFLWRAAHDLLPTAENLWKKKVLQEPMCQSCHC 884

Query: 898  EIETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLM 1077
             +ET  HAL +C  +   WR S L  ++        +  ++  + +      G   A L+
Sbjct: 885  HVETVSHALVECNRARKIWRYSNL-AEELRGVYRCDIVWMLQFWPRQHAKVEGAEVAALL 943

Query: 1078 WSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPEQSAATLQIWSKPPP 1257
            W+IW ARN+  F+GK  +       A+    S++K  +P      + +A   + WS PP 
Sbjct: 944  WAIWKARNKWLFEGKKENPLRVVANAEAIVESFKKIRQPEMVYKTKGNAERQKQWSPPPN 1003

Query: 1258 GSTKINSDASV-VRSQGTGIGVSIR 1329
            G  K+N DA+V V +Q  G+GV +R
Sbjct: 1004 GWQKVNVDAAVDVENQMAGLGVVVR 1028


>ref|XP_023884545.1| uncharacterized protein LOC111996780 [Quercus suber]
          Length = 1038

 Score =  316 bits (810), Expect = 1e-94
 Identities = 164/449 (36%), Positives = 239/449 (53%), Gaps = 6/449 (1%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            +LIK+V QAIPT+ MSCF LP  +C+++ SLI  FWWGQ+ D R+IHW +W  L  PK  
Sbjct: 525  ILIKAVVQAIPTYTMSCFKLPLGLCSELESLIRKFWWGQRGDRRKIHWVNWETLTQPKSA 584

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GG+GF++L+ FN A+LAKQ WR+L  K SL+ + +K+++FP   F+ A    + SY WRS
Sbjct: 585  GGMGFKDLALFNDALLAKQAWRLLHSKESLLYKVFKSKFFPTCSFMEAPDNSSGSYAWRS 644

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            +L G+E+L +G  W VG G TI++WD  W+    + R   P I    +  V  LIN  T 
Sbjct: 645  LLKGREVLWRGARWRVGTGETIKIWDYSWLPSMEHPRILSPCIEGLEEATVDCLINPTTR 704

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXXX 720
             WD   + G F   + D ILKIPL      D+L W H  +G Y+VKSGYR          
Sbjct: 705  SWDRNILTGYFAPMEVDLILKIPLSPTNVEDKLIWPHVPTGVYSVKSGYRFLAEDKPGLL 764

Query: 721  XXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGVE 900
                       +W+ IW L++P K++ F+W+   + LPV   L RR    + +C  C ++
Sbjct: 765  PTQFSHGEATNIWRSIWRLSVPNKVKNFLWRACKEALPVKKNLVRRRVLDEDVCCHCKLK 824

Query: 901  IETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMV--FVKNRDLEAGELFACL 1074
             E   HAL DC      W   ++    WL       T+   +  FV  ++ +  ELFA +
Sbjct: 825  AEDGYHALWDCSELSTIWETDVM----WLFCRSKKFTNFFELARFVLEKE-QQPELFASI 879

Query: 1075 MWSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPEQSAATLQI---WS 1245
             W+IW  RNQL+   +    +     A +    + +     H  +P Q++A  Q    W 
Sbjct: 880  TWTIWSRRNQLRTSNRSFPLSQIIPSAKQMLHEFSEV----HPAAPAQTSAPPQSRPKWE 935

Query: 1246 KPPPGSTKINSDASVVR-SQGTGIGVSIR 1329
             PPP   KIN D +V + ++  G+GV +R
Sbjct: 936  PPPPSLLKINFDGAVFKETEEAGLGVVVR 964


>ref|XP_023899813.1| uncharacterized protein LOC112011695 [Quercus suber]
          Length = 1191

 Score =  315 bits (807), Expect = 3e-93
 Identities = 174/451 (38%), Positives = 244/451 (54%), Gaps = 8/451 (1%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            VLIK+VAQAIPT+ MSCF +P ++C+++ ++I NFWWGQ+ +E +I W  W  +C PK N
Sbjct: 615  VLIKAVAQAIPTYTMSCFKIPDSLCDELTTMIRNFWWGQRKEENKISWLRWEKMCEPKSN 674

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GG+GF+ L  FN A+LAKQGWR+     SL+ R  KARYFP  DF+ A +G NPSYTWRS
Sbjct: 675  GGMGFKNLKFFNLALLAKQGWRLQVGHDSLVYRVLKARYFPRCDFIHASMGNNPSYTWRS 734

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            ++A + ++++GL W VG+G++IRVW+D W+     ++   P +   +D RV ELIN DT 
Sbjct: 735  LMAAQNLVKEGLRWRVGNGASIRVWEDRWLPVPSTYKVTSPRLFLQADTRVQELINEDTA 794

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRI-XXXXXXXX 717
             W    V  +F    AD I  IP+ +    D+L W  T +G +TV+S Y +         
Sbjct: 795  EWKFSVVDALFLPHKADIIKSIPISSRLPTDKLIWTETRNGQFTVRSAYHLAMNRSSSIS 854

Query: 718  XXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGV 897
                         W  IWS+ +P KIR F W+V +D LP  + L RR    + LCK C  
Sbjct: 855  GGSSSNNSTLKCFWNKIWSIPVPHKIRHFAWRVCHDALPTKSNLLRRKVIQEDLCKSCRE 914

Query: 898  EIETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGE----LF 1065
              ET  H L  CP +   W  S L ++ +    + +  DL+   + N   EAGE      
Sbjct: 915  APETVGHVLWSCPKAKEAWECSKLVIEGF-DRQNLTFQDLMWELLVNG--EAGEDKVAHA 971

Query: 1066 ACLMWSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANE--PPHRQSPEQSAATLQI 1239
                W++W+ RN++++ G   S       A    R Y    E   P    P Q A+    
Sbjct: 972  VTTAWALWHNRNEVRYGGARKSGQQLSRWALDYLREYNAVAEVKVPREPVPRQVAS---- 1027

Query: 1240 WSKPPPGSTKINSDASVVRSQ-GTGIGVSIR 1329
            WS P  G  KIN D +V  +Q   G+GV IR
Sbjct: 1028 WSPPRNGHFKINVDGAVFAAQKAVGVGVVIR 1058


>gb|PRQ55763.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1228

 Score =  310 bits (794), Expect = 3e-91
 Identities = 176/451 (39%), Positives = 246/451 (54%), Gaps = 8/451 (1%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            ++IK+VAQ+IPT++MSCF LP+++C +++ L+A FWWG K+++++IHW +W  LC PK  
Sbjct: 642  IMIKAVAQSIPTYVMSCFELPKHLCLEMHRLMAQFWWGDKSNDKKIHWLAWEKLCVPKAE 701

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GGLGFR +  FN+A+LAKQGWRI+Q+  SL+AR YKA+YFPN DF+ A+     SY W+S
Sbjct: 702  GGLGFRNMVQFNQALLAKQGWRIIQNPNSLLARLYKAKYFPNCDFMKAEASSGSSYAWKS 761

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRP-QLPNIGENSDMRVHELINMDT 537
            IL G+E+L+KGL   VG+GS I VW D W+   H+FRP   P  G  + +RV ELI+ + 
Sbjct: 762  ILFGRELLRKGLRLQVGNGSQIAVWTDQWLPLPHSFRPISTPKEGLET-LRVDELIDQEE 820

Query: 538  HCWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGY---RIXXXXX 708
            + W    ++ +F   + + I  IPL      D+  WHH + G Y VKSGY   RI     
Sbjct: 821  NEWHMFLLQELFMPLEVNLIASIPLSLRRPVDRWVWHHDKKGMYDVKSGYHIARIVDGAT 880

Query: 709  XXXXXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKK 888
                            WK +W   IPPK+R+F+W++   ILP    L ++    D  C  
Sbjct: 881  ERASCSTGPVGLNGKYWKLVWHARIPPKVRMFIWRLLKGILPTKHELCKKVALPDFECVF 940

Query: 889  CGVEIETREHALRDCPWSFFFWRASILRL---DQWLMSSHASMTDLIMVFVKNRDLEAGE 1059
            C    E   H  RDC +   FW    L L   D    S    + ++I V  ++       
Sbjct: 941  CHGAGEHGLHLFRDCSYIACFWALGELDLKVVDVMAASLEEWVKNVIDVITEHH----RN 996

Query: 1060 LFACLMWSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPEQSAATLQI 1239
            LF   +W IW  RN L + G   +  +    A +    YQ+ +  P RQ  ++S  T   
Sbjct: 997  LFFVYLWVIWSERNNLVWNGSVFNAYNAAQWASKFLGEYQQVH--PTRQ--KKSGRTRAK 1052

Query: 1240 WSKPPPGSTKINSDASVVRSQGT-GIGVSIR 1329
            W  PP G  KIN D S     G  GIGV IR
Sbjct: 1053 WENPPTGRLKINIDGSYRPESGDGGIGVVIR 1083


>ref|XP_023913142.1| uncharacterized protein LOC112024740 [Quercus suber]
          Length = 1194

 Score =  309 bits (792), Expect = 3e-91
 Identities = 173/446 (38%), Positives = 233/446 (52%), Gaps = 3/446 (0%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            VLIK+VAQ+IPT+ M  FLLP  +CN++++L A FWWGQ  DER+IHWKSW  L  PK+ 
Sbjct: 643  VLIKAVAQSIPTYTMGVFLLPVKLCNELDALCARFWWGQSGDERKIHWKSWKFLTKPKKE 702

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GG+GFR++ +FN AMLAKQGWR++QD+ SL++  +KA+YFP   FL A    N SY W+S
Sbjct: 703  GGMGFRDIRSFNLAMLAKQGWRLIQDQNSLLSLCFKAKYFPRCSFLEATDCPNSSYVWKS 762

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            +LA ++IL+ G  W VG GS+IRV +D WI  +   +   P   +  + RV +LI+   H
Sbjct: 763  VLAAQDILKSGCYWRVGTGSSIRVMEDKWIPNHPENKVLFPTENDEWEWRVSDLIDWRVH 822

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRI--XXXXXXX 714
             WD  R+   F   DA+ ILKIPL      D+L W     G Y VKS Y +         
Sbjct: 823  GWDRERIYMCFNQFDAEAILKIPLSRRLIQDRLVWKFCRKGKYEVKSSYHVARMLDGDTN 882

Query: 715  XXXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCG 894
                          W  +W L +P KI++F W+   DILP    LARR    +  C+ C 
Sbjct: 883  GREECSVPRSDHRTWNQLWQLQVPSKIKVFGWRACLDILPSKVNLARRQVLQEDKCELCK 942

Query: 895  VEIETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACL 1074
               ET  HA+ DC  +   W  S  R+ Q   S       L    +    +E  E+F   
Sbjct: 943  RSPETTVHAIWDCSAAQDVWAGSTARI-QKCGSVFDDFMQLFQGMMAKLSVEELEIFLVQ 1001

Query: 1075 MWSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPEQSAATLQIWSKPP 1254
             W IW+ RN L + G           A    + Y+ A       +   S   LQ W KPP
Sbjct: 1002 SWLIWHRRNTLLYGGSMQHPVQLNKRATDYLKEYRDAQ--TLLVAGSASTGFLQNW-KPP 1058

Query: 1255 PGST-KINSDASVVRSQGTGIGVSIR 1329
            PG   K+N DA+     G+G+G  IR
Sbjct: 1059 PGQLYKLNFDAATF-DYGSGVGAVIR 1083


>gb|PNX92808.1| ribonuclease H [Trifolium pratense]
          Length = 930

 Score =  304 bits (779), Expect = 7e-91
 Identities = 163/449 (36%), Positives = 236/449 (52%), Gaps = 7/449 (1%)
 Frame = +1

Query: 4    LIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKENG 183
            LIK+VAQAIPT++MSCFLLP+ +C+ I S+I  FWWG  ND+R+IHW  W  +C  K+ G
Sbjct: 352  LIKAVAQAIPTYVMSCFLLPKELCSHIESMICKFWWGSNNDKRKIHWIKWSTICKHKKKG 411

Query: 184  GLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGC-NPSYTWRS 360
            GL FREL AFN+A+LAKQGWR +    S++A+  KA+Y+P    L A IG  N SYTWRS
Sbjct: 412  GLSFRELRAFNEALLAKQGWRCITQPNSMVAQVLKAKYYPKTTLLEADIGSKNVSYTWRS 471

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            I     IL+KG +W +G+G+T  +W D W+      +   P  GE +   V +L+  +  
Sbjct: 472  ISKASWILKKGGLWNIGNGATTNIWTDNWLPRQQGHKIWSPK-GEATQTWVKDLMIPEIR 530

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYR-IXXXXXXXX 717
             W+   +   F   +A+QI +IP+ ++   D+ +W +T+ G YTVKSGY+ I        
Sbjct: 531  SWNRQLIFDTFMQFEAEQITQIPIVHLSRPDEFSWPYTKDGIYTVKSGYQAIQDWKEDPN 590

Query: 718  XXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGV 897
                        +W+ +W L IPPK    +W++  D LPV + L +R     PLC +C  
Sbjct: 591  KPSTSYKPKNNSVWQKLWHLKIPPKYTHLIWRILQDALPVQSNLRKRGVNCYPLCPRCKE 650

Query: 898  EIETREHALRDCPWSFFFWRAS--ILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFAC 1071
             IE   H    C W+   W AS   L+ DQ    +  S  + I   ++    +   L   
Sbjct: 651  NIEDLNHVFSGCWWAKQVWFASPLTLKFDQ----NEFSFKNWIEDNIRKDQSQNMNLIGA 706

Query: 1072 LMWSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPEQSAATL---QIW 1242
            + + IW ARN L FQ K++   D    A  C   Y +  +P   Q    + A       W
Sbjct: 707  ICYHIWRARNLLTFQNKNVPVLDVIHRAQECLFEYHRQVKPISEQVNRNTRARSNNDSDW 766

Query: 1243 SKPPPGSTKINSDASVVRSQGTGIGVSIR 1329
              PP  + K+N DA ++     G+G  +R
Sbjct: 767  IPPPKDTLKLNVDAHLMGDGHWGLGWILR 795


>dbj|GAU51007.1| hypothetical protein TSUD_411560 [Trifolium subterraneum]
          Length = 1556

 Score =  310 bits (795), Expect = 1e-90
 Identities = 167/451 (37%), Positives = 243/451 (53%), Gaps = 9/451 (1%)
 Frame = +1

Query: 4    LIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKENG 183
            LIK+VAQAIPT+IMSCFLLP+ +C+++   I+NFWWG   D+++IHW SW  +C  K+ G
Sbjct: 979  LIKAVAQAIPTYIMSCFLLPRGLCDQMERQISNFWWGSNVDQKKIHWVSWKKVCKQKKMG 1038

Query: 184  GLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRSI 363
            G+GFR+L AFN+A+LAKQGWR++ +  SL+A   KA+YFP+  FL A+   N SY+W+SI
Sbjct: 1039 GMGFRDLKAFNEALLAKQGWRLITEPDSLVATVLKAKYFPHDQFLQAKQSYNASYSWQSI 1098

Query: 364  LAGKEILQKGLIWLVGDGSTIRVWDDPWI---AENHNFRPQLPNIGENSDMRVHELINMD 534
                 IL+KG  W VG G  I +W+D WI   AE   +  +  N   N   +V +LI+  
Sbjct: 1099 RKANWILKKGCYWFVGKGDKINIWEDRWIHPQAEGATWTQKPTNTNIN---KVSDLIDAQ 1155

Query: 535  THCWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYR--IXXXXX 708
             H W+   +R  F   +A++IL IPL N    D+++W  T  G Y+VKSGY   I     
Sbjct: 1156 NHTWNSQIIRENFFPMEANKILDIPLTNSTEEDEISWRGTNDGNYSVKSGYNAMIEWDQA 1215

Query: 709  XXXXXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKK 888
                            W  IW L  PPK    +W+V ++ +PV A L  +    D LC +
Sbjct: 1216 KENPTQQSNTHMADSNWAKIWKLRNPPKQIHLLWRVLHNAIPVKANLIAKGILCDTLCPR 1275

Query: 889  CGVEIETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFA 1068
            C    ET +H   +C W+   W +S L ++ +  S + S +D     + N   E  E+  
Sbjct: 1276 CNKSPETTDHTFLNCDWAQKTWFSSPLSINLY-KSRYQSFSDWFYYMLNNTPKECIEMIT 1334

Query: 1069 CLMWSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQ----KANEPPHRQSPEQSAATLQ 1236
             + +SIWYARNQ  +  KD+        + +  + YQ       +PP   S   S++   
Sbjct: 1335 TITYSIWYARNQKIYHQKDIQIETNLMRSIQLLQDYQHNCNSLLDPPSVSSHLSSSSNNI 1394

Query: 1237 IWSKPPPGSTKINSDASVVRSQGTGIGVSIR 1329
             WS P     K+N DA +  +   GIG+ +R
Sbjct: 1395 SWSPPSENYLKLNVDAHLRDAGHWGIGMILR 1425


>gb|PNY07715.1| ribonuclease H [Trifolium pratense]
          Length = 1567

 Score =  310 bits (795), Expect = 1e-90
 Identities = 160/448 (35%), Positives = 240/448 (53%), Gaps = 6/448 (1%)
 Frame = +1

Query: 4    LIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKENG 183
            LIK+VAQAIPT++MS FLLP+ +CN +  + + FWWG   D+R+IHW +W   C  K  G
Sbjct: 973  LIKAVAQAIPTYLMSSFLLPKGLCNLLEQMSSKFWWGSNVDQRKIHWVNWRKTCKQKNQG 1032

Query: 184  GLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRSI 363
            G+GFR++ AFN+A+LAKQGWRIL +  SL+A+  KA+YFP+G+FL A  G   SY+W+SI
Sbjct: 1033 GMGFRDIRAFNEALLAKQGWRILTEPNSLVAKILKAKYFPHGNFLQATQGKKSSYSWQSI 1092

Query: 364  LAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTHC 543
                 IL+KG  WLV +G  I +W+D WI    N     P     +  +V +LIN  TH 
Sbjct: 1093 QKASWILKKGCFWLVENGQNINIWEDRWINPQGNSTTWTPKPINTNLEKVKDLINPTTHT 1152

Query: 544  WDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGY--RIXXXXXXXX 717
            W+E  +   F   +A+QIL+IPL N  + D ++W  T+ G YTVKSGY  ++        
Sbjct: 1153 WNEPVISKTFFPIEANQILQIPLSNTMTEDIISWQGTKDGNYTVKSGYNAQMEWANVESS 1212

Query: 718  XXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGV 897
                         W  IW++ +PPK    +W++ ++ +PV   L ++   +D LC +C  
Sbjct: 1213 QAQTSNNHKDEPFWNKIWNIKVPPKQIHLLWRIMHNAIPVKTNLIKKGILIDSLCPRCNK 1272

Query: 898  EIETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLM 1077
              ET +H    C W+   W  S L +    +  H +  + I   + N   E+ ++ + + 
Sbjct: 1273 GPETIDHLFLHCEWAHLVWFYSPLTIKTTNIQLH-TFHEWIKYMLHNTTKESMQILSSIT 1331

Query: 1078 WSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQ----KANEPPHRQSPEQSAATLQIWS 1245
            +SIW+ARN+  FQ KD+   +    A +    +Q    +A   P    P         W+
Sbjct: 1332 YSIWFARNKKVFQNKDIPVNETVDRALKNIHDFQHHLTEACFAPTNSKPPSVNRHNTSWN 1391

Query: 1246 KPPPGSTKINSDASVVRSQGTGIGVSIR 1329
             PP    K+N DA +      G+G  +R
Sbjct: 1392 PPPRNFLKLNVDAHLSDDGHWGLGWVLR 1419


>ref|XP_023914298.1| uncharacterized protein LOC112025844 [Quercus suber]
          Length = 1362

 Score =  309 bits (791), Expect = 2e-90
 Identities = 169/448 (37%), Positives = 246/448 (54%), Gaps = 5/448 (1%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            +L+K+V QAIPTF MSCF +P  +CN I SLI  FWWGQ+  +R+IHW  W +LC PK  
Sbjct: 796  ILLKAVIQAIPTFAMSCFKIPITLCNDIESLIRKFWWGQRGSQRKIHWTKWSSLCLPKNQ 855

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GG+GF+EL  FN AMLAKQ WR+L++K SL  + +KA++FPNG  L A+ G   S+ W+S
Sbjct: 856  GGMGFKELQKFNDAMLAKQVWRLLENKDSLFHKFFKAKFFPNGSILDAKEGLG-SFAWKS 914

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            IL G+ ++ KGL W VG+G+ I ++ D W+    + +   P    + D RV  LI+ D  
Sbjct: 915  ILKGRAVIVKGLQWRVGNGAAIGIYRDAWLPPPQSSKVISPLNSLDIDARVSVLIDHDRK 974

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXXX 720
            CW+EG +   F   DA +I  IPL     +D + W    +G ++VKSGY++         
Sbjct: 975  CWNEGVIDNTFLPSDASRIKAIPLSLTNCDDCVFWPRNPNGIFSVKSGYKLLMESELDDF 1034

Query: 721  XXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGVE 900
                       +WK IWSL IP +++  MW+   D LP  A L +R    +  C  C + 
Sbjct: 1035 LTTSDQSMSKKVWKGIWSLRIPNRVKSLMWRAGLDSLPTRANLRKRRLINEDTCPHCNLN 1094

Query: 901  IETREHALRDCPWSFFFWRASILRLDQWLMSSH---ASMTDLIMVFVKNRDLEAGELFAC 1071
             E+  HAL  CP     W+       +WL+       S+ D+  + +++ DL   +LFA 
Sbjct: 1095 SESSLHALWSCPSLLPIWKVHF----EWLIKDSWNCRSLLDVFQLCLESSDLL--DLFAM 1148

Query: 1072 LMWSIWYARNQLQFQGKDLSHAD-CFTMADRCFRSYQKANEPPHRQSPEQSAATLQIWSK 1248
            +   IW  RNQL+  G+  +  D   +MA    + +++A+ PP R +P  S A    W+ 
Sbjct: 1149 ISSLIWARRNQLRV-GESAAPLDRICSMAVANLQEFRRASPPPLRSTPSVSPAK---WTP 1204

Query: 1249 PPPGSTKINSDASVVRSQG-TGIGVSIR 1329
            PP G  KIN D +    +G  G+G  IR
Sbjct: 1205 PPLGWMKINFDGATFAEKGLAGLGAVIR 1232


>ref|XP_023878301.1| uncharacterized protein LOC111990748 [Quercus suber]
          Length = 1325

 Score =  308 bits (790), Expect = 2e-90
 Identities = 169/450 (37%), Positives = 238/450 (52%), Gaps = 7/450 (1%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            +LIK+VAQAIPT+ MSCFLLPQ +C+ +  ++ NFWWGQ+N E ++ W SW  +CN K +
Sbjct: 755  ILIKAVAQAIPTYTMSCFLLPQGLCDDMERMMKNFWWGQRNQETKMGWISWKRMCNSKAS 814

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GGLGFR L AFN AMLAKQ WRIL +  SL+ R  KARYFP GD L+A++G +PSY+WRS
Sbjct: 815  GGLGFRNLKAFNLAMLAKQAWRILYNPNSLVGRVLKARYFPTGDLLNAKLGSSPSYSWRS 874

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            I +  E++++G  W VG+G  I +W+D W+     ++   P I       V  LI+ DT 
Sbjct: 875  IHSSLEVIRRGTRWRVGNGKQIHIWEDRWLPTPSTYKVISPQIHNFEFPLVSSLIDPDTK 934

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRI-XXXXXXXX 717
             W    +R IF   + + IL+IPL      D+L W   + G ++VKS Y I         
Sbjct: 935  WWKVEALRSIFLPFEVETILRIPLSYNLPEDKLIWIGNKKGEFSVKSAYHIAHSIIDPNE 994

Query: 718  XXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGV 897
                        LWK +W LN+P KI+IF W+   D LP    +++R       C  CG+
Sbjct: 995  RGECSNGDPYRLLWKKLWLLNLPGKIKIFAWRACVDGLPTYDNISKRGICCSSTCPICGL 1054

Query: 898  EIETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLM 1077
              E   HAL  C  +   W        +   S + S  D+ +    ++  +  ELF  L 
Sbjct: 1055 VTEDVNHALLYCEAASLVW-CFWSDYPETPQSHNGSFLDMALHLCHSKASQVLELFFVLS 1113

Query: 1078 WSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANE-----PPHRQSPEQSAATLQIW 1242
            W+IWY RN++      LS +  + MA+     ++KA       P H Q           W
Sbjct: 1114 WAIWYNRNKIVHNDSPLSPSQVWLMANNTLEDFKKAASLDIIPPRHSQIR---------W 1164

Query: 1243 SKPPPGSTKINSD-ASVVRSQGTGIGVSIR 1329
              PP G  K+N D A+  + + + IGV IR
Sbjct: 1165 EAPPLGIFKVNVDGATSDQGRNSSIGVIIR 1194


>ref|XP_024156142.1| uncharacterized protein LOC112164137 [Rosa chinensis]
          Length = 1293

 Score =  308 bits (788), Expect = 3e-90
 Identities = 164/446 (36%), Positives = 246/446 (55%), Gaps = 3/446 (0%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            V+IKSV Q++PT++MSCF LP+++C +++  +A FWWG     R+IHW +W  +C PKE 
Sbjct: 600  VMIKSVVQSVPTYVMSCFELPKHLCQEMHRCMAEFWWGDSEKGRKIHWLAWDKMCVPKEE 659

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GGLGFR +  FN+A+LAKQGWRIL+   SL+ +T KA+YFPN DF+ A +    SYTWRS
Sbjct: 660  GGLGFRNMEYFNQALLAKQGWRILRHPDSLLGKTLKAKYFPNNDFIHASVNQGDSYTWRS 719

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            ++ GK +L+KGL + VG G+ I VW DPWI   ++FRP    +    D+ V +LI+ D+ 
Sbjct: 720  LMKGKVLLEKGLRFQVGLGTRISVWFDPWIPRPYSFRPYSTVMEGLEDLTVADLIDPDSK 779

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRI-XXXXXXXX 717
             W    +  +F  ++ D I KIPL      D+L WH  + G Y+VKSGY +         
Sbjct: 780  DWMVDWLEELFFADEVDLIRKIPLSLRNPEDRLIWHFDKRGLYSVKSGYHVARCVASLSS 839

Query: 718  XXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGV 897
                        LW+ +W   + PK+R F+W++  +I+P    L RR    + +C  C  
Sbjct: 840  HVSTSNSQGDKDLWRRVWHARVQPKVRNFVWRLVKNIVPTKVNLGRRVNLDERICPFCRC 899

Query: 898  EIETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLM 1077
            E ET  H   +C      W  S L L     +++ S+ + ++  +   +    ++F  L+
Sbjct: 900  ESETTLHVFMECNVIACMWLFSSLGLRAKNHTTN-SVKEWVLDMLDVLNKSQVDIFFMLL 958

Query: 1078 WSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKAN-EPPHRQSPEQSAATLQIWSKPP 1254
            W+IW  RN+L + G   +     T +      YQ+ + E    +SP  +AA    W  PP
Sbjct: 959  WAIWSERNKLVWNGGTFNPMHTVTWSMHLLSEYQRCHPEKSTHKSPRGAAAK---WMFPP 1015

Query: 1255 PGSTKINSDASVVRSQGT-GIGVSIR 1329
             G  KIN D +   ++G  GIGV +R
Sbjct: 1016 RGRLKINVDGAYKSNEGCGGIGVVVR 1041


>ref|XP_024172006.1| uncharacterized protein LOC112178017 [Rosa chinensis]
          Length = 1045

 Score =  304 bits (779), Expect = 4e-90
 Identities = 160/448 (35%), Positives = 248/448 (55%), Gaps = 5/448 (1%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            VLIK+V Q+IPT++MSCF LP+++C++++ L+A FWWG+  +ER+IHW +W  LC PK+ 
Sbjct: 407  VLIKAVVQSIPTYVMSCFELPKHLCDEMHRLMARFWWGEFGEERKIHWVAWDKLCAPKKE 466

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GGLGFR++  FN A+LAKQGWR++    SL+A+ +KARYFPN DF+ A +    S++WRS
Sbjct: 467  GGLGFRDMHLFNTALLAKQGWRLICRLDSLLAQVFKARYFPNTDFMHAVLHKGASFSWRS 526

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            I+ G+++L+KGL + VG+G  I +W+DPW+   + F+P    +    D+RV +LI+ +T 
Sbjct: 527  IMKGRDLLKKGLRFQVGNGEDISIWNDPWVPLPYRFKPFSIPMQGAEDLRVVDLIDEETG 586

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRI--XXXXXXX 714
             W E  +  +FT  +   I+KIPL      D+L WH  + G Y+VK+GY +         
Sbjct: 587  DWQEWLLHELFTPMEVVNIMKIPLSLSGGIDRLVWHFDKKGRYSVKNGYHVARVMDTLER 646

Query: 715  XXXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCG 894
                         LW  +W +N+PPK+R+  W++    LP  A LA+R    D  C  C 
Sbjct: 647  TASGSSFGADRARLWGKLWKVNVPPKVRMHAWRLVKGTLPTRAALAKRVQLSDVRCVYCS 706

Query: 895  VEIETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVF--VKNRDLEAGELFA 1068
              +E   H  ++C     FW  S L L       H S+   +  +  V   + E  E F 
Sbjct: 707  NGVEDSLHLFKNCDALRTFWMHSSLELQP---GKHPSIVLDVWFWDMVDALNGEKLEYFL 763

Query: 1069 CLMWSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPEQSAATLQIWSK 1248
              +W +W  RN + ++    +  +    A      Y++     H++   ++  +L  W+ 
Sbjct: 764  MALWVVWMERNNIVWKASVSNPTNMHAWAKSFLHEYKQL----HKRQGGKAKRSLIKWTC 819

Query: 1249 PPPGSTKINSDAS-VVRSQGTGIGVSIR 1329
            PP G  KIN D S    + G G+GV +R
Sbjct: 820  PPRGRLKINIDGSFCTETGGGGVGVVVR 847


>ref|XP_024172304.1| uncharacterized protein LOC112178381 [Rosa chinensis]
          Length = 1602

 Score =  309 bits (791), Expect = 5e-90
 Identities = 162/445 (36%), Positives = 243/445 (54%), Gaps = 2/445 (0%)
 Frame = +1

Query: 1    VLIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKEN 180
            V+IKSV Q++PT++MSCF LP+++C +++  +A FWWG     R+IHW +W  +C PKE 
Sbjct: 808  VMIKSVVQSVPTYVMSCFELPKHLCQEMHRCMAEFWWGDSEKGRKIHWLAWDKMCVPKEK 867

Query: 181  GGLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRS 360
            GGLGFR +  FN+A+LAKQGWRIL+   SL+ +T KA+YFPN DF+ A +    SYTWRS
Sbjct: 868  GGLGFRNMEYFNQALLAKQGWRILRHPDSLLGKTLKAKYFPNNDFIHASVNQGDSYTWRS 927

Query: 361  ILAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTH 540
            ++ GK +L+KGL + VG G+ I VW DPWI   ++FRP    +    D+ V +LI+ D+ 
Sbjct: 928  LMKGKVLLEKGLRFQVGSGTRISVWFDPWIPRPYSFRPYSTVMEGLEDLTVADLIDPDSK 987

Query: 541  CWDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRI-XXXXXXXX 717
             W    +  +F  ++ D I KIPL      D+L WH  + G Y+VKSGY +         
Sbjct: 988  DWMVDWLEELFFADEVDLIRKIPLSLRNPEDRLIWHFDKRGLYSVKSGYHVARCVASLSS 1047

Query: 718  XXXXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGV 897
                        LW+ +W   + PK+R F+W++  +I+P    L RR    + +C  C  
Sbjct: 1048 HVSTSNSQGDKDLWRRVWHARVQPKVRNFVWRLVKNIVPTKVNLGRRVNLDERICPFCRC 1107

Query: 898  EIETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLM 1077
            E ET  H   +C      W  S L L     +++ S+ + ++  +   +    ++F  L+
Sbjct: 1108 ESETTLHVFMECNVIACMWLFSSLGLRAKNHTTN-SVKEWVLDMLDVLNKSQVDIFFMLL 1166

Query: 1078 WSIWYARNQLQFQGKDLSHADCFTMADRCFRSYQKANEPPHRQSPEQSAATLQIWSKPPP 1257
            W+IW  RN+L + G   +     T +      YQ+ +         + AAT   W  PP 
Sbjct: 1167 WAIWSERNKLVWNGGTFNPMHTVTWSMHLLSEYQRCHPEKSTHKSPRGAATK--WMFPPR 1224

Query: 1258 GSTKINSDASVVRSQGT-GIGVSIR 1329
            G  KIN D +   ++G  GIGV +R
Sbjct: 1225 GRLKINVDGAYKSNEGCGGIGVVVR 1249


>gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense]
          Length = 894

 Score =  301 bits (770), Expect = 8e-90
 Identities = 159/448 (35%), Positives = 236/448 (52%), Gaps = 6/448 (1%)
 Frame = +1

Query: 4    LIKSVAQAIPTFIMSCFLLPQNVCNKINSLIANFWWGQKNDERRIHWKSWHALCNPKENG 183
            LIKSVAQAIP +IMSC+ LP   C+ I +++A FWWG    +R+IHW SW+ L   K  G
Sbjct: 388  LIKSVAQAIPNYIMSCYKLPTGCCDNIEAMLAKFWWGTTEKQRKIHWVSWNKLGKAKSKG 447

Query: 184  GLGFRELSAFNKAMLAKQGWRILQDKTSLIARTYKARYFPNGDFLSAQIGCNPSYTWRSI 363
            GLGFR    FNKA+L KQ WR+LQ+  SL+A+ +K+RYFP   F+ A +G  PSY WRS+
Sbjct: 448  GLGFRSFEDFNKALLGKQCWRLLQNPDSLLAKVFKSRYFPRSKFMDANVGYQPSYAWRSL 507

Query: 364  LAGKEILQKGLIWLVGDGSTIRVWDDPWIAENHNFRPQLPNIGENSDMRVHELINMDTHC 543
               +E++  G  WL+G+G  + +W+D W+     F+   P      +  V +LIN++T  
Sbjct: 508  CNSREVIDVGARWLIGNGKDVHIWNDKWLPAQDKFKVWSPVSNLAPNAMVSDLINLETKM 567

Query: 544  WDEGRVRGIFTWEDADQILKIPLRNIWSNDQLAWHHTESGTYTVKSGYRIXXXXXXXXXX 723
            WD+  V+  F   +A+QIL IPL      D+L WH  ++G ++V+S Y +          
Sbjct: 568  WDKNLVQNCFNSFEAEQILNIPLSWRLPADKLIWHWEKNGEFSVRSAYHM-LSEIRNQNS 626

Query: 724  XXXXXXXXXXLWKWIWSLNIPPKIRIFMWKVANDILPVNARLARRSFGVDPLCKKCGVEI 903
                      LWK IW + +P  I+ F+W++A  ILP  +RL ++   +D  C  C  +I
Sbjct: 627  PEASSSRDHLLWKAIWKVKVPNCIKNFLWRLAKAILPTRSRLEKKGITLDTTCPLCFNDI 686

Query: 904  ETREHALRDCPWSFFFWRASILRLDQWLMSSHASMTDLIMVFVKNRDLEAGELFACLMWS 1083
            E  EH    CP S   W +S L L      ++  +   + +++ N D  A +LF+  +W 
Sbjct: 687  ECNEHLFMHCPLSKQVWFSSPLGLH---APNNVGLISWMQLWLSNPDKLASQLFSTTLWM 743

Query: 1084 IWYARNQLQFQGKDLSHADCFTMADRC-----FRSYQKANEPP-HRQSPEQSAATLQIWS 1245
            IW  RN+L F  K++     +  A        F S    NE    R++P+        W 
Sbjct: 744  IWKGRNKLIF--KNVKFCPIYVAAASSDFVAEFNSGSCCNESNIVRENPDS-------WE 794

Query: 1246 KPPPGSTKINSDASVVRSQGTGIGVSIR 1329
             P     K+N DA    +  TG G+ +R
Sbjct: 795  PPEQAKFKVNIDAGCFSNGTTGWGMIMR 822


Top