BLASTX nr result

ID: Cheilocostus21_contig00033398 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00033398
         (857 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OAY74722.1| putative ribonuclease H protein [Ananas comosus]       129   7e-30
ref|XP_020096969.1| uncharacterized protein LOC109716078, partia...   129   7e-30
ref|XP_020088996.1| uncharacterized protein LOC109710675 [Ananas...    82   4e-17
gb|PNX91317.1| ribonuclease H, partial [Trifolium pratense]            80   4e-14
gb|PRQ16242.1| putative ribonuclease H-like domain, reverse tran...    81   2e-13
gb|KYP71696.1| Putative ribonuclease H protein At1g65750 family ...    75   9e-12
ref|XP_013718796.1| uncharacterized protein LOC106422552 [Brassi...    76   9e-12
ref|XP_023909265.1| uncharacterized protein LOC112020926 [Quercu...    71   9e-12
gb|PRQ37611.1| putative reverse transcriptase zinc-binding domai...    74   1e-11
ref|XP_022158377.1| uncharacterized protein LOC111024874 [Momord...    75   2e-11
gb|EMS67607.1| Putative disease resistance protein RGA3 [Triticu...    74   4e-11
ref|XP_024190061.1| uncharacterized protein LOC112194030 [Rosa c...    65   4e-11
gb|OVA00965.1| Reverse transcriptase zinc-binding domain [Maclea...    74   5e-11
dbj|GAU22350.1| hypothetical protein TSUD_106780 [Trifolium subt...    73   1e-10
gb|OVA11190.1| Reverse transcriptase zinc-binding domain [Maclea...    72   1e-10
ref|XP_019163494.1| PREDICTED: uncharacterized protein LOC109159...    72   3e-10
ref|XP_020109325.1| uncharacterized protein LOC109724810 [Ananas...    72   3e-10
gb|PRQ33081.1| putative RNA-directed DNA polymerase [Rosa chinen...    67   3e-10
gb|OVA20918.1| Endonuclease/exonuclease/phosphatase [Macleaya co...    71   4e-10
dbj|GAU38148.1| hypothetical protein TSUD_395930 [Trifolium subt...    71   4e-10

>gb|OAY74722.1| putative ribonuclease H protein [Ananas comosus]
          Length = 851

 Score =  129 bits (324), Expect = 7e-30
 Identities = 79/249 (31%), Positives = 117/249 (46%), Gaps = 2/249 (0%)
 Frame = +1

Query: 112  LLRPKEQGGLYMIDLALMXXXXXXXXXXXXXXXXXXXWFRVSSAKYDFIDPWNLTPQRKS 291
            + +P  +GGL   +L                      W R+   KY  +DPW       +
Sbjct: 337  ITKPLTEGGLSFRNLRKSREAFMARNALKLLNNINLPWVRIMKGKYGELDPWKNPKLTTT 396

Query: 292  SRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFI--DMGSISNQQV 465
            S AW+AL   ++ IK  L   I DG NT+ +  P      F+ KPTFI  ++G    Q +
Sbjct: 397  SWAWRALNYTVQAIKLGLQISISDGLNTNFLEHPWLFTTPFSKKPTFIATNIGE-KVQHI 455

Query: 466  SSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNSVGLLITKCVYQVLF 645
            + +I N  W   +I+   G+ +V+ I+N  L   P  D W W+ +  G      VY  L 
Sbjct: 456  NQIIENGEWCYNSIEGLVGEDLVDAITNLQLGEGP--DKWVWSLHPQGKARAGSVYSFLN 513

Query: 646  NHTIPATISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLLGANICVLCKQN 825
             HT      W GWK LW L ++ ++K   WK    +L T DFL +RGL  +N+C LC + 
Sbjct: 514  GHTDNC---WDGWKQLWGLAVAPRVKTFLWKYFWKRLPTKDFLQQRGLTQSNLCALCGEA 570

Query: 826  EESCKHLFY 852
             E+ +HLF+
Sbjct: 571  AENIQHLFF 579


>ref|XP_020096969.1| uncharacterized protein LOC109716078, partial [Ananas comosus]
          Length = 1220

 Score =  129 bits (324), Expect = 7e-30
 Identities = 79/249 (31%), Positives = 117/249 (46%), Gaps = 2/249 (0%)
 Frame = +1

Query: 112  LLRPKEQGGLYMIDLALMXXXXXXXXXXXXXXXXXXXWFRVSSAKYDFIDPWNLTPQRKS 291
            + +P  +GGL   +L                      W R+   KY  +DPW       +
Sbjct: 864  ITKPLTEGGLNFRNLRKSREAFMARNALKLLNNINLPWVRIMKGKYGELDPWKNPKLTTT 923

Query: 292  SRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFI--DMGSISNQQV 465
            S AW+AL   ++ IK  L   I DG NT+ +  P      F+ KPTFI  ++G    Q +
Sbjct: 924  SWAWRALNYTVQAIKLGLQISISDGLNTNFLEHPWLFTTPFSKKPTFIATNIGE-KVQHI 982

Query: 466  SSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNSVGLLITKCVYQVLF 645
            + +I N  W   +I+   G+ +V+ I+N  L   P  D W W+ +  G      VY  L 
Sbjct: 983  NQIIENGEWCYNSIEGLVGEDLVDAITNLQLGEGP--DKWVWSLHPQGKARAGSVYSFLN 1040

Query: 646  NHTIPATISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLLGANICVLCKQN 825
             HT      W GWK LW L ++ ++K   WK    +L T DFL +RGL  +N+C LC + 
Sbjct: 1041 GHTDNC---WDGWKQLWGLAVAPRVKTFLWKYFWKRLPTKDFLQQRGLTQSNLCALCGEA 1097

Query: 826  EESCKHLFY 852
             E+ +HLF+
Sbjct: 1098 AENIQHLFF 1106


>ref|XP_020088996.1| uncharacterized protein LOC109710675 [Ananas comosus]
          Length = 1113

 Score = 82.4 bits (202), Expect(2) = 4e-17
 Identities = 54/213 (25%), Positives = 94/213 (44%), Gaps = 4/213 (1%)
 Frame = +1

Query: 223  WFRVSSAKYDFIDPWNLTPQR--KSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP* 396
            W ++   KY    PW  T ++  K S  +KAL V + +I+  + K I DG +T I + P 
Sbjct: 636  WVKLVRDKYRGFHPWKFTDRKYKKCSPIFKALRVTMHVIRDGMCKLIGDGKDTTIWDDPW 695

Query: 397  CLDVSFAFKPTFIDMG-SISNQQVSSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLPS 573
               +  ++KPT+I+M      ++V+ LI   +W+   +    G  +   I    L     
Sbjct: 696  IFSIPLSWKPTYINMNVRFGKKKVARLIRAGNWNYDRLVEWFGPILAHNICKIILSPDNG 755

Query: 574  TDCWFWNGNSVGLLITKCVY-QVLFNHTIPATISWPGWKILWKLRISAKIKILGWKLIHH 750
            +D W W     G    K +Y  +     +P       W +LW L ++ ++K   WKL+ +
Sbjct: 756  SDEWIWAPKKDGKPSVKSIYHHINQGFYVPQATK---WIVLWSLPVAPRVKNFLWKLLWN 812

Query: 751  KLQTPDFLAKRGLLGANICVLCKQNEESCKHLF 849
            +L T +         +  C+ C   E+   H+F
Sbjct: 813  RLPTNERCYSLNSAPSPFCIYCSTPEDQ-NHIF 844



 Score = 34.7 bits (78), Expect(2) = 4e-17
 Identities = 16/42 (38%), Positives = 24/42 (57%)
 Frame = +3

Query: 3   SLGLSRSELNKLNTPIRDFI*GKNEAKKGLHLKNWETLET*R 128
           +  +S+   N L+  IRDF+  K   KK LHL  W+++ T R
Sbjct: 562 NFAVSKGVTNSLSRKIRDFLWEKRNNKKSLHLLKWDSVTTAR 603


>gb|PNX91317.1| ribonuclease H, partial [Trifolium pratense]
          Length = 232

 Score = 80.1 bits (196), Expect = 4e-14
 Identities = 42/143 (29%), Positives = 71/143 (49%)
 Frame = +1

Query: 424 PTFIDMGSISNQQVSSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNS 603
           P  ID+G++ N  V  +I+N  W +  I +     +   I   T+PS+P+ D   W  + 
Sbjct: 59  PDTIDVGNLQNAMVEDIIVNGEWRIPLIVQQIAPLLASEIRGLTIPSIPTPDSLIWEESI 118

Query: 604 VGLLITKCVYQVLFNHTIPATISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKR 783
            GL+  K  Y    +++ P     P  K++W   +     ++ W++ H+ + T D L KR
Sbjct: 119 SGLMSFKDAYMNRASNSQPL----PWCKLIWSPIVQPARTLVLWRIFHNIIATDDNLRKR 174

Query: 784 GLLGANICVLCKQNEESCKHLFY 852
           G L  + C LC Q  E+ +HLF+
Sbjct: 175 GFLMVSCCSLCNQAYETSQHLFF 197


>gb|PRQ16242.1| putative ribonuclease H-like domain, reverse transcriptase
           zinc-binding domain-containing protein [Rosa chinensis]
          Length = 606

 Score = 80.9 bits (198), Expect = 2e-13
 Identities = 57/199 (28%), Positives = 92/199 (46%), Gaps = 8/199 (4%)
 Frame = +1

Query: 283 RKSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFIDMGSISNQQ 462
           R SS  W++L+   KL+   L   + DG +  +I Q   + + ++FK        + N+ 
Sbjct: 147 RSSSLIWRSLLWGRKLLLSGLRWRVGDGTSI-LIYQDAWVPIPYSFK-IMSPPNLLVNRP 204

Query: 463 VSSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNSVGLLITKCVYQVL 642
           VS LI N SW+   I     ++ V  I +  LP  P +D   W+    G    K  YQ+ 
Sbjct: 205 VSYLISNGSWNFELISGSFWESEVSAICSIPLPITPCSDKQVWHYTKNGTYTVKSGYQLA 264

Query: 643 FNHTIPATISWPG--------WKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLLGA 798
               I                W+++WKL+I   +K+  W++IH  L T   L +R LL +
Sbjct: 265 QTLRIRGAGQEGSSTNQLDLVWQVMWKLKIPEGVKVFLWRVIHEILPTALLLQRRHLLAS 324

Query: 799 NICVLCKQNEESCKHLFYS 855
           ++C  C+  EE+  H F+S
Sbjct: 325 SVCSRCQGEEETILHAFWS 343


>gb|KYP71696.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 325

 Score = 75.1 bits (183), Expect = 9e-12
 Identities = 65/257 (25%), Positives = 110/257 (42%), Gaps = 12/257 (4%)
 Frame = +1

Query: 109 RLLRPKEQGGLYMIDLALMXXXXXXXXXXXXXXXXXXXWFRVSSAKYDFIDPWNLTPQ-- 282
           ++ RPKE GGL +  +  +                   W +V  AKY   +   + PQ  
Sbjct: 39  KICRPKEAGGLGLRSMRNVNTTYMMKNCWNLIQDPHKLWVQVVRAKYKCKN--GIIPQVM 96

Query: 283 --RKSSRAWKALMVDLKLIKPYLSKCIDDGN------NTHIINQP*CLDVSFAFKPTFID 438
              K S  WK +    +LIKP++   I++G       N  + N    +  S +F P  ++
Sbjct: 97  KTNKMSNLWKGICASWELIKPHICWRINNGRTASFWYNNWLPNHMPIIHYSSSFIPP-VE 155

Query: 439 MGSISNQQVSSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLP-STDCWFWNGNSVGLL 615
           +    ++  S      +W + ++Q    Q I+ETI +F LP+     D   W+ +  G+ 
Sbjct: 156 LFKTIHEYTSG---QGTWQIDHLQSWLPQNILETIYHFPLPTNGYGQDVVLWSPSKDGVF 212

Query: 616 ITKCVYQVLFN-HTIPATISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLL 792
            TK    +  + HT+      P +K +W+     +I++L W+++H  L T      RGL 
Sbjct: 213 TTKSAVTIASSSHTVSHP---PPFKAIWRWNGPERIRVLLWRVVHGSLMTNQVRVDRGLG 269

Query: 793 GANICVLCKQNEESCKH 843
               C +C Q  ES  H
Sbjct: 270 TDPTCSVCMQETESNLH 286


>ref|XP_013718796.1| uncharacterized protein LOC106422552 [Brassica napus]
          Length = 459

 Score = 75.9 bits (185), Expect = 9e-12
 Identities = 60/198 (30%), Positives = 89/198 (44%), Gaps = 9/198 (4%)
 Frame = +1

Query: 283 RKSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFIDMGSISNQQ 462
           +K+S  W+ +     L+  +L K I DG+ T I  +P C+  S A  P        S+  
Sbjct: 17  KKASHGWRGIQAGRDLLVEHLGKFIGDGSTTKIWGEP-CISPSSAILPFGPTKEDESDLY 75

Query: 463 VSSLILNNS--WDVYNIQR*GGQAIVETISNFTLPSLPST-DCWFWNGNSVGLLITKCVY 633
           VS LIL  S  W++  +Q        E +S    PS+    D + W   + G    K  Y
Sbjct: 76  VSDLILRGSGTWNITRLQELLPDLAKEILS--LRPSVTGAQDSYVWYPVASGSYSAKSGY 133

Query: 634 QVLFNHTIPAT------ISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLLG 795
                  +PAT       S+   K +W +    K+K+L WK++H  L T   L KRG+L 
Sbjct: 134 AAASASLLPATNQPVVPASFNWHKSVWNVESPPKLKLLIWKILHEALPTGFNLQKRGVLS 193

Query: 796 ANICVLCKQNEESCKHLF 849
             +C+ C Q  E+  HLF
Sbjct: 194 DTMCIQCGQ-VETMDHLF 210


>ref|XP_023909265.1| uncharacterized protein LOC112020926 [Quercus suber]
          Length = 1420

 Score = 71.2 bits (173), Expect(2) = 9e-12
 Identities = 65/221 (29%), Positives = 106/221 (47%), Gaps = 12/221 (5%)
 Frame = +1

Query: 223  WFRVSSAKYDFIDPW-NLTPQRKSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*C 399
            W RV + KY     +  ++  +++S + +++    ++IK  L    D+G NT I   P  
Sbjct: 792  WVRVCNKKYVRNKKFMRMSMPKEASWSCQSIFGGREVIKKGLCHRKDNGLNTRIWEDPWI 851

Query: 400  LDVSFAFKPTFI-DMGSISNQQ---VSSLILNNS--WDVYNIQR*GGQAIVETISNFTLP 561
             +     +P+ I  + S + Q+   V+ LI+ +S  WD   I      A V  I +  L 
Sbjct: 852  PN-----EPSLIPQVRSRATQEAYLVADLIVKDSRQWDRGLISDLFEPATVNRILSTHLS 906

Query: 562  SLPSTDCWFWNGNSVGLLITKCVYQVLFNHTIPATISWP-----GWKILWKLRISAKIKI 726
               + D  FW  N  G    K  Y  +     P++IS P      WK+LWK++I A++K 
Sbjct: 907  QQSANDQVFWCLNPSGEFSVKSAYNAI---RTPSSISHPVLQNRDWKVLWKMKIHARLKN 963

Query: 727  LGWKLIHHKLQTPDFLAKRGLLGANICVLCKQNEESCKHLF 849
            L WK+    L T   L +R  L +N C LC+++ E+ +HLF
Sbjct: 964  LLWKMAWEILPTASILNRRFTLPSNECHLCEKSPETLEHLF 1004



 Score = 27.7 bits (60), Expect(2) = 9e-12
 Identities = 17/41 (41%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
 Frame = +3

Query: 3   SLGLSRSELNKLNTPIRDFI*G-KNEAKKGLHLKNWETLET 122
           SL L + +  KL+   R+F  G K E KKG  LK+W+++ T
Sbjct: 717 SLLLPKQDCQKLDALNRNFWWGYKEENKKGCCLKSWDSICT 757


>gb|PRQ37611.1| putative reverse transcriptase zinc-binding domain-containing
           protein [Rosa chinensis]
          Length = 308

 Score = 74.3 bits (181), Expect = 1e-11
 Identities = 63/250 (25%), Positives = 102/250 (40%), Gaps = 6/250 (2%)
 Frame = +1

Query: 118 RPKEQGGLYMIDLALMXXXXXXXXXXXXXXXXXXXWFRVSSAKY-DFIDPWNLTPQRKSS 294
           +PK  GGL +   A                     W ++   KY   +  +    ++K+S
Sbjct: 44  QPKSTGGLGIRPSAYFNNAAIAKLAWKVINDQDNWWSQIMRRKYLRKVSFFQAKKKQKNS 103

Query: 295 RAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAF-----KPTFIDMGSISNQ 459
            AW  ++    LI   +   I +G N          D          K   I++     +
Sbjct: 104 IAWSGVLDARDLIMKGMRWIIGNGKNIKFWTFNWAFDCHLLHLIPDNKRNMINL----EE 159

Query: 460 QVSSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNSVGLLITKCVYQV 639
            V+  I+N  WD   +       IV+ I +  LP     D + W  +S G    K    +
Sbjct: 160 SVAESIINGKWDKAKLAFVLDPNIVKQILSIPLPVCEQEDEYIWGPSSNGKFSIKSATWL 219

Query: 640 LFNHTIPATISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLLGANICVLCK 819
            ++H    + S    K LWKL +  KIKI GW L+  +L+T D L++ G++  N C+LC 
Sbjct: 220 QYDHLRKHSQSKLINK-LWKLNVQPKIKIFGWLLLRGRLKTRDRLSRFGIINDNSCLLCN 278

Query: 820 QNEESCKHLF 849
           ++ E+  HLF
Sbjct: 279 RDNETADHLF 288


>ref|XP_022158377.1| uncharacterized protein LOC111024874 [Momordica charantia]
          Length = 1459

 Score = 75.1 bits (183), Expect = 2e-11
 Identities = 67/264 (25%), Positives = 102/264 (38%), Gaps = 7/264 (2%)
 Frame = +1

Query: 82   RKACTSKIGRLLRPKEQGGLYMIDLALMXXXXXXXXXXXXXXXXXXXWFRVSSAKYDFID 261
            RK    K GR+  PKE GGL   DL                        +V   KY F D
Sbjct: 927  RKLHWMKWGRMCYPKECGGLNFRDLEGFNQALVAKHVWRFLQHPNLLVSKVLKHKY-FKD 985

Query: 262  PWNLTPQR--KSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFI 435
               L      KSS  WK  +    L+   L   + +G+     + P  L     FKP   
Sbjct: 986  TSLLQASNNSKSSYFWKGFLWGRDLLVKGLRLRVGNGSTIKAFSDP-WLPRPTTFKPLRF 1044

Query: 436  DMGSISNQQVSSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNSVGLL 615
            + G++     S +  + +WDV +I         + I +  + S    D W W+ +  G  
Sbjct: 1045 NNGALDTTVASFITADGNWDVTSISHSFCNEDRDLILSMPISSYNLQDSWLWHYDKRGNY 1104

Query: 616  ITKCVYQVLFN---HTIPATISWPG--WKILWKLRISAKIKILGWKLIHHKLQTPDFLAK 780
              +  Y++  +   +   A+ ++ G  W  +WKL +  KIKI  W+  H  + T   L  
Sbjct: 1105 SVRSGYKLYMHLKCNATSASTNYRGTQWNSIWKLTVPTKIKIFIWRSAHEHIPTAQNLLL 1164

Query: 781  RGLLGANICVLCKQNEESCKHLFY 852
            RG+     C +C    ES  H F+
Sbjct: 1165 RGIGELPACTICGDRRESIIHAFF 1188


>gb|EMS67607.1| Putative disease resistance protein RGA3 [Triticum urartu]
          Length = 1426

 Score = 74.3 bits (181), Expect = 4e-11
 Identities = 53/193 (27%), Positives = 88/193 (45%), Gaps = 8/193 (4%)
 Frame = +1

Query: 289 SSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFIDMGSISNQQVS 468
           +S  WKA++ D +++K  L + + DG  T + +    +  S  FKP + +MGS     VS
Sbjct: 33  ASTTWKAIITDREVLKKGLIRRVGDGITTEVWHDN-WIQGSKTFKPVY-NMGSDPIHLVS 90

Query: 469 SLILNNS-WDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNSVGLLITKCVYQVL- 642
            L+ ++  W+   I+R      V  I N   P +   D W W     G+   +  Y +L 
Sbjct: 91  ELMTDDGDWNEEVIERNFIAPDVLDILNMPRPRVAIPDFWAWGHERSGIFTVRLAYMMLM 150

Query: 643 ------FNHTIPATISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLLGANI 804
                 FN    +T     WK LW+L++  K++I  W+++   L   + L +R +     
Sbjct: 151 EDRENNFNQVGSSTQGEEVWKSLWRLKVRPKLRIFWWRVLKGFLPAKEELCRRHVGDDLT 210

Query: 805 CVLCKQNEESCKH 843
           C +C   EES  H
Sbjct: 211 CPMCGNQEESLFH 223


>ref|XP_024190061.1| uncharacterized protein LOC112194030 [Rosa chinensis]
          Length = 1296

 Score = 65.5 bits (158), Expect(2) = 4e-11
 Identities = 55/214 (25%), Positives = 87/214 (40%), Gaps = 6/214 (2%)
 Frame = +1

Query: 226  FRVSSAKYDFIDPWNLTPQRKSSRAWKALMVDLKLIKPYLSKCIDDGNN----THIINQP 393
            +R    K  ++   N    +  S  W++++  + L+K  L   + DG      T +   P
Sbjct: 809  YREKYLKRGWLFDQNYQQTKDCSSTWRSVLNGVNLLKKGLIWRVGDGRKIKFWTDVWFPP 868

Query: 394  *CLDVSFAFKPTFIDMGSISNQQVSSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLPS 573
              L +++A   + I++       V S   +N WD+  +       I + I     P    
Sbjct: 869  TAL-INYALPDSIINI----EDTVCSFWNDNGWDLNLLSDCIPTGITDQILRIP-PGFDG 922

Query: 574  T--DCWFWNGNSVGLLITKCVYQVLFNHTIPATISWPGWKILWKLRISAKIKILGWKLIH 747
               D   W G S G    K  Y + F       +  P WK +WK+++  K+K   W L H
Sbjct: 923  CGDDTQIWGGTSNGSFSVKSAYNIFFEDY--EQMHSP-WKFIWKMQVPPKLKTFLWVLCH 979

Query: 748  HKLQTPDFLAKRGLLGANICVLCKQNEESCKHLF 849
             KL T     KR L   + C +C+ N ES  HLF
Sbjct: 980  GKLLTNAHRVKRNLTDDDTCPICRCNSESLSHLF 1013



 Score = 31.2 bits (69), Expect(2) = 4e-11
 Identities = 14/31 (45%), Positives = 22/31 (70%)
 Frame = +3

Query: 24  ELNKLNTPIRDFI*GKNEAKKGLHLKNWETL 116
           +L+K+N   R+F+ G  E KK +HL NW+T+
Sbjct: 741 DLDKIN---RNFLWGDTENKKKIHLVNWDTV 768


>gb|OVA00965.1| Reverse transcriptase zinc-binding domain [Macleaya cordata]
          Length = 626

 Score = 73.9 bits (180), Expect = 5e-11
 Identities = 67/259 (25%), Positives = 108/259 (41%), Gaps = 12/259 (4%)
 Frame = +1

Query: 109 RLLRPKEQGGLYMIDLALMXXXXXXXXXXXXXXXXXXXWFRVSSAKY-DFIDPWNLTPQR 285
           ++ +PKE GGL       +                   W ++ SA+Y    D W +    
Sbjct: 100 KICKPKELGGLGFRSPEKVNEALLAKLAWRFLTHTDSYWVKLLSARYLQRHDFWTVKKMV 159

Query: 286 KSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFIDMGSISNQQ- 462
             S  W  +++   +I P++   I DG+   I   P        F P F   GS  ++  
Sbjct: 160 NCSPVWTGILIGRDIIAPHVCWSIGDGSLVKIWTDP-----WVPFLPRFRVEGSSLDRGG 214

Query: 463 ---VSSLILNNS--WDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNSVGLLITKC 627
              VS LI  NS  W +  +Q+      +  I    L +  S D   W     G   TK 
Sbjct: 215 LCFVSELICPNSRLWRLDLLQKFFLPHEITAILKIRLSAEASADTLIWMLTPTGDFSTKS 274

Query: 628 VYQVLFN----HTIPATISWPGWKILWKLR-ISAKIKILGWKLIHHKLQTPDFLAKRGLL 792
           VY+ L      H+  +T S+  WK  WK++ IS ++ +  W+L+H+ +     +A+ G  
Sbjct: 275 VYKSLLQGEPAHSESSTFSFD-WKRFWKVQGISPRVHLFLWRLLHNAIAVKVNIARHGTS 333

Query: 793 GANICVLCKQNEESCKHLF 849
               C LC ++EE+ +HLF
Sbjct: 334 VDTCCRLCYKSEETVEHLF 352


>dbj|GAU22350.1| hypothetical protein TSUD_106780 [Trifolium subterraneum]
          Length = 1200

 Score = 73.2 bits (178), Expect = 1e-10
 Identities = 53/194 (27%), Positives = 81/194 (41%), Gaps = 2/194 (1%)
 Frame = +1

Query: 268  NLTPQRKSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFIDMGS 447
            N T  R  S  W A+   L  +K        +GN               A    F+D+  
Sbjct: 820  NSTNNRTGSVTWNAIRKALSALK--------EGNGV------------IANLVMFVDIHD 859

Query: 448  ISNQQVSSLILNNSWDVYNIQR*GGQAIVETISNFTLPSLPST-DCWFWNGNSVGLLITK 624
            +   +V  +  +  W   ++     Q I E ++  T+   PS  DC+ W GN  G+   +
Sbjct: 860  LE-MRVRDVYKDGMWLFNSLYTTLSQEIKENLNFITISLNPSVNDCYTWKGNLNGIYTAR 918

Query: 625  CVYQVLFNHTIPAT-ISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLLGAN 801
              Y  L  H+  AT IS   W  LW +    K+K   W ++H+ L T D LA RG++  N
Sbjct: 919  DGYAWLNRHSFSATTISVASWSWLWHVSAPEKLKFFFWTMLHNSLPTRDMLAHRGIITRN 978

Query: 802  ICVLCKQNEESCKH 843
            +C  C  + E+  H
Sbjct: 979  LCPRCSNHAETTIH 992


>gb|OVA11190.1| Reverse transcriptase zinc-binding domain [Macleaya cordata]
          Length = 565

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 62/221 (28%), Positives = 98/221 (44%), Gaps = 12/221 (5%)
 Frame = +1

Query: 223 WFRVSSAKY-DFIDPWNLTPQRKSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*C 399
           W ++ SA+Y    D W +      S  W  +++   +I P++   I DG+   I   P  
Sbjct: 77  WVKLISARYLQRYDFWTVKKMVNCSPVWTGILIGRDIIAPHVCWSIGDGSLIKIWTDP-- 134

Query: 400 LDVSFAFKPTFIDMGSISNQQ----VSSLILNNS--WDVYNIQR*GGQAIVETISNFTLP 561
                 F P F   GS  ++     VS LI  NS  W +  +Q+      +  I    L 
Sbjct: 135 ---WVPFLPRFRVEGSSLDRGGLCFVSELICPNSRLWRLDLLQQFFLPHEIIAILKIRLS 191

Query: 562 SLPSTDCWFWNGNSVGLLITKCVYQVLFN----HTIPATISWPGWKILWKLR-ISAKIKI 726
           +  S D   W     G   TK VY+ L      H+  +T S+  WK  WK++ IS K+ +
Sbjct: 192 AEASADTLIWMLTPTGDFSTKSVYKSLLQGEPAHSESSTFSFD-WKRFWKMQGISPKVHL 250

Query: 727 LGWKLIHHKLQTPDFLAKRGLLGANICVLCKQNEESCKHLF 849
             W+L+H+ +     +A+ G      C LC ++EE+ +HLF
Sbjct: 251 FLWRLLHNAIAVKVNIARHGTSVDTYCRLCYKSEETVEHLF 291


>ref|XP_019163494.1| PREDICTED: uncharacterized protein LOC109159838 [Ipomoea nil]
          Length = 1628

 Score = 72.0 bits (175), Expect = 3e-10
 Identities = 55/188 (29%), Positives = 86/188 (45%), Gaps = 2/188 (1%)
 Frame = +1

Query: 292  SRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFIDMGSISNQQVSS 471
            S  W+++M   +L+K    + I +G +TH+ + P  L  +          G I N  VS 
Sbjct: 1190 SYVWRSIMASQELVKSGCKRRIGNGKSTHVWSHP-WLPGTQGTLVLASQTGLIQNMLVSD 1248

Query: 472  LILNN--SWDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNSVGLLITKCVYQVLF 645
            LI  +  SW++  +Q+     +   I    + +L   D W+W G+  G    K  Y+ L 
Sbjct: 1249 LIDVDLCSWNIPYVQQLFDPHVASQIVQLPV-NLMHDDLWYWEGDLRGCYSVKDGYKRLG 1307

Query: 646  NHTIPATISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLLGANICVLCKQN 825
                  ++ W G   +W+L I  K K L W+ + + L T D L KR +   NIC  C   
Sbjct: 1308 EVHAQGSVVWNG---IWRLNIPPKWKNLMWRALSNILPTLDNLIKRRVDLMNICPACGLL 1364

Query: 826  EESCKHLF 849
            EES  H+F
Sbjct: 1365 EESIMHIF 1372


>ref|XP_020109325.1| uncharacterized protein LOC109724810 [Ananas comosus]
          Length = 562

 Score = 71.6 bits (174), Expect = 3e-10
 Identities = 66/273 (24%), Positives = 107/273 (39%), Gaps = 18/273 (6%)
 Frame = +1

Query: 85   KACTSKIGRLLRPKEQGGLYMIDLALMXXXXXXXXXXXXXXXXXXXWFRVSSAKY----- 249
            K C      + + K +GGL ++D+  M                   W RV    Y     
Sbjct: 269  KGCLLAWKNICKSKREGGLGILDVGAMNCALLTKWWWKFQQAPHLQWNRVIRDLYYIRRR 328

Query: 250  DFIDPWNLTPQRKSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPT 429
              ++  +  PQ   S  WK ++    + K   +  + +GN+        C + +      
Sbjct: 329  PLMEGRSFRPQ---SHWWKGVLSLKSIFKWGSTYKLGNGNSIDFWLDRWCGETTLG--SA 383

Query: 430  FIDMGSISNQQ---VSSLILNNSWDVYNIQR*GGQAIV----------ETISNFTLPSLP 570
            F  +  ISN++   VS ++  + W+   I     + ++          + +S+F L   P
Sbjct: 384  FPKIYQISNRRYLKVSEVLTEDGWNWNGIFGIDSEILIGLEDDIAELKDRLSHFYLGQSP 443

Query: 571  STDCWFWNGNSVGLLITKCVYQVLFNHTIPATISWPGWKILWKLRISAKIKILGWKLIHH 750
                W W+ N   L   K  Y  L +         PG   +W L I  K+K+  W  +  
Sbjct: 444  DQIFWRWSSNR--LFSVKSTYLALTD----GGTRDPGLNEIWGLHIPLKVKVFCWLALKK 497

Query: 751  KLQTPDFLAKRGLLGANICVLCKQNEESCKHLF 849
            +L T D LAKRG +G  +CVLC   +ES  HLF
Sbjct: 498  RLPTTDLLAKRGWVGNTVCVLCGVEDESVDHLF 530


>gb|PRQ33081.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1367

 Score = 67.0 bits (162), Expect(2) = 3e-10
 Identities = 62/252 (24%), Positives = 90/252 (35%), Gaps = 8/252 (3%)
 Frame = +1

Query: 118  RPKEQGGLYMIDLALMXXXXXXXXXXXXXXXXXXXWFRVSSAKY----DFIDPWNLTPQR 285
            +PK+ GGL +    +M                   W  + S KY      +D  N  P  
Sbjct: 845  QPKQLGGLGIKKTEVMNQAMLAKASWRLFINDSGLWANIYSKKYLKDCSLLDE-NYLPPS 903

Query: 286  KSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLD---VSFAFKPTFIDMGSISN 456
              S  W++++    L+K  L   + DG      +    L    ++FA     I+     N
Sbjct: 904  DCSSTWRSIVHGASLLKKNLKWRVGDGKTIKFWSDSWILPNALINFALPSAHIN----PN 959

Query: 457  QQVSSLILNNSWDVYNIQR*GGQAIVETISNF-TLPSLPSTDCWFWNGNSVGLLITKCVY 633
              +     +  WD+  +       I+  I N  T       D   W   S G    K  Y
Sbjct: 960  ATICDFWNDTGWDLDLLSSVVPNEIISLIINVPTGFEGCGDDTLIWGATSNGCFTVKSAY 1019

Query: 634  QVLFNHTIPATISWPGWKILWKLRISAKIKILGWKLIHHKLQTPDFLAKRGLLGANICVL 813
               F+ ++      P WK LWKL    K+    W + H KL T     +RGL     C L
Sbjct: 1020 SSTFDFSVQN----PQWKTLWKLNCPPKLMTFIWTVFHRKLLTNMQRVRRGLTTCATCPL 1075

Query: 814  CKQNEESCKHLF 849
            C   +ES  HLF
Sbjct: 1076 CLSADESLIHLF 1087



 Score = 26.9 bits (58), Expect(2) = 3e-10
 Identities = 13/31 (41%), Positives = 20/31 (64%)
 Frame = +3

Query: 24  ELNKLNTPIRDFI*GKNEAKKGLHLKNWETL 116
           +L+KLN   RDFI G    +K +HL +W+ +
Sbjct: 816 KLDKLN---RDFIWGDTTDRKKIHLVSWDVV 843


>gb|OVA20918.1| Endonuclease/exonuclease/phosphatase [Macleaya cordata]
          Length = 869

 Score = 71.2 bits (173), Expect = 4e-10
 Identities = 53/204 (25%), Positives = 90/204 (44%), Gaps = 9/204 (4%)
 Frame = +1

Query: 265  WNLTPQRKSSRAWKALMVDLKLIKPYLSKCIDDGNNTHIINQP*CLDVSFAFKPTFIDMG 444
            W ++    SS  W  +++   +I P++   I +G+   I   P  +     F    +   
Sbjct: 432  WTVSKASTSSSVWAGILIGRNIIAPHVCWFIGNGSMVKIWEDP-WVPTLPGFHVEGLPQA 490

Query: 445  SISNQQVSSLILNNS--WDVYNIQR*GGQAIVETISNFTLPSLPSTDCWFWNGNSVGLLI 618
            S     V+ LI ++S  W+++ +Q       +  I N  +P+ P  D   W     G   
Sbjct: 491  STEVTYVNELICSSSRHWNIHLLQHVFLPHEISAILNIRIPAEPCPDQLLWMRTPSGDFS 550

Query: 619  TKCVYQVLFNH------TIPATISWPGWKILWKLR-ISAKIKILGWKLIHHKLQTPDFLA 777
            TK VY+ L ++      T  A + W   K  WK+R IS ++ +  W+L+H+ +     +A
Sbjct: 551  TKSVYRCLADNEPSNSRTDSALLDW---KRFWKMRGISPRVHLFLWRLLHNAIALKTNIA 607

Query: 778  KRGLLGANICVLCKQNEESCKHLF 849
            KR      IC  C   EE+ +HLF
Sbjct: 608  KRIPTTDTICQFCFSKEETVEHLF 631


>dbj|GAU38148.1| hypothetical protein TSUD_395930 [Trifolium subterraneum]
          Length = 503

 Score = 70.9 bits (172), Expect = 4e-10
 Identities = 70/283 (24%), Positives = 117/283 (41%), Gaps = 25/283 (8%)
 Frame = +1

Query: 76  KPRKACTSKIGRLLRPKEQGGLYMIDLALMXXXXXXXXXXXXXXXXXXXWFRVSSAKY-- 249
           K  K C  K   + +PK++GGL + DL L+                   W  V  A+Y  
Sbjct: 110 KQTKTCWVKWDVICKPKKEGGLGVRDLRLVNISLLAKWRWKLLTTECEVWKEVVGARYGR 169

Query: 250 DFIDPWNLTP---QRKSSRAWKALMV---DLKLIKPYLSKCIDDGNNTHIINQP*CLDVS 411
           D I   NL      R  S  W+ L +   D++     + K +  G++T   N+    D  
Sbjct: 170 DVIGKVNLGDIDVTRTGSCWWRDLCLLDSDVRWFSSAVGKRVGRGDSTMFWNEIWIGDQP 229

Query: 412 FAFK-PTFIDMGSISNQQVSSL--ILNNSW--------DVYNIQR*GGQAIVETISNFTL 558
              + P    M +  N+ + ++  ++N  W        + +  +       ++ I  F  
Sbjct: 230 LRQRFPRLFGMSTQQNEVICNMGSLVNGLWHWELQWRRNFFTWEEDQYNHFLDIIVQFA- 288

Query: 559 PSLPSTDCWFWNGNSVGLLITKCVYQVLFNHTIPATISWP----GWKILWKLRISAKIKI 726
           P++   D W W G+ V        Y ++ N  +  ++  P     +KILWK    +K+  
Sbjct: 289 PTVQQ-DRWLWLGDGVQGYTANSAYSLVVNKLVTPSVCDPINDLVFKILWKCGAPSKVSA 347

Query: 727 LGWKLIHHKLQTPDFLAKRGLLGAN--ICVLCKQNEESCKHLF 849
             W+L+  +LQT D L KR ++ A+   CV C   +ES  HLF
Sbjct: 348 FSWQLMLDRLQTKDNLMKRRIIQAHHGNCVFCNLAQESASHLF 390


Top