BLASTX nr result

ID: Rehmannia29_contig00024908 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia29_contig00024908
         (1757 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP66486.1| Retrovirus-related Pol polyprotein from transposo...   723   0.0  
gb|PRQ60431.1| putative RNA-directed DNA polymerase [Rosa chinen...   703   0.0  
sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly...   701   0.0  
gb|AAK29467.1| polyprotein-like [Solanum chilense]                    685   0.0  
dbj|BAA11674.1| unnamed protein product [Nicotiana tabacum]           624   0.0  
gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]         623   0.0  
gb|KYP31853.1| Retrovirus-related Pol polyprotein from transposo...   616   0.0  
gb|PNX95528.1| putative gag/pol polyprotein [Trifolium pratense]      610   0.0  
emb|CAN77602.1| hypothetical protein VITISV_024474 [Vitis vinifera]   592   0.0  
gb|KYP40337.1| Retrovirus-related Pol polyprotein from transposo...   566   0.0  
gb|OTG31811.1| putative retrovirus-related Pol polyprotein from ...   554   e-180
ref|XP_017609491.1| PREDICTED: retrovirus-related Pol polyprotei...   531   e-173
emb|CAN65406.1| hypothetical protein VITISV_030853 [Vitis vinifera]   471   e-152
ref|XP_015160015.1| PREDICTED: LOW QUALITY PROTEIN: retrovirus-r...   469   e-147
gb|KYP40338.1| Retrovirus-related Pol polyprotein from transposo...   442   e-142
dbj|GAU39612.1| hypothetical protein TSUD_276520 [Trifolium subt...   442   e-137
gb|OAE26943.1| hypothetical protein AXG93_4413s1270 [Marchantia ...   431   e-137
gb|OMO51796.1| Reverse transcriptase, RNA-dependent DNA polymera...   433   e-133
dbj|GAU37486.1| hypothetical protein TSUD_275380 [Trifolium subt...   429   e-132
gb|KZV39824.1| hypothetical protein F511_27827 [Dorcoceras hygro...   386   e-127

>gb|KYP66486.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1321

 Score =  723 bits (1866), Expect = 0.0
 Identities = 352/591 (59%), Positives = 433/591 (73%), Gaps = 6/591 (1%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXX 180
            LN  E +AS NLWH+RL H+SEKGL  L K+ LI   K   L+PC++C+FGK        
Sbjct: 406  LNAVENDASPNLWHRRLAHISEKGLQLLAKQSLIPQAKGDFLNPCDYCVFGKHHRVSFKK 465

Query: 181  XXXXXXXXXXXVHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKLF 360
                       VHSDVCGP+EVESLGGNKYF+TFIDDASRK WVY L+ K QVF+ F+ F
Sbjct: 466  SSNRKKNKLELVHSDVCGPMEVESLGGNKYFVTFIDDASRKTWVYLLQAKSQVFQCFQQF 525

Query: 361  HVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRTI 540
            H MVERETG +LKC+R+DNGGEY SK F  YC  YGIRHEKTVP TPQHNG+AERMNRTI
Sbjct: 526  HAMVERETGKQLKCIRTDNGGEYISKEFKDYCSKYGIRHEKTVPGTPQHNGIAERMNRTI 585

Query: 541  MERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVFG 720
            +E+VR ML MAKLPKPFWGEAV+ A YLINR PSVPL F++PE++W+GK+ SYSHL+VFG
Sbjct: 586  VEKVRCMLRMAKLPKPFWGEAVQTAVYLINRLPSVPLGFDIPERVWAGKEVSYSHLKVFG 645

Query: 721  CLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIED 900
            C A+ HV KE R KLD +  PC+F+GYG+EEFGY+LWDP+ K+++RSRDV+FHE +TIED
Sbjct: 646  CKAFMHVPKEQRSKLDDKAIPCVFVGYGNEEFGYKLWDPERKRIVRSRDVIFHEHETIED 705

Query: 901  IEKPTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPNVP 1080
            ++    ++    G   +   P     DG    E                         V 
Sbjct: 706  LKGGEAAKPMEDGVNPTSHVPSECATDGRQTQE------PEHETEEPVFGDEESVDEEVV 759

Query: 1081 IPSESQNDGGSP-----QIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHK 1245
            +P    N+ G P     Q+ P++            Y  S+Y+L+ ++GEPESFQE  SHK
Sbjct: 760  VPDTEANEQGEPSYTSGQVEPQIRRSTRERQPSTKYPSSEYILIADEGEPESFQEVQSHK 819

Query: 1246 DKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVK 1425
            DK  W++AMQ+EM+SL++N+TYE+V+LPKG++ L+NKWVFKLKKDG  K+V+HKARLVVK
Sbjct: 820  DKGCWVKAMQEEMDSLKRNNTYELVQLPKGRRVLKNKWVFKLKKDGD-KLVRHKARLVVK 878

Query: 1426 GFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYMEQP 1605
            GF QK+GIDF+EIFSPVVKM SIRVILGLVASM+LELEQ+DVKT+FLHGDL EEIYMEQP
Sbjct: 879  GFSQKQGIDFEEIFSPVVKMCSIRVILGLVASMDLELEQLDVKTAFLHGDLDEEIYMEQP 938

Query: 1606 EGFEISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
            EGFE+ G +++VCKLKKSLYGLKQAPRQWY KFDS + SQG+K+T+AD CV
Sbjct: 939  EGFEVKGKEDMVCKLKKSLYGLKQAPRQWYKKFDSFIKSQGFKRTDADPCV 989


>gb|PRQ60431.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1340

 Score =  703 bits (1815), Expect = 0.0
 Identities = 354/591 (59%), Positives = 420/591 (71%), Gaps = 6/591 (1%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXX 180
            LN+A+ ++S +LWH+RLGHMSEKGL  L KK LI   K  +LD C++CLFGKQ       
Sbjct: 415  LNVAD-DSSPSLWHKRLGHMSEKGLQVLAKKSLIPFAKGTSLDSCDYCLFGKQHKVSFTR 473

Query: 181  XXXXXXXXXXXVHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKLF 360
                       VHSDVCGP+EVES+G NKYF+T+IDDASRK+WVY LKTKDQVF+ FK F
Sbjct: 474  TFTRKENVLDLVHSDVCGPMEVESVGHNKYFVTYIDDASRKVWVYLLKTKDQVFQTFKEF 533

Query: 361  HVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRTI 540
            H MVERETG  LKC+RSDNGGEYTS  F  YC  +GI+H KT+P TPQHNGVAERMNRTI
Sbjct: 534  HAMVERETGRTLKCIRSDNGGEYTSNEFRDYCSKHGIKHVKTIPGTPQHNGVAERMNRTI 593

Query: 541  MERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVFG 720
            +E+VRSML  A L K FWGEA+  ACYLINR+P VPL  E PE +W+G+  SYSHLRVFG
Sbjct: 594  LEKVRSMLKTANLTKKFWGEAMTTACYLINRTPCVPLGLETPEGVWTGRSASYSHLRVFG 653

Query: 721  CLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIED 900
            C A+AHV KE R KLD +  PCIF+GYG+EE GYRLW+PK KK+ RSRDVVFHE +TI D
Sbjct: 654  CKAFAHVPKEQRSKLDDKAMPCIFLGYGNEEMGYRLWNPKTKKLFRSRDVVFHEGQTIAD 713

Query: 901  IEKPTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXX----L 1068
             +K  +     +    S    EP  +  D+V ED+P                       L
Sbjct: 714  FDKNEVEHADQLTYDSSPLQEEPSDKAQDMVNEDVPDMAEAEAAEPEDNDQGEPNQGEQL 773

Query: 1069 PNVPIPSESQNDGGSPQI-VPEVXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHK 1245
             N  +P    N    P                   YS   Y+LLT DGEPE F+EA +H 
Sbjct: 774  ENAQVPVRRSNREPKPNTKYHSSQYILVTSDGDDPYSR--YILLTNDGEPECFEEAKTHA 831

Query: 1246 DKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVK 1425
            D +KW+ AM+ EMESL KN TYE+VELPKG+KAL+NKWVFKLK+D + ++ K KARLVVK
Sbjct: 832  DCDKWMLAMKSEMESLLKNDTYELVELPKGRKALKNKWVFKLKRDENEQLTKFKARLVVK 891

Query: 1426 GFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYMEQP 1605
            GF QK+GIDFDEIFSPVVKMTSIR+ILG+ ASM+LE+EQ+DVKT+FLHGDL+EEIYMEQP
Sbjct: 892  GFGQKEGIDFDEIFSPVVKMTSIRIILGMAASMDLEVEQLDVKTAFLHGDLEEEIYMEQP 951

Query: 1606 EGFEISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
            EGFE+ G  ++VCKLKKSLYGLKQAPRQWY KFDS MV  GYK+T+AD CV
Sbjct: 952  EGFEVEGKQHMVCKLKKSLYGLKQAPRQWYKKFDSFMVGHGYKRTDADPCV 1002


>sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon TNT
            1-94; Includes: RecName: Full=Protease; Includes:
            RecName: Full=Reverse transcriptase; Includes: RecName:
            Full=Endonuclease
 emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
          Length = 1328

 Score =  701 bits (1810), Expect = 0.0
 Identities = 357/601 (59%), Positives = 435/601 (72%), Gaps = 16/601 (2%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXX 180
            LN A+ E S +LWH+R+GHMSEKGL  L KK LI+  K   + PC++CLFGKQ       
Sbjct: 413  LNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQT 472

Query: 181  XXXXXXXXXXXVHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKLF 360
                       V+SDVCGP+E+ES+GGNKYF+TFIDDASRKLWVY LKTKDQVF+ F+ F
Sbjct: 473  SSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKF 532

Query: 361  HVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRTI 540
            H +VERETG KLK LRSDNGGEYTS+ F+ YC ++GIRHEKTVP TPQHNGVAERMNRTI
Sbjct: 533  HALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTI 592

Query: 541  MERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVFG 720
            +E+VRSML MAKLPK FWGEAV+ ACYLINRSPSVPL FE+PE++W+ K+ SYSHL+VFG
Sbjct: 593  VEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFG 652

Query: 721  CLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHES--KTI 894
            C A+AHV KE R KLD ++ PCIFIGYGDEEFGYRLWDP +KKVIRSRDVVF ES  +T 
Sbjct: 653  CRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTA 712

Query: 895  EDIEK-------------PTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXX 1035
             D+ +             P+ S         +D   E   + G+V+ +            
Sbjct: 713  ADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQ------------ 760

Query: 1036 XXXXXXXXXXLPNVPIPSESQNDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEP 1215
                      +  V  P++     G  Q  P +            Y  ++Y+L+++D EP
Sbjct: 761  ---GEQLDEGVEEVEHPTQ-----GEEQHQP-LRRSERPRVESRRYPSTEYVLISDDREP 811

Query: 1216 ESFQEAVSHKDKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKV 1395
            ES +E +SH +K + ++AMQ+EMESLQKN TY++VELPKGK+ L+ KWVFKLKKDG  K+
Sbjct: 812  ESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKL 871

Query: 1396 VKHKARLVVKGFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGD 1575
            V++KARLVVKGF+QKKGIDFDEIFSPVVKMTSIR IL L AS++LE+EQ+DVKT+FLHGD
Sbjct: 872  VRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGD 931

Query: 1576 LKEEIYMEQPEGFEISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADEC 1752
            L+EEIYMEQPEGFE++G  ++VCKL KSLYGLKQAPRQWY KFDS M SQ Y KT +D C
Sbjct: 932  LEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPC 991

Query: 1753 V 1755
            V
Sbjct: 992  V 992


>gb|AAK29467.1| polyprotein-like [Solanum chilense]
          Length = 1328

 Score =  685 bits (1767), Expect = 0.0
 Identities = 339/588 (57%), Positives = 420/588 (71%), Gaps = 3/588 (0%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXX 180
            LN A +E S +LWH+R+GH SEKGL  L KK LI+  K   + PCN+ LFGKQ       
Sbjct: 414  LNAAHEENSADLWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHRVSFQT 473

Query: 181  XXXXXXXXXXXVHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKLF 360
                       V+SDVCGP+E+ES+GGNKYF+TFIDDASRKLWVY  + KDQVF+ F+ F
Sbjct: 474  SSERKSNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQVFQKF 533

Query: 361  HVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRTI 540
            H +VERETG K K LR+DNGGEYTS+ F+ YC  +GIRHEKTVP TPQHNGVAERMNRTI
Sbjct: 534  HALVERETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGTPQHNGVAERMNRTI 593

Query: 541  MERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVFG 720
            +E+VRSML MAKLPK FWGEAVR ACYLINRSPSVPL F++PE++W+ K+ SYSHL+VFG
Sbjct: 594  VEKVRSMLRMAKLPKTFWGEAVRTACYLINRSPSVPLEFDIPERVWTNKEMSYSHLKVFG 653

Query: 721  CLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESK--TI 894
            C A+AHV KE R KLD ++ PCIFIGYGDEEFGYRLWD  +KKVIRSRDV+F ES+  T 
Sbjct: 654  CKAFAHVPKEQRTKLDDKSVPCIFIGYGDEEFGYRLWDLVKKKVIRSRDVIFRESEVGTA 713

Query: 895  EDIEKPTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPN 1074
             D+ +    +   I   ++  +        +   +++                       
Sbjct: 714  ADLSEKAKKKNGIIPNLVTIPSSSNHPTSAESTIDEVVEQEEQPDEIVEQGEQLGDNTEQ 773

Query: 1075 VPIPSESQNDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHKDKE 1254
            +  P E Q+          +            Y  S+Y+L+  +GEPE+ +E +SH +K 
Sbjct: 774  MEYPEEEQSQ--------PLRRSERQRVESTKYPSSEYVLIKYEGEPENLKEVLSHPEKS 825

Query: 1255 KWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVKGFQ 1434
            +W++AM +EM SLQKN TY++VELPKGK+ L+ KWVFKLKKDG+GK+V++KARLVVKGF+
Sbjct: 826  QWMKAMHEEMGSLQKNGTYQLVELPKGKRPLKCKWVFKLKKDGNGKLVRYKARLVVKGFE 885

Query: 1435 QKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYMEQPEGF 1614
            QKKGIDFDEIFSPVVKMTSIR IL + AS++LE+EQ+DVKT+FLHGDL+EEIYMEQ EGF
Sbjct: 886  QKKGIDFDEIFSPVVKMTSIRTILSIAASLDLEVEQLDVKTAFLHGDLEEEIYMEQGEGF 945

Query: 1615 EISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
            E+SG  ++VCKL KSLYGLKQAPRQWY KFDS M SQ Y+ T +  CV
Sbjct: 946  EVSGKKHMVCKLNKSLYGLKQAPRQWYKKFDSFMKSQTYRNTYSHPCV 993


>dbj|BAA11674.1| unnamed protein product [Nicotiana tabacum]
          Length = 1338

 Score =  624 bits (1609), Expect = 0.0
 Identities = 318/600 (53%), Positives = 406/600 (67%), Gaps = 15/600 (2%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQ-QXXXXX 177
            +N+AE +++  LWH+RLGHMSEK ++ L+KK  +    +  L  C  CL GKQ +     
Sbjct: 411  INVAENDSNIKLWHRRLGHMSEKSMARLVKKNALPGLNQIQLKKCADCLAGKQNRVSFKR 470

Query: 178  XXXXXXXXXXXXVHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKL 357
                        VHSDVCGP + +SLGG +YF+TFIDD SRK WVY LKTKDQVF+ FK 
Sbjct: 471  FPPSRRQNVLDLVHSDVCGPFK-KSLGGARYFVTFIDDHSRKTWVYTLKTKDQVFQVFKQ 529

Query: 358  FHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRT 537
            F  +VERETG KLKC+R+DNGGEY  + FDAYCK +GIRH+ T P+TPQ NG+AERMNRT
Sbjct: 530  FLTLVERETGKKLKCIRTDNGGEYQGQ-FDAYCKEHGIRHQFTPPKTPQLNGLAERMNRT 588

Query: 538  IMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVF 717
            ++ER R +LS +KLPK FWGEA+  A Y++N SP VPL ++ PEK+W G+D SY  LRVF
Sbjct: 589  LIERTRCLLSHSKLPKAFWGEALVTAAYVLNHSPCVPLQYKAPEKIWLGRDISYDQLRVF 648

Query: 718  GCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIE 897
            GC AY HV K+ R KLD +T  C+FIGYG +  GY+ +DP EKK++RSRDVVF E +TIE
Sbjct: 649  GCKAYVHVPKDERSKLDVKTRECVFIGYGQDMLGYKFYDPVEKKLVRSRDVVFVEDQTIE 708

Query: 898  DIEK-------------PTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXX 1038
            DI+K             P       +G  + D  PE        +P +            
Sbjct: 709  DIDKVEKSTDDSAEFELPPTVVPRQVGDDVQDNQPE-----APGLPNEDELADTEGNEDN 763

Query: 1039 XXXXXXXXXLPNVPIPSESQNDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPE 1218
                      P  PI +       S ++V +             YS  +Y+LLT+ GEP+
Sbjct: 764  GDDDADEEDQPQPPILNNPPYHTRSGRVVQQ----------STRYSPHEYVLLTDGGEPD 813

Query: 1219 SFQEAVSHKDKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVV 1398
            SF+EA+  + KEKW++AMQDE++SL +N T+E+V+LPKGK+AL+NKWVFK+K D    + 
Sbjct: 814  SFEEAIDDEHKEKWIEAMQDEIKSLHENKTFELVKLPKGKRALKNKWVFKMKHDEHNSLP 873

Query: 1399 KHKARLVVKGFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDL 1578
            + KARLVVKGF Q+KGIDFDEIFSPVVKMTSIR +LGL AS+NLE+EQMDVKT+FLHGDL
Sbjct: 874  RFKARLVVKGFNQRKGIDFDEIFSPVVKMTSIRTVLGLAASLNLEVEQMDVKTAFLHGDL 933

Query: 1579 KEEIYMEQPEGFEISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
            +EEIYMEQP+GF+  G ++ VC+L+KSLYGLKQAPRQWY KF+S M   GYKKT +D CV
Sbjct: 934  EEEIYMEQPDGFQQKGKEDYVCRLRKSLYGLKQAPRQWYKKFESVMGQHGYKKTTSDHCV 993


>gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]
          Length = 1415

 Score =  623 bits (1607), Expect = 0.0
 Identities = 320/599 (53%), Positives = 401/599 (66%), Gaps = 14/599 (2%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXX 180
            +N+ EKE +  LWH+RLGHMS KG+  L KK  ++  KEA LD C HCL GKQ+      
Sbjct: 408  VNVVEKECASELWHKRLGHMSVKGIDYLAKKSKLSGVKEAKLDKCVHCLAGKQRRVSFMS 467

Query: 181  XXXXXXXXXXX-VHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKL 357
                        +HSDVCGP++V SLGG  YF+TFIDD SRKLWVY LK K  V   FK 
Sbjct: 468  HPPTRKSEPLDLIHSDVCGPMKVRSLGGASYFVTFIDDYSRKLWVYTLKHKSDVLGVFKE 527

Query: 358  FHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRT 537
            FH +VER+TG KLKC+R+DNGGEY    FD YC+ YGIRH+KT P+ PQ NG+AERMNRT
Sbjct: 528  FHALVERQTGKKLKCIRTDNGGEYCGP-FDEYCRRYGIRHQKTPPKIPQLNGLAERMNRT 586

Query: 538  IMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVF 717
            IMERVR ML  AKLP  FW EAV  A ++IN SP + L  EVP+K+W GKD SY HLRVF
Sbjct: 587  IMERVRCMLDDAKLPSSFWAEAVSTAVHVINLSPVIALKNEVPDKVWCGKDVSYDHLRVF 646

Query: 718  GCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIE 897
            GC A+ HV ++ R KLD++T  CIFIGYG +EFGYRL+DP EKK++RSRDVVF E++TIE
Sbjct: 647  GCKAFVHVPRDERSKLDSKTRQCIFIGYGFDEFGYRLYDPVEKKLVRSRDVVFFENQTIE 706

Query: 898  DIEKPTMSQKSNIGAQISDAAP------------EPFVRDGDVVPEDIPXXXXXXXXXXX 1041
            DI+K    +  + G+ + D  P            +  V++GD VP+              
Sbjct: 707  DIDKVKQPESRDSGSLV-DIEPVSRRYTDDVDEVQENVQNGDPVPDYQGDTVDVDGHADD 765

Query: 1042 XXXXXXXXLPNVPIPSESQNDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPES 1221
                       VP+    ++D                      YS S Y+LLT+ GEPES
Sbjct: 766  VVHQEQEVPSQVPVDLPRRSD--------------RERRPSTRYSPSQYVLLTDGGEPES 811

Query: 1222 FQEAVSHKDKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVK 1401
            ++EA+    K +W +AMQ+EM SL  N T+E+V+ PK +KAL+N+WV+++K +    V +
Sbjct: 812  YEEAMESDQKRQWFEAMQEEMNSLYVNDTFELVKAPKNRKALKNRWVYRVKHEEGTSVPR 871

Query: 1402 HKARLVVKGFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLK 1581
             KARLVVKGF QKKGIDFDEIFSPVVK +SIRV+LGL A +++E+EQMDVKT+FLHGDL 
Sbjct: 872  FKARLVVKGFSQKKGIDFDEIFSPVVKFSSIRVVLGLAARLDIEIEQMDVKTAFLHGDLD 931

Query: 1582 EEIYMEQPEGFEISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
            EEIYMEQPEGF++ G ++ VC+LKKSLYGLKQAPRQWY KF S M   GYKKT++D CV
Sbjct: 932  EEIYMEQPEGFKVKGKEDYVCRLKKSLYGLKQAPRQWYKKFTSVMSKHGYKKTSSDHCV 990


>gb|KYP31853.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1314

 Score =  616 bits (1589), Expect = 0.0
 Identities = 319/588 (54%), Positives = 401/588 (68%), Gaps = 3/588 (0%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXX 180
            +N+A+ +AS+ LWH+RLGHMSEKGL  L K  L N+ K   L+ C  CL GKQ       
Sbjct: 408  MNVAQ-DASKELWHRRLGHMSEKGLEILAKDHLPNI-KGQPLESCEDCLAGKQHRVSFRR 465

Query: 181  XXXXXXXXXXX--VHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFK 354
                         VHSDVC   E  S+GG +YF+TFIDD SRK+WVY LKTKDQV + FK
Sbjct: 466  PDDARRRKHILDLVHSDVCSTSE-RSIGGAQYFVTFIDDHSRKVWVYPLKTKDQVLQAFK 524

Query: 355  LFHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNR 534
             FH +VER TG KLKC+R+DNGGEY    F+ YCKT+GIRHEK  P+TPQ NGVAERMNR
Sbjct: 525  EFHALVERATGRKLKCIRTDNGGEYLGP-FEYYCKTHGIRHEKVPPKTPQMNGVAERMNR 583

Query: 535  TIMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRV 714
            TI E+VRSMLS AK+PK FWGEAV  A  LIN SPS PLN E+PE++WSGK   Y HL+V
Sbjct: 584  TIAEKVRSMLSHAKIPKSFWGEAVLTAADLINLSPSRPLNGEIPEEVWSGKKAYYGHLKV 643

Query: 715  FGCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTI 894
            FGC A+ H+ K+ R KLDA+   CI++    +E G+RLWDP  KK++RSRDV+F E +TI
Sbjct: 644  FGCRAFVHIPKDERTKLDAKVKECIYLRSPKDELGFRLWDPVNKKIVRSRDVIFFEDQTI 703

Query: 895  EDIEKPTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPN 1074
            +DI+KP   ++     ++ D++P      G  + + I                       
Sbjct: 704  QDIKKPEKPKQK----EVQDSSPIVINNSGGEISQRIDEPNQTEQPESEQSQTM------ 753

Query: 1075 VPIPSESQNDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHKDKE 1254
                 E Q +    +  PEV            Y   DY+ LT++GEP+SF EA+   DKE
Sbjct: 754  ----QEEQMETNQEEQEPEVRRSMRVRQPSKRYFSDDYVNLTDEGEPQSFIEAIEMNDKE 809

Query: 1255 KWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVKGFQ 1434
            KWLQAM++E++SL++N TYE+VELP+G+KAL+NKWVFKLK + +    ++KAR+VVKG  
Sbjct: 810  KWLQAMEEELQSLKENETYELVELPQGRKALKNKWVFKLKTEENNTKPRYKARIVVKGCN 869

Query: 1435 QKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYMEQPEGF 1614
            QKKGIDF+EIFSPVVKMTSIR ILGL A ++LE+EQ+DVKT+FLHGDL+EEIYMEQPEGF
Sbjct: 870  QKKGIDFEEIFSPVVKMTSIRAILGLAAKLDLEIEQLDVKTAFLHGDLEEEIYMEQPEGF 929

Query: 1615 EISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
               G ++LVC+LKKSLYGLKQAPRQWY KFD  M    +KKT+AD+CV
Sbjct: 930  AEPGKEHLVCRLKKSLYGLKQAPRQWYKKFDLFMAQHNFKKTSADQCV 977


>gb|PNX95528.1| putative gag/pol polyprotein [Trifolium pratense]
          Length = 1339

 Score =  610 bits (1574), Expect = 0.0
 Identities = 305/597 (51%), Positives = 408/597 (68%), Gaps = 12/597 (2%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXX 180
            +N  + ++S  LWH+RLGHMSEKGLS L KK +++   +A L  C+HCL GKQ+      
Sbjct: 417  INTCDNDSSSELWHKRLGHMSEKGLSILAKKNVLHGVSDAKLRKCSHCLAGKQRRVSFKS 476

Query: 181  XXXXXXXXXXX-VHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKL 357
                        VHSDVCGP++  SLGG  YF+TFIDD SRK W Y LKTKDQV + FK 
Sbjct: 477  SEPKRKSEVLDLVHSDVCGPMKTRSLGGAYYFVTFIDDYSRKTWAYTLKTKDQVLDTFKS 536

Query: 358  FHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRT 537
            F   VERETG KLKC+R++NGGEY    FD YC+  GIRH+K+ P+TPQ NG+AERMNRT
Sbjct: 537  FQASVERETGKKLKCIRTNNGGEYVGP-FDKYCQDQGIRHQKSPPKTPQLNGLAERMNRT 595

Query: 538  IMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVF 717
            ++ER+R +LS + LPK FWGEA+    +L+N +P VPL F+VP  +W+GK+ SY HLRVF
Sbjct: 596  LVERMRCLLSQSMLPKYFWGEALNTVVHLLNLTPCVPLKFDVPNHVWNGKEVSYDHLRVF 655

Query: 718  GCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIE 897
            GC+AY H+ K+ R KLD ++  C+F+GYG +EFGYR +DP ++K+IRSRDVVF E  TIE
Sbjct: 656  GCMAYVHIPKDERSKLDEKSKKCVFVGYGLDEFGYRFFDPVQRKLIRSRDVVFMEDYTIE 715

Query: 898  DIEK-----PTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXX 1062
            DI+K     PT   +  +    +  AP P      ++ ED P                  
Sbjct: 716  DIDKVENKDPTFEDEEIVDIGSTTIAPTP------IIFEDAPIENQGMDLNVDNVLN--- 766

Query: 1063 XLPNVPIPSESQNDGGSPQIVPEV-----XXXXXXXXXXXXYSESDYLLLTEDGEPESFQ 1227
              PN  +   + ND    ++V EV                 Y  ++Y+ LT+ GEPE+F+
Sbjct: 767  --PNDVVDVGATNDVVENEVVQEVASNELRRSTRDKRPSVRYPSNEYVFLTDGGEPENFK 824

Query: 1228 EAVSHKDKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHK 1407
            E +  ++K++W+ AM+DEM+SL++N+T+E+V+LPKGK+AL+N+WV+++K+D      ++K
Sbjct: 825  EVLEDENKKEWMDAMEDEMQSLRENNTFELVKLPKGKRALKNRWVYRIKQDECTSQRRYK 884

Query: 1408 ARLVVKGFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEE 1587
            ARLVVKGF+Q++GIDF EIF+PVVKM SIR++LGL AS++LE+E+MDVKT+FLHGDL EE
Sbjct: 885  ARLVVKGFKQREGIDFGEIFAPVVKMQSIRMVLGLAASLDLEVEKMDVKTTFLHGDLHEE 944

Query: 1588 IYMEQPEGFEISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
            IYMEQP+GF   G ++ VCKL KSLYGLKQAPRQWY KF+S M+  GYK T AD CV
Sbjct: 945  IYMEQPDGFREKGKEDYVCKLVKSLYGLKQAPRQWYQKFNSVMIEHGYKMTKADHCV 1001


>emb|CAN77602.1| hypothetical protein VITISV_024474 [Vitis vinifera]
          Length = 1207

 Score =  592 bits (1526), Expect = 0.0
 Identities = 301/582 (51%), Positives = 400/582 (68%), Gaps = 19/582 (3%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQ-QXXXXX 177
            +N  + +++  LWH RLGHMSEKGL  L KK L++  K+ +L  C HCL GKQ +     
Sbjct: 293  INAVDDDSTFELWHNRLGHMSEKGLMILAKKNLLSGMKKGSLKRCAHCLAGKQTRVAFKT 352

Query: 178  XXXXXXXXXXXXVHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKL 357
                        V+SDVCGP++ ++LGG+ YF+TFIDD SRK+WVY LKTKDQV + FK 
Sbjct: 353  LRYTRKPGMLDLVYSDVCGPMKTKTLGGSLYFVTFIDDHSRKIWVYTLKTKDQVLDVFKQ 412

Query: 358  FHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRT 537
            FH +VER++G KLKC+R+DNGGEY+S  FD YC+ +GIRH+KT P+TPQ NG+AERMNRT
Sbjct: 413  FHALVERQSGEKLKCIRTDNGGEYSSP-FDEYCRQHGIRHQKTSPKTPQLNGLAERMNRT 471

Query: 538  IMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVF 717
            ++ERVR +LS ++LP+ FWGEA+    +++N +P VPL F+VP+++WS  + SY HLRVF
Sbjct: 472  LVERVRCLLSQSQLPRSFWGEALNTVVHVLNLTPCVPLEFDVPDRIWSKNEISYDHLRVF 531

Query: 718  GCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIE 897
            GC A+ H+ K+ R KLDA+T PC+FIGYG +E GYR +DP +KK++RSRDVVF E  TI+
Sbjct: 532  GCKAFVHIPKDERSKLDAKTRPCVFIGYGQDELGYRFYDPVQKKLVRSRDVVFMEDHTIQ 591

Query: 898  DIEK--PTMSQKS------------NIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXX 1035
            DIEK  P  SQ S            N+  Q+ D A +     GDV   + P         
Sbjct: 592  DIEKTNPMESQHSGDLIDLDPAPLTNLPTQVEDEAHDDQHDMGDV---ETPTQVEDEAHD 648

Query: 1036 XXXXXXXXXXLPNVPIPS---ESQNDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTED 1206
                         V +     E      +P  +P +            YS  DY+LL + 
Sbjct: 649  DQHDMGDVETPTQVEVDDDVHEQSPAAEAPSDIP-LRRSTRDRHPSTRYSVDDYVLLIDG 707

Query: 1207 GEPESFQEAVSHKDKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGS 1386
            GEPES+ EA+  ++K KW+ AMQDEMESL +N ++E+V+LPKGK+AL+N+WV+++K++  
Sbjct: 708  GEPESYVEAMEDENKMKWVDAMQDEMESLHENHSFELVKLPKGKRALKNRWVYRVKQEEH 767

Query: 1387 GKVVKHKARLVVKGFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFL 1566
                ++KARLVVKGF QKKGIDFDEIF PVVKM+SIRV+LGL AS++LE++QMDVKT+FL
Sbjct: 768  TSQPRYKARLVVKGFNQKKGIDFDEIFFPVVKMSSIRVVLGLAASLDLEIQQMDVKTAFL 827

Query: 1567 HGDLKEEIYMEQPEGFEISG-DNLVCKLKKSLYGLKQAPRQW 1689
            HG+L +EIYMEQPEGF + G ++ VCKLKKSLYGLKQAPRQW
Sbjct: 828  HGNLDKEIYMEQPEGFVLKGKEDYVCKLKKSLYGLKQAPRQW 869


>gb|KYP40337.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1269

 Score =  566 bits (1459), Expect = 0.0
 Identities = 293/579 (50%), Positives = 369/579 (63%), Gaps = 5/579 (0%)
 Frame = +1

Query: 31   NLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXXXXXXXXXXXX 210
            NLWHQRLGHMSEKG+  +  K  +   +   ++ C  C+FGKQ+                
Sbjct: 389  NLWHQRLGHMSEKGMKIMHSKGKLPGLQSMEIEMCEDCIFGKQKRVSFQKGGRTPKKERL 448

Query: 211  X-VHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKLFHVMVERETG 387
              VHSDV GP  V S+ G +YF+TFIDD SRK+WVYFLK K +VFE FK++  MVE ETG
Sbjct: 449  ELVHSDVWGPTTVSSISGKQYFVTFIDDHSRKVWVYFLKHKSEVFEAFKMWKAMVENETG 508

Query: 388  NKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRTIMERVRSMLS 567
             K+K LR+DNGGEY    F  +C  +GIR E+TVP TPQ NGVAERMNRT+ ER RSM  
Sbjct: 509  LKIKKLRTDNGGEYEDTRFKRFCYEHGIRMERTVPGTPQQNGVAERMNRTLTERARSMRM 568

Query: 568  MAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVFGCLAYAHVSK 747
             + LPK FW EAV  A YLINR PSVPL  ++PE++WSGK+   SHLRVFGC+AY H+S 
Sbjct: 569  QSGLPKQFWAEAVNTAAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISD 628

Query: 748  ELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIEDIEKPTMSQK 927
            + R KLD ++  C FIGYG++EFGYRLWD + KK+IRSRDV+F+E    +D +  + S  
Sbjct: 629  QGRNKLDPKSKKCTFIGYGEDEFGYRLWDDENKKMIRSRDVIFNEGVMYKDKQNTSASNS 688

Query: 928  SNIG---AQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPNVPIPSESQ 1098
              I     ++ DA   P V     V                                ES 
Sbjct: 689  KPIEPTYVEVDDALESPPVESSQSV--------------------------------ESI 716

Query: 1099 NDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHKDKEKWLQAMQD 1278
                  Q VPE                 +Y+LLT+ GEPE + EA   +D  KW  AM++
Sbjct: 717  EPDRGQQCVPEPELRRSSRVPVPNRRYMNYMLLTDGGEPEDYSEACQTRDASKWELAMKE 776

Query: 1279 EMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVKGFQQKKGIDFD 1458
            EM+SL  N T+E+ +LP GKKAL NKWV+++K++  G   ++KARLVVKGFQQK+G+D+ 
Sbjct: 777  EMKSLISNQTWELAKLPMGKKALHNKWVYRVKEEHDGS-KRYKARLVVKGFQQKEGVDYT 835

Query: 1459 EIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYMEQPEGFEISG-DNL 1635
            EIFSPVVK+ +IR +L +VAS  L LEQ+DVKT+FLHGDL EEIYM QPEGF   G +N+
Sbjct: 836  EIFSPVVKLNTIRTVLSIVASEELYLEQLDVKTAFLHGDLDEEIYMHQPEGFSEKGKENM 895

Query: 1636 VCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADEC 1752
            VC+LKKSLYGLKQAPRQWY KF+S M  +G+KK NAD C
Sbjct: 896  VCRLKKSLYGLKQAPRQWYRKFESFMHKEGFKKCNADHC 934


>gb|OTG31811.1| putative retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Helianthus annuus]
          Length = 1307

 Score =  554 bits (1427), Expect = e-180
 Identities = 295/578 (51%), Positives = 376/578 (65%), Gaps = 4/578 (0%)
 Frame = +1

Query: 31   NLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXXXXXXXXXXXX 210
            NLWH RLGHMSEKGL  L +K      K+   + C  C+ GKQ+                
Sbjct: 406  NLWHNRLGHMSEKGLKMLAQKGKFPDLKKVETEFCEPCVLGKQKRVTFVKTGRTPKAQKL 465

Query: 211  X-VHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKLFHVMVERETG 387
              VHSDV GP  V SLGG++Y++TFIDD++RK+WVYFLK K  VF+ FK++   VE ET 
Sbjct: 466  ELVHSDVYGPTSVTSLGGSRYYVTFIDDSTRKVWVYFLKHKSGVFDAFKIWKSAVENETN 525

Query: 388  NKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRTIMERVRSMLS 567
             K+KCL+SDNGGEY SK F  YC   GI+  +TVP TPQ NGVAERMNRT+ ER RSM  
Sbjct: 526  LKVKCLKSDNGGEYISKQFTDYCAKEGIKMIRTVPGTPQQNGVAERMNRTLNERARSMRL 585

Query: 568  MAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVFGCLAYAHVSK 747
             A +PK FW +AV  A YLINRSPSVPL F++PE++W G + S  HLRVFGC AY  +  
Sbjct: 586  NAGMPKTFWADAVNTAAYLINRSPSVPLGFKLPEEMWQGNEVSLEHLRVFGCSAYDLLEV 645

Query: 748  ELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIEDIE-KPTMSQ 924
              R KLD+++  C FIGYG EE GYRLWD + +KVIRS++VVF+E++  +D   K    Q
Sbjct: 646  GDRDKLDSKSKKCTFIGYGSEEMGYRLWDNEGRKVIRSKNVVFNENELYKDRNTKGPEVQ 705

Query: 925  KSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPNVPIPSESQND 1104
            K  +  +  +   E  V        DIP                            + N 
Sbjct: 706  KEYVEFESGEKRKEASV--------DIPENGDESSDSGSLGDDSSNDSEEEEATPSTPNS 757

Query: 1105 GGSPQIVPEVXXXXXXXXXXXXYSES-DYLLLTEDGEPESFQEAVSHKDKEKWLQAMQDE 1281
              +PQ+ P              YS S +Y+LLTE+GEP+ + EA+  KD  +W  AM+DE
Sbjct: 758  PETPQVQPR--RSSRATRPPNRYSPSVNYILLTENGEPQCYSEAMGLKDSLQWELAMKDE 815

Query: 1282 MESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVKGFQQKKGIDFDE 1461
            M+SL+KN T+ +++LP GKKAL+NKWVF++K +  G   ++KARLVVKGFQQK+GIDF+E
Sbjct: 816  MKSLEKNKTWHLIKLPPGKKALQNKWVFRVKDEHDG-AKRYKARLVVKGFQQKEGIDFNE 874

Query: 1462 IFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYMEQPEGFEISG-DNLV 1638
            IFSPVVKMT+IR++L +VA+  L LEQ+DVKT+FLHGDL+E+IYM QPEGF + G +NLV
Sbjct: 875  IFSPVVKMTTIRLVLSIVAAEGLHLEQLDVKTAFLHGDLEEDIYMTQPEGFRVKGKENLV 934

Query: 1639 CKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADEC 1752
            CKLKKSLYGLKQAPRQWY KFD+ M   GYK+ + D C
Sbjct: 935  CKLKKSLYGLKQAPRQWYLKFDNFMGRVGYKRCDNDHC 972


>ref|XP_017609491.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT
            1-94 [Gossypium arboreum]
          Length = 1184

 Score =  531 bits (1369), Expect = e-173
 Identities = 282/587 (48%), Positives = 371/587 (63%), Gaps = 2/587 (0%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXX 180
            +N+   + S  LWH+RL HMSEKGLS L KK  ++  K A L  C HCL GKQ+      
Sbjct: 303  VNVTLNDNSTELWHKRLSHMSEKGLSCLAKKNQLSGLKNATLKNCAHCLAGKQKRVSFRS 362

Query: 181  XXXXXXXXXXX-VHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKL 357
                        VHSDVCGP++V S GG  YF+TFIDD SRKLWVY LK+K+QVFE FK 
Sbjct: 363  HPPHKKSELLELVHSDVCGPIKVRSYGGALYFVTFIDDCSRKLWVYTLKSKNQVFEVFKQ 422

Query: 358  FHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRT 537
            F   VERET  KLKC+R+DNGGEYT ++F  YC   GIRH++T P+TPQ NG+AERMNRT
Sbjct: 423  FQASVERETEKKLKCIRTDNGGEYT-RSFHEYCLRQGIRHQRTPPKTPQLNGLAERMNRT 481

Query: 538  IMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVF 717
            ++ERVR +LS AKLP+ FW EA+    ++IN SPSVPL  +VP+++W  K  SY HLRVF
Sbjct: 482  LIERVRCLLSDAKLPRSFWAEALNTVTHVINLSPSVPLKGDVPDRVWFVKGVSYDHLRVF 541

Query: 718  GCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIE 897
            GC  + H                                   KK++RSRDVVF E +TI+
Sbjct: 542  GCKVFVH-----------------------------------KKLVRSRDVVFIEDQTID 566

Query: 898  DIEKPTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPNV 1077
            DI+K T    S     + D  P P     D + +D+                     P  
Sbjct: 567  DIDK-TEKVDSQGSGDLIDVNPVPLDSSPDPIQDDV-----HGDVSGDHQTIGDFATPID 620

Query: 1078 PIPSESQNDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHKDKEK 1257
             + ++ Q    +P  VP +            YS  +Y+LLT+ GEP  ++EA+  + K++
Sbjct: 621  DVVNDQQQAPIAPPAVP-LRRSSRDRRSSVKYSPDEYVLLTDGGEPGCYEEAMESECKDQ 679

Query: 1258 WLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVKGFQQ 1437
            W++AM+D+++SL +N T+E+V+LPKGK+AL+N+WV++LK++      ++KARLVVKG+ Q
Sbjct: 680  WVEAMKDKLQSLHENHTFELVKLPKGKRALKNRWVYRLKQEEKSSSPRYKARLVVKGYTQ 739

Query: 1438 KKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYMEQPEGFE 1617
            KKG+DF+EIFSPVVKM+SIR IL L A  +LE+EQMDVK +FLHGDL++E+YMEQPEGF 
Sbjct: 740  KKGVDFEEIFSPVVKMSSIRTILSLAACYDLEVEQMDVKIAFLHGDLEDELYMEQPEGFV 799

Query: 1618 IS-GDNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
                ++ VC+LKKSLYGLKQAPRQWY KF+S M  Q YKKT +D CV
Sbjct: 800  AQRKEDYVCRLKKSLYGLKQAPRQWYKKFESVMGEQSYKKTTSDHCV 846


>emb|CAN65406.1| hypothetical protein VITISV_030853 [Vitis vinifera]
          Length = 1017

 Score =  471 bits (1213), Expect = e-152
 Identities = 251/574 (43%), Positives = 348/574 (60%), Gaps = 13/574 (2%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQ-QXXXXX 177
            +N  + +++  LWH RL HMSEKGL  + KK L++  K+ +L  C HCL GKQ +     
Sbjct: 164  INAVDDDSTFKLWHNRLSHMSEKGLMIMAKKNLLSGMKKGSLKRCAHCLGGKQTRVAFKT 223

Query: 178  XXXXXXXXXXXXVHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKL 357
                        V+SDVCGP++ ++LGG+ YF+TFIDD SRK+WVY LKTKDQV + FK 
Sbjct: 224  LHHTRKPGMLDLVYSDVCGPMKTKTLGGSLYFVTFIDDHSRKIWVYTLKTKDQVLDVFKQ 283

Query: 358  FHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRT 537
            FH +VER++G KLKC++ DNGGEY+S  FD YC+ +GIRH+KT P+TPQ NG+AE MNRT
Sbjct: 284  FHALVERQSGEKLKCIQIDNGGEYSSP-FDEYCRQHGIRHQKTPPKTPQLNGLAESMNRT 342

Query: 538  IMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVF 717
            ++ERVR +LS ++LP+ FWGEA+    +L+N +P VPL F+VP+++WS  +  Y HLRVF
Sbjct: 343  LVERVRCLLSQSQLPRSFWGEALNTVVHLLNLTPCVPLEFDVPDRIWSNNEICYDHLRVF 402

Query: 718  GCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIE 897
            GC A+ H+ K+   KLDA+T PC+FIGYG +E GYR +DP +KK++RSRDVVF E  TI+
Sbjct: 403  GCKAFVHIPKDEISKLDAKTRPCVFIGYGHDELGYRFYDPMQKKLVRSRDVVFMEDHTIQ 462

Query: 898  DIEK--PTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLP 1071
            DIEK  P  SQ S                 GD++  D+                    + 
Sbjct: 463  DIEKTNPMESQHS-----------------GDLIDLDLAPLTNFPTQVEDEAHDDQHDMG 505

Query: 1072 NVPIPSESQND---------GGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPESF 1224
            +V  P++ + D           +P  +P +            YS  DY+LLT+ GEPES+
Sbjct: 506  DVETPTQVEVDDDVHEQSPTAEAPLDIP-LRRSTRDRHLSTRYSVDDYVLLTDGGEPESY 564

Query: 1225 QEAVSHKDKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKH 1404
             EA+  ++K KW+ A+Q+EMESL +N ++++V+LPKGK+AL+N+WV+++K++      ++
Sbjct: 565  VEAMEDENKMKWVDAIQNEMESLHENHSFKLVKLPKGKRALKNRWVYRVKQEEHTSQPRY 624

Query: 1405 KARLVVKGFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKE 1584
            KARLVVKGF QKK                                              +
Sbjct: 625  KARLVVKGFNQKK---------------------------------------------DK 639

Query: 1585 EIYMEQPEGFEISG-DNLVCKLKKSLYGLKQAPR 1683
            EIYMEQ EGF + G ++ V KLKKSLYGLKQAPR
Sbjct: 640  EIYMEQQEGFVLKGKEDYVSKLKKSLYGLKQAPR 673


>ref|XP_015160015.1| PREDICTED: LOW QUALITY PROTEIN: retrovirus-related Pol polyprotein
            from transposon TNT 1-94 [Solanum tuberosum]
          Length = 1417

 Score =  469 bits (1207), Expect = e-147
 Identities = 270/593 (45%), Positives = 360/593 (60%), Gaps = 8/593 (1%)
 Frame = +1

Query: 1    LNIAEKEASQNLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXX 180
            +N+ E + S +LWH+R+ HMSEKG+    KK L+   K++ L+ C HCL GKQ+      
Sbjct: 320  VNVVENDTSSSLWHRRISHMSEKGMDNFAKKNLLYGVKQSKLNKCVHCLAGKQKIVSFKS 379

Query: 181  XXXXXXXXXXX-VHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKL 357
                        VHSD+CGP +V+S G   YF+TFIDD S KL V+ LK+KDQV + FK 
Sbjct: 380  HLPSRKFDLLELVHSDLCGPFKVKSHGSALYFVTFIDDHSCKLXVFSLKSKDQVLDVFKN 439

Query: 358  FHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRT 537
            F  +VER+TG KLKC+R DN GEY    FD Y +  G+RH+KT  +TPQ N +AERMN+T
Sbjct: 440  FQALVERQTGKKLKCIRFDNDGEYIGH-FDRYSREQGVRHQKTPLKTPQLNCLAERMNKT 498

Query: 538  IMERVRSMLSMAKL-PKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRV 714
            ++ERVR MLS AKL P  FW  ++    Y+IN SP+V LN +V + +WSGK+ SY +L+V
Sbjct: 499  LVERVRCMLSNAKLVPDSFWAXSLNTPAYVINLSPTVALNGDVLDIIWSGKNVSYDYLKV 558

Query: 715  FGCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTI 894
            FGC A+ H+ K                   DE               RSRD+VF E +TI
Sbjct: 559  FGCKAFVHIPK-------------------DE---------------RSRDIVFFEDQTI 584

Query: 895  EDIEKPTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPN 1074
            ED++K        + +Q S++     + D D VP  IP                      
Sbjct: 585  EDLDK-----VEKVDSQSSES-----LVDVDPVPLTIPPGENLQVDIEDDDHIQN---DQ 631

Query: 1075 VPIPSESQND--GGSPQIV--PE-VXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVS 1239
              I +  QND  G  P I+  PE              YS ++++ LT+ GEPES  EA+ 
Sbjct: 632  YVIDAPVQNDVVGEQPTIIDAPESSQXSTREKIPSYRYSPNEFVRLTDGGEPESLDEAME 691

Query: 1240 HKDKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLV 1419
             ++ E W  AM+DEM+ L  N T+++V LPK +KAL+N WVF++K +    + ++K RLV
Sbjct: 692  TEENEMWFDAMKDEMKXLYDNDTFDLVMLPKDRKALKNMWVFRVKHEDGNSIPRYKTRLV 751

Query: 1420 VKGFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYME 1599
            VKGF QKK IDFDEIFSPV+KM+SIRV+LGLVA+++LE+E MDVKT+FLHGDL EEIYME
Sbjct: 752  VKGFSQKKEIDFDEIFSPVMKMSSIRVVLGLVANLDLEVEXMDVKTAFLHGDLDEEIYME 811

Query: 1600 QPEGFEISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
            QPEGFE+ G +N VCKLKKSLY  +    Q + KF S M  QG+KKT+++ CV
Sbjct: 812  QPEGFEVKGKENYVCKLKKSLY--ESTGLQEFMKFGSFMSQQGFKKTSSNHCV 862


>gb|KYP40338.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 864

 Score =  442 bits (1138), Expect = e-142
 Identities = 232/492 (47%), Positives = 298/492 (60%), Gaps = 4/492 (0%)
 Frame = +1

Query: 31   NLWHQRLGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQQXXXXXXXXXXXXXXXX 210
            NLWHQRLGHMSEKG+  +  K  +   +   ++ C  C+FGKQ+                
Sbjct: 389  NLWHQRLGHMSEKGMKIMHSKGKLPGLQSMEIEMCEDCIFGKQKRVSFQKGGRTPKKERL 448

Query: 211  X-VHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKLFHVMVERETG 387
              VHSDV GP  V S+ G +YF+TFIDD SRK+WVYFLK K +VFE FK++  MVE ETG
Sbjct: 449  ELVHSDVWGPTTVSSISGKQYFVTFIDDHSRKVWVYFLKHKSEVFEAFKMWKAMVENETG 508

Query: 388  NKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRTIMERVRSMLS 567
             K+K LR+DNGGEY    F  +C  +GIR E+TVP TPQ NGVAERMNRT+ ER RSM  
Sbjct: 509  LKIKKLRTDNGGEYEDTRFKRFCYEHGIRMERTVPGTPQQNGVAERMNRTLTERARSMRV 568

Query: 568  MAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVFGCLAYAHVSK 747
             + LPK FW E V  A YLINR PSVPL  ++PE++WSGK+   SHLRVFGC+AY H+S 
Sbjct: 569  QSGLPKQFWAETVNTAAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISD 628

Query: 748  ELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIEDIEKPTMSQK 927
            + R KLD ++  C FIGYG++EFGYRLWD + KK+IRSRDV+F+E    +D +  + S  
Sbjct: 629  QGRNKLDPKSKKCTFIGYGEDEFGYRLWDDENKKMIRSRDVIFNEGVMYKDKQNTSASNS 688

Query: 928  SNIG---AQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPNVPIPSESQ 1098
              I     ++ DA   P V     V                                ES 
Sbjct: 689  KPIEPTYVEVDDALESPPVESSQSV--------------------------------ESI 716

Query: 1099 NDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHKDKEKWLQAMQD 1278
                  Q VPE                 +Y+LLT+ GEPE + EA   +D  KW  AM++
Sbjct: 717  EPDRGQQCVPEPELRRSSRVPVPNRRYMNYMLLTDGGEPEDYSEACQTRDASKWELAMKE 776

Query: 1279 EMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVKGFQQKKGIDFD 1458
            EM+SL  N T+E+ +LP GKKAL NKWV+++K++  G   ++KARLVVKGFQQK+G+D+ 
Sbjct: 777  EMKSLISNQTWELAKLPMGKKALHNKWVYRVKEEHDGS-KRYKARLVVKGFQQKEGVDYT 835

Query: 1459 EIFSPVVKMTSI 1494
            EIFSPVVK+ +I
Sbjct: 836  EIFSPVVKLNTI 847


>dbj|GAU39612.1| hypothetical protein TSUD_276520 [Trifolium subterraneum]
          Length = 1403

 Score =  442 bits (1136), Expect = e-137
 Identities = 248/580 (42%), Positives = 344/580 (59%), Gaps = 11/580 (1%)
 Frame = +1

Query: 49   LGHMSEKGLSTLIKKELINVDKEAALDPCNHCLFGKQ-QXXXXXXXXXXXXXXXXXVHSD 225
            L H+SEKGL+ L KK+++   K A L+ C+HC+ GKQ +                 VH D
Sbjct: 247  LSHISEKGLNVLAKKDVLPGLKNADLEKCSHCMTGKQTRVSFKKHPPSRKLELLQLVHYD 306

Query: 226  VCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKLFHVMVERETGNKLKCL 405
            VCGPL+                                              +G  LKC+
Sbjct: 307  VCGPLK----------------------------------------------SGKNLKCV 320

Query: 406  RSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRTIMERVRSMLSMAKLPK 585
            RSDNGGEY    FD YCK  GI HEKT P+TPQ NG+AERMNRT++ERVR MLS AKLP+
Sbjct: 321  RSDNGGEYCGP-FDVYCKQQGIAHEKTPPKTPQLNGLAERMNRTLVERVRCMLSEAKLPQ 379

Query: 586  PFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVFGCLAYAHVSKELRQKL 765
             +W EA+  A ++IN +P+V LN EVP+K+W  KDPS+                     +
Sbjct: 380  HYWDEALYTAVHVINLTPTVVLNSEVPDKIWM-KDPSW---------------------M 417

Query: 766  DARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIEDIEKPTMSQ-KSNIGA 942
             ++    IFIGYG +EFGY+ +DP  KK+IRSRDVVF + +TIEDI+K   +  K ++  
Sbjct: 418  QSQNNLSIFIGYGKDEFGYKFYDPLWKKLIRSRDVVFMKDQTIEDIDKVEKTTCKKDVTL 477

Query: 943  QISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPNVPIPSESQND------ 1104
               D+   P V + D +  D+                      N+P  ++ +ND      
Sbjct: 478  SNIDSVRLP-VHNLDTIGGDVQNGEPHEYVDDQQIGEEV----NIPANNDEENDMSHDDN 532

Query: 1105 -GGSPQIVP-EVXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHKDKEKWLQAMQD 1278
             G +P+    ++            Y+  +Y+ L ++GEP+ FQ+ +   + +KW+ AM +
Sbjct: 533  LGEAPESSQVQLRRSNKQRQPSTRYNSDEYVTLNDEGEPKYFQDIMESDENQKWMDAMNN 592

Query: 1279 EMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVKGFQQKKGIDFD 1458
            EM+SL  N TY++VELPKG+KAL N+W++++K + +    + KARLVVKGF+Q+KG+DF+
Sbjct: 593  EMKSLHDNHTYDLVELPKGEKALENRWIYRVKHESNSGSPRCKARLVVKGFRQRKGVDFN 652

Query: 1459 EIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYMEQPEGFEISG-DNL 1635
            EIFSPVVKM+SIR +L L A+++LE+EQMDVKT+FLHGDL+EEIYM+QP+GF + G ++ 
Sbjct: 653  EIFSPVVKMSSIRTVLALAATLDLEVEQMDVKTTFLHGDLEEEIYMKQPDGFLVKGKEDY 712

Query: 1636 VCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKKTNADECV 1755
            VC+L+KSLYGLKQAPRQWY KF+S M  QGYKKT +D CV
Sbjct: 713  VCRLRKSLYGLKQAPRQWYKKFESVMSEQGYKKTTSDCCV 752


>gb|OAE26943.1| hypothetical protein AXG93_4413s1270 [Marchantia polymorpha subsp.
            ruderalis]
          Length = 922

 Score =  431 bits (1107), Expect = e-137
 Identities = 236/554 (42%), Positives = 324/554 (58%), Gaps = 13/554 (2%)
 Frame = +1

Query: 133  CNHCLFGKQQXXXXXXXXXXXXXXXXXVHSDVCGPLEVESLGGNKYFLTFIDDASRKLWV 312
            C HC+ GKQ                  VHSDV GP  V SL G  Y+++FIDD SR +WV
Sbjct: 122  CEHCIMGKQHRKTFGVGTHSSKEILEYVHSDVWGPSPVASLSGKWYYVSFIDDYSRYVWV 181

Query: 313  YFLKTKDQVFEYFKLFHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVP 492
            YFL  K +VF  FK + + VE +TG+K+K LRSDNGGEYTS+ F  YC   GI    T  
Sbjct: 182  YFLTHKSEVFSTFKSWRIQVETQTGHKVKYLRSDNGGEYTSEEFQRYCTEEGITRHFTTV 241

Query: 493  RTPQHNGVAERMNRTIMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEK 672
             TPQ N V+ER+NRT++E+VRSMLS + LP  FW EAV  A YL+N SPS  +NF  P +
Sbjct: 242  YTPQQNVVSERLNRTVLEKVRSMLSESGLPGEFWAEAVNTAIYLVNLSPSSAINFSTPFE 301

Query: 673  LWSGKDPSYSHLRVFGCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKV 852
            L   +   YS L++FGC AY  + KE R KLD  +  C F+GY     GYRLWDP  +KV
Sbjct: 302  LRHKRMADYSRLKIFGCTAYPLIPKEQRTKLDPTSKKCRFLGYASGVKGYRLWDPVARKV 361

Query: 853  IRSRDVVFHESKTIEDIEKPTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXX 1032
            + SRDV F+E   +++ E    ++  ++ A I     +  + + D   E+ P        
Sbjct: 362  VVSRDVSFNEPDLLKERENVEANKGKSLLADIVVGKFDHSITN-DQTHEEAPIHVEQVLE 420

Query: 1033 XXXXXXXXXXXLPNVPIPSE----------SQNDGGSPQIVPEVXXXXXXXXXXXXYSES 1182
                        P   IP E          SQ    +P+                 + + 
Sbjct: 421  EQELQEQAIVGEPIATIPDERDQTETSTRRSQRSSRAPERFGVWANSSILKDRDLDFEDE 480

Query: 1183 DYL-LLTEDGEPESFQEAVSHKDKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKW 1359
            D + L+ E+GEP S++EA +  +K +W  AM+ EM+SL  N T+E+VELPK +  + +KW
Sbjct: 481  DGMALILEEGEPSSYREAQASVNKLEWNAAMEREMQSLIDNKTWELVELPKNQTVIDSKW 540

Query: 1360 VFKLKKDGSGKVVK-HKARLVVKGFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLEL 1536
            V+KLK + +G   +  KARLV +GF Q+KG+D++E+F PV K  +IR++  L A+ +L +
Sbjct: 541  VYKLKDNPAGDEARIFKARLVARGFTQEKGVDYNEVFLPVAKYATIRLVCALAATFSLVM 600

Query: 1537 EQMDVKTSFLHGDLKEEIYMEQPEGFEISG-DNLVCKLKKSLYGLKQAPRQWYTKFDSCM 1713
            +QMDV T+FL+G L+EEIYM QP GFE+ G +  VC+L KSLYGLKQAPRQW T+FD  M
Sbjct: 601  DQMDVVTAFLYGYLEEEIYMRQPIGFEVKGQERWVCRLLKSLYGLKQAPRQWNTRFDEFM 660

Query: 1714 VSQGYKKTNADECV 1755
             +QG+ ++  D CV
Sbjct: 661  KAQGFLRSVYDPCV 674


>gb|OMO51796.1| Reverse transcriptase, RNA-dependent DNA polymerase [Corchorus
            capsularis]
          Length = 1500

 Score =  433 bits (1114), Expect = e-133
 Identities = 217/385 (56%), Positives = 275/385 (71%), Gaps = 1/385 (0%)
 Frame = +1

Query: 604  VRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVFGCLAYAHVSKELRQKLDARTTP 783
            V+VACYLINRSPS PL F++PEK+WS K+PSY+HL+VFGC A+ HV KE R KLD++ TP
Sbjct: 726  VKVACYLINRSPSAPLGFDIPEKVWSDKNPSYAHLKVFGCKAFLHVPKEQRSKLDSKVTP 785

Query: 784  CIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIEDIEKPTMSQKSNIGAQISDAAP 963
            CIF+GYG EEFGYR WDP++K ++RSRDVVFHE +TI D EK     K      + D   
Sbjct: 786  CIFVGYGGEEFGYRFWDPEKKNIVRSRDVVFHEHETIADFEK-----KEKTSRVVHD--- 837

Query: 964  EPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPNVPIPSESQNDGGSPQIVPEVXXX 1143
                 D D+ P  +P                       P+P  ++         P++   
Sbjct: 838  -----DDDLTPTTVPPRRATDGGDEQDAVGIEQG-EQPPLPENNE---------PQLRRS 882

Query: 1144 XXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHKDKEKWLQAMQDEMESLQKNSTYEIVE 1323
                     Y  S+++LLT+DGEPESF++  S  DK++WL+AMQ+EM+SLQKN TYE+VE
Sbjct: 883  ARGNIPSTKYPSSEFVLLTDDGEPESFRDVQSTSDKQRWLEAMQEEMDSLQKNGTYELVE 942

Query: 1324 LPKGKKALRNKWVFKLKKDGSGKVVKHKARLVVKGFQQKKGIDFDEIFSPVVKMTSIRVI 1503
            LPKGK+ L+NKWVFKLKKDG+ K+V++K  LVVKGF QK  IDFDEIFSPVVKM+SIRV+
Sbjct: 943  LPKGKRPLKNKWVFKLKKDGN-KLVRYKTCLVVKGFAQKACIDFDEIFSPVVKMSSIRVV 1001

Query: 1504 LGLVASMNLELEQMDVKTSFLHGDLKEEIYMEQPEGFEISG-DNLVCKLKKSLYGLKQAP 1680
            LGL AS+NLE+EQ+DVKT+FLHGDL+EEIYM+QPEGF++ G +++VC+LKKSLYGLKQA 
Sbjct: 1002 LGLAASLNLEIEQLDVKTAFLHGDLQEEIYMDQPEGFKVKGKEHMVCRLKKSLYGLKQAL 1061

Query: 1681 RQWYTKFDSCMVSQGYKKTNADECV 1755
            RQWY KFDS MVS  +K+T  D CV
Sbjct: 1062 RQWYKKFDSFMVSHLFKRTATDPCV 1086


>dbj|GAU37486.1| hypothetical protein TSUD_275380 [Trifolium subterraneum]
          Length = 1421

 Score =  429 bits (1104), Expect = e-132
 Identities = 225/585 (38%), Positives = 332/585 (56%), Gaps = 12/585 (2%)
 Frame = +1

Query: 16   KEASQNLWHQRLGHMSEKGLSTLIKKELIN-----VDKEAALDPCNHCLFGKQ-QXXXXX 177
            K  +  LWH R  H+S KGL+TL+KKE++       D E   D C  CL GKQ +     
Sbjct: 420  KMDNNELWHCRYDHLSFKGLNTLVKKEMVKGLPHLQDME---DTCVSCLTGKQHREAIPK 476

Query: 178  XXXXXXXXXXXXVHSDVCGPLEVESLGGNKYFLTFIDDASRKLWVYFLKTKDQVFEYFKL 357
                        VHSD+C P+  +S GGN+YF+TF DD SRK W Y L  K   F+ FK 
Sbjct: 477  SSDWRATRPLELVHSDICEPITPQSNGGNRYFITFTDDFSRKTWTYLLANKACAFDEFKK 536

Query: 358  FHVMVERETGNKLKCLRSDNGGEYTSKAFDAYCKTYGIRHEKTVPRTPQHNGVAERMNRT 537
            F  +VE+E   ++ CLR+D GGE+TS AF+ YC T+GI+ + T   TPQ NGV+ER NRT
Sbjct: 537  FKTLVEKEPNTQIMCLRTDRGGEFTSNAFNEYCSTHGIKRQLTTAYTPQQNGVSERKNRT 596

Query: 538  IMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSHLRVF 717
            ++  VRSML+   +P+ FW EA++ A Y+INR+P++ +    PE+ WSG  P   H RVF
Sbjct: 597  LLNMVRSMLAGRSVPETFWPEALKWATYVINRTPTLSVKDMTPEEAWSGSKPVVQHFRVF 656

Query: 718  GCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHESKTIE 897
            GC+A+AH+    R+KLD+++  CI +G  +E  GY+L+DP  K++I SRDV+F ESK   
Sbjct: 657  GCVAFAHIPDSQRKKLDSKSIQCILLGLSEESKGYKLYDPVSKRIIVSRDVIFDESKGWN 716

Query: 898  DIEKPTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXXLPNV 1077
            D +K      ++ G  +     E  + +    PE+ P                       
Sbjct: 717  DDKKQAAKSNNDEGTSLITEDTEIDINE----PENFPLNEHITDQQVNEGNTDSDQHVEE 772

Query: 1078 PIPSESQNDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTE------DGEPESFQEAVS 1239
                  +    S ++ P V              E D  L+          +P ++ EAV 
Sbjct: 773  RSSEVDEESTDSDELPPRVVRKPGYLSDYVSGEEIDESLMQNLAMSGTSDDPVTYDEAVK 832

Query: 1240 HKDKEKWLQAMQDEMESLQKNSTYEIVELPKGKKALRNKWVFKLKKDGSGKVVKHKARLV 1419
                + W QAM  E++++++N T+E+V LP G K +  KW++K K +  G++ K+KARLV
Sbjct: 833  ---CDTWKQAMDQEIDAIERNDTWELVTLPNGAKKIGVKWIYKTKYNEKGEIEKYKARLV 889

Query: 1420 VKGFQQKKGIDFDEIFSPVVKMTSIRVILGLVASMNLELEQMDVKTSFLHGDLKEEIYME 1599
             KG+ Q+ GID++E+F+PV +  +IR++L L AS N  + Q+DVK++FLHG+L E +Y+ 
Sbjct: 890  AKGYSQQYGIDYNEVFAPVARWDTIRLVLSLAASQNWSVHQLDVKSAFLHGELNENVYVA 949

Query: 1600 QPEGFEISGDNLVCKLKKSLYGLKQAPRQWYTKFDSCMVSQGYKK 1734
            Q  G++  G + + KLKK+LYGLKQAPR W  K +S    + ++K
Sbjct: 950  QTLGYQKGGSDKIYKLKKALYGLKQAPRAWDNKIESYFTHEKFEK 994


>gb|KZV39824.1| hypothetical protein F511_27827 [Dorcoceras hygrometricum]
          Length = 325

 Score =  386 bits (992), Expect = e-127
 Identities = 193/258 (74%), Positives = 208/258 (80%)
 Frame = +1

Query: 526  MNRTIMERVRSMLSMAKLPKPFWGEAVRVACYLINRSPSVPLNFEVPEKLWSGKDPSYSH 705
            MNRTIMERVRSMLSMAKLPKPFWGEAVR ACYLINRSPSVPLNFEVPEKLWSGKDPSYSH
Sbjct: 1    MNRTIMERVRSMLSMAKLPKPFWGEAVRAACYLINRSPSVPLNFEVPEKLWSGKDPSYSH 60

Query: 706  LRVFGCLAYAHVSKELRQKLDARTTPCIFIGYGDEEFGYRLWDPKEKKVIRSRDVVFHES 885
            LRVFGCLAYAHVSKELRQKLDARTTPCIF+GYGDEEFGYRLWDPKEKKVIRSRDVVFHES
Sbjct: 61   LRVFGCLAYAHVSKELRQKLDARTTPCIFVGYGDEEFGYRLWDPKEKKVIRSRDVVFHES 120

Query: 886  KTIEDIEKPTMSQKSNIGAQISDAAPEPFVRDGDVVPEDIPXXXXXXXXXXXXXXXXXXX 1065
            +TIEDIEKPTMSQKSN+GAQ SD A  PF R+ ++VPED+P                   
Sbjct: 121  QTIEDIEKPTMSQKSNVGAQSSDVALIPFTRNDEIVPEDMPEAEAEEEGGVEQGEPQPAP 180

Query: 1066 LPNVPIPSESQNDGGSPQIVPEVXXXXXXXXXXXXYSESDYLLLTEDGEPESFQEAVSHK 1245
             P VP PS+S +DG S Q +PEV            Y ES+YLLLTEDGEPESFQE +SHK
Sbjct: 181  -PVVPGPSQSPDDGESSQSIPEVRRSERGRIPSRRYLESEYLLLTEDGEPESFQETLSHK 239

Query: 1246 DKEKWLQAMQDEMESLQK 1299
            DK+KWL AMQDEME + +
Sbjct: 240  DKDKWLLAMQDEMEVMDR 257


Top