BLASTX nr result

ID: Rehmannia31_contig00013155 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00013155
         (1704 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OMO58913.1| reverse transcriptase [Corchorus capsularis]           247   e-110
ref|XP_021603183.1| uncharacterized protein LOC110608278, partia...   251   e-109
ref|XP_021600697.1| LOW QUALITY PROTEIN: uncharacterized protein...   245   e-107
gb|PKU70170.1| RNA-directed DNA polymerase [Dendrobium catenatum]     234   e-105
ref|XP_021607593.1| uncharacterized protein LOC110611516 [Maniho...   255   8e-99
gb|KZV22344.1| hypothetical protein F511_20441 [Dorcoceras hygro...   243   9e-98
ref|XP_023520277.1| LOW QUALITY PROTEIN: uncharacterized protein...   233   1e-96
gb|EXC31837.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notab...   233   1e-95
gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao]    230   1e-94
ref|XP_022933231.1| LOW QUALITY PROTEIN: uncharacterized protein...   219   2e-92
gb|ADN34141.1| ty3-gypsy retrotransposon protein [Cucumis melo s...   228   3e-91
ref|XP_023745456.1| uncharacterized protein LOC111893627 [Lactuc...   213   6e-91
ref|XP_024172019.1| uncharacterized protein LOC112178032 [Rosa c...   235   6e-91
ref|XP_015078330.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   200   7e-91
gb|AAX95749.1| Retrotransposon gag protein, putative [Oryza sati...   241   4e-90
ref|XP_024032296.1| uncharacterized protein LOC112094830 [Morus ...   219   1e-89
gb|KZV45872.1| hypothetical protein F511_35060, partial [Dorcoce...   221   4e-89
ref|XP_019071093.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   205   2e-88
gb|ABB46832.2| retrotransposon protein, putative, Ty3-gypsy subc...   242   2e-88
gb|AAK92604.1|AC078944_15 Putative retroelement [Oryza sativa Ja...   242   2e-88

>gb|OMO58913.1| reverse transcriptase [Corchorus capsularis]
          Length = 1477

 Score =  247 bits (630), Expect(2) = e-110
 Identities = 115/170 (67%), Positives = 141/170 (82%)
 Frame = +2

Query: 1106 GSPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYY 1285
            G+PVL  KK +G+ RLCIDYR LN VT++N+YPLP ID+LFDQL+GA ++SK+DL+ GY+
Sbjct: 637  GAPVLFVKKNDGSMRLCIDYRELNKVTVRNRYPLPHIDDLFDQLKGAQVFSKIDLRSGYH 696

Query: 1286 QLKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDI 1465
            QLKIK E +PK+AF+TRYG YEF+VMP GLTNAP  FMDLM+RVF  YLDKFV++FIDDI
Sbjct: 697  QLKIKVEDVPKSAFRTRYGHYEFLVMPFGLTNAPAAFMDLMNRVFKDYLDKFVVVFIDDI 756

Query: 1466 IVHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRSGD 1615
            +V+SKS E H  HL +VL+ LRE KLYAKF KCEFWL+ V FLGH+ S D
Sbjct: 757  LVYSKSMEEHGEHLRLVLQILREKKLYAKFKKCEFWLDSVAFLGHVVSKD 806



 Score =  184 bits (467), Expect(2) = e-110
 Identities = 107/328 (32%), Positives = 167/328 (50%), Gaps = 6/328 (1%)
 Frame = +3

Query: 168  CSHCRKPGHTPDQCWRKQ---GKCLKCGSDQHQLCDCPMISTPEN--KPAPPYKSGNGPN 332
            C  C + GH    C R +   G C  CG   H+  +CP      +  + + P  + NG N
Sbjct: 319  CLICGEMGHRAASCSRARPTIGFCYNCGQKGHKSFECPQPKKGASVGQTSTPAAAKNG-N 377

Query: 333  NRARAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLHTCM 512
             + +   RV++L  Q+   +  VV G + VS      LFD G+ H FV+P FV KL   +
Sbjct: 378  QKPKVQGRVYSLTQQDAQASNTVVTGMVLVSSVYALTLFDTGASHLFVSPAFVEKLGVIV 437

Query: 513  EHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LSGHF 692
            E +     + TP G   + + +  +C + IE V   A+L++L +  +D+ILGM  L  ++
Sbjct: 438  EPLDFEFVIDTPTGGDVLVNQVCKSCIVVIEGVSLPADLVVLDMHGFDVILGMGWLDKYY 497

Query: 693  AQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINKPKD 872
            A  DCH K +             G    S P I+S ++AR+ + +G  G+L  + N    
Sbjct: 498  AILDCHRKRIDFRIPDFEEFSFVGSPAKSPPRIVSMLQARRLLKSGCLGFLVSVQNNLDG 557

Query: 873  E-T*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMAHVXXXXXX 1049
            E   ++ +PIV ++ +VF ++L  +PPDR+V+F+I++ PGT  I++TPYRMA        
Sbjct: 558  ELPSLNSIPIVQDFSDVFPEDLHGLPPDREVEFSIDLIPGTTPISKTPYRMAPTELKELK 617

Query: 1050 XXXXXXMEVGFIRPSTSPWEAQCFCQKK 1133
                  ++ GFIRPS SPW A     KK
Sbjct: 618  EQLQELLDNGFIRPSVSPWGAPVLFVKK 645


>ref|XP_021603183.1| uncharacterized protein LOC110608278, partial [Manihot esculenta]
          Length = 879

 Score =  251 bits (642), Expect(2) = e-109
 Identities = 113/170 (66%), Positives = 147/170 (86%)
 Frame = +2

Query: 1106 GSPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYY 1285
            G+PVL  KK +GT RLCIDYR LN VTIKNKYPLPRID+LFDQL+GA+++SK+DL+ GY+
Sbjct: 482  GAPVLFVKKNDGTLRLCIDYRQLNKVTIKNKYPLPRIDDLFDQLKGASVFSKIDLRSGYH 541

Query: 1286 QLKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDI 1465
            QLKIK+  + KTAF+TRYG YEF+VMP GLTNAP  FMDLM+R+F PYLD+F+++FIDDI
Sbjct: 542  QLKIKESDVSKTAFRTRYGHYEFLVMPFGLTNAPAAFMDLMNRIFHPYLDQFLVVFIDDI 601

Query: 1466 IVHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRSGD 1615
            +++SK++E H+ HL +VL+ LRE +LYAK SKCEFWLN+++FLGH+ S +
Sbjct: 602  LIYSKTKEEHDQHLRIVLQTLREKQLYAKLSKCEFWLNDISFLGHVVSAE 651



 Score =  174 bits (440), Expect(2) = e-109
 Identities = 105/330 (31%), Positives = 156/330 (47%), Gaps = 24/330 (7%)
 Frame = +3

Query: 216  KQGKCLKCGSDQHQLCDCPM----ISTPENKPAPPYKSGNGPNN---------------- 335
            ++   L CGS +H + DCP      + P  +P P    G G                   
Sbjct: 161  QESSLLWCGSTEHLMRDCPRGQVSSAPPIERPIPAGSRGRGRGRGNQTGAASASQRMSEI 220

Query: 336  ----RARAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLH 503
                  R PAR +A+  +E  ++ DV+ GT  + D     L DPGS HS++    +++  
Sbjct: 221  VDRPNFRTPARAYAISAKEDRDSPDVIVGTFSIFDKPVHALIDPGSTHSYICLPIINEGK 280

Query: 504  TCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LS 683
               + +   + V+ P+G S I   +   C I I    F  +LI LP  E+D+ILGMD LS
Sbjct: 281  LQADSLNQDIIVTNPLGHSVIVSKVYRDCPISIHGQTFHGDLIELPFREFDVILGMDWLS 340

Query: 684  GHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINK 863
             H    DC +K + L  L    V + G     +  IISA  AR+ I+ G   YL   +  
Sbjct: 341  KHRVIVDCRSKRIILKTLADHDVVVVGERSDYLSNIISATTARRLISKGCEAYLVCALET 400

Query: 864  PKDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMAHVXXXX 1043
             K+   + ++  V ++ +VF  +L  +PP+R+V+FAI++ PGTA I+  PYRMA      
Sbjct: 401  KKENPSMRDISTVCDFSDVFPDDLPRLPPEREVEFAIDVIPGTAPISIVPYRMAPTELKE 460

Query: 1044 XXXXXXXXMEVGFIRPSTSPWEAQCFCQKK 1133
                    ++ GFIRPS SPW A     KK
Sbjct: 461  LKIQLQELLDKGFIRPSISPWGAPVLFVKK 490


>ref|XP_021600697.1| LOW QUALITY PROTEIN: uncharacterized protein LOC110606254 [Manihot
            esculenta]
          Length = 1723

 Score =  245 bits (625), Expect(2) = e-107
 Identities = 111/169 (65%), Positives = 146/169 (86%)
 Frame = +2

Query: 1109 SPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQ 1288
            +PVL  KKK+GT RLCIDYR LN VTIKNKYPLPRID+LF+QL+GA ++SK+DL+ GY+Q
Sbjct: 997  APVLFVKKKDGTLRLCIDYRQLNKVTIKNKYPLPRIDDLFNQLKGAIIFSKIDLRSGYHQ 1056

Query: 1289 LKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDII 1468
            LKIK+  + KTAF+TRYG YEF+VMP GLTNAP  FMDLM+R+F PYLD+FV++FIDDI+
Sbjct: 1057 LKIKEIDVSKTAFRTRYGHYEFLVMPFGLTNAPAAFMDLMNRIFHPYLDQFVVVFIDDIL 1116

Query: 1469 VHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRSGD 1615
            ++SK+++ ++ HL +VL+ LRE +LYAK SKCEFWLN+++FLGH+ S +
Sbjct: 1117 IYSKTKKEYDQHLRIVLQTLREKQLYAKISKCEFWLNDISFLGHVVSAE 1165



 Score =  176 bits (445), Expect(2) = e-107
 Identities = 105/329 (31%), Positives = 159/329 (48%), Gaps = 24/329 (7%)
 Frame = +3

Query: 222  GKCLKCGSDQHQLCDCPM----ISTPENKPAPPYKSGNG-----------PNNRA----- 341
            G C +CGS +H + DCP      +TP  +P P    G G            N R      
Sbjct: 677  GACFRCGSTEHLMRDCPRGQVSSATPVERPIPAGSRGRGRGRGNQTGAALANQRVSEIVD 736

Query: 342  ----RAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLHTC 509
                R PAR +A+  +E  ++  V+     + D     L DPGS +S++    V++    
Sbjct: 737  RPDFRTPARAYAIRAKEDRDSPVVIVDIFSIFDKPVHALIDPGSTYSYICLPIVNEGKLQ 796

Query: 510  MEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LSGH 689
             + +   + V+ P+G S I   +   C I I    F  +LI LP  E+D+ILGMD LS H
Sbjct: 797  ADSLNQDIIVTNPLGHSVIVSKVYRDCPISIHGHIFHGDLIELPFREFDVILGMDWLSRH 856

Query: 690  FAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINKPK 869
                DC ++ + L  L    V + G     +  IIS   AR+ I+ G   YL  ++   K
Sbjct: 857  RVIVDCRSRRITLKTLVDHDVVVVGERSDYLSNIISTATARRLISKGCEAYLVCVLETKK 916

Query: 870  DET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMAHVXXXXXX 1049
            +   +H++  V ++ +VF  +L  +PP+R+V+F+I++ PGTA I+  PYRMA        
Sbjct: 917  ENPCVHDISTVCDFSDVFPDDLPRLPPEREVEFSIDVIPGTAPISIAPYRMAPTKLKELK 976

Query: 1050 XXXXXXMEVGFIRPSTSPWEAQCFCQKKK 1136
                  ++ GFIRPS SPW+A     KKK
Sbjct: 977  IQLQELLDKGFIRPSISPWDAPVLFVKKK 1005


>gb|PKU70170.1| RNA-directed DNA polymerase [Dendrobium catenatum]
          Length = 617

 Score =  234 bits (597), Expect(2) = e-105
 Identities = 105/167 (62%), Positives = 142/167 (85%)
 Frame = +2

Query: 1109 SPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQ 1288
            +PVL +KKK+G+ RLCIDYR LN VT+KNKYPLPRIDELFDQL GA+++S++DL+ GY+Q
Sbjct: 299  APVLFAKKKDGSLRLCIDYRELNKVTVKNKYPLPRIDELFDQLAGASVFSRIDLRSGYHQ 358

Query: 1289 LKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDII 1468
            +++K+  + KTAF  RYG YEF+VMP GLTNAP +FM +M+R+F PYLD+FV++FIDDI+
Sbjct: 359  VRVKECDVEKTAFGIRYGHYEFLVMPFGLTNAPAIFMYMMNRIFRPYLDQFVIVFIDDIL 418

Query: 1469 VHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRS 1609
            ++SK+ + H  HL +VL  LR+HKL+AK+SKCEFWL +++FLGHI S
Sbjct: 419  IYSKNDKDHADHLKIVLNLLRQHKLFAKYSKCEFWLKKISFLGHIVS 465



 Score =  179 bits (454), Expect(2) = e-105
 Identities = 114/308 (37%), Positives = 165/308 (53%), Gaps = 3/308 (0%)
 Frame = +3

Query: 222  GKCLKCGSDQHQLCDCPMISTPE-NKPAPPYKSGNGPNNRARAPARVFALVGQEGPNATD 398
            GKC  CG   H   +CP    P+ ++PA   K     +    APARVF L  Q+  +++D
Sbjct: 4    GKCYLCGESGHIKRNCPKGVKPQMHQPALKIKGETSSDADRSAPARVFTLSSQDAKDSSD 63

Query: 399  VVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLHTCMEHMPIGLEVSTPMGASQITDSI 578
            VV GTL +SD   ++LFD G+ HSF++  F  K++        GL V  P G + +  S 
Sbjct: 64   VVTGTLNISDLSARVLFDSGASHSFISEIFCGKINVEPVSFTPGLHVRLPAG-NYLNAST 122

Query: 579  DNTCEIKIENVHFDANLILLPISEYDIILGMD*LSGHFAQTDCHNKTVKLCKLGKPVVEL 758
              + +  I    F A+LI+LP+ E+DIIL MD L  +FA  +C  K V+    G+ V   
Sbjct: 123  ICSIDFSIAGREFQADLIVLPLVEFDIILAMDWLIKYFATIECWKKKVRFDLPGEEVFYF 182

Query: 759  SGRARVSMPPIISAIKARKAITNGAHGYLAFI--INKPKDET*IHEVPIVSEYLNVFSQE 932
                 V +  +IS  +  K++  G   +LA I  +N+  D+  I  +PIV+E+++VFS+E
Sbjct: 183  QCERGV-ISSLISCNQMHKSLKFGELVFLAKIEAMNEKVDD--ISSIPIVNEFIDVFSEE 239

Query: 933  LTTIPPDRKVQFAIEIFPGTASIARTPYRMAHVXXXXXXXXXXXXMEVGFIRPSTSPWEA 1112
               +PPDR+V+F I + PGT  I + PYRMA              +E GFIRPS+SPW A
Sbjct: 240  WNELPPDREVEFLINLQPGTTPIVKAPYRMATKELQELKDQLAELIEKGFIRPSSSPWSA 299

Query: 1113 QCFCQKKK 1136
                 KKK
Sbjct: 300  PVLFAKKK 307


>ref|XP_021607593.1| uncharacterized protein LOC110611516 [Manihot esculenta]
          Length = 1251

 Score =  255 bits (652), Expect(2) = 8e-99
 Identities = 113/173 (65%), Positives = 150/173 (86%)
 Frame = +2

Query: 1097 VTMGSPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQ 1276
            + +G+PVL  KKK+G+ RLC+DYR LN VT+KNKYPLPRID+LFDQL+GA+++SK+DL+ 
Sbjct: 872  IEVGAPVLFVKKKDGSLRLCVDYRQLNKVTVKNKYPLPRIDDLFDQLKGASVFSKIDLRS 931

Query: 1277 GYYQLKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFI 1456
            GYYQL++K   IPKTAF+TRYG Y+F+VMP GLTNAP  FMDLM+ +F P+LD+FV++FI
Sbjct: 932  GYYQLRVKDADIPKTAFRTRYGHYKFLVMPFGLTNAPAAFMDLMNHIFHPFLDQFVLVFI 991

Query: 1457 DDIIVHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRSGD 1615
            DDI+V+SK++E H+ HL +VL+ LRE +LYAKFSKCEFWLNE++FLGH+ S +
Sbjct: 992  DDILVYSKTKEEHDRHLRIVLQTLREKQLYAKFSKCEFWLNEISFLGHVVSAE 1044



 Score =  136 bits (342), Expect(2) = 8e-99
 Identities = 86/293 (29%), Positives = 133/293 (45%), Gaps = 22/293 (7%)
 Frame = +3

Query: 168  CSHCRKPGHTPDQCWRKQGKCLKCGSDQHQLCDCPMIST---PENKPAPPYKSGNGPNNR 338
            C HCR+  HT   C    G C  CGS  H + DCP   T   P  +   P         R
Sbjct: 584  CDHCRRR-HT-GTCRLLTGACFICGSMDHIMRDCPKKQTGLAPSTERTAPVTQKTRSKGR 641

Query: 339  A-------------------RAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGS 461
            +                   RAPAR + +  +E  ++++V+ G   +       L DPGS
Sbjct: 642  SELTGTSSQRVSKTMDRPESRAPARAYTIKAREDQDSSNVIMGIFSIFGRSVHALIDPGS 701

Query: 462  YHSFVAPQFVHKLHTCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLP 641
             HS++     +K     + +   + V+ P+G S +   +   C I I    F  +LI LP
Sbjct: 702  THSYICIPITNKKELQADILDQDIVVTNPLGHSVVVSKVFKDCPILIHGHIFHGDLIELP 761

Query: 642  ISEYDIILGMD*LSGHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAI 821
              E+D+IL MD L  H    DC  K + L       V + G     +  +ISA  AR+ I
Sbjct: 762  FREFDVILRMDWLFRHQVIVDCRKKRIMLKTPEGEEVVVVGERSDFLSNVISATAARRMI 821

Query: 822  TNGAHGYLAFIINKPKDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEI 980
              G   YLA ++   K++  + ++P + ++ +VF  EL  +PP+R+V+FAIE+
Sbjct: 822  RKGCGAYLACVLEAKKEKHIVQDIPTICDFSDVFPDELPGLPPEREVEFAIEV 874


>gb|KZV22344.1| hypothetical protein F511_20441 [Dorcoceras hygrometricum]
          Length = 790

 Score =  243 bits (619), Expect(2) = 9e-98
 Identities = 108/170 (63%), Positives = 143/170 (84%)
 Frame = +2

Query: 1106 GSPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYY 1285
            G+P+L  KK +G+ R+CID+R LN  T+KNKYPLPRID+LFDQLQG+T+YSK+DL+ GY+
Sbjct: 547  GAPILFVKKNDGSMRMCIDFRQLNKATVKNKYPLPRIDDLFDQLQGSTVYSKIDLRFGYH 606

Query: 1286 QLKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDI 1465
            QL+++++ + KTAF+TRYG +EF+VMP GLTNAP VFMDLMHRVF  ++D+F+++FIDDI
Sbjct: 607  QLRVREQDVAKTAFRTRYGHFEFLVMPFGLTNAPAVFMDLMHRVFREFIDQFMIVFIDDI 666

Query: 1466 IVHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRSGD 1615
            +V+SK+   H+ HL MVL  LRE +LYAKFSKCEFWL+ V FLGH+ S D
Sbjct: 667  LVYSKTPREHKEHLEMVLHRLREQQLYAKFSKCEFWLDRVVFLGHVISAD 716



 Score =  145 bits (366), Expect(2) = 9e-98
 Identities = 105/352 (29%), Positives = 158/352 (44%), Gaps = 38/352 (10%)
 Frame = +3

Query: 192  HTPDQCWRKQGKCLKCGSDQHQLCDCPMISTPENKPAPPYKSGNGPNNRARA-------- 347
            H   QC   QG C  CG   H    CP   + +    P    G G  +R R+        
Sbjct: 207  HPSTQCVGVQGSCNLCGQYGHFARVCPSAGSQQTAAQP---QGRGGQSRGRSQQFQQPRF 263

Query: 348  ---PARVF-----ALVGQE------GPNATDV---------------VKGTL*VSDHRTK 440
               P R F     +  GQ       GP    V               + GT  + D   +
Sbjct: 264  GETPFRPFQQPDLSRFGQSSQPFFPGPQHAQVNAFTREQAEETPSRVIGGTCFIFDFPAR 323

Query: 441  ILFDPGSYHSFVAPQFVHKLHTCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFD 620
            +LFD G+ HSF++  FV +   C   +   + VSTP G S  +  I   C ++       
Sbjct: 324  VLFDTGASHSFISDSFVVEHGLCTVPLHDVVSVSTPGGVSLFSQEILLDCVLRFGENALL 383

Query: 621  ANLILLPISEYDIILGMD*LSGHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISA 800
            ANLI L + ++  I+GMD LS + A  DC +  V+   L     +L G+   S  P++SA
Sbjct: 384  ANLIRLMLWDFVCIVGMDVLSNYMASVDCFHGIVRFRPLSGEKWDLYGQDSRSKIPLVSA 443

Query: 801  IKARKAITNGAHGYLAFIINKPKDET*-IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIE 977
            ++    ++ G  G++ + ++     +  + +VP+V  + +VF +E+   PP R++ F+IE
Sbjct: 444  MEMFSLLSLGNAGFMIYALDASSSSSVQLSDVPVVRHFPDVFPEEIPGFPPRREIDFSIE 503

Query: 978  IFPGTASIARTPYRMAHVXXXXXXXXXXXXMEVGFIRPSTSPWEAQCFCQKK 1133
            + PGTA I+R PYR+A V            +E GFIRPS SPW A     KK
Sbjct: 504  LVPGTAPISRAPYRLAPVELRELKVQLDDLLEKGFIRPSMSPWGAPILFVKK 555


>ref|XP_023520277.1| LOW QUALITY PROTEIN: uncharacterized protein LOC111783585 [Cucurbita
            pepo subsp. pepo]
          Length = 972

 Score =  233 bits (595), Expect(2) = 1e-96
 Identities = 113/168 (67%), Positives = 137/168 (81%)
 Frame = +2

Query: 1106 GSPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYY 1285
            G+PVL  KK +G+ RLCIDYR LN  T+KNKYPLPRI++LFDQL+ AT++SK+DL+ GY+
Sbjct: 554  GAPVLFVKKIDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLKEATVFSKIDLRSGYH 613

Query: 1286 QLKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDI 1465
            Q+KIK E IPKTAF+TRYG YEFVVM  GLTNAPVVFM+LM+ VF   LD FV++FIDDI
Sbjct: 614  QIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPVVFMELMNHVFKECLDTFVIVFIDDI 673

Query: 1466 IVHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRS 1609
            +V+SK+   H+ HL   L  LRE+KLYAKFSKCEFWL E TFLGH+ S
Sbjct: 674  LVYSKTDHEHQLHLRKALTILRENKLYAKFSKCEFWLQEDTFLGHVIS 721



 Score =  150 bits (380), Expect(2) = 1e-96
 Identities = 100/324 (30%), Positives = 150/324 (46%), Gaps = 2/324 (0%)
 Frame = +3

Query: 168  CSHCRKPGHTPDQCWRKQGKCLKCGSDQHQLCDCPMISTPE--NKPAPPYKSGNGPNNRA 341
            C  C K  +   QC  + G C +CG + H   DCP   T +  N+  P   + N P NR 
Sbjct: 287  CKDCGK--NHWGQCLARSGACFRCGKEGHLAKDCPRNHTSDVGNQQKPLRTADNPPPNRP 344

Query: 342  RAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLHTCMEHM 521
              PAR +A   ++  N    V GT                                    
Sbjct: 345  --PARAYASTSKDTGNPDAAVTGT------------------------------------ 366

Query: 522  PIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LSGHFAQT 701
            P G+ +   + A ++ DS      + +  V  + +LI+L +  YD+ILGMD L+ + A  
Sbjct: 367  PAGVNM---IAAYRVKDS-----HVLVSGVEIEVDLIVLDMYVYDVILGMDWLAKNHASI 418

Query: 702  DCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINKPKDET* 881
            DCH K V      K   + +G +  ++P +IS +KA+K + +GA   LA +++  K+E  
Sbjct: 419  DCHKKEVVFTPPSKTRFKFNGTSLGTVPKVISVMKAKKLVQHGAWAILASVVDTRKEEVS 478

Query: 882  IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMAHVXXXXXXXXXX 1061
               +P+VSE+ +VF ++   IPP R+V FAI + PGT  I++ PYRMA            
Sbjct: 479  PDTLPVVSEFPDVFPEDFPGIPPTRQVDFAIXLEPGTGPISKAPYRMATAELKELKTQLQ 538

Query: 1062 XXMEVGFIRPSTSPWEAQCFCQKK 1133
              ++ GFIRPS SPW A     KK
Sbjct: 539  ELLDKGFIRPSVSPWGAPVLFVKK 562


>gb|EXC31837.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notabilis]
          Length = 1088

 Score =  233 bits (593), Expect(2) = 1e-95
 Identities = 108/188 (57%), Positives = 145/188 (77%)
 Frame = +2

Query: 1106 GSPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYY 1285
            G+PVL +KK +G+ RLC+DYR LN VTIKNKYPLP I+ELFDQL  +  +SK+DL+ GY+
Sbjct: 473  GAPVLFAKKHDGSLRLCVDYRQLNRVTIKNKYPLPCINELFDQLGRSRYFSKIDLRSGYH 532

Query: 1286 QLKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDI 1465
            QL++++E + K AF+TRYG YEF+VMP GLTNAP  FMDLM+RVF PYLD+F+++FIDDI
Sbjct: 533  QLRVREEDVSKIAFRTRYGHYEFLVMPFGLTNAPAAFMDLMNRVFRPYLDRFIIVFIDDI 592

Query: 1466 IVHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRSGDEDVMWEMEEE 1645
            +++SK++E H  HL + L+ LREH LYAK  KC+FW+ +V FLGH+     DV +  +EE
Sbjct: 593  LIYSKTQEEHAEHLRIALQTLREHSLYAKKEKCDFWMTDVKFLGHV-----DVRFVWDEE 647

Query: 1646 ILRQYPEL 1669
                + EL
Sbjct: 648  CEEAFMEL 655



 Score =  148 bits (373), Expect(2) = 1e-95
 Identities = 92/279 (32%), Positives = 140/279 (50%), Gaps = 7/279 (2%)
 Frame = +3

Query: 318  GNGPNNRARAPARVFALVGQEGPNATD-----VVKGTL*VSDHRTKILFDPGSYHSFVAP 482
            G G  N+ ++  + +A+     P         VV GT+ VS    ++LFD G+ HSF++ 
Sbjct: 206  GRGQKNKGKSHGQAYAVTSTATPGRGQQADHSVVDGTILVSHSWAQVLFDTGATHSFISM 265

Query: 483  QFVHKLHTCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDII 662
             F   L   ++     L +STPMG       I + C I + +    A+L +LP++ +D+I
Sbjct: 266  LFASVLQLSVDTHDPPLTLSTPMGGIAEVSMIRSPCCIVLGDHRLSADLFVLPMAGFDVI 325

Query: 663  LGMD*LSGHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMP-PIISA-IKARKAITNGAH 836
            LGMD LS + A  DC+ + V L      V++   +     P P++ A I  RK + +   
Sbjct: 326  LGMDWLSKYHATVDCYRRRVTLLTKNGQVIDYQAKTGAVTPSPVLKACIGGRKNLESLG- 384

Query: 837  GYLAFIINKPKDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPY 1016
              + F +    +      VPIV ++ +VF  EL  +PPDR+++F I++ PGT+ I+  PY
Sbjct: 385  --MVFALGGESEANDSSYVPIVDDFQDVFPSELPGLPPDREIEFCIDLVPGTSPISIAPY 442

Query: 1017 RMAHVXXXXXXXXXXXXMEVGFIRPSTSPWEAQCFCQKK 1133
            RMA              ME GFIRPSTSPW A     KK
Sbjct: 443  RMAPAKNVELRKQLQKLMEKGFIRPSTSPWGAPVLFAKK 481


>gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao]
          Length = 1480

 Score =  230 bits (587), Expect(2) = 1e-94
 Identities = 111/179 (62%), Positives = 141/179 (78%)
 Frame = +2

Query: 1127 KKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQLKIKKE 1306
            K+   + R+C  +  LN VT+KNKYPLPRID+LFDQLQGA  +SK+DL+ GY+QL+I+ E
Sbjct: 689  KELKDSWRIC--WIKLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDLRSGYHQLRIRNE 746

Query: 1307 GIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDIIVHSKSR 1486
             IPKTAF+TRYG YEF+VM  GLTNAP  FMDLM+RVF PYLDKFV++FIDDI+++SKSR
Sbjct: 747  DIPKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPYLDKFVVVFIDDILIYSKSR 806

Query: 1487 EHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRSGDEDVMWEMEEEILRQYP 1663
            E HE HL +VL+ LREH+LYAKFSKCEFWL  V FLGH+ S +   +   + E + ++P
Sbjct: 807  EEHEQHLKIVLQILREHRLYAKFSKCEFWLERVAFLGHVVSREGIQVDTKKIEAVEKWP 865



 Score =  147 bits (371), Expect(2) = 1e-94
 Identities = 94/308 (30%), Positives = 150/308 (48%), Gaps = 34/308 (11%)
 Frame = +3

Query: 204  QCWRKQGKCLKCGSDQHQLCDCPMI---------STPENKPAP----------------- 305
            +C+     C  CG   H + DCPM          ST     AP                 
Sbjct: 378  RCFLTTKTCYGCGQPGHIMKDCPMAHQSPDSARGSTQPASSAPSVAVSSGLEVSGSRGRG 437

Query: 306  --------PYKSGNGPNNRARAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGS 461
                    P +SG+  ++  R  ARVFAL  QE   +  VV G L V +   ++LFDPG+
Sbjct: 438  AGTSSQGRPSRSGH-QSSIGRGQARVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGA 496

Query: 462  YHSFVAPQFVHKLHTCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLP 641
             HSF++P F  +L          L VST +    + +    +C +++++     NL++L 
Sbjct: 497  THSFISPCFASRLGRGRVRREEQLVVSTLLKEIFMAEWEYESCVVRVKDKDTSVNLVVLD 556

Query: 642  ISEYDIILGMD*LSGHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAI 821
              ++D+ILGMD LS   A  DC++K V+    G+P   + G    +   +IS I AR+ +
Sbjct: 557  TLDFDVILGMDWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDMSNAPTNLISVISARRLL 616

Query: 822  TNGAHGYLAFIINKPKDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASI 1001
              G  GYLA + +       + +V +V E+++VF +EL+  PP+R+++F I++ P T  +
Sbjct: 617  RQGCIGYLAVVKDSQAKIGDVTQVSVVKEFVDVFPEELSGFPPEREIEFCIDLIPDTRPM 676

Query: 1002 ARTPYRMA 1025
            +  PYRMA
Sbjct: 677  SIPPYRMA 684


>ref|XP_022933231.1| LOW QUALITY PROTEIN: uncharacterized protein LOC111440131 [Cucurbita
            moschata]
          Length = 1803

 Score =  219 bits (559), Expect(2) = 2e-92
 Identities = 104/161 (64%), Positives = 131/161 (81%)
 Frame = +2

Query: 1133 KNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQLKIKKEGI 1312
            K+ + RLCI YR LN  T+KNKYPLPRI++LFDQL+GAT++SK+DL+ GY+Q+KIK E I
Sbjct: 574  KDDSMRLCIGYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDI 633

Query: 1313 PKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDIIVHSKSREH 1492
            PKTAF+TRYG YEFVVM  GLTNAP VFM+LM+RVF   LD FV++FIDDI+++SK+   
Sbjct: 634  PKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLK 693

Query: 1493 HEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRSGD 1615
            H+ HL   L  LRE+KLYA F+KCEFW+ +V+FLGHI S D
Sbjct: 694  HQEHLRKALTILRENKLYANFTKCEFWIXQVSFLGHIVSKD 734



 Score =  150 bits (380), Expect(2) = 2e-92
 Identities = 106/335 (31%), Positives = 157/335 (46%), Gaps = 11/335 (3%)
 Frame = +3

Query: 54   ASGSKRPIVEKTTG--PPT---KFQ-RGGIGPANNEKQSV*SITCSHCRKPGHTPDQCWR 215
            A G KRP+   T    PP+   ++Q R    P      ++    C +C K  H   +C  
Sbjct: 230  AIGRKRPVEVDTIEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQ-HV-GRCLA 287

Query: 216  KQGKCLKCGSDQHQLCDCPMISTPENKPAPPYKSGNGPNNR-----ARAPARVFALVGQE 380
              G C  CG   H    CP  S     P  P +   GP  R          + +     E
Sbjct: 288  GSGMCYICGHAGHVARTCPTKS-----PGIPREPLRGPVIREPTLQTSPQTKAYVTTSNE 342

Query: 381  GPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLHTCMEHMPIGLEVSTPMGAS 560
               +  VV GTL +  H    LFD GS HSFVA  F+ +    +E +   L V TP G  
Sbjct: 343  AGTSGTVVTGTLSILGHFALTLFDSGSTHSFVASPFIKQAGFVIEPLMHALSVGTPAGVD 402

Query: 561  QITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LSGHFAQTDCHNKTVKLCKLG 740
             +T       ++ I       +L ++ ++++D+ILGMD L+ +FA  DCH K V      
Sbjct: 403  LVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPN 462

Query: 741  KPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINKPKDET*IHEVPIVSEYLNV 920
                +  G +  + P IIS +KAR+ I  G   +LA+ +N    E  I  +P+V+E+++V
Sbjct: 463  GLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDV 522

Query: 921  FSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMA 1025
            F ++L  IPP R+V F I++  GT  I++ PYRMA
Sbjct: 523  FPEDLPGIPPSREVDFGIDLELGTGPISKAPYRMA 557


>gb|ADN34141.1| ty3-gypsy retrotransposon protein [Cucumis melo subsp. melo]
          Length = 1359

 Score =  228 bits (582), Expect(2) = 3e-91
 Identities = 105/161 (65%), Positives = 134/161 (83%)
 Frame = +2

Query: 1127 KKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQLKIKKE 1306
            KKK+ + RLCIDYR LN VT+KN+YPLP+ID+LFDQLQGATL+SK+DL+ GY+QL+IK  
Sbjct: 497  KKKDRSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATLFSKIDLRSGYHQLRIKDR 556

Query: 1307 GIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDIIVHSKSR 1486
             +PKTAF +RYG YEF+VM   LTNAP VFMDLM+RVF  +LD FV++FI+DI+++SK  
Sbjct: 557  DVPKTAFHSRYGHYEFIVMSFALTNAPSVFMDLMNRVFREFLDTFVIVFINDILIYSKIE 616

Query: 1487 EHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRS 1609
              HE HL MVL+ L+++KLYAKF KCEFWL +V+FLGH+ S
Sbjct: 617  AEHEEHLRMVLQTLQDNKLYAKFLKCEFWLKQVSFLGHVVS 657



 Score =  137 bits (346), Expect(2) = 3e-91
 Identities = 80/226 (35%), Positives = 124/226 (54%), Gaps = 2/226 (0%)
 Frame = +3

Query: 354  RVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLHTCMEHMPIG- 530
            +VFA    E   A+ VV GTL V  H   +LFD G  HSF++  FV  LH  +E  P+  
Sbjct: 266  KVFATNKTEAERASTVVTGTLPVLGHYALVLFDSGFSHSFISSAFV--LHARLEVEPLHH 323

Query: 531  -LEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LSGHFAQTDC 707
             L VSTP G   ++      C+I+I     +  L++L + ++D+ILGMD L+ + A  DC
Sbjct: 324  VLSVSTPFGECMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDC 383

Query: 708  HNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINKPKDET*IH 887
              K +          +       S+P +ISA++A K ++ G    LA +++  + +  + 
Sbjct: 384  SRKEIAFNPPSMANFKFKEEGSRSLPKVISAMRASKLLSQGIWSILASVVDTREVDVSLS 443

Query: 888  EVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMA 1025
              P+V +Y +VF +EL  +PP R+++FAIE+  GT  I+R PYRMA
Sbjct: 444  SKPMVRDYPDVFPEELPGLPPHREIEFAIELELGTVPISRAPYRMA 489


>ref|XP_023745456.1| uncharacterized protein LOC111893627 [Lactuca sativa]
          Length = 1413

 Score =  213 bits (541), Expect(2) = 6e-91
 Identities = 97/148 (65%), Positives = 123/148 (83%)
 Frame = +2

Query: 1172 LNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQLKIKKEGIPKTAFQTRYGLYE 1351
            LN +T+KNKYPLPRID+LFDQLQGA  +SK+DL+ GY+Q++++ + +PKTAF+TRYG YE
Sbjct: 656  LNKITVKNKYPLPRIDDLFDQLQGACYFSKIDLRSGYHQVRVRGQDVPKTAFRTRYGHYE 715

Query: 1352 FVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDIIVHSKSREHHEHHLIMVLEALR 1531
            F+VMP GLTNAP VFMDLM+RV  P+LDK V++FIDDI+++S+S E H  HL  VL+ LR
Sbjct: 716  FLVMPFGLTNAPAVFMDLMNRVCRPFLDKSVIVFIDDILIYSRSVEDHRRHLSEVLDTLR 775

Query: 1532 EHKLYAKFSKCEFWLNEVTFLGHIRSGD 1615
            + KLYAKFSKCEFWL EV FLGH+   D
Sbjct: 776  KEKLYAKFSKCEFWLREVQFLGHLVGED 803



 Score =  152 bits (385), Expect(2) = 6e-91
 Identities = 99/329 (30%), Positives = 145/329 (44%)
 Frame = +3

Query: 126  GPANNEKQSV*SITCSHCRKPGHTPDQCWRKQGKCLKCGSDQHQLCDCPMISTPENKPAP 305
            GP +N      +  C  C K GH  + C   +  C  C    H    CP        P  
Sbjct: 335  GPCSNS-----TTRCKRCGKIGHRLEDCKSAEPICYNCRQMGHISNQCP-------NPRV 382

Query: 306  PYKSGNGPNNRARAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQ 485
               SG   ++  +  AR F +   E     +V+ GT  V+     +LFD G+  SFV+  
Sbjct: 383  QTGSGGKKDDAPKIKARAFNMTAAEARQHDEVIFGTFLVNSIPATVLFDGGASRSFVSLP 442

Query: 486  FVHKLHTCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIIL 665
            F   +      +   LEV    G         + C I I    F   LI + +  +D+++
Sbjct: 443  FCAHIDIPRTPLETELEVEVATGQLVAVREKYDGCVISIGEHTFPLTLIPIGVGSFDVVI 502

Query: 666  GMD*LSGHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYL 845
            GMD LS + A   C +K V++           G         IS +KARK I  G   +L
Sbjct: 503  GMDWLSANRAHILCADKLVRIPLPSGDYATAYGEHHSRSTSFISVMKARKCIAKGCPVFL 562

Query: 846  AFIINKPKDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMA 1025
            A ++N   +E  + EV +V +Y++VF  +L  +PP R+V F I+I PG A IA+ PYR+A
Sbjct: 563  AHVVNSNSEELGLSEVDVVRDYVDVFPNDLPGLPPPRQVDFHIDIIPGAAPIAKAPYRLA 622

Query: 1026 HVXXXXXXXXXXXXMEVGFIRPSTSPWEA 1112
                          ++ GFIRPS+SPW A
Sbjct: 623  PSEMKEMMSQLQELLDKGFIRPSSSPWGA 651


>ref|XP_024172019.1| uncharacterized protein LOC112178032 [Rosa chinensis]
          Length = 1007

 Score =  235 bits (600), Expect(2) = 6e-91
 Identities = 107/169 (63%), Positives = 138/169 (81%)
 Frame = +2

Query: 1109 SPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQ 1288
            +PVL  KKK+ +  LC+DYR LN VTIKN+YPLPRID+LFDQL+ AT++SK+DL+ GY+Q
Sbjct: 573  APVLFVKKKDNSLHLCVDYRQLNKVTIKNRYPLPRIDDLFDQLREATVFSKIDLRSGYHQ 632

Query: 1289 LKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDII 1468
            L++K E IPKTAF+TRYG Y+F+VMP GLTNAP  FMDLM+R F PYLD+FV++F+DDI+
Sbjct: 633  LRVKDEDIPKTAFRTRYGHYQFLVMPFGLTNAPAAFMDLMNRTFSPYLDQFVVVFVDDIL 692

Query: 1469 VHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRSGD 1615
            ++SKS + HE HL +VL+ L+E +LYAK  KCEFW  EV FL H+ S D
Sbjct: 693  IYSKSSDEHEKHLRIVLQTLKEKELYAKLEKCEFWQKEVKFLSHVVSKD 741



 Score =  130 bits (326), Expect(2) = 6e-91
 Identities = 100/330 (30%), Positives = 151/330 (45%), Gaps = 5/330 (1%)
 Frame = +3

Query: 162  ITCSHCRKPGHTPDQCWR-KQGKCLKCGSDQHQLCDCPMISTPENKPAPPYKSGNGPNNR 338
            + C +C + GH    C + K+  C  CG   H   +CP            ++   G  N+
Sbjct: 286  LKCFNCHELGHISRNCLKPKKIVCYTCGQAGHISRECP------------HQWDRGQRNQ 333

Query: 339  AR----APARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLHT 506
             R      ARVFA +GQ G      V+GTL + ++  ++LFD G+ HSF++   V  L  
Sbjct: 334  QRQQPQGQARVFA-IGQGGT----WVEGTLSIYNYLARVLFDMGASHSFISSSVVDVLGL 388

Query: 507  CMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LSG 686
                +   L V++P+G S   D   N C + I    F A+LI++P   YD+ILGMD LS 
Sbjct: 389  ISIPLTGSLCVTSPLGVSLELDMFCNDCPLWICGKEFSASLIVIPDHTYDVILGMDWLSP 448

Query: 687  HFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINKP 866
            + A  DC    V     G+PV                 +++  A+  G   +    I   
Sbjct: 449  NHALIDCFRMIVSFRIPGQPVFH------------YHCLRSDIAMRTGTLAH----IESG 492

Query: 867  KDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMAHVXXXXX 1046
               + I  +P+VSEY +VF QE+  +PP R + F+I++ PGTA +++  YR+A       
Sbjct: 493  SSTSEISGIPVVSEYADVF-QEIPGLPPKRVMDFSIDVIPGTAPVSKALYRLAPAELQEL 551

Query: 1047 XXXXXXXMEVGFIRPSTSPWEAQCFCQKKK 1136
                   +   FI+ S S W A     KKK
Sbjct: 552  KVQIDGLLAQEFIQASVSYWLAPVLFVKKK 581


>ref|XP_015078330.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC107022151
            [Solanum pennellii]
          Length = 1581

 Score =  200 bits (508), Expect(2) = 7e-91
 Identities = 94/146 (64%), Positives = 117/146 (80%)
 Frame = +2

Query: 1172 LNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQLKIKKEGIPKTAFQTRYGLYE 1351
            LN VT+KN Y +PRID+LFDQLQGA+++SK+DL+ GY+QL+I+   IPKTAF+TRYG YE
Sbjct: 717  LNKVTVKNCYLMPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRAADIPKTAFRTRYGHYE 776

Query: 1352 FVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDIIVHSKSREHHEHHLIMVLEALR 1531
            F+VM  GLTNAP  FMDLM RVF PY+D FV+IFIDDI+V+S+    HE HL +VL+ L 
Sbjct: 777  FLVMSFGLTNAPAAFMDLMTRVFRPYIDSFVIIFIDDILVYSRCWSEHEQHLRIVLQTLT 836

Query: 1532 EHKLYAKFSKCEFWLNEVTFLGHIRS 1609
            + +LYA FSKCEFWL  V FLGH+ S
Sbjct: 837  DQQLYANFSKCEFWLASVAFLGHVVS 862



 Score =  165 bits (417), Expect(2) = 7e-91
 Identities = 112/350 (32%), Positives = 160/350 (45%), Gaps = 36/350 (10%)
 Frame = +3

Query: 213  RKQGKCLKCGSDQHQLCDCPMISTPENKPAPPYKS------------------------- 317
            R    C +CG+  H   +CP        PAPP                            
Sbjct: 377  RSTSGCYECGALDHWSRECPRRGRGAIVPAPPTSKPVSAVSSSARGGGQIQHSRESRQGT 436

Query: 318  -----GNGPNNRARAPAR-----VFALVGQEGPNATD-VVKGTL*VSDHRTKILFDPGSY 464
                 G     R  AP R      +A   +    A+D V+ GTL +      +LFDPGS 
Sbjct: 437  SGGARGGRSGGRPGAPGRGAQGHFYAAPTRAAAEASDDVISGTLFLCHQPATVLFDPGST 496

Query: 465  HSFVAPQFVHKLHTCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPI 644
             S+V+  F  +L    E +   + VSTP+G   + D +  +C + I+     A+LI+L +
Sbjct: 497  FSYVSIYFAPRLGMRSESLAEPIHVSTPIGEFLVVDQVLRSCLVTIQGYDTRADLIMLDM 556

Query: 645  SEYDIILGMD*LSGHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAIT 824
             ++D+ILGMD LS +    DC+ KTV L   G P V        +   IIS I+AR+ + 
Sbjct: 557  IDFDVILGMDWLSPYHVVLDCYAKTVTLSMPGVPPVLWQATYSHTPTGIISFIRARRLVA 616

Query: 825  NGAHGYLAFIINKPKDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIA 1004
            +G   YLA I +  ++   +  VP+V EY +VF  +L  +PP+R + FAI++ PGT  I+
Sbjct: 617  SGCLAYLAHIRDVSREGPSVDSVPVVREYADVFPTDLPGLPPERDIDFAIDLEPGTRPIS 676

Query: 1005 RTPYRMAHVXXXXXXXXXXXXMEVGFIRPSTSPWEAQCFCQKKKMEHKGC 1154
              PYRMA              +E GFIRPS SPW A    Q  K+  K C
Sbjct: 677  LPPYRMAPAELRELSVQSKDLLEKGFIRPSVSPWGAPVL-QLNKVTVKNC 725


>gb|AAX95749.1| Retrotransposon gag protein, putative [Oryza sativa Japonica Group]
 gb|AAP52632.2| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1608

 Score =  241 bits (615), Expect(2) = 4e-90
 Identities = 110/163 (67%), Positives = 141/163 (86%)
 Frame = +2

Query: 1121 LSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQLKIK 1300
            + ++K+ TQR+C+DYR LN+VTIKNKYPLPRID+LFDQL+GAT++SK+DL+ GY+QL+IK
Sbjct: 763  VKRQKDHTQRMCVDYRALNDVTIKNKYPLPRIDDLFDQLKGATVFSKIDLRSGYHQLRIK 822

Query: 1301 KEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDIIVHSK 1480
            +E IPKTAF TRYGL+E  VM  GLTNAP  FM+LM++VF  YLDKFV++FIDDI+++S+
Sbjct: 823  EEDIPKTAFTTRYGLFECTVMSFGLTNAPAFFMNLMNKVFMEYLDKFVVVFIDDILIYSR 882

Query: 1481 SREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRS 1609
            ++E HE HL + LE LREH+LYAKFSKCEFWL EV FLGH+ S
Sbjct: 883  TKEEHEEHLRLALEKLREHQLYAKFSKCEFWLFEVKFLGHVIS 925



 Score =  121 bits (304), Expect(2) = 4e-90
 Identities = 105/354 (29%), Positives = 155/354 (43%), Gaps = 13/354 (3%)
 Frame = +3

Query: 3    SRVEMGLNRIGQIQQSKASGSKRPIVEKTTGPPTKFQRGGIGPANNEKQSV*SITCSHCR 182
            +R+E    RI Q +  + +  K  +   T GP    Q G       ++Q   +   ++ R
Sbjct: 440  NRMEQKKRRIAQFKTQQGNNQKPRL---TLGPQPMPQGGSSSVVRPQRQFFNNNAGNNIR 496

Query: 183  ----KPGHTPDQ---CWRKQGK----CLKCGSDQHQLCDCPMISTPENKPAPPYKSGNGP 329
                +P   P Q     R+QG     C  CG  +H    CP     +  PA         
Sbjct: 497  NQAPRPVAAPAQQQPAKREQGNKPVVCFNCGDPRHYADKCPKPRRVKVVPAQ-------- 548

Query: 330  NNRA--RAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLH 503
            NN A   + ARV  +   E  +A DV+ GT  V+     +LFD G+ HSF++  F     
Sbjct: 549  NNSAVPASKARVNHVAAAEAQDAPDVILGTFLVNSVPATVLFDSGATHSFLSMSFAGNHG 608

Query: 504  TCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LS 683
              +E +   L VSTP   + ++     +  I+I+ V F ANLILL   + D+ILGMD L+
Sbjct: 609  MEVEDLRRPLMVSTPSNQA-LSLQRSPSVRIEIQGVPFLANLILLESKDLDVILGMDWLA 667

Query: 684  GHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINK 863
             H    DC NK V L      VV +   +  S+   ++ I                    
Sbjct: 668  KHKGVIDCANKKVTLTSYDGRVVTVHALSSESLRSRLNQIT------------------- 708

Query: 864  PKDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMA 1025
                  + E+PIV EY +VF  +L  +PP R ++F I++ PGT  I + PYRMA
Sbjct: 709  ------LEEIPIVREYPDVFPDDLPGMPPKRDIEFRIDLVPGTTPIHKRPYRMA 756


>ref|XP_024032296.1| uncharacterized protein LOC112094830 [Morus notabilis]
          Length = 771

 Score =  219 bits (559), Expect(2) = 1e-89
 Identities = 103/154 (66%), Positives = 128/154 (83%)
 Frame = +2

Query: 1106 GSPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYY 1285
            G+P+L +KK +G+ RLCIDYR LN VT+KNKYPLPRIDELFDQL G+  YSK+DL+ GY+
Sbjct: 525  GAPILFAKKHDGSLRLCIDYRQLNRVTVKNKYPLPRIDELFDQLGGSRYYSKIDLRSGYH 584

Query: 1286 QLKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDI 1465
            QLKI+K+ IPKTAF+TRYG YEF+VMP G T AP  FMDLM+RVF PYLD+FV++FIDDI
Sbjct: 585  QLKIRKDDIPKTAFKTRYGHYEFLVMPFGWTKAPAAFMDLMNRVFRPYLDQFVIVFIDDI 644

Query: 1466 IVHSKSREHHEHHLIMVLEALREHKLYAKFSKCE 1567
            +V+SK+ E HE HL +VL+ LREH+LYA   K +
Sbjct: 645  LVYSKTWEEHEQHLRIVLQTLREHQLYANKEKLD 678



 Score =  141 bits (355), Expect(2) = 1e-89
 Identities = 97/297 (32%), Positives = 147/297 (49%), Gaps = 11/297 (3%)
 Frame = +3

Query: 276  ISTPENKPAPPYKSGNG-PNNRARAPARVFALV-----GQEGPNA-TDVVKGTL*VSDHR 434
            +S     P   Y+  N  PNN  R   +    +     G  G +  + VVKG + +S   
Sbjct: 242  VSVQSTPPQQQYRPVNQRPNNNEREKGKALGQMHTMAGGSSGAHTGSPVVKGMVSISHSF 301

Query: 435  TKILFDPGSYHSFVAPQFVHKLHTCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVH 614
             ++LFD GS HSFV+  FV  L    + +   + +S+P+G  ++T S+  +C I I +  
Sbjct: 302  ARVLFDTGSTHSFVSTSFVKILGLKPDDLETSMFISSPLGCMEVT-SVCRSCVITIGSEK 360

Query: 615  FDANLILLPISEYDIILGMD*LSGHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPII 794
              A+LI+LP++++D++LGMD LS + A  DCH   V L       V   G     + P+I
Sbjct: 361  LKADLIILPMNQFDVVLGMDWLSRYGAIVDCHRMRVTLTIGSGTTVTYQG----GVNPVI 416

Query: 795  SAIKARKAITNGAHGYLAFIINKPKDET*IH----EVPIVSEYLNVFSQELTTIPPDRKV 962
                 R ++    +      ++  + E+ I     EVP+V +Y +VF  EL  + PDR++
Sbjct: 417  EERLLRHSVGGRQNLACFSFLSALEGESSIAGENIEVPVVDKYADVFPDELLGLLPDREI 476

Query: 963  QFAIEIFPGTASIARTPYRMAHVXXXXXXXXXXXXMEVGFIRPSTSPWEAQCFCQKK 1133
            +F I++ P TA I+  PYRMA V             E GFIR STSPW A     KK
Sbjct: 477  EFCIDLLPETAPISIAPYRMAPVEMKELRKQLGELAEKGFIRNSTSPWGAPILFAKK 533


>gb|KZV45872.1| hypothetical protein F511_35060, partial [Dorcoceras hygrometricum]
          Length = 859

 Score =  221 bits (562), Expect(2) = 4e-89
 Identities = 103/157 (65%), Positives = 130/157 (82%)
 Frame = +2

Query: 1109 SPVLLSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQ 1288
            +PVL  +KK+G+ RLCIDYR LN  T+KNKYPLPRID+LF QLQG+++YSK+DL+ GY+Q
Sbjct: 542  APVLFVRKKDGSMRLCIDYRQLNKATVKNKYPLPRIDDLFYQLQGSSVYSKIDLRSGYHQ 601

Query: 1289 LKIKKEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDII 1468
            L+++ E I KTAF+TRYG YEF+VMP GLTNAP VFM LM+RVF  YLD FV+IFIDDI+
Sbjct: 602  LRVRDEDISKTAFRTRYGHYEFIVMPFGLTNAPAVFMSLMNRVFQRYLDDFVIIFIDDIL 661

Query: 1469 VHSKSREHHEHHLIMVLEALREHKLYAKFSKCEFWLN 1579
            ++SK+   H +HL  VL+ LR+ KLYAK SKCEFWL+
Sbjct: 662  IYSKNMCDHANHLRTVLQTLRDEKLYAKLSKCEFWLD 698



 Score =  138 bits (348), Expect(2) = 4e-89
 Identities = 100/397 (25%), Positives = 175/397 (44%), Gaps = 29/397 (7%)
 Frame = +3

Query: 33   GQIQQSKASGSKRPIVEKTTGPPTKFQRGGIGPANNEKQSV*SITCSHCRKPG--HTPDQ 206
            G+ +Q K SG+       ++   ++F++ G G  + +          +C K G  H  +Q
Sbjct: 177  GKGKQFKRSGTS------SSSSSSEFKQLGAGQKSGD----------YCTKCGGKHNTEQ 220

Query: 207  CWRKQGKCLKCGSDQHQLCDCPM--------------ISTPENK--------PAPPYKSG 320
            C    G C  C    H    CP               +  PE +        P    +S 
Sbjct: 221  CRGVFGLCRICNQPGHFARICPQRGAGNSQNTGASRSLPPPERQASSVHSFQPQNQQQSR 280

Query: 321  NGPNNRARAP----ARVFALVGQEGPNATD-VVKGTL*VSDHRTKILFDPGSYHSFVAPQ 485
             G +     P    ARVFAL  ++   A D V  G   +      +L D G+ H+F++ +
Sbjct: 281  QGGSQTVSQPPKQQARVFALTEEQAQAAPDNVTAGNCCLCSFPAYVLVDTGASHTFISEK 340

Query: 486  FVHKLHTCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIIL 665
                L   +        +S+P+G   ++      C ++ E    + + I+L +S++D I+
Sbjct: 341  TTESLTEVVS-------ISSPLGRGILSVKTFRNCILQFEGHEIEIDCIVLGLSDFDCII 393

Query: 666  GMD*LSGHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYL 845
            G+D L+ + A  DC  K V+         +  G+   +  P++S I     +  GA G+L
Sbjct: 394  GIDMLTKYRATVDCFQKVVRFKPEKTDEWKFYGKGSRARIPLVSVISMTNLLQKGAEGFL 453

Query: 846  AFIINKPKDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMA 1025
             + ++  K+   + ++P+V ++ +VF  E+  +PP R+V F+IE+ PGT  I++ PY+MA
Sbjct: 454  IYAVDVNKNSPNLVDIPVVCDFADVFPDEIPGLPPYREVDFSIELIPGTQPISKAPYQMA 513

Query: 1026 HVXXXXXXXXXXXXMEVGFIRPSTSPWEAQCFCQKKK 1136
             +            +  G+IRPS SPW A     +KK
Sbjct: 514  PIELKELKEQLEDLLAKGYIRPSVSPWSAPVLFVRKK 550


>ref|XP_019071093.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC104649181
            [Solanum lycopersicum]
          Length = 1149

 Score =  205 bits (522), Expect(2) = 2e-88
 Identities = 98/146 (67%), Positives = 119/146 (81%)
 Frame = +2

Query: 1172 LNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQLKIKKEGIPKTAFQTRYGLYE 1351
            LN VT+KN YP+PRID+LFDQLQGA ++SK+DL+ GY+QL+I+   I KTAF+TRYG YE
Sbjct: 696  LNKVTVKNCYPMPRIDDLFDQLQGAVVFSKIDLRSGYHQLRIRAADIAKTAFRTRYGDYE 755

Query: 1352 FVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDIIVHSKSREHHEHHLIMVLEALR 1531
            F+VM  GLTNAP  FMDLM RVF PYLD FV+IFIDDI+V+S+SR  HE HL +VL+ LR
Sbjct: 756  FLVMSFGLTNAPAAFMDLMTRVFRPYLDSFVIIFIDDILVYSRSRSEHEQHLRIVLQTLR 815

Query: 1532 EHKLYAKFSKCEFWLNEVTFLGHIRS 1609
            + +LYAKFSKCEF L  V FLGH+ S
Sbjct: 816  DQQLYAKFSKCEFXLASVAFLGHVVS 841



 Score =  152 bits (383), Expect(2) = 2e-88
 Identities = 98/283 (34%), Positives = 144/283 (50%), Gaps = 2/283 (0%)
 Frame = +3

Query: 312  KSGNGPNNRAR-APARVFALVGQEGPNATD-VVKGTL*VSDHRTKILFDPGSYHSFVAPQ 485
            +SG  P    R A    +A   +    A+D V+ GTL +      +LFDPGS  S+V+  
Sbjct: 423  RSGGRPGAPGRGAQGHFYAAPTRAAAEASDDVISGTLFLCHQPATVLFDPGSTFSYVSIY 482

Query: 486  FVHKLHTCMEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIIL 665
            F  +L    E +   + VST +G   + D +  +C + I+     A+LI+L + ++D+IL
Sbjct: 483  FAPRLGMRSESLAEPVHVSTLIGEFLVVDQVLRSCLVTIQGYDTRADLIMLDMIDFDVIL 542

Query: 666  GMD*LSGHFAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYL 845
            GMD LS +    DC+ KTV L   G P V        +   IIS I+AR+ + +G   YL
Sbjct: 543  GMDWLSPYHVVLDCYAKTVTLSMPGVPPVLWQAAYSHTPTGIISFIRARRLVASGCLAYL 602

Query: 846  AFIINKPKDET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMA 1025
            A I +  ++   +  VP+V EY +VF  +L  +PP+R + FAI++ P T  I+  PYRMA
Sbjct: 603  AHIRDVSREGPSVDSVPVVREYEDVFPTDLPGLPPERDIDFAIDLEPATRPISIPPYRMA 662

Query: 1026 HVXXXXXXXXXXXXMEVGFIRPSTSPWEAQCFCQKKKMEHKGC 1154
                          +E GFIR S SPW A    Q  K+  K C
Sbjct: 663  PAELRELSVQLKDLLEKGFIRRSVSPWRAPVL-QLNKVTVKNC 704


>gb|ABB46832.2| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1662

 Score =  242 bits (618), Expect(2) = 2e-88
 Identities = 110/163 (67%), Positives = 142/163 (87%)
 Frame = +2

Query: 1121 LSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQLKIK 1300
            + ++K+ TQR+C+DYR LN+VTIKNKYPLPRID+LFDQL+GAT++SK+DL+ GY+QL+IK
Sbjct: 822  VKRQKDHTQRMCVDYRALNDVTIKNKYPLPRIDDLFDQLKGATVFSKIDLRSGYHQLRIK 881

Query: 1301 KEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDIIVHSK 1480
            +E IPKTAF TRYGL+E  VM  GLTNAP  FM+LM++VF  YLDKFV++FIDDI+++S+
Sbjct: 882  EEDIPKTAFTTRYGLFECTVMSFGLTNAPAFFMNLMNQVFMEYLDKFVVVFIDDILIYSR 941

Query: 1481 SREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRS 1609
            ++E HE HL + LE LREH+LYAKFSKCEFWL+EV FLGH+ S
Sbjct: 942  TKEEHEEHLRLALEKLREHQLYAKFSKCEFWLSEVKFLGHVIS 984



 Score =  114 bits (286), Expect(2) = 2e-88
 Identities = 101/352 (28%), Positives = 155/352 (44%), Gaps = 11/352 (3%)
 Frame = +3

Query: 3    SRVEMGLNRIGQIQQSKASGSKRPIVEKTTGPPTKFQRGGIGPANNEKQSV*SITCSHCR 182
            +R+E    RI Q +  + + ++RP +  T GP +    G       ++Q   +   ++ R
Sbjct: 499  NRMEQKKRRIAQFKTQQGN-NQRPRL--TLGPQSMPHGGSSSVVRPQRQFFNNNAGNNIR 555

Query: 183  ----KPGHTPDQ---CWRKQGK----CLKCGSDQHQLCDCPMISTPENKPAPPYKSGNGP 329
                +P   P Q     R+ G     C  CG   H    CP    P      P +S +  
Sbjct: 556  NQALRPVAAPTQQQPAKREHGSKPVVCFNCGDPGHYADKCPK---PRRMKVVPVQSNS-- 610

Query: 330  NNRARAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLHTC 509
                 + ARV  +   E  +A DV+ GT  V+     +LFD G+ HSF++  F       
Sbjct: 611  -TAPASKARVNHVAAAEAQDAPDVILGTFLVNLVPATVLFDSGATHSFLSMSFAGNHGMK 669

Query: 510  MEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LSGH 689
            +E +   L VSTP   + ++     +  I+I+ V F ANLILL   + D+ILGMD L+ H
Sbjct: 670  VEDLRRPLMVSTPSNQA-LSLQRSPSVRIEIKGVPFLANLILLESKDLDVILGMDWLARH 728

Query: 690  FAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINKPK 869
                DC N+ V L      VV +   +  S+   ++ I                      
Sbjct: 729  KGVIDCANRKVTLTSNDGRVVTVHALSSESLRSRLNQIT--------------------- 767

Query: 870  DET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMA 1025
                + E+PIV EY +VF  +L  +PP R ++F I++ PGT  I + PYRMA
Sbjct: 768  ----LEEIPIVREYPDVFPDDLPGMPPKRDIEFRIDLVPGTTPIHKRPYRMA 815


>gb|AAK92604.1|AC078944_15 Putative retroelement [Oryza sativa Japonica Group]
          Length = 1571

 Score =  242 bits (618), Expect(2) = 2e-88
 Identities = 110/163 (67%), Positives = 142/163 (87%)
 Frame = +2

Query: 1121 LSKKKNGTQRLCIDYRGLNNVTIKNKYPLPRIDELFDQLQGATLYSKLDLQQGYYQLKIK 1300
            + ++K+ TQR+C+DYR LN+VTIKNKYPLPRID+LFDQL+GAT++SK+DL+ GY+QL+IK
Sbjct: 830  VKRQKDHTQRMCVDYRALNDVTIKNKYPLPRIDDLFDQLKGATVFSKIDLRSGYHQLRIK 889

Query: 1301 KEGIPKTAFQTRYGLYEFVVMPVGLTNAPVVFMDLMHRVF*PYLDKFVMIFIDDIIVHSK 1480
            +E IPKTAF TRYGL+E  VM  GLTNAP  FM+LM++VF  YLDKFV++FIDDI+++S+
Sbjct: 890  EEDIPKTAFTTRYGLFECTVMSFGLTNAPAFFMNLMNQVFMEYLDKFVVVFIDDILIYSR 949

Query: 1481 SREHHEHHLIMVLEALREHKLYAKFSKCEFWLNEVTFLGHIRS 1609
            ++E HE HL + LE LREH+LYAKFSKCEFWL+EV FLGH+ S
Sbjct: 950  TKEEHEEHLRLALEKLREHQLYAKFSKCEFWLSEVKFLGHVIS 992



 Score =  114 bits (286), Expect(2) = 2e-88
 Identities = 101/352 (28%), Positives = 155/352 (44%), Gaps = 11/352 (3%)
 Frame = +3

Query: 3    SRVEMGLNRIGQIQQSKASGSKRPIVEKTTGPPTKFQRGGIGPANNEKQSV*SITCSHCR 182
            +R+E    RI Q +  + + ++RP +  T GP +    G       ++Q   +   ++ R
Sbjct: 507  NRMEQKKRRIAQFKTQQGN-NQRPRL--TLGPQSMPHGGSSSVVRPQRQFFNNNAGNNIR 563

Query: 183  ----KPGHTPDQ---CWRKQGK----CLKCGSDQHQLCDCPMISTPENKPAPPYKSGNGP 329
                +P   P Q     R+ G     C  CG   H    CP    P      P +S +  
Sbjct: 564  NQALRPVAAPTQQQPAKREHGSKPVVCFNCGDPGHYADKCPK---PRRMKVVPVQSNS-- 618

Query: 330  NNRARAPARVFALVGQEGPNATDVVKGTL*VSDHRTKILFDPGSYHSFVAPQFVHKLHTC 509
                 + ARV  +   E  +A DV+ GT  V+     +LFD G+ HSF++  F       
Sbjct: 619  -TAPASKARVNHVAAAEAQDAPDVILGTFLVNLVPATVLFDSGATHSFLSMSFAGNHGMK 677

Query: 510  MEHMPIGLEVSTPMGASQITDSIDNTCEIKIENVHFDANLILLPISEYDIILGMD*LSGH 689
            +E +   L VSTP   + ++     +  I+I+ V F ANLILL   + D+ILGMD L+ H
Sbjct: 678  VEDLRRPLMVSTPSNQA-LSLQRSPSVRIEIKGVPFLANLILLESKDLDVILGMDWLARH 736

Query: 690  FAQTDCHNKTVKLCKLGKPVVELSGRARVSMPPIISAIKARKAITNGAHGYLAFIINKPK 869
                DC N+ V L      VV +   +  S+   ++ I                      
Sbjct: 737  KGVIDCANRKVTLTSNDGRVVTVHALSSESLRSRLNQIT--------------------- 775

Query: 870  DET*IHEVPIVSEYLNVFSQELTTIPPDRKVQFAIEIFPGTASIARTPYRMA 1025
                + E+PIV EY +VF  +L  +PP R ++F I++ PGT  I + PYRMA
Sbjct: 776  ----LEEIPIVREYPDVFPDDLPGMPPKRDIEFRIDLVPGTTPIHKRPYRMA 823


Top