BLASTX nr result

ID: Rehmannia30_contig00004423 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia30_contig00004423
         (3282 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_019196472.1| PREDICTED: uncharacterized protein LOC109190...   575   0.0  
dbj|BAV56701.1| transposase [Ipomoea nil]                             372   e-111
ref|XP_019150668.1| PREDICTED: uncharacterized protein LOC109147...   370   e-111
ref|XP_024185677.1| uncharacterized protein LOC112190477 [Rosa c...   366   e-109
ref|XP_019177120.1| PREDICTED: uncharacterized protein LOC109172...   365   e-108
ref|XP_011102064.1| uncharacterized protein LOC105180111 isoform...   366   e-104
ref|XP_019174725.1| PREDICTED: uncharacterized protein LOC109170...   365   e-104
ref|XP_011102062.1| uncharacterized protein LOC105180111 isoform...   366   e-104
ref|XP_011084232.1| uncharacterized protein LOC105166542 isoform...   365   e-104
ref|XP_011084230.1| uncharacterized protein LOC105166542 isoform...   365   e-104
ref|XP_011102063.1| uncharacterized protein LOC105180111 isoform...   361   e-102
ref|XP_011084231.1| uncharacterized protein LOC105166542 isoform...   360   e-102
gb|PRQ17143.1| putative Ulp1 protease family catalytic domain, p...   342   2e-99
gb|PRQ17594.1| putative Ulp1 protease family catalytic domain, p...   342   2e-99
ref|XP_012837747.1| PREDICTED: uncharacterized protein LOC105958...   347   2e-97
ref|XP_012837746.1| PREDICTED: uncharacterized protein LOC105958...   347   2e-97
ref|XP_011076941.1| uncharacterized protein LOC105161066 isoform...   344   1e-96
ref|XP_020548914.1| uncharacterized protein LOC105161066 isoform...   344   2e-96
ref|XP_011076937.1| uncharacterized protein LOC105161066 isoform...   344   2e-96
gb|PRQ20360.1| putative Ulp1 protease family catalytic domain, p...   324   2e-93

>ref|XP_019196472.1| PREDICTED: uncharacterized protein LOC109190440 [Ipomoea nil]
 dbj|BAV56710.1| transposase [Ipomoea nil]
          Length = 677

 Score =  575 bits (1482), Expect = 0.0
 Identities = 324/759 (42%), Positives = 462/759 (60%), Gaps = 11/759 (1%)
 Frame = +3

Query: 888  MAALRKLKGGHDDVQKGKADVSSDASHEEDGDSMEID-------SRDSQPRVSTRGRTHM 1046
            MA  RK K    + QKG+ +V SD  H  +G  +++D       S+++Q   STRGRT M
Sbjct: 1    MAGRRKKKIVQQE-QKGQ-EVHSDEEH--NGKEVQVDEEENMSGSQETQSTRSTRGRTQM 56

Query: 1047 YRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVKEL 1226
            ++LA+Q+A+G+K D++ N+LGQ +G  AAELQSYIGVLARE VK+ +K+WKHVP+D+K+ 
Sbjct: 57   HKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVKLNFKTWKHVPQDIKDK 116

Query: 1227 IWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYGISQ 1406
            IW++VNL + V  I+KK CLSSA  KWRQYKT LT  F+W              GYGI  
Sbjct: 117  IWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLNDEENLHKPPPGYGIMG 176

Query: 1407 EDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEINRA 1586
            ++W+ FVISRMSE F KLSE+QK +R  N+YPHRLAR+GYA LA EI +ELCDD E+NRA
Sbjct: 177  DEWSQFVISRMSEDFKKLSEQQKVQRKQNLYPHRLARKGYARLASEISTELCDDDEVNRA 236

Query: 1587 ILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEHAGR 1766
            ILWKKGR +K GEIEGD LK    KID YI+QK++G L+++G  EDILT+ALE++EH GR
Sbjct: 237  ILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPNEDILTQALESKEHGGR 296

Query: 1767 VRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQGDLIKEQNVRLEKLEAI 1946
            VR IGGH+ PSTYFR+  G  P      ++N L+  +  + +          R+ KLE +
Sbjct: 297  VRAIGGHVNPSTYFRLGKGMLP----NHEKNVLLRRQATVED----------RVAKLENL 342

Query: 1947 FIKKYDTDNDEKASCSVKPKHQSNKDEADFVILDRKVALEGSAIPLTFKSKNEVDAYGTI 2126
             ++            +V  K  S +++      D K A++ S   + F  K ++D     
Sbjct: 343  VLQ------------NVAFKSSSIEEKGSCTAKDAKGAMKLSEEEIGFM-KQKLD----- 384

Query: 2127 VHADEPDNFLDDEPIPTNCMYIANNQAMTESTPLPVKIPMTRDCLDDVVGNHVDSPTHLI 2306
                    F DD+                                D++    +D    L 
Sbjct: 385  --------FEDDD--------------------------------DEL--QFIDKEDVLE 402

Query: 2307 KLQNEKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEK 2486
            K   +KPS         K ++ +S MP+ L++ Y   K  L +G+S+ I LD++VFG E 
Sbjct: 403  KQCKKKPSKEV-----KKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEEC 457

Query: 2487 VIHVDLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTD 2663
             ++V   D+T FC+L  IS   I VYIW+LYKKM ED  ++ F F+ P H+GHVPTTRTD
Sbjct: 458  TLYVHDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTD 517

Query: 2664 KSYLQQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRI 2843
            K++L + + +RAR LADRL +   +  +L PCN+G+HWILTVI+  K+ V++ DPL  RI
Sbjct: 518  KNFLDKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINLSKDIVYLWDPLSHRI 577

Query: 2844 RDETWRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHE 3017
            RD+ W+ VV +A+K+ +A    G+KG+ K  WE+++AP QPD  QCGF+VM Y++ ++  
Sbjct: 578  RDDDWKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIEN 637

Query: 3018 CIN-TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131
              +  +K S++ +F++ +Y +A ID VR EWA  +  ++
Sbjct: 638  MPDIDDKDSVQALFQQVEYDKAVIDLVRSEWADILSSYI 676


>dbj|BAV56701.1| transposase [Ipomoea nil]
          Length = 677

 Score =  372 bits (954), Expect = e-111
 Identities = 199/415 (47%), Positives = 268/415 (64%), Gaps = 8/415 (1%)
 Frame = +3

Query: 888  MAALRKLKGGHDDVQKGKADVSSDASHEEDGDSMEID-------SRDSQPRVSTRGRTHM 1046
            MA  RK K    + QKG+ +V SD  H  +G  +++D       S+++Q   STRGRT M
Sbjct: 1    MAGRRKKKIVQQE-QKGQ-EVHSDEEH--NGKEVQVDEEENMSGSQETQSTRSTRGRTQM 56

Query: 1047 YRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVKEL 1226
            ++LA+Q+A+G+K D++ N+LGQ +G  AAELQSYIGVLARE VK+ +K+WKHVP+D+K+ 
Sbjct: 57   HKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVKLNFKTWKHVPQDIKDK 116

Query: 1227 IWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYGISQ 1406
            IW++VNL + V  I+KK CLSSA  KWRQYKT LT  F+W              GYGI  
Sbjct: 117  IWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLNDEENLHKPPPGYGIMG 176

Query: 1407 EDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEINRA 1586
            ++W+ FVISRMSE F KLSE+QK RR  N+YPHRLAR+GYA LA EI +ELCDD E+NRA
Sbjct: 177  DEWSQFVISRMSEDFKKLSEQQKVRRKQNLYPHRLARKGYARLASEISTELCDDDEVNRA 236

Query: 1587 ILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEHAGR 1766
            ILWKKGR +K GEIEGD LK    KID YI+QK++G L+++G  EDILT+ALE++EH GR
Sbjct: 237  ILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPNEDILTQALESKEHGGR 296

Query: 1767 VRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQGDLIKEQNVRLEKLEAI 1946
            VR IGGH+ PSTYFR+  G  P              K ++  +   ++++  +LE L   
Sbjct: 297  VRAIGGHVNPSTYFRLGKGMLP-----------NHEKNVLLRRQATVEDRVAKLENLVLQ 345

Query: 1947 FIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSAIPLTFKSKNEV 2108
             +    +  +EK SC+ K    + K  E +   + +K+  E     L F  K +V
Sbjct: 346  NVAFKSSPIEEKGSCTAKDAKGAMKLSEEEIGFMKQKLDFEDDDDELQFIDKEDV 400



 Score =  243 bits (620), Expect = 8e-65
 Identities = 120/275 (43%), Positives = 182/275 (66%), Gaps = 4/275 (1%)
 Frame = +3

Query: 2319 EKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEKVIHV 2498
            EK    K      K ++ +S MP+ L++ Y   K  L +G+S+ I LD++VFG E  ++V
Sbjct: 402  EKQCKKKPSKEVKKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEECTLYV 461

Query: 2499 DLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTDKSYL 2675
               D+T FC+L  IS   I VYIW+LYKKM ED  ++ F F+ P H+GHVPTTRTDK++L
Sbjct: 462  HDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTDKNFL 521

Query: 2676 QQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDET 2855
             + + +RAR LADRL +   +  +L PCN+G+HWILTVI+  K+ V++ DPL  RIRD+ 
Sbjct: 522  DKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINVSKDIVYLWDPLSHRIRDDD 581

Query: 2856 WRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECIN- 3026
            W+ VV +A+K+ +A    G+KG+ K  WE+++AP QPD  QCGF+VM Y++ ++    + 
Sbjct: 582  WKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIENMPDI 641

Query: 3027 TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131
             +K S++ +F++ +Y +A ID VR EWA  +  ++
Sbjct: 642  DDKDSVQALFQQVEYDKAVIDLVRSEWADILSSYI 676


>ref|XP_019150668.1| PREDICTED: uncharacterized protein LOC109147518 [Ipomoea nil]
 dbj|BAV56708.1| transposase [Ipomoea nil]
          Length = 677

 Score =  370 bits (950), Expect = e-111
 Identities = 198/415 (47%), Positives = 267/415 (64%), Gaps = 8/415 (1%)
 Frame = +3

Query: 888  MAALRKLKGGHDDVQKGKADVSSDASHEEDGDSMEID-------SRDSQPRVSTRGRTHM 1046
            MA  RK K    + QKG+ +V  D  H  +G  +++D       S+++Q   STRGRT M
Sbjct: 1    MAGRRKKKIVQQE-QKGQ-EVHGDEEH--NGKEVQVDEEENMSGSQETQSTRSTRGRTQM 56

Query: 1047 YRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVKEL 1226
            ++LA+Q+A+G+K D++ N+LGQ +G  AAELQSYIGVLARE VK+ +K+WKHVP+D+K+ 
Sbjct: 57   HKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVKLNFKTWKHVPQDIKDK 116

Query: 1227 IWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYGISQ 1406
            IW++VNL + V  I+KK CLSSA  KWRQYKT LT  F+W              GYGI  
Sbjct: 117  IWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLNDEENLHKPPPGYGIMG 176

Query: 1407 EDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEINRA 1586
            ++W+ FVISRMSE F KLSE+QK RR  N+YPHRLAR+GYA LA EI +ELCDD E+NRA
Sbjct: 177  DEWSQFVISRMSEDFKKLSEQQKVRRKQNLYPHRLARKGYARLASEISTELCDDDEVNRA 236

Query: 1587 ILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEHAGR 1766
            ILWKKGR +K GEIEGD LK    KID YI+QK++G L+++G  EDILT+ALE++EH GR
Sbjct: 237  ILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPNEDILTQALESKEHGGR 296

Query: 1767 VRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQGDLIKEQNVRLEKLEAI 1946
            VR IGGH+ PSTYFR+  G  P              K ++  +   ++++  +LE L   
Sbjct: 297  VRAIGGHVNPSTYFRLGKGMLP-----------NHEKNVLLRRQATVEDRVAKLENLVLQ 345

Query: 1947 FIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSAIPLTFKSKNEV 2108
             +    +  +EK SC+ K    + K  E +   + +K+  E     L F  K +V
Sbjct: 346  NVAFKSSPIEEKGSCTAKDAKGAMKLSEEEIGFMKQKLDFEDDDDELQFIDKEDV 400



 Score =  244 bits (622), Expect = 4e-65
 Identities = 121/275 (44%), Positives = 182/275 (66%), Gaps = 4/275 (1%)
 Frame = +3

Query: 2319 EKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEKVIHV 2498
            EK    K      K ++ +S MP+ L++ Y   K  L +G+S+ I LD++VFG E  ++V
Sbjct: 402  EKQCKKKPSKEVKKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEECTLYV 461

Query: 2499 DLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTDKSYL 2675
               D+T FC+L  IS   I VYIW+LYKKM ED  ++ F F+ P H+GHVPTTRTDK++L
Sbjct: 462  HDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTDKNFL 521

Query: 2676 QQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDET 2855
             + + +RAR LADRL +   +  +L PCN+G+HWILTVI+  K+ V++ DPL  RIRD+ 
Sbjct: 522  DKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINVSKDIVYLWDPLSHRIRDDD 581

Query: 2856 WRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECIN- 3026
            W+ VV +A+K+ +A    G+KG+ K  WE+++AP QPD  QCGF+VM Y++ ++    + 
Sbjct: 582  WKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIENMPDI 641

Query: 3027 TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131
             +K S++ +F++ +Y +A ID VR EWA  I  ++
Sbjct: 642  DDKDSVQALFQQVEYDKAVIDLVRSEWADIISSYI 676


>ref|XP_024185677.1| uncharacterized protein LOC112190477 [Rosa chinensis]
 gb|PRQ48579.1| putative Ulp1 protease family catalytic domain, putative transposase,
            Ptta/En/Spm, plant [Rosa chinensis]
          Length = 725

 Score =  366 bits (940), Expect = e-109
 Identities = 236/723 (32%), Positives = 373/723 (51%), Gaps = 22/723 (3%)
 Frame = +3

Query: 1029 RGRTHMYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVP 1208
            RGRT M R+  +  RG K+ +  N  G   GK AAE+ SYIGV+ R  V I  +SW  V 
Sbjct: 29   RGRTSMERIVNRALRGKKSVVEFNPKGVPFGKAAAEMASYIGVIVRTTVPIIVESWPKVE 88

Query: 1209 KDVKELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXX 1388
            KD+K  IW+SV + + + P  +K  LSSA +KWRQ+K+ LT ++V               
Sbjct: 89   KDLKNEIWKSVEMAFVLAPRCRKMVLSSAANKWRQFKSELTTKYVLPYKDQPDALKDPPE 148

Query: 1389 GYG-ISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCD 1565
             Y  I Q+DW  FV SR++  F KL  EQKERR      HR++R+GYA L  E++  + +
Sbjct: 149  EYDFIKQQDWEQFVKSRLTTDFQKLHMEQKERRGKLQNAHRMSRKGYAGLEAELKKTMNE 208

Query: 1566 DAEINRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALE 1745
            D E++ A+LWKKGR +KNG I  + + E  A++D  +  + +       +++D+ +  L 
Sbjct: 209  D-ELDLAVLWKKGREDKNGNISHETVGEQAAEMDTLMNNEGDISNSNSRSSDDVRSMGLG 267

Query: 1746 NQEHAGRVRGIGGHITPSTYFRVLIGKKPVD---RRAEQRNELMEAKKLIAE---QGDLI 1907
              EH+ RVR  G  + P+     L  +  +D   +  EQ+    EAK  + E    GD  
Sbjct: 268  TPEHSSRVRSAGECLMPNVSPPQLERESVLDEVRKMIEQQRLWFEAKISLLEAKISGDCP 327

Query: 1908 KEQNVRLEKLEAIFIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSAIPL 2084
                     L A           +K  CS K   + N+ D   F  + R+  ++G +  L
Sbjct: 328  ATSITLPTPLLA--------KPSKKGRCSGKTNVEDNEIDSEAFSFVGRREFMKGKSCKL 379

Query: 2085 TFKSKNEVDAYGTIVHADEPDNFLDDEPIPTNCMYIANNQAMTESTPLPVKIPMTRDCLD 2264
               S N V ++GTI+  D  ++ +   P+    + +A + A+ E   LP+ +      + 
Sbjct: 380  AVGSINNVVSHGTIIEMDVANHKVHGVPLGEGNIRVAIDNALDEQALLPIPVTGELATVG 439

Query: 2265 DVVGNHVDSPTHLIKLQNEKP---STMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQD 2435
              VG+HV  P HL+KL NE+    S++K  D  N+  +    +P+ L + Y   +  + D
Sbjct: 440  QAVGSHVAWPKHLVKLMNEEERGNSSIKPRDLPNQDVI----LPKSLKLLYRYAERAMTD 495

Query: 2436 GKSISITLDDDVFGTEKVIHVDLSDITHFCELESISCYSIIVYIWHLYKKMKEDFVDNFL 2615
            G+ IS+ +++ +FG  K +++   D+  F E++ I    I VY+ HLY  +K+  + N +
Sbjct: 496  GEPISVFMEEAIFGIAKTLNIFKEDVMQFMEMKEIPPRCITVYMRHLYDMLKQSNMANMV 555

Query: 2616 -FVDPYHIGHVPTTRTDKSYLQQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVI 2792
              +DP  I  V    +D          R++ LA RL   S +Q++L P N GYHW+LT+I
Sbjct: 556  GLMDPSSIS-VGEGNSDH---------RSQVLATRLQQGSADQILLVPYNSGYHWMLTII 605

Query: 2793 DPYKETV--------HVLDPLGPRIRDETWRDVVNLALKLFNADKGRKGKKKPQWEVIRA 2948
               KE          + +DPL   +R+E W+ VVN  ++ FN + GR  +K+P W+V+  
Sbjct: 606  SEDKEVCYFMDPLQRYFMDPLRRSMREEEWKYVVNNGIRQFNIETGRGFRKQPLWKVLMG 665

Query: 2949 PIQPDEKQCGFFVMRYMREIL--HECINTNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQ 3122
            P QP   +CG++VMRYM+EI+  H+     K   R   K   YT+ ++DEVR EW   + 
Sbjct: 666  PKQPSNMECGYYVMRYMKEIIEGHDLSFATKWDGR---KLNAYTQTELDEVRCEWTDFVS 722

Query: 3123 DHM 3131
            +++
Sbjct: 723  NYV 725


>ref|XP_019177120.1| PREDICTED: uncharacterized protein LOC109172424 [Ipomoea nil]
          Length = 786

 Score =  365 bits (937), Expect = e-108
 Identities = 185/371 (49%), Positives = 247/371 (66%), Gaps = 1/371 (0%)
 Frame = +3

Query: 999  SRDSQPRVSTRGRTHMYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVK 1178
            S+++Q   STRGRT M++LA+Q+A+G+K D++ N+LGQ +G  AAELQSYIGVLARE VK
Sbjct: 150  SQETQSTRSTRGRTQMHKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVK 209

Query: 1179 ITYKSWKHVPKDVKELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXX 1358
            + +K+WKHVP+D+K+ IW++VNL + V  I+KK CLSSA  KWRQYKT LT  F+W    
Sbjct: 210  LNFKTWKHVPQDIKDKIWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLN 269

Query: 1359 XXXXXXXXXXGYGISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALA 1538
                      GYGI  ++W+ FVISRMSE F KLSE+QK RR  N+YPHRLAR+GYA LA
Sbjct: 270  DEENLHKPPPGYGIMGDEWSQFVISRMSEDFKKLSEQQKVRRKQNLYPHRLARKGYARLA 329

Query: 1539 EEIESELCDDAEINRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGAT 1718
             EI +ELCDD E+NRAILWKKGR +K GEIEGD LK    KID YI+QK++G L+++G  
Sbjct: 330  SEISTELCDDDEVNRAILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPN 389

Query: 1719 EDILTKALENQEHAGRVRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQG 1898
            EDILT+ALE++EH GRVR IGGH+ PSTYFR+  G  P              K ++  + 
Sbjct: 390  EDILTQALESKEHGGRVRAIGGHVNPSTYFRLGKGMLP-----------NHEKNVLLRRQ 438

Query: 1899 DLIKEQNVRLEKLEAIFIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSA 2075
              ++++  +LE L    +    +  +EK SC+ K    + K  E +   + +K+  E   
Sbjct: 439  ATVEDRVAKLENLVLQNVAFKSSPIEEKGSCTAKDAKGAMKLSEEEIGFMKQKLDFEDDD 498

Query: 2076 IPLTFKSKNEV 2108
              L F  K +V
Sbjct: 499  DELQFIDKEDV 509



 Score =  244 bits (622), Expect = 3e-64
 Identities = 121/275 (44%), Positives = 182/275 (66%), Gaps = 4/275 (1%)
 Frame = +3

Query: 2319 EKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEKVIHV 2498
            EK    K      K ++ +S MP+ L++ Y   K  L +G+S+ I LD++VFG E  ++V
Sbjct: 511  EKQCKKKPSKEVKKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEECTLYV 570

Query: 2499 DLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTDKSYL 2675
               D+T FC+L  IS   I VYIW+LYKKM ED  ++ F F+ P H+GHVPTTRTDK++L
Sbjct: 571  HDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTDKNFL 630

Query: 2676 QQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDET 2855
             + + +RAR LADRL +   +  +L PCN+G+HWILTVI+  K+ V++ DPL  RIRD+ 
Sbjct: 631  DKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINVSKDIVYLWDPLSHRIRDDD 690

Query: 2856 WRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECIN- 3026
            W+ VV +A+K+ +A    G+KG+ K  WE+++AP QPD  QCGF+VM Y++ ++    + 
Sbjct: 691  WKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIENMPDI 750

Query: 3027 TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131
             +K S++ +F++ +Y +A ID VR EWA  I  ++
Sbjct: 751  DDKDSVQALFQQVEYDKAVIDLVRSEWADIISSYI 785


>ref|XP_011102064.1| uncharacterized protein LOC105180111 isoform X3 [Sesamum indicum]
          Length = 1254

 Score =  366 bits (939), Expect = e-104
 Identities = 202/301 (67%), Positives = 232/301 (77%), Gaps = 10/301 (3%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+  TLSQCL+LEP+V+ D
Sbjct: 23  QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82

Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335
            K  RSGDL+RVLG S         FG AHLKNSS G         AVEELKRLRASVAD
Sbjct: 83  PKNERSGDLKRVLGSSVGSSSEDNSFGAAHLKNSSPG---------AVEELKRLRASVAD 133

Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512
           TC KAS RA            Y E++SSKKQQQ N+M+TN+RS  STLKIGSL+HR  TE
Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192

Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692
           FGS+K DDRPK+VG++ RLRTSVAETRA+C  SG LRQPL V+KERDLLKD+NAD D+V+
Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADHDMVE 252

Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSF 872
            K RR PAGGE  +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS + F
Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDSRYGF 312

Query: 873 R 875
           R
Sbjct: 313 R 313


>ref|XP_019174725.1| PREDICTED: uncharacterized protein LOC109170159 [Ipomoea nil]
          Length = 1211

 Score =  365 bits (937), Expect = e-104
 Identities = 185/371 (49%), Positives = 247/371 (66%), Gaps = 1/371 (0%)
 Frame = +3

Query: 999  SRDSQPRVSTRGRTHMYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVK 1178
            S+++Q   STRGRT M++LA+Q+A+G+K D++ N+LGQ +G  AAELQSYIGVLARE VK
Sbjct: 575  SQETQSTRSTRGRTQMHKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVK 634

Query: 1179 ITYKSWKHVPKDVKELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXX 1358
            + +K+WKHVP+D+K+ IW++VNL + V  I+KK CLSSA  KWRQYKT LT  F+W    
Sbjct: 635  LNFKTWKHVPQDIKDKIWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLN 694

Query: 1359 XXXXXXXXXXGYGISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALA 1538
                      GYGI  ++W+ FVISRMSE F KLSE+QK RR  N+YPHRLAR+GYA LA
Sbjct: 695  DEENLHKPPPGYGIMGDEWSQFVISRMSEDFKKLSEQQKVRRKQNLYPHRLARKGYARLA 754

Query: 1539 EEIESELCDDAEINRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGAT 1718
             EI +ELCDD E+NRAILWKKGR +K GEIEGD LK    KID YI+QK++G L+++G  
Sbjct: 755  SEISTELCDDDEVNRAILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPN 814

Query: 1719 EDILTKALENQEHAGRVRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQG 1898
            EDILT+ALE++EH GRVR IGGH+ PSTYFR+  G  P              K ++  + 
Sbjct: 815  EDILTQALESKEHGGRVRAIGGHVNPSTYFRLGKGMLP-----------NHEKNVLLRRQ 863

Query: 1899 DLIKEQNVRLEKLEAIFIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSA 2075
              ++++  +LE L    +    +  +EK SC+ K    + K  E +   + +K+  E   
Sbjct: 864  ATVEDRVAKLENLVLQNVAFKSSPIEEKGSCTAKDAKGAMKLSEEEIGFMKQKLDFEDDD 923

Query: 2076 IPLTFKSKNEV 2108
              L F  K +V
Sbjct: 924  DELQFIDKEDV 934



 Score =  244 bits (622), Expect = 1e-62
 Identities = 121/275 (44%), Positives = 182/275 (66%), Gaps = 4/275 (1%)
 Frame = +3

Query: 2319 EKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEKVIHV 2498
            EK    K      K ++ +S MP+ L++ Y   K  L +G+S+ I LD++VFG E  ++V
Sbjct: 936  EKQCKKKPSKEVKKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEECTLYV 995

Query: 2499 DLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTDKSYL 2675
               D+T FC+L  IS   I VYIW+LYKKM ED  ++ F F+ P H+GHVPTTRTDK++L
Sbjct: 996  HDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTDKNFL 1055

Query: 2676 QQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDET 2855
             + + +RAR LADRL +   +  +L PCN+G+HWILTVI+  K+ V++ DPL  RIRD+ 
Sbjct: 1056 DKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINVSKDIVYLWDPLSHRIRDDD 1115

Query: 2856 WRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECIN- 3026
            W+ VV +A+K+ +A    G+KG+ K  WE+++AP QPD  QCGF+VM Y++ ++    + 
Sbjct: 1116 WKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIENMPDI 1175

Query: 3027 TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131
             +K S++ +F++ +Y +A ID VR EWA  I  ++
Sbjct: 1176 DDKDSVQALFQQVEYDKAVIDLVRSEWADIISSYI 1210


>ref|XP_011102062.1| uncharacterized protein LOC105180111 isoform X1 [Sesamum indicum]
          Length = 1301

 Score =  366 bits (939), Expect = e-104
 Identities = 202/301 (67%), Positives = 232/301 (77%), Gaps = 10/301 (3%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+  TLSQCL+LEP+V+ D
Sbjct: 23  QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82

Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335
            K  RSGDL+RVLG S         FG AHLKNSS G         AVEELKRLRASVAD
Sbjct: 83  PKNERSGDLKRVLGSSVGSSSEDNSFGAAHLKNSSPG---------AVEELKRLRASVAD 133

Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512
           TC KAS RA            Y E++SSKKQQQ N+M+TN+RS  STLKIGSL+HR  TE
Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192

Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692
           FGS+K DDRPK+VG++ RLRTSVAETRA+C  SG LRQPL V+KERDLLKD+NAD D+V+
Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADHDMVE 252

Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSF 872
            K RR PAGGE  +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS + F
Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDSRYGF 312

Query: 873 R 875
           R
Sbjct: 313 R 313


>ref|XP_011084232.1| uncharacterized protein LOC105166542 isoform X3 [Sesamum indicum]
          Length = 1254

 Score =  365 bits (937), Expect = e-104
 Identities = 201/301 (66%), Positives = 232/301 (77%), Gaps = 10/301 (3%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+  TLSQCL+LEP+V+ D
Sbjct: 23  QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82

Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335
            K  RSGDL+RVLG S         FG AH+KNSS G         AVEELKRLRASVAD
Sbjct: 83  PKNERSGDLKRVLGSSVGSSSEDNSFGAAHMKNSSPG---------AVEELKRLRASVAD 133

Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512
           TC KAS RA            Y E++SSKKQQQ N+M+TN+RS  STLKIGSL+HR  TE
Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192

Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692
           FGS+K DDRPK+VG++ RLRTSVAETRA+C  SG LRQPL V+KERDLLKD+NAD D+V+
Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADPDMVE 252

Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSF 872
            K RR PAGGE  +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS + F
Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDSRYGF 312

Query: 873 R 875
           R
Sbjct: 313 R 313


>ref|XP_011084230.1| uncharacterized protein LOC105166542 isoform X1 [Sesamum indicum]
          Length = 1301

 Score =  365 bits (937), Expect = e-104
 Identities = 201/301 (66%), Positives = 232/301 (77%), Gaps = 10/301 (3%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+  TLSQCL+LEP+V+ D
Sbjct: 23  QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82

Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335
            K  RSGDL+RVLG S         FG AH+KNSS G         AVEELKRLRASVAD
Sbjct: 83  PKNERSGDLKRVLGSSVGSSSEDNSFGAAHMKNSSPG---------AVEELKRLRASVAD 133

Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512
           TC KAS RA            Y E++SSKKQQQ N+M+TN+RS  STLKIGSL+HR  TE
Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192

Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692
           FGS+K DDRPK+VG++ RLRTSVAETRA+C  SG LRQPL V+KERDLLKD+NAD D+V+
Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADPDMVE 252

Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSF 872
            K RR PAGGE  +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS + F
Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDSRYGF 312

Query: 873 R 875
           R
Sbjct: 313 R 313


>ref|XP_011102063.1| uncharacterized protein LOC105180111 isoform X2 [Sesamum indicum]
          Length = 1297

 Score =  361 bits (927), Expect = e-102
 Identities = 200/296 (67%), Positives = 229/296 (77%), Gaps = 10/296 (3%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+  TLSQCL+LEP+V+ D
Sbjct: 23  QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82

Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335
            K  RSGDL+RVLG S         FG AHLKNSS G         AVEELKRLRASVAD
Sbjct: 83  PKNERSGDLKRVLGSSVGSSSEDNSFGAAHLKNSSPG---------AVEELKRLRASVAD 133

Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512
           TC KAS RA            Y E++SSKKQQQ N+M+TN+RS  STLKIGSL+HR  TE
Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192

Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692
           FGS+K DDRPK+VG++ RLRTSVAETRA+C  SG LRQPL V+KERDLLKD+NAD D+V+
Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADHDMVE 252

Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDS 860
            K RR PAGGE  +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS
Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDS 308


>ref|XP_011084231.1| uncharacterized protein LOC105166542 isoform X2 [Sesamum indicum]
          Length = 1297

 Score =  360 bits (925), Expect = e-102
 Identities = 199/296 (67%), Positives = 229/296 (77%), Gaps = 10/296 (3%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+  TLSQCL+LEP+V+ D
Sbjct: 23  QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82

Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335
            K  RSGDL+RVLG S         FG AH+KNSS G         AVEELKRLRASVAD
Sbjct: 83  PKNERSGDLKRVLGSSVGSSSEDNSFGAAHMKNSSPG---------AVEELKRLRASVAD 133

Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512
           TC KAS RA            Y E++SSKKQQQ N+M+TN+RS  STLKIGSL+HR  TE
Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192

Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692
           FGS+K DDRPK+VG++ RLRTSVAETRA+C  SG LRQPL V+KERDLLKD+NAD D+V+
Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADPDMVE 252

Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDS 860
            K RR PAGGE  +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS
Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDS 308


>gb|PRQ17143.1| putative Ulp1 protease family catalytic domain, putative transposase,
            Ptta/En/Spm, plant [Rosa chinensis]
          Length = 775

 Score =  342 bits (877), Expect = 2e-99
 Identities = 240/789 (30%), Positives = 394/789 (49%), Gaps = 41/789 (5%)
 Frame = +3

Query: 879  LDSMAALRKLKGGHDDVQKGKADVS---SDASHEEDGDSMEIDSRDSQPRVSTRGRTH-- 1043
            + S   +RK   G    +K   + S   +D   EE  +S+  ++  S   V TRG+ +  
Sbjct: 1    MGSKKGIRKSPRGKKLKRKADLETSHPETDEVLEEKEESVSANTITSTESVKTRGKRNVV 60

Query: 1044 -MYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVK 1220
             +Y++ ++KA G K  +   + G   G+    LQSYIG+LAR  V I   SW  V  D+K
Sbjct: 61   ALYKVLVKKALGKKFKVSYTETGNPNGRIRHTLQSYIGMLARTKVPINIVSWPEVDGDLK 120

Query: 1221 ELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYG- 1397
            + +W  V   + V P  KK  L+SAG+KWR +KT LTR++V                Y  
Sbjct: 121  DKLWLDVQETFKVAPESKKLVLTSAGTKWRAFKTMLTRKYVLPYLGKKKKLRKPPSQYNF 180

Query: 1398 ISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEI 1577
            + +E W  FV  R +E F++L  EQ ER     Y HRL+R+GY  L EE+   L +   I
Sbjct: 181  VGREPWKDFVKERTTEKFLQLHNEQSERVKKRKYHHRLSRKGYIGLEEELRKTLPEGEVI 240

Query: 1578 NRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEH 1757
            +RAI+WKK R  K+G+I+ +  +    KID+ +E+K +G+L+I G   D+L++ALE  EH
Sbjct: 241  DRAIMWKKARQRKDGDID-EEARGVATKIDDLLEKKSKGELEISG-NSDVLSQALETPEH 298

Query: 1758 AGRVRGIGGHITPSTYF------RVLIGKKPVDRRAEQRN-ELMEAKKLIAEQGDLIKE- 1913
            +GRVRG+GG + PSTYF      R+ I K  +  R  +R+ EL E KK++A Q    +E 
Sbjct: 299  SGRVRGVGGFVNPSTYFKMPKQKRIRITKAELLARDRERDRELEETKKMLAAQQARTEEL 358

Query: 1914 QNVRLEKLEAIFIKK---------YDTDNDEKASCSVKPKHQSNKDEADFVILDRKVALE 2066
             + ++ +LEA+   K         +    +  +  S K   Q  +     ++ + KV  E
Sbjct: 359  LHKKIAQLEALITGKTPYTSPLNVHVVGENTISPISDKGSFQDIRRNTSNILDEPKVKQE 418

Query: 2067 -------------GSAIPLTFKSKNEVDAYGTIVHADEPDNFLDDEPIPTNCMYIANNQA 2207
                         G    L   + + + A+GT+   ++    +   P+   C+ ++ + A
Sbjct: 419  VDDCEVVPPPTEMGGTCELAVDTISNIVAFGTVFDEEDVSRVIHGVPMKEGCVRVSVDGA 478

Query: 2208 MTESTPLPVKIPMTRDCLDDVVGNHVDSPTHLIKLQNEKPSTMKVGDGKN--KSKVVNSD 2381
            + E   LP  +    + +   VG+HV  P  L+  +  K    K+   K       +N  
Sbjct: 479  IQEEARLPFPVGDEMELVGQAVGSHVAWPEELVIRRVNKKKKRKMDFVKQLFDKAELNPF 538

Query: 2382 MPRDLYMFYGCCKNVL-QDGKSISITLDDDVFGTEKVIHVDLSDITHFCELESISCYSII 2558
            +P+   + Y   K ++ Q  +SI   LDD VFG +K + +   ++    E+  I    I 
Sbjct: 539  VPKRCKLLYKHAKTIMSQTNESIRTMLDDSVFGVQKQLFILTENVIDLLEMNKIGQGVIA 598

Query: 2559 VYIWHLYKKMKE-DFVDNFLFVDPYHIGHVPTTRTDKSYLQQLMVARARFLADRLSNASI 2735
             Y+ +L++ ++E D +D F F+DP       T   ++S           +L +RL     
Sbjct: 599  AYMANLHETLRERDELDTFGFIDP-----AATYMCERSEF-------VPYLVNRLKEGKG 646

Query: 2736 NQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDETWRDVVNLALKLFNADKGRKG 2915
            +++ L   N G HWILT+I  +++ ++++DPL   +  + W + V  A+K +NA+KGR  
Sbjct: 647  DRIFLMAYNPGEHWILTII--WEDEIYIVDPLPKPVHYKPWENAVINAVKTYNAEKGRVT 704

Query: 2916 KKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECINTNKASLRTIFKKGDYTRAQIDEV 3095
            K      +  AP QP   +CG++VMRYM++I+++   +         +KG YT+ Q+DEV
Sbjct: 705  KVPKLKLLPGAPKQPGGVECGYYVMRYMKDIINDDTLSFSTKWAVKDRKG-YTQQQLDEV 763

Query: 3096 RLEWAKCIQ 3122
            R+E A  +Q
Sbjct: 764  RIEVADYLQ 772


>gb|PRQ17594.1| putative Ulp1 protease family catalytic domain, putative transposase,
            Ptta/En/Spm, plant [Rosa chinensis]
          Length = 775

 Score =  342 bits (876), Expect = 2e-99
 Identities = 239/789 (30%), Positives = 396/789 (50%), Gaps = 41/789 (5%)
 Frame = +3

Query: 879  LDSMAALRKLKGGHDDVQKGKADVS---SDASHEEDGDSMEIDSRDSQPRVSTRGRTH-- 1043
            + S   +RK   G    +K   + S   +D   EE  + +  ++  S   V TRG+ +  
Sbjct: 1    MGSKKGIRKSPRGKKLKRKADLETSQPETDEVLEEKEEFVSANTITSTESVKTRGKRNVV 60

Query: 1044 -MYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVK 1220
             +Y++ ++KA G K  +   + G   G+    LQSYIG+LAR  V I   SW  V  D+K
Sbjct: 61   ALYKVLVKKALGKKFKVSYTETGNPNGRIRHTLQSYIGMLARTKVPINIVSWPEVDGDLK 120

Query: 1221 ELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYG- 1397
            + +W  V   + V P  KK  L+SAG+KWR +KT LTR++V                Y  
Sbjct: 121  DKLWLDVQETFKVAPESKKLVLTSAGTKWRAFKTMLTRKYVLPYLGKKKKLRKPPSQYNF 180

Query: 1398 ISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEI 1577
            + +E W  FV  R +E F++L  EQ ER     Y HRL+R+GY  L EE+   L +   I
Sbjct: 181  VGREPWKDFVKERTTEKFLQLHNEQSERVKKRKYHHRLSRKGYIGLEEELRKTLPEGEVI 240

Query: 1578 NRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEH 1757
            +RAI+WKK R  K+G+I+ +  +    KID+ +E+K +G+L+I G++ D+L++ALE  EH
Sbjct: 241  DRAIMWKKARQRKDGDID-EEARGVATKIDDLLEKKSKGELEISGSS-DVLSQALETPEH 298

Query: 1758 AGRVRGIGGHITPSTYF------RVLIGKKPVDRRAEQRN-ELMEAKKLIAEQGDLIKEQ 1916
            +GRVRG+GG + PSTYF      R+ I K  +  R  +R+ EL E KK++A Q    +E+
Sbjct: 299  SGRVRGVGGFVNPSTYFKMPKQKRIRITKAELLARDRERDRELEETKKMLAAQQARTEER 358

Query: 1917 -NVRLEKLEAIFIKK---------YDTDNDEKASCSVKPKHQSNKDEADFVILDRKVALE 2066
             + ++ +LEA+   K         +    +  +  S K   Q  +     ++ + KV  E
Sbjct: 359  LHKKIAQLEALITGKTPYTSPLNVHVVGENTISPISDKGSFQDIRRNTSNILDEPKVKQE 418

Query: 2067 -------------GSAIPLTFKSKNEVDAYGTIVHADEPDNFLDDEPIPTNCMYIANNQA 2207
                         G    L   + + + A+GT+   ++    +   P+   C+ ++ + A
Sbjct: 419  VDDCEVVPPPTEMGGTCELAVDTISNIVAFGTVFDEEDVSRVIHGVPMKEGCVRVSVDGA 478

Query: 2208 MTESTPLPVKIPMTRDCLDDVVGNHVDSPTHLIKLQNEKPSTMKVGDGKN--KSKVVNSD 2381
            + E   LP  +    + +   VG+HV  P  L+  +  K    K+   K       +N  
Sbjct: 479  IQEEARLPFPVGDEMELVGQAVGSHVAWPEELVIRRVNKKKKRKMDFVKQLFDKAELNPF 538

Query: 2382 MPRDLYMFYGCCKNVL-QDGKSISITLDDDVFGTEKVIHVDLSDITHFCELESISCYSII 2558
            +P+   + Y   K ++ Q  +SI   LDD VFG +K + +   ++    E+  I    I 
Sbjct: 539  VPKRCKLLYKHAKTIMSQTNESIRTMLDDSVFGVQKQLFILTENVIDLLEMNKIGQGVIA 598

Query: 2559 VYIWHLYKKMKE-DFVDNFLFVDPYHIGHVPTTRTDKSYLQQLMVARARFLADRLSNASI 2735
             Y+ +L++ ++E D +D F F+DP       T   ++S           +L +RL     
Sbjct: 599  AYMANLHETLRERDELDTFGFIDP-----AATYMCERSEF-------VPYLVNRLKEGKG 646

Query: 2736 NQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDETWRDVVNLALKLFNADKGRKG 2915
            +++ L   N G HWILT+I  +++ ++++DPL   +  + W + V  A+K +NA+KGR  
Sbjct: 647  DRIFLMAYNPGEHWILTII--WEDEIYIVDPLPKPVHYKPWENAVINAVKTYNAEKGRVT 704

Query: 2916 KKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECINTNKASLRTIFKKGDYTRAQIDEV 3095
            K      +  AP QP   +CG++VMRYM++I+++   +         +KG YT+ Q+DEV
Sbjct: 705  KVPKLKLLPGAPKQPGGVECGYYVMRYMKDIINDDTLSFSTKWAVKDRKG-YTQQQLDEV 763

Query: 3096 RLEWAKCIQ 3122
            R+E A  +Q
Sbjct: 764  RIEVADYLQ 772


>ref|XP_012837747.1| PREDICTED: uncharacterized protein LOC105958287 isoform X2
           [Erythranthe guttata]
          Length = 1261

 Score =  347 bits (889), Expect = 2e-97
 Identities = 190/292 (65%), Positives = 224/292 (76%), Gaps = 1/292 (0%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGYS  TL+RSTSFRE  D++NF SGK+NSRGSA++SGD   L+QCL+L+PV + D
Sbjct: 23  QNGQRGYSAATLDRSTSFREGTDSKNFTSGKANSRGSASSSGDVTALTQCLMLDPVALCD 82

Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362
            K+ RS +L+R+LGFS G    +NS S    +     AVEELKRLRASVADTCVKAS RA
Sbjct: 83  LKHPRSNELKRLLGFSVGSGSEENSFSAAHLKNTSPVAVEELKRLRASVADTCVKASGRA 142

Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539
                       + ES+SSKKQQQ NE+LTN+RSS S LK GSL+HR  +EFG++K DDR
Sbjct: 143 KKLDDHLSKLNKFVESVSSKKQQQRNEILTNERSSGSNLKSGSLMHRNPSEFGNQKFDDR 202

Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719
           PKN G++ RLRTSVAETRA+C  +GVLRQ L VTKERDLLKD +ADSDIV+ K RR PAG
Sbjct: 203 PKNGGVNKRLRTSVAETRAECRNNGVLRQSLMVTKERDLLKDVSADSDIVEEKIRRLPAG 262

Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875
           GE  +KKMK K SVGAV SRSV+NDGELKRTMH+KL  ESSLQSSDS  SFR
Sbjct: 263 GEGWDKKMKRKRSVGAVFSRSVDNDGELKRTMHNKLTNESSLQSSDSNLSFR 314


>ref|XP_012837746.1| PREDICTED: uncharacterized protein LOC105958287 isoform X1
           [Erythranthe guttata]
          Length = 1262

 Score =  347 bits (889), Expect = 2e-97
 Identities = 190/292 (65%), Positives = 224/292 (76%), Gaps = 1/292 (0%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGYS  TL+RSTSFRE  D++NF SGK+NSRGSA++SGD   L+QCL+L+PV + D
Sbjct: 23  QNGQRGYSAATLDRSTSFREGTDSKNFTSGKANSRGSASSSGDVTALTQCLMLDPVALCD 82

Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362
            K+ RS +L+R+LGFS G    +NS S    +     AVEELKRLRASVADTCVKAS RA
Sbjct: 83  LKHPRSNELKRLLGFSVGSGSEENSFSAAHLKNTSPVAVEELKRLRASVADTCVKASGRA 142

Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539
                       + ES+SSKKQQQ NE+LTN+RSS S LK GSL+HR  +EFG++K DDR
Sbjct: 143 KKLDDHLSKLNKFVESVSSKKQQQRNEILTNERSSGSNLKSGSLMHRNPSEFGNQKFDDR 202

Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719
           PKN G++ RLRTSVAETRA+C  +GVLRQ L VTKERDLLKD +ADSDIV+ K RR PAG
Sbjct: 203 PKNGGVNKRLRTSVAETRAECRNNGVLRQSLMVTKERDLLKDVSADSDIVEEKIRRLPAG 262

Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875
           GE  +KKMK K SVGAV SRSV+NDGELKRTMH+KL  ESSLQSSDS  SFR
Sbjct: 263 GEGWDKKMKRKRSVGAVFSRSVDNDGELKRTMHNKLTNESSLQSSDSNLSFR 314


>ref|XP_011076941.1| uncharacterized protein LOC105161066 isoform X3 [Sesamum indicum]
          Length = 1264

 Score =  344 bits (883), Expect = 1e-96
 Identities = 189/292 (64%), Positives = 219/292 (75%), Gaps = 1/292 (0%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGYS   L RS+SFRE +++RN AS K NSRGSAT+SGD  +LSQCL+LEP+V+ D
Sbjct: 23  QNGQRGYSAQALGRSSSFREVSESRNLASAKLNSRGSATSSGDVPSLSQCLMLEPIVMGD 82

Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362
            KY RSGDLRRVLGFS G    + +S           AVEELKRLRASVADTCVKAS R 
Sbjct: 83  PKYLRSGDLRRVLGFSVGSNSEERNSPPV--------AVEELKRLRASVADTCVKASGRV 134

Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539
                       +FE+M  KKQQQ NE+L N+RSS STLKIGS IHR  +E  S+K +DR
Sbjct: 135 KKLDEHLNKLNKFFEAMPYKKQQQRNELLMNERSSGSTLKIGSQIHRNPSELASQKFEDR 194

Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719
           PKN G++ RLRTSVAETRA+C  +GVLRQPL  TKERD+ KD+NADSD+V+ K RR PAG
Sbjct: 195 PKN-GLNKRLRTSVAETRAECRNNGVLRQPLMATKERDMPKDNNADSDMVEEKNRRLPAG 253

Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875
           GE  +KKMK K SVGAV SRSV+NDGE+KRTMHHKL IESSLQSSDS H FR
Sbjct: 254 GEGWDKKMKRKRSVGAVFSRSVDNDGEVKRTMHHKLTIESSLQSSDSIHGFR 305


>ref|XP_020548914.1| uncharacterized protein LOC105161066 isoform X2 [Sesamum indicum]
          Length = 1294

 Score =  344 bits (883), Expect = 2e-96
 Identities = 189/292 (64%), Positives = 219/292 (75%), Gaps = 1/292 (0%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGYS   L RS+SFRE +++RN AS K NSRGSAT+SGD  +LSQCL+LEP+V+ D
Sbjct: 23  QNGQRGYSAQALGRSSSFREVSESRNLASAKLNSRGSATSSGDVPSLSQCLMLEPIVMGD 82

Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362
            KY RSGDLRRVLGFS G    + +S           AVEELKRLRASVADTCVKAS R 
Sbjct: 83  PKYLRSGDLRRVLGFSVGSNSEERNSPPV--------AVEELKRLRASVADTCVKASGRV 134

Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539
                       +FE+M  KKQQQ NE+L N+RSS STLKIGS IHR  +E  S+K +DR
Sbjct: 135 KKLDEHLNKLNKFFEAMPYKKQQQRNELLMNERSSGSTLKIGSQIHRNPSELASQKFEDR 194

Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719
           PKN G++ RLRTSVAETRA+C  +GVLRQPL  TKERD+ KD+NADSD+V+ K RR PAG
Sbjct: 195 PKN-GLNKRLRTSVAETRAECRNNGVLRQPLMATKERDMPKDNNADSDMVEEKNRRLPAG 253

Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875
           GE  +KKMK K SVGAV SRSV+NDGE+KRTMHHKL IESSLQSSDS H FR
Sbjct: 254 GEGWDKKMKRKRSVGAVFSRSVDNDGEVKRTMHHKLTIESSLQSSDSIHGFR 305


>ref|XP_011076937.1| uncharacterized protein LOC105161066 isoform X1 [Sesamum indicum]
 ref|XP_011076938.1| uncharacterized protein LOC105161066 isoform X1 [Sesamum indicum]
 ref|XP_011076939.1| uncharacterized protein LOC105161066 isoform X1 [Sesamum indicum]
          Length = 1297

 Score =  344 bits (883), Expect = 2e-96
 Identities = 189/292 (64%), Positives = 219/292 (75%), Gaps = 1/292 (0%)
 Frame = +3

Query: 3   QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182
           Q+GQRGYS   L RS+SFRE +++RN AS K NSRGSAT+SGD  +LSQCL+LEP+V+ D
Sbjct: 23  QNGQRGYSAQALGRSSSFREVSESRNLASAKLNSRGSATSSGDVPSLSQCLMLEPIVMGD 82

Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362
            KY RSGDLRRVLGFS G    + +S           AVEELKRLRASVADTCVKAS R 
Sbjct: 83  PKYLRSGDLRRVLGFSVGSNSEERNSPPV--------AVEELKRLRASVADTCVKASGRV 134

Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539
                       +FE+M  KKQQQ NE+L N+RSS STLKIGS IHR  +E  S+K +DR
Sbjct: 135 KKLDEHLNKLNKFFEAMPYKKQQQRNELLMNERSSGSTLKIGSQIHRNPSELASQKFEDR 194

Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719
           PKN G++ RLRTSVAETRA+C  +GVLRQPL  TKERD+ KD+NADSD+V+ K RR PAG
Sbjct: 195 PKN-GLNKRLRTSVAETRAECRNNGVLRQPLMATKERDMPKDNNADSDMVEEKNRRLPAG 253

Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875
           GE  +KKMK K SVGAV SRSV+NDGE+KRTMHHKL IESSLQSSDS H FR
Sbjct: 254 GEGWDKKMKRKRSVGAVFSRSVDNDGEVKRTMHHKLTIESSLQSSDSIHGFR 305


>gb|PRQ20360.1| putative Ulp1 protease family catalytic domain, putative transposase,
            Ptta/En/Spm, plant [Rosa chinensis]
          Length = 724

 Score =  324 bits (830), Expect = 2e-93
 Identities = 230/741 (31%), Positives = 363/741 (48%), Gaps = 48/741 (6%)
 Frame = +3

Query: 1044 MYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVKE 1223
            MY++ ++KA G K  +     G   G     LQSYIG+LAR  V I   SW +V  D+K 
Sbjct: 1    MYKVLVKKALGKKFKVTYTDTGNLNGSIRHTLQSYIGMLARTKVPINIVSWPNVDGDLKN 60

Query: 1224 LIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYG-I 1400
             +W  V   + V P  KK  L+SAG+KWR +KT LTR++V                Y  +
Sbjct: 61   KLWLDVKDTFKVAPESKKLVLTSAGTKWRAFKTMLTRKYVLPYLGKKKKLRKPPSQYAFV 120

Query: 1401 SQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEIN 1580
             ++ W  FV  R +E +++L  +Q ER     Y HRL+R+GY  L EE++  L +   I+
Sbjct: 121  GRQPWRQFVKERTTEKWLELHNKQSERVRKRKYHHRLSRKGYIGLEEELKKTLPEGEVID 180

Query: 1581 RAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEHA 1760
             AI+WKK R  K+G+   +  +  V KID+ +E+K +G+L+I G++ D+L++ALE  EH+
Sbjct: 181  CAIMWKKARQRKDGD-RDEKARAVVTKIDDLLEKKSKGELEISGSS-DVLSQALETLEHS 238

Query: 1761 GRVRGIGGHITPSTYF------RVLIGKKPVDRRAEQRN-ELMEAKKLI-AEQGDLIKEQ 1916
            GRVRG+GG I PSTYF      R+ I K  +  R  +R+ EL E KK++ A+Q    +  
Sbjct: 239  GRVRGVGGFINPSTYFKLPKLKRIRITKADLLARDRERDRELEETKKMLTAQQAKAEELL 298

Query: 1917 NVRLEKLEAIFIKKYDTDNDEK------ASCSVKP------------KHQSNKDEA---- 2030
            N R+  LE +   K  T N           C + P               +N DEA    
Sbjct: 299  NKRIAALEVMITGK--TPNTPPLNVHVLGDCRISPISDKGSIHDRTLNTSNNLDEAKVKE 356

Query: 2031 ---DFVILDRKVALEGSAIPLTFKSKNEVDAYGTIVHADEPDNFLDDEPIPTNCMYIANN 2201
               D  ++     + G    L   + N + A+GT+   ++ +  +   P+   C+ ++ +
Sbjct: 357  EVQDCEVVPPPTEM-GGTCELAVDTINNIVAFGTVFEDEDVNRMIHGVPLKEGCVRVSVD 415

Query: 2202 QAMTESTPLPVKIPMTRDCLDDVVGNHVDSPTHLIKLQNEKPSTMKVGDGKN--KSKVVN 2375
             A+     LP  +      +   +G+HV  P  L+  +  K    K+   K       +N
Sbjct: 416  GAIQAEARLPFLVEGEMGLVGQAIGSHVAWPEELVIRRVNKKKKRKMDFVKQLFDQAELN 475

Query: 2376 SDMPRDLYMFYGCCKNVL-QDGKSISITLDDDVFGTEKVIHVDLSDITHFCELESISCYS 2552
            S +P+   + Y   K ++ Q  + IS  LDD VFG  K + +   ++T   E++ I    
Sbjct: 476  SFVPKRCKLLYKHAKTIMSQTSELISTVLDDKVFGLHKELFILTENVTDLLEMKKIGQGV 535

Query: 2553 IIVYIWHLYKKMKE-DFVDNFLFVDPYHIGHVPTTRTDKSYLQQLMVARARFLADRLSNA 2729
            I  Y+ HL++ + E D +D F F+DP       T   ++S           +L DRL   
Sbjct: 536  IAAYMAHLHETLTERDELDTFTFIDP-----AATYNCERS-------GFGPYLVDRLKEG 583

Query: 2730 SINQVVLAPCNIG----------YHWILTVIDPYKETVHVLDPLGPRIRDETWRDVVNLA 2879
              +++   P N G           HWILT+I  +++ V++LDPL   +    W   V  A
Sbjct: 584  KADRIFFMPYNPGCIMWAMKYYKEHWILTII--WEDEVYILDPLPNPVHYTAWETAVMNA 641

Query: 2880 LKLFNADKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECINTNKASLRTIFK 3059
            +K +NA+KGR  K      +   P QP   +CG++VMRYM++I+++   +         +
Sbjct: 642  VKSYNAEKGRANKVPKLRLLPGVPKQPGGIECGYYVMRYMKDIINDDTLSFSTKWAVKTR 701

Query: 3060 KGDYTRAQIDEVRLEWAKCIQ 3122
            KG YT+ Q+DEVR+E A  +Q
Sbjct: 702  KG-YTQQQLDEVRMEVANYLQ 721


Top