BLASTX nr result

ID: Chrysanthemum22_contig00016769 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00016769
         (2666 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVI12220.1| hypothetical protein Ccrd_009354 [Cynara carduncu...   646   0.0  
ref|XP_022036223.1| uncharacterized protein LOC110938060 isoform...   597   0.0  
ref|XP_022036222.1| uncharacterized protein LOC110938060 isoform...   597   0.0  
ref|XP_021970383.1| uncharacterized protein LOC110865445 isoform...   592   0.0  
ref|XP_021970382.1| uncharacterized protein LOC110865445 isoform...   592   0.0  
ref|XP_023771266.1| uncharacterized protein LOC111919938 isoform...   530   e-168
ref|XP_023771265.1| uncharacterized protein LOC111919938 isoform...   530   e-168
gb|KVI01149.1| hypothetical protein Ccrd_020564 [Cynara carduncu...   509   e-159
ref|XP_017241512.1| PREDICTED: uncharacterized protein LOC108214...   464   e-142
gb|OIT08269.1| hypothetical protein A4A49_17381 [Nicotiana atten...   447   e-136
ref|XP_016486861.1| PREDICTED: uncharacterized protein LOC107807...   447   e-135
ref|XP_016486853.1| PREDICTED: uncharacterized protein LOC107807...   447   e-135
ref|XP_009797848.1| PREDICTED: uncharacterized protein LOC104244...   447   e-135
ref|XP_019258758.1| PREDICTED: uncharacterized protein LOC109236...   447   e-135
ref|XP_011093773.1| uncharacterized protein LOC105173644 isoform...   444   e-134
ref|XP_011093770.1| uncharacterized protein LOC105173644 isoform...   444   e-134
gb|KZV50855.1| hypothetical protein F511_27621 [Dorcoceras hygro...   441   e-133
ref|XP_021275591.1| uncharacterized protein LOC110410286 [Herran...   437   e-132
ref|XP_017982811.1| PREDICTED: uncharacterized protein LOC185883...   437   e-132
gb|EOY30366.1| Serine/arginine repetitive matrix protein 2 isofo...   437   e-132

>gb|KVI12220.1| hypothetical protein Ccrd_009354 [Cynara cardunculus var. scolymus]
          Length = 1294

 Score =  646 bits (1666), Expect = 0.0
 Identities = 333/449 (74%), Positives = 365/449 (81%), Gaps = 7/449 (1%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            MASSGKFDLSS SPDRPLYNSAQRG SYT ASLDRSSSFRENMENPI             
Sbjct: 1    MASSGKFDLSSVSPDRPLYNSAQRG-SYTAASLDRSSSFRENMENPILSSLPSMSRSTST 59

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSPDEXXXXXXXXXXXXXXX 1700
            VTQ DVTNFL CLRFDPK MA EHKFNR+GDFKRLASAV+GSPD                
Sbjct: 60   VTQVDVTNFLHCLRFDPKLMAAEHKFNRHGDFKRLASAVLGSPDASPSGSSKGKLPSSSP 119

Query: 1701 DDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLLLN 1880
            +DLKRL+AGLRES+IKARERVKVFSETLSV+NKCFPSIPSRKRSRPD+LPGDR+SGLLLN
Sbjct: 120  EDLKRLKAGLRESTIKARERVKVFSETLSVINKCFPSIPSRKRSRPDALPGDRSSGLLLN 179

Query: 1881 RSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNTPAR 2060
            R+P G GVGKMGTQSHS    FDFE  KVEERGK+AIPNKRTRTS+VDQRAEVRPNTPAR
Sbjct: 180  RAPTGPGVGKMGTQSHSLTSAFDFEPQKVEERGKNAIPNKRTRTSMVDQRAEVRPNTPAR 239

Query: 2061 SSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSLSTK 2240
            S+G++DRDKE LRLPNSNA QG+DR L +V DGWEKAKMKKKRTGIK DAA SPSS+STK
Sbjct: 240  SAGNVDRDKEVLRLPNSNALQGDDRALPIVADGWEKAKMKKKRTGIKVDAAPSPSSVSTK 299

Query: 2241 AVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQT---MRSSIP 2411
            A+DGYRE KQG HPRHLPDAMSRLND+ GFRP               + Q    +RSSIP
Sbjct: 300  AIDGYREPKQGMHPRHLPDAMSRLNDSHGFRPGAANGVVGGGKADGSSQQASVGIRSSIP 359

Query: 2412 RPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANARGPRSG 2579
            RP+QEN+ L+HDKRDRSTSSEK    +RS+NKSNVR+EFISGSPTSSTK+HA ARGPRSG
Sbjct: 360  RPEQENTSLLHDKRDRSTSSEKERTNLRSINKSNVREEFISGSPTSSTKMHATARGPRSG 419

Query: 2580 SSVVHKSSPVVQRATTSSDWDLVHGTDKN 2666
            S++V KSS VVQRAT SSDW+L HGT+KN
Sbjct: 420  SNIVPKSSTVVQRATASSDWELTHGTNKN 448


>ref|XP_022036223.1| uncharacterized protein LOC110938060 isoform X2 [Helianthus annuus]
 gb|OTG29794.1| hypothetical protein HannXRQ_Chr04g0126081 [Helianthus annuus]
          Length = 1258

 Score =  597 bits (1539), Expect = 0.0
 Identities = 319/450 (70%), Positives = 348/450 (77%), Gaps = 8/450 (1%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            MASSGKFDLSS SPDRPLY+SAQRG SY+ ASLDRSSSFRENMENPI             
Sbjct: 1    MASSGKFDLSSVSPDRPLYSSAQRG-SYSAASLDRSSSFRENMENPILSSLPSMSRSNST 59

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSPDEXXXXXXXXXXXXXXX 1700
            VTQ DVTNF QCLRFDPK+MA EHKFNRYGDFKRLASAV+GSPDE               
Sbjct: 60   VTQVDVTNFFQCLRFDPKSMAAEHKFNRYGDFKRLASAVVGSPDESPSGLLKGKLRNSSP 119

Query: 1701 DDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLLLN 1880
            +DLKRL+AGLRES+IKARERVKVFSETLSV+NKCFPSIPSRKRSRPD+LPGDR+SGLLLN
Sbjct: 120  EDLKRLKAGLRESTIKARERVKVFSETLSVINKCFPSIPSRKRSRPDALPGDRSSGLLLN 179

Query: 1881 RSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNTPAR 2060
            R+PMGAGVGKMGTQSHS  +TFDFEQ KVEERGK+AIPNKRTRTS+VDQRAEVRPNTPAR
Sbjct: 180  RAPMGAGVGKMGTQSHSLTNTFDFEQQKVEERGKNAIPNKRTRTSMVDQRAEVRPNTPAR 239

Query: 2061 SSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSLSTK 2240
            SSG          LPN N  Q EDR+L +V DGWEK KMKKKRTGIKAD A SP+S+STK
Sbjct: 240  SSG----------LPNGNTPQSEDRSLHIVADGWEKTKMKKKRTGIKADNAPSPNSMSTK 289

Query: 2241 AVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQT---MRSSIP 2411
            A++GY+E KQG H RHL D+MSRLND+ G RP                 Q    +RSSIP
Sbjct: 290  AINGYKEPKQGMHARHLSDSMSRLNDSHGLRPGAANGIIGGVKIDGPAQQASVGIRSSIP 349

Query: 2412 RPDQENSPLMHDKRDRSTSSEK----VRSV-NKSNVRDEFISGSPTSSTKLHANARGPRS 2576
            RP+QE + L HDKRDRSTSSEK    +RSV NKS+ RDEFISGSPTS TKLH  ARGPRS
Sbjct: 350  RPEQETTSLHHDKRDRSTSSEKERTNLRSVNNKSSFRDEFISGSPTSGTKLHGPARGPRS 409

Query: 2577 GSSVVHKSSPVVQRATTSSDWDLVHGTDKN 2666
            GSSVV KSS VVQRA  SSDWD+ H T KN
Sbjct: 410  GSSVVPKSSTVVQRANASSDWDMGHVTHKN 439


>ref|XP_022036222.1| uncharacterized protein LOC110938060 isoform X1 [Helianthus annuus]
          Length = 1259

 Score =  597 bits (1539), Expect = 0.0
 Identities = 319/450 (70%), Positives = 348/450 (77%), Gaps = 8/450 (1%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            MASSGKFDLSS SPDRPLY+SAQRG SY+ ASLDRSSSFRENMENPI             
Sbjct: 1    MASSGKFDLSSVSPDRPLYSSAQRG-SYSAASLDRSSSFRENMENPILSSLPSMSRSNST 59

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSPDEXXXXXXXXXXXXXXX 1700
            VTQ DVTNF QCLRFDPK+MA EHKFNRYGDFKRLASAV+GSPDE               
Sbjct: 60   VTQVDVTNFFQCLRFDPKSMAAEHKFNRYGDFKRLASAVVGSPDESPSGLLKGKLRNSSP 119

Query: 1701 DDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLLLN 1880
            +DLKRL+AGLRES+IKARERVKVFSETLSV+NKCFPSIPSRKRSRPD+LPGDR+SGLLLN
Sbjct: 120  EDLKRLKAGLRESTIKARERVKVFSETLSVINKCFPSIPSRKRSRPDALPGDRSSGLLLN 179

Query: 1881 RSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNTPAR 2060
            R+PMGAGVGKMGTQSHS  +TFDFEQ KVEERGK+AIPNKRTRTS+VDQRAEVRPNTPAR
Sbjct: 180  RAPMGAGVGKMGTQSHSLTNTFDFEQQKVEERGKNAIPNKRTRTSMVDQRAEVRPNTPAR 239

Query: 2061 SSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSLSTK 2240
            SSG          LPN N  Q EDR+L +V DGWEK KMKKKRTGIKAD A SP+S+STK
Sbjct: 240  SSG----------LPNGNTPQSEDRSLHIVADGWEKTKMKKKRTGIKADNAPSPNSMSTK 289

Query: 2241 AVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQT---MRSSIP 2411
            A++GY+E KQG H RHL D+MSRLND+ G RP                 Q    +RSSIP
Sbjct: 290  AINGYKEPKQGMHARHLSDSMSRLNDSHGLRPGAANGIIGGVKIDGPAQQASVGIRSSIP 349

Query: 2412 RPDQENSPLMHDKRDRSTSSEK----VRSV-NKSNVRDEFISGSPTSSTKLHANARGPRS 2576
            RP+QE + L HDKRDRSTSSEK    +RSV NKS+ RDEFISGSPTS TKLH  ARGPRS
Sbjct: 350  RPEQETTSLHHDKRDRSTSSEKERTNLRSVNNKSSFRDEFISGSPTSGTKLHGPARGPRS 409

Query: 2577 GSSVVHKSSPVVQRATTSSDWDLVHGTDKN 2666
            GSSVV KSS VVQRA  SSDWD+ H T KN
Sbjct: 410  GSSVVPKSSTVVQRANASSDWDMGHVTHKN 439


>ref|XP_021970383.1| uncharacterized protein LOC110865445 isoform X2 [Helianthus annuus]
 gb|OTG23059.1| hypothetical protein HannXRQ_Chr06g0178201 [Helianthus annuus]
          Length = 1238

 Score =  592 bits (1527), Expect = 0.0
 Identities = 316/454 (69%), Positives = 349/454 (76%), Gaps = 12/454 (2%)
 Frame = +3

Query: 1341 MASSGKFDLSSA-SPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXX 1517
            MASSGKFD+SS+ S DR LYNSA+  GSYT ASLDRSSSF +NMENPI            
Sbjct: 1    MASSGKFDMSSSVSTDRSLYNSAR--GSYTAASLDRSSSFHDNMENPILSSLPSMSRSAS 58

Query: 1518 XVTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSPDEXXXXXXXXXXXXXX 1697
             VTQ DVTNF QCLRFDPK+MA E KF+R+GDFKRLASAV GSPDE              
Sbjct: 59   TVTQVDVTNFFQCLRFDPKSMAAELKFHRHGDFKRLASAVFGSPDESPSGSLKGNLSNFS 118

Query: 1698 XDDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLLL 1877
             +D+KR +AGLRES+IKARERVK FSETLSV+NKCFPSIPSRKRSRPD LPGDR+SGLLL
Sbjct: 119  PEDVKRFKAGLRESTIKARERVKAFSETLSVINKCFPSIPSRKRSRPDVLPGDRSSGLLL 178

Query: 1878 NRSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNTPA 2057
            NR+PMG GVGKMGTQSHS  H FDFE  KVEERGK+ IPNKRTRTS+VDQR EVRPNTPA
Sbjct: 179  NRAPMGQGVGKMGTQSHSLTHAFDFENQKVEERGKNVIPNKRTRTSMVDQRVEVRPNTPA 238

Query: 2058 RSSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSLST 2237
            RSS ++DRDKE  RLPN  ++QGEDR+L +V DGWEKAKMKKKRTGIKAD A+SP S+ST
Sbjct: 239  RSSVNMDRDKETSRLPNGTSSQGEDRSLPIVADGWEKAKMKKKRTGIKADTAASPRSMST 298

Query: 2238 KAVDGYRESKQG----THPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQT---M 2396
            KAVDGYRE+KQG     H RHLPDAMSRLND+ G RP                 Q    M
Sbjct: 299  KAVDGYREAKQGVHARVHARHLPDAMSRLNDSHGLRPGTANGTVGGGKTDGPAQQASMGM 358

Query: 2397 RSSIPRPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANAR 2564
            RSSIPRP+QE + L+HDKRDRSTSSEK    +RSVNKSNVR+EFISGSPTSSTKLH  AR
Sbjct: 359  RSSIPRPEQETTALLHDKRDRSTSSEKERTNLRSVNKSNVREEFISGSPTSSTKLHGPAR 418

Query: 2565 GPRSGSSVVHKSSPVVQRATTSSDWDLVHGTDKN 2666
            GPRSG     K S V+QRAT SSDWDLVHGT+KN
Sbjct: 419  GPRSGP----KLSTVIQRATASSDWDLVHGTNKN 448


>ref|XP_021970382.1| uncharacterized protein LOC110865445 isoform X1 [Helianthus annuus]
          Length = 1239

 Score =  592 bits (1527), Expect = 0.0
 Identities = 316/454 (69%), Positives = 349/454 (76%), Gaps = 12/454 (2%)
 Frame = +3

Query: 1341 MASSGKFDLSSA-SPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXX 1517
            MASSGKFD+SS+ S DR LYNSA+  GSYT ASLDRSSSF +NMENPI            
Sbjct: 1    MASSGKFDMSSSVSTDRSLYNSAR--GSYTAASLDRSSSFHDNMENPILSSLPSMSRSAS 58

Query: 1518 XVTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSPDEXXXXXXXXXXXXXX 1697
             VTQ DVTNF QCLRFDPK+MA E KF+R+GDFKRLASAV GSPDE              
Sbjct: 59   TVTQVDVTNFFQCLRFDPKSMAAELKFHRHGDFKRLASAVFGSPDESPSGSLKGNLSNFS 118

Query: 1698 XDDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLLL 1877
             +D+KR +AGLRES+IKARERVK FSETLSV+NKCFPSIPSRKRSRPD LPGDR+SGLLL
Sbjct: 119  PEDVKRFKAGLRESTIKARERVKAFSETLSVINKCFPSIPSRKRSRPDVLPGDRSSGLLL 178

Query: 1878 NRSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNTPA 2057
            NR+PMG GVGKMGTQSHS  H FDFE  KVEERGK+ IPNKRTRTS+VDQR EVRPNTPA
Sbjct: 179  NRAPMGQGVGKMGTQSHSLTHAFDFENQKVEERGKNVIPNKRTRTSMVDQRVEVRPNTPA 238

Query: 2058 RSSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSLST 2237
            RSS ++DRDKE  RLPN  ++QGEDR+L +V DGWEKAKMKKKRTGIKAD A+SP S+ST
Sbjct: 239  RSSVNMDRDKETSRLPNGTSSQGEDRSLPIVADGWEKAKMKKKRTGIKADTAASPRSMST 298

Query: 2238 KAVDGYRESKQG----THPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQT---M 2396
            KAVDGYRE+KQG     H RHLPDAMSRLND+ G RP                 Q    M
Sbjct: 299  KAVDGYREAKQGVHARVHARHLPDAMSRLNDSHGLRPGTANGTVGGGKTDGPAQQASMGM 358

Query: 2397 RSSIPRPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANAR 2564
            RSSIPRP+QE + L+HDKRDRSTSSEK    +RSVNKSNVR+EFISGSPTSSTKLH  AR
Sbjct: 359  RSSIPRPEQETTALLHDKRDRSTSSEKERTNLRSVNKSNVREEFISGSPTSSTKLHGPAR 418

Query: 2565 GPRSGSSVVHKSSPVVQRATTSSDWDLVHGTDKN 2666
            GPRSG     K S V+QRAT SSDWDLVHGT+KN
Sbjct: 419  GPRSGP----KLSTVIQRATASSDWDLVHGTNKN 448


>ref|XP_023771266.1| uncharacterized protein LOC111919938 isoform X2 [Lactuca sativa]
          Length = 1214

 Score =  530 bits (1364), Expect = e-168
 Identities = 292/448 (65%), Positives = 338/448 (75%), Gaps = 6/448 (1%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFREN----MENPIXXXXXXXXX 1508
            MASSGKFDLS+ SPDRPLYNSAQRG SYT ASLDRSSSFR+N    MENPI         
Sbjct: 1    MASSGKFDLSTVSPDRPLYNSAQRG-SYTSASLDRSSSFRDNNNNNMENPILSSLPSMSR 59

Query: 1509 XXXXVTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSPDEXXXXXXXXXXX 1688
                VTQ DVTNFLQCLRFDPK+MA +HKFNR+GDF+RLASAV+GSPDE           
Sbjct: 60   STSTVTQLDVTNFLQCLRFDPKSMAVDHKFNRHGDFRRLASAVLGSPDESPSSKTKLPNS 119

Query: 1689 XXXXDDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSR-PDSLPGDRAS 1865
                DDLKRL+AGLRES+IK+RERVK FSETLSV+NK FPSIPSRKRSR PD LPGDR++
Sbjct: 120  SP--DDLKRLKAGLRESTIKSRERVKAFSETLSVINKFFPSIPSRKRSRGPDGLPGDRST 177

Query: 1866 GLLLNRSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRP 2045
            GLLLNR+PMG       TQ+HS +++FDF+Q KVEERGK+ IPNKRTRTS+     EVRP
Sbjct: 178  GLLLNRAPMGTPKMGAHTQTHSLSNSFDFDQQKVEERGKNVIPNKRTRTSM-----EVRP 232

Query: 2046 NTPARSSGSLDRDKEALRLPNSNATQGEDR-TLSMVTDGWEKAKMKKKRTGIKADAASSP 2222
            NTPARSSG++DRDKE LRLP SN    EDR +L +V DGWEK+KMKKKRTGIK DA+ SP
Sbjct: 233  NTPARSSGNVDRDKEPLRLPTSN----EDRASLPIVADGWEKSKMKKKRTGIKVDASPSP 288

Query: 2223 SSLSTKAVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQTMRS 2402
            SS STK +DGYRE KQG HPRH+PD ++RLND+ GFRP                 + MR 
Sbjct: 289  SSGSTKPIDGYREPKQGMHPRHVPDGITRLNDSHGFRPGAANGVAGGG-----KAEVMRP 343

Query: 2403 SIPRPDQENSPLMHDKRDRSTSSEKVRSVNKSNVRDEFISGSPTSSTKLHANARGPRSGS 2582
            SIPRP+ EN+ L+  +RDRSTSSEK R+  +SNVR+EF SGSPTSSTKLH NARGPRSGS
Sbjct: 344  SIPRPEIENTSLL--QRDRSTSSEKERT-KRSNVREEFASGSPTSSTKLHTNARGPRSGS 400

Query: 2583 SVVHKSSPVVQRATTSSDWDLVHGTDKN 2666
            +VV KSS  + +  T++DW+L HGT  N
Sbjct: 401  NVVPKSSVSMGQRATNNDWELTHGTVTN 428


>ref|XP_023771265.1| uncharacterized protein LOC111919938 isoform X1 [Lactuca sativa]
 gb|PLY79704.1| hypothetical protein LSAT_8X86180 [Lactuca sativa]
          Length = 1215

 Score =  530 bits (1364), Expect = e-168
 Identities = 292/448 (65%), Positives = 338/448 (75%), Gaps = 6/448 (1%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFREN----MENPIXXXXXXXXX 1508
            MASSGKFDLS+ SPDRPLYNSAQRG SYT ASLDRSSSFR+N    MENPI         
Sbjct: 1    MASSGKFDLSTVSPDRPLYNSAQRG-SYTSASLDRSSSFRDNNNNNMENPILSSLPSMSR 59

Query: 1509 XXXXVTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSPDEXXXXXXXXXXX 1688
                VTQ DVTNFLQCLRFDPK+MA +HKFNR+GDF+RLASAV+GSPDE           
Sbjct: 60   STSTVTQLDVTNFLQCLRFDPKSMAVDHKFNRHGDFRRLASAVLGSPDESPSSKTKLPNS 119

Query: 1689 XXXXDDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSR-PDSLPGDRAS 1865
                DDLKRL+AGLRES+IK+RERVK FSETLSV+NK FPSIPSRKRSR PD LPGDR++
Sbjct: 120  SP--DDLKRLKAGLRESTIKSRERVKAFSETLSVINKFFPSIPSRKRSRGPDGLPGDRST 177

Query: 1866 GLLLNRSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRP 2045
            GLLLNR+PMG       TQ+HS +++FDF+Q KVEERGK+ IPNKRTRTS+     EVRP
Sbjct: 178  GLLLNRAPMGTPKMGAHTQTHSLSNSFDFDQQKVEERGKNVIPNKRTRTSM-----EVRP 232

Query: 2046 NTPARSSGSLDRDKEALRLPNSNATQGEDR-TLSMVTDGWEKAKMKKKRTGIKADAASSP 2222
            NTPARSSG++DRDKE LRLP SN    EDR +L +V DGWEK+KMKKKRTGIK DA+ SP
Sbjct: 233  NTPARSSGNVDRDKEPLRLPTSN----EDRASLPIVADGWEKSKMKKKRTGIKVDASPSP 288

Query: 2223 SSLSTKAVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQTMRS 2402
            SS STK +DGYRE KQG HPRH+PD ++RLND+ GFRP                 + MR 
Sbjct: 289  SSGSTKPIDGYREPKQGMHPRHVPDGITRLNDSHGFRPGAANGVAGGG-----KAEVMRP 343

Query: 2403 SIPRPDQENSPLMHDKRDRSTSSEKVRSVNKSNVRDEFISGSPTSSTKLHANARGPRSGS 2582
            SIPRP+ EN+ L+  +RDRSTSSEK R+  +SNVR+EF SGSPTSSTKLH NARGPRSGS
Sbjct: 344  SIPRPEIENTSLL--QRDRSTSSEKERT-KRSNVREEFASGSPTSSTKLHTNARGPRSGS 400

Query: 2583 SVVHKSSPVVQRATTSSDWDLVHGTDKN 2666
            +VV KSS  + +  T++DW+L HGT  N
Sbjct: 401  NVVPKSSVSMGQRATNNDWELTHGTVTN 428


>gb|KVI01149.1| hypothetical protein Ccrd_020564 [Cynara cardunculus var. scolymus]
          Length = 1258

 Score =  509 bits (1311), Expect = e-159
 Identities = 276/455 (60%), Positives = 327/455 (71%), Gaps = 13/455 (2%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            MASSGKFDLSS+SP RPLY S +RG SYT AS+DRS+SFRENM+NPI             
Sbjct: 1    MASSGKFDLSSSSPSRPLYTSGKRG-SYTAASMDRSASFRENMDNPILSSLPSMSRSTIN 59

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSPDEXXXXXXXXXXXXXXX 1700
            V+Q DVTNF QCLRFD K+MA E+K NR+GDFKRLA+A + +PD+               
Sbjct: 60   VSQGDVTNFFQCLRFDLKSMAAEYKCNRHGDFKRLANAALCAPDDSSSGSLKGKLPSSSP 119

Query: 1701 DDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLLLN 1880
            +DLKR + GLRES+IKARERVK+FSE LSV+NKCFPSIPSRKRSRPD+  GDR++ LL +
Sbjct: 120  EDLKRFKVGLRESTIKARERVKIFSEVLSVINKCFPSIPSRKRSRPDAFSGDRSNALLAD 179

Query: 1881 RSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNTPAR 2060
            R+     VGKMGTQSH+    FD+EQ K+EERGK+ IPNKRTRTSLVDQRA+VRPNTPAR
Sbjct: 180  RT----AVGKMGTQSHASTGAFDYEQQKIEERGKNTIPNKRTRTSLVDQRADVRPNTPAR 235

Query: 2061 SSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSLSTK 2240
            SSGS+DRD+E LRLPNS A Q EDR L +V DGWEKAKMKKKR+GIKADAA S S LS K
Sbjct: 236  SSGSVDRDREVLRLPNSTALQVEDRVLPIVADGWEKAKMKKKRSGIKADAAPSTSLLSAK 295

Query: 2241 AVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQTM--RSSIPR 2414
             +DG RE +QG HPR LPDA SRLND+ G+RP                P +M  RSSIPR
Sbjct: 296  PIDGCREPRQGMHPRSLPDAKSRLNDSHGYRPGAANGGIGAGKADSAQPASMGIRSSIPR 355

Query: 2415 PDQENSPLMHDKRDRSTSSE----KVRSVNKSNVRDEFISGSPTSSTKLHANARGPRSGS 2582
            P+QEN+  +HDKRD + + E     VR++NK+NVR++FISG+PT  TKLHA ARGPRSGS
Sbjct: 356  PEQENTSFLHDKRDPTINLELERTNVRALNKANVREDFISGTPT--TKLHAAARGPRSGS 413

Query: 2583 -------SVVHKSSPVVQRATTSSDWDLVHGTDKN 2666
                   S V      VQRAT S+  +  HGT K+
Sbjct: 414  GSGSGSGSGVIPKLCHVQRATVSNGLESPHGTSKS 448


>ref|XP_017241512.1| PREDICTED: uncharacterized protein LOC108214181 [Daucus carota subsp.
            sativus]
 gb|KZN03204.1| hypothetical protein DCAR_011960 [Daucus carota subsp. sativus]
          Length = 1259

 Score =  464 bits (1193), Expect = e-142
 Identities = 251/452 (55%), Positives = 305/452 (67%), Gaps = 10/452 (2%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            MA+S KFDLSSASPDRPLY S QRG SY  AS  RSSSFREN+ENPI             
Sbjct: 1    MATSNKFDLSSASPDRPLYASGQRG-SYMAASFGRSSSFRENVENPILSSLPSMSRSTSS 59

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIG-SPDEXXXXXXXXXXXXXX 1697
            VTQ DV +FLQCLRFDPK++  +HK NR G+FKR A   IG  PDE              
Sbjct: 60   VTQGDVMSFLQCLRFDPKSIIVDHKLNRQGEFKRFAGLAIGLQPDESPSSSTKSKVPSLS 119

Query: 1698 XDDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLLL 1877
             +++KR R GLRES+IKARERVK+F+E L VVNKCFPSIPSRKRSRPD + G+R S L  
Sbjct: 120  PEEVKRFRIGLRESTIKARERVKIFNEGLLVVNKCFPSIPSRKRSRPDGISGERPSALFA 179

Query: 1878 N-RSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNTP 2054
            + RS +G GVGK+GTQ+H+ A  F+ EQ K EER K+ IPNKRTRTS+VD R +VRP+TP
Sbjct: 180  SDRSAVGPGVGKLGTQNHTLAGGFELEQQKAEERSKNVIPNKRTRTSMVDPRMDVRPSTP 239

Query: 2055 ARSSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSLS 2234
            AR++G+ DRDKE  R P ++  QGED+TL++  DGWEK+KMKKKR+ IKAD A  P S +
Sbjct: 240  ARTAGTADRDKEGSRFPTNSVAQGEDQTLAIGVDGWEKSKMKKKRSVIKADIA--PGSPA 297

Query: 2235 TKAVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQT----MRS 2402
            TKA+DGYRE KQG HPR L D   R +D+  +RP                 Q     MRS
Sbjct: 298  TKAIDGYREPKQGVHPRLLSDGRPRTSDSYAYRPGGANGIVVVKADGTSQAQQTSTGMRS 357

Query: 2403 SIPRPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANARGP 2570
            ++PR DQ++S  + D+RD   +S+K     R +NK+N RDEF SGSPTSS KL+A  R P
Sbjct: 358  TVPRSDQDSSLPLQDRRDHIINSDKERVNARVINKANTRDEFSSGSPTSSAKLNAATRAP 417

Query: 2571 RSGSSVVHKSSPVVQRATTSSDWDLVHGTDKN 2666
            RS S +V K SPVVQRA  + DW+L H T KN
Sbjct: 418  RSSSGIVPKLSPVVQRANAAKDWELSHCTSKN 449


>gb|OIT08269.1| hypothetical protein A4A49_17381 [Nicotiana attenuata]
          Length = 1333

 Score =  447 bits (1151), Expect = e-136
 Identities = 251/482 (52%), Positives = 310/482 (64%), Gaps = 14/482 (2%)
 Frame = +3

Query: 1260 CAVTLQSCHAQLCLHVL*HRDC**IDAMASSGKFDLSSASPDRPLYNSAQRGGSYTGASL 1439
            C V  + CHAQLC     H++C  IDAM++S KFDLSS+SPDRPLY S QRG SY  ASL
Sbjct: 23   CIVVEEQCHAQLCSLGGLHKECQKIDAMSASSKFDLSSSSPDRPLYASGQRG-SYAPASL 81

Query: 1440 DRSSSFRENMENPIXXXXXXXXXXXXXVTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFK 1619
            DRS SFRENMENPI             VT+TD  NF QCLRFDPKAM T+HK NR  DFK
Sbjct: 82   DRSGSFRENMENPILSSLPNMTRSTLTVTRTDAVNFFQCLRFDPKAMVTDHKLNRNIDFK 141

Query: 1620 RLASAVIGSP--DEXXXXXXXXXXXXXXXDDLKRLRAGLRESSIKARERVKVFSETLSVV 1793
            RL S  +G+P  D                ++ +RL+AGLRES  KARERVK+F+E+LSV+
Sbjct: 142  RLTSLALGAPVEDSPLVSSKGKLFPSPSAEEARRLKAGLRESCTKARERVKIFTESLSVL 201

Query: 1794 NKCFPSIPSRKRSRPDSLPGDRASGLL-LNRSPMGAGVGKMGTQSHSPAHTFDFEQPKVE 1970
            NKCFPSIPSRKRSR DSL  DR   L   +RS  G  +GK GTQSH  A +++ EQ K E
Sbjct: 202  NKCFPSIPSRKRSRSDSLSNDRHVTLFPSDRSVSGTSIGKTGTQSHCTASSYELEQQKSE 261

Query: 1971 ERGKSAIPNKRTRTSLVDQRAEVRPNTPARSSGSLDRDKEALRLPNSNATQGEDRTLSMV 2150
            ER K+A+P+KRTRTS+ D R +VR NTP RS+G++DRD+E LRLPN +  QGEDRT S+ 
Sbjct: 262  ERVKTAVPSKRTRTSMADVRPDVRANTPTRSAGNMDRDREILRLPNGSTIQGEDRTSSIA 321

Query: 2151 TDGWEKAKMKKKRTGIKADAASSPSSLSTKAVDGYRESKQGTHPRHLPDAMSRLNDTQGF 2330
             +GWEK++MKKKR+GIK DA     S++TK +DG+RE KQG  PR   D+ SR  D+ GF
Sbjct: 322  VEGWEKSRMKKKRSGIKPDAT---GSITTKPIDGHREPKQGVQPRLPSDSRSRFTDSHGF 378

Query: 2331 R----PXXXXXXXXXXXXXXLTPQTMRSSIPRPDQENSPLMHDKRDRSTSSEK------- 2477
            R    P              L    +RSS+ + DQ+N   + D+RDR   SEK       
Sbjct: 379  RHGLAPGAVGKADGATQHVTL---GVRSSLSKIDQDNHLHLLDRRDRPLGSEKERVNLKA 435

Query: 2478 VRSVNKSNVRDEFISGSPTSSTKLHANARGPRSGSSVVHKSSPVVQRATTSSDWDLVHGT 2657
            V +  K+  R+EF S SPTSSTKL+   R PRSGS V  K SP V RA  ++DW++   T
Sbjct: 436  VSNTMKAAAREEFTSPSPTSSTKLNPATRAPRSGSGVAPKLSPPVHRAAAANDWEISQCT 495

Query: 2658 DK 2663
            +K
Sbjct: 496  NK 497


>ref|XP_016486861.1| PREDICTED: uncharacterized protein LOC107807078 isoform X2 [Nicotiana
            tabacum]
          Length = 1359

 Score =  447 bits (1149), Expect = e-135
 Identities = 252/482 (52%), Positives = 308/482 (63%), Gaps = 14/482 (2%)
 Frame = +3

Query: 1260 CAVTLQSCHAQLCLHVL*HRDC**IDAMASSGKFDLSSASPDRPLYNSAQRGGSYTGASL 1439
            C V  + CHAQLC     H++C  IDAM++S KFDLSS+SPDRPLY S QRG SY  ASL
Sbjct: 52   CIVVEEQCHAQLCSLGGLHKECQKIDAMSASSKFDLSSSSPDRPLYASGQRG-SYAPASL 110

Query: 1440 DRSSSFRENMENPIXXXXXXXXXXXXXVTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFK 1619
            DRS SFRENMENPI             VT+TD  NF QCLRFDPKAM T+HK NR  DFK
Sbjct: 111  DRSGSFRENMENPILSSLPNMTRSTSTVTRTDAVNFFQCLRFDPKAMVTDHKLNRNIDFK 170

Query: 1620 RLASAVIGSP--DEXXXXXXXXXXXXXXXDDLKRLRAGLRESSIKARERVKVFSETLSVV 1793
            RL S  +G P  D                ++ +RL+AGLRES  KARERVK+F+E+LSV+
Sbjct: 171  RLTSLALGVPVEDSPLVSSKGKLFPSPSAEESRRLKAGLRESCTKARERVKIFTESLSVL 230

Query: 1794 NKCFPSIPSRKRSRPDSLPGDRASGLL-LNRSPMGAGVGKMGTQSHSPAHTFDFEQPKVE 1970
            NKCFPSIPSRKRSR DSL  DR   L   +RS  G  +GKMGTQSH  A +++ EQ K E
Sbjct: 231  NKCFPSIPSRKRSRSDSLANDRHVTLFPSDRSVSGTSIGKMGTQSHCTASSYELEQQKSE 290

Query: 1971 ERGKSAIPNKRTRTSLVDQRAEVRPNTPARSSGSLDRDKEALRLPNSNATQGEDRTLSMV 2150
            ER K+A+P+KRTRTS+ D R +VR NTP RS+G++DRD+E LRLPN +  QGEDRT S+ 
Sbjct: 291  ERVKTAVPSKRTRTSMADVRPDVRANTPTRSAGNMDRDREILRLPNGSTIQGEDRTSSIA 350

Query: 2151 TDGWEKAKMKKKRTGIKADAASSPSSLSTKAVDGYRESKQGTHPRHLPDAMSRLNDTQGF 2330
             +GWEK++MKKKR+GIK DA     S+ TK +DG+RE KQG  PR   D+ SR  DT GF
Sbjct: 351  VEGWEKSRMKKKRSGIKPDAT---GSIITKPIDGHREPKQGVQPRLPSDSRSRFTDTHGF 407

Query: 2331 R----PXXXXXXXXXXXXXXLTPQTMRSSIPRPDQENSPLMHDKRDRSTSSEK------- 2477
            R    P              L    +RSS+ + DQ+N   + D+RDR   SEK       
Sbjct: 408  RHGLAPGAVGKADGATQHVTL---GVRSSLSKIDQDNHLHLLDRRDRPLGSEKERVNLKA 464

Query: 2478 VRSVNKSNVRDEFISGSPTSSTKLHANARGPRSGSSVVHKSSPVVQRATTSSDWDLVHGT 2657
            V +  K+  R+EF S SP SSTKL+   R PRSGS V  K SP V RA  ++DW++   T
Sbjct: 465  VSNTMKAAAREEFTSPSPASSTKLNPATRAPRSGSGVAPKLSPPVHRAAAANDWEISQCT 524

Query: 2658 DK 2663
            +K
Sbjct: 525  NK 526


>ref|XP_016486853.1| PREDICTED: uncharacterized protein LOC107807078 isoform X1 [Nicotiana
            tabacum]
          Length = 1362

 Score =  447 bits (1149), Expect = e-135
 Identities = 252/482 (52%), Positives = 308/482 (63%), Gaps = 14/482 (2%)
 Frame = +3

Query: 1260 CAVTLQSCHAQLCLHVL*HRDC**IDAMASSGKFDLSSASPDRPLYNSAQRGGSYTGASL 1439
            C V  + CHAQLC     H++C  IDAM++S KFDLSS+SPDRPLY S QRG SY  ASL
Sbjct: 52   CIVVEEQCHAQLCSLGGLHKECQKIDAMSASSKFDLSSSSPDRPLYASGQRG-SYAPASL 110

Query: 1440 DRSSSFRENMENPIXXXXXXXXXXXXXVTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFK 1619
            DRS SFRENMENPI             VT+TD  NF QCLRFDPKAM T+HK NR  DFK
Sbjct: 111  DRSGSFRENMENPILSSLPNMTRSTSTVTRTDAVNFFQCLRFDPKAMVTDHKLNRNIDFK 170

Query: 1620 RLASAVIGSP--DEXXXXXXXXXXXXXXXDDLKRLRAGLRESSIKARERVKVFSETLSVV 1793
            RL S  +G P  D                ++ +RL+AGLRES  KARERVK+F+E+LSV+
Sbjct: 171  RLTSLALGVPVEDSPLVSSKGKLFPSPSAEESRRLKAGLRESCTKARERVKIFTESLSVL 230

Query: 1794 NKCFPSIPSRKRSRPDSLPGDRASGLL-LNRSPMGAGVGKMGTQSHSPAHTFDFEQPKVE 1970
            NKCFPSIPSRKRSR DSL  DR   L   +RS  G  +GKMGTQSH  A +++ EQ K E
Sbjct: 231  NKCFPSIPSRKRSRSDSLANDRHVTLFPSDRSVSGTSIGKMGTQSHCTASSYELEQQKSE 290

Query: 1971 ERGKSAIPNKRTRTSLVDQRAEVRPNTPARSSGSLDRDKEALRLPNSNATQGEDRTLSMV 2150
            ER K+A+P+KRTRTS+ D R +VR NTP RS+G++DRD+E LRLPN +  QGEDRT S+ 
Sbjct: 291  ERVKTAVPSKRTRTSMADVRPDVRANTPTRSAGNMDRDREILRLPNGSTIQGEDRTSSIA 350

Query: 2151 TDGWEKAKMKKKRTGIKADAASSPSSLSTKAVDGYRESKQGTHPRHLPDAMSRLNDTQGF 2330
             +GWEK++MKKKR+GIK DA     S+ TK +DG+RE KQG  PR   D+ SR  DT GF
Sbjct: 351  VEGWEKSRMKKKRSGIKPDAT---GSIITKPIDGHREPKQGVQPRLPSDSRSRFTDTHGF 407

Query: 2331 R----PXXXXXXXXXXXXXXLTPQTMRSSIPRPDQENSPLMHDKRDRSTSSEK------- 2477
            R    P              L    +RSS+ + DQ+N   + D+RDR   SEK       
Sbjct: 408  RHGLAPGAVGKADGATQHVTL---GVRSSLSKIDQDNHLHLLDRRDRPLGSEKERVNLKA 464

Query: 2478 VRSVNKSNVRDEFISGSPTSSTKLHANARGPRSGSSVVHKSSPVVQRATTSSDWDLVHGT 2657
            V +  K+  R+EF S SP SSTKL+   R PRSGS V  K SP V RA  ++DW++   T
Sbjct: 465  VSNTMKAAAREEFTSPSPASSTKLNPATRAPRSGSGVAPKLSPPVHRAAAANDWEISQCT 524

Query: 2658 DK 2663
            +K
Sbjct: 525  NK 526


>ref|XP_009797848.1| PREDICTED: uncharacterized protein LOC104244185 isoform X1 [Nicotiana
            sylvestris]
          Length = 1362

 Score =  447 bits (1149), Expect = e-135
 Identities = 252/482 (52%), Positives = 308/482 (63%), Gaps = 14/482 (2%)
 Frame = +3

Query: 1260 CAVTLQSCHAQLCLHVL*HRDC**IDAMASSGKFDLSSASPDRPLYNSAQRGGSYTGASL 1439
            C V  + CHAQLC     H++C  IDAM++S KFDLSS+SPDRPLY S QRG SY  ASL
Sbjct: 52   CIVVEEQCHAQLCSLGGLHKECQKIDAMSASSKFDLSSSSPDRPLYASGQRG-SYAPASL 110

Query: 1440 DRSSSFRENMENPIXXXXXXXXXXXXXVTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFK 1619
            DRS SFRENMENPI             VT+TD  NF QCLRFDPKAM T+HK NR  DFK
Sbjct: 111  DRSGSFRENMENPILSSLPNMTRSTSTVTRTDAVNFFQCLRFDPKAMVTDHKLNRNIDFK 170

Query: 1620 RLASAVIGSP--DEXXXXXXXXXXXXXXXDDLKRLRAGLRESSIKARERVKVFSETLSVV 1793
            RL S  +G P  D                ++ +RL+AGLRES  KARERVK+F+E+LSV+
Sbjct: 171  RLTSLALGVPVEDSPLVSSKGKLFPSPSAEESRRLKAGLRESCTKARERVKIFTESLSVL 230

Query: 1794 NKCFPSIPSRKRSRPDSLPGDRASGLL-LNRSPMGAGVGKMGTQSHSPAHTFDFEQPKVE 1970
            NKCFPSIPSRKRSR DSL  DR   L   +RS  G  +GKMGTQSH  A +++ EQ K E
Sbjct: 231  NKCFPSIPSRKRSRSDSLANDRHVTLFPSDRSVSGTSIGKMGTQSHCTASSYELEQQKSE 290

Query: 1971 ERGKSAIPNKRTRTSLVDQRAEVRPNTPARSSGSLDRDKEALRLPNSNATQGEDRTLSMV 2150
            ER K+A+P+KRTRTS+ D R +VR NTP RS+G++DRD+E LRLPN +  QGEDRT S+ 
Sbjct: 291  ERVKTAVPSKRTRTSMADVRPDVRANTPTRSAGNMDRDREILRLPNGSTIQGEDRTSSIA 350

Query: 2151 TDGWEKAKMKKKRTGIKADAASSPSSLSTKAVDGYRESKQGTHPRHLPDAMSRLNDTQGF 2330
             +GWEK++MKKKR+GIK DA     S+ TK +DG+RE KQG  PR   D+ SR  DT GF
Sbjct: 351  VEGWEKSRMKKKRSGIKPDAT---GSIITKPIDGHREPKQGVQPRLPSDSRSRFTDTHGF 407

Query: 2331 R----PXXXXXXXXXXXXXXLTPQTMRSSIPRPDQENSPLMHDKRDRSTSSEK------- 2477
            R    P              L    +RSS+ + DQ+N   + D+RDR   SEK       
Sbjct: 408  RHGLAPGAVGKADGATQHVTL---GVRSSLSKIDQDNHLHLLDRRDRPLGSEKERVNLKA 464

Query: 2478 VRSVNKSNVRDEFISGSPTSSTKLHANARGPRSGSSVVHKSSPVVQRATTSSDWDLVHGT 2657
            V +  K+  R+EF S SP SSTKL+   R PRSGS V  K SP V RA  ++DW++   T
Sbjct: 465  VSNTMKAAAREEFTSPSPASSTKLNPATRAPRSGSGVAPKLSPPVHRAAAANDWEISQCT 524

Query: 2658 DK 2663
            +K
Sbjct: 525  NK 526


>ref|XP_019258758.1| PREDICTED: uncharacterized protein LOC109236971 [Nicotiana attenuata]
          Length = 1436

 Score =  447 bits (1151), Expect = e-135
 Identities = 251/482 (52%), Positives = 310/482 (64%), Gaps = 14/482 (2%)
 Frame = +3

Query: 1260 CAVTLQSCHAQLCLHVL*HRDC**IDAMASSGKFDLSSASPDRPLYNSAQRGGSYTGASL 1439
            C V  + CHAQLC     H++C  IDAM++S KFDLSS+SPDRPLY S QRG SY  ASL
Sbjct: 126  CIVVEEQCHAQLCSLGGLHKECQKIDAMSASSKFDLSSSSPDRPLYASGQRG-SYAPASL 184

Query: 1440 DRSSSFRENMENPIXXXXXXXXXXXXXVTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFK 1619
            DRS SFRENMENPI             VT+TD  NF QCLRFDPKAM T+HK NR  DFK
Sbjct: 185  DRSGSFRENMENPILSSLPNMTRSTLTVTRTDAVNFFQCLRFDPKAMVTDHKLNRNIDFK 244

Query: 1620 RLASAVIGSP--DEXXXXXXXXXXXXXXXDDLKRLRAGLRESSIKARERVKVFSETLSVV 1793
            RL S  +G+P  D                ++ +RL+AGLRES  KARERVK+F+E+LSV+
Sbjct: 245  RLTSLALGAPVEDSPLVSSKGKLFPSPSAEEARRLKAGLRESCTKARERVKIFTESLSVL 304

Query: 1794 NKCFPSIPSRKRSRPDSLPGDRASGLL-LNRSPMGAGVGKMGTQSHSPAHTFDFEQPKVE 1970
            NKCFPSIPSRKRSR DSL  DR   L   +RS  G  +GK GTQSH  A +++ EQ K E
Sbjct: 305  NKCFPSIPSRKRSRSDSLSNDRHVTLFPSDRSVSGTSIGKTGTQSHCTASSYELEQQKSE 364

Query: 1971 ERGKSAIPNKRTRTSLVDQRAEVRPNTPARSSGSLDRDKEALRLPNSNATQGEDRTLSMV 2150
            ER K+A+P+KRTRTS+ D R +VR NTP RS+G++DRD+E LRLPN +  QGEDRT S+ 
Sbjct: 365  ERVKTAVPSKRTRTSMADVRPDVRANTPTRSAGNMDRDREILRLPNGSTIQGEDRTSSIA 424

Query: 2151 TDGWEKAKMKKKRTGIKADAASSPSSLSTKAVDGYRESKQGTHPRHLPDAMSRLNDTQGF 2330
             +GWEK++MKKKR+GIK DA     S++TK +DG+RE KQG  PR   D+ SR  D+ GF
Sbjct: 425  VEGWEKSRMKKKRSGIKPDAT---GSITTKPIDGHREPKQGVQPRLPSDSRSRFTDSHGF 481

Query: 2331 R----PXXXXXXXXXXXXXXLTPQTMRSSIPRPDQENSPLMHDKRDRSTSSEK------- 2477
            R    P              L    +RSS+ + DQ+N   + D+RDR   SEK       
Sbjct: 482  RHGLAPGAVGKADGATQHVTL---GVRSSLSKIDQDNHLHLLDRRDRPLGSEKERVNLKA 538

Query: 2478 VRSVNKSNVRDEFISGSPTSSTKLHANARGPRSGSSVVHKSSPVVQRATTSSDWDLVHGT 2657
            V +  K+  R+EF S SPTSSTKL+   R PRSGS V  K SP V RA  ++DW++   T
Sbjct: 539  VSNTMKAAAREEFTSPSPTSSTKLNPATRAPRSGSGVAPKLSPPVHRAAAANDWEISQCT 598

Query: 2658 DK 2663
            +K
Sbjct: 599  NK 600


>ref|XP_011093773.1| uncharacterized protein LOC105173644 isoform X2 [Sesamum indicum]
          Length = 1293

 Score =  444 bits (1141), Expect = e-134
 Identities = 241/452 (53%), Positives = 306/452 (67%), Gaps = 11/452 (2%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            M++S KFDLSS SPDRPLY S  RG SY+ +SLDRS SFREN+ENP+             
Sbjct: 1    MSASSKFDLSSGSPDRPLYTSGPRG-SYSASSLDRSGSFRENIENPLLSSLPNMTRNGSS 59

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSP--DEXXXXXXXXXXXXX 1694
            VT  DV NF QC+R DPK+M  EHK NR  +FKRLASA +G P  D              
Sbjct: 60   VTHGDVLNFFQCVRIDPKSMVVEHKLNRPAEFKRLASAAVGIPLEDSMPPSSKSKQLSSP 119

Query: 1695 XXDDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLL 1874
              +DL+RL++G+RES  KARERVK+F++ LSV+NKCFP+IPSRKRSR D+L  DR++ +L
Sbjct: 120  SLEDLRRLKSGVRESGTKARERVKIFNDCLSVINKCFPTIPSRKRSRLDALSNDRSNTML 179

Query: 1875 -LNRSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNT 2051
             ++RS  G G+GKMG Q+H+    F+ EQ K EER KS IP+KRTRTS+VD R ++R N 
Sbjct: 180  SIDRSASGMGIGKMGPQNHASTSGFEPEQQKSEERTKSTIPSKRTRTSMVDARTDIRANN 239

Query: 2052 PARSSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSL 2231
            PAR SGS+D+D++ +RL N+ A QGEDRTLS+  DGWE +KMKKKRTGIK D A+  S +
Sbjct: 240  PARPSGSVDKDRDVVRLSNNGAVQGEDRTLSVAVDGWENSKMKKKRTGIKLDVAA--SLM 297

Query: 2232 STKAVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQT---MRS 2402
            +TK VDGYRESKQG HPR   +A SRL D  GFR                 PQT   MRS
Sbjct: 298  ATKPVDGYRESKQGMHPRLPNEARSRLTDLHGFR-SGNANGGLGVGKGEANPQTSSGMRS 356

Query: 2403 SIPRPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANARGP 2570
            S+ R D +NS L+H++R+R +  EK    +++ N  N R++F SGSPTS TK +AN RGP
Sbjct: 357  SVSRTDSDNSSLLHERRERPSGQEKERLNLKATNNGNSREDFSSGSPTSGTKFNANVRGP 416

Query: 2571 RSGS-SVVHKSSPVVQRATTSSDWDLVHGTDK 2663
            RSGS   V K S VVQR+ +S+DW+L + T+K
Sbjct: 417  RSGSVGGVSKLSQVVQRSASSNDWELPNCTNK 448


>ref|XP_011093770.1| uncharacterized protein LOC105173644 isoform X1 [Sesamum indicum]
 ref|XP_011093771.1| uncharacterized protein LOC105173644 isoform X1 [Sesamum indicum]
 ref|XP_011093772.1| uncharacterized protein LOC105173644 isoform X1 [Sesamum indicum]
 ref|XP_020553406.1| uncharacterized protein LOC105173644 isoform X1 [Sesamum indicum]
          Length = 1298

 Score =  444 bits (1141), Expect = e-134
 Identities = 241/452 (53%), Positives = 306/452 (67%), Gaps = 11/452 (2%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            M++S KFDLSS SPDRPLY S  RG SY+ +SLDRS SFREN+ENP+             
Sbjct: 1    MSASSKFDLSSGSPDRPLYTSGPRG-SYSASSLDRSGSFRENIENPLLSSLPNMTRNGSS 59

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSP--DEXXXXXXXXXXXXX 1694
            VT  DV NF QC+R DPK+M  EHK NR  +FKRLASA +G P  D              
Sbjct: 60   VTHGDVLNFFQCVRIDPKSMVVEHKLNRPAEFKRLASAAVGIPLEDSMPPSSKSKQLSSP 119

Query: 1695 XXDDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLL 1874
              +DL+RL++G+RES  KARERVK+F++ LSV+NKCFP+IPSRKRSR D+L  DR++ +L
Sbjct: 120  SLEDLRRLKSGVRESGTKARERVKIFNDCLSVINKCFPTIPSRKRSRLDALSNDRSNTML 179

Query: 1875 -LNRSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNT 2051
             ++RS  G G+GKMG Q+H+    F+ EQ K EER KS IP+KRTRTS+VD R ++R N 
Sbjct: 180  SIDRSASGMGIGKMGPQNHASTSGFEPEQQKSEERTKSTIPSKRTRTSMVDARTDIRANN 239

Query: 2052 PARSSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSL 2231
            PAR SGS+D+D++ +RL N+ A QGEDRTLS+  DGWE +KMKKKRTGIK D A+  S +
Sbjct: 240  PARPSGSVDKDRDVVRLSNNGAVQGEDRTLSVAVDGWENSKMKKKRTGIKLDVAA--SLM 297

Query: 2232 STKAVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXLTPQT---MRS 2402
            +TK VDGYRESKQG HPR   +A SRL D  GFR                 PQT   MRS
Sbjct: 298  ATKPVDGYRESKQGMHPRLPNEARSRLTDLHGFR-SGNANGGLGVGKGEANPQTSSGMRS 356

Query: 2403 SIPRPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANARGP 2570
            S+ R D +NS L+H++R+R +  EK    +++ N  N R++F SGSPTS TK +AN RGP
Sbjct: 357  SVSRTDSDNSSLLHERRERPSGQEKERLNLKATNNGNSREDFSSGSPTSGTKFNANVRGP 416

Query: 2571 RSGS-SVVHKSSPVVQRATTSSDWDLVHGTDK 2663
            RSGS   V K S VVQR+ +S+DW+L + T+K
Sbjct: 417  RSGSVGGVSKLSQVVQRSASSNDWELPNCTNK 448


>gb|KZV50855.1| hypothetical protein F511_27621 [Dorcoceras hygrometricum]
          Length = 1332

 Score =  441 bits (1134), Expect = e-133
 Identities = 236/449 (52%), Positives = 310/449 (69%), Gaps = 8/449 (1%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            M++S KF+LSSASPDRPLY S  RG SY  A+LDRS SFREN+ENP+             
Sbjct: 1    MSASSKFELSSASPDRPLYASGHRG-SYGAATLDRSGSFRENLENPLLSSLPNMTRSTSS 59

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIGSPDEXXXXXXXXXXXXXXX 1700
            VTQ DV NF QC+RFDPK+M  EHK NR  +FKRLASA +G P E               
Sbjct: 60   VTQGDVLNFFQCVRFDPKSMVVEHKLNRPAEFKRLASAAVGFPLEDSLTPTKNKLPNSAP 119

Query: 1701 DDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLL-L 1877
            +DL+RL++G+RES  K+RERVK+ ++ LSV+NKCFP+IPSRKRSR D+L  DR++ LL L
Sbjct: 120  EDLRRLKSGVRESGTKSRERVKILNDCLSVINKCFPTIPSRKRSRLDTLSNDRSNTLLPL 179

Query: 1878 NRSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNTPA 2057
             RS    G+GK+G Q+H+    F+ EQ K +ER KSA PNKRTRTS+VD R E+R +TP+
Sbjct: 180  ERSIPTLGIGKIGPQNHASTSGFELEQQKADERSKSAFPNKRTRTSMVDARTEMRASTPS 239

Query: 2058 RSSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSLST 2237
            + SG++D+D++A+RL NS+A QGE+R+LS+  DGWEKAKMKKKRTGIK D +S  SS++T
Sbjct: 240  KPSGTMDKDRDAVRLSNSSAVQGEERSLSISVDGWEKAKMKKKRTGIKPDVSS--SSVTT 297

Query: 2238 KAVDGYRESKQGTHPRHLPDAMSRLNDTQGFRPXXXXXXXXXXXXXXL--TPQTMRSSIP 2411
            K +DG+RE+KQG  PR   +A SRLN++ GFRP                 T   +RSSI 
Sbjct: 298  KPIDGFRETKQGMQPRLPTEARSRLNESHGFRPGVANGGLGVGKAEATSQTSSGIRSSIS 357

Query: 2412 RPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANARGPRSG 2579
            R D ENS L+H++R+R +  EK    +++VN+++ RD+  SGSPTSS+KL+A+ R PRSG
Sbjct: 358  RTDSENSSLLHERRERPSGQEKERVNLKAVNRASSRDDLSSGSPTSSSKLNASIRAPRSG 417

Query: 2580 S-SVVHKSSPVVQRATTSSDWDLVHGTDK 2663
            S   + K S V QR+T S DW+L + T K
Sbjct: 418  SVGGISKLSQVAQRSTPSDDWELTNCTSK 446


>ref|XP_021275591.1| uncharacterized protein LOC110410286 [Herrania umbratica]
 ref|XP_021275592.1| uncharacterized protein LOC110410286 [Herrania umbratica]
          Length = 1282

 Score =  437 bits (1124), Expect = e-132
 Identities = 247/452 (54%), Positives = 305/452 (67%), Gaps = 11/452 (2%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            MA+S KFDLSS SPDRPLY S QRG ++  A LDRS SFRE MENPI             
Sbjct: 1    MATSSKFDLSSGSPDRPLYTSGQRG-AHLAAQLDRSGSFRETMENPILSSLPGMSRSL-- 57

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIG-SPDEXXXXXXXXXXXXXX 1697
            + Q DV+NF QCLRFDPK +A +HK NR GDFKR  +  +G S DE              
Sbjct: 58   LAQGDVSNFFQCLRFDPKVVAADHKSNRQGDFKRHINVALGISADESPTVLSKGKLLPSP 117

Query: 1698 X-DDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLL 1874
              +++KR++AGLR+ ++KARER+K F+E LSV NK FPSIPS+KRSR +S  GDR + LL
Sbjct: 118  IPEEIKRVKAGLRDCAVKARERMKTFNEALSVFNKFFPSIPSKKRSRSESFSGDRPNALL 177

Query: 1875 LN-RSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNT 2051
             + RS +G  +GKMG  +HS A  F+FEQ K EER KSA+PNKRTRTSLVD R ++R N 
Sbjct: 178  SSDRSVLGPTIGKMGMHNHSIAGAFEFEQQKAEERPKSAVPNKRTRTSLVDVRMDMRNNA 237

Query: 2052 PARSSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSL 2231
              R  G+ DRD+E LR+ NS A QGEDRTLS   DGWEKAKMKKKR+GIK D   SPS +
Sbjct: 238  LVRQPGNADRDREMLRVSNSGAVQGEDRTLSGGVDGWEKAKMKKKRSGIKPDV--SPSMV 295

Query: 2232 STKAVDGYRESKQGTHPRHLPDAMSRL-NDTQGFRPXXXXXXXXXXXXXXLTPQT---MR 2399
            STK ++GYRESKQG   R + DA SRL ND+ GFR               ++  T    R
Sbjct: 296  STKPIEGYRESKQGMQQRPVTDARSRLNNDSHGFRSGIANGSAGVGKSDGISQPTGLGPR 355

Query: 2400 SSIPRPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANARG 2567
            SS+PR D ++SPL++D+RDR  +S+K    +R+VNK +VRDEF S SPTSSTK++A+ RG
Sbjct: 356  SSVPRSDLDSSPLLNDRRDRPVASDKERVNLRTVNKMSVRDEFNSASPTSSTKMNASIRG 415

Query: 2568 PRSGSSVVHKSSPVVQRATTSSDWDLVHGTDK 2663
            PRSGS    K SPVV RAT S+DW+L H T+K
Sbjct: 416  PRSGSGGAPKLSPVVHRATASNDWELSHCTNK 447


>ref|XP_017982811.1| PREDICTED: uncharacterized protein LOC18588345 isoform X1 [Theobroma
            cacao]
 ref|XP_017982812.1| PREDICTED: uncharacterized protein LOC18588345 isoform X1 [Theobroma
            cacao]
 ref|XP_017982813.1| PREDICTED: uncharacterized protein LOC18588345 isoform X1 [Theobroma
            cacao]
          Length = 1282

 Score =  437 bits (1124), Expect = e-132
 Identities = 247/452 (54%), Positives = 306/452 (67%), Gaps = 11/452 (2%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            MA+S KFDLSS SPDRPLY S QRG ++  A LDRS SFRE MENPI             
Sbjct: 1    MATSSKFDLSSGSPDRPLYTSGQRG-AHLAAQLDRSGSFRETMENPILSSLPGMSRSL-- 57

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIG-SPDEXXXXXXXXXXXXXX 1697
            + Q DV+NF QCLRFDPK +A +HK NR GDFKR  +  +G S DE              
Sbjct: 58   LAQGDVSNFFQCLRFDPKVVAADHKSNRQGDFKRHINVALGISADESPTVLSKGKLLPFP 117

Query: 1698 X-DDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLL 1874
              +++KR++AGLR+ ++KARER+K F+E LSV NK FPSIPS+KRSR +S   DR + LL
Sbjct: 118  IPEEIKRVKAGLRDCAVKARERMKTFNEALSVFNKFFPSIPSKKRSRSESFSSDRPNALL 177

Query: 1875 LN-RSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNT 2051
             + RS +G  +GKMG  +HS A  F+FEQ K+EER KSA+PNKRTRTSLVD R ++R N 
Sbjct: 178  SSDRSVLGPTIGKMGMHNHSIAGGFEFEQQKLEERPKSAVPNKRTRTSLVDVRMDMRNNA 237

Query: 2052 PARSSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSL 2231
              R  G+ DRD+E LR+ NS A QGEDRTLS   DGWEKAKMKKKR+GIK D   SPS +
Sbjct: 238  LVRQPGNADRDREMLRVSNSGAVQGEDRTLSGGVDGWEKAKMKKKRSGIKPDV--SPSMV 295

Query: 2232 STKAVDGYRESKQGTHPRHLPDAMSRL-NDTQGFRPXXXXXXXXXXXXXXLTPQT---MR 2399
            STK ++GYRESKQG   R + DA SRL ND+ GFR               ++  T    R
Sbjct: 296  STKPIEGYRESKQGMQQRPVTDARSRLNNDSHGFRSGIANGSAGVGKSEGISQPTGLGPR 355

Query: 2400 SSIPRPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANARG 2567
            SS+PR D ++SPL++D+RDR  +S+K    +R+VNK +VRDEF S SPTSSTK++A+ RG
Sbjct: 356  SSVPRSDLDSSPLLNDRRDRPVASDKERVNLRAVNKMSVRDEFNSASPTSSTKMNASIRG 415

Query: 2568 PRSGSSVVHKSSPVVQRATTSSDWDLVHGTDK 2663
            PRSGS V  K SPVV RAT S+DW+L H T+K
Sbjct: 416  PRSGSGVAPKLSPVVHRATASNDWELSHCTNK 447


>gb|EOY30366.1| Serine/arginine repetitive matrix protein 2 isoform 2 [Theobroma
            cacao]
 gb|EOY30367.1| Serine/arginine repetitive matrix protein 2 isoform 2 [Theobroma
            cacao]
          Length = 1282

 Score =  437 bits (1124), Expect = e-132
 Identities = 247/452 (54%), Positives = 306/452 (67%), Gaps = 11/452 (2%)
 Frame = +3

Query: 1341 MASSGKFDLSSASPDRPLYNSAQRGGSYTGASLDRSSSFRENMENPIXXXXXXXXXXXXX 1520
            MA+S KFDLSS SPDRPLY S QRG ++  A LDRS SFRE MENPI             
Sbjct: 1    MATSSKFDLSSGSPDRPLYTSGQRG-AHLAAQLDRSGSFRETMENPILSSLPGMSRSL-- 57

Query: 1521 VTQTDVTNFLQCLRFDPKAMATEHKFNRYGDFKRLASAVIG-SPDEXXXXXXXXXXXXXX 1697
            + Q DV+NF QCLRFDPK +A +HK NR GDFKR  +  +G S DE              
Sbjct: 58   LAQGDVSNFFQCLRFDPKVVAADHKSNRQGDFKRHINVALGISADESPTVLSKGKLLPFP 117

Query: 1698 X-DDLKRLRAGLRESSIKARERVKVFSETLSVVNKCFPSIPSRKRSRPDSLPGDRASGLL 1874
              +++KR++AGLR+ ++KARER+K F+E LSV NK FPSIPS+KRSR +S   DR + LL
Sbjct: 118  IPEEIKRVKAGLRDCAVKARERMKTFNEALSVFNKFFPSIPSKKRSRSESFSSDRPNALL 177

Query: 1875 LN-RSPMGAGVGKMGTQSHSPAHTFDFEQPKVEERGKSAIPNKRTRTSLVDQRAEVRPNT 2051
             + RS +G  +GKMG  +HS A  F+FEQ K+EER KSA+PNKRTRTSLVD R ++R N 
Sbjct: 178  SSDRSVLGPTIGKMGMHNHSIAGGFEFEQQKLEERPKSAVPNKRTRTSLVDVRMDMRNNA 237

Query: 2052 PARSSGSLDRDKEALRLPNSNATQGEDRTLSMVTDGWEKAKMKKKRTGIKADAASSPSSL 2231
              R  G+ DRD+E LR+ NS A QGEDRTLS   DGWEKAKMKKKR+GIK D   SPS +
Sbjct: 238  LVRQPGNADRDREMLRVSNSGAVQGEDRTLSGGVDGWEKAKMKKKRSGIKPDV--SPSMV 295

Query: 2232 STKAVDGYRESKQGTHPRHLPDAMSRL-NDTQGFRPXXXXXXXXXXXXXXLTPQT---MR 2399
            STK ++GYRESKQG   R + DA SRL ND+ GFR               ++  T    R
Sbjct: 296  STKPIEGYRESKQGMQQRPVTDARSRLNNDSHGFRSGIANGSAGVGKSEGISQPTGLGPR 355

Query: 2400 SSIPRPDQENSPLMHDKRDRSTSSEK----VRSVNKSNVRDEFISGSPTSSTKLHANARG 2567
            SS+PR D ++SPL++D+RDR  +S+K    +R+VNK +VRDEF S SPTSSTK++A+ RG
Sbjct: 356  SSVPRSDLDSSPLLNDRRDRPVASDKERVNLRAVNKMSVRDEFNSASPTSSTKMNASIRG 415

Query: 2568 PRSGSSVVHKSSPVVQRATTSSDWDLVHGTDK 2663
            PRSGS V  K SPVV RAT S+DW+L H T+K
Sbjct: 416  PRSGSGVAPKLSPVVHRATASNDWELSHCTNK 447


Top