BLASTX nr result

ID: Rehmannia29_contig00019842 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia29_contig00019842
         (1132 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_020547188.1| uncharacterized protein LOC105180044 [Sesamu...   573   0.0  
ref|XP_011101892.1| uncharacterized protein LOC105179935 isoform...   574   0.0  
gb|PIN18708.1| Lycopene beta-cyclase [Handroanthus impetiginosus]     566   0.0  
gb|EYU43127.1| hypothetical protein MIMGU_mgv1a003245mg [Erythra...   530   0.0  
ref|XP_012830101.1| PREDICTED: uncharacterized protein LOC105951...   530   0.0  
ref|XP_011101893.1| uncharacterized protein LOC105179935 isoform...   509   e-175
ref|XP_022859436.1| uncharacterized protein LOC111380178 [Olea e...   489   e-167
emb|CDP11331.1| unnamed protein product [Coffea canephora]            474   e-161
gb|KJB10682.1| hypothetical protein B456_001G216200 [Gossypium r...   464   e-160
ref|XP_012489547.1| PREDICTED: uncharacterized protein LOC105802...   464   e-160
ref|XP_012489527.1| PREDICTED: uncharacterized protein LOC105802...   464   e-159
ref|XP_022759033.1| uncharacterized protein LOC111305610 [Durio ...   469   e-159
ref|XP_017982540.1| PREDICTED: uncharacterized protein LOC186123...   461   e-159
ref|XP_019264510.1| PREDICTED: uncharacterized protein LOC109242...   467   e-158
gb|KJB10681.1| hypothetical protein B456_001G216200 [Gossypium r...   464   e-158
ref|XP_020416480.1| uncharacterized protein LOC18781842 isoform ...   460   e-158
ref|XP_021831442.1| uncharacterized protein LOC110771448 isoform...   458   e-158
ref|XP_021989464.1| uncharacterized protein LOC110886019 isoform...   465   e-158
gb|KJB10680.1| hypothetical protein B456_001G216200 [Gossypium r...   464   e-157
ref|XP_017249143.1| PREDICTED: uncharacterized protein LOC108220...   462   e-157

>ref|XP_020547188.1| uncharacterized protein LOC105180044 [Sesamum indicum]
          Length = 524

 Score =  573 bits (1476), Expect = 0.0
 Identities = 283/369 (76%), Positives = 306/369 (82%)
 Frame = +2

Query: 26   MKNGGLSLFHGSFASPPPLPTRRKFYVLAQSTAPSRTQRIMESIPVSGEVGGAGGAYSYN 205
            +KNGGLSLFHGSFA+PPP P+RRK Y+ AQ+T PSRTQRIMESIPVSGEVGGAGGAYSYN
Sbjct: 4    VKNGGLSLFHGSFAAPPPNPSRRKLYLFAQATTPSRTQRIMESIPVSGEVGGAGGAYSYN 63

Query: 206  ALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIA 385
            ALKRLD LWSSIC                 GLF+ESE+ E+C+DMFDV+VCGGTLGIFIA
Sbjct: 64   ALKRLDSLWSSICNSSPVVQEPPQVVSEVPGLFQESERMERCSDMFDVIVCGGTLGIFIA 123

Query: 386  AALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCG 565
             ALS KGLRVGIVEKS+LRGREQEWNISRK              DIDHVT++SFNPNRCG
Sbjct: 124  TALSSKGLRVGIVEKSILRGREQEWNISRKELLELVEVGVLTKGDIDHVTAASFNPNRCG 183

Query: 566  FEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDG 745
            FEGKGEIWVNDILNLGVSP KLIE MR RF  LGG I+EGCSVSHIS+YED+AVL+L DG
Sbjct: 184  FEGKGEIWVNDILNLGVSPGKLIETMRMRFTFLGGIIFEGCSVSHISVYEDSAVLELADG 243

Query: 746  KILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRV 925
            KILSS LIIDAMGNFSP+VKQIRRGRKPDGICLVVGSCCRGF EN+TSDVI+SSASVKRV
Sbjct: 244  KILSSRLIIDAMGNFSPIVKQIRRGRKPDGICLVVGSCCRGFTENSTSDVIYSSASVKRV 303

Query: 926  GQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDL 1105
            GQS AQYFWEAFPAGSGPL+RTTYMFTYLY             DYWDLMP+YQ VSLDDL
Sbjct: 304  GQSEAQYFWEAFPAGSGPLDRTTYMFTYLYFTRGCPLLEELLEDYWDLMPKYQGVSLDDL 363

Query: 1106 EVLRVIYGI 1132
            +VLRVIYGI
Sbjct: 364  DVLRVIYGI 372


>ref|XP_011101892.1| uncharacterized protein LOC105179935 isoform X1 [Sesamum indicum]
          Length = 593

 Score =  574 bits (1480), Expect = 0.0
 Identities = 284/369 (76%), Positives = 307/369 (83%)
 Frame = +2

Query: 26   MKNGGLSLFHGSFASPPPLPTRRKFYVLAQSTAPSRTQRIMESIPVSGEVGGAGGAYSYN 205
            +KNGGLSLFHGSFA+PPP P+RRK Y+ AQ+T PSRTQRIMESIPVSGEVGGAGGAYSYN
Sbjct: 4    VKNGGLSLFHGSFAAPPPNPSRRKLYLFAQATTPSRTQRIMESIPVSGEVGGAGGAYSYN 63

Query: 206  ALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIA 385
            ALKRLD LWSSIC                 GLF+ESE+ E+C+DMFDV+VCGGTLGIFIA
Sbjct: 64   ALKRLDSLWSSICNSSPVVQEPPQVVSEVPGLFQESERMERCSDMFDVIVCGGTLGIFIA 123

Query: 386  AALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCG 565
             ALS KGLRVGIVEKS+LRGREQEWNISRK              DIDHVT++SFNPNRCG
Sbjct: 124  TALSSKGLRVGIVEKSILRGREQEWNISRKELLELVEVGVLTKGDIDHVTAASFNPNRCG 183

Query: 566  FEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDG 745
            FEGKGEIWVNDILNLGVSPAKLIE MR RF  LGG I+EGCSVSHIS+YED+AVL+L DG
Sbjct: 184  FEGKGEIWVNDILNLGVSPAKLIETMRMRFTFLGGIIFEGCSVSHISVYEDSAVLELADG 243

Query: 746  KILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRV 925
            KILSS LIIDAMGNFSP+VKQIRRGRKPDGICLVVGSCCRGF EN+TSDVI+SSASVKRV
Sbjct: 244  KILSSRLIIDAMGNFSPIVKQIRRGRKPDGICLVVGSCCRGFTENSTSDVIYSSASVKRV 303

Query: 926  GQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDL 1105
            GQS AQYFWEAFPAGSGPL+RTTYMFTYLY             DYWDLMP+YQ VSLDDL
Sbjct: 304  GQSEAQYFWEAFPAGSGPLDRTTYMFTYLYPEPGCPLLEELLEDYWDLMPKYQGVSLDDL 363

Query: 1106 EVLRVIYGI 1132
            +VLRVIYGI
Sbjct: 364  DVLRVIYGI 372


>gb|PIN18708.1| Lycopene beta-cyclase [Handroanthus impetiginosus]
          Length = 595

 Score =  567 bits (1460), Expect = 0.0
 Identities = 286/375 (76%), Positives = 306/375 (81%), Gaps = 2/375 (0%)
 Frame = +2

Query: 14   MVMQMKNGGLSLFHGSFASPP--PLPTRRKFYVLAQSTAPSRTQRIMESIPVSGEVGGAG 187
            MVMQ+K GGL  FHGSF +PP  P P+RR FY+LAQ+TAPSRTQRIMESIPVSGEVGGAG
Sbjct: 1    MVMQVKIGGLFSFHGSFGAPPRSPGPSRRNFYLLAQATAPSRTQRIMESIPVSGEVGGAG 60

Query: 188  GAYSYNALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGT 367
            GAYSYNALKRLD LWSSIC                 GL  ESE+ +KC + FDV+VCGGT
Sbjct: 61   GAYSYNALKRLDSLWSSICNASPVVQEPQQVVSNVPGLLHESEQGKKCDNTFDVIVCGGT 120

Query: 368  LGIFIAAALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSF 547
            LGIFIA ALS KGLRV IVEK+VL+GREQEWNISRK             DDIDHVT++SF
Sbjct: 121  LGIFIATALSSKGLRVAIVEKAVLKGREQEWNISRKELIELVEVGVLTEDDIDHVTAASF 180

Query: 548  NPNRCGFEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAV 727
            NPNRCGFEGKGEIWVNDILNLGVSP KLIEIMRKRF   GG I+EGCSVSHISIYEDAAV
Sbjct: 181  NPNRCGFEGKGEIWVNDILNLGVSPVKLIEIMRKRFAFFGGVIFEGCSVSHISIYEDAAV 240

Query: 728  LQLTDGKILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSS 907
            LQL DGKILSS LIIDAMGNFSPVV+QIRRGRKPDG+CLVVGSCC GFKENTTSDVIFSS
Sbjct: 241  LQLADGKILSSQLIIDAMGNFSPVVRQIRRGRKPDGVCLVVGSCCHGFKENTTSDVIFSS 300

Query: 908  ASVKRVGQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQD 1087
            A+VKRVGQSVAQYFWEAFPAGSGPLERTTYMFTYLY             DYW+LMP+YQ 
Sbjct: 301  AAVKRVGQSVAQYFWEAFPAGSGPLERTTYMFTYLYPEPGCPLLEELLEDYWELMPKYQG 360

Query: 1088 VSLDDLEVLRVIYGI 1132
            VSLDDL+VLRVIYGI
Sbjct: 361  VSLDDLKVLRVIYGI 375


>gb|EYU43127.1| hypothetical protein MIMGU_mgv1a003245mg [Erythranthe guttata]
          Length = 549

 Score =  530 bits (1364), Expect = 0.0
 Identities = 275/379 (72%), Positives = 305/379 (80%), Gaps = 10/379 (2%)
 Frame = +2

Query: 26   MKNGGLSLFHGSFA---SPPPL-PTRRKFYVLAQSTA---PSRTQRIMESIPVSG-EVGG 181
            +KNGGLS+ HGS +   SPPPL P RR FY+LAQ+TA   PSRTQRIMESI VSG EVGG
Sbjct: 2    VKNGGLSICHGSSSFVLSPPPLFPIRRNFYLLAQATAAPPPSRTQRIMESISVSGGEVGG 61

Query: 182  AGGAYSYNALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKC--TDMFDVLV 355
            AGG+YSYN+LKRLD LWS++                  G+F ESE+A KC   D+FDV+V
Sbjct: 62   AGGSYSYNSLKRLDSLWSTLFSHSPVVQEPQQVVSNIPGIFSESEQAAKCGGDDVFDVVV 121

Query: 356  CGGTLGIFIAAALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVT 535
            CGGTLGIFIA ALS KGLRVGIVEKS+L+GREQ+WNISRK             DDI  VT
Sbjct: 122  CGGTLGIFIATALSSKGLRVGIVEKSILKGREQDWNISRKELLELVEVGVLTEDDIVSVT 181

Query: 536  SSSFNPNRCGFEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYE 715
            S+SFNPNRCGFEGKGEIWVNDILNLGVSPAKLI+IMRKRFDSLGG I+EGCS+S+IS+YE
Sbjct: 182  SASFNPNRCGFEGKGEIWVNDILNLGVSPAKLIDIMRKRFDSLGGVIFEGCSLSNISVYE 241

Query: 716  DAAVLQLTDGKILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDV 895
            D+A+LQL DGKILSS L+IDAMGNFSP+VKQIRRGRKPDG+CLVVGSCCRGFK+N TSDV
Sbjct: 242  DSAILQLADGKILSSRLVIDAMGNFSPIVKQIRRGRKPDGMCLVVGSCCRGFKDNITSDV 301

Query: 896  IFSSASVKRVGQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMP 1075
            IFSSASVK VG+SVAQYFWEAFPAGSGPLERTTYMFTYL              DYWDLMP
Sbjct: 302  IFSSASVKSVGKSVAQYFWEAFPAGSGPLERTTYMFTYLNPEPGCPLLEELLEDYWDLMP 361

Query: 1076 QYQDVSLDDLEVLRVIYGI 1132
            +YQ VSLDDLEVLRVIYGI
Sbjct: 362  KYQGVSLDDLEVLRVIYGI 380


>ref|XP_012830101.1| PREDICTED: uncharacterized protein LOC105951253 [Erythranthe guttata]
 gb|EYU43126.1| hypothetical protein MIMGU_mgv1a003245mg [Erythranthe guttata]
          Length = 597

 Score =  530 bits (1364), Expect = 0.0
 Identities = 275/379 (72%), Positives = 305/379 (80%), Gaps = 10/379 (2%)
 Frame = +2

Query: 26   MKNGGLSLFHGSFA---SPPPL-PTRRKFYVLAQSTA---PSRTQRIMESIPVSG-EVGG 181
            +KNGGLS+ HGS +   SPPPL P RR FY+LAQ+TA   PSRTQRIMESI VSG EVGG
Sbjct: 2    VKNGGLSICHGSSSFVLSPPPLFPIRRNFYLLAQATAAPPPSRTQRIMESISVSGGEVGG 61

Query: 182  AGGAYSYNALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKC--TDMFDVLV 355
            AGG+YSYN+LKRLD LWS++                  G+F ESE+A KC   D+FDV+V
Sbjct: 62   AGGSYSYNSLKRLDSLWSTLFSHSPVVQEPQQVVSNIPGIFSESEQAAKCGGDDVFDVVV 121

Query: 356  CGGTLGIFIAAALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVT 535
            CGGTLGIFIA ALS KGLRVGIVEKS+L+GREQ+WNISRK             DDI  VT
Sbjct: 122  CGGTLGIFIATALSSKGLRVGIVEKSILKGREQDWNISRKELLELVEVGVLTEDDIVSVT 181

Query: 536  SSSFNPNRCGFEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYE 715
            S+SFNPNRCGFEGKGEIWVNDILNLGVSPAKLI+IMRKRFDSLGG I+EGCS+S+IS+YE
Sbjct: 182  SASFNPNRCGFEGKGEIWVNDILNLGVSPAKLIDIMRKRFDSLGGVIFEGCSLSNISVYE 241

Query: 716  DAAVLQLTDGKILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDV 895
            D+A+LQL DGKILSS L+IDAMGNFSP+VKQIRRGRKPDG+CLVVGSCCRGFK+N TSDV
Sbjct: 242  DSAILQLADGKILSSRLVIDAMGNFSPIVKQIRRGRKPDGMCLVVGSCCRGFKDNITSDV 301

Query: 896  IFSSASVKRVGQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMP 1075
            IFSSASVK VG+SVAQYFWEAFPAGSGPLERTTYMFTYL              DYWDLMP
Sbjct: 302  IFSSASVKSVGKSVAQYFWEAFPAGSGPLERTTYMFTYLNPEPGCPLLEELLEDYWDLMP 361

Query: 1076 QYQDVSLDDLEVLRVIYGI 1132
            +YQ VSLDDLEVLRVIYGI
Sbjct: 362  KYQGVSLDDLEVLRVIYGI 380


>ref|XP_011101893.1| uncharacterized protein LOC105179935 isoform X2 [Sesamum indicum]
          Length = 550

 Score =  509 bits (1310), Expect = e-175
 Identities = 253/329 (76%), Positives = 271/329 (82%)
 Frame = +2

Query: 146  MESIPVSGEVGGAGGAYSYNALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAE 325
            MESIPVSGEVGGAGGAYSYNALKRLD LWSSIC                 GLF+ESE+ E
Sbjct: 1    MESIPVSGEVGGAGGAYSYNALKRLDSLWSSICNSSPVVQEPPQVVSEVPGLFQESERME 60

Query: 326  KCTDMFDVLVCGGTLGIFIAAALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXX 505
            +C+DMFDV+VCGGTLGIFIA ALS KGLRVGIVEKS+LRGREQEWNISRK          
Sbjct: 61   RCSDMFDVIVCGGTLGIFIATALSSKGLRVGIVEKSILRGREQEWNISRKELLELVEVGV 120

Query: 506  XXXDDIDHVTSSSFNPNRCGFEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEG 685
                DIDHVT++SFNPNRCGFEGKGEIWVNDILNLGVSPAKLIE MR RF  LGG I+EG
Sbjct: 121  LTKGDIDHVTAASFNPNRCGFEGKGEIWVNDILNLGVSPAKLIETMRMRFTFLGGIIFEG 180

Query: 686  CSVSHISIYEDAAVLQLTDGKILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCR 865
            CSVSHIS+YED+AVL+L DGKILSS LIIDAMGNFSP+VKQIRRGRKPDGICLVVGSCCR
Sbjct: 181  CSVSHISVYEDSAVLELADGKILSSRLIIDAMGNFSPIVKQIRRGRKPDGICLVVGSCCR 240

Query: 866  GFKENTTSDVIFSSASVKRVGQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXX 1045
            GF EN+TSDVI+SSASVKRVGQS AQYFWEAFPAGSGPL+RTTYMFTYLY          
Sbjct: 241  GFTENSTSDVIYSSASVKRVGQSEAQYFWEAFPAGSGPLDRTTYMFTYLYPEPGCPLLEE 300

Query: 1046 XXXDYWDLMPQYQDVSLDDLEVLRVIYGI 1132
               DYWDLMP+YQ VSLDDL+VLRVIYGI
Sbjct: 301  LLEDYWDLMPKYQGVSLDDLDVLRVIYGI 329


>ref|XP_022859436.1| uncharacterized protein LOC111380178 [Olea europaea var. sylvestris]
          Length = 598

 Score =  489 bits (1260), Expect = e-167
 Identities = 246/353 (69%), Positives = 277/353 (78%)
 Frame = +2

Query: 74   PPLPTRRKFYVLAQSTAPSRTQRIMESIPVSGEVGGAGGAYSYNALKRLDGLWSSICXXX 253
            P L  +R+ Y+ AQ++ PSRTQRIME+I VSGEVGGAGGAYSYNALKRLD LWSSIC   
Sbjct: 27   PSLQRKRRNYLNAQAS-PSRTQRIMENIAVSGEVGGAGGAYSYNALKRLDQLWSSICSPP 85

Query: 254  XXXXXXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAALSCKGLRVGIVEKS 433
                          GLF +S++AEK    FD++VCGGTLGIFIA ALS KGLRVGIVE++
Sbjct: 86   TSIPEPQQVVSKVPGLFHDSDQAEKYVGTFDIVVCGGTLGIFIATALSSKGLRVGIVERT 145

Query: 434  VLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFEGKGEIWVNDILNLG 613
            +L+GREQEWNISRK             DDIDH  SSSFNPNRCGFEG+GEIWV DILNLG
Sbjct: 146  ILKGREQEWNISRKELLELVEVGILAEDDIDHAISSSFNPNRCGFEGRGEIWVRDILNLG 205

Query: 614  VSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKILSSHLIIDAMGNFS 793
            VSP KLIEIMR+RF+SLGG I+E CSVS I  Y+D+AVLQL DGKILSS LIIDAMGNFS
Sbjct: 206  VSPVKLIEIMRERFNSLGGIIFEDCSVSRICTYDDSAVLQLADGKILSSRLIIDAMGNFS 265

Query: 794  PVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQSVAQYFWEAFPAGS 973
            PVVKQIRRGRKPDG+CLVVGSCCRGFK+N+TSDVIFS+ASVK VG S  QYFWEAFPAGS
Sbjct: 266  PVVKQIRRGRKPDGVCLVVGSCCRGFKDNSTSDVIFSNASVKSVGMSSVQYFWEAFPAGS 325

Query: 974  GPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEVLRVIYGI 1132
            GPL+RTTYMFTY+              DYWDLMP+YQ  SLD+L++LRVIYGI
Sbjct: 326  GPLDRTTYMFTYVDPQPGCPKLEELLEDYWDLMPKYQGTSLDNLDILRVIYGI 378


>emb|CDP11331.1| unnamed protein product [Coffea canephora]
          Length = 583

 Score =  474 bits (1220), Expect = e-161
 Identities = 240/363 (66%), Positives = 277/363 (76%), Gaps = 3/363 (0%)
 Frame = +2

Query: 53   HGSFASP---PPLPTRRKFYVLAQSTAPSRTQRIMESIPVSGEVGGAGGAYSYNALKRLD 223
            +G F SP    P+P RRK ++L+Q+  PSRTQRIMESI VSGEVGGAGGAYSYNALKRLD
Sbjct: 3    NGVFGSPIRASPIPRRRKLFLLSQAI-PSRTQRIMESISVSGEVGGAGGAYSYNALKRLD 61

Query: 224  GLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAALSCK 403
             LWS+IC                 GLF  SE AE+  D FDV+VCGGTLGIFIA ALS K
Sbjct: 62   QLWSTICSASSAVQEPQQVVSNVAGLFTNSEFAERFEDKFDVVVCGGTLGIFIATALSSK 121

Query: 404  GLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFEGKGE 583
            GLRVG+VE++VL+GREQEWNISRK             DDID   ++SFNPNRCGFE KGE
Sbjct: 122  GLRVGVVERNVLKGREQEWNISRKELLELVEVGILTEDDIDEAIAASFNPNRCGFESKGE 181

Query: 584  IWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKILSSH 763
            IWV DILNLG+SPAKLIEIMR+RF+ LGG I EG SV+ I +Y+D AVL+L+ G+ILSS 
Sbjct: 182  IWVEDILNLGISPAKLIEIMRRRFEYLGGVILEGYSVASIRVYDDTAVLELSKGRILSSS 241

Query: 764  LIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQSVAQ 943
            L+IDAMGNFSP+VKQIRRGRKPDG+CLVVGSC RGFKEN+ SDVI+S+ASVK VGQS  Q
Sbjct: 242  LVIDAMGNFSPIVKQIRRGRKPDGVCLVVGSCGRGFKENSRSDVIYSNASVKEVGQSQVQ 301

Query: 944  YFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEVLRVI 1123
            YFWEAFPAGSGP +RTTYMFTY+              DYW+LMP YQ VSLD+LE+LRV+
Sbjct: 302  YFWEAFPAGSGPTDRTTYMFTYVNPQPECPKLEELLEDYWNLMPNYQGVSLDNLEILRVV 361

Query: 1124 YGI 1132
            +GI
Sbjct: 362  FGI 364


>gb|KJB10682.1| hypothetical protein B456_001G216200 [Gossypium raimondii]
          Length = 391

 Score =  464 bits (1193), Expect = e-160
 Identities = 237/349 (67%), Positives = 265/349 (75%), Gaps = 1/349 (0%)
 Frame = +2

Query: 89   RRKFYVLAQSTA-PSRTQRIMESIPVSGEVGGAGGAYSYNALKRLDGLWSSICXXXXXXX 265
            R   Y+ AQ+ A PSRTQRIMESI VSGEVGGAGGAYSYNALKRLDG+WSSIC       
Sbjct: 36   RGTIYMSAQTQAVPSRTQRIMESISVSGEVGGAGGAYSYNALKRLDGIWSSICSTQTVQQ 95

Query: 266  XXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAALSCKGLRVGIVEKSVLRG 445
                      G+   S  AEK  D  DV+VCGGTLGIFIA AL  KGL+V IVE+++L+G
Sbjct: 96   APQQVVSSFPGVSSRSVLAEKQVDKCDVVVCGGTLGIFIATALIAKGLKVCIVERNILKG 155

Query: 446  REQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFEGKGEIWVNDILNLGVSPA 625
            REQEWNISRK             DDI+ VT+ SFNPNRCGFE KGEIWV DILNLGVSP 
Sbjct: 156  REQEWNISRKELMELVEAGILDEDDIEEVTAVSFNPNRCGFENKGEIWVEDILNLGVSPV 215

Query: 626  KLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKILSSHLIIDAMGNFSPVVK 805
            KLIEI++KRF S GG I+EGCSVS ISIY DAA+LQL +G ILSS LIIDAMGNFSPVVK
Sbjct: 216  KLIEIVKKRFVSFGGVIFEGCSVSSISIYNDAAILQLAEGNILSSRLIIDAMGNFSPVVK 275

Query: 806  QIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQSVAQYFWEAFPAGSGPLE 985
            QIR GRKPDG+CLVVGSC RGFKEN+TSDVI+SS+SVK+VG +  QYFWEAFPAGSGPL+
Sbjct: 276  QIRGGRKPDGVCLVVGSCARGFKENSTSDVIYSSSSVKKVGNAEVQYFWEAFPAGSGPLD 335

Query: 986  RTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEVLRVIYGI 1132
            RTTYMFTY+              DYWDLMP+YQ VS+D LE+LRVIYGI
Sbjct: 336  RTTYMFTYVNPQPTSPKLEELLEDYWDLMPKYQGVSMDSLEILRVIYGI 384


>ref|XP_012489547.1| PREDICTED: uncharacterized protein LOC105802425 isoform X3 [Gossypium
            raimondii]
          Length = 409

 Score =  464 bits (1193), Expect = e-160
 Identities = 237/349 (67%), Positives = 265/349 (75%), Gaps = 1/349 (0%)
 Frame = +2

Query: 89   RRKFYVLAQSTA-PSRTQRIMESIPVSGEVGGAGGAYSYNALKRLDGLWSSICXXXXXXX 265
            R   Y+ AQ+ A PSRTQRIMESI VSGEVGGAGGAYSYNALKRLDG+WSSIC       
Sbjct: 36   RGTIYMSAQTQAVPSRTQRIMESISVSGEVGGAGGAYSYNALKRLDGIWSSICSTQTVQQ 95

Query: 266  XXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAALSCKGLRVGIVEKSVLRG 445
                      G+   S  AEK  D  DV+VCGGTLGIFIA AL  KGL+V IVE+++L+G
Sbjct: 96   APQQVVSSFPGVSSRSVLAEKQVDKCDVVVCGGTLGIFIATALIAKGLKVCIVERNILKG 155

Query: 446  REQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFEGKGEIWVNDILNLGVSPA 625
            REQEWNISRK             DDI+ VT+ SFNPNRCGFE KGEIWV DILNLGVSP 
Sbjct: 156  REQEWNISRKELMELVEAGILDEDDIEEVTAVSFNPNRCGFENKGEIWVEDILNLGVSPV 215

Query: 626  KLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKILSSHLIIDAMGNFSPVVK 805
            KLIEI++KRF S GG I+EGCSVS ISIY DAA+LQL +G ILSS LIIDAMGNFSPVVK
Sbjct: 216  KLIEIVKKRFVSFGGVIFEGCSVSSISIYNDAAILQLAEGNILSSRLIIDAMGNFSPVVK 275

Query: 806  QIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQSVAQYFWEAFPAGSGPLE 985
            QIR GRKPDG+CLVVGSC RGFKEN+TSDVI+SS+SVK+VG +  QYFWEAFPAGSGPL+
Sbjct: 276  QIRGGRKPDGVCLVVGSCARGFKENSTSDVIYSSSSVKKVGNAEVQYFWEAFPAGSGPLD 335

Query: 986  RTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEVLRVIYGI 1132
            RTTYMFTY+              DYWDLMP+YQ VS+D LE+LRVIYGI
Sbjct: 336  RTTYMFTYVNPQPTSPKLEELLEDYWDLMPKYQGVSMDSLEILRVIYGI 384


>ref|XP_012489527.1| PREDICTED: uncharacterized protein LOC105802425 isoform X2 [Gossypium
            raimondii]
          Length = 437

 Score =  464 bits (1193), Expect = e-159
 Identities = 237/349 (67%), Positives = 265/349 (75%), Gaps = 1/349 (0%)
 Frame = +2

Query: 89   RRKFYVLAQSTA-PSRTQRIMESIPVSGEVGGAGGAYSYNALKRLDGLWSSICXXXXXXX 265
            R   Y+ AQ+ A PSRTQRIMESI VSGEVGGAGGAYSYNALKRLDG+WSSIC       
Sbjct: 36   RGTIYMSAQTQAVPSRTQRIMESISVSGEVGGAGGAYSYNALKRLDGIWSSICSTQTVQQ 95

Query: 266  XXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAALSCKGLRVGIVEKSVLRG 445
                      G+   S  AEK  D  DV+VCGGTLGIFIA AL  KGL+V IVE+++L+G
Sbjct: 96   APQQVVSSFPGVSSRSVLAEKQVDKCDVVVCGGTLGIFIATALIAKGLKVCIVERNILKG 155

Query: 446  REQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFEGKGEIWVNDILNLGVSPA 625
            REQEWNISRK             DDI+ VT+ SFNPNRCGFE KGEIWV DILNLGVSP 
Sbjct: 156  REQEWNISRKELMELVEAGILDEDDIEEVTAVSFNPNRCGFENKGEIWVEDILNLGVSPV 215

Query: 626  KLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKILSSHLIIDAMGNFSPVVK 805
            KLIEI++KRF S GG I+EGCSVS ISIY DAA+LQL +G ILSS LIIDAMGNFSPVVK
Sbjct: 216  KLIEIVKKRFVSFGGVIFEGCSVSSISIYNDAAILQLAEGNILSSRLIIDAMGNFSPVVK 275

Query: 806  QIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQSVAQYFWEAFPAGSGPLE 985
            QIR GRKPDG+CLVVGSC RGFKEN+TSDVI+SS+SVK+VG +  QYFWEAFPAGSGPL+
Sbjct: 276  QIRGGRKPDGVCLVVGSCARGFKENSTSDVIYSSSSVKKVGNAEVQYFWEAFPAGSGPLD 335

Query: 986  RTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEVLRVIYGI 1132
            RTTYMFTY+              DYWDLMP+YQ VS+D LE+LRVIYGI
Sbjct: 336  RTTYMFTYVNPQPTSPKLEELLEDYWDLMPKYQGVSMDSLEILRVIYGI 384


>ref|XP_022759033.1| uncharacterized protein LOC111305610 [Durio zibethinus]
          Length = 594

 Score =  469 bits (1207), Expect = e-159
 Identities = 246/383 (64%), Positives = 279/383 (72%), Gaps = 8/383 (2%)
 Frame = +2

Query: 8    MAMVMQMKNGGLSLFHGSFASPPPLPTRRK-------FYVLAQSTA-PSRTQRIMESIPV 163
            M MV      G+S     + S PPL  R++        Y+ AQ+ A PSRTQRIMESI V
Sbjct: 6    MVMVSLRPLNGVS----RYPSKPPLVYRKRQRALRNIIYMRAQTQAVPSRTQRIMESISV 61

Query: 164  SGEVGGAGGAYSYNALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKCTDMF 343
            SGEVGGAGG YSYNALKRLD +WSSIC                 G+F  S  AEK  D  
Sbjct: 62   SGEVGGAGGTYSYNALKRLDKIWSSICSAQTVRQEPQQVVSSFPGVFSHSALAEKEVDKC 121

Query: 344  DVLVCGGTLGIFIAAALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDI 523
            DV+VCGGTLGIFIA ALS KGL+V IVE+++L+GREQEWNISRK             DDI
Sbjct: 122  DVVVCGGTLGIFIATALSAKGLKVSIVERNILKGREQEWNISRKELMELVEAGILDEDDI 181

Query: 524  DHVTSSSFNPNRCGFEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHI 703
            D  T+ SFNPNRCGFE KGEIWV DILNLGVSP KL++I+RKRF SLGG I+EGCSVS I
Sbjct: 182  DEATAVSFNPNRCGFENKGEIWVEDILNLGVSPVKLVDIVRKRFVSLGGVIFEGCSVSGI 241

Query: 704  SIYEDAAVLQLTDGKILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENT 883
            SIY+DAAVLQL +G ILSS LIIDAMGNFSPVVKQIR GRKPDG+CLVVGSC RGFKEN+
Sbjct: 242  SIYDDAAVLQLAEGNILSSRLIIDAMGNFSPVVKQIRGGRKPDGVCLVVGSCARGFKENS 301

Query: 884  TSDVIFSSASVKRVGQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYW 1063
            TSDVI+SS+SVK+VG +  QYFWEAFPAGSGPL+RT YMFTY+              DYW
Sbjct: 302  TSDVIYSSSSVKKVGNAEVQYFWEAFPAGSGPLDRTIYMFTYVNPQPSSPKLEELLEDYW 361

Query: 1064 DLMPQYQDVSLDDLEVLRVIYGI 1132
            DLMP+YQ VS+D+LE+LRVIYGI
Sbjct: 362  DLMPKYQGVSMDNLEILRVIYGI 384


>ref|XP_017982540.1| PREDICTED: uncharacterized protein LOC18612333 isoform X3 [Theobroma
            cacao]
 ref|XP_017982543.1| PREDICTED: uncharacterized protein LOC18612333 isoform X3 [Theobroma
            cacao]
          Length = 408

 Score =  461 bits (1186), Expect = e-159
 Identities = 233/350 (66%), Positives = 267/350 (76%), Gaps = 1/350 (0%)
 Frame = +2

Query: 86   TRRKFYVLAQSTA-PSRTQRIMESIPVSGEVGGAGGAYSYNALKRLDGLWSSICXXXXXX 262
            T R  Y+ AQ+ A PSRTQRIMESI V GEVGGAGGAYSY+ALKRLD +WSSIC      
Sbjct: 34   TVRNIYMKAQTQAVPSRTQRIMESISVGGEVGGAGGAYSYSALKRLDKIWSSICSAETVQ 93

Query: 263  XXXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAALSCKGLRVGIVEKSVLR 442
                       G+F  S  AEK    FDV+VCGGTLGIFIA ALS KGL+V +VE+++L+
Sbjct: 94   QEPQQVVSDFPGVFSHSALAEKAVHKFDVVVCGGTLGIFIATALSVKGLKVSVVERNLLK 153

Query: 443  GREQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFEGKGEIWVNDILNLGVSP 622
            GREQEWNISRK             +DI+  T+ SFNPNRCGFE KGEIWV DILNLGVSP
Sbjct: 154  GREQEWNISRKELMELVEAGILNENDIEEATAVSFNPNRCGFENKGEIWVEDILNLGVSP 213

Query: 623  AKLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKILSSHLIIDAMGNFSPVV 802
             KLIEI++KRF +LGG I+EGCSVS ISIY+DAAVLQL +G ILSS LIIDAMGNFSPVV
Sbjct: 214  VKLIEIVKKRFIALGGVIFEGCSVSGISIYDDAAVLQLAEGNILSSRLIIDAMGNFSPVV 273

Query: 803  KQIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQSVAQYFWEAFPAGSGPL 982
            KQIR GRKPDG+CLVVGSC  GFK+N+TSDVI+SS+SVK+VG +  QYFWEAFPAGSGPL
Sbjct: 274  KQIRGGRKPDGVCLVVGSCAHGFKDNSTSDVIYSSSSVKKVGNAEVQYFWEAFPAGSGPL 333

Query: 983  ERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEVLRVIYGI 1132
            +RTTYMFTY+              DYWDLMP+YQ VS+D+LE+LRVIYGI
Sbjct: 334  DRTTYMFTYVNPQPDSPKLEELLEDYWDLMPKYQGVSIDNLEILRVIYGI 383


>ref|XP_019264510.1| PREDICTED: uncharacterized protein LOC109242133 isoform X2 [Nicotiana
            attenuata]
          Length = 609

 Score =  467 bits (1201), Expect = e-158
 Identities = 237/378 (62%), Positives = 281/378 (74%), Gaps = 3/378 (0%)
 Frame = +2

Query: 8    MAMVMQMKNGGLSLFHGSFASPP---PLPTRRKFYVLAQSTAPSRTQRIMESIPVSGEVG 178
            MA+ +Q++    ++ +G F  P    P   R+K  + AQ+T P+RTQRIME I VSGEVG
Sbjct: 1    MALQLQLQP---TIKNGVFQYPSAIRPFGNRKKVSIQAQAT-PTRTQRIMEGIAVSGEVG 56

Query: 179  GAGGAYSYNALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKCTDMFDVLVC 358
            GAGGAYSY+ALKRLD LWS IC                 G +++SE      +MFDV+VC
Sbjct: 57   GAGGAYSYSALKRLDQLWSKICSSSTVVEEPQKVVSSVPGSYKDSEHNGNLEEMFDVIVC 116

Query: 359  GGTLGIFIAAALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVTS 538
            GGTLGIFIA ALS KGL+VG+VE++VL+GREQEWNISRK             DDI+  T+
Sbjct: 117  GGTLGIFIATALSSKGLQVGVVERNVLKGREQEWNISRKELLELVEVGILTEDDIEEATA 176

Query: 539  SSFNPNRCGFEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYED 718
            +SFNPNRCGFEGKG+IWV DILNLGVSP KL+EIM+KRF+SLGG  +EG SVS IS+YED
Sbjct: 177  ASFNPNRCGFEGKGDIWVQDILNLGVSPVKLVEIMKKRFNSLGGVTFEGYSVSSISVYED 236

Query: 719  AAVLQLTDGKILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDVI 898
            +AVL+L +GK L S LIIDAMGNFSP+VKQIR GRKPDG+CLVVG+CCRGFKEN TSDVI
Sbjct: 237  SAVLELKEGKTLFSRLIIDAMGNFSPIVKQIRCGRKPDGMCLVVGTCCRGFKENCTSDVI 296

Query: 899  FSSASVKRVGQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQ 1078
            FSSAS+K   QSV QYFWEAFPAGSGP++RTTYMFTY+              DYWDLMP+
Sbjct: 297  FSSASIKETSQSVVQYFWEAFPAGSGPMDRTTYMFTYVDPQPGSPQLEELLEDYWDLMPK 356

Query: 1079 YQDVSLDDLEVLRVIYGI 1132
            YQ VS DDLE+LR+IYGI
Sbjct: 357  YQGVSFDDLEILRIIYGI 374


>gb|KJB10681.1| hypothetical protein B456_001G216200 [Gossypium raimondii]
          Length = 541

 Score =  464 bits (1193), Expect = e-158
 Identities = 237/349 (67%), Positives = 265/349 (75%), Gaps = 1/349 (0%)
 Frame = +2

Query: 89   RRKFYVLAQSTA-PSRTQRIMESIPVSGEVGGAGGAYSYNALKRLDGLWSSICXXXXXXX 265
            R   Y+ AQ+ A PSRTQRIMESI VSGEVGGAGGAYSYNALKRLDG+WSSIC       
Sbjct: 36   RGTIYMSAQTQAVPSRTQRIMESISVSGEVGGAGGAYSYNALKRLDGIWSSICSTQTVQQ 95

Query: 266  XXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAALSCKGLRVGIVEKSVLRG 445
                      G+   S  AEK  D  DV+VCGGTLGIFIA AL  KGL+V IVE+++L+G
Sbjct: 96   APQQVVSSFPGVSSRSVLAEKQVDKCDVVVCGGTLGIFIATALIAKGLKVCIVERNILKG 155

Query: 446  REQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFEGKGEIWVNDILNLGVSPA 625
            REQEWNISRK             DDI+ VT+ SFNPNRCGFE KGEIWV DILNLGVSP 
Sbjct: 156  REQEWNISRKELMELVEAGILDEDDIEEVTAVSFNPNRCGFENKGEIWVEDILNLGVSPV 215

Query: 626  KLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKILSSHLIIDAMGNFSPVVK 805
            KLIEI++KRF S GG I+EGCSVS ISIY DAA+LQL +G ILSS LIIDAMGNFSPVVK
Sbjct: 216  KLIEIVKKRFVSFGGVIFEGCSVSSISIYNDAAILQLAEGNILSSRLIIDAMGNFSPVVK 275

Query: 806  QIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQSVAQYFWEAFPAGSGPLE 985
            QIR GRKPDG+CLVVGSC RGFKEN+TSDVI+SS+SVK+VG +  QYFWEAFPAGSGPL+
Sbjct: 276  QIRGGRKPDGVCLVVGSCARGFKENSTSDVIYSSSSVKKVGNAEVQYFWEAFPAGSGPLD 335

Query: 986  RTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEVLRVIYGI 1132
            RTTYMFTY+              DYWDLMP+YQ VS+D LE+LRVIYGI
Sbjct: 336  RTTYMFTYVNPQPTSPKLEELLEDYWDLMPKYQGVSMDSLEILRVIYGI 384


>ref|XP_020416480.1| uncharacterized protein LOC18781842 isoform X3 [Prunus persica]
          Length = 448

 Score =  460 bits (1184), Expect = e-158
 Identities = 236/370 (63%), Positives = 272/370 (73%), Gaps = 6/370 (1%)
 Frame = +2

Query: 41   LSLFHGSFASPP--PLPTRR----KFYVLAQSTAPSRTQRIMESIPVSGEVGGAGGAYSY 202
            + L +G F  P   PL  RR    ++  L     PSRTQRIMES+ VSGEVGGAGGAYSY
Sbjct: 6    VQLINGFFQGPKQSPLSQRRARAGRYLCLQTQANPSRTQRIMESLSVSGEVGGAGGAYSY 65

Query: 203  NALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFI 382
            +ALKRLD LWSSIC                 G F  S+ A+K  D FDVL+CGGTLGIFI
Sbjct: 66   SALKRLDQLWSSICSAQTVVEEPKQVVSSVPGFFSNSDLADKAVDTFDVLICGGTLGIFI 125

Query: 383  AAALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRC 562
            A AL  KGLRVGIVE++VL+GREQEWNISRK             DDI+ VT++ FNPNRC
Sbjct: 126  ATALCAKGLRVGIVERNVLKGREQEWNISRKELLELVEIGVLVEDDIELVTAAKFNPNRC 185

Query: 563  GFEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTD 742
            GFEGKG+IWV DILNLGVSPAKLIE+++ RF +LGG I+EG SVS ISIYEDAAVLQL +
Sbjct: 186  GFEGKGDIWVEDILNLGVSPAKLIEVVKNRFITLGGVIFEGNSVSSISIYEDAAVLQLNE 245

Query: 743  GKILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKR 922
            G IL+S LIIDAMGNFSP+VKQIR GRKPDG+CLVVGSC RGFK+N+TSDVI++S+ VK+
Sbjct: 246  GNILTSRLIIDAMGNFSPIVKQIRSGRKPDGVCLVVGSCARGFKDNSTSDVIYTSSLVKK 305

Query: 923  VGQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDD 1102
            VG S AQ FWEAFPAGSGP +RTTYMFTYL              +YW LMP+YQ VSLDD
Sbjct: 306  VGASKAQLFWEAFPAGSGPADRTTYMFTYLAPQPQSPKLEELLEEYWKLMPEYQGVSLDD 365

Query: 1103 LEVLRVIYGI 1132
            LE+ RV+YGI
Sbjct: 366  LEIQRVLYGI 375


>ref|XP_021831442.1| uncharacterized protein LOC110771448 isoform X5 [Prunus avium]
 ref|XP_021831443.1| uncharacterized protein LOC110771448 isoform X5 [Prunus avium]
          Length = 400

 Score =  458 bits (1179), Expect = e-158
 Identities = 235/370 (63%), Positives = 270/370 (72%), Gaps = 6/370 (1%)
 Frame = +2

Query: 41   LSLFHGSFASPP--PLPTRR----KFYVLAQSTAPSRTQRIMESIPVSGEVGGAGGAYSY 202
            + L +G F  P   PL  RR    +   L     PSRTQRIMES+ VSGEVGGAGGAYSY
Sbjct: 6    VQLINGLFQGPKQSPLSQRRARAGRCLCLQTQAPPSRTQRIMESLSVSGEVGGAGGAYSY 65

Query: 203  NALKRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFI 382
            +ALKRLD LWSSIC                 G F  S+ A+K  D FDVL+CGGTLG+FI
Sbjct: 66   SALKRLDQLWSSICSAQTVVEEPKKVVSSVPGFFGNSDLADKAVDTFDVLICGGTLGVFI 125

Query: 383  AAALSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRC 562
            A AL  KGLRVGIVE++VL+GREQEWNISRK             DDI+ VT++ FNPNRC
Sbjct: 126  ATALCAKGLRVGIVERNVLKGREQEWNISRKELLELVEIGVLVEDDIELVTAAKFNPNRC 185

Query: 563  GFEGKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTD 742
            GFEGKG+IWV DILNLGVSPAKLIE+++ RF SLGG I+EG SVS ISIYEDAAVLQL +
Sbjct: 186  GFEGKGDIWVEDILNLGVSPAKLIEVVKNRFISLGGVIFEGNSVSSISIYEDAAVLQLNE 245

Query: 743  GKILSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKR 922
            G ILSS L+IDAMGNFSP+VKQIR GRKPDG+CLVVGSC RGFK+N+T DVI++S+ VK+
Sbjct: 246  GNILSSRLVIDAMGNFSPIVKQIRSGRKPDGVCLVVGSCARGFKDNSTGDVIYTSSLVKK 305

Query: 923  VGQSVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDD 1102
            VG S AQ FWEAFPAGSGP +RTTYMFTYL              +YW LMP+YQ VSLDD
Sbjct: 306  VGASEAQLFWEAFPAGSGPADRTTYMFTYLAPQPQSPKLEELLEEYWKLMPEYQGVSLDD 365

Query: 1103 LEVLRVIYGI 1132
            LE+ RV+YGI
Sbjct: 366  LEIQRVLYGI 375


>ref|XP_021989464.1| uncharacterized protein LOC110886019 isoform X2 [Helianthus annuus]
 gb|OTG12155.1| putative lycopene beta/epsilon cyclase protein [Helianthus annuus]
          Length = 594

 Score =  465 bits (1197), Expect = e-158
 Identities = 235/347 (67%), Positives = 268/347 (77%)
 Frame = +2

Query: 92   RKFYVLAQSTAPSRTQRIMESIPVSGEVGGAGGAYSYNALKRLDGLWSSICXXXXXXXXX 271
            +K Y+  ++ + SRTQRIMESI V GEVGGAGGAYSY ALKRLD LWSSIC         
Sbjct: 31   KKLYLQPKAMS-SRTQRIMESISVGGEVGGAGGAYSYEALKRLDNLWSSICSAQTVVQEP 89

Query: 272  XXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAALSCKGLRVGIVEKSVLRGRE 451
                    GLF +S+ A K  D FDVLVCGGTLGIF+A ALS KGLRVG+VE+++L+GRE
Sbjct: 90   RQVVTNTPGLFSQSDMANKEVDKFDVLVCGGTLGIFVATALSLKGLRVGVVERNLLKGRE 149

Query: 452  QEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFEGKGEIWVNDILNLGVSPAKL 631
            QEWNISRK             DDI+  T+S FNPNRCGFE KGEIWV++ILNLGVSPAKL
Sbjct: 150  QEWNISRKELLELVEVGVLTEDDIEQATTSVFNPNRCGFESKGEIWVSNILNLGVSPAKL 209

Query: 632  IEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKILSSHLIIDAMGNFSPVVKQI 811
            I+I+R+RF SLGG I+EGCSVS I++Y+D AVLQL +GKILS+ LIIDAMGNFSPV+KQI
Sbjct: 210  IDIVRERFTSLGGVIFEGCSVSSINVYQDIAVLQLMEGKILSACLIIDAMGNFSPVLKQI 269

Query: 812  RRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQSVAQYFWEAFPAGSGPLERT 991
            R  RKPDG+CLVVGSCCRGFKEN  SDVIFSSA VK VG S AQYFWEAFPAGSGPL+RT
Sbjct: 270  RGNRKPDGVCLVVGSCCRGFKENFKSDVIFSSAEVKPVGNSEAQYFWEAFPAGSGPLDRT 329

Query: 992  TYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEVLRVIYGI 1132
            TYMFTY+              DYWDLMP+YQ+VSLDDLE+LRVIYGI
Sbjct: 330  TYMFTYVDPQPNSPKLEELLEDYWDLMPKYQEVSLDDLEILRVIYGI 376


>gb|KJB10680.1| hypothetical protein B456_001G216200 [Gossypium raimondii]
          Length = 563

 Score =  464 bits (1193), Expect = e-157
 Identities = 237/349 (67%), Positives = 265/349 (75%), Gaps = 1/349 (0%)
 Frame = +2

Query: 89   RRKFYVLAQSTA-PSRTQRIMESIPVSGEVGGAGGAYSYNALKRLDGLWSSICXXXXXXX 265
            R   Y+ AQ+ A PSRTQRIMESI VSGEVGGAGGAYSYNALKRLDG+WSSIC       
Sbjct: 36   RGTIYMSAQTQAVPSRTQRIMESISVSGEVGGAGGAYSYNALKRLDGIWSSICSTQTVQQ 95

Query: 266  XXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAALSCKGLRVGIVEKSVLRG 445
                      G+   S  AEK  D  DV+VCGGTLGIFIA AL  KGL+V IVE+++L+G
Sbjct: 96   APQQVVSSFPGVSSRSVLAEKQVDKCDVVVCGGTLGIFIATALIAKGLKVCIVERNILKG 155

Query: 446  REQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFEGKGEIWVNDILNLGVSPA 625
            REQEWNISRK             DDI+ VT+ SFNPNRCGFE KGEIWV DILNLGVSP 
Sbjct: 156  REQEWNISRKELMELVEAGILDEDDIEEVTAVSFNPNRCGFENKGEIWVEDILNLGVSPV 215

Query: 626  KLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKILSSHLIIDAMGNFSPVVK 805
            KLIEI++KRF S GG I+EGCSVS ISIY DAA+LQL +G ILSS LIIDAMGNFSPVVK
Sbjct: 216  KLIEIVKKRFVSFGGVIFEGCSVSSISIYNDAAILQLAEGNILSSRLIIDAMGNFSPVVK 275

Query: 806  QIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQSVAQYFWEAFPAGSGPLE 985
            QIR GRKPDG+CLVVGSC RGFKEN+TSDVI+SS+SVK+VG +  QYFWEAFPAGSGPL+
Sbjct: 276  QIRGGRKPDGVCLVVGSCARGFKENSTSDVIYSSSSVKKVGNAEVQYFWEAFPAGSGPLD 335

Query: 986  RTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEVLRVIYGI 1132
            RTTYMFTY+              DYWDLMP+YQ VS+D LE+LRVIYGI
Sbjct: 336  RTTYMFTYVNPQPTSPKLEELLEDYWDLMPKYQGVSMDSLEILRVIYGI 384


>ref|XP_017249143.1| PREDICTED: uncharacterized protein LOC108220020 isoform X2 [Daucus
            carota subsp. sativus]
          Length = 512

 Score =  462 bits (1188), Expect = e-157
 Identities = 235/367 (64%), Positives = 268/367 (73%)
 Frame = +2

Query: 32   NGGLSLFHGSFASPPPLPTRRKFYVLAQSTAPSRTQRIMESIPVSGEVGGAGGAYSYNAL 211
            NGG    + S A+ P    R   + +     PSRTQRIMESIPV+GEVGGAGGAYSYNAL
Sbjct: 15   NGGFQ--YKSLATSPQCHKRPTIFAVQAKAIPSRTQRIMESIPVNGEVGGAGGAYSYNAL 72

Query: 212  KRLDGLWSSICXXXXXXXXXXXXXXXXXGLFRESEKAEKCTDMFDVLVCGGTLGIFIAAA 391
            KRLD LWS IC                 GLF +S+ A+K  D FDV+VCGGTLGIFIA A
Sbjct: 73   KRLDKLWSGICSAQEVVDEPKQVVSRIPGLFSQSDLADKEVDTFDVVVCGGTLGIFIATA 132

Query: 392  LSCKGLRVGIVEKSVLRGREQEWNISRKXXXXXXXXXXXXXDDIDHVTSSSFNPNRCGFE 571
            LS KGLRVGIVEK+VL+GREQ+WNISRK             +DI+H TS++FNPNRCGFE
Sbjct: 133  LSSKGLRVGIVEKNVLKGREQDWNISRKEMLELVEVGILEEEDIEHATSATFNPNRCGFE 192

Query: 572  GKGEIWVNDILNLGVSPAKLIEIMRKRFDSLGGAIYEGCSVSHISIYEDAAVLQLTDGKI 751
            GKGEIWV +ILNLGVSP+KLIE M+ RF+S  G I EG  VS I +Y+DAA+LQL  GK 
Sbjct: 193  GKGEIWVENILNLGVSPSKLIEKMKTRFNSFDGVILEGLGVSSICVYDDAAILQLDSGKR 252

Query: 752  LSSHLIIDAMGNFSPVVKQIRRGRKPDGICLVVGSCCRGFKENTTSDVIFSSASVKRVGQ 931
            LSS L+IDAMGNFSPVVKQIR GRKPDG CLVVGSCCRGFK+N TSDVI+SSA V +VG+
Sbjct: 253  LSSRLVIDAMGNFSPVVKQIRGGRKPDGFCLVVGSCCRGFKDNKTSDVIYSSAEVMQVGE 312

Query: 932  SVAQYFWEAFPAGSGPLERTTYMFTYLYXXXXXXXXXXXXXDYWDLMPQYQDVSLDDLEV 1111
            S  QYFWEAFPAGSG ++RTTYMFTY+              DYW+LMP YQ VSLDDLE+
Sbjct: 313  SQVQYFWEAFPAGSGLMDRTTYMFTYVDPQPGSPKLEELLEDYWNLMPDYQGVSLDDLEI 372

Query: 1112 LRVIYGI 1132
            LRVIYGI
Sbjct: 373  LRVIYGI 379


Top