BLASTX nr result

ID: Magnolia22_contig00011535 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Magnolia22_contig00011535
         (879 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010249429.1 PREDICTED: uncharacterized protein LOC104591969 [...   131   3e-32
XP_010279448.1 PREDICTED: uncharacterized protein LOC104613369 [...   125   3e-30
XP_017602833.1 PREDICTED: uncharacterized protein LOC108449947 i...   102   2e-21
XP_012446400.1 PREDICTED: uncharacterized protein LOC105769961 i...   101   3e-21
XP_018856953.1 PREDICTED: uncharacterized protein LOC109019172 [...   101   4e-21
XP_010101792.1 hypothetical protein L484_018748 [Morus notabilis...   101   4e-21
XP_012446398.1 PREDICTED: uncharacterized protein LOC105769961 i...   100   5e-21
XP_016702501.1 PREDICTED: translation initiation factor IF-2 [Go...   100   8e-21
XP_012458842.1 PREDICTED: dystrophin [Gossypium raimondii] KJB13...   100   8e-21
XP_017602834.1 PREDICTED: uncharacterized protein LOC108449947 i...   100   9e-21
AGV75925.1 CFE protein [Gossypium barbadense]                         100   9e-21
AGV75920.1 CFE protein [Gossypium herbaceum]                          100   9e-21
XP_016710365.1 PREDICTED: uncharacterized protein LOC107924430 i...   100   1e-20
XP_010268822.1 PREDICTED: uncharacterized protein LOC104605671 [...   100   2e-20
XP_008777616.1 PREDICTED: uncharacterized protein LOC103697522 [...    99   3e-20
XP_016698182.1 PREDICTED: uncharacterized protein LOC107913997 i...    99   3e-20
XP_017639521.1 PREDICTED: uncharacterized protein LOC108480902 [...    97   8e-20
XP_007021270.1 PREDICTED: uncharacterized protein LOC18593823 [T...    97   1e-19
XP_006841790.2 PREDICTED: uncharacterized protein LOC18431606 [A...    96   2e-19
GAV58917.1 DUF761 domain-containing protein/DUF4408 domain-conta...    96   2e-19

>XP_010249429.1 PREDICTED: uncharacterized protein LOC104591969 [Nelumbo nucifera]
          Length = 341

 Score =  131 bits (329), Expect = 3e-32
 Identities = 97/294 (32%), Positives = 139/294 (47%), Gaps = 45/294 (15%)
 Frame = -2

Query: 878 ISFRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEKKIEDGEEETDXXXXXXXXXXQRDS 699
           ISF+ WL+ PYL+++ NF         +FQQK  +K    G EE               +
Sbjct: 57  ISFQRWLARPYLFVVFNFIIIVIAALSNFQQKLLDKS---GSEE---------------A 98

Query: 698 DASSRLETTPTTTEIWSEISSLTASSENPPESDEKPSALAVESSPELWSDITCLSDSFAE 519
           ++    +    +  +W EI+ +      P E        + ESSP  WSD++C++DS  +
Sbjct: 99  ESRPEKQRRKASQGVWLEINEMEEMQVEPDEKRMVFVEKSGESSPGGWSDVSCVTDSDGK 158

Query: 518 S------PSPAVKKT-----------------------VRSLTANQLKSV---APSEPDD 435
           S       S  VK+T                       V++    +++S+   A  E +D
Sbjct: 159 SGLNSSASSAIVKRTGTYLKTARNTATESKPLGHPGRVVKATVEKKVRSIGSEAKREEND 218

Query: 434 TLDATWKAITEGRG--ARQLKKSETWDXXXXXXXXXXXXXXXXE-----LKKSETFNDES 276
           TLDATWK+I + R   ARQL+KS+TWD                      L+KSETFND +
Sbjct: 219 TLDATWKSIADRRNPTARQLRKSDTWDTPPRVVVGAESESTVTTTGGRQLRKSETFNDTA 278

Query: 275 SSASSRGGEPGMTREMSSL------SHDELNRRVEAFIQKFRLQRQDSYRQSMQ 132
           SSASS       +R    L      SHD+LNRRVEAFI+ FRLQRQ+SYR+ ++
Sbjct: 279 SSASSASSGSDSSRRSGGLRREVLMSHDDLNRRVEAFIEMFRLQRQESYRRYLE 332


>XP_010279448.1 PREDICTED: uncharacterized protein LOC104613369 [Nelumbo nucifera]
          Length = 342

 Score =  125 bits (315), Expect = 3e-30
 Identities = 103/296 (34%), Positives = 140/296 (47%), Gaps = 44/296 (14%)
 Frame = -2

Query: 875 SFRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSE-KKIEDGEEETDXXXXXXXXXXQRDS 699
           S +SWL+PPYL+I+ NF         +FQQK +E K  EDG  E +             +
Sbjct: 58  SIQSWLAPPYLFIVFNFIIIVIAVLSNFQQKITEMKNSEDGVIEIENRSLQLRRVNSGST 117

Query: 698 DASSRLETTPTTTEIWSEISSLTASSENPPESDEKPSALAVESSP----ELWSDITCLSD 531
           ++SSR +  P    +W E+       E P ESD+K S ++VE S     E W D + L+D
Sbjct: 118 ESSSRFQRMPPV--VWHEVD------EMPTESDDK-SVISVEKSVQTSLESWFDASSLTD 168

Query: 530 SFAESPSPAV-------------------KKTVRSLTANQLKSVAPSEPDDTLDATWKAI 408
           S  E P P                     +KT R  TA+  K     + ++TLD  WKAI
Sbjct: 169 S-DERPKPKPNPFEGPSAHFSVRKPMETNQKTARVTTASDEKK----DENETLDQKWKAI 223

Query: 407 TEGRG---ARQLKKSETWDXXXXXXXXXXXXXXXXE-----------LKKSETFNDESSS 270
            EG+    ARQLKK ETW+                            L+KSETFND +SS
Sbjct: 224 MEGQKKTQARQLKKCETWEVPHRAPEPDPEPALSSVTTAMTVRSRKVLRKSETFNDTASS 283

Query: 269 ASSRGGEPGMTREMSS--LSHDELNRRVEAFIQK----FRLQRQDSYRQSMQKMNR 120
            SS         + +   +SHDELNRRVEAFI++     RLQRQ+S  + ++ +NR
Sbjct: 284 VSSGSSSSRNAAQRTEVLMSHDELNRRVEAFIKRRYDELRLQRQESENRYLEMVNR 339


>XP_017602833.1 PREDICTED: uncharacterized protein LOC108449947 isoform X1
           [Gossypium arboreum]
          Length = 334

 Score =  102 bits (253), Expect = 2e-21
 Identities = 90/286 (31%), Positives = 126/286 (44%), Gaps = 35/286 (12%)
 Frame = -2

Query: 872 FRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEK-KIEDGE-----EETDXXXXXXXXXX 711
           FRSWL PPYLY+++N           F Q + EK ++E  +      E            
Sbjct: 49  FRSWLKPPYLYVVINGIIITIAASSRFNQNNGEKDQMEQMQPRPKISEDQQPMVEYETKS 108

Query: 710 QRDSDASSRLETTPTTTEIWSEISSLTASSE-NPPESDEKPSALAVESSPELW------- 555
             DSDA    +      +   E+ +  +  E N    D++     V S  E W       
Sbjct: 109 GWDSDAVESSDFVYEENQRGEEVETRVSEEESNVAVEDDRDGNEFVISKSE-WIPPSRTD 167

Query: 554 -SDITCLSDSFAESPSPA--------VKKTVRSLTANQLKSVAPSEPDDTLDATWKAITE 402
            S+I   +    E P+P+        VK    +    +  +VA  +  +TL+ TWK ITE
Sbjct: 168 SSEIPLDALLIQEKPAPSSRFGHRKPVKANPEANAGGRALTVAKPKRHETLENTWKMITE 227

Query: 401 GRG---ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESS-----SASSRGGEP 246
           G+    +R LKKS+TW+                 +KKSETF D ++        S     
Sbjct: 228 GKSMPLSRHLKKSDTWENHGRDINMDALTSSPL-MKKSETFRDRTNYQLPPEQVSSFPAS 286

Query: 245 GMTREMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           G  R+  SLS DELNRRVEAFI+KF    RLQRQ+S  Q M+ +NR
Sbjct: 287 GKLRKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLNQYMEMVNR 332


>XP_012446400.1 PREDICTED: uncharacterized protein LOC105769961 isoform X2
           [Gossypium raimondii] AGV75921.1 CFE protein [Gossypium
           raimondii] KJB57628.1 hypothetical protein
           B456_009G172800 [Gossypium raimondii]
          Length = 331

 Score =  101 bits (251), Expect = 3e-21
 Identities = 87/282 (30%), Positives = 121/282 (42%), Gaps = 31/282 (10%)
 Frame = -2

Query: 872 FRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEKKIEDGEE------ETDXXXXXXXXXX 711
           FRSWL PPYLY+ +N           F Q + EK   +  +      E            
Sbjct: 49  FRSWLKPPYLYVFINGIIITIAASSRFNQNNGEKDQTEQMQPRPKISEDQQPIVEYDTKS 108

Query: 710 QRDSDASSRLETTPTTTEIWSEISSLTASSEN--PPESDEKPSALAVESS----PELWSD 549
             DSDA    +      +   E+++  +  E+    E D   +   +  S    P     
Sbjct: 109 GWDSDAVESSDFVYEENQRGEEVATRVSEEESNVAVEDDRDGNEFVISKSEWIPPSRTDS 168

Query: 548 ITCLSDSFA--ESPSPAVKKTVRSLT-----ANQLKSVAPSEPDDTLDATWKAITEGRG- 393
              L D+    E P+P+ +   R L        +   VA  +  +TL+ TWK ITEG+  
Sbjct: 169 SEILLDALLIQEKPAPSSRFGHRKLVKVNPEGGRALKVAKPKRHETLENTWKMITEGKSM 228

Query: 392 --ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESS-----SASSRGGEPGMTR 234
             +R LKKS+TW+                 +KKSETF D ++        S     G  R
Sbjct: 229 PLSRHLKKSDTWENHGRDINVEALTSSPL-MKKSETFRDRTNYQLPPEQVSSFPASGKLR 287

Query: 233 EMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           +  SLS DELNRRVEAFI+KF    RLQRQ+S  Q M+ +NR
Sbjct: 288 KEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLNQYMEMVNR 329


>XP_018856953.1 PREDICTED: uncharacterized protein LOC109019172 [Juglans regia]
          Length = 335

 Score =  101 bits (251), Expect = 4e-21
 Identities = 88/292 (30%), Positives = 132/292 (45%), Gaps = 40/292 (13%)
 Frame = -2

Query: 875 SFRSWLSPPYLYIIVNFXXXXXXXXXSFQQKH----SEKKIEDGE----EETDXXXXXXX 720
           S  SWL PPYL+ I+N           F   H    SE+++ D +               
Sbjct: 44  SLLSWLKPPYLFFIINGIIIGIAVTSRFHHNHHHDGSERQVPDDDPPPRRTPTEEISSEL 103

Query: 719 XXXQRDSDASSRLETTPTTTEIWSEISSLT----------ASSENPPESDEKPSALAVES 570
                 +DA+ +L+    T E+ SE++             A +E   + D+   AL  +S
Sbjct: 104 SVYHYSTDATHQLKREEATYEV-SEVTKAVVVMNGYAESEAEAEAEEDDDDNDEALGWKS 162

Query: 569 S----PELWSDITCLSDSFAESPSPAV------KKTVRSLT---ANQLKSVAPSEPDDTL 429
           +      L+   T L++ F+ +  P V      +K V++     + +   VA  +  +T+
Sbjct: 163 TGNQTKRLYPSETQLTEYFSPAEKPLVSVRFGHRKPVKASPEGGSGRTLRVAKPKRHETM 222

Query: 428 DATWKAITEGRG---ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESS--SAS 264
           + TWKAITEGR    +R ++K +TW+                 +KKSETF D ++     
Sbjct: 223 ENTWKAITEGRTMPLSRHMRKCDTWENHGHRQIDFPEDPSP--VKKSETFKDRTNLQQTP 280

Query: 263 SRGGEPGMTREMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           S G      R+  SLS DELNRRVEAFI KF    RLQRQ+S  Q M+ +NR
Sbjct: 281 SPGSSNSKLRKEPSLSQDELNRRVEAFINKFNEEMRLQRQESLNQYMEMINR 332


>XP_010101792.1 hypothetical protein L484_018748 [Morus notabilis] EXB89647.1
           hypothetical protein L484_018748 [Morus notabilis]
          Length = 347

 Score =  101 bits (251), Expect = 4e-21
 Identities = 90/289 (31%), Positives = 127/289 (43%), Gaps = 40/289 (13%)
 Frame = -2

Query: 866 SWLSPPYLYIIVNFXXXXXXXXXSFQQK--------------HSEKKIEDGEEETDXXXX 729
           SWL PPYLY+++N             QK                E KI  GE  TD    
Sbjct: 56  SWLKPPYLYVLINGIIISIVASSKLHQKLEDPSTSPSHVAITAGEVKISGGEVRTDYAVY 115

Query: 728 XXXXXXQRDSDASSRLETTPTTTE--IWSEISSLTASSENPPESDEKPSALAVESSPE-L 558
                     D     + +  T E  +++  + +  +S +  E +E+  A+   S    L
Sbjct: 116 SGVVLTGYGYDPGEVAKVSAMTAESVVYAPAARVVEASVSAAEKEEREEAIVAASERNGL 175

Query: 557 WSDITCLSDSF-AESPSPAV------KKTVRSLTANQLKSVAPSEP--DDTLDATWKAIT 405
               + L   F +E+  P V      +K V++      K++  S+P   DTL++TWK IT
Sbjct: 176 QRKDSSLEYFFPSENEKPPVSARFGHRKAVKASPEGGGKALRVSKPKRQDTLESTWKTIT 235

Query: 404 EGRG---ARQLKKSETWDXXXXXXXXXXXXXXXXE------LKKSETFNDESSSAS-SRG 255
           EGR     R LKKS+TW+                       +KKSETF D + ++S S  
Sbjct: 236 EGRPMPLTRHLKKSDTWESHVRRGGVGIQSQNDQSTPPPAKMKKSETFADRNPNSSLSPS 295

Query: 254 GEPGMTREMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
              G  R+  SLS DELNRRVEAFI+KF    RLQRQ+S  Q  + + R
Sbjct: 296 PGSGKLRKEPSLSQDELNRRVEAFIKKFNEEMRLQRQESLNQYQEMIRR 344


>XP_012446398.1 PREDICTED: uncharacterized protein LOC105769961 isoform X1
           [Gossypium raimondii] KJB57629.1 hypothetical protein
           B456_009G172800 [Gossypium raimondii]
          Length = 334

 Score =  100 bits (250), Expect = 5e-21
 Identities = 87/285 (30%), Positives = 121/285 (42%), Gaps = 34/285 (11%)
 Frame = -2

Query: 872 FRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEKKIEDGEE------ETDXXXXXXXXXX 711
           FRSWL PPYLY+ +N           F Q + EK   +  +      E            
Sbjct: 49  FRSWLKPPYLYVFINGIIITIAASSRFNQNNGEKDQTEQMQPRPKISEDQQPIVEYDTKS 108

Query: 710 QRDSDASSRLETTPTTTEIWSEISSLTASSEN--PPESDEKPSALAVESS----PELWSD 549
             DSDA    +      +   E+++  +  E+    E D   +   +  S    P     
Sbjct: 109 GWDSDAVESSDFVYEENQRGEEVATRVSEEESNVAVEDDRDGNEFVISKSEWIPPSRTDS 168

Query: 548 ITCLSDSFA--ESPSPA--------VKKTVRSLTANQLKSVAPSEPDDTLDATWKAITEG 399
              L D+    E P+P+        VK    +    +   VA  +  +TL+ TWK ITEG
Sbjct: 169 SEILLDALLIQEKPAPSSRFGHRKLVKVNPEANAGGRALKVAKPKRHETLENTWKMITEG 228

Query: 398 RG---ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESS-----SASSRGGEPG 243
           +    +R LKKS+TW+                 +KKSETF D ++        S     G
Sbjct: 229 KSMPLSRHLKKSDTWENHGRDINVEALTSSPL-MKKSETFRDRTNYQLPPEQVSSFPASG 287

Query: 242 MTREMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
             R+  SLS DELNRRVEAFI+KF    RLQRQ+S  Q M+ +NR
Sbjct: 288 KLRKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLNQYMEMVNR 332


>XP_016702501.1 PREDICTED: translation initiation factor IF-2 [Gossypium hirsutum]
          Length = 301

 Score = 99.8 bits (247), Expect = 8e-21
 Identities = 86/263 (32%), Positives = 115/263 (43%), Gaps = 14/263 (5%)
 Frame = -2

Query: 866 SWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEKKIEDGEEETDXXXXXXXXXXQRDSDASS 687
           S+L PPYLY+++N            Q +  E       EE +          +  S+  S
Sbjct: 48  SFLRPPYLYLLINGIIISIVASSKLQAQKPEST-----EEINPSPEIVSPALKVPSEVFS 102

Query: 686 RLET--TPTTTEIWSEISSLTASSENPPESDEKPSALAVESSPELWSDITCLSDSFAESP 513
              +  TP  T + +E    T   E      E   A     S EL S         AE P
Sbjct: 103 NEYSYGTPAATVLVAEEIKRTVEEEKVKVVTEAAPAPLRTESMELIS-------LMAEKP 155

Query: 512 SPA----VKKTVRSLTANQLKSVAPSEPDDTLDATWKAITEGRG---ARQLKKSETWDXX 354
             A     +K V++ T  +   V+  +  DTL+ATWK ITEGR     R LKKS+TW+  
Sbjct: 156 PVARRFGQRKAVKAATEGKALRVSKPKRHDTLEATWKTITEGRPMPLTRHLKKSDTWEQR 215

Query: 353 XXXXXXXXXXXXXXELKKSETFNDESSSAS-SRGGEPGMTREMSSLSHDELNRRVEAFIQ 177
                          +KKS+TFN+ S     +R    G  ++  SLS DELNRRVEAFI 
Sbjct: 216 TQKDHNTPPPPLPNTMKKSDTFNEHSREPPLARSSGSGKLKKDPSLSQDELNRRVEAFIT 275

Query: 176 KF----RLQRQDSYRQSMQKMNR 120
           KF    RLQRQ+S  Q  + + R
Sbjct: 276 KFNEEMRLQRQESLNQYQEMLRR 298


>XP_012458842.1 PREDICTED: dystrophin [Gossypium raimondii] KJB13356.1 hypothetical
           protein B456_002G075800 [Gossypium raimondii]
          Length = 301

 Score = 99.8 bits (247), Expect = 8e-21
 Identities = 86/263 (32%), Positives = 115/263 (43%), Gaps = 14/263 (5%)
 Frame = -2

Query: 866 SWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEKKIEDGEEETDXXXXXXXXXXQRDSDASS 687
           S+L PPYLY+++N            Q +  E       EE +          +  S+  S
Sbjct: 48  SFLRPPYLYLLINGIIISIVASSKLQAQKPEST-----EEINPSPEIVSPALKVPSEVFS 102

Query: 686 RLET--TPTTTEIWSEISSLTASSENPPESDEKPSALAVESSPELWSDITCLSDSFAESP 513
              +  TP  T + +E    T   E      E   A     S EL S         AE P
Sbjct: 103 NEYSYGTPAATVLVAEEIKRTVEEEQVKVVTEAAPAPLRTESMELIS-------LMAEKP 155

Query: 512 SPA----VKKTVRSLTANQLKSVAPSEPDDTLDATWKAITEGRG---ARQLKKSETWDXX 354
             A     +K V++ T  +   V+  +  DTL+ATWK ITEGR     R LKKS+TW+  
Sbjct: 156 PVARRFGQRKAVKAATEGKALRVSKPKRHDTLEATWKTITEGRPMPLTRHLKKSDTWEQR 215

Query: 353 XXXXXXXXXXXXXXELKKSETFNDESSSAS-SRGGEPGMTREMSSLSHDELNRRVEAFIQ 177
                          +KKS+TFN+ S     +R    G  ++  SLS DELNRRVEAFI 
Sbjct: 216 TQKDHNTPPPPLPNTMKKSDTFNEHSREPPLARSSGSGKLKKDPSLSQDELNRRVEAFIT 275

Query: 176 KF----RLQRQDSYRQSMQKMNR 120
           KF    RLQRQ+S  Q  + + R
Sbjct: 276 KFNEEMRLQRQESLNQYQEMLRR 298


>XP_017602834.1 PREDICTED: uncharacterized protein LOC108449947 isoform X2
           [Gossypium arboreum] KHG20764.1 Thymidylate kinase
           [Gossypium arboreum]
          Length = 331

 Score =  100 bits (248), Expect = 9e-21
 Identities = 91/283 (32%), Positives = 128/283 (45%), Gaps = 32/283 (11%)
 Frame = -2

Query: 872 FRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEK-KIEDGE-----EETDXXXXXXXXXX 711
           FRSWL PPYLY+++N           F Q + EK ++E  +      E            
Sbjct: 49  FRSWLKPPYLYVVINGIIITIAASSRFNQNNGEKDQMEQMQPRPKISEDQQPMVEYETKS 108

Query: 710 QRDSDASSRLETTPTTTEIWSEISSLTASSE-NPPESDEKPSALAVESSPELW------- 555
             DSDA    +      +   E+ +  +  E N    D++     V S  E W       
Sbjct: 109 GWDSDAVESSDFVYEENQRGEEVETRVSEEESNVAVEDDRDGNEFVISKSE-WIPPSRTD 167

Query: 554 -SDITCLSDSFAESPSPAVKKTVRS-LTAN----QLKSVAPSEPDDTLDATWKAITEGRG 393
            S+I   +    E P+P+ +   R  + AN    +  +VA  +  +TL+ TWK ITEG+ 
Sbjct: 168 SSEIPLDALLIQEKPAPSSRFGHRKPVKANPEGGRALTVAKPKRHETLENTWKMITEGKS 227

Query: 392 ---ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESS-----SASSRGGEPGMT 237
              +R LKKS+TW+                 +KKSETF D ++        S     G  
Sbjct: 228 MPLSRHLKKSDTWENHGRDINMDALTSSPL-MKKSETFRDRTNYQLPPEQVSSFPASGKL 286

Query: 236 REMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           R+  SLS DELNRRVEAFI+KF    RLQRQ+S  Q M+ +NR
Sbjct: 287 RKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLNQYMEMVNR 329


>AGV75925.1 CFE protein [Gossypium barbadense]
          Length = 331

 Score =  100 bits (248), Expect = 9e-21
 Identities = 90/283 (31%), Positives = 128/283 (45%), Gaps = 32/283 (11%)
 Frame = -2

Query: 872 FRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEK-KIEDGE-----EETDXXXXXXXXXX 711
           FRSWL PPYLY+++N           F Q + EK ++E  +      E            
Sbjct: 49  FRSWLKPPYLYVVINGIIITIAASSRFSQNNGEKDQMEQMQPRPKISEDQQPIVEYDTKS 108

Query: 710 QRDSDASSRLETTPTTTEIWSEISSLTASSE-NPPESDEKPSALAVESSPELW------- 555
             DSDA    +      +   E+++  +  E N    D++     V S  E W       
Sbjct: 109 GWDSDAVESSDFVYEENQRGEEVATRVSEEESNVAVEDDRDGNEFVISKSE-WIPPSRTD 167

Query: 554 -SDITCLSDSFAESPSPAVK----KTVR-SLTANQLKSVAPSEPDDTLDATWKAITEGRG 393
            S+I   +    E P+P+ +    K V+ +    +   VA  +  +TL+ TWK ITEG+ 
Sbjct: 168 SSEIPLDALLIQEKPAPSSRFGHRKPVKVNPEGGRALKVAKPKRHETLENTWKMITEGKS 227

Query: 392 ---ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESS-----SASSRGGEPGMT 237
              +R LKKS+TW+                 +KKSETF D ++        S     G  
Sbjct: 228 MPLSRHLKKSDTWENHGRDINVEALTSSPL-MKKSETFRDRTNYQLPPEQVSSFPASGKL 286

Query: 236 REMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           R+  SLS DELNRRVEAFI+KF    RLQRQ+S  Q M+ +NR
Sbjct: 287 RKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLNQYMEMVNR 329


>AGV75920.1 CFE protein [Gossypium herbaceum]
          Length = 331

 Score =  100 bits (248), Expect = 9e-21
 Identities = 91/283 (32%), Positives = 128/283 (45%), Gaps = 32/283 (11%)
 Frame = -2

Query: 872 FRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEK-KIEDGE-----EETDXXXXXXXXXX 711
           FRSWL PPYLY+++N           F Q + EK ++E  +      E            
Sbjct: 49  FRSWLKPPYLYVVINGIIITIAASSRFNQNNGEKDQMEQMQPRPKISEDQQPMVEYETKS 108

Query: 710 QRDSDASSRLETTPTTTEIWSEISSLTASSE-NPPESDEKPSALAVESSPELW------- 555
             DSDA    +      +   E+ +  +  E N    D++     V S  E W       
Sbjct: 109 GWDSDAVESSDFVYEENQRGEEVETRVSEEESNVAVEDDRDGNEFVISKSE-WIPPSRTD 167

Query: 554 -SDITCLSDSFAESPSPAVKKTVRS-LTAN----QLKSVAPSEPDDTLDATWKAITEGRG 393
            S+I   +    E P+P+ +   R  + AN    +  +VA  +  +TL+ TWK ITEG+ 
Sbjct: 168 SSEIPLDALLIQEKPAPSSRFGHRKPVKANPEGGRALTVAKPKRHETLENTWKMITEGKS 227

Query: 392 ---ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESS-----SASSRGGEPGMT 237
              +R LKKS+TW+                 +KKSETF D ++        S     G  
Sbjct: 228 MPLSRHLKKSDTWENHGRDINMEALTSSPL-MKKSETFRDRTNYQLPPEQVSSFPASGKL 286

Query: 236 REMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           R+  SLS DELNRRVEAFI+KF    RLQRQ+S  Q M+ +NR
Sbjct: 287 RKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLNQYMEMVNR 329


>XP_016710365.1 PREDICTED: uncharacterized protein LOC107924430 isoform X1
           [Gossypium hirsutum]
          Length = 334

 Score = 99.8 bits (247), Expect = 1e-20
 Identities = 89/286 (31%), Positives = 124/286 (43%), Gaps = 35/286 (12%)
 Frame = -2

Query: 872 FRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEK-KIEDGEEETDXXXXXXXXXXQR--- 705
           FRSWL PPYLY+++N           F Q + EK ++E  +                   
Sbjct: 49  FRSWLKPPYLYVVINGIIITIAASSRFNQNNGEKDQMEQMQPRPKISADQQPMVEYETKS 108

Query: 704 --DSDASSRLETTPTTTEIWSEISSLTASSE-NPPESDEKPSALAVESSPELW------- 555
             DSDA    +      +   E+ +  +  E N    D++     V S  E W       
Sbjct: 109 GWDSDAVESSDFVYEENQRGEEVETRVSEEESNVAVEDDRDGNEFVISKSE-WIPPSRTD 167

Query: 554 -SDITCLSDSFAESPSPA--------VKKTVRSLTANQLKSVAPSEPDDTLDATWKAITE 402
            S+I   +    E P+P+        VK    +    +   VA  +  +TL+ TWK ITE
Sbjct: 168 SSEIPLDALLIQEKPAPSSRFGHRKPVKVNPEANAGGRALKVAKPKRHETLENTWKMITE 227

Query: 401 GRG---ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESS-----SASSRGGEP 246
           G+    +R LKKS+TW+                 +KKSETF D ++        S     
Sbjct: 228 GKSMPLSRHLKKSDTWENHGRDINVEALTSSPL-MKKSETFRDRTNYQLPPEQVSSFPAS 286

Query: 245 GMTREMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           G  R+  SLS DELNRRVEAFI+KF    RLQRQ+S  Q M+ +NR
Sbjct: 287 GKLRKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLNQYMEMVNR 332


>XP_010268822.1 PREDICTED: uncharacterized protein LOC104605671 [Nelumbo nucifera]
          Length = 357

 Score = 99.8 bits (247), Expect = 2e-20
 Identities = 98/304 (32%), Positives = 128/304 (42%), Gaps = 52/304 (17%)
 Frame = -2

Query: 875 SFRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEKKIE---------------------D 759
           S  SWL+PPYLY+I+N           FQQK  E++ E                     +
Sbjct: 52  SLLSWLTPPYLYVIINGIIITIAASSRFQQKVDEQRPEMVGAPLKAEVQPDFVVTTAVYE 111

Query: 758 GEEETDXXXXXXXXXXQRDSDASSRLETTPTTTEIWSEISSL-----------TASSENP 612
                D            D  A    E TPT  E    +S             T  +  P
Sbjct: 112 DAAVLDEHVVVERPELGYDVVAMKNPEETPTEFESLEPLSGYGQGQAEAVEVKTLLTRAP 171

Query: 611 PESDEKPSALAVESSPELWSDITCLSDSFA---ESPSPAVK---KTVR-SLTANQLKSVA 453
            E DE    ++  +      D T +   ++   E P  +V+   K VR S    +   VA
Sbjct: 172 EEYDEDDFVISRSTWTPQRRDSTEVPTEYSFPTEKPLVSVRFGRKIVRASPEGGRALRVA 231

Query: 452 PSEPDDTLDATWKAITEGRG---ARQLKKSETWDXXXXXXXXXXXXXXXXE--LKKSETF 288
             + +DTL+ TWK IT GR     R LKKSETW+                   +KK+ETF
Sbjct: 232 KPKRNDTLENTWKTITAGRPMPLTRHLKKSETWETHGRHSNPSQGTQEPSPAKVKKAETF 291

Query: 287 ND----ESSSASSRGGEPGMTREMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQ 132
            D    E+S + S GG  G  R+  SLS DELNRRVEAFI+KF    RLQRQ+S  Q  +
Sbjct: 292 RDRTNLETSLSPSPGGS-GKFRKEPSLSQDELNRRVEAFIKKFNEEMRLQRQESLNQYRE 350

Query: 131 KMNR 120
            +NR
Sbjct: 351 MINR 354


>XP_008777616.1 PREDICTED: uncharacterized protein LOC103697522 [Phoenix
           dactylifera]
          Length = 333

 Score = 98.6 bits (244), Expect = 3e-20
 Identities = 87/290 (30%), Positives = 122/290 (42%), Gaps = 41/290 (14%)
 Frame = -2

Query: 866 SWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEKKIEDGEEETDXXXXXXXXXXQRDSDASS 687
           SWL+PPYLY ++N           FQ     K   DG++                S A+ 
Sbjct: 58  SWLTPPYLYFVINGIILSIAASSRFQ-----KPAPDGQDPA----------RSAPSMANP 102

Query: 686 RLETTPTTTEIWSEISSLTASSENPPESDEKPS-------------ALAVESSPE----- 561
            L   P       E ++  A    P      P+             A A E+  E     
Sbjct: 103 TLTQLPEYMPPAEEYAAKIAEDVQPEPEFVAPAEYEGEEVVGLEVAAAAEEAEGEEEFVI 162

Query: 560 ---LWS-------DITCLSDSFAESPSPAV------KKTVRSLTANQLKSVAPSEPDDTL 429
               WS       +    +D  A +  P V      +K V+     +   VA ++  +TL
Sbjct: 163 SRSSWSPQRRESRETATATDYLAVTEKPLVSVRFGRRKNVKPSPEGKALRVARAKRGETL 222

Query: 428 DATWKAITEGRG---ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESSSASSR 258
           ++TW+ IT+GR    AR LKKS+TW+                 ++KSETFND S S SS 
Sbjct: 223 ESTWRTITDGRPVPLARHLKKSDTWETHGRSLRDEQSPAPL--MRKSETFNDSSPSPSSS 280

Query: 257 GGEPGMTREMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
               G  R+  SL  D+LNRRVEAFI+KF    RLQRQ+S+   ++ +NR
Sbjct: 281 SSSRGRLRKEPSLGQDDLNRRVEAFIKKFNEEMRLQRQESFNHYLEMINR 330


>XP_016698182.1 PREDICTED: uncharacterized protein LOC107913997 isoform X1
           [Gossypium hirsutum]
          Length = 334

 Score = 98.6 bits (244), Expect = 3e-20
 Identities = 88/286 (30%), Positives = 123/286 (43%), Gaps = 35/286 (12%)
 Frame = -2

Query: 872 FRSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEK-KIEDGEEETDXXXXXXXXXXQR--- 705
           FRSWL PPYLY+++N           F Q + EK ++E  +                   
Sbjct: 49  FRSWLKPPYLYVVINGIIITIAASSQFNQNNGEKDQMEQMQPRPKISADQQPMVEYETKS 108

Query: 704 --DSDASSRLETTPTTTEIWSEISSLTASSE-NPPESDEKPSALAVESSPELW------- 555
             DSDA    +      +   E+ +  +  E N    D++     V S  E W       
Sbjct: 109 GWDSDAVESSDFVYEENQRGEEVETRVSEEESNVAVEDDRDGNEFVISKSE-WIPPSRTD 167

Query: 554 -SDITCLSDSFAESPSPA--------VKKTVRSLTANQLKSVAPSEPDDTLDATWKAITE 402
            S+I   +    E P+P+        VK    +    +    A  +  +TL+ TWK ITE
Sbjct: 168 SSEIPLDALLIQEKPAPSSRFGHRKPVKANPEANAGGRALKAAKPKRHETLENTWKMITE 227

Query: 401 GRG---ARQLKKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESS-----SASSRGGEP 246
           G+    +R LKKS+TW+                 +KKSETF D ++        S     
Sbjct: 228 GKSMPLSRHLKKSDTWENHGRDINMEALTSSPL-MKKSETFRDRTNYQLPPEQVSSFPAS 286

Query: 245 GMTREMSSLSHDELNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           G  R+  SLS DELNRRVEAFI+KF    RLQRQ+S  Q M+ +NR
Sbjct: 287 GKLRKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLNQYMEMVNR 332


>XP_017639521.1 PREDICTED: uncharacterized protein LOC108480902 [Gossypium
           arboreum] KHG12445.1 UDP-glucuronosyltransferase 2B1
           [Gossypium arboreum]
          Length = 301

 Score = 97.1 bits (240), Expect = 8e-20
 Identities = 80/259 (30%), Positives = 112/259 (43%), Gaps = 10/259 (3%)
 Frame = -2

Query: 866 SWLSPPYLYIIVN--FXXXXXXXXXSFQQKHSEKKIEDGEEETDXXXXXXXXXXQRDSDA 693
           S+L PPYLY+++N              Q+  S ++I    E               +   
Sbjct: 48  SFLRPPYLYLLINGIIISIVASSKLQAQKPESTQQINPSPEIVSQALKVPSEVFSNEYSY 107

Query: 692 SSRLETTPTTTEIWSEISSLTASSENPPESDEKPSALAVESSPELWSDITCLSDSFAESP 513
                 TP  T + +E    T   E      E   A     S EL   I  ++D    + 
Sbjct: 108 G-----TPAATVLVAEEIKRTVEEEQVKVVTEAAPAPLRTESMEL---INLMADKPPVAR 159

Query: 512 SPAVKKTVRSLTANQLKSVAPSEPDDTLDATWKAITEGRG---ARQLKKSETWDXXXXXX 342
               +K V++    +   V+  +  DTL+ATWK ITEGR     R LKKS+TW+      
Sbjct: 160 RFGQRKAVKAAMEGKALRVSKPKRHDTLEATWKTITEGRPMPLTRHLKKSDTWEQRTQKD 219

Query: 341 XXXXXXXXXXELKKSETFNDESSSAS-SRGGEPGMTREMSSLSHDELNRRVEAFIQKF-- 171
                      +KKS+TFN+ S     +R    G  ++  SLS DELNRRVEAFI+KF  
Sbjct: 220 HNTPPPPLPNTMKKSDTFNEHSREPPLARSSGSGKLKKDPSLSQDELNRRVEAFIKKFNE 279

Query: 170 --RLQRQDSYRQSMQKMNR 120
             RLQRQ+S  Q  + + R
Sbjct: 280 EMRLQRQESLNQYQEMIRR 298


>XP_007021270.1 PREDICTED: uncharacterized protein LOC18593823 [Theobroma cacao]
           EOY12795.1 Uncharacterized protein TCM_031316 [Theobroma
           cacao]
          Length = 318

 Score = 97.1 bits (240), Expect = 1e-19
 Identities = 78/270 (28%), Positives = 119/270 (44%), Gaps = 21/270 (7%)
 Frame = -2

Query: 866 SWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEKKIEDGE---EETDXXXXXXXXXXQRDSD 696
           S+L PPYLY+++N            Q K   ++    E                    SD
Sbjct: 48  SFLRPPYLYLLINCIIISIVASSKLQHKAETQQSPSPEIVLPAVKVSSEVYSSEYSYGSD 107

Query: 695 ASSRLETTPTTTEIWSEISSLTASSENPPESDEK------PSALAVESSPELWSDITCLS 534
            S+R+      + +     ++     +  E   K      P   A   S EL   +  L+
Sbjct: 108 TSARVVVAEDLSTVEESKEAVVVDGGDEEEEQVKVVMSLPPPPPARSESMELVMSL--LN 165

Query: 533 DSFAESPSPAVK----KTVRSLTANQLKSVAPSEPDDTLDATWKAITEGRG---ARQLKK 375
           +   E P  + +    K V++ +  +   V+  +  DTL++TWK ITEGR     R LKK
Sbjct: 166 EKAGEKPPVSKRFGQRKAVKAASEGKALRVSKPKRHDTLESTWKTITEGRPMPLTRHLKK 225

Query: 374 SETWDXXXXXXXXXXXXXXXXELKKSETFNDESSSAS-SRGGEPGMTREMSSLSHDELNR 198
           S+TW+                 +KKS+TFN+  +  S +R    G  ++  SLS D+LNR
Sbjct: 226 SDTWEQRAQKDPNAPPPPLPHTVKKSDTFNERPNGTSLTRSSGSGKLKKDPSLSQDDLNR 285

Query: 197 RVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           RVEAFI+KF    RLQRQ+S+ Q  + + R
Sbjct: 286 RVEAFIKKFNEEMRLQRQESWNQYQEMIRR 315


>XP_006841790.2 PREDICTED: uncharacterized protein LOC18431606 [Amborella
           trichopoda]
          Length = 311

 Score = 96.3 bits (238), Expect = 2e-19
 Identities = 87/273 (31%), Positives = 121/273 (44%), Gaps = 24/273 (8%)
 Frame = -2

Query: 869 RSWLSPPYLYIIVNFXXXXXXXXXSFQQKHSEKKIEDGEEETDXXXXXXXXXXQRDSDAS 690
           + WL+PPYLYI++N          SFQQK  E    + E   D             +  S
Sbjct: 48  KGWLTPPYLYIVINCIIITIAASSSFQQKPEEN---NQEPLLDQKRAEIRNDYGIPAIHS 104

Query: 689 SRLETTPTTTEIWSEISSLTASSENPPESDEKPSALAVESSPELWSD------ITCLSDS 528
              E     TEI      +TA+ ++  E +E        S  E W        +  L +S
Sbjct: 105 PEFEIP---TEIKRPSFEITATKKSSNEEEED-----FLSRGEWWPTRRSFPMVDSLENS 156

Query: 527 FAESPSPAV------KKTVRSLTANQLKSVAPSEPD--DTLDATWKAITEGRG---ARQL 381
                 P V      +K+V++        +  S P   +TL++TWK IT+GR    AR L
Sbjct: 157 CVTEEKPPVSVRFSHRKSVKASPEGGRAGLGVSRPKRHETLESTWKTITDGRAIPLARHL 216

Query: 380 KKSETWDXXXXXXXXXXXXXXXXELKKSETFNDESSSASSRGGEPGM---TREMSSLSHD 210
           KKS+TW+                 +KKS+TF++  S A      PG     R+  SLS D
Sbjct: 217 KKSDTWETHGRSRMNDSSSPL---MKKSDTFDERRSLAGESTPSPGSGKGLRKDPSLSQD 273

Query: 209 ELNRRVEAFIQKF----RLQRQDSYRQSMQKMN 123
           ELNRRVEAFI+KF    RLQR+ S++  M  +N
Sbjct: 274 ELNRRVEAFIKKFNAEIRLQRKQSFQTYMDMVN 306


>GAV58917.1 DUF761 domain-containing protein/DUF4408 domain-containing protein
           [Cephalotus follicularis]
          Length = 325

 Score = 96.3 bits (238), Expect = 2e-19
 Identities = 80/273 (29%), Positives = 121/273 (44%), Gaps = 20/273 (7%)
 Frame = -2

Query: 878 ISFRSWLSPPYLYIIVNFXXXXXXXXXSF--QQKHSEKKIEDGEEETDXXXXXXXXXXQR 705
           +S RSWLSPPY+YII+NF          F  Q  H     ED ++  +            
Sbjct: 72  VSVRSWLSPPYIYIILNFVIITIAASSIFLHQDPHQHHLKEDDDDNNNNSNKPPTTKITH 131

Query: 704 DSDASSRLETTPTTTEIWSEISSLTASSENPPESDEKPSALAVESSPELWSDITCLSDSF 525
           + +  +    +         +  +        E+ EK    +++   E  S+ +CL+ S 
Sbjct: 132 NQENQTHFLNSLHKNHNSHHLHDIWREMIYEDENKEK----SIDPLSETSSNDSCLTHSN 187

Query: 524 AESPSPAVKKTVRSLTANQLKSVAPSEPDDTLDATWKAITEGRGAR---QLKKSETWDXX 354
             +          S     ++     E  +TL+ TW+ ITEG+      QLKKS+TWD  
Sbjct: 188 ENA----------SRRQKAVQVAEEGEDKETLEDTWRLITEGKEKTRRVQLKKSDTWDTP 237

Query: 353 XXXXXXXXXXXXXXE-----------LKKSETFNDESSSASSRGGEPGMTREMSSLSHDE 207
                         +           L+KS+TFND S S         +TR+  S+SHDE
Sbjct: 238 PRVVVKGTADTGGVDDDHPVAWARRELRKSDTFNDYSVS---------LTRD-KSMSHDE 287

Query: 206 LNRRVEAFIQKF----RLQRQDSYRQSMQKMNR 120
           LN RVEAFI+KF    RL+RQ+SY++SM+ +N+
Sbjct: 288 LNCRVEAFIKKFNNDMRLERQESYQRSMEMLNQ 320


Top