BLASTX nr result

ID: Rehmannia29_contig00035097 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia29_contig00035097
         (948 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011079903.1| uncharacterized protein LOC105163303 [Sesamu...   474   e-165
gb|PIN02748.1| hypothetical protein CDL12_24735 [Handroanthus im...   439   e-151
ref|XP_012832032.1| PREDICTED: uncharacterized protein LOC105952...   422   e-144
ref|XP_012832025.1| PREDICTED: uncharacterized protein LOC105952...   422   e-144
ref|XP_022899045.1| uncharacterized protein LOC111412373 [Olea e...   382   e-129
gb|KZV22530.1| hypothetical protein F511_09052 [Dorcoceras hygro...   352   e-117
ref|XP_016498116.1| PREDICTED: uncharacterized protein LOC107816...   348   e-115
ref|XP_009626097.1| PREDICTED: uncharacterized protein LOC104116...   347   e-114
gb|EOX94676.1| 2-oxoglutarate and Fe(II)-dependent oxygenase sup...   340   e-113
ref|XP_021280709.1| uncharacterized protein LOC110413984 [Herran...   340   e-112
ref|XP_017969287.1| PREDICTED: uncharacterized protein LOC186132...   340   e-112
ref|XP_022771144.1| uncharacterized protein LOC111314255 isoform...   339   e-111
gb|EOX94675.1| 2-oxoglutarate and Fe(II)-dependent oxygenase sup...   340   e-111
ref|XP_016717870.1| PREDICTED: uncharacterized protein LOC107930...   335   e-111
ref|XP_012435825.1| PREDICTED: uncharacterized protein LOC105762...   334   e-111
ref|XP_016718793.1| PREDICTED: uncharacterized protein LOC107931...   333   e-110
ref|XP_012435824.1| PREDICTED: uncharacterized protein LOC105762...   334   e-110
ref|XP_016718788.1| PREDICTED: uncharacterized protein LOC107931...   333   e-110
gb|OMP05100.1| hypothetical protein COLO4_09049 [Corchorus olito...   336   e-110
ref|XP_016717851.1| PREDICTED: uncharacterized protein LOC107930...   335   e-110

>ref|XP_011079903.1| uncharacterized protein LOC105163303 [Sesamum indicum]
          Length = 411

 Score =  474 bits (1220), Expect = e-165
 Identities = 241/300 (80%), Positives = 254/300 (84%), Gaps = 8/300 (2%)
 Frame = -3

Query: 946  KYDHEFNSKIKGEEIARDVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQ 767
            KYD EFN K+K  E+   VDGGEFKDLG AF++LGFCMMKLGLCLARVCDKLIGG ELEQ
Sbjct: 112  KYDQEFNVKVKSNEMDSAVDGGEFKDLGLAFKELGFCMMKLGLCLARVCDKLIGGCELEQ 171

Query: 766  SLLQSGTAKGRLIHYHSVADNIAIKEAANGKRRARGSKVNLRENTKLSDGGNLNLWQQWH 587
            SLLQ GTAKGRLIHYHSV DN AIKEAA+ KRR RG KVNLREN  L D G+ +LWQQWH
Sbjct: 172  SLLQCGTAKGRLIHYHSVTDNAAIKEAASRKRRGRGGKVNLRENVPLGDDGDADLWQQWH 231

Query: 586  YDYGIFTILTAPMFMLSNGNDEQECESPSGHTYLQVFHPEANRVLMVKAPKGSFIVQVGE 407
            YDYGIFTILTAPMFMLS+G+D QECESP  HTYLQVFHPE NRVLMVKA K SFIVQVGE
Sbjct: 232  YDYGIFTILTAPMFMLSDGSDSQECESPRSHTYLQVFHPETNRVLMVKASKASFIVQVGE 291

Query: 406  SADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIERLTLVDRD- 230
            SADVLSKGR+RATLHSVCR AKMDNLSRETFVVFLQPAWSKTFSLS+YP E LT  D+D 
Sbjct: 292  SADVLSKGRLRATLHSVCRHAKMDNLSRETFVVFLQPAWSKTFSLSDYPFEGLTSGDQDT 351

Query: 229  LESRNE-------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGHGLQSNR 71
             +S NE       ESNG TQKIHEIVPPL SRLR GMTFAEFSRETTKQYY G GLQSNR
Sbjct: 352  TKSYNEEMHGVVQESNGLTQKIHEIVPPLSSRLRDGMTFAEFSRETTKQYYSGSGLQSNR 411


>gb|PIN02748.1| hypothetical protein CDL12_24735 [Handroanthus impetiginosus]
          Length = 396

 Score =  439 bits (1129), Expect = e-151
 Identities = 225/293 (76%), Positives = 247/293 (84%), Gaps = 1/293 (0%)
 Frame = -3

Query: 946 KYDHEFNSKIKGEEIARDVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQ 767
           KY  EF ++ K       VDGGEF  LGFAFQ+LGFCMM+LGL LARVCD+LIGGS+LEQ
Sbjct: 112 KYGQEFKAEFKST-----VDGGEFSGLGFAFQELGFCMMELGLSLARVCDRLIGGSQLEQ 166

Query: 766 SLLQSGTAKGRLIHYHSVADNIAIKEAA-NGKRRARGSKVNLRENTKLSDGGNLNLWQQW 590
           SLLQSG+AK RLIHYHS++DN+AIK AA NGKR  RG K NLR+N +LSD  N +LWQQW
Sbjct: 167 SLLQSGSAKVRLIHYHSISDNVAIKTAAANGKRHGRGCKANLRDNFRLSDDLNSDLWQQW 226

Query: 589 HYDYGIFTILTAPMFMLSNGNDEQECESPSGHTYLQVFHPEANRVLMVKAPKGSFIVQVG 410
           HYDYGIFTILT PMFMLSN ND QECESPSGHTYLQVFHPE NRVLMVKA KGSFIVQVG
Sbjct: 227 HYDYGIFTILTTPMFMLSNENDVQECESPSGHTYLQVFHPEKNRVLMVKASKGSFIVQVG 286

Query: 409 ESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIERLTLVDRD 230
           ESADVLSKGR+RATLHSVCR  KM NLSRETFVVFLQPAWSKTFSLS+YP+  LTL +++
Sbjct: 287 ESADVLSKGRLRATLHSVCRQTKMKNLSRETFVVFLQPAWSKTFSLSDYPVGCLTLDNQN 346

Query: 229 LESRNEESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGHGLQSNR 71
            ES N+   G ++KI EIVPPLFSRLR GMTFAEFSRETTKQYYG  GLQSNR
Sbjct: 347 SESCND---GLSRKIREIVPPLFSRLRDGMTFAEFSRETTKQYYGDSGLQSNR 396


>ref|XP_012832032.1| PREDICTED: uncharacterized protein LOC105952964 isoform X2
           [Erythranthe guttata]
 gb|EYU46546.1| hypothetical protein MIMGU_mgv1a007792mg [Erythranthe guttata]
          Length = 395

 Score =  422 bits (1084), Expect = e-144
 Identities = 218/292 (74%), Positives = 243/292 (83%)
 Frame = -3

Query: 946 KYDHEFNSKIKGEEIARDVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQ 767
           KYD EF+ K KG++ A +  GGEFKDLGF FQQLGF MM+LGL +ARVCD+ IGG ELEQ
Sbjct: 112 KYDQEFSYKPKGDQTATEFSGGEFKDLGFKFQQLGFLMMELGLRIARVCDESIGGCELEQ 171

Query: 766 SLLQSGTAKGRLIHYHSVADNIAIKEAANGKRRARGSKVNLRENTKLSDGGNLNLWQQWH 587
           SLLQ GTAKGRLIHYHSV D  AIK AA   + A G KVN + +    DG + NLWQQWH
Sbjct: 172 SLLQCGTAKGRLIHYHSVTDISAIKTAA---KHAGGGKVNPKGDISSVDGDS-NLWQQWH 227

Query: 586 YDYGIFTILTAPMFMLSNGNDEQECESPSGHTYLQVFHPEANRVLMVKAPKGSFIVQVGE 407
           YDYGIFTILTAP+FMLS    EQECESPSGHTYLQVFHPE N  LMVKA KGSFIVQVGE
Sbjct: 228 YDYGIFTILTAPVFMLS----EQECESPSGHTYLQVFHPEMNCTLMVKASKGSFIVQVGE 283

Query: 406 SADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIERLTLVDRDL 227
           SADVLSKG++RATLHSV RPAKM++LSRETFVVFLQPAW+K FSLS+YP++ LTL D+D 
Sbjct: 284 SADVLSKGKLRATLHSVVRPAKMEHLSRETFVVFLQPAWTKRFSLSDYPVDCLTLRDQDS 343

Query: 226 ESRNEESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGHGLQSNR 71
           ES N E+NG ++KIHEIVPPLFSRLR GMTFAEFSRETTK+YYGG GLQSNR
Sbjct: 344 ESCNGETNGLSRKIHEIVPPLFSRLRDGMTFAEFSRETTKRYYGGQGLQSNR 395


>ref|XP_012832025.1| PREDICTED: uncharacterized protein LOC105952964 isoform X1
           [Erythranthe guttata]
          Length = 397

 Score =  422 bits (1084), Expect = e-144
 Identities = 218/292 (74%), Positives = 243/292 (83%)
 Frame = -3

Query: 946 KYDHEFNSKIKGEEIARDVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQ 767
           KYD EF+ K KG++ A +  GGEFKDLGF FQQLGF MM+LGL +ARVCD+ IGG ELEQ
Sbjct: 114 KYDQEFSYKPKGDQTATEFSGGEFKDLGFKFQQLGFLMMELGLRIARVCDESIGGCELEQ 173

Query: 766 SLLQSGTAKGRLIHYHSVADNIAIKEAANGKRRARGSKVNLRENTKLSDGGNLNLWQQWH 587
           SLLQ GTAKGRLIHYHSV D  AIK AA   + A G KVN + +    DG + NLWQQWH
Sbjct: 174 SLLQCGTAKGRLIHYHSVTDISAIKTAA---KHAGGGKVNPKGDISSVDGDS-NLWQQWH 229

Query: 586 YDYGIFTILTAPMFMLSNGNDEQECESPSGHTYLQVFHPEANRVLMVKAPKGSFIVQVGE 407
           YDYGIFTILTAP+FMLS    EQECESPSGHTYLQVFHPE N  LMVKA KGSFIVQVGE
Sbjct: 230 YDYGIFTILTAPVFMLS----EQECESPSGHTYLQVFHPEMNCTLMVKASKGSFIVQVGE 285

Query: 406 SADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIERLTLVDRDL 227
           SADVLSKG++RATLHSV RPAKM++LSRETFVVFLQPAW+K FSLS+YP++ LTL D+D 
Sbjct: 286 SADVLSKGKLRATLHSVVRPAKMEHLSRETFVVFLQPAWTKRFSLSDYPVDCLTLRDQDS 345

Query: 226 ESRNEESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGHGLQSNR 71
           ES N E+NG ++KIHEIVPPLFSRLR GMTFAEFSRETTK+YYGG GLQSNR
Sbjct: 346 ESCNGETNGLSRKIHEIVPPLFSRLRDGMTFAEFSRETTKRYYGGQGLQSNR 397


>ref|XP_022899045.1| uncharacterized protein LOC111412373 [Olea europaea var. sylvestris]
 ref|XP_022899046.1| uncharacterized protein LOC111412373 [Olea europaea var. sylvestris]
          Length = 412

 Score =  382 bits (982), Expect = e-129
 Identities = 196/299 (65%), Positives = 227/299 (75%), Gaps = 7/299 (2%)
 Frame = -3

Query: 946  KYDHEFNSKIKGEEIARDVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQ 767
            KY+      +  E++  + +  EFK+LG  F++LGFCMM+LGL +AR+CD++IGG ELEQ
Sbjct: 115  KYEQRTCDGVNNEKMEGEFEDEEFKNLGSVFKELGFCMMELGLRVARICDRVIGGHELEQ 174

Query: 766  SLLQSGTAKGRLIHYHSVADNIAIKEAANGKRRARGSKVNLRENTKLSDGGNLNLWQQWH 587
            SLL SGTAKGRLIHYHS  DN  IKEAA  K R R S+VNL     + D   L+LWQQWH
Sbjct: 175  SLLLSGTAKGRLIHYHSRIDNFVIKEAAKRKGR-RISQVNLGNEVTVFDEKKLDLWQQWH 233

Query: 586  YDYGIFTILTAPMFMLSNGNDEQECESPSGHTYLQVFHPEANRVLMVKAPKGSFIVQVGE 407
            YDYGIFTIL APMFMLSN N E ECE P+ HTYLQ+  PE NRVLM+KA   SFIVQVGE
Sbjct: 234  YDYGIFTILAAPMFMLSNANFELECEYPTDHTYLQILDPEKNRVLMMKASMESFIVQVGE 293

Query: 406  SADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIERLTLVDRDL 227
            SADVLSKG++RA LH VCRP K +NLSRETFVVFLQPAWSKTF LS+YP+ERLT   +D 
Sbjct: 294  SADVLSKGKLRAALHCVCRPKKTENLSRETFVVFLQPAWSKTFHLSDYPMERLTSSSQDS 353

Query: 226  ES-------RNEESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGHGLQSNR 71
            ES         ++SNG TQKIH++VPPL SRL+ GMTFAEFSRETTKQYYG  GLQ NR
Sbjct: 354  ESYLEGTHGARKDSNGLTQKIHQLVPPLHSRLKDGMTFAEFSRETTKQYYGSSGLQPNR 412


>gb|KZV22530.1| hypothetical protein F511_09052 [Dorcoceras hygrometricum]
          Length = 403

 Score =  352 bits (902), Expect = e-117
 Identities = 181/299 (60%), Positives = 221/299 (73%), Gaps = 7/299 (2%)
 Frame = -3

Query: 946 KYDHEFNSKIKGEEIARDVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQ 767
           KYD EF+ K   E +A + DG EF +LG AF +LG CMM+LGL L+RVCDK + G +LEQ
Sbjct: 112 KYDQEFDGKFC-ESMAAEADGSEFSNLGLAFHELGSCMMELGLLLSRVCDKKLVGCDLEQ 170

Query: 766 SLLQSGTAKGRLIHYHSVADNIAIKEAANGKRRARGSKVNLRENTKLSDGGNLNLWQQWH 587
            LL+ GTAK RLIHYHS+ DN+ IKE   GK R+       R N  + DG  LNLWQ+WH
Sbjct: 171 CLLECGTAKARLIHYHSILDNMIIKEKCWGKGRSAID----RRNFGIDDG--LNLWQKWH 224

Query: 586 YDYGIFTILTAPMFMLSNGNDEQECESPSGHTYLQVFHPEANRVLMVKAPKGSFIVQVGE 407
           YDYGIFTILTAPMFMLS+G+D +EC SP  HTYLQ+ HP    VL VKA +GSF+VQVGE
Sbjct: 225 YDYGIFTILTAPMFMLSDGSDVKECNSPMCHTYLQILHPRKGCVLSVKAAEGSFVVQVGE 284

Query: 406 SADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIERLTLVDRDL 227
           SADVLS+G++RATLH V RPAKMDN+SRETF +FLQPAW+KTFSL  YPIE L   D+++
Sbjct: 285 SADVLSQGKLRATLHCVSRPAKMDNVSRETFAMFLQPAWNKTFSLKEYPIEHLNRGDQNV 344

Query: 226 ESRNE-------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGHGLQSNR 71
           ES  E       ES+   + I  +VPPL SRL+ GMTFA+F++ETT++YYGG G QS+R
Sbjct: 345 ESECEGTRDAENESDEMIRNISRMVPPLSSRLKDGMTFADFAKETTRKYYGGSGSQSSR 403


>ref|XP_016498116.1| PREDICTED: uncharacterized protein LOC107816885 [Nicotiana tabacum]
          Length = 445

 Score =  348 bits (893), Expect = e-115
 Identities = 180/305 (59%), Positives = 217/305 (71%), Gaps = 31/305 (10%)
 Frame = -3

Query: 892  VDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGTAKGRLIHYHSV 713
            VD  EFK LG  F++LG+CMM +GL LA++CDK+IGG EL+QSLL+SGTAKGRLIHYHS 
Sbjct: 141  VDQDEFKKLGCTFKELGYCMMDIGLRLAQICDKIIGGQELQQSLLESGTAKGRLIHYHSA 200

Query: 712  ADNIAIKEAA--NGKRRARGSKVNLRENTKLSDGG---------NLNLWQQWHYDYGIFT 566
             DN  I+EAA  NG  ++R  KVN  E       G         +  LWQQWHYDYGIFT
Sbjct: 201  VDNDIIREAAKRNGYVKSRNGKVNKNEQASTKQQGTDLSKDQSNDYGLWQQWHYDYGIFT 260

Query: 565  ILTAPMFMLSNGND-------------EQECESPSGHTYLQVFHPEANRVLMVKAPKGSF 425
            +LT PMF+LS+  +             E E  SPSGHTYLQ+F P+ N++ MVKAP  S 
Sbjct: 261  LLTVPMFLLSSRQETPAAVNTGSPISSEHEFPSPSGHTYLQIFDPKKNQIFMVKAPSESL 320

Query: 424  IVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIERLT 245
            I+QVGE+AD+LSKG++RATLH VCRP K++NLSRETFVVFLQPAWSK FSLS+YP+ERL 
Sbjct: 321  ILQVGEAADILSKGKLRATLHCVCRPPKIENLSRETFVVFLQPAWSKQFSLSDYPLERLN 380

Query: 244  LVDRD-------LESRNEESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGHG 86
            L  +        +E   + S   +Q+I +IVPPL SRL+ GMTFAEFSRETTKQYYGG G
Sbjct: 381  LSSQQCGVCIEGIEESRQVSEELSQEIQKIVPPLLSRLKDGMTFAEFSRETTKQYYGGKG 440

Query: 85   LQSNR 71
            LQSNR
Sbjct: 441  LQSNR 445


>ref|XP_009626097.1| PREDICTED: uncharacterized protein LOC104116856 [Nicotiana
            tomentosiformis]
          Length = 446

 Score =  347 bits (891), Expect = e-114
 Identities = 180/306 (58%), Positives = 217/306 (70%), Gaps = 32/306 (10%)
 Frame = -3

Query: 892  VDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGTAKGRLIHYHSV 713
            VD  EFK LG  F++LG+CMM +GL LA++CDK+IGG EL+QSLL+SGTAKGRLIHYHS 
Sbjct: 141  VDQDEFKKLGCTFKELGYCMMDIGLRLAQICDKVIGGQELQQSLLESGTAKGRLIHYHSA 200

Query: 712  ADNIAIKEAA--NGKRRARGSKVNLRENTKLSDGGNLNL----------WQQWHYDYGIF 569
             DN  I+EAA  NG  ++R  KVN  E       G  +L          WQQWHYDYGIF
Sbjct: 201  VDNDIIREAAKRNGYVKSRNGKVNKNEQASTKQQGTTDLSKDQSNDYGLWQQWHYDYGIF 260

Query: 568  TILTAPMFMLSNGND-------------EQECESPSGHTYLQVFHPEANRVLMVKAPKGS 428
            T+LT PMF+LS+  +             E E  SPSGHTYLQ+F P+ N++ MVKAP  S
Sbjct: 261  TLLTVPMFLLSSRQETLAAVNTGSPISSEHEFPSPSGHTYLQIFDPKKNQIFMVKAPSES 320

Query: 427  FIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIERL 248
             I+QVGE+AD+LSKG++RATLH VCRP K++NLSRETFVVFLQPAWSK FSLS+YP+ERL
Sbjct: 321  LILQVGEAADILSKGKLRATLHCVCRPPKIENLSRETFVVFLQPAWSKQFSLSDYPLERL 380

Query: 247  TLVDRD-------LESRNEESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGH 89
             L  +        +E   + S   +Q+I +IVPPL SRL+ GMTFAEFSRETTKQYYGG 
Sbjct: 381  NLSSQQCGVCIEGIEQSRQVSEELSQEIQKIVPPLLSRLKDGMTFAEFSRETTKQYYGGK 440

Query: 88   GLQSNR 71
            GLQSNR
Sbjct: 441  GLQSNR 446


>gb|EOX94676.1| 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein,
           putative isoform 2 [Theobroma cacao]
          Length = 341

 Score =  340 bits (872), Expect = e-113
 Identities = 178/316 (56%), Positives = 222/316 (70%), Gaps = 33/316 (10%)
 Frame = -3

Query: 919 IKGEEIAR--DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGT 746
           ++ E I R  D +  EF DL   F+ LGFCMM+LGLCLAR+CD+ IGG+ELEQSLL+S  
Sbjct: 26  LENENICRISDFEDDEFDDLENMFKALGFCMMELGLCLARICDRAIGGNELEQSLLESCA 85

Query: 745 AKGRLIHYHSVADNIAIKEAANGKRRARGSKVNL-RENTKLSDGGNL------------- 608
           AKGRLIHYHS+ D++ ++EA   K  ++    N  R   +LS   NL             
Sbjct: 86  AKGRLIHYHSIVDSLVLREAGRRKGSSKRHANNYSRSEQRLSKVANLDTNVNEVRSYDMQ 145

Query: 607 -NLWQQWHYDYGIFTILTAPMFMLSN----GNDE------QECESPSGHTYLQVFHPEAN 461
            NLWQQWHYDYGIFT+LT PMF+L++     N+E      QEC SPSGH+YLQ+FHP  +
Sbjct: 146 ANLWQQWHYDYGIFTVLTDPMFLLASQPTTANNEFSISRYQECASPSGHSYLQIFHPNKS 205

Query: 460 RVLMVKAPKGSFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKT 281
           +VL VK+   S I+QVGESAD+LSKG++R+TLH VCRPA++DN+ RETFVVFLQPAWSKT
Sbjct: 206 KVLTVKSSPESLIIQVGESADILSKGKLRSTLHCVCRPARLDNICRETFVVFLQPAWSKT 265

Query: 280 FSLSNYPIERLTLVDRDLESRNE------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSR 119
           FS+S+YP+E    V + LE   E      + N  TQ+I +IVPPL +R + GMTFAEFSR
Sbjct: 266 FSISDYPMEHYNPVCQPLEQAEERNVADQDQNALTQEIQKIVPPLSARFKDGMTFAEFSR 325

Query: 118 ETTKQYYGGHGLQSNR 71
           ETTKQYYGG GLQSNR
Sbjct: 326 ETTKQYYGGSGLQSNR 341


>ref|XP_021280709.1| uncharacterized protein LOC110413984 [Herrania umbratica]
          Length = 445

 Score =  340 bits (873), Expect = e-112
 Identities = 180/316 (56%), Positives = 223/316 (70%), Gaps = 33/316 (10%)
 Frame = -3

Query: 919  IKGEEIAR--DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGT 746
            ++ E+I R  D +  EF DL   F+ LG CMM+LGLCLAR+CD+ IGG+ELEQSLL+S  
Sbjct: 130  LENEKICRISDFEDDEFDDLENMFKALGLCMMELGLCLARICDRAIGGNELEQSLLKSCA 189

Query: 745  AKGRLIHYHSVADNIAIKEAANGKRRARGSKVNL-RENTKLSDGGNL------------- 608
            AKGRLIHYHS+ D++ ++EA   K  ++    N  R   +LS   NL             
Sbjct: 190  AKGRLIHYHSIVDSLVLREAGRRKGSSKRHANNYARSEQRLSKVANLDTNVNEVRSYDTQ 249

Query: 607  -NLWQQWHYDYGIFTILTAPMFMLSN----GNDE------QECESPSGHTYLQVFHPEAN 461
             NLWQQWHYDYGIFT+LT PMF+LS+     +DE      QEC SPSGH+YLQ+FHP  N
Sbjct: 250  PNLWQQWHYDYGIFTVLTDPMFLLSSQPTTASDEFSISRYQECASPSGHSYLQIFHPNKN 309

Query: 460  RVLMVKAPKGSFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKT 281
            +VLMVK+   S IVQVGESADVLSKG++R+TLH VCRPA++DN+ RETFVVFLQPAWSKT
Sbjct: 310  KVLMVKSSPESLIVQVGESADVLSKGKLRSTLHCVCRPARLDNICRETFVVFLQPAWSKT 369

Query: 280  FSLSNYPIERLTLVDRDLESRNE------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSR 119
            FS+S+ P+E    V + LE   E      + N  TQ+I ++VPPL +RL+ GMTFAEFSR
Sbjct: 370  FSISDCPMEHYNAVCQPLEQAEESNVADQDQNALTQEIQKMVPPLSARLKDGMTFAEFSR 429

Query: 118  ETTKQYYGGHGLQSNR 71
            ETTKQYYGG GLQSN+
Sbjct: 430  ETTKQYYGGSGLQSNK 445


>ref|XP_017969287.1| PREDICTED: uncharacterized protein LOC18613289 [Theobroma cacao]
          Length = 441

 Score =  340 bits (872), Expect = e-112
 Identities = 178/316 (56%), Positives = 222/316 (70%), Gaps = 33/316 (10%)
 Frame = -3

Query: 919  IKGEEIAR--DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGT 746
            ++ E I R  D +  EF DL   F+ LGFCMM+LGLCLAR+CD+ IGG+ELEQSLL+S  
Sbjct: 126  LENENICRISDFEDDEFDDLENMFKALGFCMMELGLCLARICDRAIGGNELEQSLLESCA 185

Query: 745  AKGRLIHYHSVADNIAIKEAANGKRRARGSKVNL-RENTKLSDGGNL------------- 608
            AKGRLIHYHS+ D++ ++EA   K  ++    N  R   +LS   NL             
Sbjct: 186  AKGRLIHYHSIVDSLVLREAGRRKGSSKRHANNYSRSEQRLSKVANLDTNVNEVRSYDMQ 245

Query: 607  -NLWQQWHYDYGIFTILTAPMFMLSN----GNDE------QECESPSGHTYLQVFHPEAN 461
             NLWQQWHYDYGIFT+LT PMF+L++     N+E      QEC SPSGH+YLQ+FHP  +
Sbjct: 246  ANLWQQWHYDYGIFTVLTDPMFLLASQPTTANNEFSISRYQECASPSGHSYLQIFHPNKS 305

Query: 460  RVLMVKAPKGSFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKT 281
            +VL VK+   S ++QVGESAD+LSKG++R+TLH VCRPA++DNL RETFVVFLQPAWSKT
Sbjct: 306  KVLTVKSSPESLLIQVGESADILSKGKLRSTLHCVCRPARLDNLCRETFVVFLQPAWSKT 365

Query: 280  FSLSNYPIERLTLVDRDLESRNE------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSR 119
            FS+S+YP+E    V + LE   E      + N  TQ+I +IVPPL +R + GMTFAEFSR
Sbjct: 366  FSISDYPMEHYNPVCQPLEQAEERNVADQDQNALTQEIQKIVPPLSARFKDGMTFAEFSR 425

Query: 118  ETTKQYYGGHGLQSNR 71
            ETTKQYYGG GLQSNR
Sbjct: 426  ETTKQYYGGSGLQSNR 441


>ref|XP_022771144.1| uncharacterized protein LOC111314255 isoform X1 [Durio zibethinus]
          Length = 443

 Score =  339 bits (870), Expect = e-111
 Identities = 180/329 (54%), Positives = 224/329 (68%), Gaps = 37/329 (11%)
 Frame = -3

Query: 946  KYDHEFNSKIKGEEI----ARDVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGS 779
            K  H  +S +  E +      D     F DL   F+ LGFCMM+LGLCLAR+CD+ IGG+
Sbjct: 117  KPSHRVDSHLNLENVNIGSTSDFQDDGFDDLENMFKALGFCMMELGLCLARICDRAIGGN 176

Query: 778  ELEQSLLQSGTAKGRLIHYHSVADNIAIKEAANGKRRARGSKV---NLRENTKLSDGGNL 608
            ELEQSLL S  AKGRLIHYHS+ D++ ++E  +G+R+    K    + R   KLS G NL
Sbjct: 177  ELEQSLLDSCAAKGRLIHYHSMVDSLVLRE--DGRRKGSSKKHANNHARSEQKLSKGANL 234

Query: 607  --------------NLWQQWHYDYGIFTILTAPMFMLSNG----------NDEQECESPS 500
                          NLWQQWHYDYGIFT+LT PMF+LS+           + +QEC SPS
Sbjct: 235  DTNGNEVRPCEIHPNLWQQWHYDYGIFTVLTDPMFLLSSQQTTANTEFPISSDQECASPS 294

Query: 499  GHTYLQVFHPEANRVLMVKAPKGSFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRE 320
            GH+YLQ+FHP  N+VLMVK+   SFIVQVGESAD+LSKG++R+TLH VCRPA+++NLSRE
Sbjct: 295  GHSYLQIFHPNKNKVLMVKSSPESFIVQVGESADILSKGKLRSTLHCVCRPARLENLSRE 354

Query: 319  TFVVFLQPAWSKTFSLSNYPIERLTLVDRDLESRNE------ESNGSTQKIHEIVPPLFS 158
            TFVVFLQPAWSKTFS+S+YP+E      + LE   E      + N  T++  + VPPL +
Sbjct: 355  TFVVFLQPAWSKTFSISDYPMEYFKPGCQPLEEAEENNFVDQDQNALTREFQKTVPPLSA 414

Query: 157  RLRHGMTFAEFSRETTKQYYGGHGLQSNR 71
            RL+ GMTFAEFSRETTKQYYGG GLQSN+
Sbjct: 415  RLKDGMTFAEFSRETTKQYYGGSGLQSNK 443


>gb|EOX94675.1| 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein,
            putative isoform 1 [Theobroma cacao]
          Length = 484

 Score =  340 bits (872), Expect = e-111
 Identities = 178/316 (56%), Positives = 222/316 (70%), Gaps = 33/316 (10%)
 Frame = -3

Query: 919  IKGEEIAR--DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGT 746
            ++ E I R  D +  EF DL   F+ LGFCMM+LGLCLAR+CD+ IGG+ELEQSLL+S  
Sbjct: 169  LENENICRISDFEDDEFDDLENMFKALGFCMMELGLCLARICDRAIGGNELEQSLLESCA 228

Query: 745  AKGRLIHYHSVADNIAIKEAANGKRRARGSKVNL-RENTKLSDGGNL------------- 608
            AKGRLIHYHS+ D++ ++EA   K  ++    N  R   +LS   NL             
Sbjct: 229  AKGRLIHYHSIVDSLVLREAGRRKGSSKRHANNYSRSEQRLSKVANLDTNVNEVRSYDMQ 288

Query: 607  -NLWQQWHYDYGIFTILTAPMFMLSN----GNDE------QECESPSGHTYLQVFHPEAN 461
             NLWQQWHYDYGIFT+LT PMF+L++     N+E      QEC SPSGH+YLQ+FHP  +
Sbjct: 289  ANLWQQWHYDYGIFTVLTDPMFLLASQPTTANNEFSISRYQECASPSGHSYLQIFHPNKS 348

Query: 460  RVLMVKAPKGSFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKT 281
            +VL VK+   S I+QVGESAD+LSKG++R+TLH VCRPA++DN+ RETFVVFLQPAWSKT
Sbjct: 349  KVLTVKSSPESLIIQVGESADILSKGKLRSTLHCVCRPARLDNICRETFVVFLQPAWSKT 408

Query: 280  FSLSNYPIERLTLVDRDLESRNE------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSR 119
            FS+S+YP+E    V + LE   E      + N  TQ+I +IVPPL +R + GMTFAEFSR
Sbjct: 409  FSISDYPMEHYNPVCQPLEQAEERNVADQDQNALTQEIQKIVPPLSARFKDGMTFAEFSR 468

Query: 118  ETTKQYYGGHGLQSNR 71
            ETTKQYYGG GLQSNR
Sbjct: 469  ETTKQYYGGSGLQSNR 484


>ref|XP_016717870.1| PREDICTED: uncharacterized protein LOC107930660 isoform X3
           [Gossypium hirsutum]
          Length = 366

 Score =  335 bits (860), Expect = e-111
 Identities = 178/306 (58%), Positives = 214/306 (69%), Gaps = 31/306 (10%)
 Frame = -3

Query: 895 DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGTAKGRLIHYHS 716
           D+    F +L   F+ LG CMM++GLCLAR+CD  IGG+ELEQSLL+S  AKGRLIHYHS
Sbjct: 61  DIKDDVFDNLENMFKALGLCMMEIGLCLARICDMAIGGNELEQSLLESCAAKGRLIHYHS 120

Query: 715 VADNIAIKEAA----NGKRRARG---SKVNLRENTKLSDGGNL--------NLWQQWHYD 581
           + D++ ++EA     + KR A     SK NL +   L   GN         NLWQQWHYD
Sbjct: 121 MVDSLVLREAGPKKGSSKRNANNHARSKENLLKGANLDTNGNEVRLCEIHPNLWQQWHYD 180

Query: 580 YGIFTILTAPMFMLSN----------GNDEQECESPSGHTYLQVFHPEANRVLMVKAPKG 431
           YGIFT+LT PMF+LS+           +  QEC SPSGH+YLQVFHP  N+VLMVKA   
Sbjct: 181 YGIFTLLTDPMFLLSSHRTTVKSEFSNSSGQECASPSGHSYLQVFHPNKNKVLMVKASPE 240

Query: 430 SFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIER 251
           SFIVQVGESAD+LSKG++R+TLH V RPA+ +NLSRETFVVFLQPAWSKTFS+S+YP+E 
Sbjct: 241 SFIVQVGESADILSKGKLRSTLHCVRRPARFENLSRETFVVFLQPAWSKTFSISDYPMEH 300

Query: 250 LTLVDRDLESRNE------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGH 89
                  LE   E      + N  TQ+I +IVPPL +RL+ GMTFAEFSRETTKQYYGG 
Sbjct: 301 YNPSVHHLEQAEEHYFADQDQNALTQEIQKIVPPLSARLKDGMTFAEFSRETTKQYYGGS 360

Query: 88  GLQSNR 71
           GLQSN+
Sbjct: 361 GLQSNK 366


>ref|XP_012435825.1| PREDICTED: uncharacterized protein LOC105762594 isoform X4
           [Gossypium raimondii]
 ref|XP_012435826.1| PREDICTED: uncharacterized protein LOC105762594 isoform X4
           [Gossypium raimondii]
 gb|KJB46943.1| hypothetical protein B456_008G002600 [Gossypium raimondii]
          Length = 341

 Score =  334 bits (857), Expect = e-111
 Identities = 176/306 (57%), Positives = 216/306 (70%), Gaps = 31/306 (10%)
 Frame = -3

Query: 895 DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGTAKGRLIHYHS 716
           D++   F +L   F+ LG CMM++GLCLAR+CD  IGG+ELEQSLL+S  AKGRLIHYHS
Sbjct: 36  DIEDDVFDNLENMFKALGLCMMEIGLCLARICDMAIGGNELEQSLLESCAAKGRLIHYHS 95

Query: 715 VADNIAIKEAA----NGKRRARG---SKVNLRENTKLSDGGNL--------NLWQQWHYD 581
           + D++ ++EA     + KR A     SK NL +   L   GN         NLWQQWH+D
Sbjct: 96  MVDSLVLREAGPKKGSSKRNANNHARSKENLLKGANLDTNGNEVRLREIHPNLWQQWHFD 155

Query: 580 YGIFTILTAPMFMLSN----------GNDEQECESPSGHTYLQVFHPEANRVLMVKAPKG 431
           YGIFT+LT PMF+LS+           +  QEC SPSGH+YLQVFHP  N+VLMVKA   
Sbjct: 156 YGIFTLLTDPMFLLSSHRTTVKSEFSNSSGQECASPSGHSYLQVFHPNKNKVLMVKASPE 215

Query: 430 SFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIER 251
           SFIVQVGESAD+LSKG++R+TLH V RPA+ +NLSRETFVVFLQPAWSKTFS+S+YP+E 
Sbjct: 216 SFIVQVGESADILSKGKLRSTLHCVRRPARFENLSRETFVVFLQPAWSKTFSISDYPMEH 275

Query: 250 LTLVDRDLES------RNEESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGH 89
                  LE        +++ N  TQ+I +IVPPL +RL+ GMTFAEFSRETTKQYYGG 
Sbjct: 276 YNPSVHHLEQAEDHYFADQDQNALTQEIQKIVPPLSARLKDGMTFAEFSRETTKQYYGGS 335

Query: 88  GLQSNR 71
           GLQSN+
Sbjct: 336 GLQSNK 341


>ref|XP_016718793.1| PREDICTED: uncharacterized protein LOC107931406 isoform X4
           [Gossypium hirsutum]
 ref|XP_016718799.1| PREDICTED: uncharacterized protein LOC107931406 isoform X4
           [Gossypium hirsutum]
          Length = 341

 Score =  333 bits (855), Expect = e-110
 Identities = 177/307 (57%), Positives = 214/307 (69%), Gaps = 32/307 (10%)
 Frame = -3

Query: 895 DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGTAKGRLIHYHS 716
           D +   F +L   F+ LG CMM++GLCLAR+CD+ IGG+ELEQSLL+S  AKGRLIHYHS
Sbjct: 36  DFEDDVFDNLENMFKALGLCMMEIGLCLARICDRAIGGNELEQSLLESCAAKGRLIHYHS 95

Query: 715 VADNIAIKEAA--------NGKRRARGSKVNLRENTKLSDGGNL--------NLWQQWHY 584
           + D++ ++EA         N    AR SK NL +   L   GN         NLWQQWHY
Sbjct: 96  MVDSLVLREAGPKKGSSKRNPNNHAR-SKENLMKGANLDTIGNEVRLCEIHPNLWQQWHY 154

Query: 583 DYGIFTILTAPMFMLSN----------GNDEQECESPSGHTYLQVFHPEANRVLMVKAPK 434
           DYGIFT+LT PMF+LS+           + +QEC SPSGH+YLQVFHP  N+V MVKA  
Sbjct: 155 DYGIFTLLTDPMFLLSSHRTTAKSEFSNSSDQECASPSGHSYLQVFHPNKNKVFMVKASP 214

Query: 433 GSFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIE 254
            SFIVQVGESAD+LSKG++R+TLH V RPA+ +NLSRETFVVFLQPAWSKTFS+S+YP+E
Sbjct: 215 ESFIVQVGESADILSKGKLRSTLHCVRRPARFENLSRETFVVFLQPAWSKTFSISDYPME 274

Query: 253 RLTLVDRDLESRNE------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGG 92
                   LE   E      + N  TQ+I +IVPPL +RL+ GMTFAEFSRETTKQYYGG
Sbjct: 275 HYNPSVHHLEQAEEHYFADQDQNALTQEIQKIVPPLSARLKDGMTFAEFSRETTKQYYGG 334

Query: 91  HGLQSNR 71
            GLQSN+
Sbjct: 335 SGLQSNK 341


>ref|XP_012435824.1| PREDICTED: uncharacterized protein LOC105762594 isoform X3
           [Gossypium raimondii]
 gb|KJB46944.1| hypothetical protein B456_008G002600 [Gossypium raimondii]
          Length = 366

 Score =  334 bits (857), Expect = e-110
 Identities = 176/306 (57%), Positives = 216/306 (70%), Gaps = 31/306 (10%)
 Frame = -3

Query: 895 DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGTAKGRLIHYHS 716
           D++   F +L   F+ LG CMM++GLCLAR+CD  IGG+ELEQSLL+S  AKGRLIHYHS
Sbjct: 61  DIEDDVFDNLENMFKALGLCMMEIGLCLARICDMAIGGNELEQSLLESCAAKGRLIHYHS 120

Query: 715 VADNIAIKEAA----NGKRRARG---SKVNLRENTKLSDGGNL--------NLWQQWHYD 581
           + D++ ++EA     + KR A     SK NL +   L   GN         NLWQQWH+D
Sbjct: 121 MVDSLVLREAGPKKGSSKRNANNHARSKENLLKGANLDTNGNEVRLREIHPNLWQQWHFD 180

Query: 580 YGIFTILTAPMFMLSN----------GNDEQECESPSGHTYLQVFHPEANRVLMVKAPKG 431
           YGIFT+LT PMF+LS+           +  QEC SPSGH+YLQVFHP  N+VLMVKA   
Sbjct: 181 YGIFTLLTDPMFLLSSHRTTVKSEFSNSSGQECASPSGHSYLQVFHPNKNKVLMVKASPE 240

Query: 430 SFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIER 251
           SFIVQVGESAD+LSKG++R+TLH V RPA+ +NLSRETFVVFLQPAWSKTFS+S+YP+E 
Sbjct: 241 SFIVQVGESADILSKGKLRSTLHCVRRPARFENLSRETFVVFLQPAWSKTFSISDYPMEH 300

Query: 250 LTLVDRDLES------RNEESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGH 89
                  LE        +++ N  TQ+I +IVPPL +RL+ GMTFAEFSRETTKQYYGG 
Sbjct: 301 YNPSVHHLEQAEDHYFADQDQNALTQEIQKIVPPLSARLKDGMTFAEFSRETTKQYYGGS 360

Query: 88  GLQSNR 71
           GLQSN+
Sbjct: 361 GLQSNK 366


>ref|XP_016718788.1| PREDICTED: uncharacterized protein LOC107931406 isoform X3
           [Gossypium hirsutum]
          Length = 366

 Score =  333 bits (855), Expect = e-110
 Identities = 177/307 (57%), Positives = 214/307 (69%), Gaps = 32/307 (10%)
 Frame = -3

Query: 895 DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGTAKGRLIHYHS 716
           D +   F +L   F+ LG CMM++GLCLAR+CD+ IGG+ELEQSLL+S  AKGRLIHYHS
Sbjct: 61  DFEDDVFDNLENMFKALGLCMMEIGLCLARICDRAIGGNELEQSLLESCAAKGRLIHYHS 120

Query: 715 VADNIAIKEAA--------NGKRRARGSKVNLRENTKLSDGGNL--------NLWQQWHY 584
           + D++ ++EA         N    AR SK NL +   L   GN         NLWQQWHY
Sbjct: 121 MVDSLVLREAGPKKGSSKRNPNNHAR-SKENLMKGANLDTIGNEVRLCEIHPNLWQQWHY 179

Query: 583 DYGIFTILTAPMFMLSN----------GNDEQECESPSGHTYLQVFHPEANRVLMVKAPK 434
           DYGIFT+LT PMF+LS+           + +QEC SPSGH+YLQVFHP  N+V MVKA  
Sbjct: 180 DYGIFTLLTDPMFLLSSHRTTAKSEFSNSSDQECASPSGHSYLQVFHPNKNKVFMVKASP 239

Query: 433 GSFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIE 254
            SFIVQVGESAD+LSKG++R+TLH V RPA+ +NLSRETFVVFLQPAWSKTFS+S+YP+E
Sbjct: 240 ESFIVQVGESADILSKGKLRSTLHCVRRPARFENLSRETFVVFLQPAWSKTFSISDYPME 299

Query: 253 RLTLVDRDLESRNE------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGG 92
                   LE   E      + N  TQ+I +IVPPL +RL+ GMTFAEFSRETTKQYYGG
Sbjct: 300 HYNPSVHHLEQAEEHYFADQDQNALTQEIQKIVPPLSARLKDGMTFAEFSRETTKQYYGG 359

Query: 91  HGLQSNR 71
            GLQSN+
Sbjct: 360 SGLQSNK 366


>gb|OMP05100.1| hypothetical protein COLO4_09049 [Corchorus olitorius]
          Length = 442

 Score =  336 bits (861), Expect = e-110
 Identities = 174/305 (57%), Positives = 217/305 (71%), Gaps = 30/305 (9%)
 Frame = -3

Query: 895  DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGTAKGRLIHYHS 716
            D +  +F  L   F+ LGFCMM++GLC+AR+CD++IGG+ELEQSLL+S  AKGRLIHYHS
Sbjct: 138  DFEDDDFDGLESLFKDLGFCMMEIGLCVARICDRVIGGNELEQSLLESCAAKGRLIHYHS 197

Query: 715  VADNIAIKEAANGKRRA-RGSKVNLRENTKLSDGGNL--------------NLWQQWHYD 581
            +AD++ ++EA   K    R +  + R   +LS G NL              NLWQQWHYD
Sbjct: 198  LADSLVLREAGRRKGSTKRHANNHARSEQRLSKGANLDTNGNQVSSCEIHHNLWQQWHYD 257

Query: 580  YGIFTILTAPMFMLSNGNDE---------QECESPSGHTYLQVFHPEANRVLMVKAPKGS 428
            YGIFT+LT PMF+LS+   E         QEC SPSG++YLQ++ P+ N+VLMVK+   S
Sbjct: 258  YGIFTVLTDPMFLLSSQPTEINEVSNSSYQECASPSGNSYLQIYDPDKNKVLMVKSSAES 317

Query: 427  FIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIERL 248
            FIVQVGESAD+LSKGR+R+TLH VCRPA++DNLSRETFVVFLQPAWSKTFS S+Y +E  
Sbjct: 318  FIVQVGESADILSKGRLRSTLHCVCRPARLDNLSRETFVVFLQPAWSKTFSFSDYSLENY 377

Query: 247  TLVDRDLESRNE------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGHG 86
                + LE   E      E N   ++I +IVPPL +RL+ GMTFAEFSRETTKQYYGG G
Sbjct: 378  NSGCQSLEKAEENTNADQEQNSLGREIQKIVPPLSARLKDGMTFAEFSRETTKQYYGGSG 437

Query: 85   LQSNR 71
            LQSN+
Sbjct: 438  LQSNK 442


>ref|XP_016717851.1| PREDICTED: uncharacterized protein LOC107930660 isoform X1 [Gossypium
            hirsutum]
          Length = 445

 Score =  335 bits (860), Expect = e-110
 Identities = 178/306 (58%), Positives = 214/306 (69%), Gaps = 31/306 (10%)
 Frame = -3

Query: 895  DVDGGEFKDLGFAFQQLGFCMMKLGLCLARVCDKLIGGSELEQSLLQSGTAKGRLIHYHS 716
            D+    F +L   F+ LG CMM++GLCLAR+CD  IGG+ELEQSLL+S  AKGRLIHYHS
Sbjct: 140  DIKDDVFDNLENMFKALGLCMMEIGLCLARICDMAIGGNELEQSLLESCAAKGRLIHYHS 199

Query: 715  VADNIAIKEAA----NGKRRARG---SKVNLRENTKLSDGGNL--------NLWQQWHYD 581
            + D++ ++EA     + KR A     SK NL +   L   GN         NLWQQWHYD
Sbjct: 200  MVDSLVLREAGPKKGSSKRNANNHARSKENLLKGANLDTNGNEVRLCEIHPNLWQQWHYD 259

Query: 580  YGIFTILTAPMFMLSN----------GNDEQECESPSGHTYLQVFHPEANRVLMVKAPKG 431
            YGIFT+LT PMF+LS+           +  QEC SPSGH+YLQVFHP  N+VLMVKA   
Sbjct: 260  YGIFTLLTDPMFLLSSHRTTVKSEFSNSSGQECASPSGHSYLQVFHPNKNKVLMVKASPE 319

Query: 430  SFIVQVGESADVLSKGRVRATLHSVCRPAKMDNLSRETFVVFLQPAWSKTFSLSNYPIER 251
            SFIVQVGESAD+LSKG++R+TLH V RPA+ +NLSRETFVVFLQPAWSKTFS+S+YP+E 
Sbjct: 320  SFIVQVGESADILSKGKLRSTLHCVRRPARFENLSRETFVVFLQPAWSKTFSISDYPMEH 379

Query: 250  LTLVDRDLESRNE------ESNGSTQKIHEIVPPLFSRLRHGMTFAEFSRETTKQYYGGH 89
                   LE   E      + N  TQ+I +IVPPL +RL+ GMTFAEFSRETTKQYYGG 
Sbjct: 380  YNPSVHHLEQAEEHYFADQDQNALTQEIQKIVPPLSARLKDGMTFAEFSRETTKQYYGGS 439

Query: 88   GLQSNR 71
            GLQSN+
Sbjct: 440  GLQSNK 445


Top