BLASTX nr result

ID: Glycyrrhiza32_contig00009190 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00009190
         (1904 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_007141283.1 hypothetical protein PHAVU_008G183100g [Phaseolus...   632   0.0  
XP_017429168.1 PREDICTED: uncharacterized protein LOC108337207 [...   629   0.0  
XP_004490492.2 PREDICTED: uncharacterized protein LOC101492718 [...   626   0.0  
XP_014522429.1 PREDICTED: uncharacterized protein LOC106778935 [...   624   0.0  
XP_006575357.1 PREDICTED: uncharacterized protein LOC102661917 [...   620   0.0  
XP_019459325.1 PREDICTED: uncharacterized protein LOC109359207 [...   579   0.0  
KHN26153.1 Putative ribonuclease H protein [Glycine soja]             566   0.0  
OIW02536.1 hypothetical protein TanjilG_12850 [Lupinus angustifo...   530   0.0  
GAU49057.1 hypothetical protein TSUD_244480, partial [Trifolium ...   489   e-167
XP_003544251.1 PREDICTED: uncharacterized protein LOC100787629 [...   450   e-150
KHN39964.1 HVA22-like protein a [Glycine soja]                        449   e-149
XP_016647353.1 PREDICTED: uncharacterized protein LOC103319467 [...   428   e-141
XP_012449204.1 PREDICTED: uncharacterized protein LOC105772486 i...   426   e-141
XP_017643483.1 PREDICTED: uncharacterized protein LOC108484269 i...   425   e-141
XP_016731529.1 PREDICTED: uncharacterized protein LOC107942387 [...   423   e-140
EOY15502.1 HVA22-like protein a, putative isoform 1 [Theobroma c...   423   e-140
XP_016730591.1 PREDICTED: uncharacterized protein LOC107941552 i...   422   e-139
XP_007018277.2 PREDICTED: uncharacterized protein LOC18591832 is...   421   e-139
ONI35422.1 hypothetical protein PRUPE_1G535200 [Prunus persica]       421   e-138
XP_012449205.1 PREDICTED: uncharacterized protein LOC105772486 i...   408   e-134

>XP_007141283.1 hypothetical protein PHAVU_008G183100g [Phaseolus vulgaris]
            ESW13277.1 hypothetical protein PHAVU_008G183100g
            [Phaseolus vulgaris]
          Length = 439

 Score =  632 bits (1631), Expect = 0.0
 Identities = 311/441 (70%), Positives = 350/441 (79%), Gaps = 19/441 (4%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MGWEEA YT+LT S T+L+WPPFS LCPL+ S+RA+ESDSRSSNQRCLAFWVLFSLS+IM
Sbjct: 1    MGWEEAFYTLLTNSFTLLSWPPFSFLCPLFVSVRAMESDSRSSNQRCLAFWVLFSLSMIM 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 545
            E ELSVL +  PWWPH+K++ATILLLIPY G AP +YKFLI+ Y  W +  +T NIF+ +
Sbjct: 61   ERELSVLLNCPPWWPHLKSIATILLLIPYVGAAPCVYKFLIRPYCPWRLFTKTSNIFSEK 120

Query: 546  STHSKSDEDNKTVLXXXXXXXXXXXXG--------QTIITSHLQEEKLLVYQGRDDLAGC 701
             TH +SDED K               G        QTI  S +QE+KL  YQ  DD AGC
Sbjct: 121  GTHVESDEDGKLFDVSDQTITTSQIQGKELVDFSDQTITPSQIQEKKLEAYQ--DDSAGC 178

Query: 702  DKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE---------- 851
            D T SSYTRLT K+ VQKEWSCALC +STTSENCL  H++GKKHK   KE          
Sbjct: 179  DMTGSSYTRLTSKKLVQKEWSCALCQVSTTSENCLREHLKGKKHKDKEKELRVECHETNS 238

Query: 852  -YLLSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAA 1028
             YLLSSTQKRIKGMVL+RNLN+IANILNPVSRS+RWCEW KP+FGWTKLNTDGSI R+ A
Sbjct: 239  TYLLSSTQKRIKGMVLIRNLNKIANILNPVSRSVRWCEWTKPEFGWTKLNTDGSINRDVA 298

Query: 1029 GFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVN 1208
             FGGLLRD++GEP+C FVSK PQGDVFLVELWAIWRGLVL  GLGIK IWVESDSMSVV 
Sbjct: 299  SFGGLLRDYRGEPMCGFVSKVPQGDVFLVELWAIWRGLVLCGGLGIKAIWVESDSMSVVK 358

Query: 1209 TINRKQTCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1388
            T+NRKQ CPKA  YLKQIWKLLKKFDKY+ISHSWRETNRAADHL+KMV+ GNDVVLWPVD
Sbjct: 359  TVNRKQHCPKAYGYLKQIWKLLKKFDKYQISHSWRETNRAADHLSKMVVWGNDVVLWPVD 418

Query: 1389 FPHSLCNIIKDDAKGKKYLRR 1451
            FP +LC+IIKDDA+G KYLRR
Sbjct: 419  FPPTLCSIIKDDARGMKYLRR 439


>XP_017429168.1 PREDICTED: uncharacterized protein LOC108337207 [Vigna angularis]
            KOM47518.1 hypothetical protein LR48_Vigan07g122200
            [Vigna angularis] BAT81689.1 hypothetical protein
            VIGAN_03147900 [Vigna angularis var. angularis]
          Length = 439

 Score =  629 bits (1623), Expect = 0.0
 Identities = 310/441 (70%), Positives = 351/441 (79%), Gaps = 19/441 (4%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MGWEE  YT+LT S T+L+WPPFSLLCPL+ S+ A+ESDSRSS QRCLAFWVLFSLS I+
Sbjct: 1    MGWEEPFYTLLTNSFTLLSWPPFSLLCPLFVSVCAMESDSRSSKQRCLAFWVLFSLSTIV 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 545
            EWELSVLF+ LPWWPH+K++AT+LLL+PY G AP +Y+FLI+ Y +W +  +  NI + +
Sbjct: 61   EWELSVLFNRLPWWPHLKSIATVLLLMPYVGAAPCVYRFLIRPYCSWTLFTKISNIVSEK 120

Query: 546  STHSKSDE--------DNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGC 701
             T S+SDE        D                  QTI  S +QE+KL   Q  DD AGC
Sbjct: 121  GTDSESDEGAKLFDVSDQTITSSQIQEKELVDFSYQTITPSQIQEKKLEACQ--DDSAGC 178

Query: 702  DKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE---------- 851
            D+TESSY R+T K+ VQKEWSCALC ISTTSENCL AH++GKKHK    E          
Sbjct: 179  DRTESSYARITRKKLVQKEWSCALCQISTTSENCLRAHLKGKKHKDKETELRVEFHETNS 238

Query: 852  -YLLSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAA 1028
             YLLSSTQKRIKGMVL+RNLNQIANILNPVSRSIRWCEW KPKFGWTKLNTDGSI R++A
Sbjct: 239  KYLLSSTQKRIKGMVLIRNLNQIANILNPVSRSIRWCEWTKPKFGWTKLNTDGSINRDSA 298

Query: 1029 GFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVN 1208
             FGGLLRD+ GEPICAFVSK PQGDVFLVELWAIWRGLVL  GLGIK IWVESDSMSVV 
Sbjct: 299  SFGGLLRDYTGEPICAFVSKVPQGDVFLVELWAIWRGLVLCWGLGIKAIWVESDSMSVVK 358

Query: 1209 TINRKQTCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1388
            T+NRKQ CPKA+SYLKQIWKLLKKFDKY+ISHSWRETNRAADHLAKMV+ GNDVVLWPVD
Sbjct: 359  TVNRKQHCPKADSYLKQIWKLLKKFDKYQISHSWRETNRAADHLAKMVVWGNDVVLWPVD 418

Query: 1389 FPHSLCNIIKDDAKGKKYLRR 1451
            FP +LC+II+DDA+GKKYLRR
Sbjct: 419  FPPTLCSIIEDDARGKKYLRR 439


>XP_004490492.2 PREDICTED: uncharacterized protein LOC101492718 [Cicer arietinum]
          Length = 409

 Score =  626 bits (1615), Expect = 0.0
 Identities = 310/429 (72%), Positives = 341/429 (79%), Gaps = 11/429 (2%)
 Frame = +3

Query: 198  EALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIMEWEL 377
            E  +TILTKSLT+L WPP SLLCPLY SIRALESD RSSNQRCLAFWVLF LS+IME E 
Sbjct: 4    EIFHTILTKSLTVLVWPPISLLCPLYVSIRALESDCRSSNQRCLAFWVLFYLSMIMECEF 63

Query: 378  SVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGESTHS 557
            +VLF+W PWWPH KAMAT LLLIP FG A YIYKFLIKHY TWNICA TLNIF  + T  
Sbjct: 64   AVLFTWPPWWPHAKAMATFLLLIPNFGAALYIYKFLIKHYCTWNICAWTLNIFYQKITRF 123

Query: 558  KSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSYTRLTG 737
            +SDED++ +                      QEEK LVYQGRDD A CDKT+SSY     
Sbjct: 124  ESDEDSEKLSE--------------------QEEKNLVYQGRDDHADCDKTKSSYA---S 160

Query: 738  KEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSSTQKRIK 884
            K+QVQKEWSCALC ISTTSENCL  H+QGK+HK   KE           Y+LS TQ+RIK
Sbjct: 161  KKQVQKEWSCALCQISTTSENCLVEHLQGKQHKAKKKELRVGLRLINSPYMLSFTQERIK 220

Query: 885  GMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGE 1064
            GM LL+NLNQIANIL+PVS S  WCEW+KP+FGWTKLNTDGS+ +E A FGGLLRDH+GE
Sbjct: 221  GMTLLKNLNQIANILSPVSTSTIWCEWKKPEFGWTKLNTDGSVNKETAAFGGLLRDHRGE 280

Query: 1065 PICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAE 1244
            PIC FVSKAPQGD+FLVELWAIWRGLVLS GLGIK IWVESDSMSVV TIN+ Q CPKAE
Sbjct: 281  PICGFVSKAPQGDIFLVELWAIWRGLVLSFGLGIKSIWVESDSMSVVKTINKMQPCPKAE 340

Query: 1245 SYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDD 1424
            S L++IWKLL KF+KYRISHSWRETNRAADHLAKM LLGNDVVLWP+DFPHSLCNII++D
Sbjct: 341  SCLEKIWKLLSKFEKYRISHSWRETNRAADHLAKMALLGNDVVLWPIDFPHSLCNIIQED 400

Query: 1425 AKGKKYLRR 1451
            AKG KYLRR
Sbjct: 401  AKGTKYLRR 409


>XP_014522429.1 PREDICTED: uncharacterized protein LOC106778935 [Vigna radiata var.
            radiata]
          Length = 439

 Score =  624 bits (1609), Expect = 0.0
 Identities = 309/441 (70%), Positives = 348/441 (78%), Gaps = 19/441 (4%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            M WEE  YT+LT S T+L+WPPFSLLCPL+ S+ A+ESDSRSSNQRCLAFWVLFSLS I+
Sbjct: 1    MSWEEPFYTLLTNSFTLLSWPPFSLLCPLFVSVCAMESDSRSSNQRCLAFWVLFSLSTIV 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 545
            EWELS+LF+ LPWWPH+K++AT+LLL+PY G A  IYKFLI+ Y +W +  +  NIF+ +
Sbjct: 61   EWELSLLFNCLPWWPHLKSIATVLLLMPYVGAAQCIYKFLIRPYSSWTLFTKISNIFSEK 120

Query: 546  STHSKSDE--------DNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGC 701
             T  +SDE        D                  QTI  S +QE+KL   Q  D  AGC
Sbjct: 121  GTDFESDEGAKLFDVSDQTITSSQIQEKERVDFSDQTITPSQIQEKKLEACQ--DHSAGC 178

Query: 702  DKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY--------- 854
            DKTESSY R+T K+ VQKEWSCALC ISTTSENCL AH++GKKHK   K           
Sbjct: 179  DKTESSYARITTKKLVQKEWSCALCQISTTSENCLRAHLKGKKHKDKEKNLRVEFHETNS 238

Query: 855  --LLSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAA 1028
              LLSSTQKRIKGMVL+RNLNQIA+ILNPVSRSIRWCEW KPKFGWTKLNTDGSI R++A
Sbjct: 239  KSLLSSTQKRIKGMVLIRNLNQIASILNPVSRSIRWCEWTKPKFGWTKLNTDGSIYRDSA 298

Query: 1029 GFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVN 1208
             FGGLLRDH GEPICAFVSK PQGDVFLVELWAIWRGLVL  GLGIK IWVESDSMSVV 
Sbjct: 299  SFGGLLRDHTGEPICAFVSKVPQGDVFLVELWAIWRGLVLCWGLGIKAIWVESDSMSVVK 358

Query: 1209 TINRKQTCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1388
            T+NRKQ CPKA++YLKQIWKLLKKFDKY+ISHSWRETNRAADHLAKMV+ GNDVVLWPVD
Sbjct: 359  TVNRKQHCPKADNYLKQIWKLLKKFDKYQISHSWRETNRAADHLAKMVVWGNDVVLWPVD 418

Query: 1389 FPHSLCNIIKDDAKGKKYLRR 1451
            FP +LC+II+DDAKGKKYLRR
Sbjct: 419  FPPTLCSIIEDDAKGKKYLRR 439


>XP_006575357.1 PREDICTED: uncharacterized protein LOC102661917 [Glycine max]
            KRH72478.1 hypothetical protein GLYMA_02G215900 [Glycine
            max]
          Length = 470

 Score =  620 bits (1600), Expect = 0.0
 Identities = 312/470 (66%), Positives = 350/470 (74%), Gaps = 48/470 (10%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MGW EA+YT+LTKS T+L+WPP S LCPL  S+RA+ESDSRSSNQRCLAFWVLFSL +I+
Sbjct: 1    MGWIEAIYTLLTKSFTVLSWPPVSFLCPLLVSVRAMESDSRSSNQRCLAFWVLFSLCMIV 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 545
            E ELSVLF+ LPWWPHVK++ATILLLIPY G APY+YKFL +HY TW++  RT NI++ +
Sbjct: 61   EGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPYVYKFLTRHYCTWSLFTRTSNIYSEK 120

Query: 546  STHSKSDE--------DNKTV--------------------------------------L 587
            STH +SDE        D KT                                       L
Sbjct: 121  STHLESDEDSKLVDVSDQKTSQIQEKKLEAYQILVSSFYRSIFMPLQQITVNFSEKSRHL 180

Query: 588  XXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSYTRLTGKEQVQKEWSC 767
                         QTI TS ++E+KL  YQ  DD  GC KTESSYT LT K  +QKEWSC
Sbjct: 181  ESDEDSKLFDVSDQTITTSQIEEKKLEAYQETDDTTGCGKTESSYTSLTSKNLIQKEWSC 240

Query: 768  ALCLISTTSENCLGAHIQGKKHKTMVKEYL--LSSTQKRIKGMVLLRNLNQIANILNPVS 941
            ALC ISTT+EN L AH++G+KHK    E    LSSTQKRIKGMVLL NLNQIANIL+PVS
Sbjct: 241  ALCQISTTNENFLRAHLKGRKHKDKENELRVELSSTQKRIKGMVLLTNLNQIANILDPVS 300

Query: 942  RSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVEL 1121
            RSIRWCEW KP+FGWTKLNTDGSI    A FGGLLRD++GEPICAFVSKAPQGD+FL EL
Sbjct: 301  RSIRWCEWTKPEFGWTKLNTDGSIHSNTASFGGLLRDYRGEPICAFVSKAPQGDIFLAEL 360

Query: 1122 WAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAESYLKQIWKLLKKFDKYRIS 1301
            WA+WRGLVLSLGLGIK IWVESDSMSVV T+NRKQ CPKA  YLKQIWKLLKKFDKY+IS
Sbjct: 361  WAMWRGLVLSLGLGIKAIWVESDSMSVVKTVNRKQFCPKAVGYLKQIWKLLKKFDKYQIS 420

Query: 1302 HSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLRR 1451
            H+WR+TNRAADHLAKM LL NDVVLWPVDFP SLC+IIKDDAKG KYLRR
Sbjct: 421  HTWRQTNRAADHLAKMDLLANDVVLWPVDFPPSLCSIIKDDAKGTKYLRR 470


>XP_019459325.1 PREDICTED: uncharacterized protein LOC109359207 [Lupinus
            angustifolius]
          Length = 416

 Score =  579 bits (1492), Expect = 0.0
 Identities = 297/423 (70%), Positives = 333/423 (78%), Gaps = 7/423 (1%)
 Frame = +3

Query: 204  LYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIMEWELSV 383
            L TILTKSLT L WPP SL+CPLYASIRA++SDSR SNQ+CLAFWVLFS S+IME E +V
Sbjct: 6    LLTILTKSLTFLTWPPLSLICPLYASIRAMKSDSRFSNQQCLAFWVLFSFSMIMEREFAV 65

Query: 384  LFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGESTHSKS 563
            LF+ LPWWP+VK+MATILLLIPYFG + +IYK+LIKHY TWNIC + LNI N  STH   
Sbjct: 66   LFNRLPWWPNVKSMATILLLIPYFGGSLHIYKYLIKHYCTWNICGKDLNIVNQNSTHVVF 125

Query: 564  DEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSYTRLTGKE 743
            +ED+K +              QTII S +QE+KL V QGR ++A       +YTR T   
Sbjct: 126  EEDSKLI----------HVTEQTIIPSQIQEKKLTVNQGRYEVA------VNYTRPTFTM 169

Query: 744  QVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY--LLSSTQKRI----KGMVLLRN 905
            QVQKEWSCALC ISTTSENCL AH+QGKKHKT   E   +L  T  +     KG+VLLRN
Sbjct: 170  QVQKEWSCALCQISTTSENCLLAHLQGKKHKTKESEVRNMLHLTDNKYLPSSKGIVLLRN 229

Query: 906  LNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVS 1085
            LNQIA ILNPVSRSIR CEW KP FGW KLNTDGS+  E AGFGGLLRDH GEPICA+VS
Sbjct: 230  LNQIAKILNPVSRSIRLCEWIKPNFGWMKLNTDGSLNNEIAGFGGLLRDHMGEPICAYVS 289

Query: 1086 KAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQ-TCPKAESYLKQI 1262
            KAPQGDVFLVELWAIWRGLVLSL LGI  +WVESDSMSVV TIN++Q +CPKA   L+QI
Sbjct: 290  KAPQGDVFLVELWAIWRGLVLSLSLGITALWVESDSMSVVKTINKEQPSCPKAYGCLEQI 349

Query: 1263 WKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKY 1442
            WKLL KFDKY ISHSWRETNRAADHLAKMV+LGNDV+LWP+DFP SL NII DDAKGKKY
Sbjct: 350  WKLLSKFDKYHISHSWRETNRAADHLAKMVVLGNDVILWPIDFPCSLRNIIDDDAKGKKY 409

Query: 1443 LRR 1451
            +RR
Sbjct: 410  IRR 412


>KHN26153.1 Putative ribonuclease H protein [Glycine soja]
          Length = 435

 Score =  566 bits (1458), Expect = 0.0
 Identities = 288/435 (66%), Positives = 321/435 (73%), Gaps = 48/435 (11%)
 Frame = +3

Query: 291  LESDSRSSNQRCLAFWVLFSLSLIMEWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPY 470
            +ESDSRSSNQRCLAFWVLFSL +I+E ELSVLF+ LPWWPHVK++ATILLLIPY G APY
Sbjct: 1    MESDSRSSNQRCLAFWVLFSLCMIVEGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPY 60

Query: 471  IYKFLIKHYFTWNICARTLNIFNGESTHSKSDE--------DNKTV-------------- 584
            +YKFL +HY TW++  RT NI++ +STH +SDE        D KT               
Sbjct: 61   VYKFLTRHYCTWSLFTRTSNIYSEKSTHLESDEDSKLVDVSDQKTSQIQEKKLEAYQILV 120

Query: 585  ------------------------LXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDL 692
                                    L             QTI TS ++E+KL  YQ  DD 
Sbjct: 121  SSFYRSIFMPLQQITVNFSEKSRHLESDEDSKLFDVSDQTITTSQIEEKKLEAYQETDDT 180

Query: 693  AGCDKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYL--LSS 866
             GC KTESSYT LT K  +QKEWSCALC ISTT+EN L AH++G+KHK    E    LSS
Sbjct: 181  TGCGKTESSYTSLTSKNLIQKEWSCALCQISTTNENFLRAHLKGRKHKDKENELRVELSS 240

Query: 867  TQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLL 1046
            TQKRIKGMVLL NLNQIANIL+PVSRSIRWCEW KP+FGWTKLNTDGSI    A FGGLL
Sbjct: 241  TQKRIKGMVLLTNLNQIANILDPVSRSIRWCEWTKPEFGWTKLNTDGSIHSNTASFGGLL 300

Query: 1047 RDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQ 1226
            RD++GEPICAFVSKAPQGD+FL ELWA+WRGLVLSLGLGIK IWVESDSMSVV T+NRKQ
Sbjct: 301  RDYRGEPICAFVSKAPQGDIFLAELWAMWRGLVLSLGLGIKAIWVESDSMSVVKTVNRKQ 360

Query: 1227 TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLC 1406
             CPKA  YLKQIWKLLKKFDKY+ISH+WR+TNRAADHLAKM LL NDVVLWPVDFP SLC
Sbjct: 361  FCPKAVGYLKQIWKLLKKFDKYQISHTWRQTNRAADHLAKMDLLANDVVLWPVDFPPSLC 420

Query: 1407 NIIKDDAKGKKYLRR 1451
            +IIKDDAKG KYLRR
Sbjct: 421  SIIKDDAKGTKYLRR 435


>OIW02536.1 hypothetical protein TanjilG_12850 [Lupinus angustifolius]
          Length = 382

 Score =  530 bits (1366), Expect = 0.0
 Identities = 273/394 (69%), Positives = 308/394 (78%), Gaps = 7/394 (1%)
 Frame = +3

Query: 291  LESDSRSSNQRCLAFWVLFSLSLIMEWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPY 470
            ++SDSR SNQ+CLAFWVLFS S+IME E +VLF+ LPWWP+VK+MATILLLIPYFG + +
Sbjct: 1    MKSDSRFSNQQCLAFWVLFSFSMIMEREFAVLFNRLPWWPNVKSMATILLLIPYFGGSLH 60

Query: 471  IYKFLIKHYFTWNICARTLNIFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHL 650
            IYK+LIKHY TWNIC + LNI N  STH   +ED+K +              QTII S +
Sbjct: 61   IYKYLIKHYCTWNICGKDLNIVNQNSTHVVFEEDSKLI----------HVTEQTIIPSQI 110

Query: 651  QEEKLLVYQGRDDLAGCDKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKK 830
            QE+KL V QGR ++A       +YTR T   QVQKEWSCALC ISTTSENCL AH+QGKK
Sbjct: 111  QEKKLTVNQGRYEVA------VNYTRPTFTMQVQKEWSCALCQISTTSENCLLAHLQGKK 164

Query: 831  HKTMVKEY--LLSSTQKRI----KGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTK 992
            HKT   E   +L  T  +     KG+VLLRNLNQIA ILNPVSRSIR CEW KP FGW K
Sbjct: 165  HKTKESEVRNMLHLTDNKYLPSSKGIVLLRNLNQIAKILNPVSRSIRLCEWIKPNFGWMK 224

Query: 993  LNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKE 1172
            LNTDGS+  E AGFGGLLRDH GEPICA+VSKAPQGDVFLVELWAIWRGLVLSL LGI  
Sbjct: 225  LNTDGSLNNEIAGFGGLLRDHMGEPICAYVSKAPQGDVFLVELWAIWRGLVLSLSLGITA 284

Query: 1173 IWVESDSMSVVNTINRKQ-TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKM 1349
            +WVESDSMSVV TIN++Q +CPKA   L+QIWKLL KFDKY ISHSWRETNRAADHLAKM
Sbjct: 285  LWVESDSMSVVKTINKEQPSCPKAYGCLEQIWKLLSKFDKYHISHSWRETNRAADHLAKM 344

Query: 1350 VLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLRR 1451
            V+LGNDV+LWP+DFP SL NII DDAKGKKY+RR
Sbjct: 345  VVLGNDVILWPIDFPCSLRNIIDDDAKGKKYIRR 378


>GAU49057.1 hypothetical protein TSUD_244480, partial [Trifolium subterraneum]
          Length = 352

 Score =  489 bits (1258), Expect = e-167
 Identities = 242/344 (70%), Positives = 269/344 (78%), Gaps = 21/344 (6%)
 Frame = +3

Query: 399  PWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGESTH-------- 554
            PWWPHVKA ATILLLIPYFG A YIYKFLIKHYF+WNIC  TLNIF+ + TH        
Sbjct: 8    PWWPHVKATATILLLIPYFGAATYIYKFLIKHYFSWNICGWTLNIFHQKITHFDLDNDSK 67

Query: 555  --SKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSYTR 728
              S SDE  +  L            GQTIIT+HLQEEKLLVYQG+DD+A CDKT + YT 
Sbjct: 68   ILSDSDEGRQVFLESDDDSKSVEVSGQTIITNHLQEEKLLVYQGKDDIADCDKTNTGYT- 126

Query: 729  LTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSSTQK 875
               K++VQKEWSCALC ISTTSENCLG+H+QGK+HK   KE           Y+LS TQ+
Sbjct: 127  --SKKKVQKEWSCALCQISTTSENCLGSHLQGKQHKAKEKELRVGLHATNIPYVLSFTQE 184

Query: 876  RIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDH 1055
            R+KGMVLLRN NQIA IL+PVSR I WCEW+KPKFGWTKLNTDGS+ +  AGFGGLLRD+
Sbjct: 185  RMKGMVLLRNFNQIAKILSPVSRPIIWCEWKKPKFGWTKLNTDGSVNKVTAGFGGLLRDY 244

Query: 1056 KGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCP 1235
            +GEPICAFVSKAPQGD FLVELWAIWRGLVLS+GLGIK IWVESDSMSVV TIN+ Q CP
Sbjct: 245  RGEPICAFVSKAPQGDTFLVELWAIWRGLVLSIGLGIKSIWVESDSMSVVKTINKVQHCP 304

Query: 1236 KAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGND 1367
            KAE+ L QIWKLL K D+YRISHSWRETNRAADHLAKM L GN+
Sbjct: 305  KAETCLIQIWKLLSKVDEYRISHSWRETNRAADHLAKMALCGNE 348


>XP_003544251.1 PREDICTED: uncharacterized protein LOC100787629 [Glycine max]
            KRH16868.1 hypothetical protein GLYMA_14G182900 [Glycine
            max]
          Length = 470

 Score =  450 bits (1158), Expect = e-150
 Identities = 228/308 (74%), Positives = 248/308 (80%), Gaps = 2/308 (0%)
 Frame = +3

Query: 534  FNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTE 713
            F+ +S + +SDED+K                QTI TS ++E+KL  YQ  DD+AGCDKTE
Sbjct: 173  FSEKSRNLESDEDSKLF----------NVSDQTITTSQIEEKKLEAYQETDDIAGCDKTE 222

Query: 714  SSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYLL--SSTQKRIKG 887
            SSYTRLT K  VQKEWSCALC ISTTSENCL AH++G+KHK    E  +  SSTQKRIKG
Sbjct: 223  SSYTRLTIKNLVQKEWSCALCQISTTSENCLRAHLKGRKHKDKENELRVEFSSTQKRIKG 282

Query: 888  MVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEP 1067
            MVLLRNLNQIA+ILNPVSRSIRWCEW KP+FGWTKLNTDGSI      FGGLLRD++GEP
Sbjct: 283  MVLLRNLNQIASILNPVSRSIRWCEWTKPEFGWTKLNTDGSIHSNTVSFGGLLRDYRGEP 342

Query: 1068 ICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAES 1247
            ICAFVSKAPQGDVFL ELWAIWRGLVLSLGLGIK IWVESDSMSVV T+NRKQ CPKA  
Sbjct: 343  ICAFVSKAPQGDVFLAELWAIWRGLVLSLGLGIKAIWVESDSMSVVRTVNRKQLCPKAVG 402

Query: 1248 YLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDA 1427
            YL QIWKLLKKFDKY+ISHSWRETNRAADHLAKM LL NDVVL PVDFP SL  II+DDA
Sbjct: 403  YLNQIWKLLKKFDKYQISHSWRETNRAADHLAKMDLLANDVVLSPVDFPPSLSRIIEDDA 462

Query: 1428 KGKKYLRR 1451
            KG KY RR
Sbjct: 463  KGTKYRRR 470



 Score =  217 bits (553), Expect = 4e-60
 Identities = 103/164 (62%), Positives = 127/164 (77%)
 Frame = +3

Query: 186 MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
           MGW EA+YT+LTKS T+L+WPP S LCPL+ S+R ++SDSRSSNQRCLAFWVL+SL +IM
Sbjct: 1   MGWIEAIYTLLTKSFTVLSWPPVSFLCPLFVSVRVMKSDSRSSNQRCLAFWVLYSLCMIM 60

Query: 366 EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 545
           E ELSVLF+ LPWWPHVK++ATILLLIPY G APY+YKFLI+HYFTW++  RT NIF+ +
Sbjct: 61  EGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPYVYKFLIRHYFTWSLFTRTSNIFSEK 120

Query: 546 STHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQ 677
           STH +SDED+K V                 +TS +QE+KL  YQ
Sbjct: 121 STHLESDEDSKLV------------DVSDQMTSQIQEKKLEAYQ 152


>KHN39964.1 HVA22-like protein a [Glycine soja]
          Length = 470

 Score =  449 bits (1154), Expect = e-149
 Identities = 227/308 (73%), Positives = 248/308 (80%), Gaps = 2/308 (0%)
 Frame = +3

Query: 534  FNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTE 713
            F+ +S + +SD+D+K                QTI TS ++E+KL  YQ  DD+AGCDKTE
Sbjct: 173  FSEKSRNLESDKDSKLF----------NVTDQTITTSQIEEKKLEAYQETDDIAGCDKTE 222

Query: 714  SSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYLL--SSTQKRIKG 887
            SSYTRLT K  VQKEWSCALC ISTTSENCL AH++G+KHK    E  +  SSTQKRIKG
Sbjct: 223  SSYTRLTIKNLVQKEWSCALCQISTTSENCLRAHLKGRKHKDKENELRVEFSSTQKRIKG 282

Query: 888  MVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEP 1067
            MVLLRNLNQIA+ILNPVSRSIRWCEW KP+FGWTKLNTDGSI      FGGLLRD++GEP
Sbjct: 283  MVLLRNLNQIASILNPVSRSIRWCEWTKPEFGWTKLNTDGSIHSNTVSFGGLLRDYRGEP 342

Query: 1068 ICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAES 1247
            ICAFVSKAPQGDVFL ELWAIWRGLVLSLGLGIK IWVESDSMSVV T+NRKQ CPKA  
Sbjct: 343  ICAFVSKAPQGDVFLAELWAIWRGLVLSLGLGIKAIWVESDSMSVVRTVNRKQLCPKAVG 402

Query: 1248 YLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDA 1427
            YL QIWKLLKKFDKY+ISHSWRETNRAADHLAKM LL NDVVL PVDFP SL  II+DDA
Sbjct: 403  YLNQIWKLLKKFDKYQISHSWRETNRAADHLAKMDLLANDVVLSPVDFPPSLSRIIEDDA 462

Query: 1428 KGKKYLRR 1451
            KG KY RR
Sbjct: 463  KGTKYRRR 470



 Score =  217 bits (553), Expect = 4e-60
 Identities = 103/164 (62%), Positives = 127/164 (77%)
 Frame = +3

Query: 186 MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
           MGW EA+YT+LTKS T+L+WPP S LCPL+ S+R ++SDSRSSNQRCLAFWVL+SL +IM
Sbjct: 1   MGWIEAIYTLLTKSFTVLSWPPVSFLCPLFVSVRVMKSDSRSSNQRCLAFWVLYSLCMIM 60

Query: 366 EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 545
           E ELSVLF+ LPWWPHVK++ATILLLIPY G APY+YKFLI+HYFTW++  RT NIF+ +
Sbjct: 61  EGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPYVYKFLIRHYFTWSLFTRTSNIFSEK 120

Query: 546 STHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQ 677
           STH +SDED+K V                 +TS +QE+KL  YQ
Sbjct: 121 STHLESDEDSKLV------------DVSDQMTSQIQEKKLEAYQ 152


>XP_016647353.1 PREDICTED: uncharacterized protein LOC103319467 [Prunus mume]
          Length = 456

 Score =  428 bits (1100), Expect = e-141
 Identities = 238/467 (50%), Positives = 296/467 (63%), Gaps = 46/467 (9%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MGWE  L  +    L +L+WP F+L+  LYASI+A+ESDS S NQ+CL++WV+F+L  I 
Sbjct: 1    MGWEAFLQVLAKLFLGVLSWPSFNLVYTLYASIQAIESDSHSRNQQCLSYWVMFALYKIS 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 545
            E  L  LF WLP WP+ K   TILL++PYFG A Y+YK  I+ Y + N      NI + +
Sbjct: 61   EEALGKLFYWLPVWPYTKGAITILLVLPYFGGASYLYKHFIRPYISENSVIWKWNILSIQ 120

Query: 546  STHS-KSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTES-S 719
              +   S EDN   +               I T   + E  ++++G    A   +TE   
Sbjct: 121  RINGFSSGEDNYPDVVDK----------NVIRTEPQKSEGAVIFKGTP--ASSSETEGRE 168

Query: 720  YTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSS 866
            YT  +  +++Q+EW+CALCLISTTS  CL  H++GKKHKT V+            Y LS 
Sbjct: 169  YTSPSSPKKIQREWTCALCLISTTSGKCLKKHLRGKKHKTQVEALRTYKQGPNSGYELSL 228

Query: 867  TQKRIKGMVL------------------------LRNLNQIAN--------ILNPVSRSI 950
              KR  GM+                         + NLNQIA         IL+PV+R I
Sbjct: 229  KLKRTNGMIFNLNQMARANGKIFNLNQMARANGKIFNLNQIARANLEKWSGILSPVARPI 288

Query: 951  RWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAI 1130
            R C W+KP+ GWTKLNTDGS+ RE AG+GGLLRD+KGEPICAFVSKA   D+FLVELWAI
Sbjct: 289  RTCIWKKPELGWTKLNTDGSVDRENAGYGGLLRDYKGEPICAFVSKALGDDIFLVELWAI 348

Query: 1131 WRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTC-PKAESYLKQIWKLLKKFDKYRISHS 1307
            WRGLVL+L LGIK IWVESDS SVV TINR +    KA S LK IW+LLKKFDK+++SHS
Sbjct: 349  WRGLVLALSLGIKVIWVESDSESVVQTINRDRPYGQKASSCLKHIWELLKKFDKHQVSHS 408

Query: 1308 WRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLR 1448
            WRETNRAAD L+KMVLLG+DVV WPVDFP SLCNIIK+DA+G+ Y R
Sbjct: 409  WRETNRAADLLSKMVLLGSDVVFWPVDFPDSLCNIIKEDAEGRIYFR 455


>XP_012449204.1 PREDICTED: uncharacterized protein LOC105772486 isoform X1 [Gossypium
            raimondii] KJB64102.1 hypothetical protein
            B456_010G033200 [Gossypium raimondii]
          Length = 418

 Score =  426 bits (1096), Expect = e-141
 Identities = 228/441 (51%), Positives = 295/441 (66%), Gaps = 19/441 (4%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MG+ EA   +L    T++ WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+ S ++
Sbjct: 1    MGFWEAFVELLANIFTVICWPSFTLIYPLFVSIRIVETNSSLKNQQCLTYWVLFAFSTMV 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 536
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I      +F
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDI------MF 114

Query: 537  NGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTES 716
              +       E N TV               +++T   + EKL   QG  +++    TE 
Sbjct: 115  FPKKKGFVLHEANGTV----------GDADTSMLTYGPKSEKLTTDQGNVNIS-YGNTEV 163

Query: 717  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY-------------L 857
              T+    ++VQKEWSC LCLIST+SE+CL  H++GKKHKT  KEY             +
Sbjct: 164  ISTQ----KRVQKEWSCVLCLISTSSEDCLKEHLRGKKHKT--KEYELRVGALPLMETCM 217

Query: 858  LSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAG 1031
            LSS  K+ + +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+     G
Sbjct: 218  LSSMPKKAEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCLKLNTDGSVDAGNLG 277

Query: 1032 FGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNT 1211
            FGGLLRD+KGEP+CAFV KAPQGD FLVELWAIWRGLVL+ GLGIK IWVESDS SVV T
Sbjct: 278  FGGLLRDYKGEPLCAFVCKAPQGDTFLVELWAIWRGLVLASGLGIKVIWVESDSKSVVKT 337

Query: 1212 INRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1388
            INR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP D
Sbjct: 338  INREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWPAD 397

Query: 1389 FPHSLCNIIKDDAKGKKYLRR 1451
            FP +L +IIKDDA+GK YLRR
Sbjct: 398  FPDTLSSIIKDDAQGKTYLRR 418


>XP_017643483.1 PREDICTED: uncharacterized protein LOC108484269 isoform X1 [Gossypium
            arboreum] KHF99723.1 HVA22-like protein a [Gossypium
            arboreum]
          Length = 418

 Score =  425 bits (1093), Expect = e-141
 Identities = 225/443 (50%), Positives = 293/443 (66%), Gaps = 21/443 (4%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MG+ EA   +L    T+L WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+   ++
Sbjct: 1    MGFWEAFVELLANIFTVLCWPSFTLIYPLFVSIRIMETNSSLKNQQCLTYWVLFAFITMV 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 530
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I    +   
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDIMFFPKKKG 120

Query: 531  IFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKT 710
                E+  +  D D  T                  +T+  + EKL   QG  +++    T
Sbjct: 121  FVLHEANGTVGDADTST------------------LTNGPKSEKLTTDQGNVNIS-YGNT 161

Query: 711  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY------------ 854
            E   T+    ++VQKEWSC LCLIST+SE CL  H+QGKKHKT  KEY            
Sbjct: 162  EVISTQ----KRVQKEWSCVLCLISTSSEYCLKEHLQGKKHKT--KEYELRVGALPLKET 215

Query: 855  -LLSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREA 1025
             +LSS  K+++ +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+    
Sbjct: 216  CMLSSMPKKVEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCVKLNTDGSVDAGN 275

Query: 1026 AGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVV 1205
            +GFGGLLRD++GEP+CAFV KAPQGD FLVELW IWRGLVL+ GLG+K IWVESDS SVV
Sbjct: 276  SGFGGLLRDYRGEPLCAFVCKAPQGDTFLVELWPIWRGLVLASGLGVKVIWVESDSKSVV 335

Query: 1206 NTINRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWP 1382
             TINR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP
Sbjct: 336  KTINREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWP 395

Query: 1383 VDFPHSLCNIIKDDAKGKKYLRR 1451
             DFP +L +IIKDDA+GK YLRR
Sbjct: 396  ADFPDTLNSIIKDDAQGKTYLRR 418


>XP_016731529.1 PREDICTED: uncharacterized protein LOC107942387 [Gossypium hirsutum]
          Length = 418

 Score =  423 bits (1087), Expect = e-140
 Identities = 227/441 (51%), Positives = 293/441 (66%), Gaps = 19/441 (4%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MG+ EA   +L    T+L WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+   ++
Sbjct: 1    MGFWEAFVELLANIFTVLCWPSFTLIYPLFVSIRIVETNSSLKNQQCLTYWVLFAFITMV 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 536
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I      +F
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDI------MF 114

Query: 537  NGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTES 716
              +       E N TV               +++T   + EKL   QG  +++    TE 
Sbjct: 115  FPKKKGFVLHEANGTV----------GDADTSMLTYGPKSEKLTTDQGNVNIS-YGNTEV 163

Query: 717  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY-------------L 857
              T+    ++VQKEWSC LCLIST+SE CL  H+ GKKHKT  KEY             +
Sbjct: 164  ISTQ----KRVQKEWSCVLCLISTSSEYCLKEHLWGKKHKT--KEYELRVGALPLKETCM 217

Query: 858  LSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAG 1031
            LSS  K+ + +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+    +G
Sbjct: 218  LSSMPKKAEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCLKLNTDGSVDAGNSG 277

Query: 1032 FGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNT 1211
            FGGLLRD+KGEP+CAFV KAPQGD FLVELWA+WRGLVL+ GLGIK IWVESDS SVV T
Sbjct: 278  FGGLLRDYKGEPLCAFVCKAPQGDTFLVELWAVWRGLVLASGLGIKVIWVESDSKSVVKT 337

Query: 1212 INRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1388
            INR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP D
Sbjct: 338  INREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWPAD 397

Query: 1389 FPHSLCNIIKDDAKGKKYLRR 1451
            FP +L +IIKDDA+GK YLRR
Sbjct: 398  FPDTLSSIIKDDAEGKTYLRR 418


>EOY15502.1 HVA22-like protein a, putative isoform 1 [Theobroma cacao]
          Length = 420

 Score =  423 bits (1087), Expect = e-140
 Identities = 229/439 (52%), Positives = 285/439 (64%), Gaps = 17/439 (3%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MG  +    +L   LT+L WP ++L+ PLY SIR +E++S   NQ+CL +WVLF+L  + 
Sbjct: 1    MGICKVFLELLASILTVLCWPSYALIYPLYVSIRTVENNSSFKNQQCLTYWVLFALITMG 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 530
            E  L    +W P+WP VK +ATILL+ PYFG A Y++K LI+ YF+   WNI    +  +
Sbjct: 61   ELTLGKFLNWFPFWPCVKGVATILLVTPYFGGASYVFKHLIRPYFSEKIWNILFFPKKKD 120

Query: 531  IFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKT 710
            I +        D D   +                       EE ++  +G  D +  D  
Sbjct: 121  IVSEAQNGILDDADTNRLKNGPKL-----------------EELIINGEGNFDRSS-DNK 162

Query: 711  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYL----------- 857
            E + T LT  ++VQKEWSC LCLIS +SE CL  H+QGKKHKT   E             
Sbjct: 163  EVNSTWLTHPKRVQKEWSCVLCLISASSEKCLKKHLQGKKHKTKEDELRADALALRATCK 222

Query: 858  LSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFG 1037
            LSS  K+   +VLLRNLN I ++LNPV+ SI WC W+KP+ G  KLNTDGS+  E AGFG
Sbjct: 223  LSSVPKKAGRVVLLRNLN-IESLLNPVTSSITWCRWKKPEIGCIKLNTDGSVVPENAGFG 281

Query: 1038 GLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTIN 1217
            GLLRD+KG+P+CAFVSKAPQ D+FLVELWAIWRGLVL+ GLGIK IWVESDSMSVV TIN
Sbjct: 282  GLLRDYKGDPLCAFVSKAPQDDIFLVELWAIWRGLVLASGLGIKVIWVESDSMSVVRTIN 341

Query: 1218 RKQ-TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFP 1394
            R+Q    K    LKQIWKLL  FD YR++HSWRETN+AADHL++MVL  +D VLWPVDFP
Sbjct: 342  REQFHGAKCSRCLKQIWKLLTMFDNYRVTHSWRETNKAADHLSRMVLRESDAVLWPVDFP 401

Query: 1395 HSLCNIIKDDAKGKKYLRR 1451
             SL NII+DDA+GK Y RR
Sbjct: 402  DSLNNIIQDDARGKIYFRR 420


>XP_016730591.1 PREDICTED: uncharacterized protein LOC107941552 isoform X1 [Gossypium
            hirsutum]
          Length = 418

 Score =  422 bits (1085), Expect = e-139
 Identities = 223/443 (50%), Positives = 293/443 (66%), Gaps = 21/443 (4%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MG+ EA   +L    T+L WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+   ++
Sbjct: 1    MGFWEAFVELLANIFTVLCWPSFTLIYPLFVSIRIMETNSSLKNQQCLTYWVLFAFITMV 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 530
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I    +   
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDIMFFPKKKG 120

Query: 531  IFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKT 710
                E+  +  D D  T                  +T+  + EKL   QG  +++    T
Sbjct: 121  FVLHEANGTVGDADTST------------------LTNGPKSEKLTTDQGNVNIS-YGNT 161

Query: 711  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY------------ 854
            E   T+    ++VQKEWSC LCLIST+SE CL  H++GKKHKT  KEY            
Sbjct: 162  EVISTQ----KRVQKEWSCVLCLISTSSEYCLKEHLRGKKHKT--KEYELRVGALPLKET 215

Query: 855  -LLSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREA 1025
             +LSS  K+++ +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+    
Sbjct: 216  CMLSSMPKKVEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCVKLNTDGSVDAGN 275

Query: 1026 AGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVV 1205
            +GFGGLLRD++GEP+CAFV KAPQGD FLVELW IWRGLVL+ GLG+K IWVESDS SVV
Sbjct: 276  SGFGGLLRDYRGEPLCAFVCKAPQGDTFLVELWPIWRGLVLASGLGVKVIWVESDSKSVV 335

Query: 1206 NTINRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWP 1382
             TIN++Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP
Sbjct: 336  KTINQEQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWP 395

Query: 1383 VDFPHSLCNIIKDDAKGKKYLRR 1451
             DFP +L +IIKDDA+GK YLRR
Sbjct: 396  ADFPDTLNSIIKDDAQGKTYLRR 418


>XP_007018277.2 PREDICTED: uncharacterized protein LOC18591832 isoform X1 [Theobroma
            cacao]
          Length = 420

 Score =  421 bits (1082), Expect = e-139
 Identities = 226/439 (51%), Positives = 284/439 (64%), Gaps = 17/439 (3%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MG  +    +L   LT+L WP ++L+ PLY SIR +E++S   NQ+CL +WVLF+L  ++
Sbjct: 1    MGICKVFLELLASILTVLCWPSYALIYPLYVSIRTVENNSSFKNQQCLTYWVLFALITMV 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 530
            E  L    +W P+WP  K +ATILL+ PYFG A Y++K LI+ YF+   WNI    +  +
Sbjct: 61   ELTLGKFLNWFPFWPCAKGVATILLVTPYFGGASYVFKHLIRPYFSEKIWNILFFPKKKD 120

Query: 531  IFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKT 710
            I +        D D   +                       EE ++  +G  D +  D  
Sbjct: 121  IVSEAQNGILDDADTNRLKNGPKL-----------------EELIINGEGNFDRSS-DNK 162

Query: 711  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYL----------- 857
            E + T LT  ++VQKEWSC LCL+S +SE CL  H+QGKKHKT   E             
Sbjct: 163  EVNSTWLTHPKRVQKEWSCVLCLVSASSEKCLKKHLQGKKHKTKEDELRADALALRATCK 222

Query: 858  LSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFG 1037
            LSS  K+   +VLLRNLN   ++LNPV+ SI WC W+KP+ G  KLNTDGS+  E AGFG
Sbjct: 223  LSSVPKKAGRVVLLRNLN-FESLLNPVTSSITWCRWKKPEIGCIKLNTDGSVVPENAGFG 281

Query: 1038 GLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTIN 1217
            GLLRD+KG+P+CAFVSKAPQ D+FLVELWAIWRGLVL+ GLGIK IWVESDSMSVV TIN
Sbjct: 282  GLLRDYKGDPLCAFVSKAPQDDIFLVELWAIWRGLVLASGLGIKVIWVESDSMSVVRTIN 341

Query: 1218 RKQ-TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFP 1394
            R+Q    K    LKQIWKLL  FD YR++HSWRETN+AADHL++MVL  +D VLWPVDFP
Sbjct: 342  REQFHGAKCSRCLKQIWKLLTMFDNYRVTHSWRETNKAADHLSRMVLRESDAVLWPVDFP 401

Query: 1395 HSLCNIIKDDAKGKKYLRR 1451
             SL NII+DDA+GK Y RR
Sbjct: 402  DSLNNIIQDDARGKIYFRR 420


>ONI35422.1 hypothetical protein PRUPE_1G535200 [Prunus persica]
          Length = 456

 Score =  421 bits (1081), Expect = e-138
 Identities = 231/466 (49%), Positives = 292/466 (62%), Gaps = 45/466 (9%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MGWE  L  +    L +L+WP F+L+  LYASI+A+ESDS S NQ+CL++WV+F+L  I 
Sbjct: 1    MGWEAFLQVLAELFLGVLSWPSFNLVYTLYASIQAIESDSHSRNQQCLSYWVMFALYKIS 60

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 545
            E  L  LF WLP WP+ K   T+LL++PYFG A Y+YK  I+ Y + N      NI + +
Sbjct: 61   EEALGKLFYWLPVWPYTKGAITVLLVLPYFGGASYLYKHFIRPYISENSVIWKWNILSIQ 120

Query: 546  STHS-KSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSY 722
              +   S EDN   +               I T   + E  ++++G    +  +K    Y
Sbjct: 121  RINGFNSGEDNYPDVVDK----------NVIRTEPQKSEGAVIFKGTP-ASSSEKEGREY 169

Query: 723  TRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSST 869
            T  +  +++Q+EW+CALCLISTTS  CL  H++GKKH+T V             Y  S  
Sbjct: 170  TSPSSPKKIQREWTCALCLISTTSGKCLKKHLRGKKHETQVAALRTYKQGPISGYKSSLK 229

Query: 870  QKRIKGMVL------------------------LRNLNQIAN--------ILNPVSRSIR 953
             KR  GM+                         + NLNQIA         IL+PV+R IR
Sbjct: 230  LKRTDGMIFNLNQMARANGKIFNLNQMARANGKIFNLNQIARANLEKWSGILSPVARPIR 289

Query: 954  WCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIW 1133
             C W+KP+ GWTKLNTDGS+ RE AG+GGLLRD+KG+PICAFVSKA   D+FLVELWAIW
Sbjct: 290  MCIWKKPELGWTKLNTDGSVDRENAGYGGLLRDYKGDPICAFVSKALGDDIFLVELWAIW 349

Query: 1134 RGLVLSLGLGIKEIWVESDSMSVVNTINR-KQTCPKAESYLKQIWKLLKKFDKYRISHSW 1310
            RGLVL+L LGIK IWVESDS SVV TINR +    KA S LK IW+LL KFDK+++SHSW
Sbjct: 350  RGLVLALSLGIKVIWVESDSESVVQTINRDRPYSQKASSCLKHIWELLNKFDKHQVSHSW 409

Query: 1311 RETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLR 1448
            RETNRAADHL+KMVLLG+DVV WPVDFP SL NIIK+DA+G+ Y R
Sbjct: 410  RETNRAADHLSKMVLLGSDVVFWPVDFPDSLHNIIKEDAEGRIYFR 455


>XP_012449205.1 PREDICTED: uncharacterized protein LOC105772486 isoform X2 [Gossypium
            raimondii]
          Length = 409

 Score =  408 bits (1048), Expect = e-134
 Identities = 223/441 (50%), Positives = 288/441 (65%), Gaps = 19/441 (4%)
 Frame = +3

Query: 186  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 365
            MG+ EA   +L    T++ W         + SIR +E++S   NQ+CL +WVLF+ S ++
Sbjct: 1    MGFWEAFVELLANIFTVICW---------FVSIRIVETNSSLKNQQCLTYWVLFAFSTMV 51

Query: 366  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 536
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I      +F
Sbjct: 52   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDI------MF 105

Query: 537  NGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTES 716
              +       E N TV               +++T   + EKL   QG  +++    TE 
Sbjct: 106  FPKKKGFVLHEANGTV----------GDADTSMLTYGPKSEKLTTDQGNVNIS-YGNTEV 154

Query: 717  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY-------------L 857
              T+    ++VQKEWSC LCLIST+SE+CL  H++GKKHKT  KEY             +
Sbjct: 155  ISTQ----KRVQKEWSCVLCLISTSSEDCLKEHLRGKKHKT--KEYELRVGALPLMETCM 208

Query: 858  LSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAG 1031
            LSS  K+ + +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+     G
Sbjct: 209  LSSMPKKAEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCLKLNTDGSVDAGNLG 268

Query: 1032 FGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNT 1211
            FGGLLRD+KGEP+CAFV KAPQGD FLVELWAIWRGLVL+ GLGIK IWVESDS SVV T
Sbjct: 269  FGGLLRDYKGEPLCAFVCKAPQGDTFLVELWAIWRGLVLASGLGIKVIWVESDSKSVVKT 328

Query: 1212 INRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1388
            INR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP D
Sbjct: 329  INREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWPAD 388

Query: 1389 FPHSLCNIIKDDAKGKKYLRR 1451
            FP +L +IIKDDA+GK YLRR
Sbjct: 389  FPDTLSSIIKDDAQGKTYLRR 409


Top