BLASTX nr result

ID: Glycyrrhiza28_contig00016196 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza28_contig00016196
         (2010 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_007141283.1 hypothetical protein PHAVU_008G183100g [Phaseolus...   630   0.0  
XP_017429168.1 PREDICTED: uncharacterized protein LOC108337207 [...   627   0.0  
XP_004490492.2 PREDICTED: uncharacterized protein LOC101492718 [...   625   0.0  
XP_014522429.1 PREDICTED: uncharacterized protein LOC106778935 [...   621   0.0  
XP_006575357.1 PREDICTED: uncharacterized protein LOC102661917 [...   618   0.0  
XP_019459325.1 PREDICTED: uncharacterized protein LOC109359207 [...   579   0.0  
KHN26153.1 Putative ribonuclease H protein [Glycine soja]             563   0.0  
OIW02536.1 hypothetical protein TanjilG_12850 [Lupinus angustifo...   530   0.0  
GAU49057.1 hypothetical protein TSUD_244480, partial [Trifolium ...   487   e-165
XP_003544251.1 PREDICTED: uncharacterized protein LOC100787629 [...   447   e-148
KHN39964.1 HVA22-like protein a [Glycine soja]                        446   e-147
XP_012449204.1 PREDICTED: uncharacterized protein LOC105772486 i...   431   e-143
XP_017643483.1 PREDICTED: uncharacterized protein LOC108484269 i...   430   e-142
XP_016731529.1 PREDICTED: uncharacterized protein LOC107942387 [...   428   e-141
XP_016647353.1 PREDICTED: uncharacterized protein LOC103319467 [...   429   e-141
XP_016730591.1 PREDICTED: uncharacterized protein LOC107941552 i...   427   e-141
EOY15502.1 HVA22-like protein a, putative isoform 1 [Theobroma c...   424   e-140
XP_007018277.2 PREDICTED: uncharacterized protein LOC18591832 is...   422   e-139
ONI35422.1 hypothetical protein PRUPE_1G535200 [Prunus persica]       422   e-139
XP_012449205.1 PREDICTED: uncharacterized protein LOC105772486 i...   413   e-135

>XP_007141283.1 hypothetical protein PHAVU_008G183100g [Phaseolus vulgaris]
            ESW13277.1 hypothetical protein PHAVU_008G183100g
            [Phaseolus vulgaris]
          Length = 439

 Score =  630 bits (1624), Expect = 0.0
 Identities = 311/441 (70%), Positives = 349/441 (79%), Gaps = 19/441 (4%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MGWEEA YT+LT S T+L+WPPFS LCPL+ S+RA+ESDSRSSNQRCLAFWVLFSLS+IM
Sbjct: 1    MGWEEAFYTLLTNSFTLLSWPPFSFLCPLFVSVRAMESDSRSSNQRCLAFWVLFSLSMIM 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 525
            E ELSVL +  PWWPH+K++ATILLLIPY G AP +YKFLI+ Y  W +  +T NIF+ +
Sbjct: 61   ERELSVLLNCPPWWPHLKSIATILLLIPYVGAAPCVYKFLIRPYCPWRLFTKTSNIFSEK 120

Query: 526  STHSESDEDNKTVLXXXXXXXXXXXXG--------QTIITSHLQEEKLLVYQGRDDLAGY 681
             TH ESDED K               G        QTI  S +QE+KL  YQ  DD AG 
Sbjct: 121  GTHVESDEDGKLFDVSDQTITTSQIQGKELVDFSDQTITPSQIQEKKLEAYQ--DDSAGC 178

Query: 682  DKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE---------- 831
            D T SSYTRLT K+ VQKEWSCALC +STTSENCL  H++GKKHK   KE          
Sbjct: 179  DMTGSSYTRLTSKKLVQKEWSCALCQVSTTSENCLREHLKGKKHKDKEKELRVECHETNS 238

Query: 832  -YLLSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAA 1008
             YLLSSTQKRIKGMVL+RNLN+IANILNPVSRS+RWCEW KP+FGWTKLNTDGSI R+ A
Sbjct: 239  TYLLSSTQKRIKGMVLIRNLNKIANILNPVSRSVRWCEWTKPEFGWTKLNTDGSINRDVA 298

Query: 1009 GFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVN 1188
             FGGLLRD++GEP+C FVSK PQGDVFLVELWAIWRGLVL  GLGIK IWVESDSMSVV 
Sbjct: 299  SFGGLLRDYRGEPMCGFVSKVPQGDVFLVELWAIWRGLVLCGGLGIKAIWVESDSMSVVK 358

Query: 1189 TINRKQTCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1368
            T+NRKQ CPKA  YLKQIWKLLKKFDKY+ISHSWRETNRAADHL+KMV+ GNDVVLWPVD
Sbjct: 359  TVNRKQHCPKAYGYLKQIWKLLKKFDKYQISHSWRETNRAADHLSKMVVWGNDVVLWPVD 418

Query: 1369 FPHSLCNIIKDDAKGKKYLRR 1431
            FP +LC+IIKDDA+G KYLRR
Sbjct: 419  FPPTLCSIIKDDARGMKYLRR 439


>XP_017429168.1 PREDICTED: uncharacterized protein LOC108337207 [Vigna angularis]
            KOM47518.1 hypothetical protein LR48_Vigan07g122200
            [Vigna angularis] BAT81689.1 hypothetical protein
            VIGAN_03147900 [Vigna angularis var. angularis]
          Length = 439

 Score =  627 bits (1616), Expect = 0.0
 Identities = 310/441 (70%), Positives = 350/441 (79%), Gaps = 19/441 (4%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MGWEE  YT+LT S T+L+WPPFSLLCPL+ S+ A+ESDSRSS QRCLAFWVLFSLS I+
Sbjct: 1    MGWEEPFYTLLTNSFTLLSWPPFSLLCPLFVSVCAMESDSRSSKQRCLAFWVLFSLSTIV 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 525
            EWELSVLF+ LPWWPH+K++AT+LLL+PY G AP +Y+FLI+ Y +W +  +  NI + +
Sbjct: 61   EWELSVLFNRLPWWPHLKSIATVLLLMPYVGAAPCVYRFLIRPYCSWTLFTKISNIVSEK 120

Query: 526  STHSESDE--------DNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGY 681
             T SESDE        D                  QTI  S +QE+KL   Q  DD AG 
Sbjct: 121  GTDSESDEGAKLFDVSDQTITSSQIQEKELVDFSYQTITPSQIQEKKLEACQ--DDSAGC 178

Query: 682  DKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE---------- 831
            D+TESSY R+T K+ VQKEWSCALC ISTTSENCL AH++GKKHK    E          
Sbjct: 179  DRTESSYARITRKKLVQKEWSCALCQISTTSENCLRAHLKGKKHKDKETELRVEFHETNS 238

Query: 832  -YLLSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAA 1008
             YLLSSTQKRIKGMVL+RNLNQIANILNPVSRSIRWCEW KPKFGWTKLNTDGSI R++A
Sbjct: 239  KYLLSSTQKRIKGMVLIRNLNQIANILNPVSRSIRWCEWTKPKFGWTKLNTDGSINRDSA 298

Query: 1009 GFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVN 1188
             FGGLLRD+ GEPICAFVSK PQGDVFLVELWAIWRGLVL  GLGIK IWVESDSMSVV 
Sbjct: 299  SFGGLLRDYTGEPICAFVSKVPQGDVFLVELWAIWRGLVLCWGLGIKAIWVESDSMSVVK 358

Query: 1189 TINRKQTCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1368
            T+NRKQ CPKA+SYLKQIWKLLKKFDKY+ISHSWRETNRAADHLAKMV+ GNDVVLWPVD
Sbjct: 359  TVNRKQHCPKADSYLKQIWKLLKKFDKYQISHSWRETNRAADHLAKMVVWGNDVVLWPVD 418

Query: 1369 FPHSLCNIIKDDAKGKKYLRR 1431
            FP +LC+II+DDA+GKKYLRR
Sbjct: 419  FPPTLCSIIEDDARGKKYLRR 439


>XP_004490492.2 PREDICTED: uncharacterized protein LOC101492718 [Cicer arietinum]
          Length = 409

 Score =  625 bits (1612), Expect = 0.0
 Identities = 311/429 (72%), Positives = 340/429 (79%), Gaps = 11/429 (2%)
 Frame = +1

Query: 178  EALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIMEWEL 357
            E  +TILTKSLT+L WPP SLLCPLY SIRALESD RSSNQRCLAFWVLF LS+IME E 
Sbjct: 4    EIFHTILTKSLTVLVWPPISLLCPLYVSIRALESDCRSSNQRCLAFWVLFYLSMIMECEF 63

Query: 358  SVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGESTHS 537
            +VLF+W PWWPH KAMAT LLLIP FG A YIYKFLIKHY TWNICA TLNIF  + T  
Sbjct: 64   AVLFTWPPWWPHAKAMATFLLLIPNFGAALYIYKFLIKHYCTWNICAWTLNIFYQKITRF 123

Query: 538  ESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTESSYTRLTG 717
            ESDED++ +                      QEEK LVYQGRDD A  DKT+SSY     
Sbjct: 124  ESDEDSEKLSE--------------------QEEKNLVYQGRDDHADCDKTKSSYA---S 160

Query: 718  KEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSSTQKRIK 864
            K+QVQKEWSCALC ISTTSENCL  H+QGK+HK   KE           Y+LS TQ+RIK
Sbjct: 161  KKQVQKEWSCALCQISTTSENCLVEHLQGKQHKAKKKELRVGLRLINSPYMLSFTQERIK 220

Query: 865  GMVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGE 1044
            GM LL+NLNQIANIL+PVS S  WCEWKKP+FGWTKLNTDGS+ +E A FGGLLRDH+GE
Sbjct: 221  GMTLLKNLNQIANILSPVSTSTIWCEWKKPEFGWTKLNTDGSVNKETAAFGGLLRDHRGE 280

Query: 1045 PICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAE 1224
            PIC FVSKAPQGD+FLVELWAIWRGLVLS GLGIK IWVESDSMSVV TIN+ Q CPKAE
Sbjct: 281  PICGFVSKAPQGDIFLVELWAIWRGLVLSFGLGIKSIWVESDSMSVVKTINKMQPCPKAE 340

Query: 1225 SYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDD 1404
            S L++IWKLL KF+KYRISHSWRETNRAADHLAKM LLGNDVVLWP+DFPHSLCNII++D
Sbjct: 341  SCLEKIWKLLSKFEKYRISHSWRETNRAADHLAKMALLGNDVVLWPIDFPHSLCNIIQED 400

Query: 1405 AKGKKYLRR 1431
            AKG KYLRR
Sbjct: 401  AKGTKYLRR 409


>XP_014522429.1 PREDICTED: uncharacterized protein LOC106778935 [Vigna radiata var.
            radiata]
          Length = 439

 Score =  621 bits (1602), Expect = 0.0
 Identities = 309/441 (70%), Positives = 347/441 (78%), Gaps = 19/441 (4%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            M WEE  YT+LT S T+L+WPPFSLLCPL+ S+ A+ESDSRSSNQRCLAFWVLFSLS I+
Sbjct: 1    MSWEEPFYTLLTNSFTLLSWPPFSLLCPLFVSVCAMESDSRSSNQRCLAFWVLFSLSTIV 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 525
            EWELS+LF+ LPWWPH+K++AT+LLL+PY G A  IYKFLI+ Y +W +  +  NIF+ +
Sbjct: 61   EWELSLLFNCLPWWPHLKSIATVLLLMPYVGAAQCIYKFLIRPYSSWTLFTKISNIFSEK 120

Query: 526  STHSESDE--------DNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGY 681
             T  ESDE        D                  QTI  S +QE+KL   Q  D  AG 
Sbjct: 121  GTDFESDEGAKLFDVSDQTITSSQIQEKERVDFSDQTITPSQIQEKKLEACQ--DHSAGC 178

Query: 682  DKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY--------- 834
            DKTESSY R+T K+ VQKEWSCALC ISTTSENCL AH++GKKHK   K           
Sbjct: 179  DKTESSYARITTKKLVQKEWSCALCQISTTSENCLRAHLKGKKHKDKEKNLRVEFHETNS 238

Query: 835  --LLSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAA 1008
              LLSSTQKRIKGMVL+RNLNQIA+ILNPVSRSIRWCEW KPKFGWTKLNTDGSI R++A
Sbjct: 239  KSLLSSTQKRIKGMVLIRNLNQIASILNPVSRSIRWCEWTKPKFGWTKLNTDGSIYRDSA 298

Query: 1009 GFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVN 1188
             FGGLLRDH GEPICAFVSK PQGDVFLVELWAIWRGLVL  GLGIK IWVESDSMSVV 
Sbjct: 299  SFGGLLRDHTGEPICAFVSKVPQGDVFLVELWAIWRGLVLCWGLGIKAIWVESDSMSVVK 358

Query: 1189 TINRKQTCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1368
            T+NRKQ CPKA++YLKQIWKLLKKFDKY+ISHSWRETNRAADHLAKMV+ GNDVVLWPVD
Sbjct: 359  TVNRKQHCPKADNYLKQIWKLLKKFDKYQISHSWRETNRAADHLAKMVVWGNDVVLWPVD 418

Query: 1369 FPHSLCNIIKDDAKGKKYLRR 1431
            FP +LC+II+DDAKGKKYLRR
Sbjct: 419  FPPTLCSIIEDDAKGKKYLRR 439


>XP_006575357.1 PREDICTED: uncharacterized protein LOC102661917 [Glycine max]
            KRH72478.1 hypothetical protein GLYMA_02G215900 [Glycine
            max]
          Length = 470

 Score =  618 bits (1593), Expect = 0.0
 Identities = 312/470 (66%), Positives = 349/470 (74%), Gaps = 48/470 (10%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MGW EA+YT+LTKS T+L+WPP S LCPL  S+RA+ESDSRSSNQRCLAFWVLFSL +I+
Sbjct: 1    MGWIEAIYTLLTKSFTVLSWPPVSFLCPLLVSVRAMESDSRSSNQRCLAFWVLFSLCMIV 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 525
            E ELSVLF+ LPWWPHVK++ATILLLIPY G APY+YKFL +HY TW++  RT NI++ +
Sbjct: 61   EGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPYVYKFLTRHYCTWSLFTRTSNIYSEK 120

Query: 526  STHSESDE--------DNKTV--------------------------------------L 567
            STH ESDE        D KT                                       L
Sbjct: 121  STHLESDEDSKLVDVSDQKTSQIQEKKLEAYQILVSSFYRSIFMPLQQITVNFSEKSRHL 180

Query: 568  XXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTESSYTRLTGKEQVQKEWSC 747
                         QTI TS ++E+KL  YQ  DD  G  KTESSYT LT K  +QKEWSC
Sbjct: 181  ESDEDSKLFDVSDQTITTSQIEEKKLEAYQETDDTTGCGKTESSYTSLTSKNLIQKEWSC 240

Query: 748  ALCLISTTSENCLGAHIQGKKHKTMVKEYL--LSSTQKRIKGMVLLRNLNQIANILNPVS 921
            ALC ISTT+EN L AH++G+KHK    E    LSSTQKRIKGMVLL NLNQIANIL+PVS
Sbjct: 241  ALCQISTTNENFLRAHLKGRKHKDKENELRVELSSTQKRIKGMVLLTNLNQIANILDPVS 300

Query: 922  RSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVEL 1101
            RSIRWCEW KP+FGWTKLNTDGSI    A FGGLLRD++GEPICAFVSKAPQGD+FL EL
Sbjct: 301  RSIRWCEWTKPEFGWTKLNTDGSIHSNTASFGGLLRDYRGEPICAFVSKAPQGDIFLAEL 360

Query: 1102 WAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAESYLKQIWKLLKKFDKYRIS 1281
            WA+WRGLVLSLGLGIK IWVESDSMSVV T+NRKQ CPKA  YLKQIWKLLKKFDKY+IS
Sbjct: 361  WAMWRGLVLSLGLGIKAIWVESDSMSVVKTVNRKQFCPKAVGYLKQIWKLLKKFDKYQIS 420

Query: 1282 HSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLRR 1431
            H+WR+TNRAADHLAKM LL NDVVLWPVDFP SLC+IIKDDAKG KYLRR
Sbjct: 421  HTWRQTNRAADHLAKMDLLANDVVLWPVDFPPSLCSIIKDDAKGTKYLRR 470


>XP_019459325.1 PREDICTED: uncharacterized protein LOC109359207 [Lupinus
            angustifolius]
          Length = 416

 Score =  579 bits (1492), Expect = 0.0
 Identities = 297/423 (70%), Positives = 333/423 (78%), Gaps = 7/423 (1%)
 Frame = +1

Query: 184  LYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIMEWELSV 363
            L TILTKSLT L WPP SL+CPLYASIRA++SDSR SNQ+CLAFWVLFS S+IME E +V
Sbjct: 6    LLTILTKSLTFLTWPPLSLICPLYASIRAMKSDSRFSNQQCLAFWVLFSFSMIMEREFAV 65

Query: 364  LFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGESTHSES 543
            LF+ LPWWP+VK+MATILLLIPYFG + +IYK+LIKHY TWNIC + LNI N  STH   
Sbjct: 66   LFNRLPWWPNVKSMATILLLIPYFGGSLHIYKYLIKHYCTWNICGKDLNIVNQNSTHVVF 125

Query: 544  DEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTESSYTRLTGKE 723
            +ED+K +              QTII S +QE+KL V QGR ++A       +YTR T   
Sbjct: 126  EEDSKLI----------HVTEQTIIPSQIQEKKLTVNQGRYEVA------VNYTRPTFTM 169

Query: 724  QVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY--LLSSTQKRI----KGMVLLRN 885
            QVQKEWSCALC ISTTSENCL AH+QGKKHKT   E   +L  T  +     KG+VLLRN
Sbjct: 170  QVQKEWSCALCQISTTSENCLLAHLQGKKHKTKESEVRNMLHLTDNKYLPSSKGIVLLRN 229

Query: 886  LNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVS 1065
            LNQIA ILNPVSRSIR CEW KP FGW KLNTDGS+  E AGFGGLLRDH GEPICA+VS
Sbjct: 230  LNQIAKILNPVSRSIRLCEWIKPNFGWMKLNTDGSLNNEIAGFGGLLRDHMGEPICAYVS 289

Query: 1066 KAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQ-TCPKAESYLKQI 1242
            KAPQGDVFLVELWAIWRGLVLSL LGI  +WVESDSMSVV TIN++Q +CPKA   L+QI
Sbjct: 290  KAPQGDVFLVELWAIWRGLVLSLSLGITALWVESDSMSVVKTINKEQPSCPKAYGCLEQI 349

Query: 1243 WKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKY 1422
            WKLL KFDKY ISHSWRETNRAADHLAKMV+LGNDV+LWP+DFP SL NII DDAKGKKY
Sbjct: 350  WKLLSKFDKYHISHSWRETNRAADHLAKMVVLGNDVILWPIDFPCSLRNIIDDDAKGKKY 409

Query: 1423 LRR 1431
            +RR
Sbjct: 410  IRR 412


>KHN26153.1 Putative ribonuclease H protein [Glycine soja]
          Length = 435

 Score =  563 bits (1451), Expect = 0.0
 Identities = 288/435 (66%), Positives = 320/435 (73%), Gaps = 48/435 (11%)
 Frame = +1

Query: 271  LESDSRSSNQRCLAFWVLFSLSLIMEWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPY 450
            +ESDSRSSNQRCLAFWVLFSL +I+E ELSVLF+ LPWWPHVK++ATILLLIPY G APY
Sbjct: 1    MESDSRSSNQRCLAFWVLFSLCMIVEGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPY 60

Query: 451  IYKFLIKHYFTWNICARTLNIFNGESTHSESDE--------DNKTV-------------- 564
            +YKFL +HY TW++  RT NI++ +STH ESDE        D KT               
Sbjct: 61   VYKFLTRHYCTWSLFTRTSNIYSEKSTHLESDEDSKLVDVSDQKTSQIQEKKLEAYQILV 120

Query: 565  ------------------------LXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDL 672
                                    L             QTI TS ++E+KL  YQ  DD 
Sbjct: 121  SSFYRSIFMPLQQITVNFSEKSRHLESDEDSKLFDVSDQTITTSQIEEKKLEAYQETDDT 180

Query: 673  AGYDKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYL--LSS 846
             G  KTESSYT LT K  +QKEWSCALC ISTT+EN L AH++G+KHK    E    LSS
Sbjct: 181  TGCGKTESSYTSLTSKNLIQKEWSCALCQISTTNENFLRAHLKGRKHKDKENELRVELSS 240

Query: 847  TQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGLL 1026
            TQKRIKGMVLL NLNQIANIL+PVSRSIRWCEW KP+FGWTKLNTDGSI    A FGGLL
Sbjct: 241  TQKRIKGMVLLTNLNQIANILDPVSRSIRWCEWTKPEFGWTKLNTDGSIHSNTASFGGLL 300

Query: 1027 RDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQ 1206
            RD++GEPICAFVSKAPQGD+FL ELWA+WRGLVLSLGLGIK IWVESDSMSVV T+NRKQ
Sbjct: 301  RDYRGEPICAFVSKAPQGDIFLAELWAMWRGLVLSLGLGIKAIWVESDSMSVVKTVNRKQ 360

Query: 1207 TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLC 1386
             CPKA  YLKQIWKLLKKFDKY+ISH+WR+TNRAADHLAKM LL NDVVLWPVDFP SLC
Sbjct: 361  FCPKAVGYLKQIWKLLKKFDKYQISHTWRQTNRAADHLAKMDLLANDVVLWPVDFPPSLC 420

Query: 1387 NIIKDDAKGKKYLRR 1431
            +IIKDDAKG KYLRR
Sbjct: 421  SIIKDDAKGTKYLRR 435


>OIW02536.1 hypothetical protein TanjilG_12850 [Lupinus angustifolius]
          Length = 382

 Score =  530 bits (1366), Expect = 0.0
 Identities = 273/394 (69%), Positives = 308/394 (78%), Gaps = 7/394 (1%)
 Frame = +1

Query: 271  LESDSRSSNQRCLAFWVLFSLSLIMEWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPY 450
            ++SDSR SNQ+CLAFWVLFS S+IME E +VLF+ LPWWP+VK+MATILLLIPYFG + +
Sbjct: 1    MKSDSRFSNQQCLAFWVLFSFSMIMEREFAVLFNRLPWWPNVKSMATILLLIPYFGGSLH 60

Query: 451  IYKFLIKHYFTWNICARTLNIFNGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHL 630
            IYK+LIKHY TWNIC + LNI N  STH   +ED+K +              QTII S +
Sbjct: 61   IYKYLIKHYCTWNICGKDLNIVNQNSTHVVFEEDSKLI----------HVTEQTIIPSQI 110

Query: 631  QEEKLLVYQGRDDLAGYDKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKK 810
            QE+KL V QGR ++A       +YTR T   QVQKEWSCALC ISTTSENCL AH+QGKK
Sbjct: 111  QEKKLTVNQGRYEVA------VNYTRPTFTMQVQKEWSCALCQISTTSENCLLAHLQGKK 164

Query: 811  HKTMVKEY--LLSSTQKRI----KGMVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTK 972
            HKT   E   +L  T  +     KG+VLLRNLNQIA ILNPVSRSIR CEW KP FGW K
Sbjct: 165  HKTKESEVRNMLHLTDNKYLPSSKGIVLLRNLNQIAKILNPVSRSIRLCEWIKPNFGWMK 224

Query: 973  LNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKE 1152
            LNTDGS+  E AGFGGLLRDH GEPICA+VSKAPQGDVFLVELWAIWRGLVLSL LGI  
Sbjct: 225  LNTDGSLNNEIAGFGGLLRDHMGEPICAYVSKAPQGDVFLVELWAIWRGLVLSLSLGITA 284

Query: 1153 IWVESDSMSVVNTINRKQ-TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKM 1329
            +WVESDSMSVV TIN++Q +CPKA   L+QIWKLL KFDKY ISHSWRETNRAADHLAKM
Sbjct: 285  LWVESDSMSVVKTINKEQPSCPKAYGCLEQIWKLLSKFDKYHISHSWRETNRAADHLAKM 344

Query: 1330 VLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLRR 1431
            V+LGNDV+LWP+DFP SL NII DDAKGKKY+RR
Sbjct: 345  VVLGNDVILWPIDFPCSLRNIIDDDAKGKKYIRR 378


>GAU49057.1 hypothetical protein TSUD_244480, partial [Trifolium subterraneum]
          Length = 352

 Score =  487 bits (1254), Expect = e-165
 Identities = 242/344 (70%), Positives = 269/344 (78%), Gaps = 21/344 (6%)
 Frame = +1

Query: 379  PWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGESTH-------- 534
            PWWPHVKA ATILLLIPYFG A YIYKFLIKHYF+WNIC  TLNIF+ + TH        
Sbjct: 8    PWWPHVKATATILLLIPYFGAATYIYKFLIKHYFSWNICGWTLNIFHQKITHFDLDNDSK 67

Query: 535  --SESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTESSYTR 708
              S+SDE  +  L            GQTIIT+HLQEEKLLVYQG+DD+A  DKT + YT 
Sbjct: 68   ILSDSDEGRQVFLESDDDSKSVEVSGQTIITNHLQEEKLLVYQGKDDIADCDKTNTGYT- 126

Query: 709  LTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSSTQK 855
               K++VQKEWSCALC ISTTSENCLG+H+QGK+HK   KE           Y+LS TQ+
Sbjct: 127  --SKKKVQKEWSCALCQISTTSENCLGSHLQGKQHKAKEKELRVGLHATNIPYVLSFTQE 184

Query: 856  RIKGMVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGLLRDH 1035
            R+KGMVLLRN NQIA IL+PVSR I WCEWKKPKFGWTKLNTDGS+ +  AGFGGLLRD+
Sbjct: 185  RMKGMVLLRNFNQIAKILSPVSRPIIWCEWKKPKFGWTKLNTDGSVNKVTAGFGGLLRDY 244

Query: 1036 KGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCP 1215
            +GEPICAFVSKAPQGD FLVELWAIWRGLVLS+GLGIK IWVESDSMSVV TIN+ Q CP
Sbjct: 245  RGEPICAFVSKAPQGDTFLVELWAIWRGLVLSIGLGIKSIWVESDSMSVVKTINKVQHCP 304

Query: 1216 KAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGND 1347
            KAE+ L QIWKLL K D+YRISHSWRETNRAADHLAKM L GN+
Sbjct: 305  KAETCLIQIWKLLSKVDEYRISHSWRETNRAADHLAKMALCGNE 348


>XP_003544251.1 PREDICTED: uncharacterized protein LOC100787629 [Glycine max]
            KRH16868.1 hypothetical protein GLYMA_14G182900 [Glycine
            max]
          Length = 470

 Score =  447 bits (1151), Expect = e-148
 Identities = 228/308 (74%), Positives = 247/308 (80%), Gaps = 2/308 (0%)
 Frame = +1

Query: 514  FNGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTE 693
            F+ +S + ESDED+K                QTI TS ++E+KL  YQ  DD+AG DKTE
Sbjct: 173  FSEKSRNLESDEDSKLF----------NVSDQTITTSQIEEKKLEAYQETDDIAGCDKTE 222

Query: 694  SSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYLL--SSTQKRIKG 867
            SSYTRLT K  VQKEWSCALC ISTTSENCL AH++G+KHK    E  +  SSTQKRIKG
Sbjct: 223  SSYTRLTIKNLVQKEWSCALCQISTTSENCLRAHLKGRKHKDKENELRVEFSSTQKRIKG 282

Query: 868  MVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEP 1047
            MVLLRNLNQIA+ILNPVSRSIRWCEW KP+FGWTKLNTDGSI      FGGLLRD++GEP
Sbjct: 283  MVLLRNLNQIASILNPVSRSIRWCEWTKPEFGWTKLNTDGSIHSNTVSFGGLLRDYRGEP 342

Query: 1048 ICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAES 1227
            ICAFVSKAPQGDVFL ELWAIWRGLVLSLGLGIK IWVESDSMSVV T+NRKQ CPKA  
Sbjct: 343  ICAFVSKAPQGDVFLAELWAIWRGLVLSLGLGIKAIWVESDSMSVVRTVNRKQLCPKAVG 402

Query: 1228 YLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDA 1407
            YL QIWKLLKKFDKY+ISHSWRETNRAADHLAKM LL NDVVL PVDFP SL  II+DDA
Sbjct: 403  YLNQIWKLLKKFDKYQISHSWRETNRAADHLAKMDLLANDVVLSPVDFPPSLSRIIEDDA 462

Query: 1408 KGKKYLRR 1431
            KG KY RR
Sbjct: 463  KGTKYRRR 470



 Score =  219 bits (557), Expect = 2e-60
 Identities = 104/164 (63%), Positives = 127/164 (77%)
 Frame = +1

Query: 166 MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
           MGW EA+YT+LTKS T+L+WPP S LCPL+ S+R ++SDSRSSNQRCLAFWVL+SL +IM
Sbjct: 1   MGWIEAIYTLLTKSFTVLSWPPVSFLCPLFVSVRVMKSDSRSSNQRCLAFWVLYSLCMIM 60

Query: 346 EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 525
           E ELSVLF+ LPWWPHVK++ATILLLIPY G APY+YKFLI+HYFTW++  RT NIF+ +
Sbjct: 61  EGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPYVYKFLIRHYFTWSLFTRTSNIFSEK 120

Query: 526 STHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQ 657
           STH ESDED+K V                 +TS +QE+KL  YQ
Sbjct: 121 STHLESDEDSKLV------------DVSDQMTSQIQEKKLEAYQ 152


>KHN39964.1 HVA22-like protein a [Glycine soja]
          Length = 470

 Score =  446 bits (1147), Expect = e-147
 Identities = 227/308 (73%), Positives = 247/308 (80%), Gaps = 2/308 (0%)
 Frame = +1

Query: 514  FNGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTE 693
            F+ +S + ESD+D+K                QTI TS ++E+KL  YQ  DD+AG DKTE
Sbjct: 173  FSEKSRNLESDKDSKLF----------NVTDQTITTSQIEEKKLEAYQETDDIAGCDKTE 222

Query: 694  SSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYLL--SSTQKRIKG 867
            SSYTRLT K  VQKEWSCALC ISTTSENCL AH++G+KHK    E  +  SSTQKRIKG
Sbjct: 223  SSYTRLTIKNLVQKEWSCALCQISTTSENCLRAHLKGRKHKDKENELRVEFSSTQKRIKG 282

Query: 868  MVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEP 1047
            MVLLRNLNQIA+ILNPVSRSIRWCEW KP+FGWTKLNTDGSI      FGGLLRD++GEP
Sbjct: 283  MVLLRNLNQIASILNPVSRSIRWCEWTKPEFGWTKLNTDGSIHSNTVSFGGLLRDYRGEP 342

Query: 1048 ICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAES 1227
            ICAFVSKAPQGDVFL ELWAIWRGLVLSLGLGIK IWVESDSMSVV T+NRKQ CPKA  
Sbjct: 343  ICAFVSKAPQGDVFLAELWAIWRGLVLSLGLGIKAIWVESDSMSVVRTVNRKQLCPKAVG 402

Query: 1228 YLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDA 1407
            YL QIWKLLKKFDKY+ISHSWRETNRAADHLAKM LL NDVVL PVDFP SL  II+DDA
Sbjct: 403  YLNQIWKLLKKFDKYQISHSWRETNRAADHLAKMDLLANDVVLSPVDFPPSLSRIIEDDA 462

Query: 1408 KGKKYLRR 1431
            KG KY RR
Sbjct: 463  KGTKYRRR 470



 Score =  219 bits (557), Expect = 2e-60
 Identities = 104/164 (63%), Positives = 127/164 (77%)
 Frame = +1

Query: 166 MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
           MGW EA+YT+LTKS T+L+WPP S LCPL+ S+R ++SDSRSSNQRCLAFWVL+SL +IM
Sbjct: 1   MGWIEAIYTLLTKSFTVLSWPPVSFLCPLFVSVRVMKSDSRSSNQRCLAFWVLYSLCMIM 60

Query: 346 EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 525
           E ELSVLF+ LPWWPHVK++ATILLLIPY G APY+YKFLI+HYFTW++  RT NIF+ +
Sbjct: 61  EGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPYVYKFLIRHYFTWSLFTRTSNIFSEK 120

Query: 526 STHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQ 657
           STH ESDED+K V                 +TS +QE+KL  YQ
Sbjct: 121 STHLESDEDSKLV------------DVSDQMTSQIQEKKLEAYQ 152


>XP_012449204.1 PREDICTED: uncharacterized protein LOC105772486 isoform X1 [Gossypium
            raimondii] KJB64102.1 hypothetical protein
            B456_010G033200 [Gossypium raimondii]
          Length = 418

 Score =  431 bits (1109), Expect = e-143
 Identities = 230/441 (52%), Positives = 296/441 (67%), Gaps = 19/441 (4%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MG+ EA   +L    T++ WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+ S ++
Sbjct: 1    MGFWEAFVELLANIFTVICWPSFTLIYPLFVSIRIVETNSSLKNQQCLTYWVLFAFSTMV 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 516
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I      +F
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDI------MF 114

Query: 517  NGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTES 696
              +       E N TV               +++T   + EKL   QG  +++ Y  TE 
Sbjct: 115  FPKKKGFVLHEANGTV----------GDADTSMLTYGPKSEKLTTDQGNVNIS-YGNTEV 163

Query: 697  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY-------------L 837
              T+    ++VQKEWSC LCLIST+SE+CL  H++GKKHKT  KEY             +
Sbjct: 164  ISTQ----KRVQKEWSCVLCLISTSSEDCLKEHLRGKKHKT--KEYELRVGALPLMETCM 217

Query: 838  LSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAG 1011
            LSS  K+ + +VL RNLN    + +L+PV+RSIRWC+WKKP+ G  KLNTDGS+     G
Sbjct: 218  LSSMPKKAEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCLKLNTDGSVDAGNLG 277

Query: 1012 FGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNT 1191
            FGGLLRD+KGEP+CAFV KAPQGD FLVELWAIWRGLVL+ GLGIK IWVESDS SVV T
Sbjct: 278  FGGLLRDYKGEPLCAFVCKAPQGDTFLVELWAIWRGLVLASGLGIKVIWVESDSKSVVKT 337

Query: 1192 INRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1368
            INR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP D
Sbjct: 338  INREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWPAD 397

Query: 1369 FPHSLCNIIKDDAKGKKYLRR 1431
            FP +L +IIKDDA+GK YLRR
Sbjct: 398  FPDTLSSIIKDDAQGKTYLRR 418


>XP_017643483.1 PREDICTED: uncharacterized protein LOC108484269 isoform X1 [Gossypium
            arboreum] KHF99723.1 HVA22-like protein a [Gossypium
            arboreum]
          Length = 418

 Score =  430 bits (1106), Expect = e-142
 Identities = 227/443 (51%), Positives = 294/443 (66%), Gaps = 21/443 (4%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MG+ EA   +L    T+L WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+   ++
Sbjct: 1    MGFWEAFVELLANIFTVLCWPSFTLIYPLFVSIRIMETNSSLKNQQCLTYWVLFAFITMV 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 510
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I    +   
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDIMFFPKKKG 120

Query: 511  IFNGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKT 690
                E+  +  D D  T                  +T+  + EKL   QG  +++ Y  T
Sbjct: 121  FVLHEANGTVGDADTST------------------LTNGPKSEKLTTDQGNVNIS-YGNT 161

Query: 691  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY------------ 834
            E   T+    ++VQKEWSC LCLIST+SE CL  H+QGKKHKT  KEY            
Sbjct: 162  EVISTQ----KRVQKEWSCVLCLISTSSEYCLKEHLQGKKHKT--KEYELRVGALPLKET 215

Query: 835  -LLSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREA 1005
             +LSS  K+++ +VL RNLN    + +L+PV+RSIRWC+WKKP+ G  KLNTDGS+    
Sbjct: 216  CMLSSMPKKVEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCVKLNTDGSVDAGN 275

Query: 1006 AGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVV 1185
            +GFGGLLRD++GEP+CAFV KAPQGD FLVELW IWRGLVL+ GLG+K IWVESDS SVV
Sbjct: 276  SGFGGLLRDYRGEPLCAFVCKAPQGDTFLVELWPIWRGLVLASGLGVKVIWVESDSKSVV 335

Query: 1186 NTINRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWP 1362
             TINR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP
Sbjct: 336  KTINREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWP 395

Query: 1363 VDFPHSLCNIIKDDAKGKKYLRR 1431
             DFP +L +IIKDDA+GK YLRR
Sbjct: 396  ADFPDTLNSIIKDDAQGKTYLRR 418


>XP_016731529.1 PREDICTED: uncharacterized protein LOC107942387 [Gossypium hirsutum]
          Length = 418

 Score =  428 bits (1100), Expect = e-141
 Identities = 229/441 (51%), Positives = 294/441 (66%), Gaps = 19/441 (4%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MG+ EA   +L    T+L WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+   ++
Sbjct: 1    MGFWEAFVELLANIFTVLCWPSFTLIYPLFVSIRIVETNSSLKNQQCLTYWVLFAFITMV 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 516
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I      +F
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDI------MF 114

Query: 517  NGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTES 696
              +       E N TV               +++T   + EKL   QG  +++ Y  TE 
Sbjct: 115  FPKKKGFVLHEANGTV----------GDADTSMLTYGPKSEKLTTDQGNVNIS-YGNTEV 163

Query: 697  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY-------------L 837
              T+    ++VQKEWSC LCLIST+SE CL  H+ GKKHKT  KEY             +
Sbjct: 164  ISTQ----KRVQKEWSCVLCLISTSSEYCLKEHLWGKKHKT--KEYELRVGALPLKETCM 217

Query: 838  LSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAG 1011
            LSS  K+ + +VL RNLN    + +L+PV+RSIRWC+WKKP+ G  KLNTDGS+    +G
Sbjct: 218  LSSMPKKAEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCLKLNTDGSVDAGNSG 277

Query: 1012 FGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNT 1191
            FGGLLRD+KGEP+CAFV KAPQGD FLVELWA+WRGLVL+ GLGIK IWVESDS SVV T
Sbjct: 278  FGGLLRDYKGEPLCAFVCKAPQGDTFLVELWAVWRGLVLASGLGIKVIWVESDSKSVVKT 337

Query: 1192 INRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1368
            INR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP D
Sbjct: 338  INREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWPAD 397

Query: 1369 FPHSLCNIIKDDAKGKKYLRR 1431
            FP +L +IIKDDA+GK YLRR
Sbjct: 398  FPDTLSSIIKDDAEGKTYLRR 418


>XP_016647353.1 PREDICTED: uncharacterized protein LOC103319467 [Prunus mume]
          Length = 456

 Score =  429 bits (1103), Expect = e-141
 Identities = 239/467 (51%), Positives = 296/467 (63%), Gaps = 46/467 (9%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MGWE  L  +    L +L+WP F+L+  LYASI+A+ESDS S NQ+CL++WV+F+L  I 
Sbjct: 1    MGWEAFLQVLAKLFLGVLSWPSFNLVYTLYASIQAIESDSHSRNQQCLSYWVMFALYKIS 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 525
            E  L  LF WLP WP+ K   TILL++PYFG A Y+YK  I+ Y + N      NI + +
Sbjct: 61   EEALGKLFYWLPVWPYTKGAITILLVLPYFGGASYLYKHFIRPYISENSVIWKWNILSIQ 120

Query: 526  STHS-ESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTES-S 699
              +   S EDN   +               I T   + E  ++++G    A   +TE   
Sbjct: 121  RINGFSSGEDNYPDVVDK----------NVIRTEPQKSEGAVIFKGTP--ASSSETEGRE 168

Query: 700  YTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSS 846
            YT  +  +++Q+EW+CALCLISTTS  CL  H++GKKHKT V+            Y LS 
Sbjct: 169  YTSPSSPKKIQREWTCALCLISTTSGKCLKKHLRGKKHKTQVEALRTYKQGPNSGYELSL 228

Query: 847  TQKRIKGMVL------------------------LRNLNQIAN--------ILNPVSRSI 930
              KR  GM+                         + NLNQIA         IL+PV+R I
Sbjct: 229  KLKRTNGMIFNLNQMARANGKIFNLNQMARANGKIFNLNQIARANLEKWSGILSPVARPI 288

Query: 931  RWCEWKKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAI 1110
            R C WKKP+ GWTKLNTDGS+ RE AG+GGLLRD+KGEPICAFVSKA   D+FLVELWAI
Sbjct: 289  RTCIWKKPELGWTKLNTDGSVDRENAGYGGLLRDYKGEPICAFVSKALGDDIFLVELWAI 348

Query: 1111 WRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTC-PKAESYLKQIWKLLKKFDKYRISHS 1287
            WRGLVL+L LGIK IWVESDS SVV TINR +    KA S LK IW+LLKKFDK+++SHS
Sbjct: 349  WRGLVLALSLGIKVIWVESDSESVVQTINRDRPYGQKASSCLKHIWELLKKFDKHQVSHS 408

Query: 1288 WRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLR 1428
            WRETNRAAD L+KMVLLG+DVV WPVDFP SLCNIIK+DA+G+ Y R
Sbjct: 409  WRETNRAADLLSKMVLLGSDVVFWPVDFPDSLCNIIKEDAEGRIYFR 455


>XP_016730591.1 PREDICTED: uncharacterized protein LOC107941552 isoform X1 [Gossypium
            hirsutum]
          Length = 418

 Score =  427 bits (1098), Expect = e-141
 Identities = 225/443 (50%), Positives = 294/443 (66%), Gaps = 21/443 (4%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MG+ EA   +L    T+L WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+   ++
Sbjct: 1    MGFWEAFVELLANIFTVLCWPSFTLIYPLFVSIRIMETNSSLKNQQCLTYWVLFAFITMV 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 510
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I    +   
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDIMFFPKKKG 120

Query: 511  IFNGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKT 690
                E+  +  D D  T                  +T+  + EKL   QG  +++ Y  T
Sbjct: 121  FVLHEANGTVGDADTST------------------LTNGPKSEKLTTDQGNVNIS-YGNT 161

Query: 691  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY------------ 834
            E   T+    ++VQKEWSC LCLIST+SE CL  H++GKKHKT  KEY            
Sbjct: 162  EVISTQ----KRVQKEWSCVLCLISTSSEYCLKEHLRGKKHKT--KEYELRVGALPLKET 215

Query: 835  -LLSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREA 1005
             +LSS  K+++ +VL RNLN    + +L+PV+RSIRWC+WKKP+ G  KLNTDGS+    
Sbjct: 216  CMLSSMPKKVEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCVKLNTDGSVDAGN 275

Query: 1006 AGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVV 1185
            +GFGGLLRD++GEP+CAFV KAPQGD FLVELW IWRGLVL+ GLG+K IWVESDS SVV
Sbjct: 276  SGFGGLLRDYRGEPLCAFVCKAPQGDTFLVELWPIWRGLVLASGLGVKVIWVESDSKSVV 335

Query: 1186 NTINRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWP 1362
             TIN++Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP
Sbjct: 336  KTINQEQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWP 395

Query: 1363 VDFPHSLCNIIKDDAKGKKYLRR 1431
             DFP +L +IIKDDA+GK YLRR
Sbjct: 396  ADFPDTLNSIIKDDAQGKTYLRR 418


>EOY15502.1 HVA22-like protein a, putative isoform 1 [Theobroma cacao]
          Length = 420

 Score =  424 bits (1091), Expect = e-140
 Identities = 230/437 (52%), Positives = 286/437 (65%), Gaps = 15/437 (3%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MG  +    +L   LT+L WP ++L+ PLY SIR +E++S   NQ+CL +WVLF+L  + 
Sbjct: 1    MGICKVFLELLASILTVLCWPSYALIYPLYVSIRTVENNSSFKNQQCLTYWVLFALITMG 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 516
            E  L    +W P+WP VK +ATILL+ PYFG A Y++K LI+ YF+   WNI      +F
Sbjct: 61   ELTLGKFLNWFPFWPCVKGVATILLVTPYFGGASYVFKHLIRPYFSEKIWNI------LF 114

Query: 517  NGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTES 696
              +     S+  N  +                +      EE ++  +G  D +  D  E 
Sbjct: 115  FPKKKDIVSEAQNGIL---------DDADTNRLKNGPKLEELIINGEGNFDRSS-DNKEV 164

Query: 697  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYL-----------LS 843
            + T LT  ++VQKEWSC LCLIS +SE CL  H+QGKKHKT   E             LS
Sbjct: 165  NSTWLTHPKRVQKEWSCVLCLISASSEKCLKKHLQGKKHKTKEDELRADALALRATCKLS 224

Query: 844  STQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGL 1023
            S  K+   +VLLRNLN I ++LNPV+ SI WC WKKP+ G  KLNTDGS+  E AGFGGL
Sbjct: 225  SVPKKAGRVVLLRNLN-IESLLNPVTSSITWCRWKKPEIGCIKLNTDGSVVPENAGFGGL 283

Query: 1024 LRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRK 1203
            LRD+KG+P+CAFVSKAPQ D+FLVELWAIWRGLVL+ GLGIK IWVESDSMSVV TINR+
Sbjct: 284  LRDYKGDPLCAFVSKAPQDDIFLVELWAIWRGLVLASGLGIKVIWVESDSMSVVRTINRE 343

Query: 1204 Q-TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHS 1380
            Q    K    LKQIWKLL  FD YR++HSWRETN+AADHL++MVL  +D VLWPVDFP S
Sbjct: 344  QFHGAKCSRCLKQIWKLLTMFDNYRVTHSWRETNKAADHLSRMVLRESDAVLWPVDFPDS 403

Query: 1381 LCNIIKDDAKGKKYLRR 1431
            L NII+DDA+GK Y RR
Sbjct: 404  LNNIIQDDARGKIYFRR 420


>XP_007018277.2 PREDICTED: uncharacterized protein LOC18591832 isoform X1 [Theobroma
            cacao]
          Length = 420

 Score =  422 bits (1086), Expect = e-139
 Identities = 227/437 (51%), Positives = 285/437 (65%), Gaps = 15/437 (3%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MG  +    +L   LT+L WP ++L+ PLY SIR +E++S   NQ+CL +WVLF+L  ++
Sbjct: 1    MGICKVFLELLASILTVLCWPSYALIYPLYVSIRTVENNSSFKNQQCLTYWVLFALITMV 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 516
            E  L    +W P+WP  K +ATILL+ PYFG A Y++K LI+ YF+   WNI      +F
Sbjct: 61   ELTLGKFLNWFPFWPCAKGVATILLVTPYFGGASYVFKHLIRPYFSEKIWNI------LF 114

Query: 517  NGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTES 696
              +     S+  N  +                +      EE ++  +G  D +  D  E 
Sbjct: 115  FPKKKDIVSEAQNGIL---------DDADTNRLKNGPKLEELIINGEGNFDRSS-DNKEV 164

Query: 697  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYL-----------LS 843
            + T LT  ++VQKEWSC LCL+S +SE CL  H+QGKKHKT   E             LS
Sbjct: 165  NSTWLTHPKRVQKEWSCVLCLVSASSEKCLKKHLQGKKHKTKEDELRADALALRATCKLS 224

Query: 844  STQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGL 1023
            S  K+   +VLLRNLN   ++LNPV+ SI WC WKKP+ G  KLNTDGS+  E AGFGGL
Sbjct: 225  SVPKKAGRVVLLRNLN-FESLLNPVTSSITWCRWKKPEIGCIKLNTDGSVVPENAGFGGL 283

Query: 1024 LRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRK 1203
            LRD+KG+P+CAFVSKAPQ D+FLVELWAIWRGLVL+ GLGIK IWVESDSMSVV TINR+
Sbjct: 284  LRDYKGDPLCAFVSKAPQDDIFLVELWAIWRGLVLASGLGIKVIWVESDSMSVVRTINRE 343

Query: 1204 Q-TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHS 1380
            Q    K    LKQIWKLL  FD YR++HSWRETN+AADHL++MVL  +D VLWPVDFP S
Sbjct: 344  QFHGAKCSRCLKQIWKLLTMFDNYRVTHSWRETNKAADHLSRMVLRESDAVLWPVDFPDS 403

Query: 1381 LCNIIKDDAKGKKYLRR 1431
            L NII+DDA+GK Y RR
Sbjct: 404  LNNIIQDDARGKIYFRR 420


>ONI35422.1 hypothetical protein PRUPE_1G535200 [Prunus persica]
          Length = 456

 Score =  422 bits (1086), Expect = e-139
 Identities = 234/474 (49%), Positives = 296/474 (62%), Gaps = 53/474 (11%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MGWE  L  +    L +L+WP F+L+  LYASI+A+ESDS S NQ+CL++WV+F+L  I 
Sbjct: 1    MGWEAFLQVLAELFLGVLSWPSFNLVYTLYASIQAIESDSHSRNQQCLSYWVMFALYKIS 60

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT-------WNICA-R 501
            E  L  LF WLP WP+ K   T+LL++PYFG A Y+YK  I+ Y +       WNI + +
Sbjct: 61   EEALGKLFYWLPVWPYTKGAITVLLVLPYFGGASYLYKHFIRPYISENSVIWKWNILSIQ 120

Query: 502  TLNIFN-GESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAG 678
             +N FN GE  + +  + N                   I T   + E  ++++G    + 
Sbjct: 121  RINGFNSGEDNYPDVVDKN------------------VIRTEPQKSEGAVIFKGTP-ASS 161

Query: 679  YDKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE--------- 831
             +K    YT  +  +++Q+EW+CALCLISTTS  CL  H++GKKH+T V           
Sbjct: 162  SEKEGREYTSPSSPKKIQREWTCALCLISTTSGKCLKKHLRGKKHETQVAALRTYKQGPI 221

Query: 832  --YLLSSTQKRIKGMVL------------------------LRNLNQIAN--------IL 909
              Y  S   KR  GM+                         + NLNQIA         IL
Sbjct: 222  SGYKSSLKLKRTDGMIFNLNQMARANGKIFNLNQMARANGKIFNLNQIARANLEKWSGIL 281

Query: 910  NPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVF 1089
            +PV+R IR C WKKP+ GWTKLNTDGS+ RE AG+GGLLRD+KG+PICAFVSKA   D+F
Sbjct: 282  SPVARPIRMCIWKKPELGWTKLNTDGSVDRENAGYGGLLRDYKGDPICAFVSKALGDDIF 341

Query: 1090 LVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINR-KQTCPKAESYLKQIWKLLKKFD 1266
            LVELWAIWRGLVL+L LGIK IWVESDS SVV TINR +    KA S LK IW+LL KFD
Sbjct: 342  LVELWAIWRGLVLALSLGIKVIWVESDSESVVQTINRDRPYSQKASSCLKHIWELLNKFD 401

Query: 1267 KYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLR 1428
            K+++SHSWRETNRAADHL+KMVLLG+DVV WPVDFP SL NIIK+DA+G+ Y R
Sbjct: 402  KHQVSHSWRETNRAADHLSKMVLLGSDVVFWPVDFPDSLHNIIKEDAEGRIYFR 455


>XP_012449205.1 PREDICTED: uncharacterized protein LOC105772486 isoform X2 [Gossypium
            raimondii]
          Length = 409

 Score =  413 bits (1061), Expect = e-135
 Identities = 225/441 (51%), Positives = 289/441 (65%), Gaps = 19/441 (4%)
 Frame = +1

Query: 166  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 345
            MG+ EA   +L    T++ W         + SIR +E++S   NQ+CL +WVLF+ S ++
Sbjct: 1    MGFWEAFVELLANIFTVICW---------FVSIRIVETNSSLKNQQCLTYWVLFAFSTMV 51

Query: 346  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 516
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I      +F
Sbjct: 52   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDI------MF 105

Query: 517  NGESTHSESDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGYDKTES 696
              +       E N TV               +++T   + EKL   QG  +++ Y  TE 
Sbjct: 106  FPKKKGFVLHEANGTV----------GDADTSMLTYGPKSEKLTTDQGNVNIS-YGNTEV 154

Query: 697  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY-------------L 837
              T+    ++VQKEWSC LCLIST+SE+CL  H++GKKHKT  KEY             +
Sbjct: 155  ISTQ----KRVQKEWSCVLCLISTSSEDCLKEHLRGKKHKT--KEYELRVGALPLMETCM 208

Query: 838  LSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWKKPKFGWTKLNTDGSIQREAAG 1011
            LSS  K+ + +VL RNLN    + +L+PV+RSIRWC+WKKP+ G  KLNTDGS+     G
Sbjct: 209  LSSMPKKAEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCLKLNTDGSVDAGNLG 268

Query: 1012 FGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNT 1191
            FGGLLRD+KGEP+CAFV KAPQGD FLVELWAIWRGLVL+ GLGIK IWVESDS SVV T
Sbjct: 269  FGGLLRDYKGEPLCAFVCKAPQGDTFLVELWAIWRGLVLASGLGIKVIWVESDSKSVVKT 328

Query: 1192 INRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1368
            INR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP D
Sbjct: 329  INREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWPAD 388

Query: 1369 FPHSLCNIIKDDAKGKKYLRR 1431
            FP +L +IIKDDA+GK YLRR
Sbjct: 389  FPDTLSSIIKDDAQGKTYLRR 409


Top