BLASTX nr result

ID: Glycyrrhiza34_contig00014939 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza34_contig00014939
         (1762 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_007141283.1 hypothetical protein PHAVU_008G183100g [Phaseolus...   632   0.0  
XP_017429168.1 PREDICTED: uncharacterized protein LOC108337207 [...   629   0.0  
XP_004490492.2 PREDICTED: uncharacterized protein LOC101492718 [...   626   0.0  
XP_014522429.1 PREDICTED: uncharacterized protein LOC106778935 [...   624   0.0  
XP_006575357.1 PREDICTED: uncharacterized protein LOC102661917 [...   620   0.0  
XP_019459325.1 PREDICTED: uncharacterized protein LOC109359207 [...   579   0.0  
KHN26153.1 Putative ribonuclease H protein [Glycine soja]             566   0.0  
OIW02536.1 hypothetical protein TanjilG_12850 [Lupinus angustifo...   530   0.0  
GAU49057.1 hypothetical protein TSUD_244480, partial [Trifolium ...   489   e-167
XP_003544251.1 PREDICTED: uncharacterized protein LOC100787629 [...   450   e-150
KHN39964.1 HVA22-like protein a [Glycine soja]                        449   e-150
XP_016647353.1 PREDICTED: uncharacterized protein LOC103319467 [...   428   e-142
XP_012449204.1 PREDICTED: uncharacterized protein LOC105772486 i...   426   e-142
XP_017643483.1 PREDICTED: uncharacterized protein LOC108484269 i...   425   e-141
XP_016731529.1 PREDICTED: uncharacterized protein LOC107942387 [...   423   e-140
EOY15502.1 HVA22-like protein a, putative isoform 1 [Theobroma c...   423   e-140
XP_016730591.1 PREDICTED: uncharacterized protein LOC107941552 i...   422   e-140
XP_007018277.2 PREDICTED: uncharacterized protein LOC18591832 is...   421   e-140
ONI35422.1 hypothetical protein PRUPE_1G535200 [Prunus persica]       421   e-139
XP_012449205.1 PREDICTED: uncharacterized protein LOC105772486 i...   408   e-135

>XP_007141283.1 hypothetical protein PHAVU_008G183100g [Phaseolus vulgaris]
            ESW13277.1 hypothetical protein PHAVU_008G183100g
            [Phaseolus vulgaris]
          Length = 439

 Score =  632 bits (1631), Expect = 0.0
 Identities = 311/441 (70%), Positives = 350/441 (79%), Gaps = 19/441 (4%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MGWEEA YT+LT S T+L+WPPFS LCPL+ S+RA+ESDSRSSNQRCLAFWVLFSLS+IM
Sbjct: 1    MGWEEAFYTLLTNSFTLLSWPPFSFLCPLFVSVRAMESDSRSSNQRCLAFWVLFSLSMIM 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 534
            E ELSVL +  PWWPH+K++ATILLLIPY G AP +YKFLI+ Y  W +  +T NIF+ +
Sbjct: 61   ERELSVLLNCPPWWPHLKSIATILLLIPYVGAAPCVYKFLIRPYCPWRLFTKTSNIFSEK 120

Query: 535  STHSKSDEDNKTVLXXXXXXXXXXXXG--------QTIITSHLQEEKLLVYQGRDDLAGC 690
             TH +SDED K               G        QTI  S +QE+KL  YQ  DD AGC
Sbjct: 121  GTHVESDEDGKLFDVSDQTITTSQIQGKELVDFSDQTITPSQIQEKKLEAYQ--DDSAGC 178

Query: 691  DKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE---------- 840
            D T SSYTRLT K+ VQKEWSCALC +STTSENCL  H++GKKHK   KE          
Sbjct: 179  DMTGSSYTRLTSKKLVQKEWSCALCQVSTTSENCLREHLKGKKHKDKEKELRVECHETNS 238

Query: 841  -YLLSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAA 1017
             YLLSSTQKRIKGMVL+RNLN+IANILNPVSRS+RWCEW KP+FGWTKLNTDGSI R+ A
Sbjct: 239  TYLLSSTQKRIKGMVLIRNLNKIANILNPVSRSVRWCEWTKPEFGWTKLNTDGSINRDVA 298

Query: 1018 GFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVN 1197
             FGGLLRD++GEP+C FVSK PQGDVFLVELWAIWRGLVL  GLGIK IWVESDSMSVV 
Sbjct: 299  SFGGLLRDYRGEPMCGFVSKVPQGDVFLVELWAIWRGLVLCGGLGIKAIWVESDSMSVVK 358

Query: 1198 TINRKQTCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1377
            T+NRKQ CPKA  YLKQIWKLLKKFDKY+ISHSWRETNRAADHL+KMV+ GNDVVLWPVD
Sbjct: 359  TVNRKQHCPKAYGYLKQIWKLLKKFDKYQISHSWRETNRAADHLSKMVVWGNDVVLWPVD 418

Query: 1378 FPHSLCNIIKDDAKGKKYLRR 1440
            FP +LC+IIKDDA+G KYLRR
Sbjct: 419  FPPTLCSIIKDDARGMKYLRR 439


>XP_017429168.1 PREDICTED: uncharacterized protein LOC108337207 [Vigna angularis]
            KOM47518.1 hypothetical protein LR48_Vigan07g122200
            [Vigna angularis] BAT81689.1 hypothetical protein
            VIGAN_03147900 [Vigna angularis var. angularis]
          Length = 439

 Score =  629 bits (1623), Expect = 0.0
 Identities = 310/441 (70%), Positives = 351/441 (79%), Gaps = 19/441 (4%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MGWEE  YT+LT S T+L+WPPFSLLCPL+ S+ A+ESDSRSS QRCLAFWVLFSLS I+
Sbjct: 1    MGWEEPFYTLLTNSFTLLSWPPFSLLCPLFVSVCAMESDSRSSKQRCLAFWVLFSLSTIV 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 534
            EWELSVLF+ LPWWPH+K++AT+LLL+PY G AP +Y+FLI+ Y +W +  +  NI + +
Sbjct: 61   EWELSVLFNRLPWWPHLKSIATVLLLMPYVGAAPCVYRFLIRPYCSWTLFTKISNIVSEK 120

Query: 535  STHSKSDE--------DNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGC 690
             T S+SDE        D                  QTI  S +QE+KL   Q  DD AGC
Sbjct: 121  GTDSESDEGAKLFDVSDQTITSSQIQEKELVDFSYQTITPSQIQEKKLEACQ--DDSAGC 178

Query: 691  DKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE---------- 840
            D+TESSY R+T K+ VQKEWSCALC ISTTSENCL AH++GKKHK    E          
Sbjct: 179  DRTESSYARITRKKLVQKEWSCALCQISTTSENCLRAHLKGKKHKDKETELRVEFHETNS 238

Query: 841  -YLLSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAA 1017
             YLLSSTQKRIKGMVL+RNLNQIANILNPVSRSIRWCEW KPKFGWTKLNTDGSI R++A
Sbjct: 239  KYLLSSTQKRIKGMVLIRNLNQIANILNPVSRSIRWCEWTKPKFGWTKLNTDGSINRDSA 298

Query: 1018 GFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVN 1197
             FGGLLRD+ GEPICAFVSK PQGDVFLVELWAIWRGLVL  GLGIK IWVESDSMSVV 
Sbjct: 299  SFGGLLRDYTGEPICAFVSKVPQGDVFLVELWAIWRGLVLCWGLGIKAIWVESDSMSVVK 358

Query: 1198 TINRKQTCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1377
            T+NRKQ CPKA+SYLKQIWKLLKKFDKY+ISHSWRETNRAADHLAKMV+ GNDVVLWPVD
Sbjct: 359  TVNRKQHCPKADSYLKQIWKLLKKFDKYQISHSWRETNRAADHLAKMVVWGNDVVLWPVD 418

Query: 1378 FPHSLCNIIKDDAKGKKYLRR 1440
            FP +LC+II+DDA+GKKYLRR
Sbjct: 419  FPPTLCSIIEDDARGKKYLRR 439


>XP_004490492.2 PREDICTED: uncharacterized protein LOC101492718 [Cicer arietinum]
          Length = 409

 Score =  626 bits (1615), Expect = 0.0
 Identities = 310/429 (72%), Positives = 341/429 (79%), Gaps = 11/429 (2%)
 Frame = +1

Query: 187  EALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIMEWEL 366
            E  +TILTKSLT+L WPP SLLCPLY SIRALESD RSSNQRCLAFWVLF LS+IME E 
Sbjct: 4    EIFHTILTKSLTVLVWPPISLLCPLYVSIRALESDCRSSNQRCLAFWVLFYLSMIMECEF 63

Query: 367  SVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGESTHS 546
            +VLF+W PWWPH KAMAT LLLIP FG A YIYKFLIKHY TWNICA TLNIF  + T  
Sbjct: 64   AVLFTWPPWWPHAKAMATFLLLIPNFGAALYIYKFLIKHYCTWNICAWTLNIFYQKITRF 123

Query: 547  KSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSYTRLTG 726
            +SDED++ +                      QEEK LVYQGRDD A CDKT+SSY     
Sbjct: 124  ESDEDSEKLSE--------------------QEEKNLVYQGRDDHADCDKTKSSYA---S 160

Query: 727  KEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSSTQKRIK 873
            K+QVQKEWSCALC ISTTSENCL  H+QGK+HK   KE           Y+LS TQ+RIK
Sbjct: 161  KKQVQKEWSCALCQISTTSENCLVEHLQGKQHKAKKKELRVGLRLINSPYMLSFTQERIK 220

Query: 874  GMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGE 1053
            GM LL+NLNQIANIL+PVS S  WCEW+KP+FGWTKLNTDGS+ +E A FGGLLRDH+GE
Sbjct: 221  GMTLLKNLNQIANILSPVSTSTIWCEWKKPEFGWTKLNTDGSVNKETAAFGGLLRDHRGE 280

Query: 1054 PICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAE 1233
            PIC FVSKAPQGD+FLVELWAIWRGLVLS GLGIK IWVESDSMSVV TIN+ Q CPKAE
Sbjct: 281  PICGFVSKAPQGDIFLVELWAIWRGLVLSFGLGIKSIWVESDSMSVVKTINKMQPCPKAE 340

Query: 1234 SYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDD 1413
            S L++IWKLL KF+KYRISHSWRETNRAADHLAKM LLGNDVVLWP+DFPHSLCNII++D
Sbjct: 341  SCLEKIWKLLSKFEKYRISHSWRETNRAADHLAKMALLGNDVVLWPIDFPHSLCNIIQED 400

Query: 1414 AKGKKYLRR 1440
            AKG KYLRR
Sbjct: 401  AKGTKYLRR 409


>XP_014522429.1 PREDICTED: uncharacterized protein LOC106778935 [Vigna radiata var.
            radiata]
          Length = 439

 Score =  624 bits (1609), Expect = 0.0
 Identities = 309/441 (70%), Positives = 348/441 (78%), Gaps = 19/441 (4%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            M WEE  YT+LT S T+L+WPPFSLLCPL+ S+ A+ESDSRSSNQRCLAFWVLFSLS I+
Sbjct: 1    MSWEEPFYTLLTNSFTLLSWPPFSLLCPLFVSVCAMESDSRSSNQRCLAFWVLFSLSTIV 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 534
            EWELS+LF+ LPWWPH+K++AT+LLL+PY G A  IYKFLI+ Y +W +  +  NIF+ +
Sbjct: 61   EWELSLLFNCLPWWPHLKSIATVLLLMPYVGAAQCIYKFLIRPYSSWTLFTKISNIFSEK 120

Query: 535  STHSKSDE--------DNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGC 690
             T  +SDE        D                  QTI  S +QE+KL   Q  D  AGC
Sbjct: 121  GTDFESDEGAKLFDVSDQTITSSQIQEKERVDFSDQTITPSQIQEKKLEACQ--DHSAGC 178

Query: 691  DKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY--------- 843
            DKTESSY R+T K+ VQKEWSCALC ISTTSENCL AH++GKKHK   K           
Sbjct: 179  DKTESSYARITTKKLVQKEWSCALCQISTTSENCLRAHLKGKKHKDKEKNLRVEFHETNS 238

Query: 844  --LLSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAA 1017
              LLSSTQKRIKGMVL+RNLNQIA+ILNPVSRSIRWCEW KPKFGWTKLNTDGSI R++A
Sbjct: 239  KSLLSSTQKRIKGMVLIRNLNQIASILNPVSRSIRWCEWTKPKFGWTKLNTDGSIYRDSA 298

Query: 1018 GFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVN 1197
             FGGLLRDH GEPICAFVSK PQGDVFLVELWAIWRGLVL  GLGIK IWVESDSMSVV 
Sbjct: 299  SFGGLLRDHTGEPICAFVSKVPQGDVFLVELWAIWRGLVLCWGLGIKAIWVESDSMSVVK 358

Query: 1198 TINRKQTCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1377
            T+NRKQ CPKA++YLKQIWKLLKKFDKY+ISHSWRETNRAADHLAKMV+ GNDVVLWPVD
Sbjct: 359  TVNRKQHCPKADNYLKQIWKLLKKFDKYQISHSWRETNRAADHLAKMVVWGNDVVLWPVD 418

Query: 1378 FPHSLCNIIKDDAKGKKYLRR 1440
            FP +LC+II+DDAKGKKYLRR
Sbjct: 419  FPPTLCSIIEDDAKGKKYLRR 439


>XP_006575357.1 PREDICTED: uncharacterized protein LOC102661917 [Glycine max]
            KRH72478.1 hypothetical protein GLYMA_02G215900 [Glycine
            max]
          Length = 470

 Score =  620 bits (1600), Expect = 0.0
 Identities = 312/470 (66%), Positives = 350/470 (74%), Gaps = 48/470 (10%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MGW EA+YT+LTKS T+L+WPP S LCPL  S+RA+ESDSRSSNQRCLAFWVLFSL +I+
Sbjct: 1    MGWIEAIYTLLTKSFTVLSWPPVSFLCPLLVSVRAMESDSRSSNQRCLAFWVLFSLCMIV 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 534
            E ELSVLF+ LPWWPHVK++ATILLLIPY G APY+YKFL +HY TW++  RT NI++ +
Sbjct: 61   EGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPYVYKFLTRHYCTWSLFTRTSNIYSEK 120

Query: 535  STHSKSDE--------DNKTV--------------------------------------L 576
            STH +SDE        D KT                                       L
Sbjct: 121  STHLESDEDSKLVDVSDQKTSQIQEKKLEAYQILVSSFYRSIFMPLQQITVNFSEKSRHL 180

Query: 577  XXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSYTRLTGKEQVQKEWSC 756
                         QTI TS ++E+KL  YQ  DD  GC KTESSYT LT K  +QKEWSC
Sbjct: 181  ESDEDSKLFDVSDQTITTSQIEEKKLEAYQETDDTTGCGKTESSYTSLTSKNLIQKEWSC 240

Query: 757  ALCLISTTSENCLGAHIQGKKHKTMVKEYL--LSSTQKRIKGMVLLRNLNQIANILNPVS 930
            ALC ISTT+EN L AH++G+KHK    E    LSSTQKRIKGMVLL NLNQIANIL+PVS
Sbjct: 241  ALCQISTTNENFLRAHLKGRKHKDKENELRVELSSTQKRIKGMVLLTNLNQIANILDPVS 300

Query: 931  RSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVEL 1110
            RSIRWCEW KP+FGWTKLNTDGSI    A FGGLLRD++GEPICAFVSKAPQGD+FL EL
Sbjct: 301  RSIRWCEWTKPEFGWTKLNTDGSIHSNTASFGGLLRDYRGEPICAFVSKAPQGDIFLAEL 360

Query: 1111 WAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAESYLKQIWKLLKKFDKYRIS 1290
            WA+WRGLVLSLGLGIK IWVESDSMSVV T+NRKQ CPKA  YLKQIWKLLKKFDKY+IS
Sbjct: 361  WAMWRGLVLSLGLGIKAIWVESDSMSVVKTVNRKQFCPKAVGYLKQIWKLLKKFDKYQIS 420

Query: 1291 HSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLRR 1440
            H+WR+TNRAADHLAKM LL NDVVLWPVDFP SLC+IIKDDAKG KYLRR
Sbjct: 421  HTWRQTNRAADHLAKMDLLANDVVLWPVDFPPSLCSIIKDDAKGTKYLRR 470


>XP_019459325.1 PREDICTED: uncharacterized protein LOC109359207 [Lupinus
            angustifolius]
          Length = 416

 Score =  579 bits (1492), Expect = 0.0
 Identities = 297/423 (70%), Positives = 333/423 (78%), Gaps = 7/423 (1%)
 Frame = +1

Query: 193  LYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIMEWELSV 372
            L TILTKSLT L WPP SL+CPLYASIRA++SDSR SNQ+CLAFWVLFS S+IME E +V
Sbjct: 6    LLTILTKSLTFLTWPPLSLICPLYASIRAMKSDSRFSNQQCLAFWVLFSFSMIMEREFAV 65

Query: 373  LFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGESTHSKS 552
            LF+ LPWWP+VK+MATILLLIPYFG + +IYK+LIKHY TWNIC + LNI N  STH   
Sbjct: 66   LFNRLPWWPNVKSMATILLLIPYFGGSLHIYKYLIKHYCTWNICGKDLNIVNQNSTHVVF 125

Query: 553  DEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSYTRLTGKE 732
            +ED+K +              QTII S +QE+KL V QGR ++A       +YTR T   
Sbjct: 126  EEDSKLI----------HVTEQTIIPSQIQEKKLTVNQGRYEVA------VNYTRPTFTM 169

Query: 733  QVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY--LLSSTQKRI----KGMVLLRN 894
            QVQKEWSCALC ISTTSENCL AH+QGKKHKT   E   +L  T  +     KG+VLLRN
Sbjct: 170  QVQKEWSCALCQISTTSENCLLAHLQGKKHKTKESEVRNMLHLTDNKYLPSSKGIVLLRN 229

Query: 895  LNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVS 1074
            LNQIA ILNPVSRSIR CEW KP FGW KLNTDGS+  E AGFGGLLRDH GEPICA+VS
Sbjct: 230  LNQIAKILNPVSRSIRLCEWIKPNFGWMKLNTDGSLNNEIAGFGGLLRDHMGEPICAYVS 289

Query: 1075 KAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQ-TCPKAESYLKQI 1251
            KAPQGDVFLVELWAIWRGLVLSL LGI  +WVESDSMSVV TIN++Q +CPKA   L+QI
Sbjct: 290  KAPQGDVFLVELWAIWRGLVLSLSLGITALWVESDSMSVVKTINKEQPSCPKAYGCLEQI 349

Query: 1252 WKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKY 1431
            WKLL KFDKY ISHSWRETNRAADHLAKMV+LGNDV+LWP+DFP SL NII DDAKGKKY
Sbjct: 350  WKLLSKFDKYHISHSWRETNRAADHLAKMVVLGNDVILWPIDFPCSLRNIIDDDAKGKKY 409

Query: 1432 LRR 1440
            +RR
Sbjct: 410  IRR 412


>KHN26153.1 Putative ribonuclease H protein [Glycine soja]
          Length = 435

 Score =  566 bits (1458), Expect = 0.0
 Identities = 288/435 (66%), Positives = 321/435 (73%), Gaps = 48/435 (11%)
 Frame = +1

Query: 280  LESDSRSSNQRCLAFWVLFSLSLIMEWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPY 459
            +ESDSRSSNQRCLAFWVLFSL +I+E ELSVLF+ LPWWPHVK++ATILLLIPY G APY
Sbjct: 1    MESDSRSSNQRCLAFWVLFSLCMIVEGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPY 60

Query: 460  IYKFLIKHYFTWNICARTLNIFNGESTHSKSDE--------DNKTV-------------- 573
            +YKFL +HY TW++  RT NI++ +STH +SDE        D KT               
Sbjct: 61   VYKFLTRHYCTWSLFTRTSNIYSEKSTHLESDEDSKLVDVSDQKTSQIQEKKLEAYQILV 120

Query: 574  ------------------------LXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDL 681
                                    L             QTI TS ++E+KL  YQ  DD 
Sbjct: 121  SSFYRSIFMPLQQITVNFSEKSRHLESDEDSKLFDVSDQTITTSQIEEKKLEAYQETDDT 180

Query: 682  AGCDKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYL--LSS 855
             GC KTESSYT LT K  +QKEWSCALC ISTT+EN L AH++G+KHK    E    LSS
Sbjct: 181  TGCGKTESSYTSLTSKNLIQKEWSCALCQISTTNENFLRAHLKGRKHKDKENELRVELSS 240

Query: 856  TQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLL 1035
            TQKRIKGMVLL NLNQIANIL+PVSRSIRWCEW KP+FGWTKLNTDGSI    A FGGLL
Sbjct: 241  TQKRIKGMVLLTNLNQIANILDPVSRSIRWCEWTKPEFGWTKLNTDGSIHSNTASFGGLL 300

Query: 1036 RDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQ 1215
            RD++GEPICAFVSKAPQGD+FL ELWA+WRGLVLSLGLGIK IWVESDSMSVV T+NRKQ
Sbjct: 301  RDYRGEPICAFVSKAPQGDIFLAELWAMWRGLVLSLGLGIKAIWVESDSMSVVKTVNRKQ 360

Query: 1216 TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLC 1395
             CPKA  YLKQIWKLLKKFDKY+ISH+WR+TNRAADHLAKM LL NDVVLWPVDFP SLC
Sbjct: 361  FCPKAVGYLKQIWKLLKKFDKYQISHTWRQTNRAADHLAKMDLLANDVVLWPVDFPPSLC 420

Query: 1396 NIIKDDAKGKKYLRR 1440
            +IIKDDAKG KYLRR
Sbjct: 421  SIIKDDAKGTKYLRR 435


>OIW02536.1 hypothetical protein TanjilG_12850 [Lupinus angustifolius]
          Length = 382

 Score =  530 bits (1366), Expect = 0.0
 Identities = 273/394 (69%), Positives = 308/394 (78%), Gaps = 7/394 (1%)
 Frame = +1

Query: 280  LESDSRSSNQRCLAFWVLFSLSLIMEWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPY 459
            ++SDSR SNQ+CLAFWVLFS S+IME E +VLF+ LPWWP+VK+MATILLLIPYFG + +
Sbjct: 1    MKSDSRFSNQQCLAFWVLFSFSMIMEREFAVLFNRLPWWPNVKSMATILLLIPYFGGSLH 60

Query: 460  IYKFLIKHYFTWNICARTLNIFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHL 639
            IYK+LIKHY TWNIC + LNI N  STH   +ED+K +              QTII S +
Sbjct: 61   IYKYLIKHYCTWNICGKDLNIVNQNSTHVVFEEDSKLI----------HVTEQTIIPSQI 110

Query: 640  QEEKLLVYQGRDDLAGCDKTESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKK 819
            QE+KL V QGR ++A       +YTR T   QVQKEWSCALC ISTTSENCL AH+QGKK
Sbjct: 111  QEKKLTVNQGRYEVA------VNYTRPTFTMQVQKEWSCALCQISTTSENCLLAHLQGKK 164

Query: 820  HKTMVKEY--LLSSTQKRI----KGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTK 981
            HKT   E   +L  T  +     KG+VLLRNLNQIA ILNPVSRSIR CEW KP FGW K
Sbjct: 165  HKTKESEVRNMLHLTDNKYLPSSKGIVLLRNLNQIAKILNPVSRSIRLCEWIKPNFGWMK 224

Query: 982  LNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKE 1161
            LNTDGS+  E AGFGGLLRDH GEPICA+VSKAPQGDVFLVELWAIWRGLVLSL LGI  
Sbjct: 225  LNTDGSLNNEIAGFGGLLRDHMGEPICAYVSKAPQGDVFLVELWAIWRGLVLSLSLGITA 284

Query: 1162 IWVESDSMSVVNTINRKQ-TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKM 1338
            +WVESDSMSVV TIN++Q +CPKA   L+QIWKLL KFDKY ISHSWRETNRAADHLAKM
Sbjct: 285  LWVESDSMSVVKTINKEQPSCPKAYGCLEQIWKLLSKFDKYHISHSWRETNRAADHLAKM 344

Query: 1339 VLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLRR 1440
            V+LGNDV+LWP+DFP SL NII DDAKGKKY+RR
Sbjct: 345  VVLGNDVILWPIDFPCSLRNIIDDDAKGKKYIRR 378


>GAU49057.1 hypothetical protein TSUD_244480, partial [Trifolium subterraneum]
          Length = 352

 Score =  489 bits (1258), Expect = e-167
 Identities = 242/344 (70%), Positives = 269/344 (78%), Gaps = 21/344 (6%)
 Frame = +1

Query: 388  PWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGESTH-------- 543
            PWWPHVKA ATILLLIPYFG A YIYKFLIKHYF+WNIC  TLNIF+ + TH        
Sbjct: 8    PWWPHVKATATILLLIPYFGAATYIYKFLIKHYFSWNICGWTLNIFHQKITHFDLDNDSK 67

Query: 544  --SKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSYTR 717
              S SDE  +  L            GQTIIT+HLQEEKLLVYQG+DD+A CDKT + YT 
Sbjct: 68   ILSDSDEGRQVFLESDDDSKSVEVSGQTIITNHLQEEKLLVYQGKDDIADCDKTNTGYT- 126

Query: 718  LTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSSTQK 864
               K++VQKEWSCALC ISTTSENCLG+H+QGK+HK   KE           Y+LS TQ+
Sbjct: 127  --SKKKVQKEWSCALCQISTTSENCLGSHLQGKQHKAKEKELRVGLHATNIPYVLSFTQE 184

Query: 865  RIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDH 1044
            R+KGMVLLRN NQIA IL+PVSR I WCEW+KPKFGWTKLNTDGS+ +  AGFGGLLRD+
Sbjct: 185  RMKGMVLLRNFNQIAKILSPVSRPIIWCEWKKPKFGWTKLNTDGSVNKVTAGFGGLLRDY 244

Query: 1045 KGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCP 1224
            +GEPICAFVSKAPQGD FLVELWAIWRGLVLS+GLGIK IWVESDSMSVV TIN+ Q CP
Sbjct: 245  RGEPICAFVSKAPQGDTFLVELWAIWRGLVLSIGLGIKSIWVESDSMSVVKTINKVQHCP 304

Query: 1225 KAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGND 1356
            KAE+ L QIWKLL K D+YRISHSWRETNRAADHLAKM L GN+
Sbjct: 305  KAETCLIQIWKLLSKVDEYRISHSWRETNRAADHLAKMALCGNE 348


>XP_003544251.1 PREDICTED: uncharacterized protein LOC100787629 [Glycine max]
            KRH16868.1 hypothetical protein GLYMA_14G182900 [Glycine
            max]
          Length = 470

 Score =  450 bits (1158), Expect = e-150
 Identities = 228/308 (74%), Positives = 248/308 (80%), Gaps = 2/308 (0%)
 Frame = +1

Query: 523  FNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTE 702
            F+ +S + +SDED+K                QTI TS ++E+KL  YQ  DD+AGCDKTE
Sbjct: 173  FSEKSRNLESDEDSKLF----------NVSDQTITTSQIEEKKLEAYQETDDIAGCDKTE 222

Query: 703  SSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYLL--SSTQKRIKG 876
            SSYTRLT K  VQKEWSCALC ISTTSENCL AH++G+KHK    E  +  SSTQKRIKG
Sbjct: 223  SSYTRLTIKNLVQKEWSCALCQISTTSENCLRAHLKGRKHKDKENELRVEFSSTQKRIKG 282

Query: 877  MVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEP 1056
            MVLLRNLNQIA+ILNPVSRSIRWCEW KP+FGWTKLNTDGSI      FGGLLRD++GEP
Sbjct: 283  MVLLRNLNQIASILNPVSRSIRWCEWTKPEFGWTKLNTDGSIHSNTVSFGGLLRDYRGEP 342

Query: 1057 ICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAES 1236
            ICAFVSKAPQGDVFL ELWAIWRGLVLSLGLGIK IWVESDSMSVV T+NRKQ CPKA  
Sbjct: 343  ICAFVSKAPQGDVFLAELWAIWRGLVLSLGLGIKAIWVESDSMSVVRTVNRKQLCPKAVG 402

Query: 1237 YLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDA 1416
            YL QIWKLLKKFDKY+ISHSWRETNRAADHLAKM LL NDVVL PVDFP SL  II+DDA
Sbjct: 403  YLNQIWKLLKKFDKYQISHSWRETNRAADHLAKMDLLANDVVLSPVDFPPSLSRIIEDDA 462

Query: 1417 KGKKYLRR 1440
            KG KY RR
Sbjct: 463  KGTKYRRR 470



 Score =  217 bits (553), Expect = 2e-60
 Identities = 103/164 (62%), Positives = 127/164 (77%)
 Frame = +1

Query: 175 MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
           MGW EA+YT+LTKS T+L+WPP S LCPL+ S+R ++SDSRSSNQRCLAFWVL+SL +IM
Sbjct: 1   MGWIEAIYTLLTKSFTVLSWPPVSFLCPLFVSVRVMKSDSRSSNQRCLAFWVLYSLCMIM 60

Query: 355 EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 534
           E ELSVLF+ LPWWPHVK++ATILLLIPY G APY+YKFLI+HYFTW++  RT NIF+ +
Sbjct: 61  EGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPYVYKFLIRHYFTWSLFTRTSNIFSEK 120

Query: 535 STHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQ 666
           STH +SDED+K V                 +TS +QE+KL  YQ
Sbjct: 121 STHLESDEDSKLV------------DVSDQMTSQIQEKKLEAYQ 152


>KHN39964.1 HVA22-like protein a [Glycine soja]
          Length = 470

 Score =  449 bits (1154), Expect = e-150
 Identities = 227/308 (73%), Positives = 248/308 (80%), Gaps = 2/308 (0%)
 Frame = +1

Query: 523  FNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTE 702
            F+ +S + +SD+D+K                QTI TS ++E+KL  YQ  DD+AGCDKTE
Sbjct: 173  FSEKSRNLESDKDSKLF----------NVTDQTITTSQIEEKKLEAYQETDDIAGCDKTE 222

Query: 703  SSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYLL--SSTQKRIKG 876
            SSYTRLT K  VQKEWSCALC ISTTSENCL AH++G+KHK    E  +  SSTQKRIKG
Sbjct: 223  SSYTRLTIKNLVQKEWSCALCQISTTSENCLRAHLKGRKHKDKENELRVEFSSTQKRIKG 282

Query: 877  MVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEP 1056
            MVLLRNLNQIA+ILNPVSRSIRWCEW KP+FGWTKLNTDGSI      FGGLLRD++GEP
Sbjct: 283  MVLLRNLNQIASILNPVSRSIRWCEWTKPEFGWTKLNTDGSIHSNTVSFGGLLRDYRGEP 342

Query: 1057 ICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTCPKAES 1236
            ICAFVSKAPQGDVFL ELWAIWRGLVLSLGLGIK IWVESDSMSVV T+NRKQ CPKA  
Sbjct: 343  ICAFVSKAPQGDVFLAELWAIWRGLVLSLGLGIKAIWVESDSMSVVRTVNRKQLCPKAVG 402

Query: 1237 YLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDA 1416
            YL QIWKLLKKFDKY+ISHSWRETNRAADHLAKM LL NDVVL PVDFP SL  II+DDA
Sbjct: 403  YLNQIWKLLKKFDKYQISHSWRETNRAADHLAKMDLLANDVVLSPVDFPPSLSRIIEDDA 462

Query: 1417 KGKKYLRR 1440
            KG KY RR
Sbjct: 463  KGTKYRRR 470



 Score =  217 bits (553), Expect = 2e-60
 Identities = 103/164 (62%), Positives = 127/164 (77%)
 Frame = +1

Query: 175 MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
           MGW EA+YT+LTKS T+L+WPP S LCPL+ S+R ++SDSRSSNQRCLAFWVL+SL +IM
Sbjct: 1   MGWIEAIYTLLTKSFTVLSWPPVSFLCPLFVSVRVMKSDSRSSNQRCLAFWVLYSLCMIM 60

Query: 355 EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 534
           E ELSVLF+ LPWWPHVK++ATILLLIPY G APY+YKFLI+HYFTW++  RT NIF+ +
Sbjct: 61  EGELSVLFNCLPWWPHVKSIATILLLIPYVGAAPYVYKFLIRHYFTWSLFTRTSNIFSEK 120

Query: 535 STHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQ 666
           STH +SDED+K V                 +TS +QE+KL  YQ
Sbjct: 121 STHLESDEDSKLV------------DVSDQMTSQIQEKKLEAYQ 152


>XP_016647353.1 PREDICTED: uncharacterized protein LOC103319467 [Prunus mume]
          Length = 456

 Score =  428 bits (1100), Expect = e-142
 Identities = 238/467 (50%), Positives = 296/467 (63%), Gaps = 46/467 (9%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MGWE  L  +    L +L+WP F+L+  LYASI+A+ESDS S NQ+CL++WV+F+L  I 
Sbjct: 1    MGWEAFLQVLAKLFLGVLSWPSFNLVYTLYASIQAIESDSHSRNQQCLSYWVMFALYKIS 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 534
            E  L  LF WLP WP+ K   TILL++PYFG A Y+YK  I+ Y + N      NI + +
Sbjct: 61   EEALGKLFYWLPVWPYTKGAITILLVLPYFGGASYLYKHFIRPYISENSVIWKWNILSIQ 120

Query: 535  STHS-KSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTES-S 708
              +   S EDN   +               I T   + E  ++++G    A   +TE   
Sbjct: 121  RINGFSSGEDNYPDVVDK----------NVIRTEPQKSEGAVIFKGTP--ASSSETEGRE 168

Query: 709  YTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSS 855
            YT  +  +++Q+EW+CALCLISTTS  CL  H++GKKHKT V+            Y LS 
Sbjct: 169  YTSPSSPKKIQREWTCALCLISTTSGKCLKKHLRGKKHKTQVEALRTYKQGPNSGYELSL 228

Query: 856  TQKRIKGMVL------------------------LRNLNQIAN--------ILNPVSRSI 939
              KR  GM+                         + NLNQIA         IL+PV+R I
Sbjct: 229  KLKRTNGMIFNLNQMARANGKIFNLNQMARANGKIFNLNQIARANLEKWSGILSPVARPI 288

Query: 940  RWCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAI 1119
            R C W+KP+ GWTKLNTDGS+ RE AG+GGLLRD+KGEPICAFVSKA   D+FLVELWAI
Sbjct: 289  RTCIWKKPELGWTKLNTDGSVDRENAGYGGLLRDYKGEPICAFVSKALGDDIFLVELWAI 348

Query: 1120 WRGLVLSLGLGIKEIWVESDSMSVVNTINRKQTC-PKAESYLKQIWKLLKKFDKYRISHS 1296
            WRGLVL+L LGIK IWVESDS SVV TINR +    KA S LK IW+LLKKFDK+++SHS
Sbjct: 349  WRGLVLALSLGIKVIWVESDSESVVQTINRDRPYGQKASSCLKHIWELLKKFDKHQVSHS 408

Query: 1297 WRETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLR 1437
            WRETNRAAD L+KMVLLG+DVV WPVDFP SLCNIIK+DA+G+ Y R
Sbjct: 409  WRETNRAADLLSKMVLLGSDVVFWPVDFPDSLCNIIKEDAEGRIYFR 455


>XP_012449204.1 PREDICTED: uncharacterized protein LOC105772486 isoform X1 [Gossypium
            raimondii] KJB64102.1 hypothetical protein
            B456_010G033200 [Gossypium raimondii]
          Length = 418

 Score =  426 bits (1096), Expect = e-142
 Identities = 228/441 (51%), Positives = 295/441 (66%), Gaps = 19/441 (4%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MG+ EA   +L    T++ WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+ S ++
Sbjct: 1    MGFWEAFVELLANIFTVICWPSFTLIYPLFVSIRIVETNSSLKNQQCLTYWVLFAFSTMV 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 525
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I      +F
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDI------MF 114

Query: 526  NGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTES 705
              +       E N TV               +++T   + EKL   QG  +++    TE 
Sbjct: 115  FPKKKGFVLHEANGTV----------GDADTSMLTYGPKSEKLTTDQGNVNIS-YGNTEV 163

Query: 706  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY-------------L 846
              T+    ++VQKEWSC LCLIST+SE+CL  H++GKKHKT  KEY             +
Sbjct: 164  ISTQ----KRVQKEWSCVLCLISTSSEDCLKEHLRGKKHKT--KEYELRVGALPLMETCM 217

Query: 847  LSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAG 1020
            LSS  K+ + +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+     G
Sbjct: 218  LSSMPKKAEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCLKLNTDGSVDAGNLG 277

Query: 1021 FGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNT 1200
            FGGLLRD+KGEP+CAFV KAPQGD FLVELWAIWRGLVL+ GLGIK IWVESDS SVV T
Sbjct: 278  FGGLLRDYKGEPLCAFVCKAPQGDTFLVELWAIWRGLVLASGLGIKVIWVESDSKSVVKT 337

Query: 1201 INRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1377
            INR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP D
Sbjct: 338  INREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWPAD 397

Query: 1378 FPHSLCNIIKDDAKGKKYLRR 1440
            FP +L +IIKDDA+GK YLRR
Sbjct: 398  FPDTLSSIIKDDAQGKTYLRR 418


>XP_017643483.1 PREDICTED: uncharacterized protein LOC108484269 isoform X1 [Gossypium
            arboreum] KHF99723.1 HVA22-like protein a [Gossypium
            arboreum]
          Length = 418

 Score =  425 bits (1093), Expect = e-141
 Identities = 225/443 (50%), Positives = 293/443 (66%), Gaps = 21/443 (4%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MG+ EA   +L    T+L WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+   ++
Sbjct: 1    MGFWEAFVELLANIFTVLCWPSFTLIYPLFVSIRIMETNSSLKNQQCLTYWVLFAFITMV 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 519
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I    +   
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDIMFFPKKKG 120

Query: 520  IFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKT 699
                E+  +  D D  T                  +T+  + EKL   QG  +++    T
Sbjct: 121  FVLHEANGTVGDADTST------------------LTNGPKSEKLTTDQGNVNIS-YGNT 161

Query: 700  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY------------ 843
            E   T+    ++VQKEWSC LCLIST+SE CL  H+QGKKHKT  KEY            
Sbjct: 162  EVISTQ----KRVQKEWSCVLCLISTSSEYCLKEHLQGKKHKT--KEYELRVGALPLKET 215

Query: 844  -LLSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREA 1014
             +LSS  K+++ +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+    
Sbjct: 216  CMLSSMPKKVEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCVKLNTDGSVDAGN 275

Query: 1015 AGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVV 1194
            +GFGGLLRD++GEP+CAFV KAPQGD FLVELW IWRGLVL+ GLG+K IWVESDS SVV
Sbjct: 276  SGFGGLLRDYRGEPLCAFVCKAPQGDTFLVELWPIWRGLVLASGLGVKVIWVESDSKSVV 335

Query: 1195 NTINRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWP 1371
             TINR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP
Sbjct: 336  KTINREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWP 395

Query: 1372 VDFPHSLCNIIKDDAKGKKYLRR 1440
             DFP +L +IIKDDA+GK YLRR
Sbjct: 396  ADFPDTLNSIIKDDAQGKTYLRR 418


>XP_016731529.1 PREDICTED: uncharacterized protein LOC107942387 [Gossypium hirsutum]
          Length = 418

 Score =  423 bits (1087), Expect = e-140
 Identities = 227/441 (51%), Positives = 293/441 (66%), Gaps = 19/441 (4%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MG+ EA   +L    T+L WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+   ++
Sbjct: 1    MGFWEAFVELLANIFTVLCWPSFTLIYPLFVSIRIVETNSSLKNQQCLTYWVLFAFITMV 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 525
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I      +F
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDI------MF 114

Query: 526  NGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTES 705
              +       E N TV               +++T   + EKL   QG  +++    TE 
Sbjct: 115  FPKKKGFVLHEANGTV----------GDADTSMLTYGPKSEKLTTDQGNVNIS-YGNTEV 163

Query: 706  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY-------------L 846
              T+    ++VQKEWSC LCLIST+SE CL  H+ GKKHKT  KEY             +
Sbjct: 164  ISTQ----KRVQKEWSCVLCLISTSSEYCLKEHLWGKKHKT--KEYELRVGALPLKETCM 217

Query: 847  LSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAG 1020
            LSS  K+ + +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+    +G
Sbjct: 218  LSSMPKKAEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCLKLNTDGSVDAGNSG 277

Query: 1021 FGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNT 1200
            FGGLLRD+KGEP+CAFV KAPQGD FLVELWA+WRGLVL+ GLGIK IWVESDS SVV T
Sbjct: 278  FGGLLRDYKGEPLCAFVCKAPQGDTFLVELWAVWRGLVLASGLGIKVIWVESDSKSVVKT 337

Query: 1201 INRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1377
            INR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP D
Sbjct: 338  INREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWPAD 397

Query: 1378 FPHSLCNIIKDDAKGKKYLRR 1440
            FP +L +IIKDDA+GK YLRR
Sbjct: 398  FPDTLSSIIKDDAEGKTYLRR 418


>EOY15502.1 HVA22-like protein a, putative isoform 1 [Theobroma cacao]
          Length = 420

 Score =  423 bits (1087), Expect = e-140
 Identities = 229/439 (52%), Positives = 285/439 (64%), Gaps = 17/439 (3%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MG  +    +L   LT+L WP ++L+ PLY SIR +E++S   NQ+CL +WVLF+L  + 
Sbjct: 1    MGICKVFLELLASILTVLCWPSYALIYPLYVSIRTVENNSSFKNQQCLTYWVLFALITMG 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 519
            E  L    +W P+WP VK +ATILL+ PYFG A Y++K LI+ YF+   WNI    +  +
Sbjct: 61   ELTLGKFLNWFPFWPCVKGVATILLVTPYFGGASYVFKHLIRPYFSEKIWNILFFPKKKD 120

Query: 520  IFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKT 699
            I +        D D   +                       EE ++  +G  D +  D  
Sbjct: 121  IVSEAQNGILDDADTNRLKNGPKL-----------------EELIINGEGNFDRSS-DNK 162

Query: 700  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYL----------- 846
            E + T LT  ++VQKEWSC LCLIS +SE CL  H+QGKKHKT   E             
Sbjct: 163  EVNSTWLTHPKRVQKEWSCVLCLISASSEKCLKKHLQGKKHKTKEDELRADALALRATCK 222

Query: 847  LSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFG 1026
            LSS  K+   +VLLRNLN I ++LNPV+ SI WC W+KP+ G  KLNTDGS+  E AGFG
Sbjct: 223  LSSVPKKAGRVVLLRNLN-IESLLNPVTSSITWCRWKKPEIGCIKLNTDGSVVPENAGFG 281

Query: 1027 GLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTIN 1206
            GLLRD+KG+P+CAFVSKAPQ D+FLVELWAIWRGLVL+ GLGIK IWVESDSMSVV TIN
Sbjct: 282  GLLRDYKGDPLCAFVSKAPQDDIFLVELWAIWRGLVLASGLGIKVIWVESDSMSVVRTIN 341

Query: 1207 RKQ-TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFP 1383
            R+Q    K    LKQIWKLL  FD YR++HSWRETN+AADHL++MVL  +D VLWPVDFP
Sbjct: 342  REQFHGAKCSRCLKQIWKLLTMFDNYRVTHSWRETNKAADHLSRMVLRESDAVLWPVDFP 401

Query: 1384 HSLCNIIKDDAKGKKYLRR 1440
             SL NII+DDA+GK Y RR
Sbjct: 402  DSLNNIIQDDARGKIYFRR 420


>XP_016730591.1 PREDICTED: uncharacterized protein LOC107941552 isoform X1 [Gossypium
            hirsutum]
          Length = 418

 Score =  422 bits (1085), Expect = e-140
 Identities = 223/443 (50%), Positives = 293/443 (66%), Gaps = 21/443 (4%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MG+ EA   +L    T+L WP F+L+ PL+ SIR +E++S   NQ+CL +WVLF+   ++
Sbjct: 1    MGFWEAFVELLANIFTVLCWPSFTLIYPLFVSIRIMETNSSLKNQQCLTYWVLFAFITMV 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 519
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I    +   
Sbjct: 61   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDIMFFPKKKG 120

Query: 520  IFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKT 699
                E+  +  D D  T                  +T+  + EKL   QG  +++    T
Sbjct: 121  FVLHEANGTVGDADTST------------------LTNGPKSEKLTTDQGNVNIS-YGNT 161

Query: 700  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY------------ 843
            E   T+    ++VQKEWSC LCLIST+SE CL  H++GKKHKT  KEY            
Sbjct: 162  EVISTQ----KRVQKEWSCVLCLISTSSEYCLKEHLRGKKHKT--KEYELRVGALPLKET 215

Query: 844  -LLSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREA 1014
             +LSS  K+++ +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+    
Sbjct: 216  CMLSSMPKKVEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCVKLNTDGSVDAGN 275

Query: 1015 AGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVV 1194
            +GFGGLLRD++GEP+CAFV KAPQGD FLVELW IWRGLVL+ GLG+K IWVESDS SVV
Sbjct: 276  SGFGGLLRDYRGEPLCAFVCKAPQGDTFLVELWPIWRGLVLASGLGVKVIWVESDSKSVV 335

Query: 1195 NTINRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWP 1371
             TIN++Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP
Sbjct: 336  KTINQEQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWP 395

Query: 1372 VDFPHSLCNIIKDDAKGKKYLRR 1440
             DFP +L +IIKDDA+GK YLRR
Sbjct: 396  ADFPDTLNSIIKDDAQGKTYLRR 418


>XP_007018277.2 PREDICTED: uncharacterized protein LOC18591832 isoform X1 [Theobroma
            cacao]
          Length = 420

 Score =  421 bits (1082), Expect = e-140
 Identities = 226/439 (51%), Positives = 284/439 (64%), Gaps = 17/439 (3%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MG  +    +L   LT+L WP ++L+ PLY SIR +E++S   NQ+CL +WVLF+L  ++
Sbjct: 1    MGICKVFLELLASILTVLCWPSYALIYPLYVSIRTVENNSSFKNQQCLTYWVLFALITMV 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNIC--ARTLN 519
            E  L    +W P+WP  K +ATILL+ PYFG A Y++K LI+ YF+   WNI    +  +
Sbjct: 61   ELTLGKFLNWFPFWPCAKGVATILLVTPYFGGASYVFKHLIRPYFSEKIWNILFFPKKKD 120

Query: 520  IFNGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKT 699
            I +        D D   +                       EE ++  +G  D +  D  
Sbjct: 121  IVSEAQNGILDDADTNRLKNGPKL-----------------EELIINGEGNFDRSS-DNK 162

Query: 700  ESSYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEYL----------- 846
            E + T LT  ++VQKEWSC LCL+S +SE CL  H+QGKKHKT   E             
Sbjct: 163  EVNSTWLTHPKRVQKEWSCVLCLVSASSEKCLKKHLQGKKHKTKEDELRADALALRATCK 222

Query: 847  LSSTQKRIKGMVLLRNLNQIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAGFG 1026
            LSS  K+   +VLLRNLN   ++LNPV+ SI WC W+KP+ G  KLNTDGS+  E AGFG
Sbjct: 223  LSSVPKKAGRVVLLRNLN-FESLLNPVTSSITWCRWKKPEIGCIKLNTDGSVVPENAGFG 281

Query: 1027 GLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNTIN 1206
            GLLRD+KG+P+CAFVSKAPQ D+FLVELWAIWRGLVL+ GLGIK IWVESDSMSVV TIN
Sbjct: 282  GLLRDYKGDPLCAFVSKAPQDDIFLVELWAIWRGLVLASGLGIKVIWVESDSMSVVRTIN 341

Query: 1207 RKQ-TCPKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVDFP 1383
            R+Q    K    LKQIWKLL  FD YR++HSWRETN+AADHL++MVL  +D VLWPVDFP
Sbjct: 342  REQFHGAKCSRCLKQIWKLLTMFDNYRVTHSWRETNKAADHLSRMVLRESDAVLWPVDFP 401

Query: 1384 HSLCNIIKDDAKGKKYLRR 1440
             SL NII+DDA+GK Y RR
Sbjct: 402  DSLNNIIQDDARGKIYFRR 420


>ONI35422.1 hypothetical protein PRUPE_1G535200 [Prunus persica]
          Length = 456

 Score =  421 bits (1081), Expect = e-139
 Identities = 231/466 (49%), Positives = 292/466 (62%), Gaps = 45/466 (9%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MGWE  L  +    L +L+WP F+L+  LYASI+A+ESDS S NQ+CL++WV+F+L  I 
Sbjct: 1    MGWEAFLQVLAELFLGVLSWPSFNLVYTLYASIQAIESDSHSRNQQCLSYWVMFALYKIS 60

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFTWNICARTLNIFNGE 534
            E  L  LF WLP WP+ K   T+LL++PYFG A Y+YK  I+ Y + N      NI + +
Sbjct: 61   EEALGKLFYWLPVWPYTKGAITVLLVLPYFGGASYLYKHFIRPYISENSVIWKWNILSIQ 120

Query: 535  STHS-KSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTESSY 711
              +   S EDN   +               I T   + E  ++++G    +  +K    Y
Sbjct: 121  RINGFNSGEDNYPDVVDK----------NVIRTEPQKSEGAVIFKGTP-ASSSEKEGREY 169

Query: 712  TRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKE-----------YLLSST 858
            T  +  +++Q+EW+CALCLISTTS  CL  H++GKKH+T V             Y  S  
Sbjct: 170  TSPSSPKKIQREWTCALCLISTTSGKCLKKHLRGKKHETQVAALRTYKQGPISGYKSSLK 229

Query: 859  QKRIKGMVL------------------------LRNLNQIAN--------ILNPVSRSIR 942
             KR  GM+                         + NLNQIA         IL+PV+R IR
Sbjct: 230  LKRTDGMIFNLNQMARANGKIFNLNQMARANGKIFNLNQIARANLEKWSGILSPVARPIR 289

Query: 943  WCEWEKPKFGWTKLNTDGSIQREAAGFGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIW 1122
             C W+KP+ GWTKLNTDGS+ RE AG+GGLLRD+KG+PICAFVSKA   D+FLVELWAIW
Sbjct: 290  MCIWKKPELGWTKLNTDGSVDRENAGYGGLLRDYKGDPICAFVSKALGDDIFLVELWAIW 349

Query: 1123 RGLVLSLGLGIKEIWVESDSMSVVNTINR-KQTCPKAESYLKQIWKLLKKFDKYRISHSW 1299
            RGLVL+L LGIK IWVESDS SVV TINR +    KA S LK IW+LL KFDK+++SHSW
Sbjct: 350  RGLVLALSLGIKVIWVESDSESVVQTINRDRPYSQKASSCLKHIWELLNKFDKHQVSHSW 409

Query: 1300 RETNRAADHLAKMVLLGNDVVLWPVDFPHSLCNIIKDDAKGKKYLR 1437
            RETNRAADHL+KMVLLG+DVV WPVDFP SL NIIK+DA+G+ Y R
Sbjct: 410  RETNRAADHLSKMVLLGSDVVFWPVDFPDSLHNIIKEDAEGRIYFR 455


>XP_012449205.1 PREDICTED: uncharacterized protein LOC105772486 isoform X2 [Gossypium
            raimondii]
          Length = 409

 Score =  408 bits (1048), Expect = e-135
 Identities = 223/441 (50%), Positives = 288/441 (65%), Gaps = 19/441 (4%)
 Frame = +1

Query: 175  MGWEEALYTILTKSLTILAWPPFSLLCPLYASIRALESDSRSSNQRCLAFWVLFSLSLIM 354
            MG+ EA   +L    T++ W         + SIR +E++S   NQ+CL +WVLF+ S ++
Sbjct: 1    MGFWEAFVELLANIFTVICW---------FVSIRIVETNSSLKNQQCLTYWVLFAFSTMV 51

Query: 355  EWELSVLFSWLPWWPHVKAMATILLLIPYFGIAPYIYKFLIKHYFT---WNICARTLNIF 525
            E  L  +  W P+WP+ K +ATILL+ PYFG A Y++  LI+ YF    W+I      +F
Sbjct: 52   ELTLGNILKWFPFWPYAKGVATILLVTPYFGGASYVFSLLIRPYFIEKRWDI------MF 105

Query: 526  NGESTHSKSDEDNKTVLXXXXXXXXXXXXGQTIITSHLQEEKLLVYQGRDDLAGCDKTES 705
              +       E N TV               +++T   + EKL   QG  +++    TE 
Sbjct: 106  FPKKKGFVLHEANGTV----------GDADTSMLTYGPKSEKLTTDQGNVNIS-YGNTEV 154

Query: 706  SYTRLTGKEQVQKEWSCALCLISTTSENCLGAHIQGKKHKTMVKEY-------------L 846
              T+    ++VQKEWSC LCLIST+SE+CL  H++GKKHKT  KEY             +
Sbjct: 155  ISTQ----KRVQKEWSCVLCLISTSSEDCLKEHLRGKKHKT--KEYELRVGALPLMETCM 208

Query: 847  LSSTQKRIKGMVLLRNLN--QIANILNPVSRSIRWCEWEKPKFGWTKLNTDGSIQREAAG 1020
            LSS  K+ + +VL RNLN    + +L+PV+RSIRWC+W+KP+ G  KLNTDGS+     G
Sbjct: 209  LSSMPKKAEKVVLFRNLNIETWSGLLHPVTRSIRWCKWKKPEIGCLKLNTDGSVDAGNLG 268

Query: 1021 FGGLLRDHKGEPICAFVSKAPQGDVFLVELWAIWRGLVLSLGLGIKEIWVESDSMSVVNT 1200
            FGGLLRD+KGEP+CAFV KAPQGD FLVELWAIWRGLVL+ GLGIK IWVESDS SVV T
Sbjct: 269  FGGLLRDYKGEPLCAFVCKAPQGDTFLVELWAIWRGLVLASGLGIKVIWVESDSKSVVKT 328

Query: 1201 INRKQTC-PKAESYLKQIWKLLKKFDKYRISHSWRETNRAADHLAKMVLLGNDVVLWPVD 1377
            INR+Q   PK+   L+QIWKLL KF+ YR++HSWRETN+AADHL++MVL  NDVVLWP D
Sbjct: 329  INREQPYGPKSSQCLRQIWKLLTKFENYRVTHSWRETNKAADHLSRMVLRENDVVLWPAD 388

Query: 1378 FPHSLCNIIKDDAKGKKYLRR 1440
            FP +L +IIKDDA+GK YLRR
Sbjct: 389  FPDTLSSIIKDDAQGKTYLRR 409


Top