BLASTX nr result

ID: Papaver31_contig00005364 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver31_contig00005364
         (2191 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010261035.1| PREDICTED: uncharacterized protein LOC104599...  1008   0.0  
ref|XP_010652875.1| PREDICTED: uncharacterized protein LOC100247...   944   0.0  
ref|XP_010652876.1| PREDICTED: uncharacterized protein LOC100247...   939   0.0  
ref|XP_010652873.1| PREDICTED: uncharacterized protein LOC100247...   939   0.0  
emb|CAN77758.1| hypothetical protein VITISV_035945 [Vitis vinifera]   938   0.0  
ref|XP_010925343.1| PREDICTED: uncharacterized protein LOC105047...   899   0.0  
ref|XP_008786555.1| PREDICTED: uncharacterized protein LOC103704...   888   0.0  
ref|XP_008786547.1| PREDICTED: uncharacterized protein LOC103704...   888   0.0  
ref|XP_007048161.1| Uncharacterized protein isoform 1 [Theobroma...   887   0.0  
gb|KRH43338.1| hypothetical protein GLYMA_08G143200 [Glycine max]     881   0.0  
gb|KHN29127.1| Hypothetical protein glysoja_008462 [Glycine soja]     881   0.0  
ref|XP_003532852.1| PREDICTED: uncharacterized protein LOC100800...   881   0.0  
gb|KDO50473.1| hypothetical protein CISIN_1g000037mg [Citrus sin...   877   0.0  
ref|XP_006464509.1| PREDICTED: uncharacterized protein LOC102626...   877   0.0  
ref|XP_004289254.1| PREDICTED: uncharacterized protein LOC101305...   875   0.0  
ref|XP_012437402.1| PREDICTED: uncharacterized protein LOC105763...   874   0.0  
gb|KJB46751.1| hypothetical protein B456_008G100800 [Gossypium r...   874   0.0  
gb|KJB46750.1| hypothetical protein B456_008G100800 [Gossypium r...   874   0.0  
ref|XP_012437401.1| PREDICTED: uncharacterized protein LOC105763...   874   0.0  
ref|XP_011021957.1| PREDICTED: uncharacterized protein LOC105123...   872   0.0  

>ref|XP_010261035.1| PREDICTED: uncharacterized protein LOC104599968 [Nelumbo nucifera]
            gi|720016065|ref|XP_010261036.1| PREDICTED:
            uncharacterized protein LOC104599968 [Nelumbo nucifera]
          Length = 3276

 Score = 1008 bits (2607), Expect = 0.0
 Identities = 523/734 (71%), Positives = 606/734 (82%), Gaps = 5/734 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  +SSAQCTLENLRPALQRFPTLWRTL+A+CF  DA+G S+  + K+VFGNS LSDY
Sbjct: 1593 VNRNCNSSAQCTLENLRPALQRFPTLWRTLVASCFHQDADGSSMAHNTKNVFGNSTLSDY 1652

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            L WRE+IFSS G DT LVQMLPCWFSK IRRLIQLFVQGP GWQSLAG+P GESFLHR+I
Sbjct: 1653 LYWRENIFSSTGRDTPLVQMLPCWFSKSIRRLIQLFVQGPLGWQSLAGIPAGESFLHREI 1712

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
              FINAHE+  +SA+SWEA+IQK+VE+EL+ASS+EET FGVEH+LHRGRALAAFNHLLG+
Sbjct: 1713 GIFINAHESAGLSAISWEASIQKNVEEELYASSVEETGFGVEHHLHRGRALAAFNHLLGM 1772

Query: 1649 RGQMLNE-NSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLV 1473
            R Q L   N  ++QSGAS   AN+QSDVQ+LLAP+T NEESLLS+V+PLAI HFEDS+LV
Sbjct: 1773 RVQKLKSTNILQEQSGAS---ANVQSDVQILLAPLTHNEESLLSSVVPLAIVHFEDSMLV 1829

Query: 1472 ASCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDIT 1293
            ASCAFLLELCGLSASMLR+DVAALRRISSFY SSEYNEH +H SPKG+AFHA   EG IT
Sbjct: 1830 ASCAFLLELCGLSASMLRVDVAALRRISSFYMSSEYNEHSKHLSPKGTAFHAVNHEGAIT 1889

Query: 1292 VSLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGET 1113
            +SLAQALADDY+H  +      +E SN   ++SK+ SRA++AVL  LEKAS+PLM EG T
Sbjct: 1890 ISLAQALADDYLHHYNDSVIKPKETSNRD-SSSKQPSRALMAVLLQLEKASLPLMVEGRT 1948

Query: 1112 CGSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTE 933
            CGSWLL+G+GDGAEFRS QKAASQHW+LVT FC+MHQIPLSTKYL+VLAKDNDWVGFL E
Sbjct: 1949 CGSWLLNGTGDGAEFRSQQKAASQHWNLVTDFCKMHQIPLSTKYLAVLAKDNDWVGFLAE 2008

Query: 932  AQVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLS 753
            AQV G+PFDA IQVASK+FSDPRL+IHILTVL+S+ STRKK  S  N+ P  K NE+  S
Sbjct: 2009 AQVGGYPFDAIIQVASKEFSDPRLRIHILTVLKSIQSTRKKSSSYSNSAPMEKNNEMPFS 2068

Query: 752  SENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWL 573
            ++ N ++P+ELF LLAECEK+K+PG+ALL++AKD+RWSLLAMIASCF+DVSPLSCLTVWL
Sbjct: 2069 TDTNLLIPLELFRLLAECEKEKNPGKALLIKAKDLRWSLLAMIASCFADVSPLSCLTVWL 2128

Query: 572  EITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESL- 396
            EITAARETSSIKV+DIASQIANNVGAAVE TNL P G++ LTF YNR+N KRR L+E   
Sbjct: 2129 EITAARETSSIKVDDIASQIANNVGAAVEMTNLLPVGSRALTFRYNRRNPKRRRLMEQTS 2188

Query: 395  ---SAVTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSV 225
               S  T+S  S +  V++ S   ++S            +I +LSD DE   SLSKMV+V
Sbjct: 2189 GDPSTTTSSKVSTDINVIRNSAIQDISAEEDKRQEADEQNI-ILSDSDEVHVSLSKMVAV 2247

Query: 224  LCEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHAR 45
            LCEQ LFLPLLRAFEMFLPSCSLLPFIRALQ FSQMRL+EASAHLASFSARIKEE  H +
Sbjct: 2248 LCEQHLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLTEASAHLASFSARIKEEAPHVQ 2307

Query: 44   ANIGREGQVGAPWI 3
             +IGRE  +G  WI
Sbjct: 2308 TSIGREKLIGTSWI 2321


>ref|XP_010652875.1| PREDICTED: uncharacterized protein LOC100247348 isoform X2 [Vitis
            vinifera]
          Length = 3261

 Score =  944 bits (2441), Expect = 0.0
 Identities = 499/733 (68%), Positives = 576/733 (78%), Gaps = 4/733 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R +SSSAQCTLENLRP LQRFPTLWRTL+AA FGHDA    L P AK+VFGNS+LSDY
Sbjct: 1595 VNRHYSSSAQCTLENLRPTLQRFPTLWRTLVAASFGHDATSNFLSPKAKNVFGNSSLSDY 1654

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            L+WR++IF S  HDTSL+QMLPCWFSK IRRLIQL+VQGP GWQSL      ESF  RD+
Sbjct: 1655 LSWRDNIFFSTAHDTSLLQMLPCWFSKAIRRLIQLYVQGPLGWQSL------ESFPPRDV 1708

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
              F+N++++ ++SA+SWEAAIQK VE+EL+ASSL E+  G+E +LHRGRALAAFNHLLG+
Sbjct: 1709 DLFVNSNDHADISAISWEAAIQKHVEEELYASSLRESGLGLEQHLHRGRALAAFNHLLGV 1768

Query: 1649 RGQMLNENSHKKQSGAS-SGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLV 1473
            R Q L   + K QS AS +GQ N+QSDVQMLL+P+TQ+EESLLS+V PLAI HFEDSVLV
Sbjct: 1769 RVQKLKLENTKGQSSASVNGQTNVQSDVQMLLSPITQSEESLLSSVTPLAIIHFEDSVLV 1828

Query: 1472 ASCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDIT 1293
            ASCAFLLELCGLSASMLRID+AALRRISSFYKSSEY EH++  SPKGSA HA   E DIT
Sbjct: 1829 ASCAFLLELCGLSASMLRIDIAALRRISSFYKSSEYTEHYRQLSPKGSALHAVSHEVDIT 1888

Query: 1292 VSLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGET 1113
             SLAQALADDY+  D S    Q+   N     SKR SRA++ VLQHLEK S+PLMA+G++
Sbjct: 1889 NSLAQALADDYVGHDGSSIVKQKGTPNS--VTSKRPSRALMLVLQHLEKVSLPLMADGKS 1946

Query: 1112 CGSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTE 933
            CGSWL SG+GDGAE RS QKAASQHW+LVT FCQMHQIPLSTKYL +LA+DNDWVGFL+E
Sbjct: 1947 CGSWLFSGNGDGAELRSQQKAASQHWNLVTVFCQMHQIPLSTKYLGLLARDNDWVGFLSE 2006

Query: 932  AQVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLS 753
            AQV G+PF+  IQVAS++FSDPRLKIHI+TVL+ + S RKK  SS N     K NE S  
Sbjct: 2007 AQVGGYPFEKVIQVASREFSDPRLKIHIVTVLKGLLS-RKKVSSSSNLDTSEKRNETSFV 2065

Query: 752  SENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWL 573
             EN+  +PVELFG+LAECEK K+PGEALL++AK++ WS+LAMIASCF DVSPLSCLTVWL
Sbjct: 2066 DENS-FIPVELFGILAECEKGKNPGEALLVKAKELCWSILAMIASCFPDVSPLSCLTVWL 2124

Query: 572  EITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLS 393
            EITAARETSSIKVNDIAS+IAN+VGAAVEATN  P G + L FHYNR+N KRR L+E +S
Sbjct: 2125 EITAARETSSIKVNDIASKIANSVGAAVEATNSLPVGGRPLQFHYNRRNPKRRRLMEPIS 2184

Query: 392  AVTASNDSGNPGVVKKSV---PTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVL 222
                +  + +   V  S      +                KV  + D+G  SLSKMV+VL
Sbjct: 2185 LEHLAATTSDVSCVSDSAKIFSVQGFVAEVERKSDAGELTKVSVNSDDGPNSLSKMVAVL 2244

Query: 221  CEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARA 42
            CEQRLFLPLLRAFEMFLPSCSLLPFIRALQ FSQMRLSEASAHL SFSARIKEEP     
Sbjct: 2245 CEQRLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPI---- 2300

Query: 41   NIGREGQVGAPWI 3
             IGREGQ+G  WI
Sbjct: 2301 -IGREGQIGTSWI 2312


>ref|XP_010652876.1| PREDICTED: uncharacterized protein LOC100247348 isoform X3 [Vitis
            vinifera]
          Length = 2452

 Score =  939 bits (2428), Expect = 0.0
 Identities = 499/735 (67%), Positives = 576/735 (78%), Gaps = 6/735 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R +SSSAQCTLENLRP LQRFPTLWRTL+AA FGHDA    L P AK+VFGNS+LSDY
Sbjct: 784  VNRHYSSSAQCTLENLRPTLQRFPTLWRTLVAASFGHDATSNFLSPKAKNVFGNSSLSDY 843

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            L+WR++IF S  HDTSL+QMLPCWFSK IRRLIQL+VQGP GWQSL      ESF  RD+
Sbjct: 844  LSWRDNIFFSTAHDTSLLQMLPCWFSKAIRRLIQLYVQGPLGWQSL------ESFPPRDV 897

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLE--ETTFGVEHYLHRGRALAAFNHLL 1656
              F+N++++ ++SA+SWEAAIQK VE+EL+ASSL   E+  G+E +LHRGRALAAFNHLL
Sbjct: 898  DLFVNSNDHADISAISWEAAIQKHVEEELYASSLRVVESGLGLEQHLHRGRALAAFNHLL 957

Query: 1655 GLRGQMLNENSHKKQSGAS-SGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSV 1479
            G+R Q L   + K QS AS +GQ N+QSDVQMLL+P+TQ+EESLLS+V PLAI HFEDSV
Sbjct: 958  GVRVQKLKLENTKGQSSASVNGQTNVQSDVQMLLSPITQSEESLLSSVTPLAIIHFEDSV 1017

Query: 1478 LVASCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGD 1299
            LVASCAFLLELCGLSASMLRID+AALRRISSFYKSSEY EH++  SPKGSA HA   E D
Sbjct: 1018 LVASCAFLLELCGLSASMLRIDIAALRRISSFYKSSEYTEHYRQLSPKGSALHAVSHEVD 1077

Query: 1298 ITVSLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEG 1119
            IT SLAQALADDY+  D S    Q+   N     SKR SRA++ VLQHLEK S+PLMA+G
Sbjct: 1078 ITNSLAQALADDYVGHDGSSIVKQKGTPNS--VTSKRPSRALMLVLQHLEKVSLPLMADG 1135

Query: 1118 ETCGSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFL 939
            ++CGSWL SG+GDGAE RS QKAASQHW+LVT FCQMHQIPLSTKYL +LA+DNDWVGFL
Sbjct: 1136 KSCGSWLFSGNGDGAELRSQQKAASQHWNLVTVFCQMHQIPLSTKYLGLLARDNDWVGFL 1195

Query: 938  TEAQVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEIS 759
            +EAQV G+PF+  IQVAS++FSDPRLKIHI+TVL+ + S RKK  SS N     K NE S
Sbjct: 1196 SEAQVGGYPFEKVIQVASREFSDPRLKIHIVTVLKGLLS-RKKVSSSSNLDTSEKRNETS 1254

Query: 758  LSSENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTV 579
               EN+  +PVELFG+LAECEK K+PGEALL++AK++ WS+LAMIASCF DVSPLSCLTV
Sbjct: 1255 FVDENS-FIPVELFGILAECEKGKNPGEALLVKAKELCWSILAMIASCFPDVSPLSCLTV 1313

Query: 578  WLEITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIES 399
            WLEITAARETSSIKVNDIAS+IAN+VGAAVEATN  P G + L FHYNR+N KRR L+E 
Sbjct: 1314 WLEITAARETSSIKVNDIASKIANSVGAAVEATNSLPVGGRPLQFHYNRRNPKRRRLMEP 1373

Query: 398  LSAVTASNDSGNPGVVKKSV---PTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVS 228
            +S    +  + +   V  S      +                KV  + D+G  SLSKMV+
Sbjct: 1374 ISLEHLAATTSDVSCVSDSAKIFSVQGFVAEVERKSDAGELTKVSVNSDDGPNSLSKMVA 1433

Query: 227  VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHA 48
            VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQ FSQMRLSEASAHL SFSARIKEEP   
Sbjct: 1434 VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPI-- 1491

Query: 47   RANIGREGQVGAPWI 3
               IGREGQ+G  WI
Sbjct: 1492 ---IGREGQIGTSWI 1503


>ref|XP_010652873.1| PREDICTED: uncharacterized protein LOC100247348 isoform X1 [Vitis
            vinifera]
          Length = 3263

 Score =  939 bits (2428), Expect = 0.0
 Identities = 499/735 (67%), Positives = 576/735 (78%), Gaps = 6/735 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R +SSSAQCTLENLRP LQRFPTLWRTL+AA FGHDA    L P AK+VFGNS+LSDY
Sbjct: 1595 VNRHYSSSAQCTLENLRPTLQRFPTLWRTLVAASFGHDATSNFLSPKAKNVFGNSSLSDY 1654

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            L+WR++IF S  HDTSL+QMLPCWFSK IRRLIQL+VQGP GWQSL      ESF  RD+
Sbjct: 1655 LSWRDNIFFSTAHDTSLLQMLPCWFSKAIRRLIQLYVQGPLGWQSL------ESFPPRDV 1708

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLE--ETTFGVEHYLHRGRALAAFNHLL 1656
              F+N++++ ++SA+SWEAAIQK VE+EL+ASSL   E+  G+E +LHRGRALAAFNHLL
Sbjct: 1709 DLFVNSNDHADISAISWEAAIQKHVEEELYASSLRVVESGLGLEQHLHRGRALAAFNHLL 1768

Query: 1655 GLRGQMLNENSHKKQSGAS-SGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSV 1479
            G+R Q L   + K QS AS +GQ N+QSDVQMLL+P+TQ+EESLLS+V PLAI HFEDSV
Sbjct: 1769 GVRVQKLKLENTKGQSSASVNGQTNVQSDVQMLLSPITQSEESLLSSVTPLAIIHFEDSV 1828

Query: 1478 LVASCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGD 1299
            LVASCAFLLELCGLSASMLRID+AALRRISSFYKSSEY EH++  SPKGSA HA   E D
Sbjct: 1829 LVASCAFLLELCGLSASMLRIDIAALRRISSFYKSSEYTEHYRQLSPKGSALHAVSHEVD 1888

Query: 1298 ITVSLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEG 1119
            IT SLAQALADDY+  D S    Q+   N     SKR SRA++ VLQHLEK S+PLMA+G
Sbjct: 1889 ITNSLAQALADDYVGHDGSSIVKQKGTPNS--VTSKRPSRALMLVLQHLEKVSLPLMADG 1946

Query: 1118 ETCGSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFL 939
            ++CGSWL SG+GDGAE RS QKAASQHW+LVT FCQMHQIPLSTKYL +LA+DNDWVGFL
Sbjct: 1947 KSCGSWLFSGNGDGAELRSQQKAASQHWNLVTVFCQMHQIPLSTKYLGLLARDNDWVGFL 2006

Query: 938  TEAQVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEIS 759
            +EAQV G+PF+  IQVAS++FSDPRLKIHI+TVL+ + S RKK  SS N     K NE S
Sbjct: 2007 SEAQVGGYPFEKVIQVASREFSDPRLKIHIVTVLKGLLS-RKKVSSSSNLDTSEKRNETS 2065

Query: 758  LSSENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTV 579
               EN+  +PVELFG+LAECEK K+PGEALL++AK++ WS+LAMIASCF DVSPLSCLTV
Sbjct: 2066 FVDENS-FIPVELFGILAECEKGKNPGEALLVKAKELCWSILAMIASCFPDVSPLSCLTV 2124

Query: 578  WLEITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIES 399
            WLEITAARETSSIKVNDIAS+IAN+VGAAVEATN  P G + L FHYNR+N KRR L+E 
Sbjct: 2125 WLEITAARETSSIKVNDIASKIANSVGAAVEATNSLPVGGRPLQFHYNRRNPKRRRLMEP 2184

Query: 398  LSAVTASNDSGNPGVVKKSV---PTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVS 228
            +S    +  + +   V  S      +                KV  + D+G  SLSKMV+
Sbjct: 2185 ISLEHLAATTSDVSCVSDSAKIFSVQGFVAEVERKSDAGELTKVSVNSDDGPNSLSKMVA 2244

Query: 227  VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHA 48
            VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQ FSQMRLSEASAHL SFSARIKEEP   
Sbjct: 2245 VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPI-- 2302

Query: 47   RANIGREGQVGAPWI 3
               IGREGQ+G  WI
Sbjct: 2303 ---IGREGQIGTSWI 2314


>emb|CAN77758.1| hypothetical protein VITISV_035945 [Vitis vinifera]
          Length = 1859

 Score =  938 bits (2425), Expect = 0.0
 Identities = 499/735 (67%), Positives = 574/735 (78%), Gaps = 6/735 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R +SSSAQCTLENLRP LQRFPTLWRTL+AA FGHDA    L P AK+VFGNS+LSDY
Sbjct: 1015 VNRHYSSSAQCTLENLRPTLQRFPTLWRTLVAASFGHDATSNFLSPKAKNVFGNSSLSDY 1074

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            L+WR++IF S  HDTSL+QMLPCWFSK IRRLIQL+VQGP GWQSL      ESF  RD+
Sbjct: 1075 LSWRDNIFFSTAHDTSLLQMLPCWFSKAIRRLIQLYVQGPLGWQSL------ESFPPRDV 1128

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
              F+N++++ ++SA+SWEAAIQK VE+EL+ASSL E+  G+E +LHRGRALAAFNHLLG+
Sbjct: 1129 DLFVNSNDHADISAISWEAAIQKHVEEELYASSLRESGLGLEQHLHRGRALAAFNHLLGV 1188

Query: 1649 RGQMLNENSHKKQSGAS-SGQANIQSDVQMLLAPVTQNEESLLS--TVMPLAISHFEDSV 1479
            R Q L   + K QS AS +GQ N+QSDVQMLL+P+TQ+EE LLS  TV PLAI HFEDSV
Sbjct: 1189 RVQKLKLENTKGQSSASVNGQTNVQSDVQMLLSPITQSEEXLLSSVTVTPLAIIHFEDSV 1248

Query: 1478 LVASCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGD 1299
            LVASCAFLLELCGLSASMLRID+AALRRISSFYKSSEY EH++  SPKGSA HA   E D
Sbjct: 1249 LVASCAFLLELCGLSASMLRIDIAALRRISSFYKSSEYTEHYRQLSPKGSALHAVSHEVD 1308

Query: 1298 ITVSLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEG 1119
            IT SLAQALADDY+  D S    Q+   N     SKR SRA++ VLQHLEK S+PLMA+G
Sbjct: 1309 ITNSLAQALADDYVGHDGSSIVKQKGTPNS--VTSKRPSRALMLVLQHLEKVSLPLMADG 1366

Query: 1118 ETCGSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFL 939
            ++CGSWL SG+GDGAE RS QKAASQHW+LVT FCQMHQIPLSTKYL  LA+DNDWVGFL
Sbjct: 1367 KSCGSWLFSGNGDGAELRSQQKAASQHWNLVTVFCQMHQIPLSTKYLGFLARDNDWVGFL 1426

Query: 938  TEAQVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEIS 759
            +EAQV G+PF+  IQVAS++FSDPRLKIHI+TVL+ + S RKK  SS N     K NE S
Sbjct: 1427 SEAQVGGYPFEKVIQVASREFSDPRLKIHIVTVLKGLLS-RKKVSSSSNLDTSEKRNETS 1485

Query: 758  LSSENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTV 579
               EN+  +PVELFG+LAECEK K+PGEALL++AK++ WS+LAMIASCF DVSPLSCLTV
Sbjct: 1486 FVDENS-FIPVELFGILAECEKGKNPGEALLVKAKELCWSILAMIASCFPDVSPLSCLTV 1544

Query: 578  WLEITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIES 399
            WLEITAARETSSIKVNDIAS+IAN+VGAAVEATN  P G + L FHYNR+N KRR L+E 
Sbjct: 1545 WLEITAARETSSIKVNDIASKIANSVGAAVEATNSLPVGGRPLQFHYNRRNPKRRRLMEP 1604

Query: 398  LSAVTASNDSGNPGVVKKSV---PTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVS 228
            +S    +  + +   V  S      +                KV  + D+G  SLSKMV+
Sbjct: 1605 ISLEHLAATTSDVSCVSDSAKIFSVQGFVAEVERKSDAGELTKVSVNSDDGPNSLSKMVA 1664

Query: 227  VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHA 48
            VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQ FSQMRLSEASAHL SFSARIKEEP   
Sbjct: 1665 VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPI-- 1722

Query: 47   RANIGREGQVGAPWI 3
               IGREGQ+G  WI
Sbjct: 1723 ---IGREGQIGTSWI 1734


>ref|XP_010925343.1| PREDICTED: uncharacterized protein LOC105047910 [Elaeis guineensis]
          Length = 3256

 Score =  899 bits (2323), Expect = 0.0
 Identities = 472/734 (64%), Positives = 563/734 (76%), Gaps = 5/734 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  SSS+QCTLENLRP LQ FPTLWRTL+A+CFG DAN  SL P A +VFG SA SDY
Sbjct: 1580 VNRHCSSSSQCTLENLRPGLQHFPTLWRTLVASCFGQDANDYSLSPTASNVFGKSAFSDY 1639

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAG-VPTGESFLHRD 1833
            L+WR SIFSSAG D SL+QMLPCWF K IRRLI+LFVQG  GWQSL G V TGESFL+RD
Sbjct: 1640 LSWRNSIFSSAGGDASLIQMLPCWFPKSIRRLIKLFVQGSLGWQSLLGAVTTGESFLYRD 1699

Query: 1832 ISYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLG 1653
             SY ++A+ NG VSA+SWEA+IQKS+EKEL  SSLEE  FGVEH+LHRGRALAAFNHLLG
Sbjct: 1700 NSYVVSANRNGGVSAISWEASIQKSIEKEL-CSSLEENGFGVEHHLHRGRALAAFNHLLG 1758

Query: 1652 LRGQMLNE-NSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVL 1476
             R   L   N+H++ SG    Q NIQ+D+Q +LAP+TQ+E S+LS+V+PLA+ HFEDSVL
Sbjct: 1759 ARALKLKSVNAHQELSG----QPNIQADMQTILAPLTQSEGSILSSVVPLAVIHFEDSVL 1814

Query: 1475 VASCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDI 1296
            VASCAF LELCGLSASMLR+D+AALRRISS+Y S E+N H++H SP+GS  HA   EGD+
Sbjct: 1815 VASCAFFLELCGLSASMLRVDIAALRRISSYYNSVEHNVHYEHVSPRGSVVHAVSHEGDL 1874

Query: 1295 TVSLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGE 1116
            T SLA+ALADDY+H D     +++++ +    +  + S+ +++VL HLEKAS+P   E +
Sbjct: 1875 TASLARALADDYIHHDHLNILEKKDVPS--EVSKGKPSQPLMSVLHHLEKASLPPTDESK 1932

Query: 1115 TCGSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLT 936
            T G+WLLSG GDG+EFRS QK AS+HW+LVTAFCQMH +PLSTKYL++LA DNDWVGFLT
Sbjct: 1933 TSGTWLLSGIGDGSEFRSRQKDASRHWNLVTAFCQMHHLPLSTKYLALLANDNDWVGFLT 1992

Query: 935  EAQVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISL 756
            EAQ+ G P D  IQVA+K+FSDPRLK H+LT+LRSM S RKK     NT   G ++EISL
Sbjct: 1993 EAQLGGFPVDVIIQVAAKEFSDPRLKTHVLTILRSMQSARKKTSPLTNTSSSG-SSEISL 2051

Query: 755  SSENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVW 576
             ++N+    +ELFG+LAECEKQK+PGEALL +AKD+RWSLLAMIASCF DVSPL+CLTVW
Sbjct: 2052 DTDNS--TTLELFGILAECEKQKNPGEALLRKAKDLRWSLLAMIASCFPDVSPLACLTVW 2109

Query: 575  LEITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESL 396
            LEITAARETSSIKV+D++S+IAN+VGAAVE TN  P G++ L F YNR+N KRR L+E  
Sbjct: 2110 LEITAARETSSIKVDDLSSKIANSVGAAVEVTNTLPIGSRTLAFRYNRRNSKRRRLMEPT 2169

Query: 395  SAVTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDI---KVLSDPDEGLTSLSKMVSV 225
            S  +    S N      S    ++             I   K  +D DEGL SLS MV+V
Sbjct: 2170 SRNSTMGSSFNVPSTSTSTIASIAQEIVNEEERKRMVIEQPKSSNDVDEGLASLSNMVAV 2229

Query: 224  LCEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHAR 45
            LCEQ LFLPLLRAFEMFLPSCSLLPFIR LQ F QMRL EASAHLASFSARIKEEPF  +
Sbjct: 2230 LCEQHLFLPLLRAFEMFLPSCSLLPFIRFLQAFFQMRLPEASAHLASFSARIKEEPFLIQ 2289

Query: 44   ANIGREGQVGAPWI 3
             N  R+G +   WI
Sbjct: 2290 MNSARDGLLKTAWI 2303


>ref|XP_008786555.1| PREDICTED: uncharacterized protein LOC103704848 isoform X2 [Phoenix
            dactylifera]
          Length = 2356

 Score =  888 bits (2295), Expect = 0.0
 Identities = 470/733 (64%), Positives = 560/733 (76%), Gaps = 4/733 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  SSS+QCTLENLRP LQ FPTLWRTL+A+CFG +AN  SL   A +VFG SA SDY
Sbjct: 681  VNRHCSSSSQCTLENLRPGLQHFPTLWRTLVASCFGQEANDYSLSSTASNVFGKSAFSDY 740

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAG-VPTGESFLHRD 1833
            LNWR SIFSSAG D SL+QMLPCWF K IRRLI+LFVQGP GWQSL G V TGESFL+RD
Sbjct: 741  LNWRNSIFSSAGGDASLIQMLPCWFPKSIRRLIKLFVQGPLGWQSLLGAVTTGESFLYRD 800

Query: 1832 ISYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLG 1653
             +Y +NA+ NG  SA+SWEA+IQKS+EKEL  SSLEE  FGVEH+LHRGRALAAFNHLLG
Sbjct: 801  NNYVVNANRNGGASAISWEASIQKSIEKEL-CSSLEENRFGVEHHLHRGRALAAFNHLLG 859

Query: 1652 LRGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLV 1473
             R   L   + +++    SGQ NIQ+DVQ +LAP+TQ+E S+LS+V+PLAI HFEDSVLV
Sbjct: 860  ARALNLKSANARQEL---SGQPNIQADVQAILAPLTQSEGSILSSVVPLAIMHFEDSVLV 916

Query: 1472 ASCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDIT 1293
            ASCAF LELCGLSAS+LR+D+AALRRIS++Y S+E+N H++H SP+GS  HA   EGD+T
Sbjct: 917  ASCAFFLELCGLSASILRVDIAALRRISAYYNSAEHNVHYEHVSPRGSVLHAVSHEGDLT 976

Query: 1292 VSLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGET 1113
             SLA+ALADDY+H D     ++++       +  + S+ +++VL HLEKAS+P + E ET
Sbjct: 977  ASLARALADDYIHHDHLNILEKKD--GPSEVSKDKPSQPLMSVLHHLEKASLPPIDESET 1034

Query: 1112 CGSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTE 933
             G+WLLSG GDG+EFRS QK AS+ W+LVTAFCQMH +PLSTKYL++LA DNDWVGFLTE
Sbjct: 1035 SGTWLLSGIGDGSEFRSRQKDASRCWNLVTAFCQMHHLPLSTKYLALLANDNDWVGFLTE 1094

Query: 932  AQVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLS 753
            AQ+ G P D  IQVA+K+FSDPRLK HILTVLRSM S RKK  S  NT   G ++EIS  
Sbjct: 1095 AQMGGFPVDVIIQVAAKEFSDPRLKTHILTVLRSMQS-RKKTSSLTNTSSSG-SSEISFD 1152

Query: 752  SENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWL 573
            ++++    +ELFG+LAECEKQK+PGEALL +AKD+RWSLLAMIASCF DVSPL+CLTVWL
Sbjct: 1153 TDSS--TTLELFGILAECEKQKNPGEALLRKAKDLRWSLLAMIASCFPDVSPLACLTVWL 1210

Query: 572  EITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLS 393
            EITAARETSSIKV+DI+S+IAN+VGAAVE TN  P G++ L F YNR+N KRR L+   S
Sbjct: 1211 EITAARETSSIKVDDISSKIANSVGAAVEVTNTLPIGSRMLAFRYNRRNSKRRRLMVPTS 1270

Query: 392  AVTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDI---KVLSDPDEGLTSLSKMVSVL 222
              +    S N      S    ++             +   K  +D DEGL SLS MV+VL
Sbjct: 1271 GNSTMGSSFNVPSTSTSTIASIAQEIVSEEESRRMVMEQPKSSNDLDEGLASLSNMVAVL 1330

Query: 221  CEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARA 42
            CEQ LFLPLLRAFEMFLPSCSLLPFIR LQ FSQMRL EASAHLASFSARIKEEPF  + 
Sbjct: 1331 CEQHLFLPLLRAFEMFLPSCSLLPFIRFLQAFSQMRLPEASAHLASFSARIKEEPFLGQI 1390

Query: 41   NIGREGQVGAPWI 3
            N  R+G +   WI
Sbjct: 1391 NSARDGLLKTAWI 1403


>ref|XP_008786547.1| PREDICTED: uncharacterized protein LOC103704848 isoform X1 [Phoenix
            dactylifera]
          Length = 3252

 Score =  888 bits (2295), Expect = 0.0
 Identities = 470/733 (64%), Positives = 560/733 (76%), Gaps = 4/733 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  SSS+QCTLENLRP LQ FPTLWRTL+A+CFG +AN  SL   A +VFG SA SDY
Sbjct: 1577 VNRHCSSSSQCTLENLRPGLQHFPTLWRTLVASCFGQEANDYSLSSTASNVFGKSAFSDY 1636

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAG-VPTGESFLHRD 1833
            LNWR SIFSSAG D SL+QMLPCWF K IRRLI+LFVQGP GWQSL G V TGESFL+RD
Sbjct: 1637 LNWRNSIFSSAGGDASLIQMLPCWFPKSIRRLIKLFVQGPLGWQSLLGAVTTGESFLYRD 1696

Query: 1832 ISYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLG 1653
             +Y +NA+ NG  SA+SWEA+IQKS+EKEL  SSLEE  FGVEH+LHRGRALAAFNHLLG
Sbjct: 1697 NNYVVNANRNGGASAISWEASIQKSIEKEL-CSSLEENRFGVEHHLHRGRALAAFNHLLG 1755

Query: 1652 LRGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLV 1473
             R   L   + +++    SGQ NIQ+DVQ +LAP+TQ+E S+LS+V+PLAI HFEDSVLV
Sbjct: 1756 ARALNLKSANARQEL---SGQPNIQADVQAILAPLTQSEGSILSSVVPLAIMHFEDSVLV 1812

Query: 1472 ASCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDIT 1293
            ASCAF LELCGLSAS+LR+D+AALRRIS++Y S+E+N H++H SP+GS  HA   EGD+T
Sbjct: 1813 ASCAFFLELCGLSASILRVDIAALRRISAYYNSAEHNVHYEHVSPRGSVLHAVSHEGDLT 1872

Query: 1292 VSLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGET 1113
             SLA+ALADDY+H D     ++++       +  + S+ +++VL HLEKAS+P + E ET
Sbjct: 1873 ASLARALADDYIHHDHLNILEKKD--GPSEVSKDKPSQPLMSVLHHLEKASLPPIDESET 1930

Query: 1112 CGSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTE 933
             G+WLLSG GDG+EFRS QK AS+ W+LVTAFCQMH +PLSTKYL++LA DNDWVGFLTE
Sbjct: 1931 SGTWLLSGIGDGSEFRSRQKDASRCWNLVTAFCQMHHLPLSTKYLALLANDNDWVGFLTE 1990

Query: 932  AQVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLS 753
            AQ+ G P D  IQVA+K+FSDPRLK HILTVLRSM S RKK  S  NT   G ++EIS  
Sbjct: 1991 AQMGGFPVDVIIQVAAKEFSDPRLKTHILTVLRSMQS-RKKTSSLTNTSSSG-SSEISFD 2048

Query: 752  SENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWL 573
            ++++    +ELFG+LAECEKQK+PGEALL +AKD+RWSLLAMIASCF DVSPL+CLTVWL
Sbjct: 2049 TDSS--TTLELFGILAECEKQKNPGEALLRKAKDLRWSLLAMIASCFPDVSPLACLTVWL 2106

Query: 572  EITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLS 393
            EITAARETSSIKV+DI+S+IAN+VGAAVE TN  P G++ L F YNR+N KRR L+   S
Sbjct: 2107 EITAARETSSIKVDDISSKIANSVGAAVEVTNTLPIGSRMLAFRYNRRNSKRRRLMVPTS 2166

Query: 392  AVTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDI---KVLSDPDEGLTSLSKMVSVL 222
              +    S N      S    ++             +   K  +D DEGL SLS MV+VL
Sbjct: 2167 GNSTMGSSFNVPSTSTSTIASIAQEIVSEEESRRMVMEQPKSSNDLDEGLASLSNMVAVL 2226

Query: 221  CEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARA 42
            CEQ LFLPLLRAFEMFLPSCSLLPFIR LQ FSQMRL EASAHLASFSARIKEEPF  + 
Sbjct: 2227 CEQHLFLPLLRAFEMFLPSCSLLPFIRFLQAFSQMRLPEASAHLASFSARIKEEPFLGQI 2286

Query: 41   NIGREGQVGAPWI 3
            N  R+G +   WI
Sbjct: 2287 NSARDGLLKTAWI 2299


>ref|XP_007048161.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590708028|ref|XP_007048162.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590708031|ref|XP_007048163.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508700422|gb|EOX92318.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508700423|gb|EOX92319.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508700424|gb|EOX92320.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 3218

 Score =  887 bits (2291), Expect = 0.0
 Identities = 467/729 (64%), Positives = 559/729 (76%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  SS+AQCTLENLRP LQ +PTLWRTL++  FG D          K+     AL+DY
Sbjct: 1563 VNRHNSSTAQCTLENLRPTLQHYPTLWRTLVSG-FGQDTTFSYFSTRVKN-----ALADY 1616

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            LNWR++IF S G DTSL+QMLPCWF K +RRLIQL+VQGP GWQ+L+G+PTGES L RDI
Sbjct: 1617 LNWRDNIFFSTGRDTSLLQMLPCWFPKAVRRLIQLYVQGPLGWQTLSGLPTGESLLDRDI 1676

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
             ++IN+ E  E++A+SWEA IQK VE+EL+ SSLE+T  G+EH+LHRGRALAAFNHLL  
Sbjct: 1677 DFYINSDEQTEINAISWEATIQKHVEEELYHSSLEDTGLGLEHHLHRGRALAAFNHLLTS 1736

Query: 1649 RGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVA 1470
            R + L  +       ++S Q N+QSDVQ LLAP++++EESLLS+VMP AI+HFED+VLVA
Sbjct: 1737 RVEKLKRDGRS----SASAQTNVQSDVQTLLAPISESEESLLSSVMPFAITHFEDTVLVA 1792

Query: 1469 SCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITV 1290
            S  FLLELCG SASMLR+DVAALRRIS FYKS E  E F   SPKGSAFHAA  + ++  
Sbjct: 1793 SSVFLLELCGSSASMLRVDVAALRRISFFYKSIENREKFTQLSPKGSAFHAASHDDNVME 1852

Query: 1289 SLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETC 1110
            SLA+ALAD+ MH DSS  + Q+   ++   +SK+ SRA++ VLQHLEKAS+PL+ EG+TC
Sbjct: 1853 SLARALADECMHGDSSRNSKQK--GSLISVSSKQPSRALVLVLQHLEKASLPLLVEGKTC 1910

Query: 1109 GSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEA 930
            GSWLL+G+GDG E RS QKAASQ+WSLVT FCQMHQ+PLSTKYL+VLA+DNDWVGFL+EA
Sbjct: 1911 GSWLLTGNGDGTELRSQQKAASQYWSLVTVFCQMHQLPLSTKYLAVLARDNDWVGFLSEA 1970

Query: 929  QVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLSS 750
            Q+ G+ FD   QVASK+FSDPRLKIHILTVL+SM S  KK  SS + +   + +  S  +
Sbjct: 1971 QIGGYSFDTVFQVASKEFSDPRLKIHILTVLKSMQS--KKKASSQSYLDTSEKSSESPFT 2028

Query: 749  ENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWLE 570
            E N  +PVELF +LA+CEKQK+PGE+LLL+AKD  WS+LAMIASCF DVSPLSCLTVWLE
Sbjct: 2029 EENVYIPVELFRVLADCEKQKNPGESLLLKAKDFSWSILAMIASCFPDVSPLSCLTVWLE 2088

Query: 569  ITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLSA 390
            ITAARET SIKVNDIASQIA+NV AAVEATN  P  ++ L+FHYNR++ KRR L+ES+S 
Sbjct: 2089 ITAARETKSIKVNDIASQIADNVAAAVEATNSLPAVSRALSFHYNRQSPKRRRLLESISR 2148

Query: 389  VTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVLCEQR 210
               S  S +     +    E S             I V SD +EG  SL+KMV+VLCEQR
Sbjct: 2149 TPLSETSDS---ATRIFSDEGSIAGEDRNVELGEQINVSSDLNEGPASLTKMVAVLCEQR 2205

Query: 209  LFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARANIGR 30
            LFLPLLRAFEMFLPSCSLLPFIRALQ FSQMRLSEASAHL SFSARIKEEP H + NIGR
Sbjct: 2206 LFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPSHLQKNIGR 2265

Query: 29   EGQVGAPWI 3
            E Q+G  WI
Sbjct: 2266 ECQIGISWI 2274


>gb|KRH43338.1| hypothetical protein GLYMA_08G143200 [Glycine max]
          Length = 2853

 Score =  881 bits (2277), Expect = 0.0
 Identities = 462/733 (63%), Positives = 562/733 (76%), Gaps = 4/733 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  +SSAQCTLENLRP LQ+FPTLWRTLI AC G D   + LVP AK+     ALSDY
Sbjct: 1557 VNRHSNSSAQCTLENLRPTLQKFPTLWRTLIGACLGQDTMAL-LVPKAKT-----ALSDY 1610

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            LNWR+ IF S  HDTSL+QMLPCWF K IRRLIQL+VQGP G QS +G PTGE+ LHRDI
Sbjct: 1611 LNWRDDIFFSTSHDTSLLQMLPCWFPKPIRRLIQLYVQGPLGCQSFSGFPTGETLLHRDI 1670

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
              FINA  + E++A+SWEA +Q+ +E+EL+   LEE  FG+EH LHRGRALAAFN +LG 
Sbjct: 1671 DLFINADVHAEINAISWEATVQRHIEEELYGPLLEENGFGLEHLLHRGRALAAFNQILGH 1730

Query: 1649 RGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVA 1470
            R Q  N  S ++ S ++ GQ NIQSDVQ LL+ V Q+EE+LLS+V+P+AI HFEDS+LVA
Sbjct: 1731 RVQ--NLKSEEESSTSAHGQTNIQSDVQTLLSAVEQSEETLLSSVLPVAIMHFEDSMLVA 1788

Query: 1469 SCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITV 1290
            SCAFLLELCGLSA+ +RID+A L+RIS FYKSSE NE+    SPKGS FHA   EGD+T 
Sbjct: 1789 SCAFLLELCGLSANKMRIDIAVLKRISLFYKSSENNENLWQLSPKGSVFHAISHEGDVTE 1848

Query: 1289 SLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETC 1110
            SLA+ALAD+Y+H DS  TA +        T SK++SRA++ VL HLEKAS+P + +G+T 
Sbjct: 1849 SLARALADEYLHKDSPATATE--------TVSKQASRALILVLHHLEKASLPQLVDGKTY 1900

Query: 1109 GSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEA 930
            GSWLLSG+GDG E RS +KAASQHW+LVT FC++HQ+PLSTKYL+ LA+DNDW+ FL+EA
Sbjct: 1901 GSWLLSGNGDGNELRSQRKAASQHWTLVTNFCRLHQLPLSTKYLAALARDNDWIEFLSEA 1960

Query: 929  QVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVS-SPNTVPRGKTNEISLS 753
            Q+ G+ FD  +QVASK+FSDPRL++H+LTVLR M S +K   +   +T+ +G  +E +  
Sbjct: 1961 QIGGYSFDTVVQVASKEFSDPRLRLHMLTVLRGMQSKKKASTALFLDTLEKG--SETTFP 2018

Query: 752  SENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWL 573
             EN   VPVELF +LAECEKQK PGEALL +AK++ WS+LAM+ASCF DVSPLSCLTVWL
Sbjct: 2019 DENM-CVPVELFQILAECEKQKCPGEALLRKAKELSWSILAMVASCFLDVSPLSCLTVWL 2077

Query: 572  EITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESL- 396
            EITAARETSSIKVNDIASQIA+NVGAAV ATN  P G++ LTFHYNR++ KRR LI  + 
Sbjct: 2078 EITAARETSSIKVNDIASQIADNVGAAVNATNALPVGDRVLTFHYNRQSPKRRRLITLVS 2137

Query: 395  --SAVTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVL 222
              S+ +A +D  +  + ++   ++               I V SD  EG  SLSKMV+VL
Sbjct: 2138 LDSSASAISDICSSSISEEIFDSKGKTMENDRKIEHFGCINVPSDSHEGPASLSKMVAVL 2197

Query: 221  CEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARA 42
            CEQ+LFLPLLRAFEMFLPSC LLPFIRALQ FSQMRLSEASAHL SFSARIKEEPF+ +A
Sbjct: 2198 CEQQLFLPLLRAFEMFLPSCPLLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPFYLQA 2257

Query: 41   NIGREGQVGAPWI 3
            N+GRE Q+GA WI
Sbjct: 2258 NVGREAQIGASWI 2270


>gb|KHN29127.1| Hypothetical protein glysoja_008462 [Glycine soja]
          Length = 3217

 Score =  881 bits (2277), Expect = 0.0
 Identities = 462/733 (63%), Positives = 562/733 (76%), Gaps = 4/733 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  +SSAQCTLENLRP LQ+FPTLWRTLI AC G D   + LVP AK+     ALSDY
Sbjct: 1557 VNRHSNSSAQCTLENLRPTLQKFPTLWRTLIGACLGQDTMAL-LVPKAKT-----ALSDY 1610

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            LNWR+ IF S  HDTSL+QMLPCWF K IRRLIQL+VQGP G QS +G PTGE+ LHRDI
Sbjct: 1611 LNWRDDIFFSTSHDTSLLQMLPCWFPKPIRRLIQLYVQGPLGCQSFSGFPTGETLLHRDI 1670

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
              FINA  + E++A+SWEA +Q+ +E+EL+   LEE  FG+EH LHRGRALAAFN +LG 
Sbjct: 1671 DLFINADVHAEINAISWEATVQRHIEEELYGPLLEENGFGLEHLLHRGRALAAFNQILGH 1730

Query: 1649 RGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVA 1470
            R Q  N  S ++ S ++ GQ NIQSDVQ LL+ V Q+EE+LLS+V+P+AI HFEDS+LVA
Sbjct: 1731 RVQ--NLKSEEESSTSAHGQTNIQSDVQTLLSAVEQSEETLLSSVLPVAIMHFEDSMLVA 1788

Query: 1469 SCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITV 1290
            SCAFLLELCGLSA+ +RID+A L+RIS FYKSSE NE+    SPKGS FHA   EGD+T 
Sbjct: 1789 SCAFLLELCGLSANKMRIDIAVLKRISLFYKSSENNENLWQLSPKGSVFHAISHEGDVTE 1848

Query: 1289 SLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETC 1110
            SLA+ALAD+Y+H DS  TA +        T SK++SRA++ VL HLEKAS+P + +G+T 
Sbjct: 1849 SLARALADEYLHKDSPATATE--------TVSKQASRALILVLHHLEKASLPQLVDGKTY 1900

Query: 1109 GSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEA 930
            GSWLLSG+GDG E RS +KAASQHW+LVT FC++HQ+PLSTKYL+ LA+DNDW+ FL+EA
Sbjct: 1901 GSWLLSGNGDGNELRSQRKAASQHWTLVTNFCRLHQLPLSTKYLAALARDNDWIEFLSEA 1960

Query: 929  QVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVS-SPNTVPRGKTNEISLS 753
            Q+ G+ FD  +QVASK+FSDPRL++H+LTVLR M S +K   +   +T+ +G  +E +  
Sbjct: 1961 QIGGYSFDTVVQVASKEFSDPRLRLHMLTVLRGMQSKKKASTALFLDTLEKG--SETTFP 2018

Query: 752  SENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWL 573
             EN   VPVELF +LAECEKQK PGEALL +AK++ WS+LAM+ASCF DVSPLSCLTVWL
Sbjct: 2019 DENM-CVPVELFQILAECEKQKCPGEALLRKAKELSWSILAMVASCFLDVSPLSCLTVWL 2077

Query: 572  EITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESL- 396
            EITAARETSSIKVNDIASQIA+NVGAAV ATN  P G++ LTFHYNR++ KRR LI  + 
Sbjct: 2078 EITAARETSSIKVNDIASQIADNVGAAVNATNALPVGDRVLTFHYNRQSPKRRRLITLVS 2137

Query: 395  --SAVTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVL 222
              S+ +A +D  +  + ++   ++               I V SD  EG  SLSKMV+VL
Sbjct: 2138 LDSSASAISDICSSSISEEIFDSKGKTMENDRKIEHFGCINVPSDSHEGPASLSKMVAVL 2197

Query: 221  CEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARA 42
            CEQ+LFLPLLRAFEMFLPSC LLPFIRALQ FSQMRLSEASAHL SFSARIKEEPF+ +A
Sbjct: 2198 CEQQLFLPLLRAFEMFLPSCPLLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPFYLQA 2257

Query: 41   NIGREGQVGAPWI 3
            N+GRE Q+GA WI
Sbjct: 2258 NVGREAQIGASWI 2270


>ref|XP_003532852.1| PREDICTED: uncharacterized protein LOC100800361 isoform X1 [Glycine
            max] gi|571471443|ref|XP_006585313.1| PREDICTED:
            uncharacterized protein LOC100800361 isoform X2 [Glycine
            max] gi|947094750|gb|KRH43335.1| hypothetical protein
            GLYMA_08G143200 [Glycine max] gi|947094751|gb|KRH43336.1|
            hypothetical protein GLYMA_08G143200 [Glycine max]
            gi|947094752|gb|KRH43337.1| hypothetical protein
            GLYMA_08G143200 [Glycine max]
          Length = 3217

 Score =  881 bits (2277), Expect = 0.0
 Identities = 462/733 (63%), Positives = 562/733 (76%), Gaps = 4/733 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  +SSAQCTLENLRP LQ+FPTLWRTLI AC G D   + LVP AK+     ALSDY
Sbjct: 1557 VNRHSNSSAQCTLENLRPTLQKFPTLWRTLIGACLGQDTMAL-LVPKAKT-----ALSDY 1610

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            LNWR+ IF S  HDTSL+QMLPCWF K IRRLIQL+VQGP G QS +G PTGE+ LHRDI
Sbjct: 1611 LNWRDDIFFSTSHDTSLLQMLPCWFPKPIRRLIQLYVQGPLGCQSFSGFPTGETLLHRDI 1670

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
              FINA  + E++A+SWEA +Q+ +E+EL+   LEE  FG+EH LHRGRALAAFN +LG 
Sbjct: 1671 DLFINADVHAEINAISWEATVQRHIEEELYGPLLEENGFGLEHLLHRGRALAAFNQILGH 1730

Query: 1649 RGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVA 1470
            R Q  N  S ++ S ++ GQ NIQSDVQ LL+ V Q+EE+LLS+V+P+AI HFEDS+LVA
Sbjct: 1731 RVQ--NLKSEEESSTSAHGQTNIQSDVQTLLSAVEQSEETLLSSVLPVAIMHFEDSMLVA 1788

Query: 1469 SCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITV 1290
            SCAFLLELCGLSA+ +RID+A L+RIS FYKSSE NE+    SPKGS FHA   EGD+T 
Sbjct: 1789 SCAFLLELCGLSANKMRIDIAVLKRISLFYKSSENNENLWQLSPKGSVFHAISHEGDVTE 1848

Query: 1289 SLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETC 1110
            SLA+ALAD+Y+H DS  TA +        T SK++SRA++ VL HLEKAS+P + +G+T 
Sbjct: 1849 SLARALADEYLHKDSPATATE--------TVSKQASRALILVLHHLEKASLPQLVDGKTY 1900

Query: 1109 GSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEA 930
            GSWLLSG+GDG E RS +KAASQHW+LVT FC++HQ+PLSTKYL+ LA+DNDW+ FL+EA
Sbjct: 1901 GSWLLSGNGDGNELRSQRKAASQHWTLVTNFCRLHQLPLSTKYLAALARDNDWIEFLSEA 1960

Query: 929  QVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVS-SPNTVPRGKTNEISLS 753
            Q+ G+ FD  +QVASK+FSDPRL++H+LTVLR M S +K   +   +T+ +G  +E +  
Sbjct: 1961 QIGGYSFDTVVQVASKEFSDPRLRLHMLTVLRGMQSKKKASTALFLDTLEKG--SETTFP 2018

Query: 752  SENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWL 573
             EN   VPVELF +LAECEKQK PGEALL +AK++ WS+LAM+ASCF DVSPLSCLTVWL
Sbjct: 2019 DENM-CVPVELFQILAECEKQKCPGEALLRKAKELSWSILAMVASCFLDVSPLSCLTVWL 2077

Query: 572  EITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESL- 396
            EITAARETSSIKVNDIASQIA+NVGAAV ATN  P G++ LTFHYNR++ KRR LI  + 
Sbjct: 2078 EITAARETSSIKVNDIASQIADNVGAAVNATNALPVGDRVLTFHYNRQSPKRRRLITLVS 2137

Query: 395  --SAVTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVL 222
              S+ +A +D  +  + ++   ++               I V SD  EG  SLSKMV+VL
Sbjct: 2138 LDSSASAISDICSSSISEEIFDSKGKTMENDRKIEHFGCINVPSDSHEGPASLSKMVAVL 2197

Query: 221  CEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARA 42
            CEQ+LFLPLLRAFEMFLPSC LLPFIRALQ FSQMRLSEASAHL SFSARIKEEPF+ +A
Sbjct: 2198 CEQQLFLPLLRAFEMFLPSCPLLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPFYLQA 2257

Query: 41   NIGREGQVGAPWI 3
            N+GRE Q+GA WI
Sbjct: 2258 NVGREAQIGASWI 2270


>gb|KDO50473.1| hypothetical protein CISIN_1g000037mg [Citrus sinensis]
          Length = 2867

 Score =  877 bits (2265), Expect = 0.0
 Identities = 471/735 (64%), Positives = 561/735 (76%), Gaps = 8/735 (1%)
 Frame = -3

Query: 2183 RRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDYLN 2004
            R  SSSAQCTLENLRP LQRFPTLWRTL+AACFG +     L P AK+      LSDYLN
Sbjct: 1534 RHSSSSAQCTLENLRPTLQRFPTLWRTLVAACFGEEPRCNFLGPKAKN-----DLSDYLN 1588

Query: 2003 WRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDISY 1824
            WR+SIF S+G DTSL Q+LPCWF K +RRLIQL+VQGP GWQS +G+PT E+ L  D+ +
Sbjct: 1589 WRDSIFFSSGRDTSLSQILPCWFPKAVRRLIQLYVQGPLGWQSPSGLPT-ETLLQGDVDF 1647

Query: 1823 FINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGLRG 1644
            F  A  + EVSA+SWEA IQK +E+EL+ +SL+ET  G+EH+LHRGRALAAFN LLG+R 
Sbjct: 1648 FTFADGDAEVSAISWEATIQKHIEEELYDASLKETGIGLEHHLHRGRALAAFNQLLGVRI 1707

Query: 1643 QMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVASC 1464
            + +   S  + S ++ G AN+QSDVQ LLAP+ +NEE LLS+VMPLAISHFEDSVLVASC
Sbjct: 1708 EKMK--SEGRSSSSALGLANVQSDVQTLLAPIIKNEEFLLSSVMPLAISHFEDSVLVASC 1765

Query: 1463 AFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITVSL 1284
             F LELCGLSAS+LR+DV+ALRRISSFYKSSE  E ++  SPK SAF+A P EGDIT SL
Sbjct: 1766 TFFLELCGLSASLLRVDVSALRRISSFYKSSENAESYKQLSPKSSAFYALPHEGDITKSL 1825

Query: 1283 AQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETCGS 1104
            A+ALAD+Y+   S+  A Q+   +    AS R SRA+L VLQHLEKAS+P++ +G+TCGS
Sbjct: 1826 ARALADEYLQEGSATKAKQK--GSPSSVASARPSRALLLVLQHLEKASLPVLLDGKTCGS 1883

Query: 1103 WLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEAQV 924
            WLL+G+GDG E RS QKAASQHW LVT FCQMHQ+PLSTKYL+VLA+DNDWVGFL EAQV
Sbjct: 1884 WLLTGNGDGTELRSQQKAASQHWDLVTVFCQMHQLPLSTKYLAVLAQDNDWVGFLYEAQV 1943

Query: 923  VGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLSSEN 744
             G+PF+  +QVASK+FSDPRLKIHILTVLRS+ S RKK  SS N+    +++E S+  EN
Sbjct: 1944 GGYPFEIVVQVASKEFSDPRLKIHILTVLRSLQS-RKKASSSLNS-GATESSESSVLDEN 2001

Query: 743  NGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWLEIT 564
               +PVELF +LA+CEKQKSPG+ALL++AK++ WS+LAMIASC+ DV+PLSCLTVWLEIT
Sbjct: 2002 L-YIPVELFRILADCEKQKSPGQALLIKAKELSWSVLAMIASCYPDVTPLSCLTVWLEIT 2060

Query: 563  AARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLSA-- 390
            AARETSSIKVNDIASQIA+NV AAV+ATN  P   + LTFHYNR++ KRR LIE +SA  
Sbjct: 2061 AARETSSIKVNDIASQIADNVAAAVKATNAIPADGRALTFHYNRQSPKRRRLIEPISADP 2120

Query: 389  ------VTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVS 228
                  V+ S  S    + + S   E               +   SD  EG  SLSKMV+
Sbjct: 2121 LVVSSDVSISYPSSTVVIAQGSTGEE-------GKKKVNQCLNFQSDSVEGSASLSKMVA 2173

Query: 227  VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHA 48
            VLCEQ LFLPLLRAFEMFLPSCS LPFIRALQ FSQMRLSEASAHL SFSARIKEE    
Sbjct: 2174 VLCEQHLFLPLLRAFEMFLPSCSFLPFIRALQAFSQMRLSEASAHLGSFSARIKEESSQL 2233

Query: 47   RANIGREGQVGAPWI 3
             A  G+EGQ+G  W+
Sbjct: 2234 PAYTGKEGQIGTSWV 2248


>ref|XP_006464509.1| PREDICTED: uncharacterized protein LOC102626916 [Citrus sinensis]
          Length = 3224

 Score =  877 bits (2265), Expect = 0.0
 Identities = 471/735 (64%), Positives = 561/735 (76%), Gaps = 8/735 (1%)
 Frame = -3

Query: 2183 RRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDYLN 2004
            R  SSSAQCTLENLRP LQRFPTLWRTL+AACFG +     L P AK+      LSDYLN
Sbjct: 1567 RHSSSSAQCTLENLRPTLQRFPTLWRTLVAACFGEEPRCNFLGPKAKN-----DLSDYLN 1621

Query: 2003 WRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDISY 1824
            WR+SIF S+G DTSL Q+LPCWF K +RRLIQL+VQGP GWQS +G+PT E+ L  D+ +
Sbjct: 1622 WRDSIFFSSGRDTSLSQILPCWFPKAVRRLIQLYVQGPLGWQSPSGLPT-ETLLQGDVDF 1680

Query: 1823 FINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGLRG 1644
            F  A  + EVSA+SWEA IQK +E+EL+ +SL+ET  G+EH+LHRGRALAAFN LLG+R 
Sbjct: 1681 FTFADGDAEVSAISWEATIQKHIEEELYDASLKETGIGLEHHLHRGRALAAFNQLLGVRI 1740

Query: 1643 QMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVASC 1464
            + +   S  + S ++ G AN+QSDVQ LLAP+ +NEE LLS+VMPLAISHFEDSVLVASC
Sbjct: 1741 EKMK--SEGRSSSSALGLANVQSDVQTLLAPIIKNEEFLLSSVMPLAISHFEDSVLVASC 1798

Query: 1463 AFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITVSL 1284
             F LELCGLSAS+LR+DV+ALRRISSFYKSSE  E ++  SPK SAF+A P EGDIT SL
Sbjct: 1799 TFFLELCGLSASLLRVDVSALRRISSFYKSSENAESYKQLSPKSSAFYALPHEGDITKSL 1858

Query: 1283 AQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETCGS 1104
            A+ALAD+Y+   S+  A Q+   +    AS R SRA+L VLQHLEKAS+P++ +G+TCGS
Sbjct: 1859 ARALADEYLQEGSATKAKQK--GSPSSVASARPSRALLLVLQHLEKASLPVLLDGKTCGS 1916

Query: 1103 WLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEAQV 924
            WLL+G+GDG E RS QKAASQHW LVT FCQMHQ+PLSTKYL+VLA+DNDWVGFL EAQV
Sbjct: 1917 WLLTGNGDGTELRSQQKAASQHWDLVTVFCQMHQLPLSTKYLAVLAQDNDWVGFLYEAQV 1976

Query: 923  VGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLSSEN 744
             G+PF+  +QVASK+FSDPRLKIHILTVLRS+ S RKK  SS N+    +++E S+  EN
Sbjct: 1977 GGYPFEIVVQVASKEFSDPRLKIHILTVLRSLQS-RKKASSSLNS-GATESSESSVLDEN 2034

Query: 743  NGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWLEIT 564
               +PVELF +LA+CEKQKSPG+ALL++AK++ WS+LAMIASC+ DV+PLSCLTVWLEIT
Sbjct: 2035 L-YIPVELFRILADCEKQKSPGQALLIKAKELSWSVLAMIASCYPDVTPLSCLTVWLEIT 2093

Query: 563  AARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLSA-- 390
            AARETSSIKVNDIASQIA+NV AAV+ATN  P   + LTFHYNR++ KRR LIE +SA  
Sbjct: 2094 AARETSSIKVNDIASQIADNVAAAVKATNAIPADGRALTFHYNRQSPKRRRLIEPISADP 2153

Query: 389  ------VTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVS 228
                  V+ S  S    + + S   E               +   SD  EG  SLSKMV+
Sbjct: 2154 LVVSSDVSISYPSSTVVIAQGSTGEE-------GKKKVNQCLNFQSDSVEGSASLSKMVA 2206

Query: 227  VLCEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHA 48
            VLCEQ LFLPLLRAFEMFLPSCS LPFIRALQ FSQMRLSEASAHL SFSARIKEE    
Sbjct: 2207 VLCEQHLFLPLLRAFEMFLPSCSFLPFIRALQAFSQMRLSEASAHLGSFSARIKEESSQL 2266

Query: 47   RANIGREGQVGAPWI 3
             A  G+EGQ+G  W+
Sbjct: 2267 PAYTGKEGQIGTSWV 2281


>ref|XP_004289254.1| PREDICTED: uncharacterized protein LOC101305114 [Fragaria vesca
            subsp. vesca]
          Length = 3230

 Score =  875 bits (2262), Expect = 0.0
 Identities = 461/734 (62%), Positives = 551/734 (75%), Gaps = 5/734 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            VKR  S+SAQCTLENLRP LQRFPTLW T ++ACFG D     + P AK+      LSDY
Sbjct: 1568 VKRHSSTSAQCTLENLRPTLQRFPTLWHTFVSACFGQDTTSNLVGPKAKN-----GLSDY 1622

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            L+WR+ IF S+G DTSL+QMLPCWF K +RRLIQL+ QGP GWQS+ G+P GES LHRDI
Sbjct: 1623 LSWRDDIFFSSGRDTSLLQMLPCWFPKAVRRLIQLYAQGPLGWQSIPGLPVGESLLHRDI 1682

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
             + +N  ++ E+SA+SWEA IQK +E+EL++S+LE    G+EH+LHRGRALAAFNH LGL
Sbjct: 1683 DFVLNTDDDVEISALSWEATIQKHIEEELYSSALEGNALGLEHHLHRGRALAAFNHFLGL 1742

Query: 1649 RGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVA 1470
            R Q L      K  G    QAN+Q+DVQ LL P+T++EESLLS+VMPLAI HFEDSVLVA
Sbjct: 1743 RVQKL------KSEGKGQIQANVQADVQTLLEPITESEESLLSSVMPLAIMHFEDSVLVA 1796

Query: 1469 SCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITV 1290
            SCAFLLEL G SASMLRID+AAL+R+S FYKSSE  ++ +    KGSAFHA   E DI  
Sbjct: 1797 SCAFLLELFGYSASMLRIDIAALKRMSYFYKSSENTDNLRKILTKGSAFHAVGHESDIME 1856

Query: 1289 SLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETC 1110
            SLA+ALAD+Y+  DS+    Q+   ++ V   K+ SRA++  L+ LEKAS+P M +G TC
Sbjct: 1857 SLARALADEYLQQDSARMTKQKGTPSLAVV--KQPSRALMLFLEFLEKASLPSMVDGRTC 1914

Query: 1109 GSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEA 930
            GSWLLSG GDG E RS QKAAS  W+LVT FCQMH +PLST+YLSVLA+DNDWVGFL+EA
Sbjct: 1915 GSWLLSGDGDGIELRSQQKAASHRWNLVTIFCQMHHLPLSTRYLSVLARDNDWVGFLSEA 1974

Query: 929  QVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLSS 750
            Q+ G+PFD  +QVASKDF DPRLKIHI TVL++M S RK   S+  T+   K +E S + 
Sbjct: 1975 QIGGYPFDTVVQVASKDFCDPRLKIHISTVLKAMQSRRKASSSTTETIE--KRSEASFTD 2032

Query: 749  ENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWLE 570
            E+   VPVELF +LAECEKQK+PGEA+L++AK++ WS+LAMIASCFSDVS +SCLTVWLE
Sbjct: 2033 ESI-CVPVELFRILAECEKQKNPGEAILMKAKELSWSILAMIASCFSDVSAISCLTVWLE 2091

Query: 569  ITAARETSSIKVNDIASQIANNVGAAVEATN-LSPGGNKDLTFHYNRKNVKRRCLIE--- 402
            ITAARETSSIKVNDIAS+IANNVGAAVEATN L  GG+K LTFHY+R+N KRR L+E   
Sbjct: 2092 ITAARETSSIKVNDIASRIANNVGAAVEATNALQAGGSKSLTFHYSRQNAKRRRLLEPNL 2151

Query: 401  -SLSAVTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSV 225
               SA T S   G+P  VK      +S             +   +D DE   SLSKMVSV
Sbjct: 2152 GEPSATTMSGILGSPVGVKIFDQGTISEDERNIELGGNMILS--TDSDEASVSLSKMVSV 2209

Query: 224  LCEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHAR 45
            LCEQ LFLPLLRAFEMFLPSCSL+PFIRALQ FSQMRLSEASAHL SFSARIKE+    +
Sbjct: 2210 LCEQHLFLPLLRAFEMFLPSCSLVPFIRALQAFSQMRLSEASAHLGSFSARIKEDSTRLQ 2269

Query: 44   ANIGREGQVGAPWI 3
             N+GR+  +GA WI
Sbjct: 2270 TNVGRDMHIGASWI 2283


>ref|XP_012437402.1| PREDICTED: uncharacterized protein LOC105763656 isoform X2 [Gossypium
            raimondii]
          Length = 3213

 Score =  874 bits (2257), Expect = 0.0
 Identities = 456/729 (62%), Positives = 551/729 (75%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  SS+AQCTLENLRP LQ +PTLWRTL++ CFG D +       AK+     AL+DY
Sbjct: 1561 VNRHNSSTAQCTLENLRPTLQHYPTLWRTLVSGCFGQDTSFGFFHTGAKN-----ALADY 1615

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            LNWR++IF S G DTSL+QMLPCWF K +RRL+QL+VQGP GWQSL+G+PTGES L RD+
Sbjct: 1616 LNWRDNIFFSTGRDTSLLQMLPCWFPKAVRRLVQLYVQGPLGWQSLSGLPTGESLLDRDV 1675

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
             ++INA E  E++A+SWEA IQK VE+EL+ SSL+ET  G+EH+LHRGRALAAFNHLL  
Sbjct: 1676 DFYINADEQAEINAISWEATIQKHVEEELYHSSLKETGLGLEHHLHRGRALAAFNHLLIS 1735

Query: 1649 RGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVA 1470
            R + L           +SGQ N+QSDVQ LLAP+++ EE LLS++MP AI+HFED+VLVA
Sbjct: 1736 RVEKLKIEGRTN----ASGQTNVQSDVQTLLAPISEKEECLLSSIMPFAITHFEDNVLVA 1791

Query: 1469 SCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITV 1290
            SCAFLLELCGLSASMLR+DVA+LRRIS FYKS +  ++ +  S KGSAF  A  +  I  
Sbjct: 1792 SCAFLLELCGLSASMLRVDVASLRRISLFYKSIQNKDNSRQLSSKGSAFQPATHDDSIME 1851

Query: 1289 SLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETC 1110
            SLA+ALAD+ MH D+S  + Q    ++     K+ SRA++ VLQHLEKAS+P + EG+TC
Sbjct: 1852 SLARALADECMHGDNSRNSKQR--GSLISVYGKQPSRALMLVLQHLEKASLPQLVEGKTC 1909

Query: 1109 GSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEA 930
            GSWLL+G+GDG E RS QKAASQ+WSLVT FCQ+HQ+PLSTKYL+VLA+DNDWVGFL EA
Sbjct: 1910 GSWLLTGNGDGTELRSQQKAASQYWSLVTVFCQIHQLPLSTKYLAVLARDNDWVGFLCEA 1969

Query: 929  QVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLSS 750
            Q+ G+ FD   QVASK+FSDPRLKIHILTVL+S+ S  KK  SS + + +   +      
Sbjct: 1970 QIGGYSFDTVFQVASKEFSDPRLKIHILTVLKSIQS--KKKASSQSYLDKKSESPF---L 2024

Query: 749  ENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWLE 570
            E N  +PVELF +LA+CEKQK+PGEALLL+AKD  WS+LAMIASCF DVSPLSCLTVWLE
Sbjct: 2025 EENVYMPVELFRVLADCEKQKNPGEALLLKAKDFSWSILAMIASCFPDVSPLSCLTVWLE 2084

Query: 569  ITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLSA 390
            ITAARET SIKVNDIA+Q+A+NV AAVEATN  PGG++ L+FHYNR+N KRR L+++   
Sbjct: 2085 ITAARETKSIKVNDIATQMADNVAAAVEATNSLPGGSRSLSFHYNRRNPKRRWLLDTSCR 2144

Query: 389  VTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVLCEQR 210
               S  S +     +    E S             I V SD +EG  SL+KMV+VLCEQ 
Sbjct: 2145 APLSEASDSS---TRIFSAEGSTAGEEKKVELSEQINVSSDFNEGPASLAKMVAVLCEQH 2201

Query: 209  LFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARANIGR 30
            LFLPLLRAFE+FLPSCS LPFIRALQ FSQMRLSEASAHL SFSARIKEEP H + NIGR
Sbjct: 2202 LFLPLLRAFELFLPSCSFLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPSHLQTNIGR 2261

Query: 29   EGQVGAPWI 3
            +GQVG  WI
Sbjct: 2262 DGQVGMSWI 2270


>gb|KJB46751.1| hypothetical protein B456_008G100800 [Gossypium raimondii]
          Length = 2607

 Score =  874 bits (2257), Expect = 0.0
 Identities = 456/729 (62%), Positives = 551/729 (75%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  SS+AQCTLENLRP LQ +PTLWRTL++ CFG D +       AK+     AL+DY
Sbjct: 955  VNRHNSSTAQCTLENLRPTLQHYPTLWRTLVSGCFGQDTSFGFFHTGAKN-----ALADY 1009

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            LNWR++IF S G DTSL+QMLPCWF K +RRL+QL+VQGP GWQSL+G+PTGES L RD+
Sbjct: 1010 LNWRDNIFFSTGRDTSLLQMLPCWFPKAVRRLVQLYVQGPLGWQSLSGLPTGESLLDRDV 1069

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
             ++INA E  E++A+SWEA IQK VE+EL+ SSL+ET  G+EH+LHRGRALAAFNHLL  
Sbjct: 1070 DFYINADEQAEINAISWEATIQKHVEEELYHSSLKETGLGLEHHLHRGRALAAFNHLLIS 1129

Query: 1649 RGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVA 1470
            R + L           +SGQ N+QSDVQ LLAP+++ EE LLS++MP AI+HFED+VLVA
Sbjct: 1130 RVEKLKIEGRTN----ASGQTNVQSDVQTLLAPISEKEECLLSSIMPFAITHFEDNVLVA 1185

Query: 1469 SCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITV 1290
            SCAFLLELCGLSASMLR+DVA+LRRIS FYKS +  ++ +  S KGSAF  A  +  I  
Sbjct: 1186 SCAFLLELCGLSASMLRVDVASLRRISLFYKSIQNKDNSRQLSSKGSAFQPATHDDSIME 1245

Query: 1289 SLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETC 1110
            SLA+ALAD+ MH D+S  + Q    ++     K+ SRA++ VLQHLEKAS+P + EG+TC
Sbjct: 1246 SLARALADECMHGDNSRNSKQR--GSLISVYGKQPSRALMLVLQHLEKASLPQLVEGKTC 1303

Query: 1109 GSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEA 930
            GSWLL+G+GDG E RS QKAASQ+WSLVT FCQ+HQ+PLSTKYL+VLA+DNDWVGFL EA
Sbjct: 1304 GSWLLTGNGDGTELRSQQKAASQYWSLVTVFCQIHQLPLSTKYLAVLARDNDWVGFLCEA 1363

Query: 929  QVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLSS 750
            Q+ G+ FD   QVASK+FSDPRLKIHILTVL+S+ S  KK  SS + + +   +      
Sbjct: 1364 QIGGYSFDTVFQVASKEFSDPRLKIHILTVLKSIQS--KKKASSQSYLDKKSESPF---L 1418

Query: 749  ENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWLE 570
            E N  +PVELF +LA+CEKQK+PGEALLL+AKD  WS+LAMIASCF DVSPLSCLTVWLE
Sbjct: 1419 EENVYMPVELFRVLADCEKQKNPGEALLLKAKDFSWSILAMIASCFPDVSPLSCLTVWLE 1478

Query: 569  ITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLSA 390
            ITAARET SIKVNDIA+Q+A+NV AAVEATN  PGG++ L+FHYNR+N KRR L+++   
Sbjct: 1479 ITAARETKSIKVNDIATQMADNVAAAVEATNSLPGGSRSLSFHYNRRNPKRRWLLDTSCR 1538

Query: 389  VTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVLCEQR 210
               S  S +     +    E S             I V SD +EG  SL+KMV+VLCEQ 
Sbjct: 1539 APLSEASDSS---TRIFSAEGSTAGEEKKVELSEQINVSSDFNEGPASLAKMVAVLCEQH 1595

Query: 209  LFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARANIGR 30
            LFLPLLRAFE+FLPSCS LPFIRALQ FSQMRLSEASAHL SFSARIKEEP H + NIGR
Sbjct: 1596 LFLPLLRAFELFLPSCSFLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPSHLQTNIGR 1655

Query: 29   EGQVGAPWI 3
            +GQVG  WI
Sbjct: 1656 DGQVGMSWI 1664


>gb|KJB46750.1| hypothetical protein B456_008G100800 [Gossypium raimondii]
          Length = 3209

 Score =  874 bits (2257), Expect = 0.0
 Identities = 456/729 (62%), Positives = 551/729 (75%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  SS+AQCTLENLRP LQ +PTLWRTL++ CFG D +       AK+     AL+DY
Sbjct: 1557 VNRHNSSTAQCTLENLRPTLQHYPTLWRTLVSGCFGQDTSFGFFHTGAKN-----ALADY 1611

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            LNWR++IF S G DTSL+QMLPCWF K +RRL+QL+VQGP GWQSL+G+PTGES L RD+
Sbjct: 1612 LNWRDNIFFSTGRDTSLLQMLPCWFPKAVRRLVQLYVQGPLGWQSLSGLPTGESLLDRDV 1671

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
             ++INA E  E++A+SWEA IQK VE+EL+ SSL+ET  G+EH+LHRGRALAAFNHLL  
Sbjct: 1672 DFYINADEQAEINAISWEATIQKHVEEELYHSSLKETGLGLEHHLHRGRALAAFNHLLIS 1731

Query: 1649 RGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVA 1470
            R + L           +SGQ N+QSDVQ LLAP+++ EE LLS++MP AI+HFED+VLVA
Sbjct: 1732 RVEKLKIEGRTN----ASGQTNVQSDVQTLLAPISEKEECLLSSIMPFAITHFEDNVLVA 1787

Query: 1469 SCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITV 1290
            SCAFLLELCGLSASMLR+DVA+LRRIS FYKS +  ++ +  S KGSAF  A  +  I  
Sbjct: 1788 SCAFLLELCGLSASMLRVDVASLRRISLFYKSIQNKDNSRQLSSKGSAFQPATHDDSIME 1847

Query: 1289 SLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETC 1110
            SLA+ALAD+ MH D+S  + Q    ++     K+ SRA++ VLQHLEKAS+P + EG+TC
Sbjct: 1848 SLARALADECMHGDNSRNSKQR--GSLISVYGKQPSRALMLVLQHLEKASLPQLVEGKTC 1905

Query: 1109 GSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEA 930
            GSWLL+G+GDG E RS QKAASQ+WSLVT FCQ+HQ+PLSTKYL+VLA+DNDWVGFL EA
Sbjct: 1906 GSWLLTGNGDGTELRSQQKAASQYWSLVTVFCQIHQLPLSTKYLAVLARDNDWVGFLCEA 1965

Query: 929  QVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLSS 750
            Q+ G+ FD   QVASK+FSDPRLKIHILTVL+S+ S  KK  SS + + +   +      
Sbjct: 1966 QIGGYSFDTVFQVASKEFSDPRLKIHILTVLKSIQS--KKKASSQSYLDKKSESPF---L 2020

Query: 749  ENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWLE 570
            E N  +PVELF +LA+CEKQK+PGEALLL+AKD  WS+LAMIASCF DVSPLSCLTVWLE
Sbjct: 2021 EENVYMPVELFRVLADCEKQKNPGEALLLKAKDFSWSILAMIASCFPDVSPLSCLTVWLE 2080

Query: 569  ITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLSA 390
            ITAARET SIKVNDIA+Q+A+NV AAVEATN  PGG++ L+FHYNR+N KRR L+++   
Sbjct: 2081 ITAARETKSIKVNDIATQMADNVAAAVEATNSLPGGSRSLSFHYNRRNPKRRWLLDTSCR 2140

Query: 389  VTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVLCEQR 210
               S  S +     +    E S             I V SD +EG  SL+KMV+VLCEQ 
Sbjct: 2141 APLSEASDSS---TRIFSAEGSTAGEEKKVELSEQINVSSDFNEGPASLAKMVAVLCEQH 2197

Query: 209  LFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARANIGR 30
            LFLPLLRAFE+FLPSCS LPFIRALQ FSQMRLSEASAHL SFSARIKEEP H + NIGR
Sbjct: 2198 LFLPLLRAFELFLPSCSFLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPSHLQTNIGR 2257

Query: 29   EGQVGAPWI 3
            +GQVG  WI
Sbjct: 2258 DGQVGMSWI 2266


>ref|XP_012437401.1| PREDICTED: uncharacterized protein LOC105763656 isoform X1 [Gossypium
            raimondii] gi|763779678|gb|KJB46749.1| hypothetical
            protein B456_008G100800 [Gossypium raimondii]
          Length = 3225

 Score =  874 bits (2257), Expect = 0.0
 Identities = 456/729 (62%), Positives = 551/729 (75%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            V R  SS+AQCTLENLRP LQ +PTLWRTL++ CFG D +       AK+     AL+DY
Sbjct: 1573 VNRHNSSTAQCTLENLRPTLQHYPTLWRTLVSGCFGQDTSFGFFHTGAKN-----ALADY 1627

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            LNWR++IF S G DTSL+QMLPCWF K +RRL+QL+VQGP GWQSL+G+PTGES L RD+
Sbjct: 1628 LNWRDNIFFSTGRDTSLLQMLPCWFPKAVRRLVQLYVQGPLGWQSLSGLPTGESLLDRDV 1687

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
             ++INA E  E++A+SWEA IQK VE+EL+ SSL+ET  G+EH+LHRGRALAAFNHLL  
Sbjct: 1688 DFYINADEQAEINAISWEATIQKHVEEELYHSSLKETGLGLEHHLHRGRALAAFNHLLIS 1747

Query: 1649 RGQMLNENSHKKQSGASSGQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLVA 1470
            R + L           +SGQ N+QSDVQ LLAP+++ EE LLS++MP AI+HFED+VLVA
Sbjct: 1748 RVEKLKIEGRTN----ASGQTNVQSDVQTLLAPISEKEECLLSSIMPFAITHFEDNVLVA 1803

Query: 1469 SCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDITV 1290
            SCAFLLELCGLSASMLR+DVA+LRRIS FYKS +  ++ +  S KGSAF  A  +  I  
Sbjct: 1804 SCAFLLELCGLSASMLRVDVASLRRISLFYKSIQNKDNSRQLSSKGSAFQPATHDDSIME 1863

Query: 1289 SLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGETC 1110
            SLA+ALAD+ MH D+S  + Q    ++     K+ SRA++ VLQHLEKAS+P + EG+TC
Sbjct: 1864 SLARALADECMHGDNSRNSKQR--GSLISVYGKQPSRALMLVLQHLEKASLPQLVEGKTC 1921

Query: 1109 GSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTEA 930
            GSWLL+G+GDG E RS QKAASQ+WSLVT FCQ+HQ+PLSTKYL+VLA+DNDWVGFL EA
Sbjct: 1922 GSWLLTGNGDGTELRSQQKAASQYWSLVTVFCQIHQLPLSTKYLAVLARDNDWVGFLCEA 1981

Query: 929  QVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLSS 750
            Q+ G+ FD   QVASK+FSDPRLKIHILTVL+S+ S  KK  SS + + +   +      
Sbjct: 1982 QIGGYSFDTVFQVASKEFSDPRLKIHILTVLKSIQS--KKKASSQSYLDKKSESPF---L 2036

Query: 749  ENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWLE 570
            E N  +PVELF +LA+CEKQK+PGEALLL+AKD  WS+LAMIASCF DVSPLSCLTVWLE
Sbjct: 2037 EENVYMPVELFRVLADCEKQKNPGEALLLKAKDFSWSILAMIASCFPDVSPLSCLTVWLE 2096

Query: 569  ITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESLSA 390
            ITAARET SIKVNDIA+Q+A+NV AAVEATN  PGG++ L+FHYNR+N KRR L+++   
Sbjct: 2097 ITAARETKSIKVNDIATQMADNVAAAVEATNSLPGGSRSLSFHYNRRNPKRRWLLDTSCR 2156

Query: 389  VTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVLCEQR 210
               S  S +     +    E S             I V SD +EG  SL+KMV+VLCEQ 
Sbjct: 2157 APLSEASDSS---TRIFSAEGSTAGEEKKVELSEQINVSSDFNEGPASLAKMVAVLCEQH 2213

Query: 209  LFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARANIGR 30
            LFLPLLRAFE+FLPSCS LPFIRALQ FSQMRLSEASAHL SFSARIKEEP H + NIGR
Sbjct: 2214 LFLPLLRAFELFLPSCSFLPFIRALQAFSQMRLSEASAHLGSFSARIKEEPSHLQTNIGR 2273

Query: 29   EGQVGAPWI 3
            +GQVG  WI
Sbjct: 2274 DGQVGMSWI 2282


>ref|XP_011021957.1| PREDICTED: uncharacterized protein LOC105123888 isoform X3 [Populus
            euphratica]
          Length = 3235

 Score =  872 bits (2252), Expect = 0.0
 Identities = 458/733 (62%), Positives = 554/733 (75%), Gaps = 4/733 (0%)
 Frame = -3

Query: 2189 VKRRFSSSAQCTLENLRPALQRFPTLWRTLIAACFGHDANGISLVPDAKSVFGNSALSDY 2010
            VKR  SSSAQCTLENLRP LQ+FPTLWRTL+AA FGHD     L P   +    +AL++Y
Sbjct: 1570 VKRHGSSSAQCTLENLRPTLQQFPTLWRTLVAASFGHDTASNFLGPKGNT----NALANY 1625

Query: 2009 LNWRESIFSSAGHDTSLVQMLPCWFSKGIRRLIQLFVQGPFGWQSLAGVPTGESFLHRDI 1830
            LNW ++IF S   DTSL+QMLPCWF K +RRLIQL +QGP GWQS++G+P GE+ L RD 
Sbjct: 1626 LNWHDNIFFSTTRDTSLLQMLPCWFPKAVRRLIQLHIQGPLGWQSVSGLPAGETLLCRDF 1685

Query: 1829 SYFINAHENGEVSAMSWEAAIQKSVEKELFASSLEETTFGVEHYLHRGRALAAFNHLLGL 1650
             +F++A E+ E++ + WEA IQK V++EL+ SSLEET  G+EH+LHRGRALAAFNH+LG+
Sbjct: 1686 DFFMHAEEHTEINGVYWEATIQKHVQEELYNSSLEETKLGLEHHLHRGRALAAFNHILGV 1745

Query: 1649 RGQMLNENSHKKQSGASS-GQANIQSDVQMLLAPVTQNEESLLSTVMPLAISHFEDSVLV 1473
            R Q L       QSGASS GQ N+QSDVQ LLAP+TQ+EE+ LS+V+PLAI+HF DSVLV
Sbjct: 1746 RAQKLKLEG---QSGASSHGQRNVQSDVQALLAPLTQSEEAALSSVIPLAIAHFMDSVLV 1802

Query: 1472 ASCAFLLELCGLSASMLRIDVAALRRISSFYKSSEYNEHFQHFSPKGSAFHAAPREGDIT 1293
            +SCAFLLELCGLSASML +DV+ALRRISSFYK SE NE +   SP+GSAF +    G++ 
Sbjct: 1803 SSCAFLLELCGLSASMLHVDVSALRRISSFYKLSENNEKYSQISPQGSAFQSISHGGNVV 1862

Query: 1292 VSLAQALADDYMHCDSSGTADQEEISNIGVTASKRSSRAVLAVLQHLEKASVPLMAEGET 1113
             SLA++LAD+Y+H D    +  +  SN    A K+SSRA++ VLQHLEKAS+PLM +G+T
Sbjct: 1863 ESLARSLADEYLHKDRVTNSKLKGTSNS--FAGKQSSRALMLVLQHLEKASLPLMMDGKT 1920

Query: 1112 CGSWLLSGSGDGAEFRSHQKAASQHWSLVTAFCQMHQIPLSTKYLSVLAKDNDWVGFLTE 933
            CGSWLL+G GDG E R  QK ASQHW+LVT FCQMHQ+PLSTKYL+VLA+DNDWVGFL+E
Sbjct: 1921 CGSWLLTGIGDGTELRDQQKVASQHWNLVTLFCQMHQLPLSTKYLTVLARDNDWVGFLSE 1980

Query: 932  AQVVGHPFDATIQVASKDFSDPRLKIHILTVLRSMYSTRKKPVSSPNTVPRGKTNEISLS 753
            AQ+ G+PFD+ +QVA+K+FSDPRLKIHILTVL+ M S +K    SP     GK+   +  
Sbjct: 1981 AQIGGYPFDSVVQVATKEFSDPRLKIHILTVLKGMQSRKKS--GSPAYTYTGKSGSETHC 2038

Query: 752  SENNGMVPVELFGLLAECEKQKSPGEALLLRAKDMRWSLLAMIASCFSDVSPLSCLTVWL 573
             + + ++P ELF +LA+CEKQK+PGE+LL +AK+M WS+LAMIASCF D SPLSCLTVWL
Sbjct: 2039 FQEDMLIPAELFRILADCEKQKNPGESLLKKAKEMSWSILAMIASCFPDASPLSCLTVWL 2098

Query: 572  EITAARETSSIKVNDIASQIANNVGAAVEATNLSPGGNKDLTFHYNRKNVKRRCLIESL- 396
            EITAARETSSIKVNDIASQIA+NV AAV+ATN  P G++ LT HYNR+N KRR L+E + 
Sbjct: 2099 EITAARETSSIKVNDIASQIADNVEAAVQATNSLPAGSRVLTVHYNRQNAKRRRLMEPMY 2158

Query: 395  --SAVTASNDSGNPGVVKKSVPTELSPXXXXXXXXXXXDIKVLSDPDEGLTSLSKMVSVL 222
              S V   + S   G   +  P                +  V SD DEG  SLSKMV+VL
Sbjct: 2159 VDSLVAIDDVSTTYGGATR--PASQGAVAEEERKVDFGEKNVSSDSDEGPVSLSKMVAVL 2216

Query: 221  CEQRLFLPLLRAFEMFLPSCSLLPFIRALQTFSQMRLSEASAHLASFSARIKEEPFHARA 42
            CEQRLFLPLLRAFEMFLPSCS LPFIRALQ FSQMRLSEASAHL SFS RIK+E    +A
Sbjct: 2217 CEQRLFLPLLRAFEMFLPSCSFLPFIRALQAFSQMRLSEASAHLGSFSVRIKDEQTSMQA 2276

Query: 41   NIGREGQVGAPWI 3
            NI  EGQV   WI
Sbjct: 2277 NIVIEGQVRTSWI 2289


Top