BLASTX nr result

ID: Aconitum23_contig00001225 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Aconitum23_contig00001225
         (2072 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010271173.1| PREDICTED: uncharacterized protein LOC104607...   655   0.0  
ref|XP_010241461.1| PREDICTED: uncharacterized protein LOC104586...   601   e-169
ref|XP_010648423.1| PREDICTED: uncharacterized protein LOC100252...   585   e-164
ref|XP_010648352.1| PREDICTED: uncharacterized protein LOC100252...   580   e-162
ref|XP_010648465.1| PREDICTED: uncharacterized protein LOC100252...   578   e-162
ref|XP_010105545.1| hypothetical protein L484_019288 [Morus nota...   577   e-161
ref|XP_008220943.1| PREDICTED: uncharacterized protein LOC103320...   576   e-161
ref|XP_009351588.1| PREDICTED: uncharacterized protein LOC103943...   570   e-159
ref|XP_012072373.1| PREDICTED: uncharacterized protein LOC105634...   563   e-157
ref|XP_008381778.1| PREDICTED: uncharacterized protein LOC103444...   561   e-157
emb|CBI26785.3| unnamed protein product [Vitis vinifera]              551   e-154
ref|XP_008360022.1| PREDICTED: uncharacterized protein LOC103423...   550   e-153
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   549   e-153
gb|KHN25979.1| hypothetical protein glysoja_018833 [Glycine soja]     542   e-151
ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot...   541   e-151
ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809...   541   e-150
ref|XP_008365818.1| PREDICTED: uncharacterized protein LOC103429...   540   e-150
ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794...   540   e-150
gb|KHN28877.1| hypothetical protein glysoja_025671 [Glycine soja]     538   e-150
ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot...   537   e-149

>ref|XP_010271173.1| PREDICTED: uncharacterized protein LOC104607267 [Nelumbo nucifera]
          Length = 698

 Score =  655 bits (1689), Expect = 0.0
 Identities = 363/696 (52%), Positives = 439/696 (63%), Gaps = 7/696 (1%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGSGG--EIHNRQWFVDDRDRFISWLRGEFAAANAIIDSLC 1897
            MAMP GNVVISDKMQFPS GG  G  E+H+RQWF D+RD FISWLRGEFAAANAIIDSLC
Sbjct: 1    MAMPSGNVVISDKMQFPSGGGGAGSGEVHHRQWFPDERDGFISWLRGEFAAANAIIDSLC 60

Query: 1896 HHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRHXXXX 1717
            HHLRS+GEP EYDVV   IQQRRC+WN VLHMQQYFSIAEV  ALQQ AW+KQ RH    
Sbjct: 61   HHLRSIGEPREYDVVISCIQQRRCNWNPVLHMQQYFSIAEVMYALQQVAWRKQQRHFDQM 120

Query: 1716 XXXXXXXXKSGSQGVTSKQWFRVEN-KENQNSSAEHQNRSVVSSAQFVNMDSXXXXXXXX 1540
                    K+G QG+ S+Q  R EN KEN  S++E       +S Q VNM+S        
Sbjct: 121  KITEKDFKKNGPQGIGSRQGHRAENVKENHKSNSETHYLDANTSPQPVNMESEKTEEEPE 180

Query: 1539 XXXEAIQKARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVEGIPCDDTKAGAAN 1360
                  Q A+  +SD+  S   +E+E       SH   GLK+S N E    ++ +    +
Sbjct: 181  KGEAVKQGAKVERSDDKGSALGEEREGGDSVEKSHSGSGLKNSENPERSEHENLEIEVVD 240

Query: 1359 DDTLLNLKDSIQKQDKEENLDV-VPKTFVATEILDGKAVNVVEGLKLYGETFDSLKISNL 1183
            D  +     +  ++   + + V +PKTFV TEI DG  VNVVEGLKLY + FD  +IS L
Sbjct: 241  DGCISKGTSNALQKGATDTIQVPIPKTFVGTEIFDGNVVNVVEGLKLYEDLFDGSEISKL 300

Query: 1182 VQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPIADAPQEDENNSGSCEDGK 1003
            + L NELR+AGR+GQ +GQ+F+V KRPMKG GRE+IQLG+PIADAP EDE+ +GS +D K
Sbjct: 301  LLLVNELRTAGRKGQFQGQTFVVLKRPMKGHGREMIQLGLPIADAPPEDESTAGSSKDKK 360

Query: 1002 MEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPWFGRPVCTLFLTEC 823
            ME IP LLQ+VI+ L+ LQV T K DSCIIDFFNEGDHSQPH  PPWFGRPV  LFLTEC
Sbjct: 361  MEPIPGLLQDVIDNLVHLQVMTTKADSCIIDFFNEGDHSQPHTFPPWFGRPVSVLFLTEC 420

Query: 822  DVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSIRKQRILITFTKSQ 643
            ++TFGRVIGV HPG+Y              +MQGKSADFAKHAI SIRKQRIL+TFTKSQ
Sbjct: 421  NMTFGRVIGVDHPGDYRGSLNLSLAAGSVLTMQGKSADFAKHAIPSIRKQRILVTFTKSQ 480

Query: 642  PKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXXXXXXXXPKHYGAATTTGVLXXXXX 463
            PKK   S++    P++A  P  WG                  PKHYGA  TTGVL     
Sbjct: 481  PKK-STSNESLRAPSTAGPPSPWG--PPPSRPLGHHVRHPAGPKHYGAVPTTGVLPAPPI 537

Query: 462  XXXXXXXXXXXXXIFITSSLAPAAMPYPT-PVPLPPASSGWAAVXXXXXXXXXXXXPGTG 286
                         +F+T+ +A A +PYPT PVPLPPAS+GW AV            PGTG
Sbjct: 538  RAQHLPPPNGMQPLFVTAPVA-APVPYPTAPVPLPPASAGWPAVPPPRHPPPRLPVPGTG 596

Query: 285  VFLPPQGSGPA--PQQTVSASAATVNSTTETPSTLDNTKWSVRSNCNGSASPNSELDGNT 112
            VFLPP GSGP+  PQ    A+A   +   ETP+ ++N     +SN N +ASP S+LDG  
Sbjct: 597  VFLPPPGSGPSPPPQAQQPATATESSIAVETPTQVENENGLEKSNGNSNASPKSKLDGKG 656

Query: 111  KRQDCNGNVGIDLGGTVAGEEQSDDLENGASKAVDR 4
             RQ+CNGN+  + G  V G+E+     N   K   +
Sbjct: 657  PRQECNGNISSNSGARVVGKEEHQQSANIKKKVASK 692


>ref|XP_010241461.1| PREDICTED: uncharacterized protein LOC104586031 [Nelumbo nucifera]
            gi|719962706|ref|XP_010241536.1| PREDICTED:
            uncharacterized protein LOC104586031 [Nelumbo nucifera]
            gi|719962709|ref|XP_010241609.1| PREDICTED:
            uncharacterized protein LOC104586031 [Nelumbo nucifera]
          Length = 696

 Score =  601 bits (1549), Expect = e-169
 Identities = 350/695 (50%), Positives = 420/695 (60%), Gaps = 20/695 (2%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGSGG--EIHNRQWFVDDRDRFISWLRGEFAAANAIIDSLC 1897
            MA P GNVVISDKMQFPS GG GG  E+H+RQWF D+RD FISWLRGEFAAANAIIDSLC
Sbjct: 1    MATPSGNVVISDKMQFPSGGGGGGGGEVHHRQWFPDERDGFISWLRGEFAAANAIIDSLC 60

Query: 1896 HHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRHXXXX 1717
            HHLRS+GEPGEYDVV G IQQRRC+WN VLHMQQYFS+AEV   LQQAAW++Q RH    
Sbjct: 61   HHLRSIGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVMFQLQQAAWRRQQRHFDQM 120

Query: 1716 XXXXXXXXKSGSQGVTSKQWFRVEN-KENQNSSAEHQNRSVVSSAQFVNMDSXXXXXXXX 1540
                    K+G QGV S+     EN KE+   ++E  +    +SA+ VN +         
Sbjct: 121  KITEKDFKKNGPQGVGSRPGHWAENVKESHKVNSEIHHHDANTSARSVNTEPDKPEEPEK 180

Query: 1539 XXXEAIQKARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVEGIPCDDTKAGAAN 1360
                  Q+A   +S+   S   +EKE +     SH D  LK S N   I  D+ +  A +
Sbjct: 181  GEVSK-QRANVERSNNKSSALGEEKEGLKNMERSHADSSLKGSENAVAIERDNPELEAMD 239

Query: 1359 DDTLLNLKDSIQKQDKEENLDV-VPKTFVATEILDGKAVNVVEGLKLYGETFDSLKISNL 1183
            D        S  +    + +   VPKTFV  EI DG  VNVVEGLK Y E F S +IS L
Sbjct: 240  DGCSSKGTSSAPQMAAADTIQTPVPKTFVGIEIFDGNTVNVVEGLKFYEELFGSSEISKL 299

Query: 1182 VQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPIADAPQEDENNSGSCEDGK 1003
            + L NELR+AGR+GQ +GQ+F VSKRPMKG GRE+IQLG+PIADAP E+ + +G+ +D K
Sbjct: 300  LSLVNELRAAGRKGQFQGQTFAVSKRPMKGHGREMIQLGIPIADAPPEEGSATGTFKDCK 359

Query: 1002 MEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPWFGRPVCTLFLTEC 823
            ME IP LLQ+VI+ L+ LQV T+KPDSCIIDFFNEGDHSQPH+ PPWFGRPVC LFLTEC
Sbjct: 360  MEPIPGLLQDVIDHLVHLQVMTMKPDSCIIDFFNEGDHSQPHMFPPWFGRPVCILFLTEC 419

Query: 822  DVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSIRKQRILITFTKSQ 643
             +TFGRVI V HPG+Y              +MQGKSADFA+HAI S+RKQRI++TFTKSQ
Sbjct: 420  IMTFGRVIVVDHPGDYRGSLKLSLAAGTLLTMQGKSADFARHAIPSVRKQRIVVTFTKSQ 479

Query: 642  PKKVMLSSDGQPLPTSAAA---PHCWGXXXXXXXXXXXXXXXXXXPKHYGA--ATTTGVL 478
            PKK M  SD    P+S++A   P  WG                   KHYG     TTGVL
Sbjct: 480  PKKTM-PSDSSRGPSSSSAGGSPSPWG--PSPGRPLGNVRHPAGPNKHYGGVPTPTTGVL 536

Query: 477  --XXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPT-PVPLPPASSGWAAVXXXXXXXXX 307
                                +F+T+ +AP  +PYPT PVP+P AS+GW AV         
Sbjct: 537  PAPPIRPQHLPPPNGIGMQPLFVTAPVAP-PVPYPTAPVPIPSASTGWPAVPPPRHPPPR 595

Query: 306  XXXPGTGVFLPPQGSGPAP--QQTVSASAATVNSTTETPSTLDNTKWSVRSNCNGSASPN 133
               PGTGVFLPP GSG +P  QQ +S S    +   E P           SN N S SP 
Sbjct: 596  FPVPGTGVFLPPPGSGHSPPSQQPISGSVTETSFAVEIPQ-------HPESNSNNSTSPK 648

Query: 132  SELDGNTKRQDCNGNVG------IDLGGTVAGEEQ 46
             + DG  + Q+CNG+V          GG V  EEQ
Sbjct: 649  GKSDGKGQSQECNGSVSGTSPSTTTTGGGVGKEEQ 683


>ref|XP_010648423.1| PREDICTED: uncharacterized protein LOC100252594 isoform X2 [Vitis
            vinifera]
          Length = 704

 Score =  585 bits (1507), Expect = e-164
 Identities = 351/717 (48%), Positives = 429/717 (59%), Gaps = 27/717 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGSGG-----EIHN-RQWFVDDRDRFISWLRGEFAAANAII 1909
            MAMP GNVVISDKMQFP  GG GG     EIH+ RQWF D+RD FISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 1908 DSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRH 1729
            DSLC+HLR +GEPGEYD V G IQQRR +W+ VLHMQQYFS+AEV  ALQQ  W++Q RH
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 1728 XXXXXXXXXXXXKSGSQGVTSKQWFRVEN-KENQNSSAEHQNRSVVSSAQFVNMDSXXXX 1552
                        +    GV  +Q  R E  K++ NS+ E+ +    SS     ++     
Sbjct: 121  LDPVKGAGKEYKR---YGVAYRQGQRGETAKDSHNSNFENHSHDANSSG---TLEKGERV 174

Query: 1551 XXXXXXXEAIQKARA-GKSDENKSPNAQEKEVVLGS-ADSHGDVGLKDSGNVEGIPC--D 1384
                   +   K    GK ++     A+EK+    + A  + +   K S N EG  C   
Sbjct: 175  SEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGIS 234

Query: 1383 DTKAGAANDDTLLNLKDS-----------IQKQDKEENLDVVPKTFVATEILDGKAVNVV 1237
            +T+A   +D   LN K S           +Q Q+++ N    PKTFV TEI DGKAVNVV
Sbjct: 235  ETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVV 294

Query: 1236 EGLKLYGETFDSLKISNLVQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPI 1057
            +GLKLY E FD  ++S  V L N+LR+AG+RGQL+GQ+F+VSKRPMKG GRE+IQLGVPI
Sbjct: 295  DGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPI 354

Query: 1056 ADAPQEDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPH 877
            ADAP EDE+  G+ +D + E+IPSLLQ+VI  L+  QV TVKPD+CIIDF+NEGDHSQPH
Sbjct: 355  ADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPH 414

Query: 876  VCPPWFGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKH 697
            + P WFGRPVC LFLTECD+TFGRVIG  HPG+Y               MQGKSADFAKH
Sbjct: 415  IWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKH 474

Query: 696  AISSIRKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXXXXXXXX 517
            AI S+RKQRIL+TFTKSQPKK M +SDGQ L   AA    W                   
Sbjct: 475  AIPSLRKQRILVTFTKSQPKKTM-ASDGQRLLPPAAQSSHW---VPPPSRSPNHMRHPMG 530

Query: 516  PKHYGAATTTGVL-XXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWA 340
            PKHYGA  TTGVL                   +F+T+++AP AMP+P PVPLP  S GW 
Sbjct: 531  PKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAP-AMPFPAPVPLPTGSPGWP 589

Query: 339  AVXXXXXXXXXXXXPGTGVFLPPQGSG-PAPQQTVSASAATVNSTTETPSTLDNTKWSVR 163
            A             PGTGVFLPP GSG  +  Q +S  A + +  T  P+  +N      
Sbjct: 590  AA-PPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSS 648

Query: 162  SNCNGSASPNSELDGNTKRQDCNGNV---GIDLGGTVAGEEQSDDLENGASKAVDRV 1
            SN N + SP  +LDG   RQ+CNG++   G+D       E+Q +D    ASK    V
Sbjct: 649  SNSN-TVSPKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQHNDELKVASKPAGAV 704


>ref|XP_010648352.1| PREDICTED: uncharacterized protein LOC100252594 isoform X1 [Vitis
            vinifera] gi|731369021|ref|XP_010648386.1| PREDICTED:
            uncharacterized protein LOC100252594 isoform X1 [Vitis
            vinifera]
          Length = 705

 Score =  580 bits (1495), Expect = e-162
 Identities = 351/718 (48%), Positives = 429/718 (59%), Gaps = 28/718 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGSGG-----EIHN-RQWFVDDRDRFISWLRGEFAAANAII 1909
            MAMP GNVVISDKMQFP  GG GG     EIH+ RQWF D+RD FISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 1908 DSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRH 1729
            DSLC+HLR +GEPGEYD V G IQQRR +W+ VLHMQQYFS+AEV  ALQQ  W++Q RH
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 1728 XXXXXXXXXXXXKSGSQGVTSKQWFRVEN-KENQNSSAEHQNRSVVSSAQFVNMDSXXXX 1552
                        +    GV  +Q  R E  K++ NS+ E+ +    SS     ++     
Sbjct: 121  LDPVKGAGKEYKR---YGVAYRQGQRGETAKDSHNSNFENHSHDANSSG---TLEKGERV 174

Query: 1551 XXXXXXXEAIQKARA-GKSDENKSPNAQEKEVVLGS-ADSHGDVGLKDSGNVEGIPC--D 1384
                   +   K    GK ++     A+EK+    + A  + +   K S N EG  C   
Sbjct: 175  SEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGIS 234

Query: 1383 DTKAGAANDDTLLNLKDS-----------IQKQDKEENLDVVPKTFVATEILDGKAVNVV 1237
            +T+A   +D   LN K S           +Q Q+++ N    PKTFV TEI DGKAVNVV
Sbjct: 235  ETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVV 294

Query: 1236 EGLKLYGETFDSLKISNLVQLANELRSAGRRGQLK-GQSFIVSKRPMKGRGREVIQLGVP 1060
            +GLKLY E FD  ++S  V L N+LR+AG+RGQL+ GQ+F+VSKRPMKG GRE+IQLGVP
Sbjct: 295  DGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQLGVP 354

Query: 1059 IADAPQEDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQP 880
            IADAP EDE+  G+ +D + E+IPSLLQ+VI  L+  QV TVKPD+CIIDF+NEGDHSQP
Sbjct: 355  IADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQP 414

Query: 879  HVCPPWFGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAK 700
            H+ P WFGRPVC LFLTECD+TFGRVIG  HPG+Y               MQGKSADFAK
Sbjct: 415  HIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAK 474

Query: 699  HAISSIRKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXXXXXXX 520
            HAI S+RKQRIL+TFTKSQPKK M +SDGQ L   AA    W                  
Sbjct: 475  HAIPSLRKQRILVTFTKSQPKKTM-ASDGQRLLPPAAQSSHW---VPPPSRSPNHMRHPM 530

Query: 519  XPKHYGAATTTGVL-XXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGW 343
             PKHYGA  TTGVL                   +F+T+++AP AMP+P PVPLP  S GW
Sbjct: 531  GPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAP-AMPFPAPVPLPTGSPGW 589

Query: 342  AAVXXXXXXXXXXXXPGTGVFLPPQGSG-PAPQQTVSASAATVNSTTETPSTLDNTKWSV 166
             A             PGTGVFLPP GSG  +  Q +S  A + +  T  P+  +N     
Sbjct: 590  PAA-PPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKS 648

Query: 165  RSNCNGSASPNSELDGNTKRQDCNGNV---GIDLGGTVAGEEQSDDLENGASKAVDRV 1
             SN N + SP  +LDG   RQ+CNG++   G+D       E+Q +D    ASK    V
Sbjct: 649  SSNSN-TVSPKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQHNDELKVASKPAGAV 705


>ref|XP_010648465.1| PREDICTED: uncharacterized protein LOC100252594 isoform X3 [Vitis
            vinifera]
          Length = 699

 Score =  578 bits (1489), Expect = e-162
 Identities = 348/712 (48%), Positives = 427/712 (59%), Gaps = 22/712 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGSGG-----EIHN-RQWFVDDRDRFISWLRGEFAAANAII 1909
            MAMP GNVVISDKMQFP  GG GG     EIH+ RQWF D+RD FISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 1908 DSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRH 1729
            DSLC+HLR +GEPGEYD V G IQQRR +W+ VLHMQQYFS+AEV  ALQQ  W++Q RH
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 1728 XXXXXXXXXXXXKSGSQGVTSKQWFRVEN-KENQNSSAEHQNRSVVSSAQFVNMDSXXXX 1552
                        +    GV  +Q  R E  K++ NS+ E+ +    SS     ++     
Sbjct: 121  LDPVKGAGKEYKR---YGVAYRQGQRGETAKDSHNSNFENHSHDANSSG---TLEKGERV 174

Query: 1551 XXXXXXXEAIQKARA-GKSDENKSPNAQEKEVVLGS-ADSHGDVGLKDSGNVEGIPC--D 1384
                   +   K    GK ++     A+EK+    + A  + +   K S N EG  C   
Sbjct: 175  SEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGIS 234

Query: 1383 DTKAGAANDDTLLNL-----KDSIQKQDKEENLDVVPKTFVATEILDGKAVNVVEGLKLY 1219
            +T+A   +D    N+        +Q Q+++ N    PKTFV TEI DGKAVNVV+GLKLY
Sbjct: 235  ETEANDMDDGGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLY 294

Query: 1218 GETFDSLKISNLVQLANELRSAGRRGQLK-GQSFIVSKRPMKGRGREVIQLGVPIADAPQ 1042
             E FD  ++S  V L N+LR+AG+RGQL+ GQ+F+VSKRPMKG GRE+IQLGVPIADAP 
Sbjct: 295  EELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQLGVPIADAPL 354

Query: 1041 EDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPW 862
            EDE+  G+ +D + E+IPSLLQ+VI  L+  QV TVKPD+CIIDF+NEGDHSQPH+ P W
Sbjct: 355  EDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIWPTW 414

Query: 861  FGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSI 682
            FGRPVC LFLTECD+TFGRVIG  HPG+Y               MQGKSADFAKHAI S+
Sbjct: 415  FGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIPSL 474

Query: 681  RKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXXXXXXXXPKHYG 502
            RKQRIL+TFTKSQPKK M +SDGQ L   AA    W                   PKHYG
Sbjct: 475  RKQRILVTFTKSQPKKTM-ASDGQRLLPPAAQSSHW---VPPPSRSPNHMRHPMGPKHYG 530

Query: 501  AATTTGVL-XXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAVXXX 325
            A  TTGVL                   +F+T+++AP AMP+P PVPLP  S GW A    
Sbjct: 531  AVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAP-AMPFPAPVPLPTGSPGWPAA-PP 588

Query: 324  XXXXXXXXXPGTGVFLPPQGSG-PAPQQTVSASAATVNSTTETPSTLDNTKWSVRSNCNG 148
                     PGTGVFLPP GSG  +  Q +S  A + +  T  P+  +N      SN N 
Sbjct: 589  RHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSSNSN- 647

Query: 147  SASPNSELDGNTKRQDCNGNV---GIDLGGTVAGEEQSDDLENGASKAVDRV 1
            + SP  +LDG   RQ+CNG++   G+D       E+Q +D    ASK    V
Sbjct: 648  TVSPKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQHNDELKVASKPAGAV 699


>ref|XP_010105545.1| hypothetical protein L484_019288 [Morus notabilis]
            gi|587917472|gb|EXC05040.1| hypothetical protein
            L484_019288 [Morus notabilis]
          Length = 681

 Score =  577 bits (1486), Expect = e-161
 Identities = 342/698 (48%), Positives = 419/698 (60%), Gaps = 14/698 (2%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGSGGEI---HNRQWFVDDRDRFISWLRGEFAAANAIIDSL 1900
            MAMP GNVV SDKMQFPS     GEI   +NRQWF D+RD FISWLRGEFAAANA+IDSL
Sbjct: 1    MAMPSGNVVSSDKMQFPSGTAGAGEISHHNNRQWFPDERDGFISWLRGEFAAANAMIDSL 60

Query: 1899 CHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRHXXX 1720
            CHHLR+VGEPGEYD V   IQ RRC+WN VLHMQQYFS+AEV  ALQQ AW++Q R    
Sbjct: 61   CHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFALQQVAWRRQQRFYDP 120

Query: 1719 XXXXXXXXXKSGSQGVTSKQWFRVEN-KENQNSSAEHQNRSVVSSAQFVNMDSXXXXXXX 1543
                     +SG   V  KQW R ++ K+ +NS+AE  +  +  ++ F N  S       
Sbjct: 121  VKMGNKEFKRSG---VGFKQWQRNDSFKDGRNSAAE--SHCLDGNSSFGNAASEKGGSDK 175

Query: 1542 XXXXEAIQKARAGKSDENKS-PNAQEKEVVLGSADSHGDVGLKDSGNVEGIPCDDTKAGA 1366
                        G SD+  S P A+EK     +A S  D  +K  GN EG+         
Sbjct: 176  SGD-------EVGNSDDRGSMPAAKEKND--SAAKSQEDGNVKSLGNFEGVVSGSEPEVH 226

Query: 1365 ANDDTLL-----NLKDSIQKQDKEENLDVVPKTFVATEILDGKAVNVVEGLKLYGETFDS 1201
            A DD        N   S  KQ++  NL  VPKTF   E+ DGK VNVVEGLKLY E    
Sbjct: 227  AVDDGCTSSSKENDSHSTPKQNENSNLANVPKTFSGNEMFDGKPVNVVEGLKLYEEFCAD 286

Query: 1200 LKISNLVQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPIADAPQEDENNSG 1021
             ++S LV L N+LRSAG RG  + Q+++VSKRPMKG GRE IQLG+PIADAP EDE ++G
Sbjct: 287  TEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLGLPIADAPVEDEISAG 346

Query: 1020 SCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPWFGRPVCT 841
            + +D + EAIP LLQ+V ERL+S+QV TVKPDSCIIDF+NEGDHSQPH+ P WFGRPVC 
Sbjct: 347  TLKDRRTEAIPPLLQDVAERLVSMQVATVKPDSCIIDFYNEGDHSQPHLWPSWFGRPVCV 406

Query: 840  LFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSIRKQRILI 661
            LFLTECD+TFGRV  + HPG+Y              +MQGKSADFAKHAI S+R+QRIL+
Sbjct: 407  LFLTECDMTFGRVFAIDHPGDYRGALKLSLKPGSLLAMQGKSADFAKHAIPSLRRQRILV 466

Query: 660  TFTKSQPKKVMLSSDGQPLPTSAAAPHC-WGXXXXXXXXXXXXXXXXXXPKHYGAATTTG 484
            TFTKSQPKK M  SDGQ +P+   AP   WG                  PKHY    TTG
Sbjct: 467  TFTKSQPKKSM-PSDGQRMPSPGVAPSSHWG----PQPSRSPNHIRHPGPKHYAPVPTTG 521

Query: 483  VLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAVXXXXXXXXXX 304
            VL                  +F+T+ +AP AMP+P PVP+PP+SSGW+A           
Sbjct: 522  VL-QASPVRPQIPPPNGIQPLFVTAPVAP-AMPFPAPVPIPPSSSGWSAA-PPRHPPPRL 578

Query: 303  XXPGTGVFLPPQGSGPAPQQTVSASAATVNSTTETPSTLDNTKWSVRSNCNGSASPNSEL 124
              PGTGVFLPP GSG     +        N T ET +  +    S + N   +ASP  ++
Sbjct: 579  PVPGTGVFLPPPGSGGNSSGSQQVLGNDTNHTVETAAPPEKENGSGKLNHGMTASPKGKV 638

Query: 123  DGNTKRQDCNGNVGIDLGGTVAG---EEQSDDLENGAS 19
            D  T++Q+CNG+  +D  G+V     EE+    +N A+
Sbjct: 639  DSKTQKQECNGS--LDGSGSVISVTKEERQQSSDNTAT 674


>ref|XP_008220943.1| PREDICTED: uncharacterized protein LOC103320980 [Prunus mume]
          Length = 691

 Score =  576 bits (1485), Expect = e-161
 Identities = 339/705 (48%), Positives = 424/705 (60%), Gaps = 21/705 (2%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGSG----GEI--HNRQWFVDDRDRFISWLRGEFAAANAII 1909
            M MP GNVV+SDKMQFPS GG G    GEI  H+RQWF D+RD FISWLRGEFAAANAII
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGGAVGGGEIPQHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 1908 DSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRH 1729
            DSLCHHLR+VGEPGEYDVV G IQQRRC+WN VLHMQQYFS+AEV  ALQ  AW++Q R+
Sbjct: 61   DSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQRY 120

Query: 1728 XXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAE-HQN----RSVVSSAQFVNMDS 1564
                        +SG  G    Q      KE  NS+ E H N      VV+  +F     
Sbjct: 121  YDPVKAGAKEFKRSGV-GFNKGQQRAEAFKEGHNSTLESHSNDGNSSGVVAPEKFERGSE 179

Query: 1563 XXXXXXXXXXXEAIQKARAGK-SDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVEGIPC 1387
                               GK +D+  +P  ++K+ +    +   D  L+  GN +G   
Sbjct: 180  VGEEVEPG--------GEVGKLNDKGLAPAGEKKDALTKPQE---DSNLRSFGNSQGTIS 228

Query: 1386 DDTKAGAANDD-----TLLNLKDSIQKQDKEENLDVVPKTFVATEILDGKAVNVVEGLKL 1222
            ++++      D     + +N   SIQ Q++++NL +VPKTF+  E  DGK VN V+GLKL
Sbjct: 229  ENSEPEVVEVDGCTPSSKVNESHSIQIQNQKQNLSIVPKTFIGNETSDGKTVNAVDGLKL 288

Query: 1221 YGETFDSLKISNLVQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPIADAPQ 1042
            Y +     ++S L+ L N+LR+AG+R QL+GQ+++VSKRPMKG GRE+IQLG+PIADAP 
Sbjct: 289  YEDFLGDTEVSKLLSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIADAPP 348

Query: 1041 EDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPW 862
            EDE ++G+ +D K+E IPSLLQ+VI+RL+ + V TVKPDSCIID +NEGDHSQPH  P W
Sbjct: 349  EDEISAGTSKDRKIEPIPSLLQDVIDRLVGMHVVTVKPDSCIIDVYNEGDHSQPHTWPSW 408

Query: 861  FGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSI 682
            FGRPVC L+LTECD+TFGRV+ + HPG+Y               MQGKSADFAKHAI SI
Sbjct: 409  FGRPVCALYLTECDMTFGRVLLMDHPGDYRGSLRLSLTPGSILLMQGKSADFAKHAIPSI 468

Query: 681  RKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPHC-WGXXXXXXXXXXXXXXXXXXPKHY 505
            RKQRIL+TFTKSQPKK   +SDGQ  P  A A    WG                  PKHY
Sbjct: 469  RKQRILVTFTKSQPKK-STTSDGQRFPAPAPAQSSYWG---PPPSRSPNHIRHPTGPKHY 524

Query: 504  GAATTTGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAVXXX 325
             A  TTGVL                  +F+ + + P A+P+   VP+PP S+GW A    
Sbjct: 525  AAVPTTGVL-PAPPIRSQLPPQNGIQPLFVPAPVGP-AIPFAAAVPIPPGSAGWPAA--P 580

Query: 324  XXXXXXXXXPGTGVFLPPQGSG--PAPQQTVSASAATVNSTTETPSTLDNTKWSVRSNCN 151
                     PGTGVFLPP GSG   APQQ +  +A  ++ T ETPS  D    S +SN +
Sbjct: 581  RHPPPRIPLPGTGVFLPPPGSGNSSAPQQ-LPGTATEMSPTVETPSPRDKDNGSGKSNHS 639

Query: 150  GSASPNSELDGNTKRQDCNGNV-GIDLGGTVAGEEQSDDLENGAS 19
             SASP  + DG   RQDCNG+  G   G T   EE+    +  A+
Sbjct: 640  TSASPKGKSDGKAHRQDCNGSAEGTGSGRTAVKEEEQQTSDKTAA 684


>ref|XP_009351588.1| PREDICTED: uncharacterized protein LOC103943111 [Pyrus x
            bretschneideri] gi|694320826|ref|XP_009351589.1|
            PREDICTED: uncharacterized protein LOC103943111 [Pyrus x
            bretschneideri]
          Length = 690

 Score =  570 bits (1470), Expect = e-159
 Identities = 340/693 (49%), Positives = 413/693 (59%), Gaps = 16/693 (2%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGS----GGEIHN--RQWFVDDRDRFISWLRGEFAAANAII 1909
            M MP GNVV+SDKMQFPS GG     GGEIH   RQWF D+RD FISWLRGEFAAAN II
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGAAVGGGEIHQHPRQWFPDERDGFISWLRGEFAAANTII 60

Query: 1908 DSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRH 1729
            DSLCHHLR+VG+PGEYDVV G IQQRRC+WN VLHMQQYFS+AEV  ALQ  AW++Q   
Sbjct: 61   DSLCHHLRAVGDPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQMQ 120

Query: 1728 XXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAEHQNRSVVSSAQFVNMDSXXXXX 1549
                        +S S G    Q      KE  N   E  +    SS       S     
Sbjct: 121  YDPVKVGTKEYKRSAS-GFNKDQQRAEHFKEGHNFRTEVHSYDGNSSGLVA---SEKVER 176

Query: 1548 XXXXXXEAIQKARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVEG-IPCDDTKA 1372
                  E   +   GK D+N    A EK+  L       D  L+ SGN +  I C+    
Sbjct: 177  GSDVAEEVKPRGEVGKLDDNGLAPAGEKKDALTKPQE--DSRLRSSGNSQQTIYCNLEPE 234

Query: 1371 GAANDDTLLNLKD----SIQKQDKEENLDVVPKTFVATEILDGKAVNVVEGLKLYGETFD 1204
             A  D    + K+    SIQ Q+ ++NL VVPKTFV  E++DGK VNVV+GLKL+     
Sbjct: 235  VAVGDGCTSSSKENESHSIQIQNAKQNLPVVPKTFVGNELIDGKTVNVVDGLKLFEGLLG 294

Query: 1203 SLKISNLVQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPIADAPQEDENNS 1024
              ++S LV LAN+LR AG+RGQL+GQ+++VSKRPM+G GRE+IQLG+P+ DAP EDE ++
Sbjct: 295  DTEVSKLVSLANDLRVAGKRGQLQGQTYVVSKRPMRGHGREMIQLGLPVTDAPSEDEISA 354

Query: 1023 GSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPWFGRPVC 844
            G+ +D ++EAIPSLLQ+VI+RL+ +QVTTVKPDSCIIDF+NEGDHS PH  PPWFGRPVC
Sbjct: 355  GTSKDRRIEAIPSLLQDVIDRLVGMQVTTVKPDSCIIDFYNEGDHSHPHTWPPWFGRPVC 414

Query: 843  TLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSIRKQRIL 664
             L LTECD+TFGRV+   HPG+Y               +QGKS DFAKHAI SIRKQRIL
Sbjct: 415  ILLLTECDMTFGRVLVSDHPGDYRGSLKLSLTPGSLLLLQGKSTDFAKHAIPSIRKQRIL 474

Query: 663  ITFTKSQPKKVMLSSDGQ--PLPTSAAAPHCWGXXXXXXXXXXXXXXXXXXPKHYGAATT 490
            +TFTKSQPKK M+ SDGQ  P PT A + H WG                  PKHY A  T
Sbjct: 475  VTFTKSQPKKSMM-SDGQRFPGPTPAQSSH-WG---PASGRSPSHIRHPAGPKHYAAVPT 529

Query: 489  TGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAVXXXXXXXX 310
            TGVL                  +F+ + + P A+P+ T VP+PP S+GWAA         
Sbjct: 530  TGVL-PAPPIRSQLPPPNGIQPLFVPAPVGP-AIPFATAVPMPPVSAGWAAA--PRHPPP 585

Query: 309  XXXXPGTGVFLPPQGSG--PAPQQTVSASAATVNSTTETPSTLDNTKWSVRSNCNGSASP 136
                PGTGVFLPP GSG   APQQ +   A   +   E P  ++    S +SN + + SP
Sbjct: 586  RIPLPGTGVFLPPPGSGNSSAPQQ-LPYIATQKSPAVEIPPQIEKENGSAKSNHSTTPSP 644

Query: 135  NSELDGNTKRQDCNGNV-GIDLGGTVAGEEQSD 40
              + DG  +R +CNG   G   G  V  EE  D
Sbjct: 645  RGKSDGKAERHECNGRADGTGSGRAVVEEEHQD 677


>ref|XP_012072373.1| PREDICTED: uncharacterized protein LOC105634169 [Jatropha curcas]
            gi|643730738|gb|KDP38170.1| hypothetical protein
            JCGZ_04813 [Jatropha curcas]
          Length = 691

 Score =  563 bits (1451), Expect = e-157
 Identities = 338/707 (47%), Positives = 422/707 (59%), Gaps = 24/707 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGS------GGEIHNR-----QWF-VDDRDRFISWLRGEFA 1927
            MAMP GNVVISDKMQFP+ GG       G EIH +     QWF VD+RD FISWLRGEFA
Sbjct: 1    MAMPPGNVVISDKMQFPAGGGGVGGGGVGNEIHQQHHHRQQWFPVDERDGFISWLRGEFA 60

Query: 1926 AANAIIDSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAW 1747
            AANAIIDSLCHHLR+VGEPGEYD+V G IQQRRC+WN VLHMQQYFS+ EV  ALQQ A 
Sbjct: 61   AANAIIDSLCHHLRAVGEPGEYDLVVGCIQQRRCNWNAVLHMQQYFSVGEVILALQQVAL 120

Query: 1746 KKQHRHXXXXXXXXXXXXKSGSQGVTSKQWFRVE----NKENQNSSAEHQNRSVVSSAQF 1579
            +KQ +                   V  K++ R      NK  +    E    +V S  + 
Sbjct: 121  RKQQQQQQQRYYYD-------QNKVGGKEFKRFSGAGFNKGQKGGGGEVVKEAVNSRVES 173

Query: 1578 VNMDSXXXXXXXXXXXEAIQK-ARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNV 1402
             + D            E I+  A  GK ++     A++K+    +A  H D  LK SGN 
Sbjct: 174  HSFDGNSSGNGGSEKFEEIKSGADGGKLEDKSVALAEDKKDA--AAKPHVDNPLKTSGNS 231

Query: 1401 EGIPCDDTKAGAANDDTLLNLKD----SIQKQDKEENLDVVPKTFVATEILDGKAVNVVE 1234
            E     + +A A   D   +LK+    S   Q  ++ L + PKTFV  EI+DGK VNVV+
Sbjct: 232  EETLSGNLEADAEAVDEQSSLKENDSHSSHNQSVKQTLAITPKTFVGGEIVDGKMVNVVD 291

Query: 1233 GLKLYGETFDSLKISNLVQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPIA 1054
            GLKLY +  D +++S LV L N+LR++GRRGQ  GQ+++VSKRPMKG GRE+IQLG+PIA
Sbjct: 292  GLKLYEQLLDDVEVSKLVSLVNDLRASGRRGQFSGQTYVVSKRPMKGHGREMIQLGLPIA 351

Query: 1053 DAPQEDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHV 874
            DAP EDEN +G+ +D ++E+IP+LLQ+VIER +++Q+  VKPDSCIID +NEGDHSQP++
Sbjct: 352  DAPAEDENAAGTSKDRRVESIPTLLQDVIERFVNMQIMAVKPDSCIIDLYNEGDHSQPNM 411

Query: 873  CPPWFGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHA 694
             PPWFG+P+  LFLTECD+TFGRVI    PG+Y               MQGKS D+AKHA
Sbjct: 412  WPPWFGKPISVLFLTECDLTFGRVITADQPGDYKGSLKLPLAPGSLLVMQGKSTDYAKHA 471

Query: 693  ISSIRKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPH-CWGXXXXXXXXXXXXXXXXXX 517
            I +IRKQR+++TFTKSQPKK    SDGQ L +SAAAP   WG                  
Sbjct: 472  IPAIRKQRMIVTFTKSQPKK-YAQSDGQRLVSSAAAPSPHWG----PAPSRSPNHIRHPV 526

Query: 516  PKHYGAATTTGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGW-A 340
            PKHY A  TTGVL                  +F+T+++A A MP+P PVP+PP S+GW A
Sbjct: 527  PKHYPAVPTTGVL-PAPAIRPQIPPPNGVQPLFVTATVA-APMPFPAPVPIPPVSTGWPA 584

Query: 339  AVXXXXXXXXXXXXPGTGVFLPPQGSGPAPQQ-TVSASAATVNSTTETPSTLDNTKWSVR 163
            A             PGTGVFLPP GSG A     +S +A   N   E  S  D       
Sbjct: 585  AAPRHPPNRLPVPVPGTGVFLPPPGSGNASSSPQISTAAIEANFPVEAVSLTDKENGPGI 644

Query: 162  SNCNGSASPNSELDGNTKRQDCNGNVGIDLGGTVAGEEQSDDLENGA 22
            SN    ASP  +LDG T+RQDCN   GI  G  V  EEQ  ++++ A
Sbjct: 645  SNHVSCASPKEKLDGKTQRQDCN---GIADGRAVTEEEQHQNVDHSA 688


>ref|XP_008381778.1| PREDICTED: uncharacterized protein LOC103444603 [Malus domestica]
          Length = 690

 Score =  561 bits (1447), Expect = e-157
 Identities = 339/703 (48%), Positives = 414/703 (58%), Gaps = 17/703 (2%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGS----GGEIHN--RQWFVDDRDRFISWLRGEFAAANAII 1909
            M MP GNVV+SDKMQFPS GG     GGEIH   RQW  D+RD FISWLRGEFAAAN II
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGAAVGGGEIHQHPRQWLPDERDGFISWLRGEFAAANTII 60

Query: 1908 DSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRH 1729
            DSLCHHLR+VG+PGEYDVV G IQQRRC+WN VLHMQQYFS+AEV  ALQ  AW++QH  
Sbjct: 61   DSLCHHLRAVGDPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQHMQ 120

Query: 1728 XXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAEHQNRSVVSSAQFVNMDSXXXXX 1549
                        +S S G    Q      KE  N   E  +    SS       S     
Sbjct: 121  YDPVKVGTKEYKRSAS-GFNKDQQRAEHFKEGHNFRTEVHSYDGNSSGLVA---SEKVER 176

Query: 1548 XXXXXXEAIQKARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVEG-IPCDDTKA 1372
                  E       GK D+     A EK+  L       D  L+ SGN +  I C+    
Sbjct: 177  GSDVAEEVKPHGEVGKLDDKGLAPAGEKKDALTKPQE--DSRLRSSGNSQQTIYCNLEPE 234

Query: 1371 GAANDDTLLNLKD----SIQKQDKEENLDVVPKTFVATEILDGKAVNVVEGLKLYGETFD 1204
             A  D      K+    SIQ Q  ++NL VVPKTFV  E++DGK VNVV+GLKL+     
Sbjct: 235  VAVGDGCTSISKENESHSIQIQIAQQNLPVVPKTFVGNELIDGKTVNVVDGLKLFEGLLG 294

Query: 1203 SLKISNLVQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPIADAPQEDENNS 1024
              ++S LV LAN+LR AG+RGQ +GQ+++VSKRPM+G GRE+IQLG+P+ DAP EDE ++
Sbjct: 295  DTEVSKLVSLANDLRVAGKRGQFQGQTYVVSKRPMRGHGREMIQLGLPVTDAPSEDEISA 354

Query: 1023 GSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPWFGRPVC 844
            G+ +D ++EAIPSLLQ+VI+RL+ +QVTTVKPDSCIIDF+NEGDHS PH+ PPWFGRPVC
Sbjct: 355  GTSKDRRIEAIPSLLQDVIDRLVGMQVTTVKPDSCIIDFYNEGDHSHPHIWPPWFGRPVC 414

Query: 843  TLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSIRKQRIL 664
             L LTECD+TFGRV+   HPG+Y               +QGKS DFAKHAI SIRKQRIL
Sbjct: 415  VLLLTECDMTFGRVLVSDHPGDYRGALKLSLTPGSLLLLQGKSTDFAKHAIPSIRKQRIL 474

Query: 663  ITFTKSQPKKVMLSSDGQ--PLPTSAAAPHCWGXXXXXXXXXXXXXXXXXXPKHYGAATT 490
            +TFTKSQPKK  + SDGQ  P PT A + H WG                  P HY A  T
Sbjct: 475  VTFTKSQPKKSTM-SDGQRFPGPTPAQSSH-WG---PASGRSPSHIRHPAGPNHYAAVPT 529

Query: 489  TGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAVXXXXXXXX 310
            TGVL                  +F+ + + P A+P+ T VP+PP S+GWAA         
Sbjct: 530  TGVL-PAPSIRSQLPPPNGIQPLFVPAPVGP-AIPFATAVPMPPVSAGWAAA--PRHPPP 585

Query: 309  XXXXPGTGVFLPPQGSG--PAPQQTVSASAATVNSTTETPSTLDNTKWSVRSNCNGSASP 136
                PGTGVFLPP GSG   APQQ +  SA   +   E P  ++    S +SN +   SP
Sbjct: 586  RIPLPGTGVFLPPPGSGNSSAPQQ-LPYSATQKSPAVEIPPQIEKESGSAKSNHSPMPSP 644

Query: 135  NSELDGNTKRQDCNGNV-GIDLGGTVAGEE-QSDDLENGASKA 13
              + DG  +R +CNG+  G   G  V  EE Q+ D    +++A
Sbjct: 645  RGKSDGKAERHECNGSADGTGSGRAVVEEEDQNSDSMTASNQA 687


>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  551 bits (1420), Expect = e-154
 Identities = 329/655 (50%), Positives = 399/655 (60%), Gaps = 25/655 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGSGG-----EIHN-RQWFVDDRDRFISWLRGEFAAANAII 1909
            MAMP GNVVISDKMQFP  GG GG     EIH+ RQWF D+RD FISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 1908 DSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRH 1729
            DSLC+HLR +GEPGEYD V G IQQRR +W+ VLHMQQYFS+AEV  ALQQ  W++Q RH
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 1728 XXXXXXXXXXXXKSGSQGVTSKQWFRVEN-KENQNSSAEHQNRSVVSSAQFVNMDSXXXX 1552
                        +    GV  +Q  R E  K++ NS+ E+ +    SS     ++     
Sbjct: 121  LDPVKGAGKEYKR---YGVAYRQGQRGETAKDSHNSNFENHSHDANSSG---TLEKGERV 174

Query: 1551 XXXXXXXEAIQKARA-GKSDENKSPNAQEKEVVLGS-ADSHGDVGLKDSGNVEGIPC--D 1384
                   +   K    GK ++     A+EK+    + A  + +   K S N EG  C   
Sbjct: 175  SEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGIS 234

Query: 1383 DTKAGAANDDTLLNLKDS-----------IQKQDKEENLDVVPKTFVATEILDGKAVNVV 1237
            +T+A   +D   LN K S           +Q Q+++ N    PKTFV TEI DGKAVNVV
Sbjct: 235  ETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVV 294

Query: 1236 EGLKLYGETFDSLKISNLVQLANELRSAGRRGQLK-GQSFIVSKRPMKGRGREVIQLGVP 1060
            +GLKLY E FD  ++S  V L N+LR+AG+RGQL+ GQ+F+VSKRPMKG GRE+IQLGVP
Sbjct: 295  DGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQLGVP 354

Query: 1059 IADAPQEDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQP 880
            IADAP EDE+  G+ +D + E+IPSLLQ+VI  L+  QV TVKPD+CIIDF+NEGDHSQP
Sbjct: 355  IADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQP 414

Query: 879  HVCPPWFGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAK 700
            H+ P WFGRPVC LFLTECD+TFGRVIG  HPG+Y               MQGKSADFAK
Sbjct: 415  HIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAK 474

Query: 699  HAISSIRKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXXXXXXX 520
            HAI S+RKQRIL+TFTKSQPKK M +SDGQ L   AA    W                  
Sbjct: 475  HAIPSLRKQRILVTFTKSQPKKTM-ASDGQRLLPPAAQSSHW---VPPPSRSPNHMRHPM 530

Query: 519  XPKHYGAATTTGVL-XXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGW 343
             PKHYGA  TTGVL                   +F+T+++AP AMP+P PVPLP  S GW
Sbjct: 531  GPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAP-AMPFPAPVPLPTGSPGW 589

Query: 342  AAVXXXXXXXXXXXXPGTGVFLPPQGSG-PAPQQTVSASAATVNSTTETPSTLDN 181
             A             PGTGVFLPP GSG  +  Q +S  A + +  T  P+  +N
Sbjct: 590  PAA-PPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKEN 643


>ref|XP_008360022.1| PREDICTED: uncharacterized protein LOC103423718 [Malus domestica]
          Length = 687

 Score =  550 bits (1416), Expect = e-153
 Identities = 328/696 (47%), Positives = 407/696 (58%), Gaps = 19/696 (2%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGS----GGEIHN--RQWFVDDRDRFISWLRGEFAAANAII 1909
            M MP GNVV+SDKMQFPS GG+    GGEI    RQWF D+RD FISWLRGEFAAANAII
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGAAAVGGGEIPQLPRQWFPDERDGFISWLRGEFAAANAII 60

Query: 1908 DSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRH 1729
            DSLCHHLR VGEPGEYD V   IQQRRC+WN VLHMQQYFS+AEV  ALQ  AW++Q R 
Sbjct: 61   DSLCHHLRVVGEPGEYDGVISCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQRQ 120

Query: 1728 XXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAEHQ----NRSVVSSAQFVNMDSX 1561
                        +SGS G    Q      KE  N S E      N S + +++ V   S 
Sbjct: 121  YDHVKVGAKEYKRSGS-GFNKGQHRAEHFKEGHNFSTEVHSYDGNSSGLXASEKVERGSE 179

Query: 1560 XXXXXXXXXXEAIQKARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVEGIPCDD 1381
                              GK D N    A EK       +   D  L+ S N +     +
Sbjct: 180  VAEELKPG-------GEVGKLDGNGLAAAGEK------TEPQEDSRLRSSENSQLTIYGN 226

Query: 1380 TKAGAANDDTLL-----NLKDSIQKQDKEENLDVVPKTFVATEILDGKAVNVVEGLKLYG 1216
            ++   A  D        N   SIQ Q+ ++NL +VPKTFV  E+LDGK VNVV+GLKLY 
Sbjct: 227  SEPEVAVGDGCTSSSKENESHSIQIQNAKQNLSIVPKTFVGNELLDGKTVNVVDGLKLYE 286

Query: 1215 ETFDSLKISNLVQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPIADAPQED 1036
                  ++S LV LAN+LR AG+RGQ +GQ+++VSKRPM+G GRE+IQLG+P+ D P ED
Sbjct: 287  GLLGDTEVSKLVSLANDLRVAGKRGQFQGQTYVVSKRPMRGHGREMIQLGLPVIDXPSED 346

Query: 1035 ENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPWFG 856
            E +SG+ +D ++EAIPSLLQ+VI+R+  +QVTTVKPDSCIIDF+NEGDHS PH  PPWFG
Sbjct: 347  EISSGTSKDRRIEAIPSLLQDVIDRIAGMQVTTVKPDSCIIDFYNEGDHSHPHTWPPWFG 406

Query: 855  RPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSIRK 676
            RP+C LFLTECD+TFGRV+   HPG+Y               +QGKS DFAKHAI SIRK
Sbjct: 407  RPICVLFLTECDMTFGRVLVSDHPGDYRGPLKLSLTPGSLLLLQGKSTDFAKHAIPSIRK 466

Query: 675  QRILITFTKSQPKKVMLSSDGQ--PLPTSAAAPHCWGXXXXXXXXXXXXXXXXXXPKHYG 502
            QR+L+TFTKSQPKK  + SDGQ  P PT A + + WG                  PKHY 
Sbjct: 467  QRVLVTFTKSQPKKNTM-SDGQRFPAPTPAQSSY-WG---QPSGRSPSHIRHPAGPKHYA 521

Query: 501  AATTTGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAVXXXX 322
            A  TTGVL                  +F+   + P A+P+   V +PP S+GWAA     
Sbjct: 522  AVPTTGVL-PAPPIRSQLPPPNGIQPLFVPPPVGPPAIPFAGAVSIPPVSAGWAAA--PR 578

Query: 321  XXXXXXXXPGTGVFLPPQGSG--PAPQQTVSASAATVNSTTETPSTLDNTKWSVRSNCNG 148
                    PGTGVFLPP GSG   APQQ +  +A  ++ + E P   +    S +SN + 
Sbjct: 579  HPPPRIPPPGTGVFLPPPGSGNSSAPQQ-LPTTATQMSPSVEIPPQTERESGSAKSN-HS 636

Query: 147  SASPNSELDGNTKRQDCNGNVGIDLGGTVAGEEQSD 40
            +  P  + DG  +  +CNG++     G  A +E+ D
Sbjct: 637  TTPPKGKSDGKAQSHECNGSLDGTGSGRAAVKEEED 672


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max] gi|947110281|gb|KRH58607.1| hypothetical protein
            GLYMA_05G138600 [Glycine max]
          Length = 681

 Score =  549 bits (1415), Expect = e-153
 Identities = 328/706 (46%), Positives = 412/706 (58%), Gaps = 23/706 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNG----GSGGEIHN----RQWFVDDRDRFISWLRGEFAAANA 1915
            MAMP GNVVI DKMQFPS G    G+GGEIH     +QWFVD+RD  I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 1914 IIDSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQH 1735
            IIDSLCHHLR VG+PGEYD+V G IQQRRC+WN VL MQQYFS+A+V  ALQQ AW++Q 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120

Query: 1734 RHXXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAEHQNRSVVSSAQFVNMDSXXX 1555
            R             KSGS G    Q F    KE  NSS E  N+           D+   
Sbjct: 121  RPLDPVKVGAKEFRKSGS-GYRHGQRFEPV-KEGYNSSVESYNQ----------YDANVT 168

Query: 1554 XXXXXXXXEAIQKARAGKSDENKSPNAQEK--EVVLGSADSHGDV--------GLKDSGN 1405
                      + +    KS+E+KS    EK  +  L SA+   D          LK + +
Sbjct: 169  VTGGTEKGTPVVE----KSEEHKSGGKVEKVGDKGLASAEDKKDAITKHQTDGSLKSTRS 224

Query: 1404 VEGIPCDDTKAGAANDDTLLNLKD----SIQKQDKEENLDVVPKTFVATEILDGKAVNVV 1237
             EG   +       ND+ + N K     S+Q Q + ++L    KTF+  E+ DGK VNVV
Sbjct: 225  TEGSLSNLESEAVVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVV 284

Query: 1236 EGLKLYGETFDSLKISNLVQLANELRSAGRRGQLKG-QSFIVSKRPMKGRGREVIQLGVP 1060
            +GLKLY + FDS +I+NLV L N+LR +G++GQL+G Q++IVS+RPMKG GRE+IQLGVP
Sbjct: 285  DGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVP 344

Query: 1059 IADAPQEDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQP 880
            IADAP E EN +G+ +D  +E IPSL Q++IER++S QV TVKPD CI+DF+NEGDHSQP
Sbjct: 345  IADAPAEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQP 404

Query: 879  HVCPPWFGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAK 700
            H  P W+GRPV  LFLTEC++TFGRVI   HPG+Y               M+GKS+DFAK
Sbjct: 405  HSWPSWYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAK 464

Query: 699  HAISSIRKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXXXXXXX 520
            HA+ S+RKQRIL+TFTKSQP+K  LSSD Q L ++A + H WG                 
Sbjct: 465  HALPSVRKQRILVTFTKSQPRK-SLSSDAQRLASTATSSH-WG---PLPSRSPNHVRHHV 519

Query: 519  XPKHYGAATTTGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWA 340
              KHY    TTGVL                  +F+T+ + P  MP+P PV  PP S+GW 
Sbjct: 520  GSKHYATLPTTGVL-PSPPIRPQMAAPVGMQPLFVTAPVVP-PMPFPAPVAFPPGSTGWT 577

Query: 339  AVXXXXXXXXXXXXPGTGVFLPPQGSGPAPQQTVSASAATVNSTTETPSTLDNTKWSVRS 160
                          PGTGVFLPP GSG + QQ  + + A VN +TETP+ L+        
Sbjct: 578  GAPPPRHPPPRVPAPGTGVFLPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGKTNH 637

Query: 159  NCNGSASPNSELDGNTKRQDCNGNVGIDLGGTVAGEEQSDDLENGA 22
            N + SASP     G  ++Q+CNG+         A E + D  +  A
Sbjct: 638  N-STSASPK----GKVQKQECNGHAADGTQVEPALETRQDSNDKAA 678


>gb|KHN25979.1| hypothetical protein glysoja_018833 [Glycine soja]
          Length = 685

 Score =  542 bits (1396), Expect = e-151
 Identities = 329/711 (46%), Positives = 413/711 (58%), Gaps = 28/711 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNG----GSGGEIHN----RQWFVDDRDRFISWLRGEFAAANA 1915
            MAMP GNVVI DKMQFPS G    G+GGEIH     +QWFVD+RD  I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 1914 IIDSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQH 1735
            IIDSLCHHLR VG+PGEYD+V G IQQRRC+WN VL MQQYFS+A+V  ALQQ AW++Q 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120

Query: 1734 RHXXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAEHQNRSVVSSAQFVNMDSXXX 1555
            R             KSGS G    Q F    KE  NSS E  N+           D+   
Sbjct: 121  RPLDPVKVGAKEFRKSGS-GYRHGQRFEPV-KEGYNSSVESYNQ----------YDANVT 168

Query: 1554 XXXXXXXXEAIQKARAGKSDENKSPNAQEK--EVVLGSADSHGDV--------GLKDSGN 1405
                      + +    KS+E+KS    EK  +  L SA+   D          LK + +
Sbjct: 169  VTGGTEKGTPVVE----KSEEHKSGGKVEKVGDKGLASAEDKKDAITKHQTDGSLKSTRS 224

Query: 1404 VEGIPCDDTKAGAANDDTLLNLKD----SIQKQDKEENLDVVPKTFVATEILDGKAVNVV 1237
             EG   +       ND+ + N K     S+Q Q + ++L    KTF+  E+ DGK VNVV
Sbjct: 225  TEGSLSNLESEAVVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVV 284

Query: 1236 EGLKLYGETFDSLKISNLVQLANELRSAGRRGQLKG-QSFIVSKRPMKGRGREVIQLGVP 1060
            +GLKLY + FDS +I+NLV L N+LR +G++GQL+G Q++IVS+RPMKG GRE+IQLGVP
Sbjct: 285  DGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVP 344

Query: 1059 IADAPQEDENNSGSCEDGKM-----EAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEG 895
            IADAP E EN +G+ + GK+     E IPSL Q++IER++S QV TVKPD CI+DF+NEG
Sbjct: 345  IADAPAEGENMTGASK-GKLYYMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEG 403

Query: 894  DHSQPHVCPPWFGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKS 715
            DHSQPH  P W+GRPV  LFLTEC++TFGRVI   HPG+Y               M+GKS
Sbjct: 404  DHSQPHSWPSWYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKS 463

Query: 714  ADFAKHAISSIRKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXX 535
            +DFAKHA+ S+RKQRIL+TFTKSQP+K  LSSD Q L ++A + H WG            
Sbjct: 464  SDFAKHALPSVRKQRILVTFTKSQPRK-SLSSDAQRLASTATSSH-WG---PLPSRSPNH 518

Query: 534  XXXXXXPKHYGAATTTGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPA 355
                   KHY    TTGVL                  +F+T+ + P  MP+P PV  PP 
Sbjct: 519  VRHHVGSKHYATLPTTGVL-PSPPIRPQMAAPVGMQPLFVTAPVVP-PMPFPAPVAFPPG 576

Query: 354  SSGWAAVXXXXXXXXXXXXPGTGVFLPPQGSGPAPQQTVSASAATVNSTTETPSTLDNTK 175
            S+GW               PGTGVFLPP GSG + QQ  + + A VN +TETP+ L+   
Sbjct: 577  STGWTGAPPPRHPPPRVPAPGTGVFLPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKEN 636

Query: 174  WSVRSNCNGSASPNSELDGNTKRQDCNGNVGIDLGGTVAGEEQSDDLENGA 22
                 N + SASP     G  ++Q+CNG+         A E + D  +  A
Sbjct: 637  GKTNHN-STSASPK----GKVQKQECNGHAADGTQVEPALETRQDSNDKAA 682


>ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|590697545|ref|XP_007045470.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  541 bits (1395), Expect = e-151
 Identities = 324/701 (46%), Positives = 416/701 (59%), Gaps = 26/701 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSN-----------------GGSGGEIH---NRQWFVDDRDRFI 1951
            MAMP GNVV+SDKMQFP+                  GG GGEIH   +RQW  D+RD FI
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 1950 SWLRGEFAAANAIIDSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVT 1771
             WLRGEFAA+NAIIDSLCHHLR VGE GEY+ V   IQQRRC+WN VLHMQQYFS+AEV+
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 1770 CALQQAAWKKQHRHXXXXXXXXXXXXKSGSQGVTSKQWFRVE-NKENQNSSAEHQNRSVV 1594
             ALQQ AW+++ RH            +SG  G   +   R+E  KE QNS  +    S V
Sbjct: 121  YALQQVAWRRRQRHYESGKVGGKEFKRSG-MGFKGQ---RMEVAKEGQNSGVDSDGNSTV 176

Query: 1593 SSAQFVNMDSXXXXXXXXXXXEAIQKARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKD 1414
            ++    N                      GK ++  S   ++K+          D G K 
Sbjct: 177  TAVSERNERGSEKREEVKSC------GEVGKVEDKCSTFTEDKK----------DTGSKP 220

Query: 1413 -SGNVEGIPCDDTKAGAANDDTLLNLKDSIQKQDKEENLDVVPKTFVATEILDGKAVNVV 1237
             +G+ E +  +D   G  +     +L  SIQ Q++++NL   PKTFV  E+ DGK VNVV
Sbjct: 221  HAGDAESVT-EDVNGGCTSSYKENDLC-SIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVV 278

Query: 1236 EGLKLYGETFDSLKISNLVQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPI 1057
            +GLKLY E FD  ++ +LV L N+LR+AG+RGQL+GQ+++ +KRPMKG GRE+IQLG+PI
Sbjct: 279  DGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPI 338

Query: 1056 ADAPQEDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPH 877
            ADAP +DEN +G+ +D ++E IP LLQ+ IERL++LQV TVKPDSCIID +NEGDHSQP 
Sbjct: 339  ADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPR 398

Query: 876  VCPPWFGRPVCTLFLTECDVTFGRVIGVG-HPGEYNXXXXXXXXXXXXXSMQGKSADFAK 700
            + PPWFG+PVC +FLTECD+TFGRV+ V  HPG+Y               MQGKSADFAK
Sbjct: 399  MWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAK 458

Query: 699  HAISSIRKQRILITFTK-SQPKKVMLSSDGQPLPT-SAAAPHCWGXXXXXXXXXXXXXXX 526
            HA+ S+RKQRIL+TFTK  QPKK   ++D Q L + S +    WG               
Sbjct: 459  HALPSVRKQRILVTFTKYCQPKK--STTDNQRLSSPSVSQSSQWG---PPPSRSPNRIRH 513

Query: 525  XXXPKHYGAATTTGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSG 346
               PKHY    TTGVL                  +F+ +++AP A+ +P PVP+PP S+G
Sbjct: 514  SAGPKHYAVIPTTGVL-PAPPIRPQIPPSSGVQPLFVPTAVAP-AISFPAPVPIPPGSTG 571

Query: 345  WAAVXXXXXXXXXXXXPGTGVFLPPQGSGPAPQQTVSASAATVNSTTETPSTLDNTKWSV 166
            W A             PGTGVFLPP GSG +  Q +S +A  +N   ET S  +    SV
Sbjct: 572  WPAA--PRHPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSV 629

Query: 165  RSNCNGSASPNSELDGNTKRQDCNGNV-GIDLGGTVAGEEQ 46
            + N + + SP   LDG + +QDCNG+V G   G  +  EEQ
Sbjct: 630  KPN-HHTTSPRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQ 669


>ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809865 isoform X2 [Glycine
            max] gi|947110280|gb|KRH58606.1| hypothetical protein
            GLYMA_05G138600 [Glycine max]
          Length = 641

 Score =  541 bits (1393), Expect = e-150
 Identities = 323/692 (46%), Positives = 404/692 (58%), Gaps = 9/692 (1%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNG----GSGGEIHN----RQWFVDDRDRFISWLRGEFAAANA 1915
            MAMP GNVVI DKMQFPS G    G+GGEIH     +QWFVD+RD  I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 1914 IIDSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQH 1735
            IIDSLCHHLR VG+PGEYD+V G IQQRRC+WN VL MQQYFS+A+V  ALQQ AW++Q 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120

Query: 1734 RHXXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAEHQNRSVVSSAQFVNMDSXXX 1555
            R             KSGS G    Q F    KE  NSS E  N+           D+   
Sbjct: 121  RPLDPVKVGAKEFRKSGS-GYRHGQRFEPV-KEGYNSSVESYNQ----------YDANVT 168

Query: 1554 XXXXXXXXEAIQKARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVEGIPCDDTK 1375
                      + +    KS+E+KS    EK          GD GL  + + +G       
Sbjct: 169  VTGGTEKGTPVVE----KSEEHKSGGKVEKV---------GDKGLASAEDKKG------- 208

Query: 1374 AGAANDDTLLNLKDSIQKQDKEENLDVVPKTFVATEILDGKAVNVVEGLKLYGETFDSLK 1195
                 DD+      S+Q Q + ++L    KTF+  E+ DGK VNVV+GLKLY + FDS +
Sbjct: 209  -----DDS-----HSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKLYEDLFDSTE 258

Query: 1194 ISNLVQLANELRSAGRRGQLKG-QSFIVSKRPMKGRGREVIQLGVPIADAPQEDENNSGS 1018
            I+NLV L N+LR +G++GQL+G Q++IVS+RPMKG GRE+IQLGVPIADAP E EN +G+
Sbjct: 259  IANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIADAPAEGENMTGA 318

Query: 1017 CEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPWFGRPVCTL 838
             +D  +E IPSL Q++IER++S QV TVKPD CI+DF+NEGDHSQPH  P W+GRPV  L
Sbjct: 319  SKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPSWYGRPVYIL 378

Query: 837  FLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSIRKQRILIT 658
            FLTEC++TFGRVI   HPG+Y               M+GKS+DFAKHA+ S+RKQRIL+T
Sbjct: 379  FLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAKHALPSVRKQRILVT 438

Query: 657  FTKSQPKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXXXXXXXXPKHYGAATTTGVL 478
            FTKSQP+K  LSSD Q L ++A + H WG                   KHY    TTGVL
Sbjct: 439  FTKSQPRK-SLSSDAQRLASTATSSH-WG---PLPSRSPNHVRHHVGSKHYATLPTTGVL 493

Query: 477  XXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAVXXXXXXXXXXXX 298
                              +F+T+ + P  MP+P PV  PP S+GW               
Sbjct: 494  -PSPPIRPQMAAPVGMQPLFVTAPVVP-PMPFPAPVAFPPGSTGWTGAPPPRHPPPRVPA 551

Query: 297  PGTGVFLPPQGSGPAPQQTVSASAATVNSTTETPSTLDNTKWSVRSNCNGSASPNSELDG 118
            PGTGVFLPP GSG + QQ  + + A VN +TETP+ L+        N + SASP     G
Sbjct: 552  PGTGVFLPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGKTNHN-STSASPK----G 606

Query: 117  NTKRQDCNGNVGIDLGGTVAGEEQSDDLENGA 22
              ++Q+CNG+         A E + D  +  A
Sbjct: 607  KVQKQECNGHAADGTQVEPALETRQDSNDKAA 638


>ref|XP_008365818.1| PREDICTED: uncharacterized protein LOC103429447, partial [Malus
            domestica]
          Length = 640

 Score =  540 bits (1392), Expect = e-150
 Identities = 323/653 (49%), Positives = 389/653 (59%), Gaps = 15/653 (2%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNGGS----GGEIHN--RQWFVDDRDRFISWLRGEFAAANAII 1909
            M MP GNVV+SDKMQFPS GG     GGEIH   RQW  D+RD FISWLRGEFAAAN II
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGAAVGGGEIHQHPRQWLPDERDGFISWLRGEFAAANTII 60

Query: 1908 DSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQHRH 1729
            DSLCHHLR+VG+PGEYDVV G IQQRRC+WN VLHMQQYFS+AEV  ALQ  AW++QH  
Sbjct: 61   DSLCHHLRAVGDPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQHMQ 120

Query: 1728 XXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAEHQNRSVVSSAQFVNMDSXXXXX 1549
                        +S S G    Q      KE  N   E  +    SS       S     
Sbjct: 121  YDPVKVGTKEYKRSAS-GFNKDQQRAEHFKEGHNFRTEVHSYDGNSSGLVA---SEKVER 176

Query: 1548 XXXXXXEAIQKARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVEG-IPCDDTKA 1372
                  E       GK D+     A EK+  L       D  L+ SGN +  I C+    
Sbjct: 177  GSDVAEEVKPHGEVGKLDDKGLAPAGEKKDALTKPQE--DSRLRSSGNSQQTIYCNLEPE 234

Query: 1371 GAANDDTLLNLKD----SIQKQDKEENLDVVPKTFVATEILDGKAVNVVEGLKLYGETFD 1204
             A  D      K+    SIQ Q  ++NL VVPKTFV  E++DGK VNVV+GLKL+     
Sbjct: 235  VAVGDGCTSISKENESHSIQIQIAQQNLPVVPKTFVGNELIDGKTVNVVDGLKLFEGLLG 294

Query: 1203 SLKISNLVQLANELRSAGRRGQLKGQSFIVSKRPMKGRGREVIQLGVPIADAPQEDENNS 1024
              ++S LV LAN+LR AG+RGQ +GQ+++VSKRPM+G GRE+IQLG+P+ DAP EDE ++
Sbjct: 295  DTEVSKLVSLANDLRVAGKRGQFQGQTYVVSKRPMRGHGREMIQLGLPVTDAPSEDEISA 354

Query: 1023 GSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPWFGRPVC 844
            G+ +D ++EAIPSLLQ+VI+RL+ +QVTTVKPDSCIIDF+NEGDHS PH+ PPWFGRPVC
Sbjct: 355  GTSKDRRIEAIPSLLQDVIDRLVGMQVTTVKPDSCIIDFYNEGDHSHPHIWPPWFGRPVC 414

Query: 843  TLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSIRKQRIL 664
             L LTECD+TFGRV+   HPG+Y               +QGKS DFAKHAI SIRKQRIL
Sbjct: 415  VLLLTECDMTFGRVLVSDHPGDYRGALKLSLTPGSLLLLQGKSTDFAKHAIPSIRKQRIL 474

Query: 663  ITFTKSQPKKVMLSSDGQ--PLPTSAAAPHCWGXXXXXXXXXXXXXXXXXXPKHYGAATT 490
            +TFTKSQPKK  + SDGQ  P PT A + H WG                  P HY A  T
Sbjct: 475  VTFTKSQPKKSTM-SDGQRFPGPTPAQSSH-WG---PASGRSPSHIRHPAGPNHYAAVPT 529

Query: 489  TGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAVXXXXXXXX 310
            TGVL                  +F+ + + P A+P+ T VP+PP S+GWAA         
Sbjct: 530  TGVL-PAPSIRSQLPPPNGIQPLFVPAPVGP-AIPFATAVPMPPVSAGWAAA--PRHPPP 585

Query: 309  XXXXPGTGVFLPPQGSG--PAPQQTVSASAATVNSTTETPSTLDNTKWSVRSN 157
                PGTGVFLPP GSG   APQQ +  SA   +   E P  ++    S +SN
Sbjct: 586  RIPLPGTGVFLPPPGSGNSSAPQQ-LPYSATQKSPAVEIPPQIEKESGSAKSN 637


>ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max]
            gi|947093927|gb|KRH42512.1| hypothetical protein
            GLYMA_08G093800 [Glycine max]
          Length = 683

 Score =  540 bits (1390), Expect = e-150
 Identities = 317/682 (46%), Positives = 405/682 (59%), Gaps = 21/682 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSN-------GGSGGEIHNR-----QWFVDDRDRFISWLRGEFA 1927
            MAMP GNVVI DKMQFPS        GG+GGEIH       QWFVD+RD  I WLR EFA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGAGGGGGGGGAGGEIHQPHHYRPQWFVDERDGLIGWLRSEFA 60

Query: 1926 AANAIIDSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAW 1747
            AANAIIDSLCHHLR VG+PGEYD+V G IQQRRC+WN VL MQQYFS+A+V  ALQQ AW
Sbjct: 61   AANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNWNQVLMMQQYFSVADVAYALQQVAW 120

Query: 1746 KKQHRHXXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAEHQNRSVVSSAQFVNMD 1567
            ++Q R             KSGS G    Q F         S  E  N SV S +   N+ 
Sbjct: 121  RRQQRPLDPMKVGAKEVRKSGS-GYRHGQRFE--------SVKEGYNSSVESYSHDANVA 171

Query: 1566 SXXXXXXXXXXXEAIQKARAG----KSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVE 1399
                        E  ++ ++G    K  +    + +EK+  + +  S G   LK + + E
Sbjct: 172  VTGGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDAITNHQSEGS--LKSARSTE 229

Query: 1398 GIPCDDTKAGAANDDTLLNLKD----SIQKQDKEENLDVVPKTFVATEILDGKAVNVVEG 1231
            G   +       ND  + N K     S+Q Q + ++L  + KTF+  E+ DGK VNVV+G
Sbjct: 230  GSLSNLESEAVVNDGCISNSKGNDLHSVQNQSQSQSLSNIAKTFIGNEMFDGKTVNVVDG 289

Query: 1230 LKLYGETFDSLKISNLVQLANELRSAGRRGQLKG-QSFIVSKRPMKGRGREVIQLGVPIA 1054
            LKLY + FDS +++NLV L N+LR +G++GQL+G Q++IVS+RPMKG GRE+IQLGV IA
Sbjct: 290  LKLYDDLFDSTEVANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVRIA 349

Query: 1053 DAPQEDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHV 874
            DAP E EN +G+ +D  +E+IPSL Q++IER++S QV TVKPD CI+DF+NEGDHSQPH 
Sbjct: 350  DAPAEGENMTGASKDMNVESIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHS 409

Query: 873  CPPWFGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHA 694
             P W+GRPV  LFLTEC++TFGRVI   HPG+Y               MQGKS+DFAKHA
Sbjct: 410  WPSWYGRPVYVLFLTECEMTFGRVIASEHPGDYRGSIKLSLVPGSLLVMQGKSSDFAKHA 469

Query: 693  ISSIRKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXXXXXXXXP 514
            + S RKQRIL+TFTKSQP+K  LSSD Q L ++ A+ H WG                  P
Sbjct: 470  LPSTRKQRILVTFTKSQPRK-SLSSDAQQLASAVASSH-WG---PPPSRSPNHVRHHVGP 524

Query: 513  KHYGAATTTGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAV 334
            KHY    TTGVL                  +F+ + + P  MP+  PVP+P  S+GW A 
Sbjct: 525  KHYATLPTTGVL-PAPPIRPQMAAPVGMQPLFVAAPVVP-PMPFSAPVPIPAGSTGWTAA 582

Query: 333  XXXXXXXXXXXXPGTGVFLPPQGSGPAPQQTVSASAATVNSTTETPSTLDNTKWSVRSNC 154
                        PGTGVFLPP GSG + QQ  +++ A VN +TETP+  +     +  N 
Sbjct: 583  PPPRHPPPRVPAPGTGVFLPPSGSGNSSQQLPASTLAEVNPSTETPTMPEKENGKINHN- 641

Query: 153  NGSASPNSELDGNTKRQDCNGN 88
            + SASP     G  ++Q+CNG+
Sbjct: 642  STSASPK----GKVQKQECNGH 659


>gb|KHN28877.1| hypothetical protein glysoja_025671 [Glycine soja]
          Length = 679

 Score =  538 bits (1386), Expect = e-150
 Identities = 316/678 (46%), Positives = 405/678 (59%), Gaps = 17/678 (2%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSNG----GSGGEIHN----RQWFVDDRDRFISWLRGEFAAANA 1915
            MAMP GNVVI DKMQFPS G    G+ GEIH     +QWFVD+RD  I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAVGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 1914 IIDSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVTCALQQAAWKKQH 1735
            IIDSLCHHLR VG+PGEYD+V G IQQRRC+WN VL MQQYFS+A+V  ALQQ AW++Q 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAYALQQVAWRRQQ 120

Query: 1734 RHXXXXXXXXXXXXKSGSQGVTSKQWFRVENKENQNSSAEHQNRSVVSSAQFVNMDSXXX 1555
            R             KSGS G    Q F         S  E  N SV S +   N+     
Sbjct: 121  RPLDPMKVGAKEVRKSGS-GYRHGQRFE--------SVKEGYNSSVESYSHDANVAVTGG 171

Query: 1554 XXXXXXXXEAIQKARAG----KSDENKSPNAQEKEVVLGSADSHGDVGLKDSGNVEGIPC 1387
                    E  ++ ++G    K  +    + +EK+  + +  S G   LK + + EG   
Sbjct: 172  TEKGTPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDAITNHQSDGS--LKSARSTEGSLS 229

Query: 1386 DDTKAGAANDDTLLNLKD----SIQKQDKEENLDVVPKTFVATEILDGKAVNVVEGLKLY 1219
            +       ND  + N K     S+Q Q + ++L  + KTF+  E+ DGK VNVV+GLKLY
Sbjct: 230  NLESEAVVNDGCISNSKGNDLHSVQNQSQSQSLSNIAKTFIGNEMFDGKTVNVVDGLKLY 289

Query: 1218 GETFDSLKISNLVQLANELRSAGRRGQLKG-QSFIVSKRPMKGRGREVIQLGVPIADAPQ 1042
             + FDS +++NLV L N+LR +G++GQL+G Q++IVS+RPMKG GRE+IQLGV IADAP 
Sbjct: 290  DDLFDSTEVANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVRIADAPA 349

Query: 1041 EDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQPHVCPPW 862
            E EN +G+ +D  +E+IPSL Q++IER++S QV TVKPD CI+DF+NEGDHSQPH  P W
Sbjct: 350  EGENMTGASKDMNVESIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPSW 409

Query: 861  FGRPVCTLFLTECDVTFGRVIGVGHPGEYNXXXXXXXXXXXXXSMQGKSADFAKHAISSI 682
            +GRPV  LFLTEC++TFGRVI   HPG+Y               MQGKS+DFAKHA+ S 
Sbjct: 410  YGRPVYVLFLTECEMTFGRVIASEHPGDYRGSIKLSLVPGSLLVMQGKSSDFAKHALPST 469

Query: 681  RKQRILITFTKSQPKKVMLSSDGQPLPTSAAAPHCWGXXXXXXXXXXXXXXXXXXPKHYG 502
            RKQRIL+TFTKSQP+K  LSSD Q L ++ A+ H WG                  PKHY 
Sbjct: 470  RKQRILVTFTKSQPRK-SLSSDAQQLASAVASSH-WG---PPPSRSPNHVRHHVGPKHYA 524

Query: 501  AATTTGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASSGWAAVXXXX 322
               TTGVL                  +F+ + + P  MP+  PVP+P  S+GW A     
Sbjct: 525  TLPTTGVL-PAPPIRPQMAAPVGMQPLFVAAPVVP-PMPFSAPVPIPAGSTGWTAAPPPR 582

Query: 321  XXXXXXXXPGTGVFLPPQGSGPAPQQTVSASAATVNSTTETPSTLDNTKWSVRSNCNGSA 142
                    PGTGVFLPP GSG + QQ  +++ A VN +TETP+  +     +  N + SA
Sbjct: 583  HPPPRVPAPGTGVFLPPSGSGNSSQQLPASTLAEVNPSTETPTMPEKENGKINHN-STSA 641

Query: 141  SPNSELDGNTKRQDCNGN 88
            SP     G  ++Q+CNG+
Sbjct: 642  SPK----GKVQKQECNGH 655


>ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|590697542|ref|XP_007045469.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  537 bits (1383), Expect = e-149
 Identities = 324/702 (46%), Positives = 416/702 (59%), Gaps = 27/702 (3%)
 Frame = -3

Query: 2070 MAMPQGNVVISDKMQFPSN-----------------GGSGGEIH---NRQWFVDDRDRFI 1951
            MAMP GNVV+SDKMQFP+                  GG GGEIH   +RQW  D+RD FI
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 1950 SWLRGEFAAANAIIDSLCHHLRSVGEPGEYDVVTGYIQQRRCSWNHVLHMQQYFSIAEVT 1771
             WLRGEFAA+NAIIDSLCHHLR VGE GEY+ V   IQQRRC+WN VLHMQQYFS+AEV+
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 1770 CALQQAAWKKQHRHXXXXXXXXXXXXKSGSQGVTSKQWFRVE-NKENQNSSAEHQNRSVV 1594
             ALQQ AW+++ RH            +SG  G   +   R+E  KE QNS  +    S V
Sbjct: 121  YALQQVAWRRRQRHYESGKVGGKEFKRSG-MGFKGQ---RMEVAKEGQNSGVDSDGNSTV 176

Query: 1593 SSAQFVNMDSXXXXXXXXXXXEAIQKARAGKSDENKSPNAQEKEVVLGSADSHGDVGLKD 1414
            ++    N                      GK ++  S   ++K+          D G K 
Sbjct: 177  TAVSERNERGSEKREEVKSC------GEVGKVEDKCSTFTEDKK----------DTGSKP 220

Query: 1413 -SGNVEGIPCDDTKAGAANDDTLLNLKDSIQKQDKEENLDVVPKTFVATEILDGKAVNVV 1237
             +G+ E +  +D   G  +     +L  SIQ Q++++NL   PKTFV  E+ DGK VNVV
Sbjct: 221  HAGDAESVT-EDVNGGCTSSYKENDLC-SIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVV 278

Query: 1236 EGLKLYGETFDSLKISNLVQLANELRSAGRRGQLK-GQSFIVSKRPMKGRGREVIQLGVP 1060
            +GLKLY E FD  ++ +LV L N+LR+AG+RGQL+ GQ+++ +KRPMKG GRE+IQLG+P
Sbjct: 279  DGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLP 338

Query: 1059 IADAPQEDENNSGSCEDGKMEAIPSLLQEVIERLLSLQVTTVKPDSCIIDFFNEGDHSQP 880
            IADAP +DEN +G+ +D ++E IP LLQ+ IERL++LQV TVKPDSCIID +NEGDHSQP
Sbjct: 339  IADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQP 398

Query: 879  HVCPPWFGRPVCTLFLTECDVTFGRVIGVG-HPGEYNXXXXXXXXXXXXXSMQGKSADFA 703
             + PPWFG+PVC +FLTECD+TFGRV+ V  HPG+Y               MQGKSADFA
Sbjct: 399  RMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFA 458

Query: 702  KHAISSIRKQRILITFTK-SQPKKVMLSSDGQPLPT-SAAAPHCWGXXXXXXXXXXXXXX 529
            KHA+ S+RKQRIL+TFTK  QPKK   ++D Q L + S +    WG              
Sbjct: 459  KHALPSVRKQRILVTFTKYCQPKK--STTDNQRLSSPSVSQSSQWG---PPPSRSPNRIR 513

Query: 528  XXXXPKHYGAATTTGVLXXXXXXXXXXXXXXXXXXIFITSSLAPAAMPYPTPVPLPPASS 349
                PKHY    TTGVL                  +F+ +++AP A+ +P PVP+PP S+
Sbjct: 514  HSAGPKHYAVIPTTGVL-PAPPIRPQIPPSSGVQPLFVPTAVAP-AISFPAPVPIPPGST 571

Query: 348  GWAAVXXXXXXXXXXXXPGTGVFLPPQGSGPAPQQTVSASAATVNSTTETPSTLDNTKWS 169
            GW A             PGTGVFLPP GSG +  Q +S +A  +N   ET S  +    S
Sbjct: 572  GWPAA--PRHPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGS 629

Query: 168  VRSNCNGSASPNSELDGNTKRQDCNGNV-GIDLGGTVAGEEQ 46
            V+ N + + SP   LDG + +QDCNG+V G   G  +  EEQ
Sbjct: 630  VKPN-HHTTSPRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQ 670


Top