BLASTX nr result

ID: Paeonia25_contig00001844 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00001844
         (2960 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI26785.3| unnamed protein product [Vitis vinifera]              660   0.0  
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   655   0.0  
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   618   e-174
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     603   e-169
ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun...   603   e-169
ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot...   589   e-165
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   588   e-165
ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot...   585   e-164
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   582   e-163
ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794...   569   e-159
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   569   e-159
ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814...   560   e-156
ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phas...   558   e-156
ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phas...   550   e-153
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   547   e-153
ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781...   547   e-152
gb|ABK95394.1| unknown [Populus trichocarpa]                          547   e-152
ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809...   544   e-151
ref|XP_007158786.1| hypothetical protein PHAVU_002G181800g [Phas...   522   e-145
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   516   e-143

>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  660 bits (1703), Expect = 0.0
 Identities = 380/677 (56%), Positives = 431/677 (63%), Gaps = 44/677 (6%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGGG------EIHH-RQWFPDERDGFISWLRGEFAAANAII 2182
            MAMPSGNVVISDKMQF  GGG G      EIHH RQWFPDERDGFISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 2181 DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 2002
            DSLC HLR  GEPGEYD VIG IQQRR NW+ VLHMQQYFS++EV+YALQQV WRRQ R+
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 2001 FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXSEKEPRVNEN 1822
             D +KG GKE+KR G V YRQGQR E  K+ H  + E+H            EK  RV+E 
Sbjct: 121  LDPVKGAGKEYKRYG-VAYRQGQRGETAKDSHNSNFENHSHDANSSGTL--EKGERVSEI 177

Query: 1821 VQDAKHGRE---IGESDHKVPPLADEKKDGF-----------LKSSGNSDGIMCGNSGLE 1684
              D K G +   +G+ + K    A+EKK G             KSS NS+G  CG S  E
Sbjct: 178  YDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETE 237

Query: 1683 VKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGL 1504
              ++ D    N KGSCN+++EN+++ +QNQNEK N TT PKTFVGTE+FDGKAVNVVDGL
Sbjct: 238  ANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGL 297

Query: 1503 KLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQ-GQTFVVSKRPMKGHGREMIQFGIPIVD 1330
            KLYE LFD+SEVSK  SLVNDLR AG+RGQ Q GQTFVVSKRPMKGHGREMIQ G+PI D
Sbjct: 298  KLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQLGVPIAD 357

Query: 1329 APPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHML 1150
            AP EDE++ GTSKD + ESIP+ LQDV   LVG QV+T KPD+CIIDFYNEGDHSQPH+ 
Sbjct: 358  APLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIW 417

Query: 1149 PPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAI 970
            P WFG+PVCILFLTECDMTFGR IG D PGDYR            L +QGKS DFAKHAI
Sbjct: 418  PTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAI 477

Query: 969  PSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHY 790
            PS+RKQR+LVTF K+ PKKT+A                     P+R+PNH+RHP GPKHY
Sbjct: 478  PSLRKQRILVTFTKSQPKKTMAS--DGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHY 535

Query: 789  -----AXXXXXXXXXXXXXXXPSNGMQPIFVTT---XXXXXXXXXXXXXXXXXXXXXXXX 634
                                 P NGMQP+FVTT                           
Sbjct: 536  GAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSPGWPAAPPR 595

Query: 633  XXXPRLPVPGTGVFXXXXXXXXXXXLQQVT--------ETNFSEEKENGS-----VVKEE 493
               PRLPVPGTGVF            Q ++        ET    EKENGS     V KEE
Sbjct: 596  HPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSTVTKEE 655

Query: 492  CNGGVMIKEEENKPAGA 442
                  +K   +KPAGA
Sbjct: 656  QQHNDELK-VASKPAGA 671


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  655 bits (1689), Expect = 0.0
 Identities = 381/708 (53%), Positives = 436/708 (61%), Gaps = 75/708 (10%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGGG------EIHH-RQWFPDERDGFISWLRGEFAAANAII 2182
            MAMPSGNVVISDKMQF  GGG G      EIHH RQWFPDERDGFISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 2181 DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 2002
            DSLC HLR  GEPGEYD VIG IQQRR NW+ VLHMQQYFS++EV+YALQQV WRRQ R+
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 2001 FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXSEKEPRVNEN 1822
             D +KG GKE+KR G V YRQGQR E  K+ H  + E+H            EK  RV+E 
Sbjct: 121  LDPVKGAGKEYKRYG-VAYRQGQRGETAKDSHNSNFENHSHDANSSGTL--EKGERVSEI 177

Query: 1821 VQDAKHGRE---IGESDHKVPPLADEKKDGF-----------LKSSGNSDGIMCGNSGLE 1684
              D K G +   +G+ + K    A+EKK G             KSS NS+G  CG S  E
Sbjct: 178  YDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETE 237

Query: 1683 VKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGL 1504
              ++ D       GSCN+++EN+++ +QNQNEK N TT PKTFVGTE+FDGKAVNVVDGL
Sbjct: 238  ANDMDDG------GSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGL 291

Query: 1503 KLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDA 1327
            KLYE LFD+SEVSK  SLVNDLR AG+RGQ QGQTFVVSKRPMKGHGREMIQ G+PI DA
Sbjct: 292  KLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPIADA 351

Query: 1326 PPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLP 1147
            P EDE++ GTSKD + ESIP+ LQDV   LVG QV+T KPD+CIIDFYNEGDHSQPH+ P
Sbjct: 352  PLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIWP 411

Query: 1146 PWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIP 967
             WFG+PVCILFLTECDMTFGR IG D PGDYR            L +QGKS DFAKHAIP
Sbjct: 412  TWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIP 471

Query: 966  SIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHY- 790
            S+RKQR+LVTF K+ PKKT+A                     P+R+PNH+RHP GPKHY 
Sbjct: 472  SLRKQRILVTFTKSQPKKTMAS--DGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHYG 529

Query: 789  ----AXXXXXXXXXXXXXXXPSNGMQPIFVTT---XXXXXXXXXXXXXXXXXXXXXXXXX 631
                                P NGMQP+FVTT                            
Sbjct: 530  AVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSPGWPAAPPRH 589

Query: 630  XXPRLPVPGTGVFXXXXXXXXXXXLQQVT--------ETNFSEEKENGS----------- 508
              PRLPVPGTGVF            Q ++        ET    EKENGS           
Sbjct: 590  PPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSSNSNTVS 649

Query: 507  --------VVKEECNGGV---------MIKEEE---------NKPAGA 442
                    V ++ECNG +         + KEE+         +KPAGA
Sbjct: 650  PKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQHNDELKVASKPAGA 697


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  618 bits (1594), Expect = e-174
 Identities = 363/685 (52%), Positives = 420/685 (61%), Gaps = 66/685 (9%)
 Frame = -3

Query: 2334 MPSGNVVISDKMQFSSGGGGG------EIHH-RQWFPDERDGFISWLRGEFAAANAIIDS 2176
            MPSGNVVISDKMQF  GGGGG      EIHH RQWFPDERDGFISWLRGEFAAANAIIDS
Sbjct: 1    MPSGNVVISDKMQFPGGGGGGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAIIDS 60

Query: 2175 LCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFD 1996
            LC HLR  GEPGEYD VIG IQQRR NW+ VLHMQQYFS++EV+YALQQV WRRQ R+ D
Sbjct: 61   LCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRHLD 120

Query: 1995 QMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXSEKEPRVNENVQ 1816
             +KG GKE+KR G V YRQGQR E  K+ H  + E+H            EK  RV+E   
Sbjct: 121  PVKGAGKEYKRYG-VAYRQGQRGETAKDSHNSNFENHSHDANSSGTL--EKGERVSEIYD 177

Query: 1815 DAKHGRE---IGESDHKVPPLADEKKD--GFLKSSGNSDGIMCGNSGLEVKEVG------ 1669
            D K G +   +G+ + K    A EKK+   F+        ++     + V+ V       
Sbjct: 178  DVKGGDKGDVVGKLEDKDLSAAAEKKEVMNFVIFGQLEQMLLQNPMQIAVRRVQKTQKDP 237

Query: 1668 ----DKCIPNS----KGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVV 1513
                 +  P +      SCN+++EN+++ +QNQNEK N TT PKTFVGTE+FDGKAVNVV
Sbjct: 238  DVAFQRLRPMTWMMEARSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVV 297

Query: 1512 DGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPI 1336
            DGLKLYE LFD+SEVSK  SLVNDLR AG+RGQ QGQTFVVSKRPMKGHGREMIQ G+PI
Sbjct: 298  DGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPI 357

Query: 1335 VDAPPEDENLSGTSK----DSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDH 1168
             DAP EDE++ GTSK    + + ESIP+ LQDV  +LVG QV+T KPD+CIIDFYNEGDH
Sbjct: 358  ADAPLEDESVVGTSKGMFHNRRTESIPSLLQDVIGQLVGSQVLTVKPDACIIDFYNEGDH 417

Query: 1167 SQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLD 988
            SQPH+ P WFG+PVCILFLTECDMTFGR IG D PGDYR            L +QGKS D
Sbjct: 418  SQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSAD 477

Query: 987  FAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHP 808
            FAKHAIPS+RKQR+LVTF K+ PKKT A                     P+R+PNH+RHP
Sbjct: 478  FAKHAIPSLRKQRILVTFTKSQPKKTTAS--DGQRLLPPAAQSSHWVPPPSRSPNHMRHP 535

Query: 807  SGPKHY-----AXXXXXXXXXXXXXXXPSNGMQPIFVTT---XXXXXXXXXXXXXXXXXX 652
             GPKHY                     P NGMQP+FVTT                     
Sbjct: 536  MGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPXPLPTGSPGW 595

Query: 651  XXXXXXXXXPRLPVPGTGVFXXXXXXXXXXXLQQVT--------ETNFSEEKENGS---- 508
                     PRLPVPGTGVF            Q ++        ET    EKENGS    
Sbjct: 596  PAAPPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSS 655

Query: 507  ---------------VVKEECNGGV 478
                           V ++ECNG +
Sbjct: 656  SNSNTVSPKGKLDGKVHRQECNGSM 680


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  603 bits (1556), Expect = e-169
 Identities = 351/668 (52%), Positives = 406/668 (60%), Gaps = 47/668 (7%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSG-GGGGEIHH---RQWFPDERDGFISWLRGEFAAANAIIDSL 2173
            MAMPSGNVV SDKMQF SG  G GEI H   RQWFPDERDGFISWLRGEFAAANA+IDSL
Sbjct: 1    MAMPSGNVVSSDKMQFPSGTAGAGEISHHNNRQWFPDERDGFISWLRGEFAAANAMIDSL 60

Query: 2172 CYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFDQ 1993
            C+HLR+ GEPGEYD VI  IQ RRCNWNPVLHMQQYFS++EV++ALQQVAWRRQ R++D 
Sbjct: 61   CHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFALQQVAWRRQQRFYDP 120

Query: 1992 MKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXSEKEPRVNENVQD 1813
            +K   KEFKR+G VG++Q QR ++ K+G   + ESH                  +E    
Sbjct: 121  VKMGNKEFKRSG-VGFKQWQRNDSFKDGRNSAAESHCLDGNSSFGNA------ASEKGGS 173

Query: 1812 AKHGREIGESDHKVP-PLADEK--------KDGFLKSSGNSDGIMCGNSGLEVKEVGDKC 1660
             K G E+G SD +   P A EK        +DG +KS GN +G++ G+   EV  V D C
Sbjct: 174  DKSGDEVGNSDDRGSMPAAKEKNDSAAKSQEDGNVKSLGNFEGVVSGSEP-EVHAVDDGC 232

Query: 1659 IPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYELF-D 1483
              +SK       ENDS+S   QNE  NL  +PKTF G E+FDGK VNVV+GLKLYE F  
Sbjct: 233  TSSSK-------ENDSHSTPKQNENSNLANVPKTFSGNEMFDGKPVNVVEGLKLYEEFCA 285

Query: 1482 NSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLS 1303
            ++EVSKL +LVNDLR AG RG FQ QT+VVSKRPMKGHGRE IQ G+PI DAP EDE  +
Sbjct: 286  DTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLGLPIADAPVEDEISA 345

Query: 1302 GTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVC 1123
            GT KD + E+IP  LQDVAERLV MQV T KPDSCIIDFYNEGDHSQPH+ P WFG+PVC
Sbjct: 346  GTLKDRRTEAIPPLLQDVAERLVSMQVATVKPDSCIIDFYNEGDHSQPHLWPSWFGRPVC 405

Query: 1122 ILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIPSIRKQRVL 943
            +LFLTECDMTFGR   ID PGDYR            LA+QGKS DFAKHAIPS+R+QR+L
Sbjct: 406  VLFLTECDMTFGRVFAIDHPGDYRGALKLSLKPGSLLAMQGKSADFAKHAIPSLRRQRIL 465

Query: 942  VTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHYA---XXXXX 772
            VTF K+ PKK++                      P+R+PNHIRHP GPKHYA        
Sbjct: 466  VTFTKSQPKKSMPS-DGQRMPSPGVAPSSHWGPQPSRSPNHIRHP-GPKHYAPVPTTGVL 523

Query: 771  XXXXXXXXXXPSNGMQPIFVT---TXXXXXXXXXXXXXXXXXXXXXXXXXXXPRLPVPGT 601
                      P NG+QP+FVT                               PRLPVPGT
Sbjct: 524  QASPVRPQIPPPNGIQPLFVTAPVAPAMPFPAPVPIPPSSSGWSAAPPRHPPPRLPVPGT 583

Query: 600  GVFXXXXXXXXXXXLQQ---------VTETNFSEEKENGS------------------VV 502
            GVF             Q           ET    EKENGS                    
Sbjct: 584  GVFLPPPGSGGNSSGSQQVLGNDTNHTVETAAPPEKENGSGKLNHGMTASPKGKVDSKTQ 643

Query: 501  KEECNGGV 478
            K+ECNG +
Sbjct: 644  KQECNGSL 651


>ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
            gi|462422058|gb|EMJ26321.1| hypothetical protein
            PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  603 bits (1556), Expect = e-169
 Identities = 344/635 (54%), Positives = 397/635 (62%), Gaps = 24/635 (3%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGGGEI-------HHRQWFPDERDGFISWLRGEFAAANAII 2182
            M MPSGNVV+SDKMQF SGGGGG +       HHRQWFPDERDGFISWLRGEFAAANAII
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGGAVGGGEIAQHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 2181 DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 2002
            DSLC+HLR+ GEPGEYDVVIG IQQRRCNWNPVLHMQQYFS++EV+YALQ VAWRRQ RY
Sbjct: 61   DSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQRY 120

Query: 2001 FDQMKGTGKEFKRAGGVGYRQGQ-RVEAVKEGHTFSVESHXXXXXXXXXXXSEKEPRVNE 1825
            +D +K   KEFKR+G VG+ +GQ R EA KEGH  ++ESH            EK  R   
Sbjct: 121  YDPVKAGAKEFKRSG-VGFNKGQQRAEAFKEGHNSTLESHSNDGNSSGVVAPEKFER--- 176

Query: 1824 NVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNSDGIMCGNSGLEVKEVGDKCIPNSK 1645
                   G E+GE   +V P                        G EV ++ DK +  + 
Sbjct: 177  -------GSEVGE---EVEP------------------------GGEVGKLNDKGLAPA- 201

Query: 1644 GSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYELF-DNSEVS 1468
            G   V   N+S+S+Q QN+KQNL+ +PKTF+G E+ DGK VNVVDGLKLYE F  ++EVS
Sbjct: 202  GEKKV---NESHSIQIQNQKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYEDFLGDTEVS 258

Query: 1467 KLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSGTSKD 1288
            KL SLVNDLR AG+R Q QGQT+VVSKRPMKGHGREMIQ GIPI DAPPEDE  +GTSKD
Sbjct: 259  KLVSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIADAPPEDEISAGTSKD 318

Query: 1287 SKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCILFLT 1108
             KIE IP+ LQDV +RLVGM VMT KPDSCIID YNEGDHSQPH  P WFG+PVC L+LT
Sbjct: 319  RKIEPIPSLLQDVIDRLVGMHVMTVKPDSCIIDVYNEGDHSQPHTWPSWFGRPVCALYLT 378

Query: 1107 ECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIPSIRKQRVLVTFAK 928
            ECDMTFGR + +D PGDYR            L +QGKS DFAKHAIPSIRKQR+LVT  K
Sbjct: 379  ECDMTFGRLLLMDHPGDYRGSLRLSLTPGSILLMQGKSADFAKHAIPSIRKQRILVTLTK 438

Query: 927  AIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHYA---XXXXXXXXXX 757
            + PKK+                       P+R+PNHIRHP+GPKHYA             
Sbjct: 439  SQPKKSTTS-DGQRFPAPAPAQSSYWGPPPSRSPNHIRHPTGPKHYAAVPTTGVLPAPPI 497

Query: 756  XXXXXPSNGMQPIFV--TTXXXXXXXXXXXXXXXXXXXXXXXXXXXPRLPVPGTGVFXXX 583
                 P NG+QP+FV                               PR+P+PGTGVF   
Sbjct: 498  RSQLPPQNGIQPLFVPAPVGPAIPFAAAVPIPPGSAGWPAAPRHPPPRIPLPGTGVFLPP 557

Query: 582  XXXXXXXXLQQV----------TETNFSEEKENGS 508
                     QQ+           ET    +K+NGS
Sbjct: 558  PGSGNSSAPQQLPGTATEMSPTVETPSPRDKDNGS 592


>ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|590697545|ref|XP_007045470.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  589 bits (1519), Expect = e-165
 Identities = 347/692 (50%), Positives = 413/692 (59%), Gaps = 64/692 (9%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSS------------------GGGGGEIH---HRQWFPDERDGFI 2224
            MAMPSGNVV+SDKMQF +                  GGGGGEIH   HRQW PDERDGFI
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 2223 SWLRGEFAAANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVL 2044
             WLRGEFAA+NAIIDSLC+HLR  GE GEY+ VI  IQQRRCNWNPVLHMQQYFS++EV 
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 2043 YALQQVAWRRQHRYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXX 1864
            YALQQVAWRR+ R+++  K  GKEFKR+G +G++ GQR+E  KEG    V+S        
Sbjct: 121  YALQQVAWRRRQRHYESGKVGGKEFKRSG-MGFK-GQRMEVAKEGQNSGVDSDGNSTVTA 178

Query: 1863 XXXXSEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKD-GFLKSSGNSDGIMCGNSGL 1687
                 E+  R +E  ++ K   E+G+ + K     ++KKD G    +G+++ +       
Sbjct: 179  VS---ERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESV------- 228

Query: 1686 EVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDG 1507
              ++V   C  + K       END  S+QNQNEKQNL   PKTFVG E+FDGK VNVVDG
Sbjct: 229  -TEDVNGGCTSSYK-------ENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDG 280

Query: 1506 LKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVD 1330
            LKLYE LFD+ EV  L SLVNDLR AG+RGQ QGQT+V +KRPMKGHGREMIQ G+PI D
Sbjct: 281  LKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIAD 340

Query: 1329 APPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHML 1150
            AP +DEN +GTSKD +IE IP  LQD  ERLV +QVMT KPDSCIID YNEGDHSQP M 
Sbjct: 341  APLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMW 400

Query: 1149 PPWFGKPVCILFLTECDMTFGRAIGI-DRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHA 973
            PPWFGKPVCI+FLTECD+TFGR + + D PGDYR            L +QGKS DFAKHA
Sbjct: 401  PPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHA 460

Query: 972  IPSIRKQRVLVTFAK-AIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPK 796
            +PS+RKQR+LVTF K   PKK+  D                    P+R+PN IRH +GPK
Sbjct: 461  LPSVRKQRILVTFTKYCQPKKSTTD--NQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPK 518

Query: 795  HYA---XXXXXXXXXXXXXXXPSNGMQPIFVTT--XXXXXXXXXXXXXXXXXXXXXXXXX 631
            HYA                  PS+G+QP+FV T                           
Sbjct: 519  HYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAAPRH 578

Query: 630  XXPRLPVPGTGVFXXXXXXXXXXXLQQVT---------ETNFSEEKENGSV--------- 505
              PRLPVPGTGVF            Q  T         ET    EKENGSV         
Sbjct: 579  PPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTSP 638

Query: 504  --------VKEECNGGV--------MIKEEEN 457
                     K++CNG V        ++KEE++
Sbjct: 639  RGRLDGKSPKQDCNGSVDGAGSGRALMKEEQH 670


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  588 bits (1516), Expect = e-165
 Identities = 340/669 (50%), Positives = 404/669 (60%), Gaps = 48/669 (7%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGG----GGEIHH--RQWFPDERDGFISWLRGEFAAANAIID 2179
            M MPSGNVV+SDKMQ+ S  G    GGEIH   RQWFPDERDGFISWLRGEFAAANAIID
Sbjct: 1    MTMPSGNVVLSDKMQYPSVAGAAVSGGEIHQQPRQWFPDERDGFISWLRGEFAAANAIID 60

Query: 2178 SLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYF 1999
            SLC+HLR+ GEP EYD+VIG +QQRRCNW PVLHMQQYFS++EV+YALQQVAWRRQ RY+
Sbjct: 61   SLCHHLRAVGEPSEYDMVIGCVQQRRCNWTPVLHMQQYFSVAEVIYALQQVAWRRQQRYY 120

Query: 1998 DQMKGTGKEFKRAG-GVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXSEKEPRVNEN 1822
            + +K   K++KR+  GVG++   R E VKE HT SVE             SE        
Sbjct: 121  EPVKMGNKDYKRSNSGVGFKP--RNEPVKEWHTASVEYRSYDGSGLEKVGSEMR------ 172

Query: 1821 VQDAKHGREIGESDHKVPPLADEKKDGFLK--------SSGNSDGIMCGNSGLEVKEVGD 1666
             ++ K G E G+ D K        K    K        SS NS G + GNS  E   V +
Sbjct: 173  -EEVKPGGEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSSANSQGTISGNSESEDAVVNE 231

Query: 1665 KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYELF 1486
             C  + K       EN+S S+Q QNEKQNL+ +PKTFVG E FDGK VNVVDGLKLYE F
Sbjct: 232  GCTSSIK-------ENESNSIQIQNEKQNLSLIPKTFVGNETFDGKTVNVVDGLKLYEEF 284

Query: 1485 -DNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDAPPEDEN 1309
              ++EVSKL SLVNDLR  GRRGQ QGQT+V+SKRPMKGHGREMIQ GIPI D P EDE 
Sbjct: 285  LGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQLGIPIADGPQEDEI 344

Query: 1308 LSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKP 1129
             +G SKD ++E+IP+ LQDV +RL+G QV+T KPDSCIIDF+NEGDHS PHM PPWFG+P
Sbjct: 345  SAGISKDRRMEAIPSLLQDVIDRLIGTQVLTDKPDSCIIDFFNEGDHSHPHMWPPWFGRP 404

Query: 1128 VCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIPSIRKQR 949
            V +LFLTECD+TFG+ +G+D PGDYR            L +QGKS D+AKHAIPSIRKQR
Sbjct: 405  VSVLFLTECDLTFGKVLGMDHPGDYRGALRLSLTPGSLLLLQGKSADYAKHAIPSIRKQR 464

Query: 948  VLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHYA---XXX 778
            +LVTF K+ P+K+                         R+PNHIRHP+GPKHYA      
Sbjct: 465  ILVTFTKSQPRKSFPTDGQRLPSPGPSQSPYWSPPPG-RSPNHIRHPAGPKHYAAVPTTG 523

Query: 777  XXXXXXXXXXXXPSNGMQPIFVT--TXXXXXXXXXXXXXXXXXXXXXXXXXXXPRLPVPG 604
                        P+NG+QP+FV                               PR+P+PG
Sbjct: 524  VLPAPPNRPQLPPANGIQPLFVAAPVGPAMPFPAPVVIPPGSPGWVAAPRHPPPRMPLPG 583

Query: 603  TGVFXXXXXXXXXXXLQQ-----VTETN-----FSEEKENGS-----------------V 505
            TGVF             Q      TE N      S EK+NG+                  
Sbjct: 584  TGVFLPPPGSGSSSAPPQQFPSTATEMNPSVETASTEKDNGTAKSSHAIASPKAKLDVKA 643

Query: 504  VKEECNGGV 478
             +++CNG V
Sbjct: 644  QRQDCNGSV 652


>ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|590697542|ref|XP_007045469.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  585 bits (1507), Expect = e-164
 Identities = 347/693 (50%), Positives = 413/693 (59%), Gaps = 65/693 (9%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSS------------------GGGGGEIH---HRQWFPDERDGFI 2224
            MAMPSGNVV+SDKMQF +                  GGGGGEIH   HRQW PDERDGFI
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 2223 SWLRGEFAAANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVL 2044
             WLRGEFAA+NAIIDSLC+HLR  GE GEY+ VI  IQQRRCNWNPVLHMQQYFS++EV 
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 2043 YALQQVAWRRQHRYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXX 1864
            YALQQVAWRR+ R+++  K  GKEFKR+G +G++ GQR+E  KEG    V+S        
Sbjct: 121  YALQQVAWRRRQRHYESGKVGGKEFKRSG-MGFK-GQRMEVAKEGQNSGVDSDGNSTVTA 178

Query: 1863 XXXXSEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKD-GFLKSSGNSDGIMCGNSGL 1687
                 E+  R +E  ++ K   E+G+ + K     ++KKD G    +G+++ +       
Sbjct: 179  VS---ERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESV------- 228

Query: 1686 EVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDG 1507
              ++V   C  + K       END  S+QNQNEKQNL   PKTFVG E+FDGK VNVVDG
Sbjct: 229  -TEDVNGGCTSSYK-------ENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDG 280

Query: 1506 LKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQ-GQTFVVSKRPMKGHGREMIQFGIPIV 1333
            LKLYE LFD+ EV  L SLVNDLR AG+RGQ Q GQT+V +KRPMKGHGREMIQ G+PI 
Sbjct: 281  LKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIA 340

Query: 1332 DAPPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHM 1153
            DAP +DEN +GTSKD +IE IP  LQD  ERLV +QVMT KPDSCIID YNEGDHSQP M
Sbjct: 341  DAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRM 400

Query: 1152 LPPWFGKPVCILFLTECDMTFGRAIGI-DRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKH 976
             PPWFGKPVCI+FLTECD+TFGR + + D PGDYR            L +QGKS DFAKH
Sbjct: 401  WPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKH 460

Query: 975  AIPSIRKQRVLVTFAK-AIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGP 799
            A+PS+RKQR+LVTF K   PKK+  D                    P+R+PN IRH +GP
Sbjct: 461  ALPSVRKQRILVTFTKYCQPKKSTTD--NQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGP 518

Query: 798  KHYA---XXXXXXXXXXXXXXXPSNGMQPIFVTT--XXXXXXXXXXXXXXXXXXXXXXXX 634
            KHYA                  PS+G+QP+FV T                          
Sbjct: 519  KHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAAPR 578

Query: 633  XXXPRLPVPGTGVFXXXXXXXXXXXLQQVT---------ETNFSEEKENGSV-------- 505
               PRLPVPGTGVF            Q  T         ET    EKENGSV        
Sbjct: 579  HPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTS 638

Query: 504  ---------VKEECNGGV--------MIKEEEN 457
                      K++CNG V        ++KEE++
Sbjct: 639  PRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQH 671


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
            gi|223533099|gb|EEF34858.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 697

 Score =  582 bits (1501), Expect = e-163
 Identities = 348/664 (52%), Positives = 402/664 (60%), Gaps = 54/664 (8%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGG----------GEI-----HHR-QWFP-DERDGFISWLR 2212
            MAMP GNVVISDK+QF +GGGG           EI     HHR QWFP DERDGFISWLR
Sbjct: 1    MAMPPGNVVISDKIQFPAGGGGVGGGNNGGIGNEIQQQQHHHRHQWFPVDERDGFISWLR 60

Query: 2211 GEFAAANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQ 2032
            GEFAAANAIIDSLC+HLR+AGEPGEYDVVIG IQQRRCNWNPVLHMQQYFS+ EV+ ALQ
Sbjct: 61   GEFAAANAIIDSLCHHLRAAGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVGEVILALQ 120

Query: 2031 QVAWRRQ-----------HRYF-DQMKGTGKEFKRAGGVGYRQGQRV--EAVKEGHTFSV 1894
            QVA R+Q           HRY+ DQ K  GK+FKR   +G+ +G R   E VKE + +  
Sbjct: 121  QVALRKQQQHQHQHQHQQHRYYYDQPKVGGKDFKRNSSMGFNKGHRGGGEVVKEVN-YGA 179

Query: 1893 ESHXXXXXXXXXXXSEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKDGF-------L 1735
            ESH            +     NE   + K G + G  ++K    A++KKD         L
Sbjct: 180  ESHGL----------DGNTSGNEKFNEIKSGGDSGRLENKSLATAEDKKDAASKPHVDNL 229

Query: 1734 KSSGNSDGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTF 1555
            KSSGNS+G + GN   E + V ++  P          E+DS+ +QNQ  K NLTT PKTF
Sbjct: 230  KSSGNSEGSLSGNLETEAEAVHEQSSPK---------EHDSHFIQNQIVKLNLTTTPKTF 280

Query: 1554 VGTELFDGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPM 1378
            VG E+ DGK+VNVVDGLKLYE L D+ EVSKL SLVNDLR AGR+GQFQGQ +VVSKRPM
Sbjct: 281  VGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQGQAYVVSKRPM 340

Query: 1377 KGHGREMIQFGIPIVDAPPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSC 1198
            KGHGREMIQ G+PI DAP E+EN +GTSKD KIESIP  LQ+V ER V MQ+MT KPDSC
Sbjct: 341  KGHGREMIQLGLPIADAPAEEENAAGTSKDRKIESIPTLLQEVIERFVSMQIMTMKPDSC 400

Query: 1197 IIDFYNEGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXX 1018
            IID YNEGDHSQPHM PPWFGKP+ +LFLTECD+TFGR I  D PGDYR           
Sbjct: 401  IIDIYNEGDHSQPHMWPPWFGKPISVLFLTECDLTFGRVITADHPGDYRGSLKLPLAPGS 460

Query: 1017 XLAVQGKSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXP 838
             L +QGK+ DFAKHAIP+IRKQRVL+TF K+ PKK                        P
Sbjct: 461  LLVMQGKATDFAKHAIPAIRKQRVLLTFTKSQPKK-FVQSDGQRLTSPAASPSSHWGPPP 519

Query: 837  TRAPNHIRHPSGPKHYA---XXXXXXXXXXXXXXXPSNGMQPIFVT----TXXXXXXXXX 679
            +R+PNHIRHP   KHYA                  P NG+QP+FVT              
Sbjct: 520  SRSPNHIRHPVS-KHYAPIPTTGVLPAPSIRPQIAPPNGVQPLFVTAPVAAPMPFPAPVP 578

Query: 678  XXXXXXXXXXXXXXXXXXPRLPVPGTGVFXXXXXXXXXXXLQ--QVTETNFS------EE 523
                                +PVPGTGVF            Q    TE NF       ++
Sbjct: 579  MPPVSTGWPAAPRHPPNRLPVPVPGTGVFLPPPGSGNASSPQIPNATEINFPAETASLQD 638

Query: 522  KENG 511
            KENG
Sbjct: 639  KENG 642


>ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max]
          Length = 683

 Score =  569 bits (1466), Expect = e-159
 Identities = 331/674 (49%), Positives = 404/674 (59%), Gaps = 55/674 (8%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGGG--------EIHHR-----QWFPDERDGFISWLRGEFA 2200
            MAMPSGNVVI DKMQF SG GGG        EIH       QWF DERDG I WLR EFA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGAGGGGGGGGAGGEIHQPHHYRPQWFVDERDGLIGWLRSEFA 60

Query: 2199 AANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAW 2020
            AANAIIDSLC+HLR  G+PGEYD+V+G+IQQRRCNWN VL MQQYFS+++V YALQQVAW
Sbjct: 61   AANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNWNQVLMMQQYFSVADVAYALQQVAW 120

Query: 2019 RRQHRYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXSEKE 1840
            RRQ R  D MK   KE +++G  GYR GQR E+VKEG+  SVES+           +EK 
Sbjct: 121  RRQQRPLDPMKVGAKEVRKSGS-GYRHGQRFESVKEGYNSSVESYSHDANVAVTGGTEKG 179

Query: 1839 PRVNENVQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLE 1684
              V E  ++ K G ++ +   K     +EKKD        G LKS+ +++G +   S LE
Sbjct: 180  TPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDAITNHQSEGSLKSARSTEGSL---SNLE 236

Query: 1683 VKEV-GDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDG 1507
             + V  D CI NSKG       ND +S+QNQ++ Q+L+ + KTF+G E+FDGK VNVVDG
Sbjct: 237  SEAVVNDGCISNSKG-------NDLHSVQNQSQSQSLSNIAKTFIGNEMFDGKTVNVVDG 289

Query: 1506 LKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIV 1333
            LKLY+ LFD++EV+ L SLVNDLRV+G++GQ QG Q ++VS+RPMKGHGREMIQ G+ I 
Sbjct: 290  LKLYDDLFDSTEVANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVRIA 349

Query: 1332 DAPPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHM 1153
            DAP E EN++G SKD  +ESIP+  QD+ ER+V  QVMT KPD CI+DFYNEGDHSQPH 
Sbjct: 350  DAPAEGENMTGASKDMNVESIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHS 409

Query: 1152 LPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHA 973
             P W+G+PV +LFLTEC+MTFGR I  + PGDYR            L +QGKS DFAKHA
Sbjct: 410  WPSWYGRPVYVLFLTECEMTFGRVIASEHPGDYRGSIKLSLVPGSLLVMQGKSSDFAKHA 469

Query: 972  IPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKH 793
            +PS RKQR+LVTF K+ P+K+L+                     P+R+PNH+RH  GPKH
Sbjct: 470  LPSTRKQRILVTFTKSQPRKSLSS---DAQQLASAVASSHWGPPPSRSPNHVRHHVGPKH 526

Query: 792  YAXXXXXXXXXXXXXXXPSN---GMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXX 634
            YA                     GMQP+FV                              
Sbjct: 527  YATLPTTGVLPAPPIRPQMAAPVGMQPLFVAAPVVPPMPFSAPVPIPAGSTGWTAAPPPR 586

Query: 633  XXXPRLPVPGTGVFXXXXXXXXXXXLQQV-----------TETNFSEEKEN--------- 514
               PR+P PGTGVF            QQ+           TET    EKEN         
Sbjct: 587  HPPPRVPAPGTGVF--LPPSGSGNSSQQLPASTLAEVNPSTETPTMPEKENGKINHNSTS 644

Query: 513  ----GSVVKEECNG 484
                G V K+ECNG
Sbjct: 645  ASPKGKVQKQECNG 658


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max]
          Length = 681

 Score =  569 bits (1466), Expect = e-159
 Identities = 332/671 (49%), Positives = 408/671 (60%), Gaps = 52/671 (7%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGG-----GEIHH----RQWFPDERDGFISWLRGEFAAANA 2188
            MAMPSGNVVI DKMQF SGG G     GEIH     +QWF DERDG I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 2187 IIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQH 2008
            IIDSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V +ALQQVAWRRQ 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120

Query: 2007 RYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXS-EKEPRV 1831
            R  D +K   KEF+++G  GYR GQR E VKEG+  SVES+             EK   V
Sbjct: 121  RPLDPVKVGAKEFRKSGS-GYRHGQRFEPVKEGYNSSVESYNQYDANVTVTGGTEKGTPV 179

Query: 1830 NENVQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKE 1675
             E  ++ K G ++ +   K    A++KKD        G LKS+ +++G +   S LE + 
Sbjct: 180  VEKSEEHKSGGKVEKVGDKGLASAEDKKDAITKHQTDGSLKSTRSTEGSL---SNLESEA 236

Query: 1674 V-GDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKL 1498
            V  D+CI NSKG       +DS+S+QNQ++ Q+L+T  KTF+G E+FDGK VNVVDGLKL
Sbjct: 237  VVNDECISNSKG-------DDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKL 289

Query: 1497 YE-LFDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAP 1324
            YE LFD++E++ L SLVNDLRV+G++GQ QG Q ++VS+RPMKGHGREMIQ G+PI DAP
Sbjct: 290  YEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIADAP 349

Query: 1323 PEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPP 1144
             E EN++G SKD  +E IP+  QD+ ER+V  QVMT KPD CI+DFYNEGDHSQPH  P 
Sbjct: 350  AEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPS 409

Query: 1143 WFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIPS 964
            W+G+PV ILFLTEC+MTFGR I  + PGDYR            L ++GKS DFAKHA+PS
Sbjct: 410  WYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAKHALPS 469

Query: 963  IRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHYAX 784
            +RKQR+LVTF K+ P+K+L+                     P+R+PNH+RH  G KHYA 
Sbjct: 470  VRKQRILVTFTKSQPRKSLSS---DAQRLASTATSSHWGPLPSRSPNHVRHHVGSKHYAT 526

Query: 783  XXXXXXXXXXXXXXPSN---GMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXX 625
                                GMQP+FVT                                
Sbjct: 527  LPTTGVLPSPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTGWTGAPPPRHPP 586

Query: 624  PRLPVPGTGVFXXXXXXXXXXXLQQV-----------TETNFSEEKEN------------ 514
            PR+P PGTGVF            QQ+           TET    EKEN            
Sbjct: 587  PRVPAPGTGVF--LPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGKTNHNSTSASP 644

Query: 513  -GSVVKEECNG 484
             G V K+ECNG
Sbjct: 645  KGKVQKQECNG 655


>ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814525 [Glycine max]
          Length = 626

 Score =  560 bits (1444), Expect = e-156
 Identities = 326/656 (49%), Positives = 392/656 (59%), Gaps = 23/656 (3%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGGGEIHHRQ-WFPDERDGFISWLRGEFAAANAIIDSLCYH 2164
            MAMPSGN V+ +K+QF  GGGG EIH+RQ WF DERDGFI WLR EFAAANAIIDSLC+H
Sbjct: 1    MAMPSGNAVMPEKLQFPGGGGGSEIHYRQQWFVDERDGFIGWLRSEFAAANAIIDSLCHH 60

Query: 2163 LRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFDQMKG 1984
            LR  GEPGEYD+V+G+IQQRRCNW  VL MQQYFS+SEV+ ALQQV+WRRQ R  D  K 
Sbjct: 61   LRCVGEPGEYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVCALQQVSWRRQQRVVDLAKT 120

Query: 1983 TGKEFKRAGGVGYRQGQ-RVEAVKEGHTFSVESHXXXXXXXXXXXS-EKEPRVNENVQDA 1810
              KEF++ G  G RQGQ R+EA K+G+  SVES              EK   + E   + 
Sbjct: 121  GAKEFRKFGS-GIRQGQHRLEAAKDGYNSSVESFCHGTNAVVVAGGVEKGTPLTEKNGEI 179

Query: 1809 KHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKEVGDKCIP 1654
            K G ++G  D+K     +E+KD        G LK SGNS G +   S  E   V ++C+ 
Sbjct: 180  KSGGKVGTMDNKSLASPEERKDTITNHQSDGILKGSGNSQGSL-STSECEAVGVNEECVS 238

Query: 1653 NSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-LFDNS 1477
            NSK       ENDS             T+ KTF+G E+FDGK VNVVDGLKLYE L D +
Sbjct: 239  NSK-------ENDS-------------TMGKTFIGNEMFDGKMVNVVDGLKLYEDLLDRT 278

Query: 1476 EVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSG 1300
            EVSKL SLVNDLRVAG+RGQFQG QTFVVSKRPMKGHGREMIQ G+PI DAPP+ +N++G
Sbjct: 279  EVSKLVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVDNVTG 338

Query: 1299 TSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCI 1120
             SKD K+ESIP+  QD+ +RLV  QVMT KPD+CI+DF+NEG+HS P+  PPWFG+P+ I
Sbjct: 339  ISKDKKVESIPSLFQDIIKRLVASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGRPLYI 398

Query: 1119 LFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIPSIRKQRVLV 940
            LFLTECDMTFGR I  D PG++R            L +QGKS DFAKHA+PSI KQR++V
Sbjct: 399  LFLTECDMTFGRIIVSDHPGEFRGAVTLSLVPGSLLVMQGKSTDFAKHALPSIHKQRIIV 458

Query: 939  TFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHYAXXXXXXXXX 760
            TF K+ P+ +L +                    P+R+PNH+RH  GPKHY          
Sbjct: 459  TFTKSQPRSSLPN----DSERLAPPAAPHWAPPPSRSPNHVRHQLGPKHY------PTVQ 508

Query: 759  XXXXXXPSNGMQPIFV-----TTXXXXXXXXXXXXXXXXXXXXXXXXXXXPRLPVPGTGV 595
                    NGMQP+FV                                  PR+PVPGTGV
Sbjct: 509  ATGVLPAPNGMQPLFVPVPVPVASPMSFPTPVPIPPGSIGWTSAPPRHPPPRIPVPGTGV 568

Query: 594  FXXXXXXXXXXXLQQVTETNFSEEKENGSVVKEECN-----GGVMIKEEENKPAGA 442
            F           +    ET     KENG     + N      GV  + E N    A
Sbjct: 569  FLPPPGSGTIHEVNPSVETWTVSGKENGKSNHSKTNSEAEEAGVEKEHESNDMTAA 624


>ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phaseolus vulgaris]
            gi|561026542|gb|ESW25182.1| hypothetical protein
            PHAVU_003G014200g [Phaseolus vulgaris]
          Length = 691

 Score =  558 bits (1439), Expect = e-156
 Identities = 333/695 (47%), Positives = 402/695 (57%), Gaps = 61/695 (8%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGG----GGEIH--HRQWFPDERDGFISWLRGEFAAANAIID 2179
            MAMPSGN  + +K+QF  GGG    GGEI   H+QWF DERDGFI WLR EFAAANAIID
Sbjct: 1    MAMPSGNGGMPEKLQFPVGGGAASGGGEIQYRHQQWFVDERDGFIGWLRSEFAAANAIID 60

Query: 2178 SLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYF 1999
            SLC HLR  GEPG YD+V+G+IQQRRCNW  VL MQQYFS+SEV+YALQQVAWRRQ R+ 
Sbjct: 61   SLCQHLRVVGEPGVYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVAWRRQQRFV 120

Query: 1998 DQMKGTGKEFKRAGGVGYRQGQ-------------RVEAVKEGHTFSVESHXXXXXXXXX 1858
            D  K   KEF++ G  G+RQGQ             R EA KEG+   VES          
Sbjct: 121  DPAKAGSKEFRKFGS-GFRQGQHRNEASKEGYNNSRNEAAKEGYNSKVESFGREMNAVVV 179

Query: 1857 XXS-EKEPRVNENVQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIM 1705
                EK  RV +   +   G ++G  D+      +E KD        G L  SGN  G +
Sbjct: 180  TGGVEKGTRVIDKNGELNSGGKVGTMDNNSIASPEESKDTITNDQLDGILNGSGNFQGSL 239

Query: 1704 CGNSGLEVKEVGD--KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDG 1531
               S  E + VG+  +C  NSKG       NDS+S+QNQ++ QN +T+ KTF+G E+F+G
Sbjct: 240  ---SSSECEAVGENEECTSNSKG-------NDSHSVQNQHQSQNASTIGKTFIGNEMFEG 289

Query: 1530 KAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREM 1357
            K VNVVDGLKLYE L D++EVSKL SLVND+RVAG+RGQFQG QTFVVSKRP+KG GREM
Sbjct: 290  KMVNVVDGLKLYEDLIDSAEVSKLVSLVNDMRVAGKRGQFQGSQTFVVSKRPIKGRGREM 349

Query: 1356 IQFGIPIVDAPPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNE 1177
            IQ G+PI DAPP+ +N++G SKD K+ESIP+  +D+ ERL   QVMT KPD+CI+DF+NE
Sbjct: 350  IQLGVPIADAPPDVDNVTGLSKDKKVESIPSLFEDIIERLAASQVMTVKPDACIVDFFNE 409

Query: 1176 GDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGK 997
            GDHSQP+  PPWFG+PV +LFLTECD+TFGR I  D PGDYR            L +QGK
Sbjct: 410  GDHSQPNSCPPWFGRPVYMLFLTECDITFGRTIVSDHPGDYRGAVKLSLVPGSLLVMQGK 469

Query: 996  SLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHI 817
            S D AKHA+PSI KQR+LVTF K+ PK +L +                      R PNH+
Sbjct: 470  STDLAKHALPSIHKQRILVTFTKSQPKTSLPNDSQRLSPAVTSHWAPPQG----RTPNHM 525

Query: 816  RHPSGPKHYAXXXXXXXXXXXXXXXPSNGMQPIFVTT---XXXXXXXXXXXXXXXXXXXX 646
            RH  GPKHY                P NGMQ +FV T                       
Sbjct: 526  RHQLGPKHYPTIPATGVLPAPSIRAPPNGMQTLFVPTPVAPPISFASPVPIPLGSTGWAS 585

Query: 645  XXXXXXXPRLPVPGTGVFXXXXXXXXXXXLQ---QVTETNFSEE---------------- 523
                   PR+PVPGTGVF                 V+E N S E                
Sbjct: 586  APQRHPPPRMPVPGTGVFLPPPGSGTTSSQHLPGVVSEVNLSGETTSTGKESLKSNHNTI 645

Query: 522  ------KENGSVV-KEECNGGVMIKEEENKPAGAD 439
                  K +G+VV ++ECNG     E E    G +
Sbjct: 646  NSSPKGKVDGNVVGRQECNGNADRSEGEEDVVGKE 680


>ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
            gi|561032200|gb|ESW30779.1| hypothetical protein
            PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 671

 Score =  550 bits (1417), Expect = e-153
 Identities = 322/667 (48%), Positives = 395/667 (59%), Gaps = 48/667 (7%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGG---GEI--HH--RQWFPDERDGFISWLRGEFAAANAII 2182
            MAMPSGNVVI DKMQF +GGGG   GEI  HH  +QWF DERDG I WLR EFAAANAII
Sbjct: 1    MAMPSGNVVIQDKMQFPNGGGGAGVGEIQQHHYRQQWFVDERDGLIGWLRSEFAAANAII 60

Query: 2181 DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 2002
            DSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V Y LQQVAWR+Q R 
Sbjct: 61   DSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVTYTLQQVAWRKQQRP 120

Query: 2001 FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXSEKEPRVNEN 1822
             D +K   KE ++ G  GYR G R E  KEG+  SVES+            EK     + 
Sbjct: 121  LDPVKVGAKEVRKPGP-GYRYGHRFEPSKEGYNSSVESYSHDGNATFTRGMEKGTPTVDK 179

Query: 1821 VQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKEVGD 1666
             ++ K G ++ +   K     +EKKD        G LKS+G+S+G +  N   E   V D
Sbjct: 180  SEEHKSGSKVEKVGDKGLASPEEKKDAIIKHQTDGNLKSTGSSEGYL-SNLESEAVVVND 238

Query: 1665 KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-L 1489
            + I NSKG       NDS S+++Q++ Q+ +T+ KTF+G E+ DGK VN+ DGLKLYE +
Sbjct: 239  EFISNSKG-------NDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADGLKLYEDI 291

Query: 1488 FDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDE 1312
            FD++EVS L SLVNDLR++G++GQ QG Q +VVS+RPMKGHGREMIQ G+PI DAP E E
Sbjct: 292  FDSTEVSNLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIADAPVEGE 351

Query: 1311 NLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGK 1132
            N++G SK   +E IP+  +D+ ER+V  QVMTTKPD CI+DFYNEGDHSQPH  P WFG+
Sbjct: 352  NMTGASKVMNVEPIPSLFEDIIERMVSSQVMTTKPDCCIVDFYNEGDHSQPHSWPSWFGR 411

Query: 1131 PVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIPSIRKQ 952
            PV  LFLTEC+MTFGR I  + PGDYR            LA+QGKS DFAKHA+PSIRKQ
Sbjct: 412  PVYTLFLTECEMTFGRLIASEHPGDYRGSLKLSLVPGSLLAMQGKSCDFAKHALPSIRKQ 471

Query: 951  RVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHYA---XX 781
            R+LVTF K+ PKK++                      P+R+PNH+RH  G KHYA     
Sbjct: 472  RILVTFTKSQPKKSVPS---DAQRLYLPAASSQWGPPPSRSPNHVRHSVGSKHYAALPTT 528

Query: 780  XXXXXXXXXXXXXPSNGMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXXPRLP 613
                            GMQP+FV                                 PR+P
Sbjct: 529  GVLPAPPIRPQIPAQVGMQPLFVAAPVVPPMPYPAPVSIPPGSAGWTTAPPPRHPPPRIP 588

Query: 612  VPGTGVF---XXXXXXXXXXXLQQVTETNFS-------EEKEN--------------GSV 505
             PGTGVF                 + E N S       +EKEN              G V
Sbjct: 589  APGTGVFLPPPGSGNSQQQLPAGTLAEVNPSIETPTTMQEKENGKSNDDNSSSTSPKGKV 648

Query: 504  VKEECNG 484
             K+ECNG
Sbjct: 649  QKQECNG 655


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
            gi|550333016|gb|ERP57586.1| hypothetical protein
            POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  547 bits (1410), Expect = e-153
 Identities = 328/659 (49%), Positives = 382/659 (57%), Gaps = 49/659 (7%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSG-----GGGGEIHHRQ-----WFP-DERDGFISWLRGEFAAA 2194
            MAMP GNVVI DK+QF +G     GGG EIH  Q     WFP DERDGFISWLRGEFAAA
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60

Query: 2193 NAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRR 2014
            NAIIDSLC+HLR+ GE GEYD+V+G IQQRR NWN VLHMQQYFS+ EV+ ALQQV  RR
Sbjct: 61   NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120

Query: 2013 QHR--------------YFDQMKGTGKEFKRAGGVGYRQGQRV-------EAVKEGHTFS 1897
            Q +              Y+D  K  G++FKR+   G+ +G R        +AVKEG   S
Sbjct: 121  QQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGGGDAVKEGVNSS 180

Query: 1896 VESHXXXXXXXXXXXSEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNS 1717
            VE+H           SEK        ++ K G + G+SD K    A    D    SSGN+
Sbjct: 181  VENHSFNGNSSENIRSEK-------FEEVKSGGDGGKSDDKKDATAKSHTDNHKNSSGNA 233

Query: 1716 DGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELF 1537
             G   GNS  E   V D+  P          E+DS+   NQNEKQNL   PKTFV  E  
Sbjct: 234  QGTFSGNS--EAVAVDDRSSPE---------ESDSHPSNNQNEKQNLAITPKTFVAEEKI 282

Query: 1536 DGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGRE 1360
            DG+ VNVVDGLKLYE L D  EVSKL SLVN+LR  GRRGQ QGQT+++SKRPMKGHGRE
Sbjct: 283  DGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGRE 342

Query: 1359 MIQFGIPIVDAPPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYN 1180
            MIQ G+PI DAP EDEN +GTSK+ ++ESIPA LQDV E  V MQVMT KPDSCIID YN
Sbjct: 343  MIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYN 402

Query: 1179 EGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQG 1000
            EGDHSQPHM PPWFGKPV +LFLTEC++TFG+ I     GDY+            L +QG
Sbjct: 403  EGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQG 462

Query: 999  KSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNH 820
            KS D AKHAIP I+KQR+LVTF K+ PKK L                      P+R+PNH
Sbjct: 463  KSSDLAKHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPNH 521

Query: 819  IRHPSGPKHYA---XXXXXXXXXXXXXXXPSNGMQPIFVTT-----XXXXXXXXXXXXXX 664
            +RHP  PKHYA                  P NG+QP+F+TT                   
Sbjct: 522  LRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVST 580

Query: 663  XXXXXXXXXXXXXPRLPVPGTGVFXXXXXXXXXXXLQQV----TETNF----SEEKENG 511
                           +P+PGTGVF             Q+    TE NF     +EKENG
Sbjct: 581  GWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENG 639


>ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781773 [Glycine max]
          Length = 664

 Score =  547 bits (1409), Expect = e-152
 Identities = 311/603 (51%), Positives = 374/603 (62%), Gaps = 20/603 (3%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGG----GGEIHHRQ-WFPDERDGFISWLRGEFAAANAIIDS 2176
            MAMPSGN V+ +K+QF  GGG    G EIH RQ WF DERDGFI WLR EFAAANAIIDS
Sbjct: 1    MAMPSGNAVMPEKLQFPGGGGAPGGGSEIHFRQQWFVDERDGFIGWLRSEFAAANAIIDS 60

Query: 2175 LCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFD 1996
            LC+HLR  GEPGEY++V+G+IQQRRCNW  VL MQQYFS+SEV+YALQQV+WRRQ R  D
Sbjct: 61   LCHHLRDVGEPGEYNMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVSWRRQQRVVD 120

Query: 1995 QMKGTGKEFKRAGGVGYRQGQ-RVEAVKEGHTFSVESHXXXXXXXXXXXS-EKEPRVNEN 1822
              K   KEF++ G +G++QGQ R EAVK+G+  SVES              EK   V E 
Sbjct: 121  PAKTGAKEFRKFG-LGFKQGQHRFEAVKDGYNSSVESFGHGTNAVVVAGGVEKGACVTEK 179

Query: 1821 VQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKEVGD 1666
              + K G  +G  D+K     +E+KD        G LK S NS G +  +S  E   V +
Sbjct: 180  NGEIKSGGMVGTMDNKNLGSPEERKDAITNHQSDGILKGSRNSQGSL-SSSECEAVGVNE 238

Query: 1665 KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-L 1489
            +C+ NSK       ENDS              + K F+G E+FDGK VNVVDGLKLYE L
Sbjct: 239  ECVSNSK-------ENDSI-------------MGKFFIGNEMFDGKMVNVVDGLKLYEDL 278

Query: 1488 FDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDE 1312
             D++EVSKL SLVNDLRVAG+RGQFQG QTFVVSKRPMKGHGREMIQ G+PI DAPP+ +
Sbjct: 279  LDSTEVSKLVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVD 338

Query: 1311 NLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGK 1132
            N++G SKD K+ESIP+  QD+ ERL   QVMT KPD+CI+DF+NEG+HS P+  PPWFG+
Sbjct: 339  NVTGISKDKKVESIPSLFQDIIERLAASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGR 398

Query: 1131 PVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIPSIRKQ 952
            PV  LFLTECDMTFGR I  D PG++R            L +QGKS DFAKHA+PSI KQ
Sbjct: 399  PVYTLFLTECDMTFGRIIVSDHPGEFRGAVRLSLVPGSLLVMQGKSTDFAKHALPSIHKQ 458

Query: 951  RVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHYAXXXXX 772
            R+++TF K+ PK +L +                     +R+PNH+RH  GPKHY      
Sbjct: 459  RIIITFTKSQPKCSLPN----DSQRLAPPAASHWAPPQSRSPNHVRHQLGPKHYPTVPAT 514

Query: 771  XXXXXXXXXXPSNGMQPIFV---TTXXXXXXXXXXXXXXXXXXXXXXXXXXXPRLPVPGT 601
                      P N MQP+FV                                PR+PVPGT
Sbjct: 515  VVLPAPSIHAPPNSMQPLFVPAPVAPPMSFPTPVPIPPGSTGWTSAPSRHPPPRIPVPGT 574

Query: 600  GVF 592
            GVF
Sbjct: 575  GVF 577


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  547 bits (1409), Expect = e-152
 Identities = 328/660 (49%), Positives = 382/660 (57%), Gaps = 50/660 (7%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSG-----GGGGEIHHRQ-----WFP-DERDGFISWLRGEFAAA 2194
            MAMP GNVVI DK+QF +G     GGG EIH  Q     WFP DERDGFISWLRGEFAAA
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60

Query: 2193 NAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRR 2014
            NAIIDSLC+HLR+ GE GEYD+V+G IQQRR NWN VLHMQQYFS+ EV+ ALQQV  RR
Sbjct: 61   NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120

Query: 2013 QHR-----------------YFDQMKGTGKEFKRAGGVGYRQGQRV-----EAVKEGHTF 1900
            Q +                 Y+D  K  G++FKR+   G+ +G R      +AVKEG   
Sbjct: 121  QQQQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGDAVKEGVNS 180

Query: 1899 SVESHXXXXXXXXXXXSEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGN 1720
            SVE+H           SEK        ++ K G + G+SD K    A    D    SSGN
Sbjct: 181  SVENHSFNGNSSENIRSEK-------FEEVKSGGDGGKSDDKKDATAKSHTDNHKNSSGN 233

Query: 1719 SDGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTEL 1540
            + G   GNS  E   V D+  P          E+DS+   NQNEKQNL   PKTFV  E 
Sbjct: 234  AQGTFSGNS--EAVAVDDRSSPE---------ESDSHPSNNQNEKQNLAITPKTFVAEEK 282

Query: 1539 FDGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGR 1363
             DG+ VNVVDGLKLYE L D  EVSKL SLVN+LR  GRRGQ QGQT+++SKRPMKGHGR
Sbjct: 283  IDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGR 342

Query: 1362 EMIQFGIPIVDAPPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFY 1183
            EMIQ G+PI DAP EDEN +GTSK+ ++ESIPA LQDV E  V MQVMT KPDSCIID Y
Sbjct: 343  EMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIY 402

Query: 1182 NEGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQ 1003
            NEGDHSQPHM PPWFGKPV +LFLTEC++TFG+ I     GDY+            L +Q
Sbjct: 403  NEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQ 462

Query: 1002 GKSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPN 823
            GKS D AKHAIP I+KQR+LVTF K+ PKK L                      P+R+PN
Sbjct: 463  GKSSDLAKHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPN 521

Query: 822  HIRHPSGPKHYA---XXXXXXXXXXXXXXXPSNGMQPIFVTT-----XXXXXXXXXXXXX 667
            H+RHP  PKHYA                  P NG+QP+F+TT                  
Sbjct: 522  HLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVS 580

Query: 666  XXXXXXXXXXXXXXPRLPVPGTGVFXXXXXXXXXXXLQQV----TETNF----SEEKENG 511
                            +P+PGTGVF             Q+    TE NF     +EKENG
Sbjct: 581  TGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENG 640


>ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809865 isoform X2 [Glycine
            max]
          Length = 641

 Score =  544 bits (1401), Expect = e-151
 Identities = 317/661 (47%), Positives = 392/661 (59%), Gaps = 42/661 (6%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGG-----GEIHH----RQWFPDERDGFISWLRGEFAAANA 2188
            MAMPSGNVVI DKMQF SGG G     GEIH     +QWF DERDG I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 2187 IIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQH 2008
            IIDSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V +ALQQVAWRRQ 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120

Query: 2007 RYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXSEKEPRVN 1828
            R  D +K   KEF+++G  GYR GQR E VKEG+  SVES+                   
Sbjct: 121  RPLDPVKVGAKEFRKSGS-GYRHGQRFEPVKEGYNSSVESY------------------- 160

Query: 1827 ENVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNSDGIMCGNSGLEVKEVGDKCIPNS 1648
             N  DA     +     K  P+ ++ ++                SG +V++VGDK + ++
Sbjct: 161  -NQYDANV--TVTGGTEKGTPVVEKSEEH--------------KSGGKVEKVGDKGLASA 203

Query: 1647 KGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-LFDNSEV 1471
            +        +DS+S+QNQ++ Q+L+T  KTF+G E+FDGK VNVVDGLKLYE LFD++E+
Sbjct: 204  EDKKG----DDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKLYEDLFDSTEI 259

Query: 1470 SKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSGTS 1294
            + L SLVNDLRV+G++GQ QG Q ++VS+RPMKGHGREMIQ G+PI DAP E EN++G S
Sbjct: 260  ANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIADAPAEGENMTGAS 319

Query: 1293 KDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCILF 1114
            KD  +E IP+  QD+ ER+V  QVMT KPD CI+DFYNEGDHSQPH  P W+G+PV ILF
Sbjct: 320  KDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPSWYGRPVYILF 379

Query: 1113 LTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIPSIRKQRVLVTF 934
            LTEC+MTFGR I  + PGDYR            L ++GKS DFAKHA+PS+RKQR+LVTF
Sbjct: 380  LTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAKHALPSVRKQRILVTF 439

Query: 933  AKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHYAXXXXXXXXXXX 754
             K+ P+K+L+                     P+R+PNH+RH  G KHYA           
Sbjct: 440  TKSQPRKSLSS---DAQRLASTATSSHWGPLPSRSPNHVRHHVGSKHYATLPTTGVLPSP 496

Query: 753  XXXXPSN---GMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXXPRLPVPGTGV 595
                      GMQP+FVT                                PR+P PGTGV
Sbjct: 497  PIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTGWTGAPPPRHPPPRVPAPGTGV 556

Query: 594  FXXXXXXXXXXXLQQV-----------TETNFSEEKEN-------------GSVVKEECN 487
            F            QQ+           TET    EKEN             G V K+ECN
Sbjct: 557  F--LPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGKTNHNSTSASPKGKVQKQECN 614

Query: 486  G 484
            G
Sbjct: 615  G 615


>ref|XP_007158786.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
            gi|561032201|gb|ESW30780.1| hypothetical protein
            PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 630

 Score =  522 bits (1345), Expect = e-145
 Identities = 310/660 (46%), Positives = 381/660 (57%), Gaps = 41/660 (6%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSGGGG---GEI--HH--RQWFPDERDGFISWLRGEFAAANAII 2182
            MAMPSGNVVI DKMQF +GGGG   GEI  HH  +QWF DERDG I WLR EFAAANAII
Sbjct: 1    MAMPSGNVVIQDKMQFPNGGGGAGVGEIQQHHYRQQWFVDERDGLIGWLRSEFAAANAII 60

Query: 2181 DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 2002
            DSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V Y LQQVAWR+Q R 
Sbjct: 61   DSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVTYTLQQVAWRKQQRP 120

Query: 2001 FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXSEK-EPRVNE 1825
             D +K   KE ++ G  GYR G R E  KEG+  SVES+            EK  P V++
Sbjct: 121  LDPVKVGAKEVRKPGP-GYRYGHRFEPSKEGYNSSVESYSHDGNATFTRGMEKGTPTVDK 179

Query: 1824 NVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNSDGIMCGNSGLEVKEVGDKCIPNSK 1645
            +             +HK                          SG +V++VGDK + + +
Sbjct: 180  S------------EEHK--------------------------SGSKVEKVGDKGLASPE 201

Query: 1644 GSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-LFDNSEVS 1468
                    NDS S+++Q++ Q+ +T+ KTF+G E+ DGK VN+ DGLKLYE +FD++EVS
Sbjct: 202  EKKG----NDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADGLKLYEDIFDSTEVS 257

Query: 1467 KLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSGTSK 1291
             L SLVNDLR++G++GQ QG Q +VVS+RPMKGHGREMIQ G+PI DAP E EN++G SK
Sbjct: 258  NLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIADAPVEGENMTGASK 317

Query: 1290 DSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCILFL 1111
               +E IP+  +D+ ER+V  QVMTTKPD CI+DFYNEGDHSQPH  P WFG+PV  LFL
Sbjct: 318  VMNVEPIPSLFEDIIERMVSSQVMTTKPDCCIVDFYNEGDHSQPHSWPSWFGRPVYTLFL 377

Query: 1110 TECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQGKSLDFAKHAIPSIRKQRVLVTFA 931
            TEC+MTFGR I  + PGDYR            LA+QGKS DFAKHA+PSIRKQR+LVTF 
Sbjct: 378  TECEMTFGRLIASEHPGDYRGSLKLSLVPGSLLAMQGKSCDFAKHALPSIRKQRILVTFT 437

Query: 930  KAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPNHIRHPSGPKHYA---XXXXXXXXX 760
            K+ PKK++                      P+R+PNH+RH  G KHYA            
Sbjct: 438  KSQPKKSVPS---DAQRLYLPAASSQWGPPPSRSPNHVRHSVGSKHYAALPTTGVLPAPP 494

Query: 759  XXXXXXPSNGMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXXPRLPVPGTGVF 592
                     GMQP+FV                                 PR+P PGTGVF
Sbjct: 495  IRPQIPAQVGMQPLFVAAPVVPPMPYPAPVSIPPGSAGWTTAPPPRHPPPRIPAPGTGVF 554

Query: 591  ---XXXXXXXXXXXLQQVTETNFS-------EEKEN--------------GSVVKEECNG 484
                             + E N S       +EKEN              G V K+ECNG
Sbjct: 555  LPPPGSGNSQQQLPAGTLAEVNPSIETPTTMQEKENGKSNDDNSSSTSPKGKVQKQECNG 614


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550333015|gb|EEE88914.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  516 bits (1330), Expect = e-143
 Identities = 319/660 (48%), Positives = 369/660 (55%), Gaps = 50/660 (7%)
 Frame = -3

Query: 2340 MAMPSGNVVISDKMQFSSG-----GGGGEIHHRQ-----WFP-DERDGFISWLRGEFAAA 2194
            MAMP GNVVI DK+QF +G     GGG EIH  Q     WFP DERDGFISWLRGEFAAA
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60

Query: 2193 NAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRR 2014
            NAIIDSLC+HLR+ GE GEYD+V+G IQQRR NWN VLHMQQYFS+ EV+ ALQQV  RR
Sbjct: 61   NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120

Query: 2013 QHR--------------YFDQMKGTGKEFKRAGGVGYRQGQRV-------EAVKEGHTFS 1897
            Q +              Y+D  K  G++FKR+   G+ +G R        +AVKEG   S
Sbjct: 121  QQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGGGDAVKEGVNSS 180

Query: 1896 VESHXXXXXXXXXXXSEKEPRVNENVQDAKHGREIGESDHK-VPPLADEKKDGFLKSSGN 1720
            VE+H           SEK        ++ K G + G+SD K     A    D    SSGN
Sbjct: 181  VENHSFNGNSSENIRSEK-------FEEVKSGGDGGKSDDKKADATAKSHTDNHKNSSGN 233

Query: 1719 SDGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTEL 1540
            + G   GNS                                 NEKQNL   PKTFV  E 
Sbjct: 234  AQGTFSGNSEAVA-----------------------------NEKQNLAITPKTFVAEEK 264

Query: 1539 FDGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGR 1363
             DG+ VNVVDGLKLYE L D  EVSKL SLVN+LR  GRRGQ QGQT+++SKRPMKGHGR
Sbjct: 265  IDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGR 324

Query: 1362 EMIQFGIPIVDAPPEDENLSGTSKDSKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFY 1183
            EMIQ G+PI DAP EDEN +GTSK + +ESIPA LQDV E  V MQVMT KPDSCIID Y
Sbjct: 325  EMIQLGLPIADAPAEDENATGTSKGT-VESIPALLQDVIEHFVAMQVMTMKPDSCIIDIY 383

Query: 1182 NEGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXLAVQ 1003
            NEGDHSQPHM PPWFGKPV +LFLTEC++TFG+ I     GDY+            L +Q
Sbjct: 384  NEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQ 443

Query: 1002 GKSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXPTRAPN 823
            GKS D AKHAIP I+KQR+LVTF K+ PKK L                      P+R+PN
Sbjct: 444  GKSSDLAKHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPN 502

Query: 822  HIRHPSGPKHYA---XXXXXXXXXXXXXXXPSNGMQPIFVTT-----XXXXXXXXXXXXX 667
            H+RHP  PKHYA                  P NG+QP+F+TT                  
Sbjct: 503  HLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVS 561

Query: 666  XXXXXXXXXXXXXXPRLPVPGTGVFXXXXXXXXXXXLQQV----TETNF----SEEKENG 511
                            +P+PGTGVF             Q+    TE NF     +EKENG
Sbjct: 562  TGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENG 621


Top