BLASTX nr result

ID: Paeonia22_contig00005984 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00005984
         (2932 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI26785.3| unnamed protein product [Vitis vinifera]              659   0.0  
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   654   0.0  
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   617   e-174
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     603   e-169
ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun...   603   e-169
ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot...   588   e-165
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   587   e-165
ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot...   584   e-164
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   582   e-163
ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794...   569   e-159
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   569   e-159
ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814...   559   e-156
ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phas...   557   e-156
ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phas...   550   e-153
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   546   e-152
gb|ABK95394.1| unknown [Populus trichocarpa]                          546   e-152
ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781...   546   e-152
ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809...   544   e-151
ref|XP_007158786.1| hypothetical protein PHAVU_002G181800g [Phas...   522   e-145
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   516   e-143

>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  659 bits (1701), Expect = 0.0
 Identities = 376/677 (55%), Positives = 427/677 (63%), Gaps = 44/677 (6%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGGG------EIHH-RQWFPDERDGFISWLRGEFAAANAII 768
            MAMPSGNVVISDKMQF  GGG G      EIHH RQWFPDERDGFISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 769  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 948
            DSLC HLR  GEPGEYD VIG IQQRR NW+ VLHMQQYFS++EV+YALQQV WRRQ R+
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 949  FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXXEKEPRVNEN 1128
             D +KG GKE+KR G V YRQGQR E  K+ H  + E+H            EK  RV+E 
Sbjct: 121  LDPVKGAGKEYKRYG-VAYRQGQRGETAKDSHNSNFENHSHDANSSGTL--EKGERVSEI 177

Query: 1129 VQDAKHGRE---IGESDHKVPPLADEKKDGF-----------LKSSGNSDGIMCGNSGLE 1266
              D K G +   +G+ + K    A+EKK G             KSS NS+G  CG S  E
Sbjct: 178  YDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETE 237

Query: 1267 VKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGL 1446
              ++ D    N KGSCN+++EN+++ +QNQNEK N TT PKTFVGTE+FDGKAVNVVDGL
Sbjct: 238  ANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGL 297

Query: 1447 KLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQ-GQTFVVSKRPMKGHGREMIQFGIPIVD 1620
            KLYE LFD+SEVSK  SLVNDLR AG+RGQ Q GQTFVVSKRPMKGHGREMIQ G+PI D
Sbjct: 298  KLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQLGVPIAD 357

Query: 1621 APPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHML 1800
            AP EDE++ GTSKD + ESIP+ LQDV   LVG QV+T KPD+CIIDFYNEGDHSQPH+ 
Sbjct: 358  APLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIW 417

Query: 1801 PPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAI 1980
            P WFG+PVCILFLTECDMTFGR IG D PGDYR              +QGKS DFAKHAI
Sbjct: 418  PTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAI 477

Query: 1981 PSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHY 2160
            PS+RKQR+LVTF K+ PKKT+A                      +R+PNH+RHP GPKHY
Sbjct: 478  PSLRKQRILVTFTKSQPKKTMAS--DGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHY 535

Query: 2161 -----AXXXXXXXXXXXXXXXXSNGMQPIFVTT---XXXXXXXXXXXXXXXXXXXXXXXX 2316
                                   NGMQP+FVTT                           
Sbjct: 536  GAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSPGWPAAPPR 595

Query: 2317 XXXXRLPVPGTGVFXXXXXXXXXXXXQQVT--------ETNFSEEKENGS-----VVKEE 2457
                RLPVPGTGVF            Q ++        ET    EKENGS     V KEE
Sbjct: 596  HPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSTVTKEE 655

Query: 2458 CNGGVMIKEEENKPAGA 2508
                  +K   +KPAGA
Sbjct: 656  QQHNDELK-VASKPAGA 671


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  654 bits (1687), Expect = 0.0
 Identities = 377/708 (53%), Positives = 432/708 (61%), Gaps = 75/708 (10%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGGG------EIHH-RQWFPDERDGFISWLRGEFAAANAII 768
            MAMPSGNVVISDKMQF  GGG G      EIHH RQWFPDERDGFISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 769  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 948
            DSLC HLR  GEPGEYD VIG IQQRR NW+ VLHMQQYFS++EV+YALQQV WRRQ R+
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 949  FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXXEKEPRVNEN 1128
             D +KG GKE+KR G V YRQGQR E  K+ H  + E+H            EK  RV+E 
Sbjct: 121  LDPVKGAGKEYKRYG-VAYRQGQRGETAKDSHNSNFENHSHDANSSGTL--EKGERVSEI 177

Query: 1129 VQDAKHGRE---IGESDHKVPPLADEKKDGF-----------LKSSGNSDGIMCGNSGLE 1266
              D K G +   +G+ + K    A+EKK G             KSS NS+G  CG S  E
Sbjct: 178  YDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETE 237

Query: 1267 VKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGL 1446
              ++ D       GSCN+++EN+++ +QNQNEK N TT PKTFVGTE+FDGKAVNVVDGL
Sbjct: 238  ANDMDDG------GSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGL 291

Query: 1447 KLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDA 1623
            KLYE LFD+SEVSK  SLVNDLR AG+RGQ QGQTFVVSKRPMKGHGREMIQ G+PI DA
Sbjct: 292  KLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPIADA 351

Query: 1624 PPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLP 1803
            P EDE++ GTSKD + ESIP+ LQDV   LVG QV+T KPD+CIIDFYNEGDHSQPH+ P
Sbjct: 352  PLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIWP 411

Query: 1804 PWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIP 1983
             WFG+PVCILFLTECDMTFGR IG D PGDYR              +QGKS DFAKHAIP
Sbjct: 412  TWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIP 471

Query: 1984 SIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHY- 2160
            S+RKQR+LVTF K+ PKKT+A                      +R+PNH+RHP GPKHY 
Sbjct: 472  SLRKQRILVTFTKSQPKKTMAS--DGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHYG 529

Query: 2161 ----AXXXXXXXXXXXXXXXXSNGMQPIFVTT---XXXXXXXXXXXXXXXXXXXXXXXXX 2319
                                  NGMQP+FVTT                            
Sbjct: 530  AVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSPGWPAAPPRH 589

Query: 2320 XXXRLPVPGTGVFXXXXXXXXXXXXQQVT--------ETNFSEEKENGS----------- 2442
               RLPVPGTGVF            Q ++        ET    EKENGS           
Sbjct: 590  PPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSSNSNTVS 649

Query: 2443 --------VVKEECNGGV---------MIKEEE---------NKPAGA 2508
                    V ++ECNG +         + KEE+         +KPAGA
Sbjct: 650  PKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQHNDELKVASKPAGA 697


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  617 bits (1592), Expect = e-174
 Identities = 359/685 (52%), Positives = 415/685 (60%), Gaps = 66/685 (9%)
 Frame = +1

Query: 616  MPSGNVVISDKMQFSSGGGGG------EIHH-RQWFPDERDGFISWLRGEFAAANAIIDS 774
            MPSGNVVISDKMQF  GGGGG      EIHH RQWFPDERDGFISWLRGEFAAANAIIDS
Sbjct: 1    MPSGNVVISDKMQFPGGGGGGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAIIDS 60

Query: 775  LCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFD 954
            LC HLR  GEPGEYD VIG IQQRR NW+ VLHMQQYFS++EV+YALQQV WRRQ R+ D
Sbjct: 61   LCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRHLD 120

Query: 955  QMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXXEKEPRVNENVQ 1134
             +KG GKE+KR G V YRQGQR E  K+ H  + E+H            EK  RV+E   
Sbjct: 121  PVKGAGKEYKRYG-VAYRQGQRGETAKDSHNSNFENHSHDANSSGTL--EKGERVSEIYD 177

Query: 1135 DAKHGRE---IGESDHKVPPLADEKKD--GFLKSSGNSDGIMCGNSGLEVKEVG------ 1281
            D K G +   +G+ + K    A EKK+   F+        ++     + V+ V       
Sbjct: 178  DVKGGDKGDVVGKLEDKDLSAAAEKKEVMNFVIFGQLEQMLLQNPMQIAVRRVQKTQKDP 237

Query: 1282 ----DKCIPNS----KGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVV 1437
                 +  P +      SCN+++EN+++ +QNQNEK N TT PKTFVGTE+FDGKAVNVV
Sbjct: 238  DVAFQRLRPMTWMMEARSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVV 297

Query: 1438 DGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPI 1614
            DGLKLYE LFD+SEVSK  SLVNDLR AG+RGQ QGQTFVVSKRPMKGHGREMIQ G+PI
Sbjct: 298  DGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPI 357

Query: 1615 VDAPPEDENLSGTSKDC----KIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDH 1782
             DAP EDE++ GTSK      + ESIP+ LQDV  +LVG QV+T KPD+CIIDFYNEGDH
Sbjct: 358  ADAPLEDESVVGTSKGMFHNRRTESIPSLLQDVIGQLVGSQVLTVKPDACIIDFYNEGDH 417

Query: 1783 SQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLD 1962
            SQPH+ P WFG+PVCILFLTECDMTFGR IG D PGDYR              +QGKS D
Sbjct: 418  SQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSAD 477

Query: 1963 FAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHP 2142
            FAKHAIPS+RKQR+LVTF K+ PKKT A                      +R+PNH+RHP
Sbjct: 478  FAKHAIPSLRKQRILVTFTKSQPKKTTAS--DGQRLLPPAAQSSHWVPPPSRSPNHMRHP 535

Query: 2143 SGPKHY-----AXXXXXXXXXXXXXXXXSNGMQPIFVTT---XXXXXXXXXXXXXXXXXX 2298
             GPKHY                       NGMQP+FVTT                     
Sbjct: 536  MGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPXPLPTGSPGW 595

Query: 2299 XXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQQVT--------ETNFSEEKENGS---- 2442
                      RLPVPGTGVF            Q ++        ET    EKENGS    
Sbjct: 596  PAAPPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSS 655

Query: 2443 ---------------VVKEECNGGV 2472
                           V ++ECNG +
Sbjct: 656  SNSNTVSPKGKLDGKVHRQECNGSM 680


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  603 bits (1554), Expect = e-169
 Identities = 347/668 (51%), Positives = 402/668 (60%), Gaps = 47/668 (7%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSG-GGGGEIHH---RQWFPDERDGFISWLRGEFAAANAIIDSL 777
            MAMPSGNVV SDKMQF SG  G GEI H   RQWFPDERDGFISWLRGEFAAANA+IDSL
Sbjct: 1    MAMPSGNVVSSDKMQFPSGTAGAGEISHHNNRQWFPDERDGFISWLRGEFAAANAMIDSL 60

Query: 778  CYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFDQ 957
            C+HLR+ GEPGEYD VI  IQ RRCNWNPVLHMQQYFS++EV++ALQQVAWRRQ R++D 
Sbjct: 61   CHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFALQQVAWRRQQRFYDP 120

Query: 958  MKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXXEKEPRVNENVQD 1137
            +K   KEFKR+G VG++Q QR ++ K+G   + ESH                  +E    
Sbjct: 121  VKMGNKEFKRSG-VGFKQWQRNDSFKDGRNSAAESHCLDGNSSFGNA------ASEKGGS 173

Query: 1138 AKHGREIGESDHKVP-PLADEK--------KDGFLKSSGNSDGIMCGNSGLEVKEVGDKC 1290
             K G E+G SD +   P A EK        +DG +KS GN +G++ G+   EV  V D C
Sbjct: 174  DKSGDEVGNSDDRGSMPAAKEKNDSAAKSQEDGNVKSLGNFEGVVSGSEP-EVHAVDDGC 232

Query: 1291 IPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYELF-D 1467
              +SK       ENDS+S   QNE  NL  +PKTF G E+FDGK VNVV+GLKLYE F  
Sbjct: 233  TSSSK-------ENDSHSTPKQNENSNLANVPKTFSGNEMFDGKPVNVVEGLKLYEEFCA 285

Query: 1468 NSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLS 1647
            ++EVSKL +LVNDLR AG RG FQ QT+VVSKRPMKGHGRE IQ G+PI DAP EDE  +
Sbjct: 286  DTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLGLPIADAPVEDEISA 345

Query: 1648 GTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVC 1827
            GT KD + E+IP  LQDVAERLV MQV T KPDSCIIDFYNEGDHSQPH+ P WFG+PVC
Sbjct: 346  GTLKDRRTEAIPPLLQDVAERLVSMQVATVKPDSCIIDFYNEGDHSQPHLWPSWFGRPVC 405

Query: 1828 ILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVL 2007
            +LFLTECDMTFGR   ID PGDYR             A+QGKS DFAKHAIPS+R+QR+L
Sbjct: 406  VLFLTECDMTFGRVFAIDHPGDYRGALKLSLKPGSLLAMQGKSADFAKHAIPSLRRQRIL 465

Query: 2008 VTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XXXXX 2178
            VTF K+ PKK++                       +R+PNHIRHP GPKHYA        
Sbjct: 466  VTFTKSQPKKSMPS-DGQRMPSPGVAPSSHWGPQPSRSPNHIRHP-GPKHYAPVPTTGVL 523

Query: 2179 XXXXXXXXXXXSNGMQPIFVT---TXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGT 2349
                        NG+QP+FVT                                RLPVPGT
Sbjct: 524  QASPVRPQIPPPNGIQPLFVTAPVAPAMPFPAPVPIPPSSSGWSAAPPRHPPPRLPVPGT 583

Query: 2350 GVFXXXXXXXXXXXXQQ---------VTETNFSEEKENGS------------------VV 2448
            GVF             Q           ET    EKENGS                    
Sbjct: 584  GVFLPPPGSGGNSSGSQQVLGNDTNHTVETAAPPEKENGSGKLNHGMTASPKGKVDSKTQ 643

Query: 2449 KEECNGGV 2472
            K+ECNG +
Sbjct: 644  KQECNGSL 651


>ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
            gi|462422058|gb|EMJ26321.1| hypothetical protein
            PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  603 bits (1554), Expect = e-169
 Identities = 340/635 (53%), Positives = 393/635 (61%), Gaps = 24/635 (3%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGGGEI-------HHRQWFPDERDGFISWLRGEFAAANAII 768
            M MPSGNVV+SDKMQF SGGGGG +       HHRQWFPDERDGFISWLRGEFAAANAII
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGGAVGGGEIAQHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 769  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 948
            DSLC+HLR+ GEPGEYDVVIG IQQRRCNWNPVLHMQQYFS++EV+YALQ VAWRRQ RY
Sbjct: 61   DSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQRY 120

Query: 949  FDQMKGTGKEFKRAGGVGYRQGQ-RVEAVKEGHTFSVESHXXXXXXXXXXXXEKEPRVNE 1125
            +D +K   KEFKR+G VG+ +GQ R EA KEGH  ++ESH            EK  R   
Sbjct: 121  YDPVKAGAKEFKRSG-VGFNKGQQRAEAFKEGHNSTLESHSNDGNSSGVVAPEKFER--- 176

Query: 1126 NVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNSDGIMCGNSGLEVKEVGDKCIPNSK 1305
                   G E+GE   +V P                        G EV ++ DK +  + 
Sbjct: 177  -------GSEVGE---EVEP------------------------GGEVGKLNDKGLAPA- 201

Query: 1306 GSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYELF-DNSEVS 1482
            G   V   N+S+S+Q QN+KQNL+ +PKTF+G E+ DGK VNVVDGLKLYE F  ++EVS
Sbjct: 202  GEKKV---NESHSIQIQNQKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYEDFLGDTEVS 258

Query: 1483 KLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSGTSKD 1662
            KL SLVNDLR AG+R Q QGQT+VVSKRPMKGHGREMIQ GIPI DAPPEDE  +GTSKD
Sbjct: 259  KLVSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIADAPPEDEISAGTSKD 318

Query: 1663 CKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCILFLT 1842
             KIE IP+ LQDV +RLVGM VMT KPDSCIID YNEGDHSQPH  P WFG+PVC L+LT
Sbjct: 319  RKIEPIPSLLQDVIDRLVGMHVMTVKPDSCIIDVYNEGDHSQPHTWPSWFGRPVCALYLT 378

Query: 1843 ECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVLVTFAK 2022
            ECDMTFGR + +D PGDYR              +QGKS DFAKHAIPSIRKQR+LVT  K
Sbjct: 379  ECDMTFGRLLLMDHPGDYRGSLRLSLTPGSILLMQGKSADFAKHAIPSIRKQRILVTLTK 438

Query: 2023 AIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XXXXXXXXXX 2193
            + PKK+                        +R+PNHIRHP+GPKHYA             
Sbjct: 439  SQPKKSTTS-DGQRFPAPAPAQSSYWGPPPSRSPNHIRHPTGPKHYAAVPTTGVLPAPPI 497

Query: 2194 XXXXXXSNGMQPIFV--TTXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGTGVFXXX 2367
                   NG+QP+FV                                R+P+PGTGVF   
Sbjct: 498  RSQLPPQNGIQPLFVPAPVGPAIPFAAAVPIPPGSAGWPAAPRHPPPRIPLPGTGVFLPP 557

Query: 2368 XXXXXXXXXQQV----------TETNFSEEKENGS 2442
                     QQ+           ET    +K+NGS
Sbjct: 558  PGSGNSSAPQQLPGTATEMSPTVETPSPRDKDNGS 592


>ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|590697545|ref|XP_007045470.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  588 bits (1517), Expect = e-165
 Identities = 343/692 (49%), Positives = 409/692 (59%), Gaps = 64/692 (9%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSS------------------GGGGGEIH---HRQWFPDERDGFI 726
            MAMPSGNVV+SDKMQF +                  GGGGGEIH   HRQW PDERDGFI
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 727  SWLRGEFAAANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVL 906
             WLRGEFAA+NAIIDSLC+HLR  GE GEY+ VI  IQQRRCNWNPVLHMQQYFS++EV 
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 907  YALQQVAWRRQHRYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXX 1086
            YALQQVAWRR+ R+++  K  GKEFKR+G +G++ GQR+E  KEG    V+S        
Sbjct: 121  YALQQVAWRRRQRHYESGKVGGKEFKRSG-MGFK-GQRMEVAKEGQNSGVDSDGNSTVTA 178

Query: 1087 XXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKD-GFLKSSGNSDGIMCGNSGL 1263
                 E+  R +E  ++ K   E+G+ + K     ++KKD G    +G+++ +       
Sbjct: 179  VS---ERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESV------- 228

Query: 1264 EVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDG 1443
              ++V   C  + K       END  S+QNQNEKQNL   PKTFVG E+FDGK VNVVDG
Sbjct: 229  -TEDVNGGCTSSYK-------ENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDG 280

Query: 1444 LKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVD 1620
            LKLYE LFD+ EV  L SLVNDLR AG+RGQ QGQT+V +KRPMKGHGREMIQ G+PI D
Sbjct: 281  LKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIAD 340

Query: 1621 APPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHML 1800
            AP +DEN +GTSKD +IE IP  LQD  ERLV +QVMT KPDSCIID YNEGDHSQP M 
Sbjct: 341  APLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMW 400

Query: 1801 PPWFGKPVCILFLTECDMTFGRAIGI-DRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHA 1977
            PPWFGKPVCI+FLTECD+TFGR + + D PGDYR              +QGKS DFAKHA
Sbjct: 401  PPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHA 460

Query: 1978 IPSIRKQRVLVTFAK-AIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPK 2154
            +PS+RKQR+LVTF K   PKK+  D                     +R+PN IRH +GPK
Sbjct: 461  LPSVRKQRILVTFTKYCQPKKSTTD--NQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPK 518

Query: 2155 HYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT--XXXXXXXXXXXXXXXXXXXXXXXXX 2319
            HYA                   S+G+QP+FV T                           
Sbjct: 519  HYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAAPRH 578

Query: 2320 XXXRLPVPGTGVFXXXXXXXXXXXXQQVT---------ETNFSEEKENGSV--------- 2445
               RLPVPGTGVF            Q  T         ET    EKENGSV         
Sbjct: 579  PPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTSP 638

Query: 2446 --------VKEECNGGV--------MIKEEEN 2493
                     K++CNG V        ++KEE++
Sbjct: 639  RGRLDGKSPKQDCNGSVDGAGSGRALMKEEQH 670


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  587 bits (1514), Expect = e-165
 Identities = 336/669 (50%), Positives = 400/669 (59%), Gaps = 48/669 (7%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGG----GGEIHH--RQWFPDERDGFISWLRGEFAAANAIID 771
            M MPSGNVV+SDKMQ+ S  G    GGEIH   RQWFPDERDGFISWLRGEFAAANAIID
Sbjct: 1    MTMPSGNVVLSDKMQYPSVAGAAVSGGEIHQQPRQWFPDERDGFISWLRGEFAAANAIID 60

Query: 772  SLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYF 951
            SLC+HLR+ GEP EYD+VIG +QQRRCNW PVLHMQQYFS++EV+YALQQVAWRRQ RY+
Sbjct: 61   SLCHHLRAVGEPSEYDMVIGCVQQRRCNWTPVLHMQQYFSVAEVIYALQQVAWRRQQRYY 120

Query: 952  DQMKGTGKEFKRAG-GVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXXEKEPRVNEN 1128
            + +K   K++KR+  GVG++   R E VKE HT SVE              E        
Sbjct: 121  EPVKMGNKDYKRSNSGVGFKP--RNEPVKEWHTASVEYRSYDGSGLEKVGSEMR------ 172

Query: 1129 VQDAKHGREIGESDHKVPPLADEKKDGFLK--------SSGNSDGIMCGNSGLEVKEVGD 1284
             ++ K G E G+ D K        K    K        SS NS G + GNS  E   V +
Sbjct: 173  -EEVKPGGEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSSANSQGTISGNSESEDAVVNE 231

Query: 1285 KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYELF 1464
             C  + K       EN+S S+Q QNEKQNL+ +PKTFVG E FDGK VNVVDGLKLYE F
Sbjct: 232  GCTSSIK-------ENESNSIQIQNEKQNLSLIPKTFVGNETFDGKTVNVVDGLKLYEEF 284

Query: 1465 -DNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDAPPEDEN 1641
              ++EVSKL SLVNDLR  GRRGQ QGQT+V+SKRPMKGHGREMIQ GIPI D P EDE 
Sbjct: 285  LGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQLGIPIADGPQEDEI 344

Query: 1642 LSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKP 1821
             +G SKD ++E+IP+ LQDV +RL+G QV+T KPDSCIIDF+NEGDHS PHM PPWFG+P
Sbjct: 345  SAGISKDRRMEAIPSLLQDVIDRLIGTQVLTDKPDSCIIDFFNEGDHSHPHMWPPWFGRP 404

Query: 1822 VCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQR 2001
            V +LFLTECD+TFG+ +G+D PGDYR              +QGKS D+AKHAIPSIRKQR
Sbjct: 405  VSVLFLTECDLTFGKVLGMDHPGDYRGALRLSLTPGSLLLLQGKSADYAKHAIPSIRKQR 464

Query: 2002 VLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XXX 2172
            +LVTF K+ P+K+                         R+PNHIRHP+GPKHYA      
Sbjct: 465  ILVTFTKSQPRKSFPTDGQRLPSPGPSQSPYWSPPPG-RSPNHIRHPAGPKHYAAVPTTG 523

Query: 2173 XXXXXXXXXXXXXSNGMQPIFVT--TXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPG 2346
                         +NG+QP+FV                                R+P+PG
Sbjct: 524  VLPAPPNRPQLPPANGIQPLFVAAPVGPAMPFPAPVVIPPGSPGWVAAPRHPPPRMPLPG 583

Query: 2347 TGVFXXXXXXXXXXXXQQ-----VTETN-----FSEEKENGS-----------------V 2445
            TGVF             Q      TE N      S EK+NG+                  
Sbjct: 584  TGVFLPPPGSGSSSAPPQQFPSTATEMNPSVETASTEKDNGTAKSSHAIASPKAKLDVKA 643

Query: 2446 VKEECNGGV 2472
             +++CNG V
Sbjct: 644  QRQDCNGSV 652


>ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|590697542|ref|XP_007045469.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  584 bits (1505), Expect = e-164
 Identities = 343/693 (49%), Positives = 409/693 (59%), Gaps = 65/693 (9%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSS------------------GGGGGEIH---HRQWFPDERDGFI 726
            MAMPSGNVV+SDKMQF +                  GGGGGEIH   HRQW PDERDGFI
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 727  SWLRGEFAAANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVL 906
             WLRGEFAA+NAIIDSLC+HLR  GE GEY+ VI  IQQRRCNWNPVLHMQQYFS++EV 
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 907  YALQQVAWRRQHRYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXX 1086
            YALQQVAWRR+ R+++  K  GKEFKR+G +G++ GQR+E  KEG    V+S        
Sbjct: 121  YALQQVAWRRRQRHYESGKVGGKEFKRSG-MGFK-GQRMEVAKEGQNSGVDSDGNSTVTA 178

Query: 1087 XXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKD-GFLKSSGNSDGIMCGNSGL 1263
                 E+  R +E  ++ K   E+G+ + K     ++KKD G    +G+++ +       
Sbjct: 179  VS---ERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESV------- 228

Query: 1264 EVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDG 1443
              ++V   C  + K       END  S+QNQNEKQNL   PKTFVG E+FDGK VNVVDG
Sbjct: 229  -TEDVNGGCTSSYK-------ENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDG 280

Query: 1444 LKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQ-GQTFVVSKRPMKGHGREMIQFGIPIV 1617
            LKLYE LFD+ EV  L SLVNDLR AG+RGQ Q GQT+V +KRPMKGHGREMIQ G+PI 
Sbjct: 281  LKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIA 340

Query: 1618 DAPPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHM 1797
            DAP +DEN +GTSKD +IE IP  LQD  ERLV +QVMT KPDSCIID YNEGDHSQP M
Sbjct: 341  DAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRM 400

Query: 1798 LPPWFGKPVCILFLTECDMTFGRAIGI-DRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKH 1974
             PPWFGKPVCI+FLTECD+TFGR + + D PGDYR              +QGKS DFAKH
Sbjct: 401  WPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKH 460

Query: 1975 AIPSIRKQRVLVTFAK-AIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGP 2151
            A+PS+RKQR+LVTF K   PKK+  D                     +R+PN IRH +GP
Sbjct: 461  ALPSVRKQRILVTFTKYCQPKKSTTD--NQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGP 518

Query: 2152 KHYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT--XXXXXXXXXXXXXXXXXXXXXXXX 2316
            KHYA                   S+G+QP+FV T                          
Sbjct: 519  KHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAAPR 578

Query: 2317 XXXXRLPVPGTGVFXXXXXXXXXXXXQQVT---------ETNFSEEKENGSV-------- 2445
                RLPVPGTGVF            Q  T         ET    EKENGSV        
Sbjct: 579  HPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTS 638

Query: 2446 ---------VKEECNGGV--------MIKEEEN 2493
                      K++CNG V        ++KEE++
Sbjct: 639  PRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQH 671


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
            gi|223533099|gb|EEF34858.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 697

 Score =  582 bits (1499), Expect = e-163
 Identities = 345/664 (51%), Positives = 399/664 (60%), Gaps = 54/664 (8%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGG----------GEI-----HHR-QWFP-DERDGFISWLR 738
            MAMP GNVVISDK+QF +GGGG           EI     HHR QWFP DERDGFISWLR
Sbjct: 1    MAMPPGNVVISDKIQFPAGGGGVGGGNNGGIGNEIQQQQHHHRHQWFPVDERDGFISWLR 60

Query: 739  GEFAAANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQ 918
            GEFAAANAIIDSLC+HLR+AGEPGEYDVVIG IQQRRCNWNPVLHMQQYFS+ EV+ ALQ
Sbjct: 61   GEFAAANAIIDSLCHHLRAAGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVGEVILALQ 120

Query: 919  QVAWRRQ-----------HRYF-DQMKGTGKEFKRAGGVGYRQGQRV--EAVKEGHTFSV 1056
            QVA R+Q           HRY+ DQ K  GK+FKR   +G+ +G R   E VKE + +  
Sbjct: 121  QVALRKQQQHQHQHQHQQHRYYYDQPKVGGKDFKRNSSMGFNKGHRGGGEVVKEVN-YGA 179

Query: 1057 ESHXXXXXXXXXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKDGF-------L 1215
            ESH            +     NE   + K G + G  ++K    A++KKD         L
Sbjct: 180  ESHGL----------DGNTSGNEKFNEIKSGGDSGRLENKSLATAEDKKDAASKPHVDNL 229

Query: 1216 KSSGNSDGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTF 1395
            KSSGNS+G + GN   E + V ++  P          E+DS+ +QNQ  K NLTT PKTF
Sbjct: 230  KSSGNSEGSLSGNLETEAEAVHEQSSPK---------EHDSHFIQNQIVKLNLTTTPKTF 280

Query: 1396 VGTELFDGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPM 1572
            VG E+ DGK+VNVVDGLKLYE L D+ EVSKL SLVNDLR AGR+GQFQGQ +VVSKRPM
Sbjct: 281  VGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQGQAYVVSKRPM 340

Query: 1573 KGHGREMIQFGIPIVDAPPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSC 1752
            KGHGREMIQ G+PI DAP E+EN +GTSKD KIESIP  LQ+V ER V MQ+MT KPDSC
Sbjct: 341  KGHGREMIQLGLPIADAPAEEENAAGTSKDRKIESIPTLLQEVIERFVSMQIMTMKPDSC 400

Query: 1753 IIDFYNEGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXX 1932
            IID YNEGDHSQPHM PPWFGKP+ +LFLTECD+TFGR I  D PGDYR           
Sbjct: 401  IIDIYNEGDHSQPHMWPPWFGKPISVLFLTECDLTFGRVITADHPGDYRGSLKLPLAPGS 460

Query: 1933 XXAVQGKSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXX 2112
               +QGK+ DFAKHAIP+IRKQRVL+TF K+ PKK                         
Sbjct: 461  LLVMQGKATDFAKHAIPAIRKQRVLLTFTKSQPKK-FVQSDGQRLTSPAASPSSHWGPPP 519

Query: 2113 TRAPNHIRHPSGPKHYA---XXXXXXXXXXXXXXXXSNGMQPIFVT----TXXXXXXXXX 2271
            +R+PNHIRHP   KHYA                    NG+QP+FVT              
Sbjct: 520  SRSPNHIRHPVS-KHYAPIPTTGVLPAPSIRPQIAPPNGVQPLFVTAPVAAPMPFPAPVP 578

Query: 2272 XXXXXXXXXXXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQ--QVTETNFS------EE 2427
                                +PVPGTGVF            Q    TE NF       ++
Sbjct: 579  MPPVSTGWPAAPRHPPNRLPVPVPGTGVFLPPPGSGNASSPQIPNATEINFPAETASLQD 638

Query: 2428 KENG 2439
            KENG
Sbjct: 639  KENG 642


>ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max]
          Length = 683

 Score =  569 bits (1466), Expect = e-159
 Identities = 328/674 (48%), Positives = 400/674 (59%), Gaps = 55/674 (8%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGGG--------EIHHR-----QWFPDERDGFISWLRGEFA 750
            MAMPSGNVVI DKMQF SG GGG        EIH       QWF DERDG I WLR EFA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGAGGGGGGGGAGGEIHQPHHYRPQWFVDERDGLIGWLRSEFA 60

Query: 751  AANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAW 930
            AANAIIDSLC+HLR  G+PGEYD+V+G+IQQRRCNWN VL MQQYFS+++V YALQQVAW
Sbjct: 61   AANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNWNQVLMMQQYFSVADVAYALQQVAW 120

Query: 931  RRQHRYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXXEKE 1110
            RRQ R  D MK   KE +++G  GYR GQR E+VKEG+  SVES+            EK 
Sbjct: 121  RRQQRPLDPMKVGAKEVRKSGS-GYRHGQRFESVKEGYNSSVESYSHDANVAVTGGTEKG 179

Query: 1111 PRVNENVQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLE 1266
              V E  ++ K G ++ +   K     +EKKD        G LKS+ +++G +   S LE
Sbjct: 180  TPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDAITNHQSEGSLKSARSTEGSL---SNLE 236

Query: 1267 VKEV-GDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDG 1443
             + V  D CI NSKG       ND +S+QNQ++ Q+L+ + KTF+G E+FDGK VNVVDG
Sbjct: 237  SEAVVNDGCISNSKG-------NDLHSVQNQSQSQSLSNIAKTFIGNEMFDGKTVNVVDG 289

Query: 1444 LKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIV 1617
            LKLY+ LFD++EV+ L SLVNDLRV+G++GQ QG Q ++VS+RPMKGHGREMIQ G+ I 
Sbjct: 290  LKLYDDLFDSTEVANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVRIA 349

Query: 1618 DAPPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHM 1797
            DAP E EN++G SKD  +ESIP+  QD+ ER+V  QVMT KPD CI+DFYNEGDHSQPH 
Sbjct: 350  DAPAEGENMTGASKDMNVESIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHS 409

Query: 1798 LPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHA 1977
             P W+G+PV +LFLTEC+MTFGR I  + PGDYR              +QGKS DFAKHA
Sbjct: 410  WPSWYGRPVYVLFLTECEMTFGRVIASEHPGDYRGSIKLSLVPGSLLVMQGKSSDFAKHA 469

Query: 1978 IPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKH 2157
            +PS RKQR+LVTF K+ P+K+L+                      +R+PNH+RH  GPKH
Sbjct: 470  LPSTRKQRILVTFTKSQPRKSLSS---DAQQLASAVASSHWGPPPSRSPNHVRHHVGPKH 526

Query: 2158 YAXXXXXXXXXXXXXXXXSN---GMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXX 2316
            YA                     GMQP+FV                              
Sbjct: 527  YATLPTTGVLPAPPIRPQMAAPVGMQPLFVAAPVVPPMPFSAPVPIPAGSTGWTAAPPPR 586

Query: 2317 XXXXRLPVPGTGVFXXXXXXXXXXXXQQV-----------TETNFSEEKEN--------- 2436
                R+P PGTGVF            QQ+           TET    EKEN         
Sbjct: 587  HPPPRVPAPGTGVF--LPPSGSGNSSQQLPASTLAEVNPSTETPTMPEKENGKINHNSTS 644

Query: 2437 ----GSVVKEECNG 2466
                G V K+ECNG
Sbjct: 645  ASPKGKVQKQECNG 658


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max]
          Length = 681

 Score =  569 bits (1466), Expect = e-159
 Identities = 329/671 (49%), Positives = 405/671 (60%), Gaps = 52/671 (7%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGG-----GEIHH----RQWFPDERDGFISWLRGEFAAANA 762
            MAMPSGNVVI DKMQF SGG G     GEIH     +QWF DERDG I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 763  IIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQH 942
            IIDSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V +ALQQVAWRRQ 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120

Query: 943  RYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXX-EKEPRV 1119
            R  D +K   KEF+++G  GYR GQR E VKEG+  SVES+             EK   V
Sbjct: 121  RPLDPVKVGAKEFRKSGS-GYRHGQRFEPVKEGYNSSVESYNQYDANVTVTGGTEKGTPV 179

Query: 1120 NENVQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKE 1275
             E  ++ K G ++ +   K    A++KKD        G LKS+ +++G +   S LE + 
Sbjct: 180  VEKSEEHKSGGKVEKVGDKGLASAEDKKDAITKHQTDGSLKSTRSTEGSL---SNLESEA 236

Query: 1276 V-GDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKL 1452
            V  D+CI NSKG       +DS+S+QNQ++ Q+L+T  KTF+G E+FDGK VNVVDGLKL
Sbjct: 237  VVNDECISNSKG-------DDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKL 289

Query: 1453 YE-LFDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAP 1626
            YE LFD++E++ L SLVNDLRV+G++GQ QG Q ++VS+RPMKGHGREMIQ G+PI DAP
Sbjct: 290  YEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIADAP 349

Query: 1627 PEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPP 1806
             E EN++G SKD  +E IP+  QD+ ER+V  QVMT KPD CI+DFYNEGDHSQPH  P 
Sbjct: 350  AEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPS 409

Query: 1807 WFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPS 1986
            W+G+PV ILFLTEC+MTFGR I  + PGDYR              ++GKS DFAKHA+PS
Sbjct: 410  WYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAKHALPS 469

Query: 1987 IRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYAX 2166
            +RKQR+LVTF K+ P+K+L+                      +R+PNH+RH  G KHYA 
Sbjct: 470  VRKQRILVTFTKSQPRKSLSS---DAQRLASTATSSHWGPLPSRSPNHVRHHVGSKHYAT 526

Query: 2167 XXXXXXXXXXXXXXXSN---GMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXX 2325
                                GMQP+FVT                                
Sbjct: 527  LPTTGVLPSPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTGWTGAPPPRHPP 586

Query: 2326 XRLPVPGTGVFXXXXXXXXXXXXQQV-----------TETNFSEEKEN------------ 2436
             R+P PGTGVF            QQ+           TET    EKEN            
Sbjct: 587  PRVPAPGTGVF--LPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGKTNHNSTSASP 644

Query: 2437 -GSVVKEECNG 2466
             G V K+ECNG
Sbjct: 645  KGKVQKQECNG 655


>ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814525 [Glycine max]
          Length = 626

 Score =  559 bits (1441), Expect = e-156
 Identities = 323/656 (49%), Positives = 388/656 (59%), Gaps = 23/656 (3%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGGGEIHHRQ-WFPDERDGFISWLRGEFAAANAIIDSLCYH 786
            MAMPSGN V+ +K+QF  GGGG EIH+RQ WF DERDGFI WLR EFAAANAIIDSLC+H
Sbjct: 1    MAMPSGNAVMPEKLQFPGGGGGSEIHYRQQWFVDERDGFIGWLRSEFAAANAIIDSLCHH 60

Query: 787  LRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFDQMKG 966
            LR  GEPGEYD+V+G+IQQRRCNW  VL MQQYFS+SEV+ ALQQV+WRRQ R  D  K 
Sbjct: 61   LRCVGEPGEYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVCALQQVSWRRQQRVVDLAKT 120

Query: 967  TGKEFKRAGGVGYRQGQ-RVEAVKEGHTFSVESHXXXXXXXXXXXX-EKEPRVNENVQDA 1140
              KEF++ G  G RQGQ R+EA K+G+  SVES              EK   + E   + 
Sbjct: 121  GAKEFRKFGS-GIRQGQHRLEAAKDGYNSSVESFCHGTNAVVVAGGVEKGTPLTEKNGEI 179

Query: 1141 KHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKEVGDKCIP 1296
            K G ++G  D+K     +E+KD        G LK SGNS G +   S  E   V ++C+ 
Sbjct: 180  KSGGKVGTMDNKSLASPEERKDTITNHQSDGILKGSGNSQGSL-STSECEAVGVNEECVS 238

Query: 1297 NSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-LFDNS 1473
            NSK       ENDS             T+ KTF+G E+FDGK VNVVDGLKLYE L D +
Sbjct: 239  NSK-------ENDS-------------TMGKTFIGNEMFDGKMVNVVDGLKLYEDLLDRT 278

Query: 1474 EVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSG 1650
            EVSKL SLVNDLRVAG+RGQFQG QTFVVSKRPMKGHGREMIQ G+PI DAPP+ +N++G
Sbjct: 279  EVSKLVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVDNVTG 338

Query: 1651 TSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCI 1830
             SKD K+ESIP+  QD+ +RLV  QVMT KPD+CI+DF+NEG+HS P+  PPWFG+P+ I
Sbjct: 339  ISKDKKVESIPSLFQDIIKRLVASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGRPLYI 398

Query: 1831 LFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVLV 2010
            LFLTECDMTFGR I  D PG++R              +QGKS DFAKHA+PSI KQR++V
Sbjct: 399  LFLTECDMTFGRIIVSDHPGEFRGAVTLSLVPGSLLVMQGKSTDFAKHALPSIHKQRIIV 458

Query: 2011 TFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYAXXXXXXXXX 2190
            TF K+ P+ +L +                     +R+PNH+RH  GPKHY          
Sbjct: 459  TFTKSQPRSSLPN----DSERLAPPAAPHWAPPPSRSPNHVRHQLGPKHY------PTVQ 508

Query: 2191 XXXXXXXSNGMQPIFV-----TTXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGTGV 2355
                    NGMQP+FV                                   R+PVPGTGV
Sbjct: 509  ATGVLPAPNGMQPLFVPVPVPVASPMSFPTPVPIPPGSIGWTSAPPRHPPPRIPVPGTGV 568

Query: 2356 FXXXXXXXXXXXXQQVTETNFSEEKENGSVVKEECN-----GGVMIKEEENKPAGA 2508
            F                ET     KENG     + N      GV  + E N    A
Sbjct: 569  FLPPPGSGTIHEVNPSVETWTVSGKENGKSNHSKTNSEAEEAGVEKEHESNDMTAA 624


>ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phaseolus vulgaris]
            gi|561026542|gb|ESW25182.1| hypothetical protein
            PHAVU_003G014200g [Phaseolus vulgaris]
          Length = 691

 Score =  557 bits (1436), Expect = e-156
 Identities = 330/695 (47%), Positives = 399/695 (57%), Gaps = 61/695 (8%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGG----GGEIH--HRQWFPDERDGFISWLRGEFAAANAIID 771
            MAMPSGN  + +K+QF  GGG    GGEI   H+QWF DERDGFI WLR EFAAANAIID
Sbjct: 1    MAMPSGNGGMPEKLQFPVGGGAASGGGEIQYRHQQWFVDERDGFIGWLRSEFAAANAIID 60

Query: 772  SLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYF 951
            SLC HLR  GEPG YD+V+G+IQQRRCNW  VL MQQYFS+SEV+YALQQVAWRRQ R+ 
Sbjct: 61   SLCQHLRVVGEPGVYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVAWRRQQRFV 120

Query: 952  DQMKGTGKEFKRAGGVGYRQGQ-------------RVEAVKEGHTFSVESHXXXXXXXXX 1092
            D  K   KEF++ G  G+RQGQ             R EA KEG+   VES          
Sbjct: 121  DPAKAGSKEFRKFGS-GFRQGQHRNEASKEGYNNSRNEAAKEGYNSKVESFGREMNAVVV 179

Query: 1093 XXX-EKEPRVNENVQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIM 1245
                EK  RV +   +   G ++G  D+      +E KD        G L  SGN  G +
Sbjct: 180  TGGVEKGTRVIDKNGELNSGGKVGTMDNNSIASPEESKDTITNDQLDGILNGSGNFQGSL 239

Query: 1246 CGNSGLEVKEVGD--KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDG 1419
               S  E + VG+  +C  NSKG       NDS+S+QNQ++ QN +T+ KTF+G E+F+G
Sbjct: 240  ---SSSECEAVGENEECTSNSKG-------NDSHSVQNQHQSQNASTIGKTFIGNEMFEG 289

Query: 1420 KAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREM 1593
            K VNVVDGLKLYE L D++EVSKL SLVND+RVAG+RGQFQG QTFVVSKRP+KG GREM
Sbjct: 290  KMVNVVDGLKLYEDLIDSAEVSKLVSLVNDMRVAGKRGQFQGSQTFVVSKRPIKGRGREM 349

Query: 1594 IQFGIPIVDAPPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNE 1773
            IQ G+PI DAPP+ +N++G SKD K+ESIP+  +D+ ERL   QVMT KPD+CI+DF+NE
Sbjct: 350  IQLGVPIADAPPDVDNVTGLSKDKKVESIPSLFEDIIERLAASQVMTVKPDACIVDFFNE 409

Query: 1774 GDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGK 1953
            GDHSQP+  PPWFG+PV +LFLTECD+TFGR I  D PGDYR              +QGK
Sbjct: 410  GDHSQPNSCPPWFGRPVYMLFLTECDITFGRTIVSDHPGDYRGAVKLSLVPGSLLVMQGK 469

Query: 1954 SLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHI 2133
            S D AKHA+PSI KQR+LVTF K+ PK +L +                      R PNH+
Sbjct: 470  STDLAKHALPSIHKQRILVTFTKSQPKTSLPNDSQRLSPAVTSHWAPPQG----RTPNHM 525

Query: 2134 RHPSGPKHYAXXXXXXXXXXXXXXXXSNGMQPIFVTT---XXXXXXXXXXXXXXXXXXXX 2304
            RH  GPKHY                  NGMQ +FV T                       
Sbjct: 526  RHQLGPKHYPTIPATGVLPAPSIRAPPNGMQTLFVPTPVAPPISFASPVPIPLGSTGWAS 585

Query: 2305 XXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQ---QVTETNFSEE---------------- 2427
                    R+PVPGTGVF                 V+E N S E                
Sbjct: 586  APQRHPPPRMPVPGTGVFLPPPGSGTTSSQHLPGVVSEVNLSGETTSTGKESLKSNHNTI 645

Query: 2428 ------KENGSVV-KEECNGGVMIKEEENKPAGAD 2511
                  K +G+VV ++ECNG     E E    G +
Sbjct: 646  NSSPKGKVDGNVVGRQECNGNADRSEGEEDVVGKE 680


>ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
            gi|561032200|gb|ESW30779.1| hypothetical protein
            PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 671

 Score =  550 bits (1417), Expect = e-153
 Identities = 319/667 (47%), Positives = 392/667 (58%), Gaps = 48/667 (7%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGG---GEI--HH--RQWFPDERDGFISWLRGEFAAANAII 768
            MAMPSGNVVI DKMQF +GGGG   GEI  HH  +QWF DERDG I WLR EFAAANAII
Sbjct: 1    MAMPSGNVVIQDKMQFPNGGGGAGVGEIQQHHYRQQWFVDERDGLIGWLRSEFAAANAII 60

Query: 769  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 948
            DSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V Y LQQVAWR+Q R 
Sbjct: 61   DSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVTYTLQQVAWRKQQRP 120

Query: 949  FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXXEKEPRVNEN 1128
             D +K   KE ++ G  GYR G R E  KEG+  SVES+            EK     + 
Sbjct: 121  LDPVKVGAKEVRKPGP-GYRYGHRFEPSKEGYNSSVESYSHDGNATFTRGMEKGTPTVDK 179

Query: 1129 VQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKEVGD 1284
             ++ K G ++ +   K     +EKKD        G LKS+G+S+G +  N   E   V D
Sbjct: 180  SEEHKSGSKVEKVGDKGLASPEEKKDAIIKHQTDGNLKSTGSSEGYL-SNLESEAVVVND 238

Query: 1285 KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-L 1461
            + I NSKG       NDS S+++Q++ Q+ +T+ KTF+G E+ DGK VN+ DGLKLYE +
Sbjct: 239  EFISNSKG-------NDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADGLKLYEDI 291

Query: 1462 FDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDE 1638
            FD++EVS L SLVNDLR++G++GQ QG Q +VVS+RPMKGHGREMIQ G+PI DAP E E
Sbjct: 292  FDSTEVSNLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIADAPVEGE 351

Query: 1639 NLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGK 1818
            N++G SK   +E IP+  +D+ ER+V  QVMTTKPD CI+DFYNEGDHSQPH  P WFG+
Sbjct: 352  NMTGASKVMNVEPIPSLFEDIIERMVSSQVMTTKPDCCIVDFYNEGDHSQPHSWPSWFGR 411

Query: 1819 PVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQ 1998
            PV  LFLTEC+MTFGR I  + PGDYR             A+QGKS DFAKHA+PSIRKQ
Sbjct: 412  PVYTLFLTECEMTFGRLIASEHPGDYRGSLKLSLVPGSLLAMQGKSCDFAKHALPSIRKQ 471

Query: 1999 RVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XX 2169
            R+LVTF K+ PKK++                       +R+PNH+RH  G KHYA     
Sbjct: 472  RILVTFTKSQPKKSVPS---DAQRLYLPAASSQWGPPPSRSPNHVRHSVGSKHYAALPTT 528

Query: 2170 XXXXXXXXXXXXXXSNGMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXXXRLP 2337
                            GMQP+FV                                  R+P
Sbjct: 529  GVLPAPPIRPQIPAQVGMQPLFVAAPVVPPMPYPAPVSIPPGSAGWTTAPPPRHPPPRIP 588

Query: 2338 VPGTGVF---XXXXXXXXXXXXQQVTETNFS-------EEKEN--------------GSV 2445
             PGTGVF                 + E N S       +EKEN              G V
Sbjct: 589  APGTGVFLPPPGSGNSQQQLPAGTLAEVNPSIETPTTMQEKENGKSNDDNSSSTSPKGKV 648

Query: 2446 VKEECNG 2466
             K+ECNG
Sbjct: 649  QKQECNG 655


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
            gi|550333016|gb|ERP57586.1| hypothetical protein
            POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  546 bits (1408), Expect = e-152
 Identities = 324/659 (49%), Positives = 378/659 (57%), Gaps = 49/659 (7%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSG-----GGGGEIHHRQ-----WFP-DERDGFISWLRGEFAAA 756
            MAMP GNVVI DK+QF +G     GGG EIH  Q     WFP DERDGFISWLRGEFAAA
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60

Query: 757  NAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRR 936
            NAIIDSLC+HLR+ GE GEYD+V+G IQQRR NWN VLHMQQYFS+ EV+ ALQQV  RR
Sbjct: 61   NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120

Query: 937  QHR--------------YFDQMKGTGKEFKRAGGVGYRQGQRV-------EAVKEGHTFS 1053
            Q +              Y+D  K  G++FKR+   G+ +G R        +AVKEG   S
Sbjct: 121  QQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGGGDAVKEGVNSS 180

Query: 1054 VESHXXXXXXXXXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNS 1233
            VE+H            EK        ++ K G + G+SD K    A    D    SSGN+
Sbjct: 181  VENHSFNGNSSENIRSEK-------FEEVKSGGDGGKSDDKKDATAKSHTDNHKNSSGNA 233

Query: 1234 DGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELF 1413
             G   GNS  E   V D+  P          E+DS+   NQNEKQNL   PKTFV  E  
Sbjct: 234  QGTFSGNS--EAVAVDDRSSPE---------ESDSHPSNNQNEKQNLAITPKTFVAEEKI 282

Query: 1414 DGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGRE 1590
            DG+ VNVVDGLKLYE L D  EVSKL SLVN+LR  GRRGQ QGQT+++SKRPMKGHGRE
Sbjct: 283  DGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGRE 342

Query: 1591 MIQFGIPIVDAPPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYN 1770
            MIQ G+PI DAP EDEN +GTSK+ ++ESIPA LQDV E  V MQVMT KPDSCIID YN
Sbjct: 343  MIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYN 402

Query: 1771 EGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQG 1950
            EGDHSQPHM PPWFGKPV +LFLTEC++TFG+ I     GDY+              +QG
Sbjct: 403  EGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQG 462

Query: 1951 KSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNH 2130
            KS D AKHAIP I+KQR+LVTF K+ PKK L                       +R+PNH
Sbjct: 463  KSSDLAKHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPNH 521

Query: 2131 IRHPSGPKHYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT-----XXXXXXXXXXXXXX 2286
            +RHP  PKHYA                    NG+QP+F+TT                   
Sbjct: 522  LRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVST 580

Query: 2287 XXXXXXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQQV----TETNF----SEEKENG 2439
                           +P+PGTGVF             Q+    TE NF     +EKENG
Sbjct: 581  GWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENG 639


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  546 bits (1407), Expect = e-152
 Identities = 324/660 (49%), Positives = 378/660 (57%), Gaps = 50/660 (7%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSG-----GGGGEIHHRQ-----WFP-DERDGFISWLRGEFAAA 756
            MAMP GNVVI DK+QF +G     GGG EIH  Q     WFP DERDGFISWLRGEFAAA
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60

Query: 757  NAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRR 936
            NAIIDSLC+HLR+ GE GEYD+V+G IQQRR NWN VLHMQQYFS+ EV+ ALQQV  RR
Sbjct: 61   NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120

Query: 937  QHR-----------------YFDQMKGTGKEFKRAGGVGYRQGQRV-----EAVKEGHTF 1050
            Q +                 Y+D  K  G++FKR+   G+ +G R      +AVKEG   
Sbjct: 121  QQQQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGDAVKEGVNS 180

Query: 1051 SVESHXXXXXXXXXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGN 1230
            SVE+H            EK        ++ K G + G+SD K    A    D    SSGN
Sbjct: 181  SVENHSFNGNSSENIRSEK-------FEEVKSGGDGGKSDDKKDATAKSHTDNHKNSSGN 233

Query: 1231 SDGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTEL 1410
            + G   GNS  E   V D+  P          E+DS+   NQNEKQNL   PKTFV  E 
Sbjct: 234  AQGTFSGNS--EAVAVDDRSSPE---------ESDSHPSNNQNEKQNLAITPKTFVAEEK 282

Query: 1411 FDGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGR 1587
             DG+ VNVVDGLKLYE L D  EVSKL SLVN+LR  GRRGQ QGQT+++SKRPMKGHGR
Sbjct: 283  IDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGR 342

Query: 1588 EMIQFGIPIVDAPPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFY 1767
            EMIQ G+PI DAP EDEN +GTSK+ ++ESIPA LQDV E  V MQVMT KPDSCIID Y
Sbjct: 343  EMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIY 402

Query: 1768 NEGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQ 1947
            NEGDHSQPHM PPWFGKPV +LFLTEC++TFG+ I     GDY+              +Q
Sbjct: 403  NEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQ 462

Query: 1948 GKSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPN 2127
            GKS D AKHAIP I+KQR+LVTF K+ PKK L                       +R+PN
Sbjct: 463  GKSSDLAKHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPN 521

Query: 2128 HIRHPSGPKHYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT-----XXXXXXXXXXXXX 2283
            H+RHP  PKHYA                    NG+QP+F+TT                  
Sbjct: 522  HLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVS 580

Query: 2284 XXXXXXXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQQV----TETNF----SEEKENG 2439
                            +P+PGTGVF             Q+    TE NF     +EKENG
Sbjct: 581  TGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENG 640


>ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781773 [Glycine max]
          Length = 664

 Score =  546 bits (1406), Expect = e-152
 Identities = 308/603 (51%), Positives = 371/603 (61%), Gaps = 20/603 (3%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGG----GGEIHHRQ-WFPDERDGFISWLRGEFAAANAIIDS 774
            MAMPSGN V+ +K+QF  GGG    G EIH RQ WF DERDGFI WLR EFAAANAIIDS
Sbjct: 1    MAMPSGNAVMPEKLQFPGGGGAPGGGSEIHFRQQWFVDERDGFIGWLRSEFAAANAIIDS 60

Query: 775  LCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFD 954
            LC+HLR  GEPGEY++V+G+IQQRRCNW  VL MQQYFS+SEV+YALQQV+WRRQ R  D
Sbjct: 61   LCHHLRDVGEPGEYNMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVSWRRQQRVVD 120

Query: 955  QMKGTGKEFKRAGGVGYRQGQ-RVEAVKEGHTFSVESHXXXXXXXXXXXX-EKEPRVNEN 1128
              K   KEF++ G +G++QGQ R EAVK+G+  SVES              EK   V E 
Sbjct: 121  PAKTGAKEFRKFG-LGFKQGQHRFEAVKDGYNSSVESFGHGTNAVVVAGGVEKGACVTEK 179

Query: 1129 VQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKEVGD 1284
              + K G  +G  D+K     +E+KD        G LK S NS G +  +S  E   V +
Sbjct: 180  NGEIKSGGMVGTMDNKNLGSPEERKDAITNHQSDGILKGSRNSQGSL-SSSECEAVGVNE 238

Query: 1285 KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-L 1461
            +C+ NSK       ENDS              + K F+G E+FDGK VNVVDGLKLYE L
Sbjct: 239  ECVSNSK-------ENDSI-------------MGKFFIGNEMFDGKMVNVVDGLKLYEDL 278

Query: 1462 FDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDE 1638
             D++EVSKL SLVNDLRVAG+RGQFQG QTFVVSKRPMKGHGREMIQ G+PI DAPP+ +
Sbjct: 279  LDSTEVSKLVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVD 338

Query: 1639 NLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGK 1818
            N++G SKD K+ESIP+  QD+ ERL   QVMT KPD+CI+DF+NEG+HS P+  PPWFG+
Sbjct: 339  NVTGISKDKKVESIPSLFQDIIERLAASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGR 398

Query: 1819 PVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQ 1998
            PV  LFLTECDMTFGR I  D PG++R              +QGKS DFAKHA+PSI KQ
Sbjct: 399  PVYTLFLTECDMTFGRIIVSDHPGEFRGAVRLSLVPGSLLVMQGKSTDFAKHALPSIHKQ 458

Query: 1999 RVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYAXXXXX 2178
            R+++TF K+ PK +L +                     +R+PNH+RH  GPKHY      
Sbjct: 459  RIIITFTKSQPKCSLPN----DSQRLAPPAASHWAPPQSRSPNHVRHQLGPKHYPTVPAT 514

Query: 2179 XXXXXXXXXXXSNGMQPIFV---TTXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGT 2349
                        N MQP+FV                                 R+PVPGT
Sbjct: 515  VVLPAPSIHAPPNSMQPLFVPAPVAPPMSFPTPVPIPPGSTGWTSAPSRHPPPRIPVPGT 574

Query: 2350 GVF 2358
            GVF
Sbjct: 575  GVF 577


>ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809865 isoform X2 [Glycine
            max]
          Length = 641

 Score =  544 bits (1401), Expect = e-151
 Identities = 314/661 (47%), Positives = 389/661 (58%), Gaps = 42/661 (6%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGG-----GEIHH----RQWFPDERDGFISWLRGEFAAANA 762
            MAMPSGNVVI DKMQF SGG G     GEIH     +QWF DERDG I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 763  IIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQH 942
            IIDSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V +ALQQVAWRRQ 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120

Query: 943  RYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXXEKEPRVN 1122
            R  D +K   KEF+++G  GYR GQR E VKEG+  SVES+                   
Sbjct: 121  RPLDPVKVGAKEFRKSGS-GYRHGQRFEPVKEGYNSSVESY------------------- 160

Query: 1123 ENVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNSDGIMCGNSGLEVKEVGDKCIPNS 1302
             N  DA     +     K  P+ ++ ++                SG +V++VGDK + ++
Sbjct: 161  -NQYDANV--TVTGGTEKGTPVVEKSEEH--------------KSGGKVEKVGDKGLASA 203

Query: 1303 KGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-LFDNSEV 1479
            +        +DS+S+QNQ++ Q+L+T  KTF+G E+FDGK VNVVDGLKLYE LFD++E+
Sbjct: 204  EDKKG----DDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKLYEDLFDSTEI 259

Query: 1480 SKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSGTS 1656
            + L SLVNDLRV+G++GQ QG Q ++VS+RPMKGHGREMIQ G+PI DAP E EN++G S
Sbjct: 260  ANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIADAPAEGENMTGAS 319

Query: 1657 KDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCILF 1836
            KD  +E IP+  QD+ ER+V  QVMT KPD CI+DFYNEGDHSQPH  P W+G+PV ILF
Sbjct: 320  KDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPSWYGRPVYILF 379

Query: 1837 LTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVLVTF 2016
            LTEC+MTFGR I  + PGDYR              ++GKS DFAKHA+PS+RKQR+LVTF
Sbjct: 380  LTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAKHALPSVRKQRILVTF 439

Query: 2017 AKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYAXXXXXXXXXXX 2196
             K+ P+K+L+                      +R+PNH+RH  G KHYA           
Sbjct: 440  TKSQPRKSLSS---DAQRLASTATSSHWGPLPSRSPNHVRHHVGSKHYATLPTTGVLPSP 496

Query: 2197 XXXXXSN---GMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGTGV 2355
                      GMQP+FVT                                 R+P PGTGV
Sbjct: 497  PIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTGWTGAPPPRHPPPRVPAPGTGV 556

Query: 2356 FXXXXXXXXXXXXQQV-----------TETNFSEEKEN-------------GSVVKEECN 2463
            F            QQ+           TET    EKEN             G V K+ECN
Sbjct: 557  F--LPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGKTNHNSTSASPKGKVQKQECN 614

Query: 2464 G 2466
            G
Sbjct: 615  G 615


>ref|XP_007158786.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
            gi|561032201|gb|ESW30780.1| hypothetical protein
            PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 630

 Score =  522 bits (1345), Expect = e-145
 Identities = 307/660 (46%), Positives = 378/660 (57%), Gaps = 41/660 (6%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSGGGG---GEI--HH--RQWFPDERDGFISWLRGEFAAANAII 768
            MAMPSGNVVI DKMQF +GGGG   GEI  HH  +QWF DERDG I WLR EFAAANAII
Sbjct: 1    MAMPSGNVVIQDKMQFPNGGGGAGVGEIQQHHYRQQWFVDERDGLIGWLRSEFAAANAII 60

Query: 769  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 948
            DSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V Y LQQVAWR+Q R 
Sbjct: 61   DSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVTYTLQQVAWRKQQRP 120

Query: 949  FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVESHXXXXXXXXXXXXEK-EPRVNE 1125
             D +K   KE ++ G  GYR G R E  KEG+  SVES+            EK  P V++
Sbjct: 121  LDPVKVGAKEVRKPGP-GYRYGHRFEPSKEGYNSSVESYSHDGNATFTRGMEKGTPTVDK 179

Query: 1126 NVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNSDGIMCGNSGLEVKEVGDKCIPNSK 1305
            +             +HK                          SG +V++VGDK + + +
Sbjct: 180  S------------EEHK--------------------------SGSKVEKVGDKGLASPE 201

Query: 1306 GSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-LFDNSEVS 1482
                    NDS S+++Q++ Q+ +T+ KTF+G E+ DGK VN+ DGLKLYE +FD++EVS
Sbjct: 202  EKKG----NDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADGLKLYEDIFDSTEVS 257

Query: 1483 KLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSGTSK 1659
             L SLVNDLR++G++GQ QG Q +VVS+RPMKGHGREMIQ G+PI DAP E EN++G SK
Sbjct: 258  NLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIADAPVEGENMTGASK 317

Query: 1660 DCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCILFL 1839
               +E IP+  +D+ ER+V  QVMTTKPD CI+DFYNEGDHSQPH  P WFG+PV  LFL
Sbjct: 318  VMNVEPIPSLFEDIIERMVSSQVMTTKPDCCIVDFYNEGDHSQPHSWPSWFGRPVYTLFL 377

Query: 1840 TECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVLVTFA 2019
            TEC+MTFGR I  + PGDYR             A+QGKS DFAKHA+PSIRKQR+LVTF 
Sbjct: 378  TECEMTFGRLIASEHPGDYRGSLKLSLVPGSLLAMQGKSCDFAKHALPSIRKQRILVTFT 437

Query: 2020 KAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XXXXXXXXX 2190
            K+ PKK++                       +R+PNH+RH  G KHYA            
Sbjct: 438  KSQPKKSVPS---DAQRLYLPAASSQWGPPPSRSPNHVRHSVGSKHYAALPTTGVLPAPP 494

Query: 2191 XXXXXXXSNGMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGTGVF 2358
                     GMQP+FV                                  R+P PGTGVF
Sbjct: 495  IRPQIPAQVGMQPLFVAAPVVPPMPYPAPVSIPPGSAGWTTAPPPRHPPPRIPAPGTGVF 554

Query: 2359 ---XXXXXXXXXXXXQQVTETNFS-------EEKEN--------------GSVVKEECNG 2466
                             + E N S       +EKEN              G V K+ECNG
Sbjct: 555  LPPPGSGNSQQQLPAGTLAEVNPSIETPTTMQEKENGKSNDDNSSSTSPKGKVQKQECNG 614


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550333015|gb|EEE88914.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  516 bits (1328), Expect = e-143
 Identities = 315/660 (47%), Positives = 364/660 (55%), Gaps = 50/660 (7%)
 Frame = +1

Query: 610  MAMPSGNVVISDKMQFSSG-----GGGGEIHHRQ-----WFP-DERDGFISWLRGEFAAA 756
            MAMP GNVVI DK+QF +G     GGG EIH  Q     WFP DERDGFISWLRGEFAAA
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60

Query: 757  NAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRR 936
            NAIIDSLC+HLR+ GE GEYD+V+G IQQRR NWN VLHMQQYFS+ EV+ ALQQV  RR
Sbjct: 61   NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120

Query: 937  QHR--------------YFDQMKGTGKEFKRAGGVGYRQGQRV-------EAVKEGHTFS 1053
            Q +              Y+D  K  G++FKR+   G+ +G R        +AVKEG   S
Sbjct: 121  QQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGGGDAVKEGVNSS 180

Query: 1054 VESHXXXXXXXXXXXXEKEPRVNENVQDAKHGREIGESDHK-VPPLADEKKDGFLKSSGN 1230
            VE+H            EK        ++ K G + G+SD K     A    D    SSGN
Sbjct: 181  VENHSFNGNSSENIRSEK-------FEEVKSGGDGGKSDDKKADATAKSHTDNHKNSSGN 233

Query: 1231 SDGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTEL 1410
            + G   GNS                                 NEKQNL   PKTFV  E 
Sbjct: 234  AQGTFSGNSEAVA-----------------------------NEKQNLAITPKTFVAEEK 264

Query: 1411 FDGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGR 1587
             DG+ VNVVDGLKLYE L D  EVSKL SLVN+LR  GRRGQ QGQT+++SKRPMKGHGR
Sbjct: 265  IDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGR 324

Query: 1588 EMIQFGIPIVDAPPEDENLSGTSKDCKIESIPAFLQDVAERLVGMQVMTTKPDSCIIDFY 1767
            EMIQ G+PI DAP EDEN +GTSK   +ESIPA LQDV E  V MQVMT KPDSCIID Y
Sbjct: 325  EMIQLGLPIADAPAEDENATGTSKGT-VESIPALLQDVIEHFVAMQVMTMKPDSCIIDIY 383

Query: 1768 NEGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQ 1947
            NEGDHSQPHM PPWFGKPV +LFLTEC++TFG+ I     GDY+              +Q
Sbjct: 384  NEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQ 443

Query: 1948 GKSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPN 2127
            GKS D AKHAIP I+KQR+LVTF K+ PKK L                       +R+PN
Sbjct: 444  GKSSDLAKHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPN 502

Query: 2128 HIRHPSGPKHYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT-----XXXXXXXXXXXXX 2283
            H+RHP  PKHYA                    NG+QP+F+TT                  
Sbjct: 503  HLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVS 561

Query: 2284 XXXXXXXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQQV----TETNF----SEEKENG 2439
                            +P+PGTGVF             Q+    TE NF     +EKENG
Sbjct: 562  TGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENG 621


Top