BLASTX nr result

ID: Paeonia23_contig00004174 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00004174
         (3006 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI26785.3| unnamed protein product [Vitis vinifera]              654   0.0  
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   649   0.0  
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   612   e-172
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     602   e-169
ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun...   597   e-168
ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot...   584   e-164
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   583   e-163
ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot...   580   e-162
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   579   e-162
ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794...   563   e-157
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   563   e-157
ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phas...   557   e-155
ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814...   554   e-154
ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phas...   548   e-153
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   544   e-151
gb|ABK95394.1| unknown [Populus trichocarpa]                          543   e-151
ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781...   540   e-150
ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809...   538   e-150
ref|XP_007158786.1| hypothetical protein PHAVU_002G181800g [Phas...   520   e-144
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   513   e-142

>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  654 bits (1687), Expect = 0.0
 Identities = 374/677 (55%), Positives = 425/677 (62%), Gaps = 44/677 (6%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGGG------EIQH-RQWFPDERDGFISWLRGEFAAANAII 772
            MAMPSGNVVISDKMQF  GGG G      EI H RQWFPDERDGFISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 773  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 952
            DSLC HLR  GEPGEYD VIG IQQRR NW+ VLHMQQYFS++EV+YALQQV WRRQ R+
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 953  FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXXEKEPRVNEN 1132
             D +KG GKE+KR G V YRQGQR E  K+ H  + ++H            EK  RV+E 
Sbjct: 121  LDPVKGAGKEYKRYG-VAYRQGQRGETAKDSHNSNFENHSHDANSSGTL--EKGERVSEI 177

Query: 1133 VQDAKHGRE---IGESDHKVPPLADEKKDGF-----------LKSSGNSDGIMCGNSGLE 1270
              D K G +   +G+ + K    A+EKK G             KSS NS+G  CG S  E
Sbjct: 178  YDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETE 237

Query: 1271 VKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGL 1450
              ++ D    N KGSCN+++EN+++ +QNQNEK N TT PKTFVGTE+FDGKAVNVVDGL
Sbjct: 238  ANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGL 297

Query: 1451 KLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQ-GQTFVVSKRPMKGHGREMIQFGIPIVD 1624
            KLYE LFD+SEVSK  SLVNDLR AG+RGQ Q GQTFVVSKRPMKGHGREMIQ G+PI D
Sbjct: 298  KLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQLGVPIAD 357

Query: 1625 APPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHML 1804
            AP EDE++ GTSKD + ESIP  LQDV   LVG QV+T KPD+CIIDFYNEGDHSQPH+ 
Sbjct: 358  APLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIW 417

Query: 1805 PPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAI 1984
            P WFG+PVCILFLTECDMTFGR IG D PGDYR              +QGKS DFAKHAI
Sbjct: 418  PTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAI 477

Query: 1985 PSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHY 2164
            PS+RKQR+LVTF K+ PKKT+A                      +R+PNH+RHP GPKHY
Sbjct: 478  PSLRKQRILVTFTKSQPKKTMAS--DGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHY 535

Query: 2165 -----AXXXXXXXXXXXXXXXXSNGMQPIFVTT---XXXXXXXXXXXXXXXXXXXXXXXX 2320
                                   NGMQP+FVTT                           
Sbjct: 536  GAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSPGWPAAPPR 595

Query: 2321 XXXXRLPVPGTGVFXXXXXXXXXXXXQQVT--------ETNFSEEKENGS-----VVKEE 2461
                RLPVPGTGVF            Q ++        ET    EKENGS     V KEE
Sbjct: 596  HPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSTVTKEE 655

Query: 2462 CNGGVMIKEEENKPAGA 2512
                  +K   +KPAGA
Sbjct: 656  QQHNDELK-VASKPAGA 671


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  649 bits (1673), Expect = 0.0
 Identities = 375/708 (52%), Positives = 430/708 (60%), Gaps = 75/708 (10%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGGG------EIQH-RQWFPDERDGFISWLRGEFAAANAII 772
            MAMPSGNVVISDKMQF  GGG G      EI H RQWFPDERDGFISWLRGEFAAANAII
Sbjct: 1    MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 773  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 952
            DSLC HLR  GEPGEYD VIG IQQRR NW+ VLHMQQYFS++EV+YALQQV WRRQ R+
Sbjct: 61   DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120

Query: 953  FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXXEKEPRVNEN 1132
             D +KG GKE+KR G V YRQGQR E  K+ H  + ++H            EK  RV+E 
Sbjct: 121  LDPVKGAGKEYKRYG-VAYRQGQRGETAKDSHNSNFENHSHDANSSGTL--EKGERVSEI 177

Query: 1133 VQDAKHGRE---IGESDHKVPPLADEKKDGF-----------LKSSGNSDGIMCGNSGLE 1270
              D K G +   +G+ + K    A+EKK G             KSS NS+G  CG S  E
Sbjct: 178  YDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETE 237

Query: 1271 VKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGL 1450
              ++ D       GSCN+++EN+++ +QNQNEK N TT PKTFVGTE+FDGKAVNVVDGL
Sbjct: 238  ANDMDDG------GSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGL 291

Query: 1451 KLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDA 1627
            KLYE LFD+SEVSK  SLVNDLR AG+RGQ QGQTFVVSKRPMKGHGREMIQ G+PI DA
Sbjct: 292  KLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPIADA 351

Query: 1628 PPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLP 1807
            P EDE++ GTSKD + ESIP  LQDV   LVG QV+T KPD+CIIDFYNEGDHSQPH+ P
Sbjct: 352  PLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIWP 411

Query: 1808 PWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIP 1987
             WFG+PVCILFLTECDMTFGR IG D PGDYR              +QGKS DFAKHAIP
Sbjct: 412  TWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIP 471

Query: 1988 SIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHY- 2164
            S+RKQR+LVTF K+ PKKT+A                      +R+PNH+RHP GPKHY 
Sbjct: 472  SLRKQRILVTFTKSQPKKTMAS--DGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHYG 529

Query: 2165 ----AXXXXXXXXXXXXXXXXSNGMQPIFVTT---XXXXXXXXXXXXXXXXXXXXXXXXX 2323
                                  NGMQP+FVTT                            
Sbjct: 530  AVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSPGWPAAPPRH 589

Query: 2324 XXXRLPVPGTGVFXXXXXXXXXXXXQQVT--------ETNFSEEKENGS----------- 2446
               RLPVPGTGVF            Q ++        ET    EKENGS           
Sbjct: 590  PPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSSNSNTVS 649

Query: 2447 --------VVKEECNGGV---------MIKEEE---------NKPAGA 2512
                    V ++ECNG +         + KEE+         +KPAGA
Sbjct: 650  PKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQHNDELKVASKPAGA 697


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  612 bits (1578), Expect = e-172
 Identities = 357/685 (52%), Positives = 413/685 (60%), Gaps = 66/685 (9%)
 Frame = +2

Query: 620  MPSGNVVISDKMQFSSGGGGG------EIQH-RQWFPDERDGFISWLRGEFAAANAIIDS 778
            MPSGNVVISDKMQF  GGGGG      EI H RQWFPDERDGFISWLRGEFAAANAIIDS
Sbjct: 1    MPSGNVVISDKMQFPGGGGGGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAIIDS 60

Query: 779  LCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFD 958
            LC HLR  GEPGEYD VIG IQQRR NW+ VLHMQQYFS++EV+YALQQV WRRQ R+ D
Sbjct: 61   LCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRHLD 120

Query: 959  QMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXXEKEPRVNENVQ 1138
             +KG GKE+KR G V YRQGQR E  K+ H  + ++H            EK  RV+E   
Sbjct: 121  PVKGAGKEYKRYG-VAYRQGQRGETAKDSHNSNFENHSHDANSSGTL--EKGERVSEIYD 177

Query: 1139 DAKHGRE---IGESDHKVPPLADEKKD--GFLKSSGNSDGIMCGNSGLEVKEVG------ 1285
            D K G +   +G+ + K    A EKK+   F+        ++     + V+ V       
Sbjct: 178  DVKGGDKGDVVGKLEDKDLSAAAEKKEVMNFVIFGQLEQMLLQNPMQIAVRRVQKTQKDP 237

Query: 1286 ----DKCIPNS----KGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVV 1441
                 +  P +      SCN+++EN+++ +QNQNEK N TT PKTFVGTE+FDGKAVNVV
Sbjct: 238  DVAFQRLRPMTWMMEARSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVV 297

Query: 1442 DGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPI 1618
            DGLKLYE LFD+SEVSK  SLVNDLR AG+RGQ QGQTFVVSKRPMKGHGREMIQ G+PI
Sbjct: 298  DGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPI 357

Query: 1619 VDAPPEDENLSGTSKDC----KIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDH 1786
             DAP EDE++ GTSK      + ESIP  LQDV  +LVG QV+T KPD+CIIDFYNEGDH
Sbjct: 358  ADAPLEDESVVGTSKGMFHNRRTESIPSLLQDVIGQLVGSQVLTVKPDACIIDFYNEGDH 417

Query: 1787 SQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLD 1966
            SQPH+ P WFG+PVCILFLTECDMTFGR IG D PGDYR              +QGKS D
Sbjct: 418  SQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSAD 477

Query: 1967 FAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHP 2146
            FAKHAIPS+RKQR+LVTF K+ PKKT A                      +R+PNH+RHP
Sbjct: 478  FAKHAIPSLRKQRILVTFTKSQPKKTTAS--DGQRLLPPAAQSSHWVPPPSRSPNHMRHP 535

Query: 2147 SGPKHY-----AXXXXXXXXXXXXXXXXSNGMQPIFVTT---XXXXXXXXXXXXXXXXXX 2302
             GPKHY                       NGMQP+FVTT                     
Sbjct: 536  MGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPXPLPTGSPGW 595

Query: 2303 XXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQQVT--------ETNFSEEKENGS---- 2446
                      RLPVPGTGVF            Q ++        ET    EKENGS    
Sbjct: 596  PAAPPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSS 655

Query: 2447 ---------------VVKEECNGGV 2476
                           V ++ECNG +
Sbjct: 656  SNSNTVSPKGKLDGKVHRQECNGSM 680


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  602 bits (1551), Expect = e-169
 Identities = 346/668 (51%), Positives = 402/668 (60%), Gaps = 47/668 (7%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSG-GGGGEIQH---RQWFPDERDGFISWLRGEFAAANAIIDSL 781
            MAMPSGNVV SDKMQF SG  G GEI H   RQWFPDERDGFISWLRGEFAAANA+IDSL
Sbjct: 1    MAMPSGNVVSSDKMQFPSGTAGAGEISHHNNRQWFPDERDGFISWLRGEFAAANAMIDSL 60

Query: 782  CYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFDQ 961
            C+HLR+ GEPGEYD VI  IQ RRCNWNPVLHMQQYFS++EV++ALQQVAWRRQ R++D 
Sbjct: 61   CHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFALQQVAWRRQQRFYDP 120

Query: 962  MKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXXEKEPRVNENVQD 1141
            +K   KEFKR+G VG++Q QR ++ K+G   + +SH                  +E    
Sbjct: 121  VKMGNKEFKRSG-VGFKQWQRNDSFKDGRNSAAESHCLDGNSSFGNA------ASEKGGS 173

Query: 1142 AKHGREIGESDHKVP-PLADEK--------KDGFLKSSGNSDGIMCGNSGLEVKEVGDKC 1294
             K G E+G SD +   P A EK        +DG +KS GN +G++ G+   EV  V D C
Sbjct: 174  DKSGDEVGNSDDRGSMPAAKEKNDSAAKSQEDGNVKSLGNFEGVVSGSEP-EVHAVDDGC 232

Query: 1295 IPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYELF-D 1471
              +SK       ENDS+S   QNE  NL  +PKTF G E+FDGK VNVV+GLKLYE F  
Sbjct: 233  TSSSK-------ENDSHSTPKQNENSNLANVPKTFSGNEMFDGKPVNVVEGLKLYEEFCA 285

Query: 1472 NSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLS 1651
            ++EVSKL +LVNDLR AG RG FQ QT+VVSKRPMKGHGRE IQ G+PI DAP EDE  +
Sbjct: 286  DTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLGLPIADAPVEDEISA 345

Query: 1652 GTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVC 1831
            GT KD + E+IP  LQDVAERLV MQV T KPDSCIIDFYNEGDHSQPH+ P WFG+PVC
Sbjct: 346  GTLKDRRTEAIPPLLQDVAERLVSMQVATVKPDSCIIDFYNEGDHSQPHLWPSWFGRPVC 405

Query: 1832 ILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVL 2011
            +LFLTECDMTFGR   ID PGDYR             A+QGKS DFAKHAIPS+R+QR+L
Sbjct: 406  VLFLTECDMTFGRVFAIDHPGDYRGALKLSLKPGSLLAMQGKSADFAKHAIPSLRRQRIL 465

Query: 2012 VTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XXXXX 2182
            VTF K+ PKK++                       +R+PNHIRHP GPKHYA        
Sbjct: 466  VTFTKSQPKKSMPS-DGQRMPSPGVAPSSHWGPQPSRSPNHIRHP-GPKHYAPVPTTGVL 523

Query: 2183 XXXXXXXXXXXSNGMQPIFVT---TXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGT 2353
                        NG+QP+FVT                                RLPVPGT
Sbjct: 524  QASPVRPQIPPPNGIQPLFVTAPVAPAMPFPAPVPIPPSSSGWSAAPPRHPPPRLPVPGT 583

Query: 2354 GVFXXXXXXXXXXXXQQ---------VTETNFSEEKENGS------------------VV 2452
            GVF             Q           ET    EKENGS                    
Sbjct: 584  GVFLPPPGSGGNSSGSQQVLGNDTNHTVETAAPPEKENGSGKLNHGMTASPKGKVDSKTQ 643

Query: 2453 KEECNGGV 2476
            K+ECNG +
Sbjct: 644  KQECNGSL 651


>ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
            gi|462422058|gb|EMJ26321.1| hypothetical protein
            PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  597 bits (1540), Expect = e-168
 Identities = 338/635 (53%), Positives = 391/635 (61%), Gaps = 24/635 (3%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGGGEI-------QHRQWFPDERDGFISWLRGEFAAANAII 772
            M MPSGNVV+SDKMQF SGGGGG +        HRQWFPDERDGFISWLRGEFAAANAII
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGGAVGGGEIAQHHRQWFPDERDGFISWLRGEFAAANAII 60

Query: 773  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 952
            DSLC+HLR+ GEPGEYDVVIG IQQRRCNWNPVLHMQQYFS++EV+YALQ VAWRRQ RY
Sbjct: 61   DSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQRY 120

Query: 953  FDQMKGTGKEFKRAGGVGYRQGQ-RVEAVKEGHTFSVQSHXXXXXXXXXXXXEKEPRVNE 1129
            +D +K   KEFKR+G VG+ +GQ R EA KEGH  +++SH            EK  R   
Sbjct: 121  YDPVKAGAKEFKRSG-VGFNKGQQRAEAFKEGHNSTLESHSNDGNSSGVVAPEKFER--- 176

Query: 1130 NVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNSDGIMCGNSGLEVKEVGDKCIPNSK 1309
                   G E+GE   +V P                        G EV ++ DK +  + 
Sbjct: 177  -------GSEVGE---EVEP------------------------GGEVGKLNDKGLAPA- 201

Query: 1310 GSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYELF-DNSEVS 1486
            G   V   N+S+S+Q QN+KQNL+ +PKTF+G E+ DGK VNVVDGLKLYE F  ++EVS
Sbjct: 202  GEKKV---NESHSIQIQNQKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYEDFLGDTEVS 258

Query: 1487 KLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSGTSKD 1666
            KL SLVNDLR AG+R Q QGQT+VVSKRPMKGHGREMIQ GIPI DAPPEDE  +GTSKD
Sbjct: 259  KLVSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIADAPPEDEISAGTSKD 318

Query: 1667 CKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCILFLT 1846
             KIE IP  LQDV +RLVGM VMT KPDSCIID YNEGDHSQPH  P WFG+PVC L+LT
Sbjct: 319  RKIEPIPSLLQDVIDRLVGMHVMTVKPDSCIIDVYNEGDHSQPHTWPSWFGRPVCALYLT 378

Query: 1847 ECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVLVTFAK 2026
            ECDMTFGR + +D PGDYR              +QGKS DFAKHAIPSIRKQR+LVT  K
Sbjct: 379  ECDMTFGRLLLMDHPGDYRGSLRLSLTPGSILLMQGKSADFAKHAIPSIRKQRILVTLTK 438

Query: 2027 AIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XXXXXXXXXX 2197
            + PKK+                        +R+PNHIRHP+GPKHYA             
Sbjct: 439  SQPKKSTTS-DGQRFPAPAPAQSSYWGPPPSRSPNHIRHPTGPKHYAAVPTTGVLPAPPI 497

Query: 2198 XXXXXXSNGMQPIFV--TTXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGTGVFXXX 2371
                   NG+QP+FV                                R+P+PGTGVF   
Sbjct: 498  RSQLPPQNGIQPLFVPAPVGPAIPFAAAVPIPPGSAGWPAAPRHPPPRIPLPGTGVFLPP 557

Query: 2372 XXXXXXXXXQQV----------TETNFSEEKENGS 2446
                     QQ+           ET    +K+NGS
Sbjct: 558  PGSGNSSAPQQLPGTATEMSPTVETPSPRDKDNGS 592


>ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|590697545|ref|XP_007045470.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  584 bits (1506), Expect = e-164
 Identities = 342/692 (49%), Positives = 407/692 (58%), Gaps = 64/692 (9%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSS------------------GGGGGEIQ---HRQWFPDERDGFI 730
            MAMPSGNVV+SDKMQF +                  GGGGGEI    HRQW PDERDGFI
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 731  SWLRGEFAAANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVL 910
             WLRGEFAA+NAIIDSLC+HLR  GE GEY+ VI  IQQRRCNWNPVLHMQQYFS++EV 
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 911  YALQQVAWRRQHRYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXX 1090
            YALQQVAWRR+ R+++  K  GKEFKR+G +G++ GQR+E  KEG    V S        
Sbjct: 121  YALQQVAWRRRQRHYESGKVGGKEFKRSG-MGFK-GQRMEVAKEGQNSGVDSDGNSTVTA 178

Query: 1091 XXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKD-GFLKSSGNSDGIMCGNSGL 1267
                 E+  R +E  ++ K   E+G+ + K     ++KKD G    +G+++ +       
Sbjct: 179  VS---ERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESV------- 228

Query: 1268 EVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDG 1447
              ++V   C  + K       END  S+QNQNEKQNL   PKTFVG E+FDGK VNVVDG
Sbjct: 229  -TEDVNGGCTSSYK-------ENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDG 280

Query: 1448 LKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVD 1624
            LKLYE LFD+ EV  L SLVNDLR AG+RGQ QGQT+V +KRPMKGHGREMIQ G+PI D
Sbjct: 281  LKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIAD 340

Query: 1625 APPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHML 1804
            AP +DEN +GTSKD +IE IP  LQD  ERLV +QVMT KPDSCIID YNEGDHSQP M 
Sbjct: 341  APLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMW 400

Query: 1805 PPWFGKPVCILFLTECDMTFGRAIGI-DRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHA 1981
            PPWFGKPVCI+FLTECD+TFGR + + D PGDYR              +QGKS DFAKHA
Sbjct: 401  PPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHA 460

Query: 1982 IPSIRKQRVLVTFAK-AIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPK 2158
            +PS+RKQR+LVTF K   PKK+  D                     +R+PN IRH +GPK
Sbjct: 461  LPSVRKQRILVTFTKYCQPKKSTTD--NQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPK 518

Query: 2159 HYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT--XXXXXXXXXXXXXXXXXXXXXXXXX 2323
            HYA                   S+G+QP+FV T                           
Sbjct: 519  HYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAAPRH 578

Query: 2324 XXXRLPVPGTGVFXXXXXXXXXXXXQQVT---------ETNFSEEKENGSV--------- 2449
               RLPVPGTGVF            Q  T         ET    EKENGSV         
Sbjct: 579  PPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTSP 638

Query: 2450 --------VKEECNGGV--------MIKEEEN 2497
                     K++CNG V        ++KEE++
Sbjct: 639  RGRLDGKSPKQDCNGSVDGAGSGRALMKEEQH 670


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  583 bits (1503), Expect = e-163
 Identities = 335/669 (50%), Positives = 399/669 (59%), Gaps = 48/669 (7%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGG----GGEI--QHRQWFPDERDGFISWLRGEFAAANAIID 775
            M MPSGNVV+SDKMQ+ S  G    GGEI  Q RQWFPDERDGFISWLRGEFAAANAIID
Sbjct: 1    MTMPSGNVVLSDKMQYPSVAGAAVSGGEIHQQPRQWFPDERDGFISWLRGEFAAANAIID 60

Query: 776  SLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYF 955
            SLC+HLR+ GEP EYD+VIG +QQRRCNW PVLHMQQYFS++EV+YALQQVAWRRQ RY+
Sbjct: 61   SLCHHLRAVGEPSEYDMVIGCVQQRRCNWTPVLHMQQYFSVAEVIYALQQVAWRRQQRYY 120

Query: 956  DQMKGTGKEFKRAG-GVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXXEKEPRVNEN 1132
            + +K   K++KR+  GVG++   R E VKE HT SV+              E        
Sbjct: 121  EPVKMGNKDYKRSNSGVGFKP--RNEPVKEWHTASVEYRSYDGSGLEKVGSEMR------ 172

Query: 1133 VQDAKHGREIGESDHKVPPLADEKKDGFLK--------SSGNSDGIMCGNSGLEVKEVGD 1288
             ++ K G E G+ D K        K    K        SS NS G + GNS  E   V +
Sbjct: 173  -EEVKPGGEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSSANSQGTISGNSESEDAVVNE 231

Query: 1289 KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYELF 1468
             C  + K       EN+S S+Q QNEKQNL+ +PKTFVG E FDGK VNVVDGLKLYE F
Sbjct: 232  GCTSSIK-------ENESNSIQIQNEKQNLSLIPKTFVGNETFDGKTVNVVDGLKLYEEF 284

Query: 1469 -DNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGREMIQFGIPIVDAPPEDEN 1645
              ++EVSKL SLVNDLR  GRRGQ QGQT+V+SKRPMKGHGREMIQ GIPI D P EDE 
Sbjct: 285  LGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQLGIPIADGPQEDEI 344

Query: 1646 LSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKP 1825
             +G SKD ++E+IP  LQDV +RL+G QV+T KPDSCIIDF+NEGDHS PHM PPWFG+P
Sbjct: 345  SAGISKDRRMEAIPSLLQDVIDRLIGTQVLTDKPDSCIIDFFNEGDHSHPHMWPPWFGRP 404

Query: 1826 VCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQR 2005
            V +LFLTECD+TFG+ +G+D PGDYR              +QGKS D+AKHAIPSIRKQR
Sbjct: 405  VSVLFLTECDLTFGKVLGMDHPGDYRGALRLSLTPGSLLLLQGKSADYAKHAIPSIRKQR 464

Query: 2006 VLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XXX 2176
            +LVTF K+ P+K+                         R+PNHIRHP+GPKHYA      
Sbjct: 465  ILVTFTKSQPRKSFPTDGQRLPSPGPSQSPYWSPPPG-RSPNHIRHPAGPKHYAAVPTTG 523

Query: 2177 XXXXXXXXXXXXXSNGMQPIFVT--TXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPG 2350
                         +NG+QP+FV                                R+P+PG
Sbjct: 524  VLPAPPNRPQLPPANGIQPLFVAAPVGPAMPFPAPVVIPPGSPGWVAAPRHPPPRMPLPG 583

Query: 2351 TGVFXXXXXXXXXXXXQQ-----VTETN-----FSEEKENGS-----------------V 2449
            TGVF             Q      TE N      S EK+NG+                  
Sbjct: 584  TGVFLPPPGSGSSSAPPQQFPSTATEMNPSVETASTEKDNGTAKSSHAIASPKAKLDVKA 643

Query: 2450 VKEECNGGV 2476
             +++CNG V
Sbjct: 644  QRQDCNGSV 652


>ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|590697542|ref|XP_007045469.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  580 bits (1494), Expect = e-162
 Identities = 342/693 (49%), Positives = 407/693 (58%), Gaps = 65/693 (9%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSS------------------GGGGGEIQ---HRQWFPDERDGFI 730
            MAMPSGNVV+SDKMQF +                  GGGGGEI    HRQW PDERDGFI
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 731  SWLRGEFAAANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVL 910
             WLRGEFAA+NAIIDSLC+HLR  GE GEY+ VI  IQQRRCNWNPVLHMQQYFS++EV 
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 911  YALQQVAWRRQHRYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXX 1090
            YALQQVAWRR+ R+++  K  GKEFKR+G +G++ GQR+E  KEG    V S        
Sbjct: 121  YALQQVAWRRRQRHYESGKVGGKEFKRSG-MGFK-GQRMEVAKEGQNSGVDSDGNSTVTA 178

Query: 1091 XXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKD-GFLKSSGNSDGIMCGNSGL 1267
                 E+  R +E  ++ K   E+G+ + K     ++KKD G    +G+++ +       
Sbjct: 179  VS---ERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESV------- 228

Query: 1268 EVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDG 1447
              ++V   C  + K       END  S+QNQNEKQNL   PKTFVG E+FDGK VNVVDG
Sbjct: 229  -TEDVNGGCTSSYK-------ENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDG 280

Query: 1448 LKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQ-GQTFVVSKRPMKGHGREMIQFGIPIV 1621
            LKLYE LFD+ EV  L SLVNDLR AG+RGQ Q GQT+V +KRPMKGHGREMIQ G+PI 
Sbjct: 281  LKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIA 340

Query: 1622 DAPPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHM 1801
            DAP +DEN +GTSKD +IE IP  LQD  ERLV +QVMT KPDSCIID YNEGDHSQP M
Sbjct: 341  DAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRM 400

Query: 1802 LPPWFGKPVCILFLTECDMTFGRAIGI-DRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKH 1978
             PPWFGKPVCI+FLTECD+TFGR + + D PGDYR              +QGKS DFAKH
Sbjct: 401  WPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKH 460

Query: 1979 AIPSIRKQRVLVTFAK-AIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGP 2155
            A+PS+RKQR+LVTF K   PKK+  D                     +R+PN IRH +GP
Sbjct: 461  ALPSVRKQRILVTFTKYCQPKKSTTD--NQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGP 518

Query: 2156 KHYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT--XXXXXXXXXXXXXXXXXXXXXXXX 2320
            KHYA                   S+G+QP+FV T                          
Sbjct: 519  KHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAAPR 578

Query: 2321 XXXXRLPVPGTGVFXXXXXXXXXXXXQQVT---------ETNFSEEKENGSV-------- 2449
                RLPVPGTGVF            Q  T         ET    EKENGSV        
Sbjct: 579  HPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTS 638

Query: 2450 ---------VKEECNGGV--------MIKEEEN 2497
                      K++CNG V        ++KEE++
Sbjct: 639  PRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQH 671


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
            gi|223533099|gb|EEF34858.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 697

 Score =  579 bits (1493), Expect = e-162
 Identities = 344/664 (51%), Positives = 399/664 (60%), Gaps = 54/664 (8%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGG----------GEIQ-----HR-QWFP-DERDGFISWLR 742
            MAMP GNVVISDK+QF +GGGG           EIQ     HR QWFP DERDGFISWLR
Sbjct: 1    MAMPPGNVVISDKIQFPAGGGGVGGGNNGGIGNEIQQQQHHHRHQWFPVDERDGFISWLR 60

Query: 743  GEFAAANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQ 922
            GEFAAANAIIDSLC+HLR+AGEPGEYDVVIG IQQRRCNWNPVLHMQQYFS+ EV+ ALQ
Sbjct: 61   GEFAAANAIIDSLCHHLRAAGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVGEVILALQ 120

Query: 923  QVAWRRQ-----------HRYF-DQMKGTGKEFKRAGGVGYRQGQRV--EAVKEGHTFSV 1060
            QVA R+Q           HRY+ DQ K  GK+FKR   +G+ +G R   E VKE + +  
Sbjct: 121  QVALRKQQQHQHQHQHQQHRYYYDQPKVGGKDFKRNSSMGFNKGHRGGGEVVKEVN-YGA 179

Query: 1061 QSHXXXXXXXXXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKDGF-------L 1219
            +SH            +     NE   + K G + G  ++K    A++KKD         L
Sbjct: 180  ESHGL----------DGNTSGNEKFNEIKSGGDSGRLENKSLATAEDKKDAASKPHVDNL 229

Query: 1220 KSSGNSDGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTF 1399
            KSSGNS+G + GN   E + V ++  P          E+DS+ +QNQ  K NLTT PKTF
Sbjct: 230  KSSGNSEGSLSGNLETEAEAVHEQSSPK---------EHDSHFIQNQIVKLNLTTTPKTF 280

Query: 1400 VGTELFDGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPM 1576
            VG E+ DGK+VNVVDGLKLYE L D+ EVSKL SLVNDLR AGR+GQFQGQ +VVSKRPM
Sbjct: 281  VGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQGQAYVVSKRPM 340

Query: 1577 KGHGREMIQFGIPIVDAPPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSC 1756
            KGHGREMIQ G+PI DAP E+EN +GTSKD KIESIP  LQ+V ER V MQ+MT KPDSC
Sbjct: 341  KGHGREMIQLGLPIADAPAEEENAAGTSKDRKIESIPTLLQEVIERFVSMQIMTMKPDSC 400

Query: 1757 IIDFYNEGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXX 1936
            IID YNEGDHSQPHM PPWFGKP+ +LFLTECD+TFGR I  D PGDYR           
Sbjct: 401  IIDIYNEGDHSQPHMWPPWFGKPISVLFLTECDLTFGRVITADHPGDYRGSLKLPLAPGS 460

Query: 1937 XXAVQGKSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXX 2116
               +QGK+ DFAKHAIP+IRKQRVL+TF K+ PKK                         
Sbjct: 461  LLVMQGKATDFAKHAIPAIRKQRVLLTFTKSQPKK-FVQSDGQRLTSPAASPSSHWGPPP 519

Query: 2117 TRAPNHIRHPSGPKHYA---XXXXXXXXXXXXXXXXSNGMQPIFVT----TXXXXXXXXX 2275
            +R+PNHIRHP   KHYA                    NG+QP+FVT              
Sbjct: 520  SRSPNHIRHPVS-KHYAPIPTTGVLPAPSIRPQIAPPNGVQPLFVTAPVAAPMPFPAPVP 578

Query: 2276 XXXXXXXXXXXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQ--QVTETNFS------EE 2431
                                +PVPGTGVF            Q    TE NF       ++
Sbjct: 579  MPPVSTGWPAAPRHPPNRLPVPVPGTGVFLPPPGSGNASSPQIPNATEINFPAETASLQD 638

Query: 2432 KENG 2443
            KENG
Sbjct: 639  KENG 642


>ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max]
          Length = 683

 Score =  563 bits (1452), Expect = e-157
 Identities = 326/674 (48%), Positives = 398/674 (59%), Gaps = 55/674 (8%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGGG--------EIQHR-----QWFPDERDGFISWLRGEFA 754
            MAMPSGNVVI DKMQF SG GGG        EI        QWF DERDG I WLR EFA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGAGGGGGGGGAGGEIHQPHHYRPQWFVDERDGLIGWLRSEFA 60

Query: 755  AANAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAW 934
            AANAIIDSLC+HLR  G+PGEYD+V+G+IQQRRCNWN VL MQQYFS+++V YALQQVAW
Sbjct: 61   AANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNWNQVLMMQQYFSVADVAYALQQVAW 120

Query: 935  RRQHRYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXXEKE 1114
            RRQ R  D MK   KE +++G  GYR GQR E+VKEG+  SV+S+            EK 
Sbjct: 121  RRQQRPLDPMKVGAKEVRKSGS-GYRHGQRFESVKEGYNSSVESYSHDANVAVTGGTEKG 179

Query: 1115 PRVNENVQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLE 1270
              V E  ++ K G ++ +   K     +EKKD        G LKS+ +++G +   S LE
Sbjct: 180  TPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDAITNHQSEGSLKSARSTEGSL---SNLE 236

Query: 1271 VKEV-GDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDG 1447
             + V  D CI NSKG       ND +S+QNQ++ Q+L+ + KTF+G E+FDGK VNVVDG
Sbjct: 237  SEAVVNDGCISNSKG-------NDLHSVQNQSQSQSLSNIAKTFIGNEMFDGKTVNVVDG 289

Query: 1448 LKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIV 1621
            LKLY+ LFD++EV+ L SLVNDLRV+G++GQ QG Q ++VS+RPMKGHGREMIQ G+ I 
Sbjct: 290  LKLYDDLFDSTEVANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVRIA 349

Query: 1622 DAPPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHM 1801
            DAP E EN++G SKD  +ESIP   QD+ ER+V  QVMT KPD CI+DFYNEGDHSQPH 
Sbjct: 350  DAPAEGENMTGASKDMNVESIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHS 409

Query: 1802 LPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHA 1981
             P W+G+PV +LFLTEC+MTFGR I  + PGDYR              +QGKS DFAKHA
Sbjct: 410  WPSWYGRPVYVLFLTECEMTFGRVIASEHPGDYRGSIKLSLVPGSLLVMQGKSSDFAKHA 469

Query: 1982 IPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKH 2161
            +PS RKQR+LVTF K+ P+K+L+                      +R+PNH+RH  GPKH
Sbjct: 470  LPSTRKQRILVTFTKSQPRKSLSS---DAQQLASAVASSHWGPPPSRSPNHVRHHVGPKH 526

Query: 2162 YAXXXXXXXXXXXXXXXXSN---GMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXX 2320
            YA                     GMQP+FV                              
Sbjct: 527  YATLPTTGVLPAPPIRPQMAAPVGMQPLFVAAPVVPPMPFSAPVPIPAGSTGWTAAPPPR 586

Query: 2321 XXXXRLPVPGTGVFXXXXXXXXXXXXQQV-----------TETNFSEEKEN--------- 2440
                R+P PGTGVF            QQ+           TET    EKEN         
Sbjct: 587  HPPPRVPAPGTGVF--LPPSGSGNSSQQLPASTLAEVNPSTETPTMPEKENGKINHNSTS 644

Query: 2441 ----GSVVKEECNG 2470
                G V K+ECNG
Sbjct: 645  ASPKGKVQKQECNG 658


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max]
          Length = 681

 Score =  563 bits (1452), Expect = e-157
 Identities = 327/671 (48%), Positives = 403/671 (60%), Gaps = 52/671 (7%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGG-----GEIQH----RQWFPDERDGFISWLRGEFAAANA 766
            MAMPSGNVVI DKMQF SGG G     GEI      +QWF DERDG I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 767  IIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQH 946
            IIDSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V +ALQQVAWRRQ 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120

Query: 947  RYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXX-EKEPRV 1123
            R  D +K   KEF+++G  GYR GQR E VKEG+  SV+S+             EK   V
Sbjct: 121  RPLDPVKVGAKEFRKSGS-GYRHGQRFEPVKEGYNSSVESYNQYDANVTVTGGTEKGTPV 179

Query: 1124 NENVQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKE 1279
             E  ++ K G ++ +   K    A++KKD        G LKS+ +++G +   S LE + 
Sbjct: 180  VEKSEEHKSGGKVEKVGDKGLASAEDKKDAITKHQTDGSLKSTRSTEGSL---SNLESEA 236

Query: 1280 V-GDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKL 1456
            V  D+CI NSKG       +DS+S+QNQ++ Q+L+T  KTF+G E+FDGK VNVVDGLKL
Sbjct: 237  VVNDECISNSKG-------DDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKL 289

Query: 1457 YE-LFDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAP 1630
            YE LFD++E++ L SLVNDLRV+G++GQ QG Q ++VS+RPMKGHGREMIQ G+PI DAP
Sbjct: 290  YEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIADAP 349

Query: 1631 PEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPP 1810
             E EN++G SKD  +E IP   QD+ ER+V  QVMT KPD CI+DFYNEGDHSQPH  P 
Sbjct: 350  AEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPS 409

Query: 1811 WFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPS 1990
            W+G+PV ILFLTEC+MTFGR I  + PGDYR              ++GKS DFAKHA+PS
Sbjct: 410  WYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAKHALPS 469

Query: 1991 IRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYAX 2170
            +RKQR+LVTF K+ P+K+L+                      +R+PNH+RH  G KHYA 
Sbjct: 470  VRKQRILVTFTKSQPRKSLSS---DAQRLASTATSSHWGPLPSRSPNHVRHHVGSKHYAT 526

Query: 2171 XXXXXXXXXXXXXXXSN---GMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXX 2329
                                GMQP+FVT                                
Sbjct: 527  LPTTGVLPSPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTGWTGAPPPRHPP 586

Query: 2330 XRLPVPGTGVFXXXXXXXXXXXXQQV-----------TETNFSEEKEN------------ 2440
             R+P PGTGVF            QQ+           TET    EKEN            
Sbjct: 587  PRVPAPGTGVF--LPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGKTNHNSTSASP 644

Query: 2441 -GSVVKEECNG 2470
             G V K+ECNG
Sbjct: 645  KGKVQKQECNG 655


>ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phaseolus vulgaris]
            gi|561026542|gb|ESW25182.1| hypothetical protein
            PHAVU_003G014200g [Phaseolus vulgaris]
          Length = 691

 Score =  557 bits (1435), Expect = e-155
 Identities = 330/695 (47%), Positives = 399/695 (57%), Gaps = 61/695 (8%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGG----GGEIQ--HRQWFPDERDGFISWLRGEFAAANAIID 775
            MAMPSGN  + +K+QF  GGG    GGEIQ  H+QWF DERDGFI WLR EFAAANAIID
Sbjct: 1    MAMPSGNGGMPEKLQFPVGGGAASGGGEIQYRHQQWFVDERDGFIGWLRSEFAAANAIID 60

Query: 776  SLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYF 955
            SLC HLR  GEPG YD+V+G+IQQRRCNW  VL MQQYFS+SEV+YALQQVAWRRQ R+ 
Sbjct: 61   SLCQHLRVVGEPGVYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVAWRRQQRFV 120

Query: 956  DQMKGTGKEFKRAGGVGYRQGQ-------------RVEAVKEGHTFSVQSHXXXXXXXXX 1096
            D  K   KEF++ G  G+RQGQ             R EA KEG+   V+S          
Sbjct: 121  DPAKAGSKEFRKFGS-GFRQGQHRNEASKEGYNNSRNEAAKEGYNSKVESFGREMNAVVV 179

Query: 1097 XXX-EKEPRVNENVQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIM 1249
                EK  RV +   +   G ++G  D+      +E KD        G L  SGN  G +
Sbjct: 180  TGGVEKGTRVIDKNGELNSGGKVGTMDNNSIASPEESKDTITNDQLDGILNGSGNFQGSL 239

Query: 1250 CGNSGLEVKEVGD--KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDG 1423
               S  E + VG+  +C  NSKG       NDS+S+QNQ++ QN +T+ KTF+G E+F+G
Sbjct: 240  ---SSSECEAVGENEECTSNSKG-------NDSHSVQNQHQSQNASTIGKTFIGNEMFEG 289

Query: 1424 KAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREM 1597
            K VNVVDGLKLYE L D++EVSKL SLVND+RVAG+RGQFQG QTFVVSKRP+KG GREM
Sbjct: 290  KMVNVVDGLKLYEDLIDSAEVSKLVSLVNDMRVAGKRGQFQGSQTFVVSKRPIKGRGREM 349

Query: 1598 IQFGIPIVDAPPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNE 1777
            IQ G+PI DAPP+ +N++G SKD K+ESIP   +D+ ERL   QVMT KPD+CI+DF+NE
Sbjct: 350  IQLGVPIADAPPDVDNVTGLSKDKKVESIPSLFEDIIERLAASQVMTVKPDACIVDFFNE 409

Query: 1778 GDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGK 1957
            GDHSQP+  PPWFG+PV +LFLTECD+TFGR I  D PGDYR              +QGK
Sbjct: 410  GDHSQPNSCPPWFGRPVYMLFLTECDITFGRTIVSDHPGDYRGAVKLSLVPGSLLVMQGK 469

Query: 1958 SLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHI 2137
            S D AKHA+PSI KQR+LVTF K+ PK +L +                      R PNH+
Sbjct: 470  STDLAKHALPSIHKQRILVTFTKSQPKTSLPNDSQRLSPAVTSHWAPPQG----RTPNHM 525

Query: 2138 RHPSGPKHYAXXXXXXXXXXXXXXXXSNGMQPIFVTT---XXXXXXXXXXXXXXXXXXXX 2308
            RH  GPKHY                  NGMQ +FV T                       
Sbjct: 526  RHQLGPKHYPTIPATGVLPAPSIRAPPNGMQTLFVPTPVAPPISFASPVPIPLGSTGWAS 585

Query: 2309 XXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQ---QVTETNFSEE---------------- 2431
                    R+PVPGTGVF                 V+E N S E                
Sbjct: 586  APQRHPPPRMPVPGTGVFLPPPGSGTTSSQHLPGVVSEVNLSGETTSTGKESLKSNHNTI 645

Query: 2432 ------KENGSVV-KEECNGGVMIKEEENKPAGAD 2515
                  K +G+VV ++ECNG     E E    G +
Sbjct: 646  NSSPKGKVDGNVVGRQECNGNADRSEGEEDVVGKE 680


>ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814525 [Glycine max]
          Length = 626

 Score =  554 bits (1427), Expect = e-154
 Identities = 321/656 (48%), Positives = 386/656 (58%), Gaps = 23/656 (3%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGGGEIQHRQ-WFPDERDGFISWLRGEFAAANAIIDSLCYH 790
            MAMPSGN V+ +K+QF  GGGG EI +RQ WF DERDGFI WLR EFAAANAIIDSLC+H
Sbjct: 1    MAMPSGNAVMPEKLQFPGGGGGSEIHYRQQWFVDERDGFIGWLRSEFAAANAIIDSLCHH 60

Query: 791  LRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFDQMKG 970
            LR  GEPGEYD+V+G+IQQRRCNW  VL MQQYFS+SEV+ ALQQV+WRRQ R  D  K 
Sbjct: 61   LRCVGEPGEYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVCALQQVSWRRQQRVVDLAKT 120

Query: 971  TGKEFKRAGGVGYRQGQ-RVEAVKEGHTFSVQSHXXXXXXXXXXXX-EKEPRVNENVQDA 1144
              KEF++ G  G RQGQ R+EA K+G+  SV+S              EK   + E   + 
Sbjct: 121  GAKEFRKFGS-GIRQGQHRLEAAKDGYNSSVESFCHGTNAVVVAGGVEKGTPLTEKNGEI 179

Query: 1145 KHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKEVGDKCIP 1300
            K G ++G  D+K     +E+KD        G LK SGNS G +   S  E   V ++C+ 
Sbjct: 180  KSGGKVGTMDNKSLASPEERKDTITNHQSDGILKGSGNSQGSL-STSECEAVGVNEECVS 238

Query: 1301 NSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-LFDNS 1477
            NSK       ENDS             T+ KTF+G E+FDGK VNVVDGLKLYE L D +
Sbjct: 239  NSK-------ENDS-------------TMGKTFIGNEMFDGKMVNVVDGLKLYEDLLDRT 278

Query: 1478 EVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSG 1654
            EVSKL SLVNDLRVAG+RGQFQG QTFVVSKRPMKGHGREMIQ G+PI DAPP+ +N++G
Sbjct: 279  EVSKLVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVDNVTG 338

Query: 1655 TSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCI 1834
             SKD K+ESIP   QD+ +RLV  QVMT KPD+CI+DF+NEG+HS P+  PPWFG+P+ I
Sbjct: 339  ISKDKKVESIPSLFQDIIKRLVASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGRPLYI 398

Query: 1835 LFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVLV 2014
            LFLTECDMTFGR I  D PG++R              +QGKS DFAKHA+PSI KQR++V
Sbjct: 399  LFLTECDMTFGRIIVSDHPGEFRGAVTLSLVPGSLLVMQGKSTDFAKHALPSIHKQRIIV 458

Query: 2015 TFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYAXXXXXXXXX 2194
            TF K+ P+ +L +                     +R+PNH+RH  GPKHY          
Sbjct: 459  TFTKSQPRSSLPN----DSERLAPPAAPHWAPPPSRSPNHVRHQLGPKHY------PTVQ 508

Query: 2195 XXXXXXXSNGMQPIFV-----TTXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGTGV 2359
                    NGMQP+FV                                   R+PVPGTGV
Sbjct: 509  ATGVLPAPNGMQPLFVPVPVPVASPMSFPTPVPIPPGSIGWTSAPPRHPPPRIPVPGTGV 568

Query: 2360 FXXXXXXXXXXXXQQVTETNFSEEKENGSVVKEECN-----GGVMIKEEENKPAGA 2512
            F                ET     KENG     + N      GV  + E N    A
Sbjct: 569  FLPPPGSGTIHEVNPSVETWTVSGKENGKSNHSKTNSEAEEAGVEKEHESNDMTAA 624


>ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
            gi|561032200|gb|ESW30779.1| hypothetical protein
            PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 671

 Score =  548 bits (1411), Expect = e-153
 Identities = 317/667 (47%), Positives = 390/667 (58%), Gaps = 48/667 (7%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGG---GEIQH----RQWFPDERDGFISWLRGEFAAANAII 772
            MAMPSGNVVI DKMQF +GGGG   GEIQ     +QWF DERDG I WLR EFAAANAII
Sbjct: 1    MAMPSGNVVIQDKMQFPNGGGGAGVGEIQQHHYRQQWFVDERDGLIGWLRSEFAAANAII 60

Query: 773  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 952
            DSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V Y LQQVAWR+Q R 
Sbjct: 61   DSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVTYTLQQVAWRKQQRP 120

Query: 953  FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXXEKEPRVNEN 1132
             D +K   KE ++ G  GYR G R E  KEG+  SV+S+            EK     + 
Sbjct: 121  LDPVKVGAKEVRKPGP-GYRYGHRFEPSKEGYNSSVESYSHDGNATFTRGMEKGTPTVDK 179

Query: 1133 VQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKEVGD 1288
             ++ K G ++ +   K     +EKKD        G LKS+G+S+G +  N   E   V D
Sbjct: 180  SEEHKSGSKVEKVGDKGLASPEEKKDAIIKHQTDGNLKSTGSSEGYL-SNLESEAVVVND 238

Query: 1289 KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-L 1465
            + I NSKG       NDS S+++Q++ Q+ +T+ KTF+G E+ DGK VN+ DGLKLYE +
Sbjct: 239  EFISNSKG-------NDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADGLKLYEDI 291

Query: 1466 FDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDE 1642
            FD++EVS L SLVNDLR++G++GQ QG Q +VVS+RPMKGHGREMIQ G+PI DAP E E
Sbjct: 292  FDSTEVSNLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIADAPVEGE 351

Query: 1643 NLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGK 1822
            N++G SK   +E IP   +D+ ER+V  QVMTTKPD CI+DFYNEGDHSQPH  P WFG+
Sbjct: 352  NMTGASKVMNVEPIPSLFEDIIERMVSSQVMTTKPDCCIVDFYNEGDHSQPHSWPSWFGR 411

Query: 1823 PVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQ 2002
            PV  LFLTEC+MTFGR I  + PGDYR             A+QGKS DFAKHA+PSIRKQ
Sbjct: 412  PVYTLFLTECEMTFGRLIASEHPGDYRGSLKLSLVPGSLLAMQGKSCDFAKHALPSIRKQ 471

Query: 2003 RVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XX 2173
            R+LVTF K+ PKK++                       +R+PNH+RH  G KHYA     
Sbjct: 472  RILVTFTKSQPKKSVPS---DAQRLYLPAASSQWGPPPSRSPNHVRHSVGSKHYAALPTT 528

Query: 2174 XXXXXXXXXXXXXXSNGMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXXXRLP 2341
                            GMQP+FV                                  R+P
Sbjct: 529  GVLPAPPIRPQIPAQVGMQPLFVAAPVVPPMPYPAPVSIPPGSAGWTTAPPPRHPPPRIP 588

Query: 2342 VPGTGVF---XXXXXXXXXXXXQQVTETNFS-------EEKEN--------------GSV 2449
             PGTGVF                 + E N S       +EKEN              G V
Sbjct: 589  APGTGVFLPPPGSGNSQQQLPAGTLAEVNPSIETPTTMQEKENGKSNDDNSSSTSPKGKV 648

Query: 2450 VKEECNG 2470
             K+ECNG
Sbjct: 649  QKQECNG 655


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
            gi|550333016|gb|ERP57586.1| hypothetical protein
            POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  544 bits (1401), Expect = e-151
 Identities = 321/659 (48%), Positives = 379/659 (57%), Gaps = 49/659 (7%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQF---SSGGGGG-------EIQHRQWFP-DERDGFISWLRGEFAAA 760
            MAMP GNVVI DK+QF   ++GGGGG       ++Q  QWFP DERDGFISWLRGEFAAA
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60

Query: 761  NAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRR 940
            NAIIDSLC+HLR+ GE GEYD+V+G IQQRR NWN VLHMQQYFS+ EV+ ALQQV  RR
Sbjct: 61   NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120

Query: 941  QHR--------------YFDQMKGTGKEFKRAGGVGYRQGQRV-------EAVKEGHTFS 1057
            Q +              Y+D  K  G++FKR+   G+ +G R        +AVKEG   S
Sbjct: 121  QQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGGGDAVKEGVNSS 180

Query: 1058 VQSHXXXXXXXXXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNS 1237
            V++H            EK        ++ K G + G+SD K    A    D    SSGN+
Sbjct: 181  VENHSFNGNSSENIRSEK-------FEEVKSGGDGGKSDDKKDATAKSHTDNHKNSSGNA 233

Query: 1238 DGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELF 1417
             G   GNS  E   V D+  P          E+DS+   NQNEKQNL   PKTFV  E  
Sbjct: 234  QGTFSGNS--EAVAVDDRSSPE---------ESDSHPSNNQNEKQNLAITPKTFVAEEKI 282

Query: 1418 DGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGRE 1594
            DG+ VNVVDGLKLYE L D  EVSKL SLVN+LR  GRRGQ QGQT+++SKRPMKGHGRE
Sbjct: 283  DGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGRE 342

Query: 1595 MIQFGIPIVDAPPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYN 1774
            MIQ G+PI DAP EDEN +GTSK+ ++ESIP  LQDV E  V MQVMT KPDSCIID YN
Sbjct: 343  MIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYN 402

Query: 1775 EGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQG 1954
            EGDHSQPHM PPWFGKPV +LFLTEC++TFG+ I     GDY+              +QG
Sbjct: 403  EGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQG 462

Query: 1955 KSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNH 2134
            KS D AKHAIP I+KQR+LVTF K+ PKK L                       +R+PNH
Sbjct: 463  KSSDLAKHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPNH 521

Query: 2135 IRHPSGPKHYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT-----XXXXXXXXXXXXXX 2290
            +RHP  PKHYA                    NG+QP+F+TT                   
Sbjct: 522  LRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVST 580

Query: 2291 XXXXXXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQQV----TETNF----SEEKENG 2443
                           +P+PGTGVF             Q+    TE NF     +EKENG
Sbjct: 581  GWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENG 639


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  543 bits (1400), Expect = e-151
 Identities = 321/660 (48%), Positives = 379/660 (57%), Gaps = 50/660 (7%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQF---SSGGGGG-------EIQHRQWFP-DERDGFISWLRGEFAAA 760
            MAMP GNVVI DK+QF   ++GGGGG       ++Q  QWFP DERDGFISWLRGEFAAA
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60

Query: 761  NAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRR 940
            NAIIDSLC+HLR+ GE GEYD+V+G IQQRR NWN VLHMQQYFS+ EV+ ALQQV  RR
Sbjct: 61   NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120

Query: 941  QHR-----------------YFDQMKGTGKEFKRAGGVGYRQGQRV-----EAVKEGHTF 1054
            Q +                 Y+D  K  G++FKR+   G+ +G R      +AVKEG   
Sbjct: 121  QQQQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGDAVKEGVNS 180

Query: 1055 SVQSHXXXXXXXXXXXXEKEPRVNENVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGN 1234
            SV++H            EK        ++ K G + G+SD K    A    D    SSGN
Sbjct: 181  SVENHSFNGNSSENIRSEK-------FEEVKSGGDGGKSDDKKDATAKSHTDNHKNSSGN 233

Query: 1235 SDGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTEL 1414
            + G   GNS  E   V D+  P          E+DS+   NQNEKQNL   PKTFV  E 
Sbjct: 234  AQGTFSGNS--EAVAVDDRSSPE---------ESDSHPSNNQNEKQNLAITPKTFVAEEK 282

Query: 1415 FDGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGR 1591
             DG+ VNVVDGLKLYE L D  EVSKL SLVN+LR  GRRGQ QGQT+++SKRPMKGHGR
Sbjct: 283  IDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGR 342

Query: 1592 EMIQFGIPIVDAPPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFY 1771
            EMIQ G+PI DAP EDEN +GTSK+ ++ESIP  LQDV E  V MQVMT KPDSCIID Y
Sbjct: 343  EMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIY 402

Query: 1772 NEGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQ 1951
            NEGDHSQPHM PPWFGKPV +LFLTEC++TFG+ I     GDY+              +Q
Sbjct: 403  NEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQ 462

Query: 1952 GKSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPN 2131
            GKS D AKHAIP I+KQR+LVTF K+ PKK L                       +R+PN
Sbjct: 463  GKSSDLAKHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPN 521

Query: 2132 HIRHPSGPKHYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT-----XXXXXXXXXXXXX 2287
            H+RHP  PKHYA                    NG+QP+F+TT                  
Sbjct: 522  HLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVS 580

Query: 2288 XXXXXXXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQQV----TETNF----SEEKENG 2443
                            +P+PGTGVF             Q+    TE NF     +EKENG
Sbjct: 581  TGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENG 640


>ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781773 [Glycine max]
          Length = 664

 Score =  540 bits (1392), Expect = e-150
 Identities = 306/603 (50%), Positives = 369/603 (61%), Gaps = 20/603 (3%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGG----GGEIQHRQ-WFPDERDGFISWLRGEFAAANAIIDS 778
            MAMPSGN V+ +K+QF  GGG    G EI  RQ WF DERDGFI WLR EFAAANAIIDS
Sbjct: 1    MAMPSGNAVMPEKLQFPGGGGAPGGGSEIHFRQQWFVDERDGFIGWLRSEFAAANAIIDS 60

Query: 779  LCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRYFD 958
            LC+HLR  GEPGEY++V+G+IQQRRCNW  VL MQQYFS+SEV+YALQQV+WRRQ R  D
Sbjct: 61   LCHHLRDVGEPGEYNMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVSWRRQQRVVD 120

Query: 959  QMKGTGKEFKRAGGVGYRQGQ-RVEAVKEGHTFSVQSHXXXXXXXXXXXX-EKEPRVNEN 1132
              K   KEF++ G +G++QGQ R EAVK+G+  SV+S              EK   V E 
Sbjct: 121  PAKTGAKEFRKFG-LGFKQGQHRFEAVKDGYNSSVESFGHGTNAVVVAGGVEKGACVTEK 179

Query: 1133 VQDAKHGREIGESDHKVPPLADEKKD--------GFLKSSGNSDGIMCGNSGLEVKEVGD 1288
              + K G  +G  D+K     +E+KD        G LK S NS G +  +S  E   V +
Sbjct: 180  NGEIKSGGMVGTMDNKNLGSPEERKDAITNHQSDGILKGSRNSQGSL-SSSECEAVGVNE 238

Query: 1289 KCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-L 1465
            +C+ NSK       ENDS              + K F+G E+FDGK VNVVDGLKLYE L
Sbjct: 239  ECVSNSK-------ENDSI-------------MGKFFIGNEMFDGKMVNVVDGLKLYEDL 278

Query: 1466 FDNSEVSKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDE 1642
             D++EVSKL SLVNDLRVAG+RGQFQG QTFVVSKRPMKGHGREMIQ G+PI DAPP+ +
Sbjct: 279  LDSTEVSKLVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVD 338

Query: 1643 NLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGK 1822
            N++G SKD K+ESIP   QD+ ERL   QVMT KPD+CI+DF+NEG+HS P+  PPWFG+
Sbjct: 339  NVTGISKDKKVESIPSLFQDIIERLAASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGR 398

Query: 1823 PVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQ 2002
            PV  LFLTECDMTFGR I  D PG++R              +QGKS DFAKHA+PSI KQ
Sbjct: 399  PVYTLFLTECDMTFGRIIVSDHPGEFRGAVRLSLVPGSLLVMQGKSTDFAKHALPSIHKQ 458

Query: 2003 RVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYAXXXXX 2182
            R+++TF K+ PK +L +                     +R+PNH+RH  GPKHY      
Sbjct: 459  RIIITFTKSQPKCSLPN----DSQRLAPPAASHWAPPQSRSPNHVRHQLGPKHYPTVPAT 514

Query: 2183 XXXXXXXXXXXSNGMQPIFV---TTXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGT 2353
                        N MQP+FV                                 R+PVPGT
Sbjct: 515  VVLPAPSIHAPPNSMQPLFVPAPVAPPMSFPTPVPIPPGSTGWTSAPSRHPPPRIPVPGT 574

Query: 2354 GVF 2362
            GVF
Sbjct: 575  GVF 577


>ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809865 isoform X2 [Glycine
            max]
          Length = 641

 Score =  538 bits (1387), Expect = e-150
 Identities = 312/661 (47%), Positives = 387/661 (58%), Gaps = 42/661 (6%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGG-----GEIQH----RQWFPDERDGFISWLRGEFAAANA 766
            MAMPSGNVVI DKMQF SGG G     GEI      +QWF DERDG I WLR EFAAANA
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60

Query: 767  IIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQH 946
            IIDSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V +ALQQVAWRRQ 
Sbjct: 61   IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120

Query: 947  RYFDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXXEKEPRVN 1126
            R  D +K   KEF+++G  GYR GQR E VKEG+  SV+S+                   
Sbjct: 121  RPLDPVKVGAKEFRKSGS-GYRHGQRFEPVKEGYNSSVESY------------------- 160

Query: 1127 ENVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNSDGIMCGNSGLEVKEVGDKCIPNS 1306
             N  DA     +     K  P+ ++ ++                SG +V++VGDK + ++
Sbjct: 161  -NQYDANV--TVTGGTEKGTPVVEKSEEH--------------KSGGKVEKVGDKGLASA 203

Query: 1307 KGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-LFDNSEV 1483
            +        +DS+S+QNQ++ Q+L+T  KTF+G E+FDGK VNVVDGLKLYE LFD++E+
Sbjct: 204  EDKKG----DDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKLYEDLFDSTEI 259

Query: 1484 SKLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSGTS 1660
            + L SLVNDLRV+G++GQ QG Q ++VS+RPMKGHGREMIQ G+PI DAP E EN++G S
Sbjct: 260  ANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIADAPAEGENMTGAS 319

Query: 1661 KDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCILF 1840
            KD  +E IP   QD+ ER+V  QVMT KPD CI+DFYNEGDHSQPH  P W+G+PV ILF
Sbjct: 320  KDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPSWYGRPVYILF 379

Query: 1841 LTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVLVTF 2020
            LTEC+MTFGR I  + PGDYR              ++GKS DFAKHA+PS+RKQR+LVTF
Sbjct: 380  LTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAKHALPSVRKQRILVTF 439

Query: 2021 AKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYAXXXXXXXXXXX 2200
             K+ P+K+L+                      +R+PNH+RH  G KHYA           
Sbjct: 440  TKSQPRKSLSS---DAQRLASTATSSHWGPLPSRSPNHVRHHVGSKHYATLPTTGVLPSP 496

Query: 2201 XXXXXSN---GMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGTGV 2359
                      GMQP+FVT                                 R+P PGTGV
Sbjct: 497  PIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTGWTGAPPPRHPPPRVPAPGTGV 556

Query: 2360 FXXXXXXXXXXXXQQV-----------TETNFSEEKEN-------------GSVVKEECN 2467
            F            QQ+           TET    EKEN             G V K+ECN
Sbjct: 557  F--LPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGKTNHNSTSASPKGKVQKQECN 614

Query: 2468 G 2470
            G
Sbjct: 615  G 615


>ref|XP_007158786.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
            gi|561032201|gb|ESW30780.1| hypothetical protein
            PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 630

 Score =  520 bits (1339), Expect = e-144
 Identities = 305/660 (46%), Positives = 376/660 (56%), Gaps = 41/660 (6%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQFSSGGGG---GEIQH----RQWFPDERDGFISWLRGEFAAANAII 772
            MAMPSGNVVI DKMQF +GGGG   GEIQ     +QWF DERDG I WLR EFAAANAII
Sbjct: 1    MAMPSGNVVIQDKMQFPNGGGGAGVGEIQQHHYRQQWFVDERDGLIGWLRSEFAAANAII 60

Query: 773  DSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRRQHRY 952
            DSLC+HLR  G+PGEYD+VIG+IQQRRCNWN VL MQQYFS+++V Y LQQVAWR+Q R 
Sbjct: 61   DSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVTYTLQQVAWRKQQRP 120

Query: 953  FDQMKGTGKEFKRAGGVGYRQGQRVEAVKEGHTFSVQSHXXXXXXXXXXXXEK-EPRVNE 1129
             D +K   KE ++ G  GYR G R E  KEG+  SV+S+            EK  P V++
Sbjct: 121  LDPVKVGAKEVRKPGP-GYRYGHRFEPSKEGYNSSVESYSHDGNATFTRGMEKGTPTVDK 179

Query: 1130 NVQDAKHGREIGESDHKVPPLADEKKDGFLKSSGNSDGIMCGNSGLEVKEVGDKCIPNSK 1309
            +             +HK                          SG +V++VGDK + + +
Sbjct: 180  S------------EEHK--------------------------SGSKVEKVGDKGLASPE 201

Query: 1310 GSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTELFDGKAVNVVDGLKLYE-LFDNSEVS 1486
                    NDS S+++Q++ Q+ +T+ KTF+G E+ DGK VN+ DGLKLYE +FD++EVS
Sbjct: 202  EKKG----NDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADGLKLYEDIFDSTEVS 257

Query: 1487 KLASLVNDLRVAGRRGQFQG-QTFVVSKRPMKGHGREMIQFGIPIVDAPPEDENLSGTSK 1663
             L SLVNDLR++G++GQ QG Q +VVS+RPMKGHGREMIQ G+PI DAP E EN++G SK
Sbjct: 258  NLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIADAPVEGENMTGASK 317

Query: 1664 DCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFYNEGDHSQPHMLPPWFGKPVCILFL 1843
               +E IP   +D+ ER+V  QVMTTKPD CI+DFYNEGDHSQPH  P WFG+PV  LFL
Sbjct: 318  VMNVEPIPSLFEDIIERMVSSQVMTTKPDCCIVDFYNEGDHSQPHSWPSWFGRPVYTLFL 377

Query: 1844 TECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQGKSLDFAKHAIPSIRKQRVLVTFA 2023
            TEC+MTFGR I  + PGDYR             A+QGKS DFAKHA+PSIRKQR+LVTF 
Sbjct: 378  TECEMTFGRLIASEHPGDYRGSLKLSLVPGSLLAMQGKSCDFAKHALPSIRKQRILVTFT 437

Query: 2024 KAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPNHIRHPSGPKHYA---XXXXXXXXX 2194
            K+ PKK++                       +R+PNH+RH  G KHYA            
Sbjct: 438  KSQPKKSVPS---DAQRLYLPAASSQWGPPPSRSPNHVRHSVGSKHYAALPTTGVLPAPP 494

Query: 2195 XXXXXXXSNGMQPIFVTT----XXXXXXXXXXXXXXXXXXXXXXXXXXXXRLPVPGTGVF 2362
                     GMQP+FV                                  R+P PGTGVF
Sbjct: 495  IRPQIPAQVGMQPLFVAAPVVPPMPYPAPVSIPPGSAGWTTAPPPRHPPPRIPAPGTGVF 554

Query: 2363 ---XXXXXXXXXXXXQQVTETNFS-------EEKEN--------------GSVVKEECNG 2470
                             + E N S       +EKEN              G V K+ECNG
Sbjct: 555  LPPPGSGNSQQQLPAGTLAEVNPSIETPTTMQEKENGKSNDDNSSSTSPKGKVQKQECNG 614


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550333015|gb|EEE88914.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  513 bits (1321), Expect = e-142
 Identities = 312/660 (47%), Positives = 365/660 (55%), Gaps = 50/660 (7%)
 Frame = +2

Query: 614  MAMPSGNVVISDKMQF---SSGGGGG-------EIQHRQWFP-DERDGFISWLRGEFAAA 760
            MAMP GNVVI DK+QF   ++GGGGG       ++Q  QWFP DERDGFISWLRGEFAAA
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60

Query: 761  NAIIDSLCYHLRSAGEPGEYDVVIGSIQQRRCNWNPVLHMQQYFSISEVLYALQQVAWRR 940
            NAIIDSLC+HLR+ GE GEYD+V+G IQQRR NWN VLHMQQYFS+ EV+ ALQQV  RR
Sbjct: 61   NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120

Query: 941  QHR--------------YFDQMKGTGKEFKRAGGVGYRQGQRV-------EAVKEGHTFS 1057
            Q +              Y+D  K  G++FKR+   G+ +G R        +AVKEG   S
Sbjct: 121  QQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGGGDAVKEGVNSS 180

Query: 1058 VQSHXXXXXXXXXXXXEKEPRVNENVQDAKHGREIGESDHK-VPPLADEKKDGFLKSSGN 1234
            V++H            EK        ++ K G + G+SD K     A    D    SSGN
Sbjct: 181  VENHSFNGNSSENIRSEK-------FEEVKSGGDGGKSDDKKADATAKSHTDNHKNSSGN 233

Query: 1235 SDGIMCGNSGLEVKEVGDKCIPNSKGSCNVLLENDSYSLQNQNEKQNLTTLPKTFVGTEL 1414
            + G   GNS                                 NEKQNL   PKTFV  E 
Sbjct: 234  AQGTFSGNSEAVA-----------------------------NEKQNLAITPKTFVAEEK 264

Query: 1415 FDGKAVNVVDGLKLYE-LFDNSEVSKLASLVNDLRVAGRRGQFQGQTFVVSKRPMKGHGR 1591
             DG+ VNVVDGLKLYE L D  EVSKL SLVN+LR  GRRGQ QGQT+++SKRPMKGHGR
Sbjct: 265  IDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGR 324

Query: 1592 EMIQFGIPIVDAPPEDENLSGTSKDCKIESIPVFLQDVAERLVGMQVMTTKPDSCIIDFY 1771
            EMIQ G+PI DAP EDEN +GTSK   +ESIP  LQDV E  V MQVMT KPDSCIID Y
Sbjct: 325  EMIQLGLPIADAPAEDENATGTSKGT-VESIPALLQDVIEHFVAMQVMTMKPDSCIIDIY 383

Query: 1772 NEGDHSQPHMLPPWFGKPVCILFLTECDMTFGRAIGIDRPGDYRXXXXXXXXXXXXXAVQ 1951
            NEGDHSQPHM PPWFGKPV +LFLTEC++TFG+ I     GDY+              +Q
Sbjct: 384  NEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQ 443

Query: 1952 GKSLDFAKHAIPSIRKQRVLVTFAKAIPKKTLADXXXXXXXXXXXXXXXXXXXXXTRAPN 2131
            GKS D AKHAIP I+KQR+LVTF K+ PKK L                       +R+PN
Sbjct: 444  GKSSDLAKHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPN 502

Query: 2132 HIRHPSGPKHYA---XXXXXXXXXXXXXXXXSNGMQPIFVTT-----XXXXXXXXXXXXX 2287
            H+RHP  PKHYA                    NG+QP+F+TT                  
Sbjct: 503  HLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVS 561

Query: 2288 XXXXXXXXXXXXXXXRLPVPGTGVFXXXXXXXXXXXXQQV----TETNF----SEEKENG 2443
                            +P+PGTGVF             Q+    TE NF     +EKENG
Sbjct: 562  TGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENG 621


Top