BLASTX nr result

ID: Sinomenium21_contig00029605 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00029605
         (753 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun...   184   4e-44
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   184   4e-44
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   181   2e-43
ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809...   174   2e-41
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   174   2e-41
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     172   1e-40
ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794...   171   3e-40
ref|XP_007158786.1| hypothetical protein PHAVU_002G181800g [Phas...   169   9e-40
ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phas...   169   9e-40
emb|CBI26785.3| unnamed protein product [Vitis vinifera]              167   5e-39
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   161   2e-37
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   159   1e-36
ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781...   154   2e-35
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   154   2e-35
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   154   2e-35
gb|ABK95394.1| unknown [Populus trichocarpa]                          154   2e-35
ref|XP_002315841.2| hydroxyproline-rich glycoprotein [Populus tr...   154   3e-35
ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family prot...   150   3e-34
ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot...   150   3e-34
ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot...   150   3e-34

>ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
            gi|462422058|gb|EMJ26321.1| hypothetical protein
            PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  184 bits (466), Expect = 4e-44
 Identities = 106/205 (51%), Positives = 122/205 (59%), Gaps = 4/205 (1%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            S+L+MQGKSADFAKHAI SIRKQRILVT TKSQPKKS  SDGQ  P  A A +  WGP P
Sbjct: 408  SILLMQGKSADFAKHAIPSIRKQRILVTLTKSQPKKSTTSDGQRFPAPAPAQSSYWGPPP 467

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPVP----HLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  +++RHP G KHY AVPTTGVLP P     LP  N +QPLFV               
Sbjct: 468  SRSPNHIRHPTGPKHYAAVPTTGVLPAPPIRSQLPPQNGIQPLFVPAPVGPAIPFAAAVP 527

Query: 260  XXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETPV 81
                 +GW A             PGTGVFLPP GSG+      LP  T T+ S  VETP 
Sbjct: 528  IPPGSAGWPA--APRHPPPRIPLPGTGVFLPPPGSGNSSAPQQLP-GTATEMSPTVETPS 584

Query: 80   FSENENGPERVNCNSNASPKGKLDG 6
              + +NG  + N +++ASPKGK DG
Sbjct: 585  PRDKDNGSGKSNHSTSASPKGKSDG 609


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  184 bits (466), Expect = 4e-44
 Identities = 111/208 (53%), Positives = 125/208 (60%), Gaps = 7/208 (3%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGKSADFAKHAI S+RKQRILVTFTKSQPKK+M SDGQ L L   A +  W P P
Sbjct: 454  SLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRL-LPPAAQSSHWVPPP 512

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV------PHLPSPNNMQPLFVTXXXXXXXXXXXX 267
            SR  +++RHP G KHYGAVPTTGVLP       P LP PN MQPLFVT            
Sbjct: 513  SRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAP 572

Query: 266  XXXXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVET 87
                    GW A             PGTGVFLPP GSG+     +   ++   TS+ VET
Sbjct: 573  VPLPTGSPGWPA-APPRHPPPRLPVPGTGVFLPPPGSGNSSSPQH---ISTEATSTSVET 628

Query: 86   PVFSENENGPERVNCNSN-ASPKGKLDG 6
               +E ENG  + + NSN  SPKGKLDG
Sbjct: 629  AAPTEKENGSGKSSSNSNTVSPKGKLDG 656


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  181 bits (460), Expect = 2e-43
 Identities = 111/208 (53%), Positives = 125/208 (60%), Gaps = 7/208 (3%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGKSADFAKHAI S+RKQRILVTFTKSQPKK+  SDGQ L L   A +  W P P
Sbjct: 467  SLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTTASDGQRL-LPPAAQSSHWVPPP 525

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV------PHLPSPNNMQPLFVTXXXXXXXXXXXX 267
            SR  +++RHP G KHYGAVPTTGVLP       P LP PN MQPLFVT            
Sbjct: 526  SRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAP 585

Query: 266  XXXXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVET 87
                    GW A             PGTGVFLPP GSG+     +   ++   TS+ VET
Sbjct: 586  XPLPTGSPGWPA-APPRHPPPRLPVPGTGVFLPPPGSGNSSSPQH---ISTEATSTSVET 641

Query: 86   PVFSENENGPERVNCNSN-ASPKGKLDG 6
               +E ENG  + + NSN  SPKGKLDG
Sbjct: 642  AAPTEKENGSGKSSSNSNTVSPKGKLDG 669


>ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809865 isoform X2 [Glycine
            max]
          Length = 641

 Score =  174 bits (442), Expect = 2e-41
 Identities = 101/203 (49%), Positives = 122/203 (60%), Gaps = 4/203 (1%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVM+GKS+DFAKHA+ S+RKQRILVTFTKSQP+KS+ SD Q L  +AT+S   WGPLP
Sbjct: 411  SLLVMEGKSSDFAKHALPSVRKQRILVTFTKSQPRKSLSSDAQRLASTATSS--HWGPLP 468

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  ++VRH  G KHY  +PTTGVLP     P + +P  MQPLFVT              
Sbjct: 469  SRSPNHVRHHVGSKHYATLPTTGVLPSPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVA 528

Query: 260  XXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETPV 81
                 +GW               PGTGVFLPP GSG+   S  LP  TL + +   ETP 
Sbjct: 529  FPPGSTGWTGAPPPRHPPPRVPAPGTGVFLPPPGSGN--SSQQLPAGTLAEVNPSTETPT 586

Query: 80   FSENENGPERVNCNSNASPKGKL 12
              E ENG    N +++ASPKGK+
Sbjct: 587  MLEKENGKTNHN-STSASPKGKV 608


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max]
          Length = 681

 Score =  174 bits (442), Expect = 2e-41
 Identities = 101/203 (49%), Positives = 122/203 (60%), Gaps = 4/203 (1%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVM+GKS+DFAKHA+ S+RKQRILVTFTKSQP+KS+ SD Q L  +AT+S   WGPLP
Sbjct: 451  SLLVMEGKSSDFAKHALPSVRKQRILVTFTKSQPRKSLSSDAQRLASTATSS--HWGPLP 508

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  ++VRH  G KHY  +PTTGVLP     P + +P  MQPLFVT              
Sbjct: 509  SRSPNHVRHHVGSKHYATLPTTGVLPSPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVA 568

Query: 260  XXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETPV 81
                 +GW               PGTGVFLPP GSG+   S  LP  TL + +   ETP 
Sbjct: 569  FPPGSTGWTGAPPPRHPPPRVPAPGTGVFLPPPGSGN--SSQQLPAGTLAEVNPSTETPT 626

Query: 80   FSENENGPERVNCNSNASPKGKL 12
              E ENG    N +++ASPKGK+
Sbjct: 627  MLEKENGKTNHN-STSASPKGKV 648


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  172 bits (436), Expect = 1e-40
 Identities = 106/204 (51%), Positives = 121/204 (59%), Gaps = 4/204 (1%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLL MQGKSADFAKHAI S+R+QRILVTFTKSQPKKSM SDGQ +P    A +  WGP P
Sbjct: 440  SLLAMQGKSADFAKHAIPSLRRQRILVTFTKSQPKKSMPSDGQRMPSPGVAPSSHWGPQP 499

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVL---PV-PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  +++RHP G KHY  VPTTGVL   PV P +P PN +QPLFVT              
Sbjct: 500  SRSPNHIRHP-GPKHYAPVPTTGVLQASPVRPQIPPPNGIQPLFVTAPVAPAMPFPAPVP 558

Query: 260  XXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETPV 81
                 SGW+A             PGTGVFLPP GSG    S+    V    T+  VET  
Sbjct: 559  IPPSSSGWSA-APPRHPPPRLPVPGTGVFLPPPGSG--GNSSGSQQVLGNDTNHTVETAA 615

Query: 80   FSENENGPERVNCNSNASPKGKLD 9
              E ENG  ++N    ASPKGK+D
Sbjct: 616  PPEKENGSGKLNHGMTASPKGKVD 639


>ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max]
          Length = 683

 Score =  171 bits (432), Expect = 3e-40
 Identities = 104/204 (50%), Positives = 122/204 (59%), Gaps = 5/204 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGKS+DFAKHA+ S RKQRILVTFTKSQP+KS+ SD Q L  SA AS+  WGP P
Sbjct: 454  SLLVMQGKSSDFAKHALPSTRKQRILVTFTKSQPRKSLSSDAQQL-ASAVASS-HWGPPP 511

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  ++VRH  G KHY  +PTTGVLP     P + +P  MQPLFV               
Sbjct: 512  SRSPNHVRHHVGPKHYATLPTTGVLPAPPIRPQMAAPVGMQPLFVAAPVVPPMPFSAPVP 571

Query: 260  XXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETPV 81
                 +GW A             PGTGVFLPP GSG+   S  LP  TL + +   ETP 
Sbjct: 572  IPAGSTGWTAAPPPRHPPPRVPAPGTGVFLPPSGSGN--SSQQLPASTLAEVNPSTETPT 629

Query: 80   FSENENGPERVNCNS-NASPKGKL 12
              E ENG  ++N NS +ASPKGK+
Sbjct: 630  MPEKENG--KINHNSTSASPKGKV 651


>ref|XP_007158786.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
            gi|561032201|gb|ESW30780.1| hypothetical protein
            PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 630

 Score =  169 bits (428), Expect = 9e-40
 Identities = 100/204 (49%), Positives = 117/204 (57%), Gaps = 5/204 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLL MQGKS DFAKHA+ SIRKQRILVTFTKSQPKKS+ SD Q L L A +S   WGP P
Sbjct: 408  SLLAMQGKSCDFAKHALPSIRKQRILVTFTKSQPKKSVPSDAQRLYLPAASS--QWGPPP 465

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  ++VRH  G KHY A+PTTGVLP     P +P+   MQPLFV               
Sbjct: 466  SRSPNHVRHSVGSKHYAALPTTGVLPAPPIRPQIPAQVGMQPLFVAAPVVPPMPYPAPVS 525

Query: 260  XXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETP- 84
                 +GW               PGTGVFLPP GSG+      LP  TL + +  +ETP 
Sbjct: 526  IPPGSAGWTTAPPPRHPPPRIPAPGTGVFLPPPGSGN--SQQQLPAGTLAEVNPSIETPT 583

Query: 83   VFSENENGPERVNCNSNASPKGKL 12
               E ENG    + +S+ SPKGK+
Sbjct: 584  TMQEKENGKSNDDNSSSTSPKGKV 607


>ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
            gi|561032200|gb|ESW30779.1| hypothetical protein
            PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 671

 Score =  169 bits (428), Expect = 9e-40
 Identities = 100/204 (49%), Positives = 117/204 (57%), Gaps = 5/204 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLL MQGKS DFAKHA+ SIRKQRILVTFTKSQPKKS+ SD Q L L A +S   WGP P
Sbjct: 449  SLLAMQGKSCDFAKHALPSIRKQRILVTFTKSQPKKSVPSDAQRLYLPAASS--QWGPPP 506

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  ++VRH  G KHY A+PTTGVLP     P +P+   MQPLFV               
Sbjct: 507  SRSPNHVRHSVGSKHYAALPTTGVLPAPPIRPQIPAQVGMQPLFVAAPVVPPMPYPAPVS 566

Query: 260  XXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETP- 84
                 +GW               PGTGVFLPP GSG+      LP  TL + +  +ETP 
Sbjct: 567  IPPGSAGWTTAPPPRHPPPRIPAPGTGVFLPPPGSGN--SQQQLPAGTLAEVNPSIETPT 624

Query: 83   VFSENENGPERVNCNSNASPKGKL 12
               E ENG    + +S+ SPKGK+
Sbjct: 625  TMQEKENGKSNDDNSSSTSPKGKV 648


>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  167 bits (422), Expect = 5e-39
 Identities = 100/189 (52%), Positives = 112/189 (59%), Gaps = 6/189 (3%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGKSADFAKHAI S+RKQRILVTFTKSQPKK+M SDGQ L L   A +  W P P
Sbjct: 461  SLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRL-LPPAAQSSHWVPPP 519

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV------PHLPSPNNMQPLFVTXXXXXXXXXXXX 267
            SR  +++RHP G KHYGAVPTTGVLP       P LP PN MQPLFVT            
Sbjct: 520  SRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAP 579

Query: 266  XXXXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVET 87
                    GW A             PGTGVFLPP GSG+     +   ++   TS+ VET
Sbjct: 580  VPLPTGSPGWPA-APPRHPPPRLPVPGTGVFLPPPGSGNSSSPQH---ISTEATSTSVET 635

Query: 86   PVFSENENG 60
               +E ENG
Sbjct: 636  AAPTEKENG 644


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  161 bits (408), Expect = 2e-37
 Identities = 99/206 (48%), Positives = 119/206 (57%), Gaps = 6/206 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLL++QGKSAD+AKHAI SIRKQRILVTFTKSQP+KS  +DGQ LP    + +  W P P
Sbjct: 441  SLLLLQGKSADYAKHAIPSIRKQRILVTFTKSQPRKSFPTDGQRLPSPGPSQSPYWSPPP 500

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
             R  +++RHPAG KHY AVPTTGVLP     P LP  N +QPLFV               
Sbjct: 501  GRSPNHIRHPAGPKHYAAVPTTGVLPAPPNRPQLPPANGIQPLFVAAPVGPAMPFPAPVV 560

Query: 260  XXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSG--HPPPSNYLPLVTLTQTSSVVET 87
                  GW A             PGTGVFLPP GSG    PP  +    T T+ +  VET
Sbjct: 561  IPPGSPGWVA--APRHPPPRMPLPGTGVFLPPPGSGSSSAPPQQFPS--TATEMNPSVET 616

Query: 86   PVFSENENGPERVNCNSNASPKGKLD 9
               +E +NG  + + ++ ASPK KLD
Sbjct: 617  -ASTEKDNGTAK-SSHAIASPKAKLD 640


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
            gi|223533099|gb|EEF34858.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 697

 Score =  159 bits (402), Expect = 1e-36
 Identities = 94/204 (46%), Positives = 116/204 (56%), Gaps = 4/204 (1%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGK+ DFAKHAI +IRKQR+L+TFTKSQPKK + SDGQ L   A + +  WGP P
Sbjct: 460  SLLVMQGKATDFAKHAIPAIRKQRVLLTFTKSQPKKFVQSDGQRLTSPAASPSSHWGPPP 519

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  +++RHP   KHY  +PTTGVLP     P +  PN +QPLFVT              
Sbjct: 520  SRSPNHIRHPVS-KHYAPIPTTGVLPAPSIRPQIAPPNGVQPLFVTAPVAAPMPFPAPVP 578

Query: 260  XXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETPV 81
                 +GW A             PGTGVFLPP GSG+   S  +P    T+ +   ET  
Sbjct: 579  MPPVSTGWPAAPRHPPNRLPVPVPGTGVFLPPPGSGN-ASSPQIP--NATEINFPAETAS 635

Query: 80   FSENENGPERVNCNSNASPKGKLD 9
              + ENG  + N  + ASPK KL+
Sbjct: 636  LQDKENGLGKSNHGTCASPKEKLE 659


>ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781773 [Glycine max]
          Length = 664

 Score =  154 bits (390), Expect = 2e-35
 Identities = 99/204 (48%), Positives = 118/204 (57%), Gaps = 2/204 (0%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGKS DFAKHA+ SI KQRI++TFTKSQPK S+ +D Q L   A  +A  W P  
Sbjct: 436  SLLVMQGKSTDFAKHALPSIHKQRIIITFTKSQPKCSLPNDSQRL---APPAASHWAPPQ 492

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPVP--HLPSPNNMQPLFVTXXXXXXXXXXXXXXXX 255
            SR  ++VRH  G KHY  VP T VLP P  H P PN+MQPLFV                 
Sbjct: 493  SRSPNHVRHQLGPKHYPTVPATVVLPAPSIHAP-PNSMQPLFVPAPVAPPMSFPTPVPIP 551

Query: 254  XXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETPVFS 75
               +GW +             PGTGVFLPP GSG    S +LP  T+ + +  VET   S
Sbjct: 552  PGSTGWTS-APSRHPPPRIPVPGTGVFLPPPGSG--TSSQHLP-CTVPEVNPSVETLTVS 607

Query: 74   ENENGPERVNCNSNASPKGKLDGN 3
              ENG  + N N+N+SPKGK+DGN
Sbjct: 608  GKENG--KSNHNTNSSPKGKMDGN 629


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
            gi|550333016|gb|ERP57586.1| hypothetical protein
            POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  154 bits (390), Expect = 2e-35
 Identities = 94/203 (46%), Positives = 115/203 (56%), Gaps = 5/203 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGKS+D AKHAI  I+KQR+LVTFTKSQPKK   +DG  LP  A A +  WGP P
Sbjct: 456  SLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWGPPP 515

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  +++RHP   KHY A+PTTGVL V    P +P PN +QPLF+T              
Sbjct: 516  SRSPNHLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVP 574

Query: 260  XXXXXSGW-AAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETP 84
                 +GW  +             PGTGVFLPP GSG+   +  L   + T T     T 
Sbjct: 575  IPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQL---SATATEMNFPTE 631

Query: 83   VFSENENGPERVNCNSNASPKGK 15
               E ENGP + N +++ASPK K
Sbjct: 632  TEKEKENGPGKSNHDTSASPKEK 654


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550333015|gb|EEE88914.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  154 bits (390), Expect = 2e-35
 Identities = 94/203 (46%), Positives = 115/203 (56%), Gaps = 5/203 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGKS+D AKHAI  I+KQR+LVTFTKSQPKK   +DG  LP  A A +  WGP P
Sbjct: 438  SLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWGPPP 497

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  +++RHP   KHY A+PTTGVL V    P +P PN +QPLF+T              
Sbjct: 498  SRSPNHLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVP 556

Query: 260  XXXXXSGW-AAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETP 84
                 +GW  +             PGTGVFLPP GSG+   +  L   + T T     T 
Sbjct: 557  IPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQL---SATATEMNFPTE 613

Query: 83   VFSENENGPERVNCNSNASPKGK 15
               E ENGP + N +++ASPK K
Sbjct: 614  TEKEKENGPGKSNHDTSASPKEK 636


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  154 bits (390), Expect = 2e-35
 Identities = 94/203 (46%), Positives = 115/203 (56%), Gaps = 5/203 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGKS+D AKHAI  I+KQR+LVTFTKSQPKK   +DG  LP  A A +  WGP P
Sbjct: 457  SLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWGPPP 516

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  +++RHP   KHY A+PTTGVL V    P +P PN +QPLF+T              
Sbjct: 517  SRSPNHLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVP 575

Query: 260  XXXXXSGW-AAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETP 84
                 +GW  +             PGTGVFLPP GSG+   +  L   + T T     T 
Sbjct: 576  IPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQL---SATATEMNFPTE 632

Query: 83   VFSENENGPERVNCNSNASPKGK 15
               E ENGP + N +++ASPK K
Sbjct: 633  TEKEKENGPGKSNHDTSASPKEK 655


>ref|XP_002315841.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550329565|gb|EEF02012.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 699

 Score =  154 bits (389), Expect = 3e-35
 Identities = 98/205 (47%), Positives = 118/205 (57%), Gaps = 6/205 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSMLSDGQHLPLSATASALPWGPLP 429
            SLLVMQGKS+D AKHAI  IRKQR+L+TFTKSQPKK   +DG  LP  A A +  WGP  
Sbjct: 464  SLLVMQGKSSDVAKHAIPMIRKQRMLITFTKSQPKKFSSTDGSRLPSHAVAPSSHWGPSL 523

Query: 428  SRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXXX 261
            SR  ++ RHP   KHY A+PT GVLPV    P +P PN +QP+FVT              
Sbjct: 524  SRSPNHPRHPV-PKHYAAIPTAGVLPVPPIRPQIPPPNGVQPIFVT----TTVPFPAPVP 578

Query: 260  XXXXXSGW-AAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVT-LTQTSSVVET 87
                 +GW  A             PGTGVFLPP GSG+   S+ L L T  T+ +   ET
Sbjct: 579  IPPVSTGWLTASPRHPSARLPVPIPGTGVFLPPPGSGN--ASSPLQLSTAATEMNFHTET 636

Query: 86   PVFSENENGPERVNCNSNASPKGKL 12
                E ENG  + NC+++ASPK KL
Sbjct: 637  ASLPEKENGLGKSNCDTSASPKEKL 661


>ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5
           [Theobroma cacao] gi|508709406|gb|EOY01303.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative isoform 5 [Theobroma cacao]
          Length = 572

 Score =  150 bits (380), Expect = 3e-34
 Identities = 95/206 (46%), Positives = 117/206 (56%), Gaps = 5/206 (2%)
 Frame = -2

Query: 608 SLLVMQGKSADFAKHAISSIRKQRILVTFTK-SQPKKSMLSDGQHLPLSATASALPWGPL 432
           SLLVMQGKSADFAKHA+ S+RKQRILVTFTK  QPKKS  +D Q L   + + +  WGP 
Sbjct: 337 SLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKS-TTDNQRLSSPSVSQSSQWGPP 395

Query: 431 PSRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXX 264
           PSR  + +RH AG KHY  +PTTGVLP     P +P  + +QPLFV              
Sbjct: 396 PSRSPNRIRHSAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPV 455

Query: 263 XXXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETP 84
                 +GW A             PGTGVFLPP GSG+   S+     T T+ + +VET 
Sbjct: 456 PIPPGSTGWPA--APRHPPPRLPVPGTGVFLPPPGSGN--SSSQQLSTTATELNILVETT 511

Query: 83  VFSENENGPERVNCNSNASPKGKLDG 6
              E ENG  + N +   SP+G+LDG
Sbjct: 512 SPREKENGSVKPN-HHTTSPRGRLDG 536


>ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|590697545|ref|XP_007045470.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  150 bits (380), Expect = 3e-34
 Identities = 95/206 (46%), Positives = 117/206 (56%), Gaps = 5/206 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTK-SQPKKSMLSDGQHLPLSATASALPWGPL 432
            SLLVMQGKSADFAKHA+ S+RKQRILVTFTK  QPKKS  +D Q L   + + +  WGP 
Sbjct: 445  SLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKS-TTDNQRLSSPSVSQSSQWGPP 503

Query: 431  PSRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXX 264
            PSR  + +RH AG KHY  +PTTGVLP     P +P  + +QPLFV              
Sbjct: 504  PSRSPNRIRHSAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPV 563

Query: 263  XXXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETP 84
                  +GW A             PGTGVFLPP GSG+   S+     T T+ + +VET 
Sbjct: 564  PIPPGSTGWPA--APRHPPPRLPVPGTGVFLPPPGSGN--SSSQQLSTTATELNILVETT 619

Query: 83   VFSENENGPERVNCNSNASPKGKLDG 6
               E ENG  + N +   SP+G+LDG
Sbjct: 620  SPREKENGSVKPN-HHTTSPRGRLDG 644


>ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|590697542|ref|XP_007045469.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  150 bits (380), Expect = 3e-34
 Identities = 95/206 (46%), Positives = 117/206 (56%), Gaps = 5/206 (2%)
 Frame = -2

Query: 608  SLLVMQGKSADFAKHAISSIRKQRILVTFTK-SQPKKSMLSDGQHLPLSATASALPWGPL 432
            SLLVMQGKSADFAKHA+ S+RKQRILVTFTK  QPKKS  +D Q L   + + +  WGP 
Sbjct: 446  SLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKS-TTDNQRLSSPSVSQSSQWGPP 504

Query: 431  PSRPTSYVRHPAGHKHYGAVPTTGVLPV----PHLPSPNNMQPLFVTXXXXXXXXXXXXX 264
            PSR  + +RH AG KHY  +PTTGVLP     P +P  + +QPLFV              
Sbjct: 505  PSRSPNRIRHSAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPV 564

Query: 263  XXXXXXSGWAAXXXXXXXXXXXXXPGTGVFLPPQGSGHPPPSNYLPLVTLTQTSSVVETP 84
                  +GW A             PGTGVFLPP GSG+   S+     T T+ + +VET 
Sbjct: 565  PIPPGSTGWPA--APRHPPPRLPVPGTGVFLPPPGSGN--SSSQQLSTTATELNILVETT 620

Query: 83   VFSENENGPERVNCNSNASPKGKLDG 6
               E ENG  + N +   SP+G+LDG
Sbjct: 621  SPREKENGSVKPN-HHTTSPRGRLDG 645


Top