BLASTX nr result

ID: Paeonia22_contig00007095 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00007095
         (1786 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   393   e-106
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   391   e-106
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     365   3e-98
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   362   3e-97
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   361   7e-97
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   360   1e-96
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              355   4e-95
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   352   2e-94
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   352   3e-94
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   352   3e-94
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   340   1e-90
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   338   6e-90
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   329   2e-87
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   329   2e-87
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   307   1e-80
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   297   9e-78
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   297   1e-77
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   293   1e-76
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   292   4e-76
gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus...   289   2e-75

>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  393 bits (1009), Expect = e-106
 Identities = 238/481 (49%), Positives = 269/481 (55%), Gaps = 88/481 (18%)
 Frame = -3

Query: 1496 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGP-PQYQKRRWGSCWSIYWCFGSHKQTKR 1320
            MR +NG++R++NS                 R P P  QKRRWGSCW  YWCF S K  KR
Sbjct: 1    MRSVNGDTRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKD-KR 59

Query: 1319 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1140
            IGHAVL PE+   G+    +E ++TQ P+                      ATQSP+GLL
Sbjct: 60   IGHAVLAPESRAPGSGVPAAE-NLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLL 118

Query: 1139 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 960
            SLTSI+AN+YSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 119  SLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 178

Query: 959  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 798
            SSPEVPFAQL DPN+R      RF  SQYEFQSYQLYPGSPVG L              P
Sbjct: 179  SSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSP 238

Query: 797  FPDPESVSHG-PHFLEFRTGGPPQL--LNKLNTHDWGSRLGSGSLTPDA----------- 660
            FPD + V  G   FLEFR GGPP+L  L+KL+ H+WGSR+GSGS+TPDA           
Sbjct: 239  FPDRDFVCSGSSQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSITPDALGPPSRDGSVL 298

Query: 659  -----------------------------------PSNEIVVDHRVSFELTPENIVRCVE 585
                                               P+NEI+VDHRVSFELT E++VRCVE
Sbjct: 299  DRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVE 358

Query: 584  KEPEGLARAVSASLQNHETGKVTKES-------------LXXXXXXXXXXXXXXXNRQHH 444
            K+   L +AVSASLQN  T ++ + S                               Q H
Sbjct: 359  KDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPH 418

Query: 443  QKHRSITLGSVKEFNFDNADGGCSDK-------------------EGKNWSFFPMMQPGV 321
             K RSITLGS KEFNFDNADGG SDK                     KNWS F MMQP V
Sbjct: 419  HKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSV 478

Query: 320  S 318
            S
Sbjct: 479  S 479


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  391 bits (1004), Expect = e-106
 Identities = 238/459 (51%), Positives = 261/459 (56%), Gaps = 66/459 (14%)
 Frame = -3

Query: 1496 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKR 1320
            MRR+NGESR  N+                 R P    QKRRWGS WS+YWCFG  +  KR
Sbjct: 1    MRRVNGESRTGNNALETINAAASAIAAAENRVPQATVQKRRWGSWWSMYWCFGFQRHKKR 60

Query: 1319 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1140
            IGHAVLVPET   G D   +E  + Q PS                      ATQSPAG  
Sbjct: 61   IGHAVLVPETTDRGGDAPRAENPI-QTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFF 119

Query: 1139 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 960
            SLT   A+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SLT---ASMYSPSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 176

Query: 959  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 798
            SSPEVPFAQLLDP+ R      RFP S YEFQSYQLYPGSPVGQL              P
Sbjct: 177  SSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSP 236

Query: 797  FPDPESVSHGPHFLEFRTGGPPQLLNK--LNTHDWGSRLGSGSLTPDAP----------- 657
            FPD E  + G HFLEFRTG PP+LLN   L+T DWGSRLGSGS+TPD             
Sbjct: 237  FPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDGFLLK 296

Query: 656  -----------------SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHE- 531
                             +N+I ++HRVSFEL+ E ++RCVEK+P  LA AVS SL++ E 
Sbjct: 297  PQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRCVEKKPVALAEAVSTSLEDTEK 356

Query: 530  ------TGKVTKESL----XXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADG 381
                    KV   S+                     Q H K RSITLGSVKEFNFDN DG
Sbjct: 357  AQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQLHPKQRSITLGSVKEFNFDNPDG 416

Query: 380  GCSD---------------KEG---KNWSFFPMMQPGVS 318
            G S                KE    KNWSFFPMMQPGVS
Sbjct: 417  GDSGNSIGSDWWANEKVDAKENGPTKNWSFFPMMQPGVS 455


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  365 bits (938), Expect = 3e-98
 Identities = 224/449 (49%), Positives = 254/449 (56%), Gaps = 61/449 (13%)
 Frame = -3

Query: 1481 GESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKRIGHAV 1305
            G+SR +N+                 R P    +KRRWG C SIYWCFG+ K   RIGH V
Sbjct: 8    GDSRTMNNALETINAAATAIAMAENRVPQATVRKRRWGGCLSIYWCFGTPKNRTRIGHGV 67

Query: 1304 LVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLLSLTSI 1125
            LVPET   G     +E S TQ  +                      ATQSPAGLLSLTS+
Sbjct: 68   LVPETAQPGNSAPRAENS-TQTHAVILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTSV 126

Query: 1124 SANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEV 945
            SA+MYSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEV
Sbjct: 127  SASMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEV 186

Query: 944  PFAQLLDPN------HRRFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPFPDPE 783
            PFAQLLDPN       +RFP    EFQSY   PGSP+GQL              PFPDPE
Sbjct: 187  PFAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPE 246

Query: 782  SVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPD----------AP------ 657
              + GPHFLEFRTG PP+LLN  KL+  DWGSR GSGSLTPD          AP      
Sbjct: 247  FAARGPHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLTPDSVKPISTFEVAPHLKPNG 306

Query: 656  ---SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASL---------QNHETGKVTK 513
               + E V D RVSF+++ E+++R VEK+   LA A+  SL         +N ++ KV +
Sbjct: 307  RCRNAENVADRRVSFDVSTEDVIRYVEKKTVPLAEAMLTSLKDTTMGQREENSDSNKVEE 366

Query: 512  ESLXXXXXXXXXXXXXXXNRQ-----HHQKHRSITLGSVKEFNFDNADGG---------- 378
                                       HQKHRSITLGS KEFNFDNAD G          
Sbjct: 367  IGCENRVGETSNEEPDKAPTSGEEVLQHQKHRSITLGSSKEFNFDNADAGDLHKSDSVSD 426

Query: 377  ------CSDKEG---KNWSFFPMMQPGVS 318
                   + KEG   +NWSFFPM+QPGVS
Sbjct: 427  WWANQKVAGKEGAPSQNWSFFPMIQPGVS 455


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  362 bits (929), Expect = 3e-97
 Identities = 227/463 (49%), Positives = 255/463 (55%), Gaps = 70/463 (15%)
 Frame = -3

Query: 1496 MRRLNG-ESRAVNSXXXXXXXXXXXXXXXXTR-GPPQYQKRRWGSCWSIYWCFGSHKQTK 1323
            MR +NG +SRA+N+                 R      QKRRWG CWSI WCFG  K  K
Sbjct: 1    MRGVNGGDSRALNNSLETINAAATAIASAENRVHQATSQKRRWGGCWSISWCFGFQKHRK 60

Query: 1322 RIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGL 1143
            RIGHAVLVPE PT          + TQ  +                      ATQSPAGL
Sbjct: 61   RIGHAVLVPE-PTASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGL 119

Query: 1142 LSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 963
            +SL SIS NMYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT
Sbjct: 120  VSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 179

Query: 962  PSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXX 801
            PSSPEVPFAQLLDP+ R      +FP+S YEFQSY L+PGSPVG L              
Sbjct: 180  PSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSS 239

Query: 800  PFPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDA----PSN---- 651
            PFPD E  + GP F +F  G PP+LLN  KL+  +WGSR GSG+LTPDA    P N    
Sbjct: 240  PFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVGSTPRNGFFQ 299

Query: 650  -------------------EIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHET 528
                               + +VDHRVSFELT E++VRCVEK+P  LA AVS SLQN  T
Sbjct: 300  NRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTT 359

Query: 527  GKVTKESLXXXXXXXXXXXXXXXNRQ-------------HHQKHRSITLGSVKEFNFDNA 387
              V KE                                  HQK +SITLGS KEFNFD+A
Sbjct: 360  --VEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSA 417

Query: 386  DG------------------GCSDKEGKNWSFFPMMQ--PGVS 318
            DG                  G      KNW+FFP++Q  PGVS
Sbjct: 418  DGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  361 bits (926), Expect = 7e-97
 Identities = 226/463 (48%), Positives = 255/463 (55%), Gaps = 70/463 (15%)
 Frame = -3

Query: 1496 MRRLNG-ESRAVNSXXXXXXXXXXXXXXXXTR-GPPQYQKRRWGSCWSIYWCFGSHKQTK 1323
            MR +NG +SRA+N+                 R      QKRRWG CW+I WCFG  K  K
Sbjct: 1    MRGVNGGDSRALNNSLETISAAATAIASAENRVHQATSQKRRWGGCWNISWCFGFQKHRK 60

Query: 1322 RIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGL 1143
            RIGHAVLVPE PT          + TQ  +                      ATQSPAGL
Sbjct: 61   RIGHAVLVPE-PTASRSNASEAVNSTQATAISLPFVAPPSSPASFLQSEPPSATQSPAGL 119

Query: 1142 LSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 963
            +SL SIS NMYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT
Sbjct: 120  VSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 179

Query: 962  PSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXX 801
            PSSPEVPFAQLLDP+ R      +FP+S YEFQSY L+PGSPVG L              
Sbjct: 180  PSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSS 239

Query: 800  PFPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDA----PSN---- 651
            PFPD E  + GP F +F  G PP+LLN  KL+  +WGSR GSG+LTPDA    P N    
Sbjct: 240  PFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVRSTPRNGFFQ 299

Query: 650  -------------------EIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHET 528
                               + +VDHRVSFELT E++VRCVEK+P  LA AVS SLQN  T
Sbjct: 300  NRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTT 359

Query: 527  GKVTKESLXXXXXXXXXXXXXXXNRQ-------------HHQKHRSITLGSVKEFNFDNA 387
              V KE                                  HQK +SITLGS KEFNFD+A
Sbjct: 360  --VEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSA 417

Query: 386  DG------------------GCSDKEGKNWSFFPMMQ--PGVS 318
            DG                  G      KNW+FFP++Q  PGVS
Sbjct: 418  DGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  360 bits (924), Expect = 1e-96
 Identities = 213/457 (46%), Positives = 252/457 (55%), Gaps = 65/457 (14%)
 Frame = -3

Query: 1496 MRRLNG--ESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQT 1326
            MR +NG  +SR  N+                 R P    QKRRWGSCWS+YWCFG H+  
Sbjct: 2    MRNVNGGADSRPSNNALDTINAAASVIASAENRVPQATIQKRRWGSCWSVYWCFGYHRHR 61

Query: 1325 KRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAG 1146
            KRIGHAVLVPE    G D+  +E   TQ P+                      A+QSPAG
Sbjct: 62   KRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPFVAPPSSPASFLQSEPPSASQSPAG 121

Query: 1145 LLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLT 966
            +LSLTS+SA+MYSP GP SIFAIGPYAHETQLVSPP FSTFTTEPSTAPFTPPPESV LT
Sbjct: 122  ILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFSTFTTEPSTAPFTPPPESVQLT 181

Query: 965  TPSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXX 804
            TPSSPEVPFAQLL+P++R      RFP+S YEFQSYQ YPGSPVGQL             
Sbjct: 182  TPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGTS 241

Query: 803  XPFPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDAP--------- 657
             PFPD E  + GP FLEF+   PP+LLN  KL+ H+ GSR GSG+LTPDA          
Sbjct: 242  SPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLTPDAVRATSCSFPL 301

Query: 656  -----------------SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNH-E 531
                              ++ V D RVSF+L+ E+ +R  E +P    + +  S++N   
Sbjct: 302  DRQCSDIASNRHSDNENKDDQVADLRVSFDLSAEDALRYAEPKPASPVKIMPESMKNEIA 361

Query: 530  TGKVTKESLXXXXXXXXXXXXXXXNRQ----------HHQKHRSITLGSVKEFNFDNADG 381
              KV K S                  +           HQKHR++TLG+ KEFNFDNADG
Sbjct: 362  AEKVQKSSEIRHNFECRVGETSNGILEQASTGGEKTPRHQKHRTLTLGTFKEFNFDNADG 421

Query: 380  -----------------GCSDKEGKNWSFFPMMQPGV 321
                             G  D   KNWSFFP+MQP +
Sbjct: 422  VPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQPSI 458


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  355 bits (911), Expect = 4e-95
 Identities = 221/432 (51%), Positives = 245/432 (56%), Gaps = 39/432 (9%)
 Frame = -3

Query: 1496 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGP-PQYQKRRWGSCWSIYWCFGSHKQTKR 1320
            MR +NG++R++NS                 R P P  QKRRWGSCW  YWCF S K  KR
Sbjct: 1    MRSVNGDTRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKD-KR 59

Query: 1319 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1140
            IGHAVL PE+   G+    +E ++TQ P+                      ATQSP+GLL
Sbjct: 60   IGHAVLAPESRAPGSGVPAAE-NLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLL 118

Query: 1139 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 960
            SLTSI+AN+YSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 119  SLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 178

Query: 959  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 798
            SSPEVPFAQL DPN+R      RF  SQYEFQSYQLYPGSPVG L              P
Sbjct: 179  SSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSP 238

Query: 797  FPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSLTPDAPSNEIVVDHRVSFE 618
            FPD  S S  P  L     GPP            SR GS       P+NEI+VDHRVSFE
Sbjct: 239  FPD-RSGSITPDAL-----GPP------------SRDGSVLDHSGCPNNEIMVDHRVSFE 280

Query: 617  LTPENIVRCVEKEPEGLARAVSASLQNHETGKVTKES-------------LXXXXXXXXX 477
            LT E++VRCVEK+   L +AVSASLQN  T ++ + S                       
Sbjct: 281  LTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAP 340

Query: 476  XXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGCSDK-------------------EGKN 354
                    Q H K RSITLGS KEFNFDNADGG SDK                     KN
Sbjct: 341  EDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKN 400

Query: 353  WSFFPMMQPGVS 318
            WS F MMQP VS
Sbjct: 401  WSIFHMMQPSVS 412


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  352 bits (904), Expect = 2e-94
 Identities = 225/460 (48%), Positives = 253/460 (55%), Gaps = 68/460 (14%)
 Frame = -3

Query: 1496 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKR 1320
            MR  NGES A+N+                 R P    QKRRWG CWSIYWCFGS+KQ KR
Sbjct: 1    MRGANGESIAMNNTLETIHAAANAIASAENRVPQATVQKRRWGGCWSIYWCFGSYKQKKR 60

Query: 1319 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1140
            IG AVL  ET  +G +   +E   TQ P+                      ATQSPAGL+
Sbjct: 61   IGPAVLTSETSFSGANVPAAENP-TQAPAIALPFVAPPSSPASFLPSEPPSATQSPAGLV 119

Query: 1139 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 960
            SLTSISA+MYSPG P SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SLTSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 178

Query: 959  SSPEVPFAQLLDPN------HRRFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 798
            SSPEVPFAQLL PN       +RFP S YEFQSYQL+PGSPVGQL              P
Sbjct: 179  SSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSP 238

Query: 797  FPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDA----PSNEIVVD 636
            F D E  +   HF EFR G PP+LLN  K ++ +WGS  GSG+LTPDA    P N  ++D
Sbjct: 239  FRDGEFAA-SLHFPEFRMGDPPKLLNLDKHSSCEWGSHHGSGTLTPDATRSTPRNGFLLD 297

Query: 635  -------------------------HRVSFELTPENIVRCVEKEPEGLARAVSASLQ--- 540
                                     HRVSFELT E +VR +E E    + AVS SLQ   
Sbjct: 298  HQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQIEA 357

Query: 539  -----NHETGKVTKESLXXXXXXXXXXXXXXXNRQ---HHQKHRSITLGSVKEFNFDNAD 384
                  H+T  V                    +R+    H KH+SITLGS KEFNFDN D
Sbjct: 358  TRESEEHDTKVVDDYECRVGETSNERPEKALADREGKPQHHKHQSITLGSAKEFNFDNVD 417

Query: 383  GGCSDKE-------------------GKNWSFFPMMQPGV 321
            GG + K                     +NWSFFPMMQPGV
Sbjct: 418  GGDAHKPILTSDWWANDKVAGKGGGVPRNWSFFPMMQPGV 457


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  352 bits (903), Expect = 3e-94
 Identities = 222/456 (48%), Positives = 246/456 (53%), Gaps = 63/456 (13%)
 Frame = -3

Query: 1496 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPPQYQKRRWGSCWSIYWCFGSHKQTKRI 1317
            MR  NGESRA N+                 R P    +RRWGSCWSIY CFG  K  K+I
Sbjct: 1    MRGFNGESRAANNTLETINAAATAIASAENRVPQATVQRRWGSCWSIYLCFGYQKHKKQI 60

Query: 1316 GHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLLS 1137
            GHAVL PE    G     SE   TQ P+                       TQSPAGL+S
Sbjct: 61   GHAVLFPEPSAPGNGAPASENP-TQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVS 119

Query: 1136 LTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 957
            LTSISA+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS
Sbjct: 120  LTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 179

Query: 956  SPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPF 795
            SPEVPFAQ LDP+ R      RFP   ++FQSYQ +PGSPVGQL              PF
Sbjct: 180  SPEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPF 236

Query: 794  PDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTP--------------- 666
            PD E    G HF EFR G PP+LLN  KL+T +WGS  GSG+LTP               
Sbjct: 237  PDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESVRRGSPNFLLHRQ 296

Query: 665  --DAPS---------NEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGKV 519
              D PS         N  VV+HRVSFELT E+  RCVE++P    + V   ++N    K 
Sbjct: 297  FSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENGTQAKE 356

Query: 518  TKES-----------LXXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGCS 372
             K S                               H+K +SITLGSVKEFNFDNAD G S
Sbjct: 357  EKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDS 416

Query: 371  ---------------DKEG---KNWSFFPMMQPGVS 318
                            KEG   KNWSFFPM+Q GVS
Sbjct: 417  RKPSSSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 452


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  352 bits (903), Expect = 3e-94
 Identities = 224/457 (49%), Positives = 247/457 (54%), Gaps = 64/457 (14%)
 Frame = -3

Query: 1496 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKR 1320
            MR  NGESRA N+                 R P    QKRRWGSCWSIY CFG  K  K+
Sbjct: 1    MRGFNGESRAANNTLETINAAATAIASAENRVPQATVQKRRWGSCWSIYLCFGYQKHKKQ 60

Query: 1319 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1140
            IGHAVL PE    G     SE   TQ P+                       TQSPAGL+
Sbjct: 61   IGHAVLFPEPSAPGNGAPASENP-TQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLV 119

Query: 1139 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 960
            SLTSISA+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 179

Query: 959  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 798
            SSPEVPFAQ LDP+ R      RFP   ++FQSYQ +PGSPVGQL              P
Sbjct: 180  SSPEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSP 236

Query: 797  FPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTP-------------- 666
            FPD E    G HF EFR G PP+LLN  KL+T +WGS  GSG+LTP              
Sbjct: 237  FPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESVRRGSPNFLLHR 296

Query: 665  ---DAPS---------NEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 522
               D PS         N  VV+HRVSFELT E+  RCVE++P    + V   ++N    K
Sbjct: 297  QFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENGTQAK 356

Query: 521  VTKES-----------LXXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGC 375
              K S                               H+K +SITLGSVKEFNFDNAD G 
Sbjct: 357  EEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGD 416

Query: 374  S---------------DKEG---KNWSFFPMMQPGVS 318
            S                KEG   KNWSFFPM+Q GVS
Sbjct: 417  SRKPSSSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 453


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  340 bits (872), Expect = 1e-90
 Identities = 212/456 (46%), Positives = 241/456 (52%), Gaps = 63/456 (13%)
 Frame = -3

Query: 1496 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPP-QYQKRRWGSCWSIYWCFGSHKQTKR 1320
            M R+NGE R V+S                 R P    QKRRWG CWS+YWCFGS KQTKR
Sbjct: 1    MNRVNGEQRGVDSTLETISAAATAIASVENRVPQASIQKRRWGGCWSMYWCFGSQKQTKR 60

Query: 1319 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1140
            IGHAV +PET  +G D   S  S +Q PS                      AT SP G  
Sbjct: 61   IGHAVFIPETTASGADRPSSNTS-SQAPSIVLPFIAPPSSPASFLPSEPPSATHSPVGSK 119

Query: 1139 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 960
             L   S + YSP GP SIFAIGPYAHETQLVSPPVFS FTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  CL---SMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTP 176

Query: 959  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 798
            SSPEVPFA+LLDPN++      R+P++QYEFQSYQL PGSPV  L              P
Sbjct: 177  SSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSP 236

Query: 797  FPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSLTPDAPS------------ 654
            F D E     P FL          L K+  H+WGSR GSG+LTP+A +            
Sbjct: 237  FLDREYTPGRPQFLN---------LEKIAPHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQ 287

Query: 653  ---------------NEI-VVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 522
                           N++ VVDHRVSFE+T E++VRCVEK+P  + R  S SLQ+ E   
Sbjct: 288  NSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERST 347

Query: 521  VTKESLXXXXXXXXXXXXXXXNR------------QHHQKHRSITLGSVKEFNFDNADGG 378
              +E+L                             Q  QKHRSITLGS KEFNFDN DGG
Sbjct: 348  KRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGG 407

Query: 377  CSDKE--GKNW--------------SFFPMMQPGVS 318
              DK   G +W                FPMMQPGVS
Sbjct: 408  YPDKATIGSDWWANEKVLGKEPCNNWIFPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  338 bits (866), Expect = 6e-90
 Identities = 211/456 (46%), Positives = 241/456 (52%), Gaps = 63/456 (13%)
 Frame = -3

Query: 1496 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPP-QYQKRRWGSCWSIYWCFGSHKQTKR 1320
            M R+NGE R V+S                 R P    QKRRWGSCWS+YWCFGS KQTKR
Sbjct: 1    MNRVNGEQRGVDSTLETINAAATAIASVENRVPQASIQKRRWGSCWSMYWCFGSQKQTKR 60

Query: 1319 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1140
            IGHAV +PET  +  D   S  S +Q PS                      AT SP G  
Sbjct: 61   IGHAVFIPETTASAADRPSSNTS-SQAPSIVLPFIAPPSSPASFLPSEPPSATHSPVGSK 119

Query: 1139 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 960
             L   S + YSP GP SIFAIGPYAHETQLVSPPVFS FTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  CL---SMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTP 176

Query: 959  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 798
            SSPEVPFA+LLDPN++      R+P++QYEFQSYQL PGSPV  L              P
Sbjct: 177  SSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSP 236

Query: 797  FPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSLTPDAPS------------ 654
            F + E     P FL          L K+  H+WGSR GSG+LTP+A +            
Sbjct: 237  FLEREYTPGRPQFLN---------LEKIAPHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQ 287

Query: 653  ---------------NEI-VVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 522
                           N++ VVDHRVSFE+T E++VRCVEK+P  + R  S SLQ+ E   
Sbjct: 288  NTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERST 347

Query: 521  VTKESLXXXXXXXXXXXXXXXNR------------QHHQKHRSITLGSVKEFNFDNADGG 378
              +E+L                             Q  QKHRSITLGS KEFNFDN DGG
Sbjct: 348  KRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGG 407

Query: 377  CSDKE--GKNW--------------SFFPMMQPGVS 318
              DK   G +W                FPMMQPGVS
Sbjct: 408  YPDKATIGSDWWANEKVLGKEPCNNWIFPMMQPGVS 443


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  329 bits (844), Expect = 2e-87
 Identities = 207/428 (48%), Positives = 233/428 (54%), Gaps = 71/428 (16%)
 Frame = -3

Query: 1388 QKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXX 1209
            QKRRW   W +YWCFG  +  KRIGHAV++PET + G +   +E ++TQ  S        
Sbjct: 2    QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAE-NLTQASSIVLPFAAP 60

Query: 1208 XXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 1029
                          A QSP    SL   SA+MYSPG P+SIFAIGPYAHETQLVSPPVFS
Sbjct: 61   PSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFS 116

Query: 1028 TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLY 867
            TFTTEPSTAPFTPP ESVHLT PSSPEVPFAQLLD N R      R+P S YEFQSYQ Y
Sbjct: 117  TFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWY 176

Query: 866  PGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNK--LNTHDWGS 693
            PGSPVGQL              PF D E  S G HFLEFRTG  P++LN   L T DWGS
Sbjct: 177  PGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGS 236

Query: 692  RLGSGSLTPDAPSNE----------------------------IVVDHRVSFELTPENIV 597
            RL SGS+TPDA  +                               + HRVSFEL+ E +V
Sbjct: 237  RLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 296

Query: 596  RCVEKEPEGLARAVSASLQNHETGKVTKE----------------SLXXXXXXXXXXXXX 465
            RCVEK+P  LA AVS SLQ+ E  K  +E                               
Sbjct: 297  RCVEKKPVALAEAVSTSLQSAE--KAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDA 354

Query: 464  XXNRQHHQKHRSITLGSVKEFNFDNADGGCS-------------------DKEGKNWSFF 342
                  +QK RSITLGS KEFNFDNADGG S                   + E KNWSFF
Sbjct: 355  EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFF 414

Query: 341  PMMQPGVS 318
            PM+QPG+S
Sbjct: 415  PMIQPGMS 422


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  329 bits (844), Expect = 2e-87
 Identities = 207/428 (48%), Positives = 233/428 (54%), Gaps = 71/428 (16%)
 Frame = -3

Query: 1388 QKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXX 1209
            QKRRW   W +YWCFG  +  KRIGHAV++PET + G +   +E ++TQ  S        
Sbjct: 39   QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAE-NLTQASSIVLPFAAP 97

Query: 1208 XXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 1029
                          A QSP    SL   SA+MYSPG P+SIFAIGPYAHETQLVSPPVFS
Sbjct: 98   PSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFS 153

Query: 1028 TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLY 867
            TFTTEPSTAPFTPP ESVHLT PSSPEVPFAQLLD N R      R+P S YEFQSYQ Y
Sbjct: 154  TFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWY 213

Query: 866  PGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNK--LNTHDWGS 693
            PGSPVGQL              PF D E  S G HFLEFRTG  P++LN   L T DWGS
Sbjct: 214  PGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGS 273

Query: 692  RLGSGSLTPDAPSNE----------------------------IVVDHRVSFELTPENIV 597
            RL SGS+TPDA  +                               + HRVSFEL+ E +V
Sbjct: 274  RLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 333

Query: 596  RCVEKEPEGLARAVSASLQNHETGKVTKE----------------SLXXXXXXXXXXXXX 465
            RCVEK+P  LA AVS SLQ+ E  K  +E                               
Sbjct: 334  RCVEKKPVALAEAVSTSLQSAE--KAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDA 391

Query: 464  XXNRQHHQKHRSITLGSVKEFNFDNADGGCS-------------------DKEGKNWSFF 342
                  +QK RSITLGS KEFNFDNADGG S                   + E KNWSFF
Sbjct: 392  EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFF 451

Query: 341  PMMQPGVS 318
            PM+QPG+S
Sbjct: 452  PMIQPGMS 459


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  307 bits (786), Expect = 1e-80
 Identities = 195/436 (44%), Positives = 229/436 (52%), Gaps = 46/436 (10%)
 Frame = -3

Query: 1496 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPP-QYQKRRWGSCWSIYWCFGSHKQTKR 1320
            MR +NGESRA N+                 R P    QK+RW S WSIYWCFG  K  ++
Sbjct: 1    MRDVNGESRAANNTLETINAAATAIASAENRVPQAMVQKQRWRSHWSIYWCFGYQKSKRQ 60

Query: 1319 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1140
            IGHAVL PE+   G+    +E S  Q P                        TQSPAGL+
Sbjct: 61   IGHAVLFPESSAPGSGAPAAENS-AQAPEVTFPFVAPPSSPASFFQSEPPSVTQSPAGLV 119

Query: 1139 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 960
            S TSISA+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 179

Query: 959  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 798
            SSPEVPFAQL+DP  R      RFP+   +FQSYQ +PGS VGQL              P
Sbjct: 180  SSPEVPFAQLIDPTLRNGVTGLRFPF---DFQSYQFHPGSSVGQLISPSSGISGSGTSSP 236

Query: 797  FPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDAP----------- 657
            FPD E    GPH  EFR G  P+LLN  KL+T +WGS   SG+LTPD+            
Sbjct: 237  FPDGEFAVGGPHSPEFRMG--PKLLNLDKLSTREWGSYQDSGALTPDSVRHGSPNFLLHR 294

Query: 656  ---------------SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 522
                            ++ VV+HR SFEL+ ++  RCVE++P    + V   ++N    K
Sbjct: 295  QFSDVASHPRSENGHDDDQVVNHRFSFELSVKDASRCVEEKPACSIKTVPEYVENGTKAK 354

Query: 521  VT----------KESLXXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGCS 372
                        +                      H+K + ITLGSV EFNFDNAD G S
Sbjct: 355  EEENYGELIQSFERRSGDTSNDTPETPSTDGEAPQHRKQQPITLGSVNEFNFDNADEGDS 414

Query: 371  -DKEGKNWSFFPMMQP 327
             +    NW   P   P
Sbjct: 415  HNPSSSNWVKQPRTGP 430


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  297 bits (761), Expect = 9e-78
 Identities = 188/435 (43%), Positives = 220/435 (50%), Gaps = 74/435 (17%)
 Frame = -3

Query: 1400 PPQYQKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXX 1221
            P   QKRRWGSC S+YWCFGSH+ +KRIGHAVLVPE    G     SE ++    S    
Sbjct: 27   PTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASE-NLNLSTSIVLP 85

Query: 1220 XXXXXXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSP 1041
                              +TQSPAG LSLT++S N YSP GP S+FAIGPYAHETQLVSP
Sbjct: 86   FIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSP 145

Query: 1040 PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQY 891
            PVFSTF TEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++   S Y
Sbjct: 146  PVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNY 205

Query: 890  EFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNKLN 711
            EFQ YQLYP SPVG L              PFPD   +   P  L F            +
Sbjct: 206  EFQPYQLYPESPVGHL---ISPISNSGTSSPFPDRRPIVEAPKLLGF---------EHFS 253

Query: 710  THDWGSRLGSGSLTPD----------------------------APSNEIVVDHRVSFEL 615
            T  WGSRLGSGSLTPD                            + + E V+DHRVSFEL
Sbjct: 254  TRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFEL 313

Query: 614  TPENIVRCVEKEPEGLARAVSASLQN-HETGKVTKES---------------LXXXXXXX 483
              E++  CVEK+P   A  V  +LQ+  E G++ +E                        
Sbjct: 314  AGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAAS 373

Query: 482  XXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGCSDKE--------------GK---- 357
                      Q H+KH  I  GS+KEFNFDN  G  S K               GK    
Sbjct: 374  EKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGP 433

Query: 356  --NWSFFPMMQPGVS 318
              NW+FFP++QPG+S
Sbjct: 434  QTNWTFFPLLQPGIS 448


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  297 bits (760), Expect = 1e-77
 Identities = 196/476 (41%), Positives = 230/476 (48%), Gaps = 115/476 (24%)
 Frame = -3

Query: 1400 PPQYQKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXX 1221
            P   QK+RWGSCW +YWCFGS K +KRIGHAVLVPE    G     +E +++        
Sbjct: 27   PTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAE-NVSNPTGIILP 85

Query: 1220 XXXXXXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSP 1041
                              ATQSPAGLLSLTS+S N YSP GP SIFAIGPYAHETQLV+P
Sbjct: 86   FIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTP 145

Query: 1040 PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQY 891
            PVFS  TTEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++F  S Y
Sbjct: 146  PVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHY 205

Query: 890  EFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNKLN 711
            EFQSYQ+YPGSP G L              PFPD   +      LEFR G  P+LL   N
Sbjct: 206  EFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPI------LEFRMGEAPKLLGFEN 259

Query: 710  --THDWGSRLGSGSL--------------------------------TPDA--------- 660
              T  WGSRLGSGSL                                TPD          
Sbjct: 260  FTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGF 319

Query: 659  --------------PSN-----EIVVDHRVSFELTPENIVRCVE---------------- 585
                          P+N     E +VDHRVSFEL+ E++  C+E                
Sbjct: 320  LVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKD 379

Query: 584  ------KEPEGLARAVSASLQN--HETGKVTKESLXXXXXXXXXXXXXXXNRQHHQKHRS 429
                  KE +G+ + + +S +    ET   T E                     +QKHRS
Sbjct: 380  LVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEE----------EHSYQKHRS 429

Query: 428  ITLGSVKEFNFDNADGGCSDK-------------------EGKNWSFFPMMQPGVS 318
            +TLGS+KEFNFDN  G  SDK                    G +W+FFPM+QP VS
Sbjct: 430  VTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  293 bits (751), Expect = 1e-76
 Identities = 194/471 (41%), Positives = 228/471 (48%), Gaps = 115/471 (24%)
 Frame = -3

Query: 1385 KRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXX 1206
            K+RWGSCW +YWCFGS K +KRIGHAVLVPE    G     +E +++             
Sbjct: 36   KKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAE-NVSNPTGIILPFIAPP 94

Query: 1205 XXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFST 1026
                         ATQSPAGLLSLTS+S N YSP GP SIFAIGPYAHETQLV+PPVFS 
Sbjct: 95   SSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSA 154

Query: 1025 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQYEFQSY 876
             TTEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++F  S YEFQSY
Sbjct: 155  LTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSY 214

Query: 875  QLYPGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNKLN--THD 702
            Q+YPGSP G L              PFPD   +      LEFR G  P+LL   N  T  
Sbjct: 215  QIYPGSPGGNLISPGSAISNSGTSSPFPDRRPI------LEFRMGEAPKLLGFENFTTRK 268

Query: 701  WGSRLGSGSL--------------------------------TPDA-------------- 660
            WGSRLGSGSL                                TPD               
Sbjct: 269  WGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQ 328

Query: 659  ---------PSN-----EIVVDHRVSFELTPENIVRCVE--------------------- 585
                     P+N     E +VDHRVSFEL+ E++  C+E                     
Sbjct: 329  ISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEG 388

Query: 584  -KEPEGLARAVSASLQN--HETGKVTKESLXXXXXXXXXXXXXXXNRQHHQKHRSITLGS 414
             KE +G+ + + +S +    ET   T E                     +QKHRS+TLGS
Sbjct: 389  RKERDGIKKDLESSCELFIRETSNETVEKASGEAEE----------EHSYQKHRSVTLGS 438

Query: 413  VKEFNFDNADGGCSDK-------------------EGKNWSFFPMMQPGVS 318
            +KEFNFDN  G  SDK                    G +W+FFPM+QP VS
Sbjct: 439  IKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  292 bits (747), Expect = 4e-76
 Identities = 196/478 (41%), Positives = 229/478 (47%), Gaps = 117/478 (24%)
 Frame = -3

Query: 1400 PPQYQKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXX 1221
            P   QKRRWG CWS+YWCFGSHK TKRIGHAVL PE    G     S  + +Q  +    
Sbjct: 41   PTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGA-VVTSAENQSQSTAITVP 98

Query: 1220 XXXXXXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSP 1041
                              ATQSPAGLLSLTS+S N YSPGGP SIFAIGPYAHETQLV+P
Sbjct: 99   FIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTP 158

Query: 1040 PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQY 891
            P FS FTTEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++F  S Y
Sbjct: 159  PAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHY 218

Query: 890  EFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLN--K 717
            EFQSY LYPGSP GQL              PFPD   +      LEFR G  P+LL    
Sbjct: 219  EFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPI------LEFRMGEAPKLLGFEH 272

Query: 716  LNTHDWGSRLGS------------------------------------------------ 681
              T  WGSRLGS                                                
Sbjct: 273  FTTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGS 332

Query: 680  GSLTPDA----------------------------PSNEIVVDHRVSFELTPENIVRCVE 585
            GSLTPDA                             ++E +VDHRVSFEL+ E + RC+E
Sbjct: 333  GSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLE 392

Query: 584  KEPEGLARAVSASLQNH------ETGKV--TKESLXXXXXXXXXXXXXXXNRQH---HQK 438
             +     RA S    +       ++GK+  T E+L                 +    ++K
Sbjct: 393  SKSLASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTGETSGETPEKPSGEMEEEHCYRK 452

Query: 437  HRSITLGSVKEFNFDNAD------------------GGCSDKEGKNWSFFPMMQPGVS 318
            HRSITLGS+KEFNFDN+                    G   +   NW+FFP++QP VS
Sbjct: 453  HRSITLGSIKEFNFDNSKEVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus guttatus]
          Length = 420

 Score =  289 bits (740), Expect = 2e-75
 Identities = 188/406 (46%), Positives = 218/406 (53%), Gaps = 49/406 (12%)
 Frame = -3

Query: 1388 QKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXX 1209
            QKRRW S WS+YWCF  +   KRIGHAVLV ET ++ T    +     Q PS        
Sbjct: 36   QKRRWRSFWSLYWCFRPNNN-KRIGHAVLVTETSSSDTAYTPTAERPFQPPSIVLPFTAP 94

Query: 1208 XXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 1029
                          +TQSP GLLSL+S S N+YSP GP SIFAIGPYAHETQLVSPPVFS
Sbjct: 95   PSSPASFIPSEPPSSTQSPTGLLSLSSPSGNIYSPSGPASIFAIGPYAHETQLVSPPVFS 154

Query: 1028 TFTTEPSTAPFTPPPE-SVHLTTPSSPEVPFAQLLDPNHRRFPYSQYEFQSYQLYPGSPV 852
            TFTTEPSTAP+TPPPE S HLTTPSSPEVPFA+LL+PN +R+P SQYEFQSYQL PGSPV
Sbjct: 155  TFTTEPSTAPYTPPPEFSAHLTTPSSPEVPFARLLEPN-QRYPLSQYEFQSYQLQPGSPV 213

Query: 851  GQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSL 672
              L              PF D +  +  P FLEF  G PP+         W S   SG +
Sbjct: 214  SHLISPCSGISGSGASSPFLDRDFAAVHPFFLEFGGGNPPR------RDQWESCQESGVV 267

Query: 671  TP-DA----------------------PSN-------EIVVDHRVSFELTPENIVRCVEK 582
            TP DA                      P N          +DHRVSFE+T E ++RCVEK
Sbjct: 268  TPTDAVGPRSRDSCVLLNRQNSDISPLPDNCTGLENDVAAIDHRVSFEITAEKVIRCVEK 327

Query: 581  EPEGLARAVSASLQNHETGKVTKESLXXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEF 402
            +        S        GK   E +               N + HQK+R+ITLGS KEF
Sbjct: 328  K--------SLETAQESVGKKPIELI-----NREEDQTEIVNEKRHQKNRTITLGSTKEF 374

Query: 401  NFD--NADGGCSD------------KEG----KNWSFFPMMQPGVS 318
            NF+  N D  C D            KEG    +NWSFFP++QPGVS
Sbjct: 375  NFEGGNCDEPCVDSSEWWVNEKKVPKEGGGSSENWSFFPILQPGVS 420


Top