BLASTX nr result

ID: Paeonia23_contig00010079 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00010079
         (1736 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   393   e-106
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   391   e-106
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     365   3e-98
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   362   3e-97
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   361   6e-97
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   360   1e-96
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              355   4e-95
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   352   2e-94
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   352   3e-94
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   352   3e-94
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   340   1e-90
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   338   6e-90
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   329   2e-87
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   329   2e-87
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   307   1e-80
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   297   9e-78
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   297   1e-77
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   293   1e-76
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   292   4e-76
gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus...   289   2e-75

>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  393 bits (1009), Expect = e-106
 Identities = 238/481 (49%), Positives = 269/481 (55%), Gaps = 88/481 (18%)
 Frame = -1

Query: 1445 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGP-PQYQKRRWGSCWSIYWCFGSHKQTKR 1269
            MR +NG++R++NS                 R P P  QKRRWGSCW  YWCF S K  KR
Sbjct: 1    MRSVNGDTRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKD-KR 59

Query: 1268 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1089
            IGHAVL PE+   G+    +E ++TQ P+                      ATQSP+GLL
Sbjct: 60   IGHAVLAPESRAPGSGVPAAE-NLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLL 118

Query: 1088 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 909
            SLTSI+AN+YSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 119  SLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 178

Query: 908  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 747
            SSPEVPFAQL DPN+R      RF  SQYEFQSYQLYPGSPVG L              P
Sbjct: 179  SSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSP 238

Query: 746  FPDPESVSHG-PHFLEFRTGGPPQL--LNKLNTHDWGSRLGSGSLTPDA----------- 609
            FPD + V  G   FLEFR GGPP+L  L+KL+ H+WGSR+GSGS+TPDA           
Sbjct: 239  FPDRDFVCSGSSQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSITPDALGPPSRDGSVL 298

Query: 608  -----------------------------------PSNEIVVDHRVSFELTPENIVRCVE 534
                                               P+NEI+VDHRVSFELT E++VRCVE
Sbjct: 299  DRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVE 358

Query: 533  KEPEGLARAVSASLQNHETGKVTKES-------------LXXXXXXXXXXXXXXXNRQHH 393
            K+   L +AVSASLQN  T ++ + S                               Q H
Sbjct: 359  KDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPH 418

Query: 392  QKHRSITLGSVKEFNFDNADGGCSDK-------------------EGKNWSFFPMMQPGV 270
             K RSITLGS KEFNFDNADGG SDK                     KNWS F MMQP V
Sbjct: 419  HKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSV 478

Query: 269  S 267
            S
Sbjct: 479  S 479


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  391 bits (1004), Expect = e-106
 Identities = 238/459 (51%), Positives = 261/459 (56%), Gaps = 66/459 (14%)
 Frame = -1

Query: 1445 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKR 1269
            MRR+NGESR  N+                 R P    QKRRWGS WS+YWCFG  +  KR
Sbjct: 1    MRRVNGESRTGNNALETINAAASAIAAAENRVPQATVQKRRWGSWWSMYWCFGFQRHKKR 60

Query: 1268 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1089
            IGHAVLVPET   G D   +E  + Q PS                      ATQSPAG  
Sbjct: 61   IGHAVLVPETTDRGGDAPRAENPI-QTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFF 119

Query: 1088 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 909
            SLT   A+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SLT---ASMYSPSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 176

Query: 908  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 747
            SSPEVPFAQLLDP+ R      RFP S YEFQSYQLYPGSPVGQL              P
Sbjct: 177  SSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSP 236

Query: 746  FPDPESVSHGPHFLEFRTGGPPQLLNK--LNTHDWGSRLGSGSLTPDAP----------- 606
            FPD E  + G HFLEFRTG PP+LLN   L+T DWGSRLGSGS+TPD             
Sbjct: 237  FPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDGFLLK 296

Query: 605  -----------------SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHE- 480
                             +N+I ++HRVSFEL+ E ++RCVEK+P  LA AVS SL++ E 
Sbjct: 297  PQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRCVEKKPVALAEAVSTSLEDTEK 356

Query: 479  ------TGKVTKESL----XXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADG 330
                    KV   S+                     Q H K RSITLGSVKEFNFDN DG
Sbjct: 357  AQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQLHPKQRSITLGSVKEFNFDNPDG 416

Query: 329  GCSD---------------KEG---KNWSFFPMMQPGVS 267
            G S                KE    KNWSFFPMMQPGVS
Sbjct: 417  GDSGNSIGSDWWANEKVDAKENGPTKNWSFFPMMQPGVS 455


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  365 bits (938), Expect = 3e-98
 Identities = 224/449 (49%), Positives = 254/449 (56%), Gaps = 61/449 (13%)
 Frame = -1

Query: 1430 GESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKRIGHAV 1254
            G+SR +N+                 R P    +KRRWG C SIYWCFG+ K   RIGH V
Sbjct: 8    GDSRTMNNALETINAAATAIAMAENRVPQATVRKRRWGGCLSIYWCFGTPKNRTRIGHGV 67

Query: 1253 LVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLLSLTSI 1074
            LVPET   G     +E S TQ  +                      ATQSPAGLLSLTS+
Sbjct: 68   LVPETAQPGNSAPRAENS-TQTHAVILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTSV 126

Query: 1073 SANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEV 894
            SA+MYSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEV
Sbjct: 127  SASMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEV 186

Query: 893  PFAQLLDPN------HRRFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPFPDPE 732
            PFAQLLDPN       +RFP    EFQSY   PGSP+GQL              PFPDPE
Sbjct: 187  PFAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPE 246

Query: 731  SVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPD----------AP------ 606
              + GPHFLEFRTG PP+LLN  KL+  DWGSR GSGSLTPD          AP      
Sbjct: 247  FAARGPHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLTPDSVKPISTFEVAPHLKPNG 306

Query: 605  ---SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASL---------QNHETGKVTK 462
               + E V D RVSF+++ E+++R VEK+   LA A+  SL         +N ++ KV +
Sbjct: 307  RCRNAENVADRRVSFDVSTEDVIRYVEKKTVPLAEAMLTSLKDTTMGQREENSDSNKVEE 366

Query: 461  ESLXXXXXXXXXXXXXXXNRQ-----HHQKHRSITLGSVKEFNFDNADGG---------- 327
                                       HQKHRSITLGS KEFNFDNAD G          
Sbjct: 367  IGCENRVGETSNEEPDKAPTSGEEVLQHQKHRSITLGSSKEFNFDNADAGDLHKSDSVSD 426

Query: 326  ------CSDKEG---KNWSFFPMMQPGVS 267
                   + KEG   +NWSFFPM+QPGVS
Sbjct: 427  WWANQKVAGKEGAPSQNWSFFPMIQPGVS 455


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  362 bits (929), Expect = 3e-97
 Identities = 227/463 (49%), Positives = 255/463 (55%), Gaps = 70/463 (15%)
 Frame = -1

Query: 1445 MRRLNG-ESRAVNSXXXXXXXXXXXXXXXXTR-GPPQYQKRRWGSCWSIYWCFGSHKQTK 1272
            MR +NG +SRA+N+                 R      QKRRWG CWSI WCFG  K  K
Sbjct: 1    MRGVNGGDSRALNNSLETINAAATAIASAENRVHQATSQKRRWGGCWSISWCFGFQKHRK 60

Query: 1271 RIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGL 1092
            RIGHAVLVPE PT          + TQ  +                      ATQSPAGL
Sbjct: 61   RIGHAVLVPE-PTASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGL 119

Query: 1091 LSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 912
            +SL SIS NMYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT
Sbjct: 120  VSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 179

Query: 911  PSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXX 750
            PSSPEVPFAQLLDP+ R      +FP+S YEFQSY L+PGSPVG L              
Sbjct: 180  PSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSS 239

Query: 749  PFPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDA----PSN---- 600
            PFPD E  + GP F +F  G PP+LLN  KL+  +WGSR GSG+LTPDA    P N    
Sbjct: 240  PFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVGSTPRNGFFQ 299

Query: 599  -------------------EIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHET 477
                               + +VDHRVSFELT E++VRCVEK+P  LA AVS SLQN  T
Sbjct: 300  NRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTT 359

Query: 476  GKVTKESLXXXXXXXXXXXXXXXNRQ-------------HHQKHRSITLGSVKEFNFDNA 336
              V KE                                  HQK +SITLGS KEFNFD+A
Sbjct: 360  --VEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSA 417

Query: 335  DG------------------GCSDKEGKNWSFFPMMQ--PGVS 267
            DG                  G      KNW+FFP++Q  PGVS
Sbjct: 418  DGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  361 bits (926), Expect = 6e-97
 Identities = 226/463 (48%), Positives = 255/463 (55%), Gaps = 70/463 (15%)
 Frame = -1

Query: 1445 MRRLNG-ESRAVNSXXXXXXXXXXXXXXXXTR-GPPQYQKRRWGSCWSIYWCFGSHKQTK 1272
            MR +NG +SRA+N+                 R      QKRRWG CW+I WCFG  K  K
Sbjct: 1    MRGVNGGDSRALNNSLETISAAATAIASAENRVHQATSQKRRWGGCWNISWCFGFQKHRK 60

Query: 1271 RIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGL 1092
            RIGHAVLVPE PT          + TQ  +                      ATQSPAGL
Sbjct: 61   RIGHAVLVPE-PTASRSNASEAVNSTQATAISLPFVAPPSSPASFLQSEPPSATQSPAGL 119

Query: 1091 LSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 912
            +SL SIS NMYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT
Sbjct: 120  VSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 179

Query: 911  PSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXX 750
            PSSPEVPFAQLLDP+ R      +FP+S YEFQSY L+PGSPVG L              
Sbjct: 180  PSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSS 239

Query: 749  PFPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDA----PSN---- 600
            PFPD E  + GP F +F  G PP+LLN  KL+  +WGSR GSG+LTPDA    P N    
Sbjct: 240  PFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVRSTPRNGFFQ 299

Query: 599  -------------------EIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHET 477
                               + +VDHRVSFELT E++VRCVEK+P  LA AVS SLQN  T
Sbjct: 300  NRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTT 359

Query: 476  GKVTKESLXXXXXXXXXXXXXXXNRQ-------------HHQKHRSITLGSVKEFNFDNA 336
              V KE                                  HQK +SITLGS KEFNFD+A
Sbjct: 360  --VEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSA 417

Query: 335  DG------------------GCSDKEGKNWSFFPMMQ--PGVS 267
            DG                  G      KNW+FFP++Q  PGVS
Sbjct: 418  DGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  360 bits (924), Expect = 1e-96
 Identities = 213/457 (46%), Positives = 252/457 (55%), Gaps = 65/457 (14%)
 Frame = -1

Query: 1445 MRRLNG--ESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQT 1275
            MR +NG  +SR  N+                 R P    QKRRWGSCWS+YWCFG H+  
Sbjct: 2    MRNVNGGADSRPSNNALDTINAAASVIASAENRVPQATIQKRRWGSCWSVYWCFGYHRHR 61

Query: 1274 KRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAG 1095
            KRIGHAVLVPE    G D+  +E   TQ P+                      A+QSPAG
Sbjct: 62   KRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPFVAPPSSPASFLQSEPPSASQSPAG 121

Query: 1094 LLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLT 915
            +LSLTS+SA+MYSP GP SIFAIGPYAHETQLVSPP FSTFTTEPSTAPFTPPPESV LT
Sbjct: 122  ILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFSTFTTEPSTAPFTPPPESVQLT 181

Query: 914  TPSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXX 753
            TPSSPEVPFAQLL+P++R      RFP+S YEFQSYQ YPGSPVGQL             
Sbjct: 182  TPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGTS 241

Query: 752  XPFPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDAP--------- 606
             PFPD E  + GP FLEF+   PP+LLN  KL+ H+ GSR GSG+LTPDA          
Sbjct: 242  SPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLTPDAVRATSCSFPL 301

Query: 605  -----------------SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNH-E 480
                              ++ V D RVSF+L+ E+ +R  E +P    + +  S++N   
Sbjct: 302  DRQCSDIASNRHSDNENKDDQVADLRVSFDLSAEDALRYAEPKPASPVKIMPESMKNEIA 361

Query: 479  TGKVTKESLXXXXXXXXXXXXXXXNRQ----------HHQKHRSITLGSVKEFNFDNADG 330
              KV K S                  +           HQKHR++TLG+ KEFNFDNADG
Sbjct: 362  AEKVQKSSEIRHNFECRVGETSNGILEQASTGGEKTPRHQKHRTLTLGTFKEFNFDNADG 421

Query: 329  -----------------GCSDKEGKNWSFFPMMQPGV 270
                             G  D   KNWSFFP+MQP +
Sbjct: 422  VPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQPSI 458


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  355 bits (911), Expect = 4e-95
 Identities = 221/432 (51%), Positives = 245/432 (56%), Gaps = 39/432 (9%)
 Frame = -1

Query: 1445 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGP-PQYQKRRWGSCWSIYWCFGSHKQTKR 1269
            MR +NG++R++NS                 R P P  QKRRWGSCW  YWCF S K  KR
Sbjct: 1    MRSVNGDTRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKD-KR 59

Query: 1268 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1089
            IGHAVL PE+   G+    +E ++TQ P+                      ATQSP+GLL
Sbjct: 60   IGHAVLAPESRAPGSGVPAAE-NLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLL 118

Query: 1088 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 909
            SLTSI+AN+YSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 119  SLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 178

Query: 908  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 747
            SSPEVPFAQL DPN+R      RF  SQYEFQSYQLYPGSPVG L              P
Sbjct: 179  SSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSP 238

Query: 746  FPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSLTPDAPSNEIVVDHRVSFE 567
            FPD  S S  P  L     GPP            SR GS       P+NEI+VDHRVSFE
Sbjct: 239  FPD-RSGSITPDAL-----GPP------------SRDGSVLDHSGCPNNEIMVDHRVSFE 280

Query: 566  LTPENIVRCVEKEPEGLARAVSASLQNHETGKVTKES-------------LXXXXXXXXX 426
            LT E++VRCVEK+   L +AVSASLQN  T ++ + S                       
Sbjct: 281  LTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAP 340

Query: 425  XXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGCSDK-------------------EGKN 303
                    Q H K RSITLGS KEFNFDNADGG SDK                     KN
Sbjct: 341  EDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKN 400

Query: 302  WSFFPMMQPGVS 267
            WS F MMQP VS
Sbjct: 401  WSIFHMMQPSVS 412


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  352 bits (904), Expect = 2e-94
 Identities = 225/460 (48%), Positives = 253/460 (55%), Gaps = 68/460 (14%)
 Frame = -1

Query: 1445 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKR 1269
            MR  NGES A+N+                 R P    QKRRWG CWSIYWCFGS+KQ KR
Sbjct: 1    MRGANGESIAMNNTLETIHAAANAIASAENRVPQATVQKRRWGGCWSIYWCFGSYKQKKR 60

Query: 1268 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1089
            IG AVL  ET  +G +   +E   TQ P+                      ATQSPAGL+
Sbjct: 61   IGPAVLTSETSFSGANVPAAENP-TQAPAIALPFVAPPSSPASFLPSEPPSATQSPAGLV 119

Query: 1088 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 909
            SLTSISA+MYSPG P SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SLTSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 178

Query: 908  SSPEVPFAQLLDPN------HRRFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 747
            SSPEVPFAQLL PN       +RFP S YEFQSYQL+PGSPVGQL              P
Sbjct: 179  SSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSP 238

Query: 746  FPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDA----PSNEIVVD 585
            F D E  +   HF EFR G PP+LLN  K ++ +WGS  GSG+LTPDA    P N  ++D
Sbjct: 239  FRDGEFAA-SLHFPEFRMGDPPKLLNLDKHSSCEWGSHHGSGTLTPDATRSTPRNGFLLD 297

Query: 584  -------------------------HRVSFELTPENIVRCVEKEPEGLARAVSASLQ--- 489
                                     HRVSFELT E +VR +E E    + AVS SLQ   
Sbjct: 298  HQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQIEA 357

Query: 488  -----NHETGKVTKESLXXXXXXXXXXXXXXXNRQ---HHQKHRSITLGSVKEFNFDNAD 333
                  H+T  V                    +R+    H KH+SITLGS KEFNFDN D
Sbjct: 358  TRESEEHDTKVVDDYECRVGETSNERPEKALADREGKPQHHKHQSITLGSAKEFNFDNVD 417

Query: 332  GGCSDKE-------------------GKNWSFFPMMQPGV 270
            GG + K                     +NWSFFPMMQPGV
Sbjct: 418  GGDAHKPILTSDWWANDKVAGKGGGVPRNWSFFPMMQPGV 457


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  352 bits (903), Expect = 3e-94
 Identities = 222/456 (48%), Positives = 246/456 (53%), Gaps = 63/456 (13%)
 Frame = -1

Query: 1445 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPPQYQKRRWGSCWSIYWCFGSHKQTKRI 1266
            MR  NGESRA N+                 R P    +RRWGSCWSIY CFG  K  K+I
Sbjct: 1    MRGFNGESRAANNTLETINAAATAIASAENRVPQATVQRRWGSCWSIYLCFGYQKHKKQI 60

Query: 1265 GHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLLS 1086
            GHAVL PE    G     SE   TQ P+                       TQSPAGL+S
Sbjct: 61   GHAVLFPEPSAPGNGAPASENP-TQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVS 119

Query: 1085 LTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 906
            LTSISA+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS
Sbjct: 120  LTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 179

Query: 905  SPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPF 744
            SPEVPFAQ LDP+ R      RFP   ++FQSYQ +PGSPVGQL              PF
Sbjct: 180  SPEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPF 236

Query: 743  PDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTP--------------- 615
            PD E    G HF EFR G PP+LLN  KL+T +WGS  GSG+LTP               
Sbjct: 237  PDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESVRRGSPNFLLHRQ 296

Query: 614  --DAPS---------NEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGKV 468
              D PS         N  VV+HRVSFELT E+  RCVE++P    + V   ++N    K 
Sbjct: 297  FSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENGTQAKE 356

Query: 467  TKES-----------LXXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGCS 321
             K S                               H+K +SITLGSVKEFNFDNAD G S
Sbjct: 357  EKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDS 416

Query: 320  ---------------DKEG---KNWSFFPMMQPGVS 267
                            KEG   KNWSFFPM+Q GVS
Sbjct: 417  RKPSSSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 452


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  352 bits (903), Expect = 3e-94
 Identities = 224/457 (49%), Positives = 247/457 (54%), Gaps = 64/457 (14%)
 Frame = -1

Query: 1445 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKR 1269
            MR  NGESRA N+                 R P    QKRRWGSCWSIY CFG  K  K+
Sbjct: 1    MRGFNGESRAANNTLETINAAATAIASAENRVPQATVQKRRWGSCWSIYLCFGYQKHKKQ 60

Query: 1268 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1089
            IGHAVL PE    G     SE   TQ P+                       TQSPAGL+
Sbjct: 61   IGHAVLFPEPSAPGNGAPASENP-TQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLV 119

Query: 1088 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 909
            SLTSISA+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 179

Query: 908  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 747
            SSPEVPFAQ LDP+ R      RFP   ++FQSYQ +PGSPVGQL              P
Sbjct: 180  SSPEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSP 236

Query: 746  FPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTP-------------- 615
            FPD E    G HF EFR G PP+LLN  KL+T +WGS  GSG+LTP              
Sbjct: 237  FPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESVRRGSPNFLLHR 296

Query: 614  ---DAPS---------NEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 471
               D PS         N  VV+HRVSFELT E+  RCVE++P    + V   ++N    K
Sbjct: 297  QFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENGTQAK 356

Query: 470  VTKES-----------LXXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGC 324
              K S                               H+K +SITLGSVKEFNFDNAD G 
Sbjct: 357  EEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGD 416

Query: 323  S---------------DKEG---KNWSFFPMMQPGVS 267
            S                KEG   KNWSFFPM+Q GVS
Sbjct: 417  SRKPSSSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 453


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  340 bits (872), Expect = 1e-90
 Identities = 212/456 (46%), Positives = 241/456 (52%), Gaps = 63/456 (13%)
 Frame = -1

Query: 1445 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPP-QYQKRRWGSCWSIYWCFGSHKQTKR 1269
            M R+NGE R V+S                 R P    QKRRWG CWS+YWCFGS KQTKR
Sbjct: 1    MNRVNGEQRGVDSTLETISAAATAIASVENRVPQASIQKRRWGGCWSMYWCFGSQKQTKR 60

Query: 1268 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1089
            IGHAV +PET  +G D   S  S +Q PS                      AT SP G  
Sbjct: 61   IGHAVFIPETTASGADRPSSNTS-SQAPSIVLPFIAPPSSPASFLPSEPPSATHSPVGSK 119

Query: 1088 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 909
             L   S + YSP GP SIFAIGPYAHETQLVSPPVFS FTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  CL---SMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTP 176

Query: 908  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 747
            SSPEVPFA+LLDPN++      R+P++QYEFQSYQL PGSPV  L              P
Sbjct: 177  SSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSP 236

Query: 746  FPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSLTPDAPS------------ 603
            F D E     P FL          L K+  H+WGSR GSG+LTP+A +            
Sbjct: 237  FLDREYTPGRPQFLN---------LEKIAPHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQ 287

Query: 602  ---------------NEI-VVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 471
                           N++ VVDHRVSFE+T E++VRCVEK+P  + R  S SLQ+ E   
Sbjct: 288  NSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERST 347

Query: 470  VTKESLXXXXXXXXXXXXXXXNR------------QHHQKHRSITLGSVKEFNFDNADGG 327
              +E+L                             Q  QKHRSITLGS KEFNFDN DGG
Sbjct: 348  KRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGG 407

Query: 326  CSDKE--GKNW--------------SFFPMMQPGVS 267
              DK   G +W                FPMMQPGVS
Sbjct: 408  YPDKATIGSDWWANEKVLGKEPCNNWIFPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  338 bits (866), Expect = 6e-90
 Identities = 211/456 (46%), Positives = 241/456 (52%), Gaps = 63/456 (13%)
 Frame = -1

Query: 1445 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPP-QYQKRRWGSCWSIYWCFGSHKQTKR 1269
            M R+NGE R V+S                 R P    QKRRWGSCWS+YWCFGS KQTKR
Sbjct: 1    MNRVNGEQRGVDSTLETINAAATAIASVENRVPQASIQKRRWGSCWSMYWCFGSQKQTKR 60

Query: 1268 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1089
            IGHAV +PET  +  D   S  S +Q PS                      AT SP G  
Sbjct: 61   IGHAVFIPETTASAADRPSSNTS-SQAPSIVLPFIAPPSSPASFLPSEPPSATHSPVGSK 119

Query: 1088 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 909
             L   S + YSP GP SIFAIGPYAHETQLVSPPVFS FTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  CL---SMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTP 176

Query: 908  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 747
            SSPEVPFA+LLDPN++      R+P++QYEFQSYQL PGSPV  L              P
Sbjct: 177  SSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSP 236

Query: 746  FPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSLTPDAPS------------ 603
            F + E     P FL          L K+  H+WGSR GSG+LTP+A +            
Sbjct: 237  FLEREYTPGRPQFLN---------LEKIAPHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQ 287

Query: 602  ---------------NEI-VVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 471
                           N++ VVDHRVSFE+T E++VRCVEK+P  + R  S SLQ+ E   
Sbjct: 288  NTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERST 347

Query: 470  VTKESLXXXXXXXXXXXXXXXNR------------QHHQKHRSITLGSVKEFNFDNADGG 327
              +E+L                             Q  QKHRSITLGS KEFNFDN DGG
Sbjct: 348  KRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGG 407

Query: 326  CSDKE--GKNW--------------SFFPMMQPGVS 267
              DK   G +W                FPMMQPGVS
Sbjct: 408  YPDKATIGSDWWANEKVLGKEPCNNWIFPMMQPGVS 443


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  329 bits (844), Expect = 2e-87
 Identities = 207/428 (48%), Positives = 233/428 (54%), Gaps = 71/428 (16%)
 Frame = -1

Query: 1337 QKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXX 1158
            QKRRW   W +YWCFG  +  KRIGHAV++PET + G +   +E ++TQ  S        
Sbjct: 2    QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAE-NLTQASSIVLPFAAP 60

Query: 1157 XXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 978
                          A QSP    SL   SA+MYSPG P+SIFAIGPYAHETQLVSPPVFS
Sbjct: 61   PSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFS 116

Query: 977  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLY 816
            TFTTEPSTAPFTPP ESVHLT PSSPEVPFAQLLD N R      R+P S YEFQSYQ Y
Sbjct: 117  TFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWY 176

Query: 815  PGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNK--LNTHDWGS 642
            PGSPVGQL              PF D E  S G HFLEFRTG  P++LN   L T DWGS
Sbjct: 177  PGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGS 236

Query: 641  RLGSGSLTPDAPSNE----------------------------IVVDHRVSFELTPENIV 546
            RL SGS+TPDA  +                               + HRVSFEL+ E +V
Sbjct: 237  RLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 296

Query: 545  RCVEKEPEGLARAVSASLQNHETGKVTKE----------------SLXXXXXXXXXXXXX 414
            RCVEK+P  LA AVS SLQ+ E  K  +E                               
Sbjct: 297  RCVEKKPVALAEAVSTSLQSAE--KAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDA 354

Query: 413  XXNRQHHQKHRSITLGSVKEFNFDNADGGCS-------------------DKEGKNWSFF 291
                  +QK RSITLGS KEFNFDNADGG S                   + E KNWSFF
Sbjct: 355  EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFF 414

Query: 290  PMMQPGVS 267
            PM+QPG+S
Sbjct: 415  PMIQPGMS 422


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  329 bits (844), Expect = 2e-87
 Identities = 207/428 (48%), Positives = 233/428 (54%), Gaps = 71/428 (16%)
 Frame = -1

Query: 1337 QKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXX 1158
            QKRRW   W +YWCFG  +  KRIGHAV++PET + G +   +E ++TQ  S        
Sbjct: 39   QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAE-NLTQASSIVLPFAAP 97

Query: 1157 XXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 978
                          A QSP    SL   SA+MYSPG P+SIFAIGPYAHETQLVSPPVFS
Sbjct: 98   PSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFS 153

Query: 977  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLY 816
            TFTTEPSTAPFTPP ESVHLT PSSPEVPFAQLLD N R      R+P S YEFQSYQ Y
Sbjct: 154  TFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWY 213

Query: 815  PGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNK--LNTHDWGS 642
            PGSPVGQL              PF D E  S G HFLEFRTG  P++LN   L T DWGS
Sbjct: 214  PGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGS 273

Query: 641  RLGSGSLTPDAPSNE----------------------------IVVDHRVSFELTPENIV 546
            RL SGS+TPDA  +                               + HRVSFEL+ E +V
Sbjct: 274  RLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 333

Query: 545  RCVEKEPEGLARAVSASLQNHETGKVTKE----------------SLXXXXXXXXXXXXX 414
            RCVEK+P  LA AVS SLQ+ E  K  +E                               
Sbjct: 334  RCVEKKPVALAEAVSTSLQSAE--KAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDA 391

Query: 413  XXNRQHHQKHRSITLGSVKEFNFDNADGGCS-------------------DKEGKNWSFF 291
                  +QK RSITLGS KEFNFDNADGG S                   + E KNWSFF
Sbjct: 392  EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFF 451

Query: 290  PMMQPGVS 267
            PM+QPG+S
Sbjct: 452  PMIQPGMS 459


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  307 bits (786), Expect = 1e-80
 Identities = 195/436 (44%), Positives = 229/436 (52%), Gaps = 46/436 (10%)
 Frame = -1

Query: 1445 MRRLNGESRAVNSXXXXXXXXXXXXXXXXTRGPP-QYQKRRWGSCWSIYWCFGSHKQTKR 1269
            MR +NGESRA N+                 R P    QK+RW S WSIYWCFG  K  ++
Sbjct: 1    MRDVNGESRAANNTLETINAAATAIASAENRVPQAMVQKQRWRSHWSIYWCFGYQKSKRQ 60

Query: 1268 IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXATQSPAGLL 1089
            IGHAVL PE+   G+    +E S  Q P                        TQSPAGL+
Sbjct: 61   IGHAVLFPESSAPGSGAPAAENS-AQAPEVTFPFVAPPSSPASFFQSEPPSVTQSPAGLV 119

Query: 1088 SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 909
            S TSISA+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 179

Query: 908  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXP 747
            SSPEVPFAQL+DP  R      RFP+   +FQSYQ +PGS VGQL              P
Sbjct: 180  SSPEVPFAQLIDPTLRNGVTGLRFPF---DFQSYQFHPGSSVGQLISPSSGISGSGTSSP 236

Query: 746  FPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDAP----------- 606
            FPD E    GPH  EFR G  P+LLN  KL+T +WGS   SG+LTPD+            
Sbjct: 237  FPDGEFAVGGPHSPEFRMG--PKLLNLDKLSTREWGSYQDSGALTPDSVRHGSPNFLLHR 294

Query: 605  ---------------SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 471
                            ++ VV+HR SFEL+ ++  RCVE++P    + V   ++N    K
Sbjct: 295  QFSDVASHPRSENGHDDDQVVNHRFSFELSVKDASRCVEEKPACSIKTVPEYVENGTKAK 354

Query: 470  VT----------KESLXXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGCS 321
                        +                      H+K + ITLGSV EFNFDNAD G S
Sbjct: 355  EEENYGELIQSFERRSGDTSNDTPETPSTDGEAPQHRKQQPITLGSVNEFNFDNADEGDS 414

Query: 320  -DKEGKNWSFFPMMQP 276
             +    NW   P   P
Sbjct: 415  HNPSSSNWVKQPRTGP 430


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  297 bits (761), Expect = 9e-78
 Identities = 188/435 (43%), Positives = 220/435 (50%), Gaps = 74/435 (17%)
 Frame = -1

Query: 1349 PPQYQKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXX 1170
            P   QKRRWGSC S+YWCFGSH+ +KRIGHAVLVPE    G     SE ++    S    
Sbjct: 27   PTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASE-NLNLSTSIVLP 85

Query: 1169 XXXXXXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSP 990
                              +TQSPAG LSLT++S N YSP GP S+FAIGPYAHETQLVSP
Sbjct: 86   FIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSP 145

Query: 989  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQY 840
            PVFSTF TEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++   S Y
Sbjct: 146  PVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNY 205

Query: 839  EFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNKLN 660
            EFQ YQLYP SPVG L              PFPD   +   P  L F            +
Sbjct: 206  EFQPYQLYPESPVGHL---ISPISNSGTSSPFPDRRPIVEAPKLLGF---------EHFS 253

Query: 659  THDWGSRLGSGSLTPD----------------------------APSNEIVVDHRVSFEL 564
            T  WGSRLGSGSLTPD                            + + E V+DHRVSFEL
Sbjct: 254  TRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFEL 313

Query: 563  TPENIVRCVEKEPEGLARAVSASLQN-HETGKVTKES---------------LXXXXXXX 432
              E++  CVEK+P   A  V  +LQ+  E G++ +E                        
Sbjct: 314  AGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAAS 373

Query: 431  XXXXXXXXNRQHHQKHRSITLGSVKEFNFDNADGGCSDKE--------------GK---- 306
                      Q H+KH  I  GS+KEFNFDN  G  S K               GK    
Sbjct: 374  EKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGP 433

Query: 305  --NWSFFPMMQPGVS 267
              NW+FFP++QPG+S
Sbjct: 434  QTNWTFFPLLQPGIS 448


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  297 bits (760), Expect = 1e-77
 Identities = 196/476 (41%), Positives = 230/476 (48%), Gaps = 115/476 (24%)
 Frame = -1

Query: 1349 PPQYQKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXX 1170
            P   QK+RWGSCW +YWCFGS K +KRIGHAVLVPE    G     +E +++        
Sbjct: 27   PTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAE-NVSNPTGIILP 85

Query: 1169 XXXXXXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSP 990
                              ATQSPAGLLSLTS+S N YSP GP SIFAIGPYAHETQLV+P
Sbjct: 86   FIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTP 145

Query: 989  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQY 840
            PVFS  TTEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++F  S Y
Sbjct: 146  PVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHY 205

Query: 839  EFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNKLN 660
            EFQSYQ+YPGSP G L              PFPD   +      LEFR G  P+LL   N
Sbjct: 206  EFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPI------LEFRMGEAPKLLGFEN 259

Query: 659  --THDWGSRLGSGSL--------------------------------TPDA--------- 609
              T  WGSRLGSGSL                                TPD          
Sbjct: 260  FTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGF 319

Query: 608  --------------PSN-----EIVVDHRVSFELTPENIVRCVE---------------- 534
                          P+N     E +VDHRVSFEL+ E++  C+E                
Sbjct: 320  LVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKD 379

Query: 533  ------KEPEGLARAVSASLQN--HETGKVTKESLXXXXXXXXXXXXXXXNRQHHQKHRS 378
                  KE +G+ + + +S +    ET   T E                     +QKHRS
Sbjct: 380  LVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEE----------EHSYQKHRS 429

Query: 377  ITLGSVKEFNFDNADGGCSDK-------------------EGKNWSFFPMMQPGVS 267
            +TLGS+KEFNFDN  G  SDK                    G +W+FFPM+QP VS
Sbjct: 430  VTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  293 bits (751), Expect = 1e-76
 Identities = 194/471 (41%), Positives = 228/471 (48%), Gaps = 115/471 (24%)
 Frame = -1

Query: 1334 KRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXX 1155
            K+RWGSCW +YWCFGS K +KRIGHAVLVPE    G     +E +++             
Sbjct: 36   KKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAE-NVSNPTGIILPFIAPP 94

Query: 1154 XXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFST 975
                         ATQSPAGLLSLTS+S N YSP GP SIFAIGPYAHETQLV+PPVFS 
Sbjct: 95   SSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSA 154

Query: 974  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQYEFQSY 825
             TTEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++F  S YEFQSY
Sbjct: 155  LTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSY 214

Query: 824  QLYPGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNKLN--THD 651
            Q+YPGSP G L              PFPD   +      LEFR G  P+LL   N  T  
Sbjct: 215  QIYPGSPGGNLISPGSAISNSGTSSPFPDRRPI------LEFRMGEAPKLLGFENFTTRK 268

Query: 650  WGSRLGSGSL--------------------------------TPDA-------------- 609
            WGSRLGSGSL                                TPD               
Sbjct: 269  WGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQ 328

Query: 608  ---------PSN-----EIVVDHRVSFELTPENIVRCVE--------------------- 534
                     P+N     E +VDHRVSFEL+ E++  C+E                     
Sbjct: 329  ISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEG 388

Query: 533  -KEPEGLARAVSASLQN--HETGKVTKESLXXXXXXXXXXXXXXXNRQHHQKHRSITLGS 363
             KE +G+ + + +S +    ET   T E                     +QKHRS+TLGS
Sbjct: 389  RKERDGIKKDLESSCELFIRETSNETVEKASGEAEE----------EHSYQKHRSVTLGS 438

Query: 362  VKEFNFDNADGGCSDK-------------------EGKNWSFFPMMQPGVS 267
            +KEFNFDN  G  SDK                    G +W+FFPM+QP VS
Sbjct: 439  IKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  292 bits (747), Expect = 4e-76
 Identities = 196/478 (41%), Positives = 229/478 (47%), Gaps = 117/478 (24%)
 Frame = -1

Query: 1349 PPQYQKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXX 1170
            P   QKRRWG CWS+YWCFGSHK TKRIGHAVL PE    G     S  + +Q  +    
Sbjct: 41   PTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGA-VVTSAENQSQSTAITVP 98

Query: 1169 XXXXXXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSP 990
                              ATQSPAGLLSLTS+S N YSPGGP SIFAIGPYAHETQLV+P
Sbjct: 99   FIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTP 158

Query: 989  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQY 840
            P FS FTTEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++F  S Y
Sbjct: 159  PAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHY 218

Query: 839  EFQSYQLYPGSPVGQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLN--K 666
            EFQSY LYPGSP GQL              PFPD   +      LEFR G  P+LL    
Sbjct: 219  EFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPI------LEFRMGEAPKLLGFEH 272

Query: 665  LNTHDWGSRLGS------------------------------------------------ 630
              T  WGSRLGS                                                
Sbjct: 273  FTTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGS 332

Query: 629  GSLTPDA----------------------------PSNEIVVDHRVSFELTPENIVRCVE 534
            GSLTPDA                             ++E +VDHRVSFEL+ E + RC+E
Sbjct: 333  GSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLE 392

Query: 533  KEPEGLARAVSASLQNH------ETGKV--TKESLXXXXXXXXXXXXXXXNRQH---HQK 387
             +     RA S    +       ++GK+  T E+L                 +    ++K
Sbjct: 393  SKSLASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTGETSGETPEKPSGEMEEEHCYRK 452

Query: 386  HRSITLGSVKEFNFDNAD------------------GGCSDKEGKNWSFFPMMQPGVS 267
            HRSITLGS+KEFNFDN+                    G   +   NW+FFP++QP VS
Sbjct: 453  HRSITLGSIKEFNFDNSKEVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus guttatus]
          Length = 420

 Score =  289 bits (740), Expect = 2e-75
 Identities = 188/406 (46%), Positives = 218/406 (53%), Gaps = 49/406 (12%)
 Frame = -1

Query: 1337 QKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXX 1158
            QKRRW S WS+YWCF  +   KRIGHAVLV ET ++ T    +     Q PS        
Sbjct: 36   QKRRWRSFWSLYWCFRPNNN-KRIGHAVLVTETSSSDTAYTPTAERPFQPPSIVLPFTAP 94

Query: 1157 XXXXXXXXXXXXXXATQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 978
                          +TQSP GLLSL+S S N+YSP GP SIFAIGPYAHETQLVSPPVFS
Sbjct: 95   PSSPASFIPSEPPSSTQSPTGLLSLSSPSGNIYSPSGPASIFAIGPYAHETQLVSPPVFS 154

Query: 977  TFTTEPSTAPFTPPPE-SVHLTTPSSPEVPFAQLLDPNHRRFPYSQYEFQSYQLYPGSPV 801
            TFTTEPSTAP+TPPPE S HLTTPSSPEVPFA+LL+PN +R+P SQYEFQSYQL PGSPV
Sbjct: 155  TFTTEPSTAPYTPPPEFSAHLTTPSSPEVPFARLLEPN-QRYPLSQYEFQSYQLQPGSPV 213

Query: 800  GQLXXXXXXXXXXXXXXPFPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSL 621
              L              PF D +  +  P FLEF  G PP+         W S   SG +
Sbjct: 214  SHLISPCSGISGSGASSPFLDRDFAAVHPFFLEFGGGNPPR------RDQWESCQESGVV 267

Query: 620  TP-DA----------------------PSN-------EIVVDHRVSFELTPENIVRCVEK 531
            TP DA                      P N          +DHRVSFE+T E ++RCVEK
Sbjct: 268  TPTDAVGPRSRDSCVLLNRQNSDISPLPDNCTGLENDVAAIDHRVSFEITAEKVIRCVEK 327

Query: 530  EPEGLARAVSASLQNHETGKVTKESLXXXXXXXXXXXXXXXNRQHHQKHRSITLGSVKEF 351
            +        S        GK   E +               N + HQK+R+ITLGS KEF
Sbjct: 328  K--------SLETAQESVGKKPIELI-----NREEDQTEIVNEKRHQKNRTITLGSTKEF 374

Query: 350  NFD--NADGGCSD------------KEG----KNWSFFPMMQPGVS 267
            NF+  N D  C D            KEG    +NWSFFP++QPGVS
Sbjct: 375  NFEGGNCDEPCVDSSEWWVNEKKVPKEGGGSSENWSFFPILQPGVS 420


Top