BLASTX nr result

ID: Paeonia25_contig00026120 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00026120
         (1539 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   393   e-106
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   391   e-106
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     365   2e-98
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   362   2e-97
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   361   6e-97
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   360   9e-97
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              355   3e-95
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   352   2e-94
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   352   3e-94
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   352   3e-94
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   340   1e-90
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   338   5e-90
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   329   2e-87
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   329   2e-87
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   307   9e-81
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   297   8e-78
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   297   1e-77
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   293   1e-76
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   292   3e-76
gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus...   289   2e-75

>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  393 bits (1009), Expect = e-106
 Identities = 236/481 (49%), Positives = 267/481 (55%), Gaps = 88/481 (18%)
 Frame = +3

Query: 255  MRRLNGESRAVNSXXXXXXXXXXXXXXXXXRGP-PQYQKRRWGSCWSIYWCFGSHKQTKR 431
            MR +NG++R++NS                 R P P  QKRRWGSCW  YWCF S K  KR
Sbjct: 1    MRSVNGDTRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKD-KR 59

Query: 432  IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLL 611
            IGHAVL PE+   G+    +E ++TQ P+                       TQSP+GLL
Sbjct: 60   IGHAVLAPESRAPGSGVPAAE-NLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLL 118

Query: 612  SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 791
            SLTSI+AN+YSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 119  SLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 178

Query: 792  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXX 953
            SSPEVPFAQL DPN+R      RF  SQYEFQSYQLYPGSPVG L               
Sbjct: 179  SSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSP 238

Query: 954  FPDPESVSHG-PHFLEFRTGGPPQL--LNKLNTHDWGSRLGSGSLTPDA----------- 1091
            FPD + V  G   FLEFR GGPP+L  L+KL+ H+WGSR+GSGS+TPDA           
Sbjct: 239  FPDRDFVCSGSSQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSITPDALGPPSRDGSVL 298

Query: 1092 -----------------------------------PSNEIVVDHRVSFELTPENIVRCVE 1166
                                               P+NEI+VDHRVSFELT E++VRCVE
Sbjct: 299  DRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVE 358

Query: 1167 KEPEGLARAVSASLQNHETGKVTKES-------------LXXXXXXXXXXXXXXXXRQHH 1307
            K+   L +AVSASLQN  T ++ + S                               Q H
Sbjct: 359  KDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPH 418

Query: 1308 QKHRSITLGSVKEFNFDNADGGCSDK-------------------EGKNWSFFPMMQPGV 1430
             K RSITLGS KEFNFDNADGG SDK                     KNWS F MMQP V
Sbjct: 419  HKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSV 478

Query: 1431 S 1433
            S
Sbjct: 479  S 479


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  391 bits (1004), Expect = e-106
 Identities = 236/459 (51%), Positives = 259/459 (56%), Gaps = 66/459 (14%)
 Frame = +3

Query: 255  MRRLNGESRAVNSXXXXXXXXXXXXXXXXXRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKR 431
            MRR+NGESR  N+                 R P    QKRRWGS WS+YWCFG  +  KR
Sbjct: 1    MRRVNGESRTGNNALETINAAASAIAAAENRVPQATVQKRRWGSWWSMYWCFGFQRHKKR 60

Query: 432  IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLL 611
            IGHAVLVPET   G D   +E  + Q PS                       TQSPAG  
Sbjct: 61   IGHAVLVPETTDRGGDAPRAENPI-QTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFF 119

Query: 612  SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 791
            SLT   A+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SLT---ASMYSPSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 176

Query: 792  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXX 953
            SSPEVPFAQLLDP+ R      RFP S YEFQSYQLYPGSPVGQL               
Sbjct: 177  SSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSP 236

Query: 954  FPDPESVSHGPHFLEFRTGGPPQLLNK--LNTHDWGSRLGSGSLTPDAP----------- 1094
            FPD E  + G HFLEFRTG PP+LLN   L+T DWGSRLGSGS+TPD             
Sbjct: 237  FPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDGFLLK 296

Query: 1095 -----------------SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHE- 1220
                             +N+I ++HRVSFEL+ E ++RCVEK+P  LA AVS SL++ E 
Sbjct: 297  PQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRCVEKKPVALAEAVSTSLEDTEK 356

Query: 1221 ------TGKVTKESL----XXXXXXXXXXXXXXXXRQHHQKHRSITLGSVKEFNFDNADG 1370
                    KV   S+                     Q H K RSITLGSVKEFNFDN DG
Sbjct: 357  AQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQLHPKQRSITLGSVKEFNFDNPDG 416

Query: 1371 GCSD---------------KEG---KNWSFFPMMQPGVS 1433
            G S                KE    KNWSFFPMMQPGVS
Sbjct: 417  GDSGNSIGSDWWANEKVDAKENGPTKNWSFFPMMQPGVS 455


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  365 bits (938), Expect = 2e-98
 Identities = 222/449 (49%), Positives = 252/449 (56%), Gaps = 61/449 (13%)
 Frame = +3

Query: 270  GESRAVNSXXXXXXXXXXXXXXXXXRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKRIGHAV 446
            G+SR +N+                 R P    +KRRWG C SIYWCFG+ K   RIGH V
Sbjct: 8    GDSRTMNNALETINAAATAIAMAENRVPQATVRKRRWGGCLSIYWCFGTPKNRTRIGHGV 67

Query: 447  LVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLLSLTSI 626
            LVPET   G     +E S TQ  +                       TQSPAGLLSLTS+
Sbjct: 68   LVPETAQPGNSAPRAENS-TQTHAVILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTSV 126

Query: 627  SANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEV 806
            SA+MYSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEV
Sbjct: 127  SASMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEV 186

Query: 807  PFAQLLDPN------HRRFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXXFPDPE 968
            PFAQLLDPN       +RFP    EFQSY   PGSP+GQL               FPDPE
Sbjct: 187  PFAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPE 246

Query: 969  SVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPD----------AP------ 1094
              + GPHFLEFRTG PP+LLN  KL+  DWGSR GSGSLTPD          AP      
Sbjct: 247  FAARGPHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLTPDSVKPISTFEVAPHLKPNG 306

Query: 1095 ---SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASL---------QNHETGKVTK 1238
               + E V D RVSF+++ E+++R VEK+   LA A+  SL         +N ++ KV +
Sbjct: 307  RCRNAENVADRRVSFDVSTEDVIRYVEKKTVPLAEAMLTSLKDTTMGQREENSDSNKVEE 366

Query: 1239 ESLXXXXXXXXXXXXXXXXRQ-----HHQKHRSITLGSVKEFNFDNADGG---------- 1373
                                       HQKHRSITLGS KEFNFDNAD G          
Sbjct: 367  IGCENRVGETSNEEPDKAPTSGEEVLQHQKHRSITLGSSKEFNFDNADAGDLHKSDSVSD 426

Query: 1374 ------CSDKEG---KNWSFFPMMQPGVS 1433
                   + KEG   +NWSFFPM+QPGVS
Sbjct: 427  WWANQKVAGKEGAPSQNWSFFPMIQPGVS 455


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  362 bits (929), Expect = 2e-97
 Identities = 225/463 (48%), Positives = 253/463 (54%), Gaps = 70/463 (15%)
 Frame = +3

Query: 255  MRRLNG-ESRAVNSXXXXXXXXXXXXXXXXXR-GPPQYQKRRWGSCWSIYWCFGSHKQTK 428
            MR +NG +SRA+N+                 R      QKRRWG CWSI WCFG  K  K
Sbjct: 1    MRGVNGGDSRALNNSLETINAAATAIASAENRVHQATSQKRRWGGCWSISWCFGFQKHRK 60

Query: 429  RIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGL 608
            RIGHAVLVPE PT          + TQ  +                       TQSPAGL
Sbjct: 61   RIGHAVLVPE-PTASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGL 119

Query: 609  LSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 788
            +SL SIS NMYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT
Sbjct: 120  VSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 179

Query: 789  PSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXX 950
            PSSPEVPFAQLLDP+ R      +FP+S YEFQSY L+PGSPVG L              
Sbjct: 180  PSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSS 239

Query: 951  XFPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDA----PSN---- 1100
             FPD E  + GP F +F  G PP+LLN  KL+  +WGSR GSG+LTPDA    P N    
Sbjct: 240  PFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVGSTPRNGFFQ 299

Query: 1101 -------------------EIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHET 1223
                               + +VDHRVSFELT E++VRCVEK+P  LA AVS SLQN  T
Sbjct: 300  NRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTT 359

Query: 1224 GKVTKESLXXXXXXXXXXXXXXXXRQ-------------HHQKHRSITLGSVKEFNFDNA 1364
              V KE                                  HQK +SITLGS KEFNFD+A
Sbjct: 360  --VEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSA 417

Query: 1365 DG------------------GCSDKEGKNWSFFPMMQ--PGVS 1433
            DG                  G      KNW+FFP++Q  PGVS
Sbjct: 418  DGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  361 bits (926), Expect = 6e-97
 Identities = 224/463 (48%), Positives = 253/463 (54%), Gaps = 70/463 (15%)
 Frame = +3

Query: 255  MRRLNG-ESRAVNSXXXXXXXXXXXXXXXXXR-GPPQYQKRRWGSCWSIYWCFGSHKQTK 428
            MR +NG +SRA+N+                 R      QKRRWG CW+I WCFG  K  K
Sbjct: 1    MRGVNGGDSRALNNSLETISAAATAIASAENRVHQATSQKRRWGGCWNISWCFGFQKHRK 60

Query: 429  RIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGL 608
            RIGHAVLVPE PT          + TQ  +                       TQSPAGL
Sbjct: 61   RIGHAVLVPE-PTASRSNASEAVNSTQATAISLPFVAPPSSPASFLQSEPPSATQSPAGL 119

Query: 609  LSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 788
            +SL SIS NMYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT
Sbjct: 120  VSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 179

Query: 789  PSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXX 950
            PSSPEVPFAQLLDP+ R      +FP+S YEFQSY L+PGSPVG L              
Sbjct: 180  PSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSS 239

Query: 951  XFPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDA----PSN---- 1100
             FPD E  + GP F +F  G PP+LLN  KL+  +WGSR GSG+LTPDA    P N    
Sbjct: 240  PFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVRSTPRNGFFQ 299

Query: 1101 -------------------EIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHET 1223
                               + +VDHRVSFELT E++VRCVEK+P  LA AVS SLQN  T
Sbjct: 300  NRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTT 359

Query: 1224 GKVTKESLXXXXXXXXXXXXXXXXRQ-------------HHQKHRSITLGSVKEFNFDNA 1364
              V KE                                  HQK +SITLGS KEFNFD+A
Sbjct: 360  --VEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSA 417

Query: 1365 DG------------------GCSDKEGKNWSFFPMMQ--PGVS 1433
            DG                  G      KNW+FFP++Q  PGVS
Sbjct: 418  DGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  360 bits (924), Expect = 9e-97
 Identities = 211/457 (46%), Positives = 250/457 (54%), Gaps = 65/457 (14%)
 Frame = +3

Query: 255  MRRLNG--ESRAVNSXXXXXXXXXXXXXXXXXRGPPQ-YQKRRWGSCWSIYWCFGSHKQT 425
            MR +NG  +SR  N+                 R P    QKRRWGSCWS+YWCFG H+  
Sbjct: 2    MRNVNGGADSRPSNNALDTINAAASVIASAENRVPQATIQKRRWGSCWSVYWCFGYHRHR 61

Query: 426  KRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAG 605
            KRIGHAVLVPE    G D+  +E   TQ P+                       +QSPAG
Sbjct: 62   KRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPFVAPPSSPASFLQSEPPSASQSPAG 121

Query: 606  LLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLT 785
            +LSLTS+SA+MYSP GP SIFAIGPYAHETQLVSPP FSTFTTEPSTAPFTPPPESV LT
Sbjct: 122  ILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFSTFTTEPSTAPFTPPPESVQLT 181

Query: 786  TPSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXX 947
            TPSSPEVPFAQLL+P++R      RFP+S YEFQSYQ YPGSPVGQL             
Sbjct: 182  TPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGTS 241

Query: 948  XXFPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDAP--------- 1094
              FPD E  + GP FLEF+   PP+LLN  KL+ H+ GSR GSG+LTPDA          
Sbjct: 242  SPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLTPDAVRATSCSFPL 301

Query: 1095 -----------------SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNH-E 1220
                              ++ V D RVSF+L+ E+ +R  E +P    + +  S++N   
Sbjct: 302  DRQCSDIASNRHSDNENKDDQVADLRVSFDLSAEDALRYAEPKPASPVKIMPESMKNEIA 361

Query: 1221 TGKVTKESLXXXXXXXXXXXXXXXXRQ----------HHQKHRSITLGSVKEFNFDNADG 1370
              KV K S                  +           HQKHR++TLG+ KEFNFDNADG
Sbjct: 362  AEKVQKSSEIRHNFECRVGETSNGILEQASTGGEKTPRHQKHRTLTLGTFKEFNFDNADG 421

Query: 1371 -----------------GCSDKEGKNWSFFPMMQPGV 1430
                             G  D   KNWSFFP+MQP +
Sbjct: 422  VPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQPSI 458


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  355 bits (911), Expect = 3e-95
 Identities = 219/432 (50%), Positives = 243/432 (56%), Gaps = 39/432 (9%)
 Frame = +3

Query: 255  MRRLNGESRAVNSXXXXXXXXXXXXXXXXXRGP-PQYQKRRWGSCWSIYWCFGSHKQTKR 431
            MR +NG++R++NS                 R P P  QKRRWGSCW  YWCF S K  KR
Sbjct: 1    MRSVNGDTRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKD-KR 59

Query: 432  IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLL 611
            IGHAVL PE+   G+    +E ++TQ P+                       TQSP+GLL
Sbjct: 60   IGHAVLAPESRAPGSGVPAAE-NLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLL 118

Query: 612  SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 791
            SLTSI+AN+YSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 119  SLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 178

Query: 792  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXX 953
            SSPEVPFAQL DPN+R      RF  SQYEFQSYQLYPGSPVG L               
Sbjct: 179  SSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSP 238

Query: 954  FPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSLTPDAPSNEIVVDHRVSFE 1133
            FPD  S S  P  L     GPP            SR GS       P+NEI+VDHRVSFE
Sbjct: 239  FPD-RSGSITPDAL-----GPP------------SRDGSVLDHSGCPNNEIMVDHRVSFE 280

Query: 1134 LTPENIVRCVEKEPEGLARAVSASLQNHETGKVTKES-------------LXXXXXXXXX 1274
            LT E++VRCVEK+   L +AVSASLQN  T ++ + S                       
Sbjct: 281  LTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAP 340

Query: 1275 XXXXXXXRQHHQKHRSITLGSVKEFNFDNADGGCSDK-------------------EGKN 1397
                    Q H K RSITLGS KEFNFDNADGG SDK                     KN
Sbjct: 341  EDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKN 400

Query: 1398 WSFFPMMQPGVS 1433
            WS F MMQP VS
Sbjct: 401  WSIFHMMQPSVS 412


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  352 bits (904), Expect = 2e-94
 Identities = 223/460 (48%), Positives = 250/460 (54%), Gaps = 68/460 (14%)
 Frame = +3

Query: 255  MRRLNGESRAVNSXXXXXXXXXXXXXXXXXRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKR 431
            MR  NGES A+N+                 R P    QKRRWG CWSIYWCFGS+KQ KR
Sbjct: 1    MRGANGESIAMNNTLETIHAAANAIASAENRVPQATVQKRRWGGCWSIYWCFGSYKQKKR 60

Query: 432  IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLL 611
            IG AVL  ET  +G +   +E   TQ P+                       TQSPAGL+
Sbjct: 61   IGPAVLTSETSFSGANVPAAENP-TQAPAIALPFVAPPSSPASFLPSEPPSATQSPAGLV 119

Query: 612  SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 791
            SLTSISA+MYSPG P SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SLTSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 178

Query: 792  SSPEVPFAQLLDPN------HRRFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXX 953
            SSPEVPFAQLL PN       +RFP S YEFQSYQL+PGSPVGQL               
Sbjct: 179  SSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSP 238

Query: 954  FPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDA----PSNEIVVD 1115
            F D E  +   HF EFR G PP+LLN  K ++ +WGS  GSG+LTPDA    P N  ++D
Sbjct: 239  FRDGEFAA-SLHFPEFRMGDPPKLLNLDKHSSCEWGSHHGSGTLTPDATRSTPRNGFLLD 297

Query: 1116 -------------------------HRVSFELTPENIVRCVEKEPEGLARAVSASLQ--- 1211
                                     HRVSFELT E +VR +E E    + AVS SLQ   
Sbjct: 298  HQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQIEA 357

Query: 1212 -----NHETGKVTKESLXXXXXXXXXXXXXXXXRQ---HHQKHRSITLGSVKEFNFDNAD 1367
                  H+T  V                     R+    H KH+SITLGS KEFNFDN D
Sbjct: 358  TRESEEHDTKVVDDYECRVGETSNERPEKALADREGKPQHHKHQSITLGSAKEFNFDNVD 417

Query: 1368 GGCSDKE-------------------GKNWSFFPMMQPGV 1430
            GG + K                     +NWSFFPMMQPGV
Sbjct: 418  GGDAHKPILTSDWWANDKVAGKGGGVPRNWSFFPMMQPGV 457


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  352 bits (903), Expect = 3e-94
 Identities = 221/456 (48%), Positives = 245/456 (53%), Gaps = 63/456 (13%)
 Frame = +3

Query: 255  MRRLNGESRAVNSXXXXXXXXXXXXXXXXXRGPPQYQKRRWGSCWSIYWCFGSHKQTKRI 434
            MR  NGESRA N+                 R P    +RRWGSCWSIY CFG  K  K+I
Sbjct: 1    MRGFNGESRAANNTLETINAAATAIASAENRVPQATVQRRWGSCWSIYLCFGYQKHKKQI 60

Query: 435  GHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLLS 614
            GHAVL PE    G     SE   TQ P+                       TQSPAGL+S
Sbjct: 61   GHAVLFPEPSAPGNGAPASENP-TQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVS 119

Query: 615  LTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 794
            LTSISA+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS
Sbjct: 120  LTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 179

Query: 795  SPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXXF 956
            SPEVPFAQ LDP+ R      RFP   ++FQSYQ +PGSPVGQL               F
Sbjct: 180  SPEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPF 236

Query: 957  PDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTP--------------- 1085
            PD E    G HF EFR G PP+LLN  KL+T +WGS  GSG+LTP               
Sbjct: 237  PDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESVRRGSPNFLLHRQ 296

Query: 1086 --DAPS---------NEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGKV 1232
              D PS         N  VV+HRVSFELT E+  RCVE++P    + V   ++N    K 
Sbjct: 297  FSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENGTQAKE 356

Query: 1233 TKES-----------LXXXXXXXXXXXXXXXXRQHHQKHRSITLGSVKEFNFDNADGGCS 1379
             K S                               H+K +SITLGSVKEFNFDNAD G S
Sbjct: 357  EKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDS 416

Query: 1380 ---------------DKEG---KNWSFFPMMQPGVS 1433
                            KEG   KNWSFFPM+Q GVS
Sbjct: 417  RKPSSSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 452


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  352 bits (903), Expect = 3e-94
 Identities = 223/457 (48%), Positives = 246/457 (53%), Gaps = 64/457 (14%)
 Frame = +3

Query: 255  MRRLNGESRAVNSXXXXXXXXXXXXXXXXXRGPPQ-YQKRRWGSCWSIYWCFGSHKQTKR 431
            MR  NGESRA N+                 R P    QKRRWGSCWSIY CFG  K  K+
Sbjct: 1    MRGFNGESRAANNTLETINAAATAIASAENRVPQATVQKRRWGSCWSIYLCFGYQKHKKQ 60

Query: 432  IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLL 611
            IGHAVL PE    G     SE   TQ P+                       TQSPAGL+
Sbjct: 61   IGHAVLFPEPSAPGNGAPASENP-TQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLV 119

Query: 612  SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 791
            SLTSISA+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 179

Query: 792  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXX 953
            SSPEVPFAQ LDP+ R      RFP   ++FQSYQ +PGSPVGQL               
Sbjct: 180  SSPEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSP 236

Query: 954  FPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTP-------------- 1085
            FPD E    G HF EFR G PP+LLN  KL+T +WGS  GSG+LTP              
Sbjct: 237  FPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESVRRGSPNFLLHR 296

Query: 1086 ---DAPS---------NEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 1229
               D PS         N  VV+HRVSFELT E+  RCVE++P    + V   ++N    K
Sbjct: 297  QFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENGTQAK 356

Query: 1230 VTKES-----------LXXXXXXXXXXXXXXXXRQHHQKHRSITLGSVKEFNFDNADGGC 1376
              K S                               H+K +SITLGSVKEFNFDNAD G 
Sbjct: 357  EEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGD 416

Query: 1377 S---------------DKEG---KNWSFFPMMQPGVS 1433
            S                KEG   KNWSFFPM+Q GVS
Sbjct: 417  SRKPSSSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 453


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  340 bits (872), Expect = 1e-90
 Identities = 210/456 (46%), Positives = 239/456 (52%), Gaps = 63/456 (13%)
 Frame = +3

Query: 255  MRRLNGESRAVNSXXXXXXXXXXXXXXXXXRGPP-QYQKRRWGSCWSIYWCFGSHKQTKR 431
            M R+NGE R V+S                 R P    QKRRWG CWS+YWCFGS KQTKR
Sbjct: 1    MNRVNGEQRGVDSTLETISAAATAIASVENRVPQASIQKRRWGGCWSMYWCFGSQKQTKR 60

Query: 432  IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLL 611
            IGHAV +PET  +G D   S  S +Q PS                       T SP G  
Sbjct: 61   IGHAVFIPETTASGADRPSSNTS-SQAPSIVLPFIAPPSSPASFLPSEPPSATHSPVGSK 119

Query: 612  SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 791
             L   S + YSP GP SIFAIGPYAHETQLVSPPVFS FTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  CL---SMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTP 176

Query: 792  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXX 953
            SSPEVPFA+LLDPN++      R+P++QYEFQSYQL PGSPV  L               
Sbjct: 177  SSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSP 236

Query: 954  FPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSLTPDAPS------------ 1097
            F D E     P FL          L K+  H+WGSR GSG+LTP+A +            
Sbjct: 237  FLDREYTPGRPQFLN---------LEKIAPHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQ 287

Query: 1098 ---------------NEI-VVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 1229
                           N++ VVDHRVSFE+T E++VRCVEK+P  + R  S SLQ+ E   
Sbjct: 288  NSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERST 347

Query: 1230 VTKESLXXXXXXXXXXXXXXXXR------------QHHQKHRSITLGSVKEFNFDNADGG 1373
              +E+L                             Q  QKHRSITLGS KEFNFDN DGG
Sbjct: 348  KRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGG 407

Query: 1374 CSDKE--GKNW--------------SFFPMMQPGVS 1433
              DK   G +W                FPMMQPGVS
Sbjct: 408  YPDKATIGSDWWANEKVLGKEPCNNWIFPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  338 bits (866), Expect = 5e-90
 Identities = 209/456 (45%), Positives = 239/456 (52%), Gaps = 63/456 (13%)
 Frame = +3

Query: 255  MRRLNGESRAVNSXXXXXXXXXXXXXXXXXRGPP-QYQKRRWGSCWSIYWCFGSHKQTKR 431
            M R+NGE R V+S                 R P    QKRRWGSCWS+YWCFGS KQTKR
Sbjct: 1    MNRVNGEQRGVDSTLETINAAATAIASVENRVPQASIQKRRWGSCWSMYWCFGSQKQTKR 60

Query: 432  IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLL 611
            IGHAV +PET  +  D   S  S +Q PS                       T SP G  
Sbjct: 61   IGHAVFIPETTASAADRPSSNTS-SQAPSIVLPFIAPPSSPASFLPSEPPSATHSPVGSK 119

Query: 612  SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 791
             L   S + YSP GP SIFAIGPYAHETQLVSPPVFS FTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  CL---SMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTP 176

Query: 792  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXX 953
            SSPEVPFA+LLDPN++      R+P++QYEFQSYQL PGSPV  L               
Sbjct: 177  SSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSP 236

Query: 954  FPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSLTPDAPS------------ 1097
            F + E     P FL          L K+  H+WGSR GSG+LTP+A +            
Sbjct: 237  FLEREYTPGRPQFLN---------LEKIAPHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQ 287

Query: 1098 ---------------NEI-VVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 1229
                           N++ VVDHRVSFE+T E++VRCVEK+P  + R  S SLQ+ E   
Sbjct: 288  NTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERST 347

Query: 1230 VTKESLXXXXXXXXXXXXXXXXR------------QHHQKHRSITLGSVKEFNFDNADGG 1373
              +E+L                             Q  QKHRSITLGS KEFNFDN DGG
Sbjct: 348  KRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGG 407

Query: 1374 CSDKE--GKNW--------------SFFPMMQPGVS 1433
              DK   G +W                FPMMQPGVS
Sbjct: 408  YPDKATIGSDWWANEKVLGKEPCNNWIFPMMQPGVS 443


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  329 bits (844), Expect = 2e-87
 Identities = 205/428 (47%), Positives = 231/428 (53%), Gaps = 71/428 (16%)
 Frame = +3

Query: 363  QKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXX 542
            QKRRW   W +YWCFG  +  KRIGHAV++PET + G +   +E ++TQ  S        
Sbjct: 2    QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAE-NLTQASSIVLPFAAP 60

Query: 543  XXXXXXXXXXXXXXXTQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 722
                            QSP    SL   SA+MYSPG P+SIFAIGPYAHETQLVSPPVFS
Sbjct: 61   PSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFS 116

Query: 723  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLY 884
            TFTTEPSTAPFTPP ESVHLT PSSPEVPFAQLLD N R      R+P S YEFQSYQ Y
Sbjct: 117  TFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWY 176

Query: 885  PGSPVGQLXXXXXXXXXXXXXXXFPDPESVSHGPHFLEFRTGGPPQLLNK--LNTHDWGS 1058
            PGSPVGQL               F D E  S G HFLEFRTG  P++LN   L T DWGS
Sbjct: 177  PGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGS 236

Query: 1059 RLGSGSLTPDAPSNE----------------------------IVVDHRVSFELTPENIV 1154
            RL SGS+TPDA  +                               + HRVSFEL+ E +V
Sbjct: 237  RLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 296

Query: 1155 RCVEKEPEGLARAVSASLQNHETGKVTKE----------------SLXXXXXXXXXXXXX 1286
            RCVEK+P  LA AVS SLQ+ E  K  +E                               
Sbjct: 297  RCVEKKPVALAEAVSTSLQSAE--KAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDA 354

Query: 1287 XXXRQHHQKHRSITLGSVKEFNFDNADGGCS-------------------DKEGKNWSFF 1409
                  +QK RSITLGS KEFNFDNADGG S                   + E KNWSFF
Sbjct: 355  EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFF 414

Query: 1410 PMMQPGVS 1433
            PM+QPG+S
Sbjct: 415  PMIQPGMS 422


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  329 bits (844), Expect = 2e-87
 Identities = 205/428 (47%), Positives = 231/428 (53%), Gaps = 71/428 (16%)
 Frame = +3

Query: 363  QKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXX 542
            QKRRW   W +YWCFG  +  KRIGHAV++PET + G +   +E ++TQ  S        
Sbjct: 39   QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAE-NLTQASSIVLPFAAP 97

Query: 543  XXXXXXXXXXXXXXXTQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 722
                            QSP    SL   SA+MYSPG P+SIFAIGPYAHETQLVSPPVFS
Sbjct: 98   PSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFS 153

Query: 723  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLY 884
            TFTTEPSTAPFTPP ESVHLT PSSPEVPFAQLLD N R      R+P S YEFQSYQ Y
Sbjct: 154  TFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWY 213

Query: 885  PGSPVGQLXXXXXXXXXXXXXXXFPDPESVSHGPHFLEFRTGGPPQLLNK--LNTHDWGS 1058
            PGSPVGQL               F D E  S G HFLEFRTG  P++LN   L T DWGS
Sbjct: 214  PGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGS 273

Query: 1059 RLGSGSLTPDAPSNE----------------------------IVVDHRVSFELTPENIV 1154
            RL SGS+TPDA  +                               + HRVSFEL+ E +V
Sbjct: 274  RLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 333

Query: 1155 RCVEKEPEGLARAVSASLQNHETGKVTKE----------------SLXXXXXXXXXXXXX 1286
            RCVEK+P  LA AVS SLQ+ E  K  +E                               
Sbjct: 334  RCVEKKPVALAEAVSTSLQSAE--KAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDA 391

Query: 1287 XXXRQHHQKHRSITLGSVKEFNFDNADGGCS-------------------DKEGKNWSFF 1409
                  +QK RSITLGS KEFNFDNADGG S                   + E KNWSFF
Sbjct: 392  EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFF 451

Query: 1410 PMMQPGVS 1433
            PM+QPG+S
Sbjct: 452  PMIQPGMS 459


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  307 bits (786), Expect = 9e-81
 Identities = 194/436 (44%), Positives = 228/436 (52%), Gaps = 46/436 (10%)
 Frame = +3

Query: 255  MRRLNGESRAVNSXXXXXXXXXXXXXXXXXRGPP-QYQKRRWGSCWSIYWCFGSHKQTKR 431
            MR +NGESRA N+                 R P    QK+RW S WSIYWCFG  K  ++
Sbjct: 1    MRDVNGESRAANNTLETINAAATAIASAENRVPQAMVQKQRWRSHWSIYWCFGYQKSKRQ 60

Query: 432  IGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXXXXXXXXXXXXXXXXTQSPAGLL 611
            IGHAVL PE+   G+    +E S  Q P                        TQSPAGL+
Sbjct: 61   IGHAVLFPESSAPGSGAPAAENS-AQAPEVTFPFVAPPSSPASFFQSEPPSVTQSPAGLV 119

Query: 612  SLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 791
            S TSISA+MYSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP
Sbjct: 120  SRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 179

Query: 792  SSPEVPFAQLLDPNHR------RFPYSQYEFQSYQLYPGSPVGQLXXXXXXXXXXXXXXX 953
            SSPEVPFAQL+DP  R      RFP+   +FQSYQ +PGS VGQL               
Sbjct: 180  SSPEVPFAQLIDPTLRNGVTGLRFPF---DFQSYQFHPGSSVGQLISPSSGISGSGTSSP 236

Query: 954  FPDPESVSHGPHFLEFRTGGPPQLLN--KLNTHDWGSRLGSGSLTPDAP----------- 1094
            FPD E    GPH  EFR G  P+LLN  KL+T +WGS   SG+LTPD+            
Sbjct: 237  FPDGEFAVGGPHSPEFRMG--PKLLNLDKLSTREWGSYQDSGALTPDSVRHGSPNFLLHR 294

Query: 1095 ---------------SNEIVVDHRVSFELTPENIVRCVEKEPEGLARAVSASLQNHETGK 1229
                            ++ VV+HR SFEL+ ++  RCVE++P    + V   ++N    K
Sbjct: 295  QFSDVASHPRSENGHDDDQVVNHRFSFELSVKDASRCVEEKPACSIKTVPEYVENGTKAK 354

Query: 1230 VT----------KESLXXXXXXXXXXXXXXXXRQHHQKHRSITLGSVKEFNFDNADGGCS 1379
                        +                      H+K + ITLGSV EFNFDNAD G S
Sbjct: 355  EEENYGELIQSFERRSGDTSNDTPETPSTDGEAPQHRKQQPITLGSVNEFNFDNADEGDS 414

Query: 1380 -DKEGKNWSFFPMMQP 1424
             +    NW   P   P
Sbjct: 415  HNPSSSNWVKQPRTGP 430


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  297 bits (761), Expect = 8e-78
 Identities = 187/435 (42%), Positives = 218/435 (50%), Gaps = 74/435 (17%)
 Frame = +3

Query: 351  PPQYQKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXX 530
            P   QKRRWGSC S+YWCFGSH+ +KRIGHAVLVPE    G     SE ++    S    
Sbjct: 27   PTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASE-NLNLSTSIVLP 85

Query: 531  XXXXXXXXXXXXXXXXXXXTQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSP 710
                               TQSPAG LSLT++S N YSP GP S+FAIGPYAHETQLVSP
Sbjct: 86   FIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSP 145

Query: 711  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQY 860
            PVFSTF TEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++   S Y
Sbjct: 146  PVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNY 205

Query: 861  EFQSYQLYPGSPVGQLXXXXXXXXXXXXXXXFPDPESVSHGPHFLEFRTGGPPQLLNKLN 1040
            EFQ YQLYP SPVG L               FPD   +   P  L F            +
Sbjct: 206  EFQPYQLYPESPVGHL---ISPISNSGTSSPFPDRRPIVEAPKLLGF---------EHFS 253

Query: 1041 THDWGSRLGSGSLTPD----------------------------APSNEIVVDHRVSFEL 1136
            T  WGSRLGSGSLTPD                            + + E V+DHRVSFEL
Sbjct: 254  TRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFEL 313

Query: 1137 TPENIVRCVEKEPEGLARAVSASLQN-HETGKVTKES---------------LXXXXXXX 1268
              E++  CVEK+P   A  V  +LQ+  E G++ +E                        
Sbjct: 314  AGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAAS 373

Query: 1269 XXXXXXXXXRQHHQKHRSITLGSVKEFNFDNADGGCSDKE--------------GK---- 1394
                      Q H+KH  I  GS+KEFNFDN  G  S K               GK    
Sbjct: 374  EKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGP 433

Query: 1395 --NWSFFPMMQPGVS 1433
              NW+FFP++QPG+S
Sbjct: 434  QTNWTFFPLLQPGIS 448


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  297 bits (760), Expect = 1e-77
 Identities = 194/476 (40%), Positives = 228/476 (47%), Gaps = 115/476 (24%)
 Frame = +3

Query: 351  PPQYQKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXX 530
            P   QK+RWGSCW +YWCFGS K +KRIGHAVLVPE    G     +E +++        
Sbjct: 27   PTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAE-NVSNPTGIILP 85

Query: 531  XXXXXXXXXXXXXXXXXXXTQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSP 710
                               TQSPAGLLSLTS+S N YSP GP SIFAIGPYAHETQLV+P
Sbjct: 86   FIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTP 145

Query: 711  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQY 860
            PVFS  TTEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++F  S Y
Sbjct: 146  PVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHY 205

Query: 861  EFQSYQLYPGSPVGQLXXXXXXXXXXXXXXXFPDPESVSHGPHFLEFRTGGPPQLLNKLN 1040
            EFQSYQ+YPGSP G L               FPD   +      LEFR G  P+LL   N
Sbjct: 206  EFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPI------LEFRMGEAPKLLGFEN 259

Query: 1041 --THDWGSRLGSGSL--------------------------------TPDA--------- 1091
              T  WGSRLGSGSL                                TPD          
Sbjct: 260  FTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGF 319

Query: 1092 --------------PSN-----EIVVDHRVSFELTPENIVRCVE---------------- 1166
                          P+N     E +VDHRVSFEL+ E++  C+E                
Sbjct: 320  LVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKD 379

Query: 1167 ------KEPEGLARAVSASLQN--HETGKVTKESLXXXXXXXXXXXXXXXXRQHHQKHRS 1322
                  KE +G+ + + +S +    ET   T E                     +QKHRS
Sbjct: 380  LVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEE----------EHSYQKHRS 429

Query: 1323 ITLGSVKEFNFDNADGGCSDK-------------------EGKNWSFFPMMQPGVS 1433
            +TLGS+KEFNFDN  G  SDK                    G +W+FFPM+QP VS
Sbjct: 430  VTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  293 bits (751), Expect = 1e-76
 Identities = 192/471 (40%), Positives = 226/471 (47%), Gaps = 115/471 (24%)
 Frame = +3

Query: 366  KRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXXX 545
            K+RWGSCW +YWCFGS K +KRIGHAVLVPE    G     +E +++             
Sbjct: 36   KKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAE-NVSNPTGIILPFIAPP 94

Query: 546  XXXXXXXXXXXXXXTQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFST 725
                          TQSPAGLLSLTS+S N YSP GP SIFAIGPYAHETQLV+PPVFS 
Sbjct: 95   SSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSA 154

Query: 726  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQYEFQSY 875
             TTEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++F  S YEFQSY
Sbjct: 155  LTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSY 214

Query: 876  QLYPGSPVGQLXXXXXXXXXXXXXXXFPDPESVSHGPHFLEFRTGGPPQLLNKLN--THD 1049
            Q+YPGSP G L               FPD   +      LEFR G  P+LL   N  T  
Sbjct: 215  QIYPGSPGGNLISPGSAISNSGTSSPFPDRRPI------LEFRMGEAPKLLGFENFTTRK 268

Query: 1050 WGSRLGSGSL--------------------------------TPDA-------------- 1091
            WGSRLGSGSL                                TPD               
Sbjct: 269  WGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQ 328

Query: 1092 ---------PSN-----EIVVDHRVSFELTPENIVRCVE--------------------- 1166
                     P+N     E +VDHRVSFEL+ E++  C+E                     
Sbjct: 329  ISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEG 388

Query: 1167 -KEPEGLARAVSASLQN--HETGKVTKESLXXXXXXXXXXXXXXXXRQHHQKHRSITLGS 1337
             KE +G+ + + +S +    ET   T E                     +QKHRS+TLGS
Sbjct: 389  RKERDGIKKDLESSCELFIRETSNETVEKASGEAEE----------EHSYQKHRSVTLGS 438

Query: 1338 VKEFNFDNADGGCSDK-------------------EGKNWSFFPMMQPGVS 1433
            +KEFNFDN  G  SDK                    G +W+FFPM+QP VS
Sbjct: 439  IKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  292 bits (747), Expect = 3e-76
 Identities = 194/478 (40%), Positives = 227/478 (47%), Gaps = 117/478 (24%)
 Frame = +3

Query: 351  PPQYQKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXX 530
            P   QKRRWG CWS+YWCFGSHK TKRIGHAVL PE    G     S  + +Q  +    
Sbjct: 41   PTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGA-VVTSAENQSQSTAITVP 98

Query: 531  XXXXXXXXXXXXXXXXXXXTQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSP 710
                               TQSPAGLLSLTS+S N YSPGGP SIFAIGPYAHETQLV+P
Sbjct: 99   FIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTP 158

Query: 711  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPN----------HRRFPYSQY 860
            P FS FTTEPSTAPFTPPPESV LTTPSSPEVPFAQLL  +          +++F  S Y
Sbjct: 159  PAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHY 218

Query: 861  EFQSYQLYPGSPVGQLXXXXXXXXXXXXXXXFPDPESVSHGPHFLEFRTGGPPQLLN--K 1034
            EFQSY LYPGSP GQL               FPD   +      LEFR G  P+LL    
Sbjct: 219  EFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPI------LEFRMGEAPKLLGFEH 272

Query: 1035 LNTHDWGSRLGS------------------------------------------------ 1070
              T  WGSRLGS                                                
Sbjct: 273  FTTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGS 332

Query: 1071 GSLTPDA----------------------------PSNEIVVDHRVSFELTPENIVRCVE 1166
            GSLTPDA                             ++E +VDHRVSFEL+ E + RC+E
Sbjct: 333  GSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLE 392

Query: 1167 KEPEGLARAVSASLQNH------ETGKV--TKESLXXXXXXXXXXXXXXXXRQH---HQK 1313
             +     RA S    +       ++GK+  T E+L                 +    ++K
Sbjct: 393  SKSLASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTGETSGETPEKPSGEMEEEHCYRK 452

Query: 1314 HRSITLGSVKEFNFDNAD------------------GGCSDKEGKNWSFFPMMQPGVS 1433
            HRSITLGS+KEFNFDN+                    G   +   NW+FFP++QP VS
Sbjct: 453  HRSITLGSIKEFNFDNSKEVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus guttatus]
          Length = 420

 Score =  289 bits (740), Expect = 2e-75
 Identities = 186/406 (45%), Positives = 215/406 (52%), Gaps = 49/406 (12%)
 Frame = +3

Query: 363  QKRRWGSCWSIYWCFGSHKQTKRIGHAVLVPETPTTGTDTQVSERSMTQQPSRXXXXXXX 542
            QKRRW S WS+YWCF  +   KRIGHAVLV ET ++ T    +     Q PS        
Sbjct: 36   QKRRWRSFWSLYWCFRPNNN-KRIGHAVLVTETSSSDTAYTPTAERPFQPPSIVLPFTAP 94

Query: 543  XXXXXXXXXXXXXXXTQSPAGLLSLTSISANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 722
                           TQSP GLLSL+S S N+YSP GP SIFAIGPYAHETQLVSPPVFS
Sbjct: 95   PSSPASFIPSEPPSSTQSPTGLLSLSSPSGNIYSPSGPASIFAIGPYAHETQLVSPPVFS 154

Query: 723  TFTTEPSTAPFTPPPE-SVHLTTPSSPEVPFAQLLDPNHRRFPYSQYEFQSYQLYPGSPV 899
            TFTTEPSTAP+TPPPE S HLTTPSSPEVPFA+LL+PN +R+P SQYEFQSYQL PGSPV
Sbjct: 155  TFTTEPSTAPYTPPPEFSAHLTTPSSPEVPFARLLEPN-QRYPLSQYEFQSYQLQPGSPV 213

Query: 900  GQLXXXXXXXXXXXXXXXFPDPESVSHGPHFLEFRTGGPPQLLNKLNTHDWGSRLGSGSL 1079
              L               F D +  +  P FLEF  G PP+         W S   SG +
Sbjct: 214  SHLISPCSGISGSGASSPFLDRDFAAVHPFFLEFGGGNPPR------RDQWESCQESGVV 267

Query: 1080 TP-DA----------------------PSN-------EIVVDHRVSFELTPENIVRCVEK 1169
            TP DA                      P N          +DHRVSFE+T E ++RCVEK
Sbjct: 268  TPTDAVGPRSRDSCVLLNRQNSDISPLPDNCTGLENDVAAIDHRVSFEITAEKVIRCVEK 327

Query: 1170 EPEGLARAVSASLQNHETGKVTKESLXXXXXXXXXXXXXXXXRQHHQKHRSITLGSVKEF 1349
            +        S        GK   E +                 + HQK+R+ITLGS KEF
Sbjct: 328  K--------SLETAQESVGKKPIELI-----NREEDQTEIVNEKRHQKNRTITLGSTKEF 374

Query: 1350 NFD--NADGGCSD------------KEG----KNWSFFPMMQPGVS 1433
            NF+  N D  C D            KEG    +NWSFFP++QPGVS
Sbjct: 375  NFEGGNCDEPCVDSSEWWVNEKKVPKEGGGSSENWSFFPILQPGVS 420


Top