BLASTX nr result

ID: Rheum21_contig00010219 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00010219
         (1113 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   348   2e-93
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   348   2e-93
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   335   3e-89
gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe...   333   1e-88
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   325   2e-86
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   322   1e-85
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   315   3e-83
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   313   8e-83
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     305   2e-80
gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [...   305   2e-80
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   301   4e-79
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   296   8e-78
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   295   2e-77
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              286   1e-74
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   284   4e-74
gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i...   273   7e-71
gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i...   272   2e-70
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   271   3e-70
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   268   3e-69
ref|XP_002333412.1| predicted protein [Populus trichocarpa]           265   2e-68

>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  348 bits (894), Expect = 2e-93
 Identities = 192/376 (51%), Positives = 228/376 (60%), Gaps = 5/376 (1%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            TS+KRRWG  WSIS CFG Q + KRIGHAV V              +             
Sbjct: 37   TSQKRRWGGCWSISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQAAAISLPFVA 96

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                       SIS N+YSPGGP SIFA+GPYAHETQLVSPP+F
Sbjct: 97   PPSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVF 156

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQF 540
            STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDP L+ G+ GQ+F  S YEFQ Y  
Sbjct: 157  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHL 216

Query: 541  YPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWG 720
            +P SPVG LISP S ISGSGTSSPFPD E+ T G  + D   G PPKLLNL  L  R+WG
Sbjct: 217  HPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWG 276

Query: 721  SHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDV 900
            S QGSGTLTPDA+    R+ +  +RQ S ++  P   +G    + ++D RVSFELT EDV
Sbjct: 277  SRQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSENGLR-KDQIVDHRVSFELTTEDV 335

Query: 901  VRCVEKGPKSLPKALQIVGSEGGDV---STNILADNIDKPPSEDGTGKSP--SEADAEEG 1065
            VRCVEK P +L +A+      G  V    ++  A+N+    + +     P  +  D EE 
Sbjct: 336  VRCVEKKPTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEA 395

Query: 1066 QRCQKQRSVSLGSIKE 1113
             R QKQ+S++LGS KE
Sbjct: 396  PRHQKQQSITLGSTKE 411


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  348 bits (894), Expect = 2e-93
 Identities = 191/376 (50%), Positives = 229/376 (60%), Gaps = 5/376 (1%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            TS+KRRWG  W+IS CFG Q + KRIGHAV V              +             
Sbjct: 37   TSQKRRWGGCWNISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQATAISLPFVA 96

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                       SIS N+YSPGGP SIFA+GPYAHETQLVSPP+F
Sbjct: 97   PPSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVF 156

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQF 540
            STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDP L+ G+ GQ+F  S YEFQ Y  
Sbjct: 157  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHL 216

Query: 541  YPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWG 720
            +P SPVG LISP S ISGSGTSSPFPD E+ T G  + D   G PPKLLNL  L  R+WG
Sbjct: 217  HPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWG 276

Query: 721  SHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDV 900
            S QGSGTLTPDA++   R+ +  +RQ S ++  P   +G    + ++D RVSFELT EDV
Sbjct: 277  SRQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSENGLR-KDQIVDHRVSFELTTEDV 335

Query: 901  VRCVEKGPKSLPKALQIVGSEGGDV---STNILADNIDKPPSEDGTGKSP--SEADAEEG 1065
            VRCVEK P +L +A+      G  V    ++  A+N+    + +     P  +  D EE 
Sbjct: 336  VRCVEKKPTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEA 395

Query: 1066 QRCQKQRSVSLGSIKE 1113
             R QKQ+S++LGS KE
Sbjct: 396  PRHQKQQSITLGSTKE 411


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  335 bits (858), Expect = 3e-89
 Identities = 195/408 (47%), Positives = 231/408 (56%), Gaps = 37/408 (9%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +KRRWGS W    CF S   +KRIGHAV                +L            
Sbjct: 36   TVQKRRWGSCWGEYWCFRSPK-DKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVA 94

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                      TSI+AN+YSPGGP SIFA+GPYAHETQLVSPP+F
Sbjct: 95   PPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVF 154

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQF 540
            STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL DP  ++G+ G RF  SQYEFQ YQ 
Sbjct: 155  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQL 214

Query: 541  YPRSPVGQLISPGSAISGSGTSSPFPDHEYY-TGGLHYRDIRLGVPPKLLNLQSLPSRDW 717
            YP SPVG LISP S ISGSGTSSPFPD ++  +G   + + R G PPKLL L  L + +W
Sbjct: 215  YPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGGPPKLLTLDKLSNHEW 274

Query: 718  GSHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLAD------------------GHH 843
            GS  GSG++TPDA+ P SRD  +LDRQ S +   PS  D                  G  
Sbjct: 275  GSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCP 334

Query: 844  YHEAVIDQRVSFELTVEDVVRCVEKGPKSLPKALQ------------------IVGSEGG 969
             +E ++D RVSFELT EDVVRCVEK   +L KA+                   +V SEG 
Sbjct: 335  NNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGR 394

Query: 970  DVSTNILADNIDKPPSEDGTGKSPSEADAEEGQRCQKQRSVSLGSIKE 1113
                  + +  + PP      K+P +A+ EEGQ   KQRS++LGS KE
Sbjct: 395  ------VGETANNPPE-----KAPEDANGEEGQPHHKQRSITLGSAKE 431


>gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  333 bits (853), Expect = 1e-88
 Identities = 180/376 (47%), Positives = 226/376 (60%), Gaps = 5/376 (1%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +KRRWGS+WS+  CFG Q + KRIGHAV V              +             
Sbjct: 36   TVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPRAEN---PIQTPSIVLP 92

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                       S++A++YSP GP SIFA+GPYAHETQLVSPP+F
Sbjct: 93   FVAPPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVF 152

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQF 540
            STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDP  ++G+ GQRF  S YEFQ YQ 
Sbjct: 153  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQL 212

Query: 541  YPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWG 720
            YP SPVGQLISP S ISGSGTSSPFPD E+   G H+ + R G PPKLLNL  L +RDWG
Sbjct: 213  YPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWG 272

Query: 721  SHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDV 900
            S  GSG++TPD  +  S D ++L  Q   +   P   +    ++  I+ RVSFEL+ E+V
Sbjct: 273  SRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEV 332

Query: 901  VRCVEKGPKSLPKAL-----QIVGSEGGDVSTNILADNIDKPPSEDGTGKSPSEADAEEG 1065
            +RCVEK P +L +A+         ++  +  + +++ +I             + AD EE 
Sbjct: 333  IRCVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEA 392

Query: 1066 QRCQKQRSVSLGSIKE 1113
            Q   KQRS++LGS+KE
Sbjct: 393  QLHPKQRSITLGSVKE 408


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  325 bits (834), Expect = 2e-86
 Identities = 180/379 (47%), Positives = 232/379 (61%), Gaps = 8/379 (2%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +KRRWGS WSI  CFG Q + K+IGHAV                +             
Sbjct: 36   TVQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAA 95

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                      TSISA++YSP GP SIFA+GPYAHETQLVSPP+F
Sbjct: 96   PPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVF 155

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQF 540
            STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQ LDP L++GD G RF    ++FQ YQF
Sbjct: 156  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRF---PFDFQSYQF 212

Query: 541  YPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWG 720
            +P SPVGQLISP S ISGSGTSSPFPD E+  GG H+ + R+G PPKLLNL  L + +WG
Sbjct: 213  HPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWG 272

Query: 721  SHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDV 900
            S+QGSG LTP++++ +   +++L RQ S +   P   +GH  +  V++ RVSFELT ED 
Sbjct: 273  SYQGSGALTPESVR-RGSPNFLLHRQFSDVPSRPRSGNGHK-NGQVVNHRVSFELTAEDA 330

Query: 901  VRCVEKGP----KSLPKALQIVGSEGGDVSTNILADNIDKPPSEDGTGKSPS----EADA 1056
             RCVE+ P    K++P+ ++  G++  +   +   ++I       G   + S      D 
Sbjct: 331  SRCVEEKPAFSIKTVPEYVE-NGTQAKEEKNS--GESIQSFECRVGVTSNDSPEMASTDG 387

Query: 1057 EEGQRCQKQRSVSLGSIKE 1113
            E   + +KQ+S++LGS+KE
Sbjct: 388  EAAPQHRKQQSITLGSVKE 406


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  322 bits (826), Expect = 1e-85
 Identities = 178/376 (47%), Positives = 230/376 (61%), Gaps = 8/376 (2%)
 Frame = +1

Query: 10   KRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXXXXX 189
            +RRWGS WSI  CFG Q + K+IGHAV                +                
Sbjct: 38   QRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPS 97

Query: 190  XXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIFSTF 369
                                   TSISA++YSP GP SIFA+GPYAHETQLVSPP+FSTF
Sbjct: 98   SPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTF 157

Query: 370  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQFYPR 549
            TTEPSTAPFTPPPESVHLTTPSSPEVPFAQ LDP L++GD G RF    ++FQ YQF+P 
Sbjct: 158  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRF---PFDFQSYQFHPG 214

Query: 550  SPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWGSHQ 729
            SPVGQLISP S ISGSGTSSPFPD E+  GG H+ + R+G PPKLLNL  L + +WGS+Q
Sbjct: 215  SPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQ 274

Query: 730  GSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDVVRC 909
            GSG LTP++++ +   +++L RQ S +   P   +GH  +  V++ RVSFELT ED  RC
Sbjct: 275  GSGALTPESVR-RGSPNFLLHRQFSDVPSRPRSGNGHK-NGQVVNHRVSFELTAEDASRC 332

Query: 910  VEKGP----KSLPKALQIVGSEGGDVSTNILADNIDKPPSEDGTGKSPS----EADAEEG 1065
            VE+ P    K++P+ ++  G++  +   +   ++I       G   + S      D E  
Sbjct: 333  VEEKPAFSIKTVPEYVE-NGTQAKEEKNS--GESIQSFECRVGVTSNDSPEMASTDGEAA 389

Query: 1066 QRCQKQRSVSLGSIKE 1113
             + +KQ+S++LGS+KE
Sbjct: 390  PQHRKQQSITLGSVKE 405


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  315 bits (806), Expect = 3e-83
 Identities = 178/376 (47%), Positives = 216/376 (57%), Gaps = 7/376 (1%)
 Frame = +1

Query: 7    KKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXXXX 186
            +KRRWGS WS+  CFGSQ   KRIGHAVF+              S               
Sbjct: 38   QKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADRP---SSNTSSQAPSIVLPFI 94

Query: 187  XXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIFST 366
                                      +S + YSP GP SIFA+GPYAHETQLVSPP+FS 
Sbjct: 95   APPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSA 154

Query: 367  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQFYP 546
            FTTEPSTAPFTPPPESVHLTTPSSPEVPFA+LLDP  Q+   G R+  +QYEFQ YQ  P
Sbjct: 155  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQP 214

Query: 547  RSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWGSH 726
             SPV  LISPGSAIS SGTSSPF + EY  G            P+ LNL+ +   +WGS 
Sbjct: 215  GSPVSNLISPGSAISVSGTSSPFLEREYTPG-----------RPQFLNLEKIAPHEWGSR 263

Query: 727  QGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDVVR 906
            QGSGTLTP+A+ PK  DS++L+ Q++ +  LP   +G      V+D RVSFE+T EDVVR
Sbjct: 264  QGSGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVR 323

Query: 907  CVEKGPKSLPKALQIVGSEGGDVSTNILADNIDKPPSEDGTGKSP-------SEADAEEG 1065
            CVEK P  + +    V  +  + ST    +  +   + D +G  P       S  D E+G
Sbjct: 324  CVEKKPTMMMRT-GSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDG 382

Query: 1066 QRCQKQRSVSLGSIKE 1113
            QR QK RS++LGS KE
Sbjct: 383  QRQQKHRSITLGSSKE 398


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  313 bits (802), Expect = 8e-83
 Identities = 178/376 (47%), Positives = 213/376 (56%), Gaps = 7/376 (1%)
 Frame = +1

Query: 7    KKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXXXX 186
            +KRRWG  WS+  CFGSQ   KRIGHAVF+              S               
Sbjct: 38   QKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADRP---SSNTSSQAPSIVLPFI 94

Query: 187  XXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIFST 366
                                      +S + YSP GP SIFA+GPYAHETQLVSPP+FS 
Sbjct: 95   APPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSA 154

Query: 367  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQFYP 546
            FTTEPSTAPFTPPPESVHLTTPSSPEVPFA+LLDP  Q+   G R+  +QYEFQ YQ  P
Sbjct: 155  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQP 214

Query: 547  RSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWGSH 726
             SPV  LISPGSAIS SGTSSPF D EY  G            P+ LNL+ +   +WGS 
Sbjct: 215  GSPVSNLISPGSAISVSGTSSPFLDREYTPG-----------RPQFLNLEKIAPHEWGSR 263

Query: 727  QGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDVVR 906
            QGSGTLTP+A+ PK  D+++L+ Q+S +  LP   +G      V+D RVSFE+T EDVVR
Sbjct: 264  QGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVR 323

Query: 907  CVEKGPKSLPKALQIVGSEGGDVSTNILADNIDKPPSEDGTGKSP-------SEADAEEG 1065
            CVEK P  + +    V  +  + ST    +  +     D  G  P       S  D E+G
Sbjct: 324  CVEKKPTMMMRT-GSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDG 382

Query: 1066 QRCQKQRSVSLGSIKE 1113
            QR QK RS++LGS KE
Sbjct: 383  QRQQKHRSITLGSSKE 398


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  305 bits (782), Expect = 2e-80
 Identities = 176/381 (46%), Positives = 216/381 (56%), Gaps = 10/381 (2%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +KRRWG   SI  CFG+     RIGH V V              +             
Sbjct: 38   TVRKRRWGGCLSIYWCFGTPKNRTRIGHGVLVPETAQPGNSAPRAENSTQTHAVILPFIA 97

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                      TS+SA++YSPGGP SIFA+GPYAHETQLVSPP+F
Sbjct: 98   PPSSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPVF 157

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQF 540
            STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDP + +G+ GQRF     EFQ Y F
Sbjct: 158  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNEFQSYYF 217

Query: 541  YPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWG 720
             P SP+GQLISP S ISGSGTSSPFPD E+   G H+ + R G PPKLLNL  L   DWG
Sbjct: 218  QPGSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGDPPKLLNLDKLSKFDWG 277

Query: 721  SHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEA--VIDQRVSFELTVE 894
            S QGSG+LTPD+++P            ST    P L        A  V D+RVSF+++ E
Sbjct: 278  SRQGSGSLTPDSVKP-----------ISTFEVAPHLKPNGRCRNAENVADRRVSFDVSTE 326

Query: 895  DVVRCVEKGPKSLPKAL--QIVGSEGGDVSTNILADNIDKPPSEDGTGKSPSE------A 1050
            DV+R VEK    L +A+   +  +  G    N  ++ +++   E+  G++ +E       
Sbjct: 327  DVIRYVEKKTVPLAEAMLTSLKDTTMGQREENSDSNKVEEIGCENRVGETSNEEPDKAPT 386

Query: 1051 DAEEGQRCQKQRSVSLGSIKE 1113
              EE  + QK RS++LGS KE
Sbjct: 387  SGEEVLQHQKHRSITLGSSKE 407


>gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  305 bits (781), Expect = 2e-80
 Identities = 181/377 (48%), Positives = 219/377 (58%), Gaps = 6/377 (1%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +KRRWG  WSI  CFGS    KRIG AV                +             
Sbjct: 36   TVQKRRWGGCWSIYWCFGSYKQKKRIGPAVLTSETSFSGANVPAAENPTQAPAIALPFVA 95

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                      TSISA++YSPG P SIFA+GPYAHETQLVSPP+F
Sbjct: 96   PPSSPASFLPSEPPSATQSPAGLVSLTSISASMYSPG-PASIFAIGPYAHETQLVSPPVF 154

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQF 540
            STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL P LQ G+  QRF  S YEFQ YQ 
Sbjct: 155  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQL 214

Query: 541  YPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWG 720
            +P SPVGQLISP S ISGSGTSSPF D E +   LH+ + R+G PPKLLNL    S +WG
Sbjct: 215  HPGSPVGQLISPSSGISGSGTSSPFRDGE-FAASLHFPEFRMGDPPKLLNLDKHSSCEWG 273

Query: 721  SHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVI-DQRVSFELTVED 897
            SH GSGTLTPDA +   R+ ++LD Q S I+  P L +    ++ V  + RVSFELT E+
Sbjct: 274  SHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEE 333

Query: 898  VVRCVE----KGPKSLPKALQIVGS-EGGDVSTNILADNIDKPPSEDGTGKSPSEADAEE 1062
            VVR +E       +++  +LQI  + E  +  T ++ D   +           + AD E 
Sbjct: 334  VVRSLEMETATPSEAVSGSLQIEATRESEEHDTKVVDDYECRVGETSNERPEKALADREG 393

Query: 1063 GQRCQKQRSVSLGSIKE 1113
              +  K +S++LGS KE
Sbjct: 394  KPQHHKHQSITLGSAKE 410


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  301 bits (770), Expect = 4e-79
 Identities = 167/373 (44%), Positives = 222/373 (59%), Gaps = 4/373 (1%)
 Frame = +1

Query: 7    KKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXXXX 186
            +K+RW S WSI  CFG Q   ++IGHAV                +               
Sbjct: 38   QKQRWRSHWSIYWCFGYQKSKRQIGHAVLFPESSAPGSGAPAAENSAQAPEVTFPFVAPP 97

Query: 187  XXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIFST 366
                                    TSISA++YSP GP SIFA+GPYAHETQLVSPP+FST
Sbjct: 98   SSPASFFQSEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFST 157

Query: 367  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQFYP 546
            FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL+DP L++G  G RF    ++FQ YQF+P
Sbjct: 158  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRF---PFDFQSYQFHP 214

Query: 547  RSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWGSH 726
             S VGQLISP S ISGSGTSSPFPD E+  GG H  + R+G  PKLLNL  L +R+WGS+
Sbjct: 215  GSSVGQLISPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG--PKLLNLDKLSTREWGSY 272

Query: 727  QGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDVVR 906
            Q SG LTPD+++  S  +++L RQ S ++  P   +GH   + V++ R SFEL+V+D  R
Sbjct: 273  QDSGALTPDSVRHGS-PNFLLHRQFSDVASHPRSENGHD-DDQVVNHRFSFELSVKDASR 330

Query: 907  CVEKGP----KSLPKALQIVGSEGGDVSTNILADNIDKPPSEDGTGKSPSEADAEEGQRC 1074
            CVE+ P    K++P+ ++       + +   L  + ++   +       + +   E  + 
Sbjct: 331  CVEEKPACSIKTVPEYVENGTKAKEEENYGELIQSFERRSGDTSNDTPETPSTDGEAPQH 390

Query: 1075 QKQRSVSLGSIKE 1113
            +KQ+ ++LGS+ E
Sbjct: 391  RKQQPITLGSVNE 403


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  296 bits (759), Expect = 8e-78
 Identities = 178/380 (46%), Positives = 223/380 (58%), Gaps = 9/380 (2%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +KRRW   W +  CFG Q + KRIGHAV +              +L            
Sbjct: 37   TVQKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENL---TQASSIVLP 93

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                     + S+SA++YSPG P SIFA+GPYAHETQLVSPP+F
Sbjct: 94   FAAPPSSPASFLQSEPPSAMQSPGFNFSLSASMYSPG-PSSIFAIGPYAHETQLVSPPVF 152

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQF 540
            STFTTEPSTAPFTPP ESVHLT PSSPEVPFAQLLD   + G+ GQR+  S YEFQ YQ+
Sbjct: 153  STFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQW 212

Query: 541  YPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWG 720
            YP SPVGQLISP S ISGSGTSSPF D E+ +GG H+ + R G  PK+LNL  L +RDWG
Sbjct: 213  YPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWG 272

Query: 721  SHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHE-AVIDQRVSFELTVED 897
            S   SG++TPDA +  S + + L +  +    L + ++    ++ A I  RVSFEL+ E+
Sbjct: 273  SRLCSGSVTPDAAKSTSSEGFTL-KPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEE 331

Query: 898  VVRCVEKGPKSLPKA----LQIVGSEGGDVSTNILADNIDKPPSEDGTGKSPSEA---DA 1056
            VVRCVEK P +L +A    LQ       +   N    +  + P  D +  S  +A   DA
Sbjct: 332  VVRCVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDA 391

Query: 1057 EE-GQRCQKQRSVSLGSIKE 1113
            EE   R QK+RS++LGS KE
Sbjct: 392  EELSYRYQKERSITLGSAKE 411


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  295 bits (756), Expect = 2e-77
 Identities = 177/378 (46%), Positives = 222/378 (58%), Gaps = 9/378 (2%)
 Frame = +1

Query: 7    KKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXXXX 186
            +KRRW   W +  CFG Q + KRIGHAV +              +L              
Sbjct: 2    QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENL---TQASSIVLPFA 58

Query: 187  XXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIFST 366
                                   + S+SA++YSPG P SIFA+GPYAHETQLVSPP+FST
Sbjct: 59   APPSSPASFLQSEPPSAMQSPGFNFSLSASMYSPG-PSSIFAIGPYAHETQLVSPPVFST 117

Query: 367  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQFYP 546
            FTTEPSTAPFTPP ESVHLT PSSPEVPFAQLLD   + G+ GQR+  S YEFQ YQ+YP
Sbjct: 118  FTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYP 177

Query: 547  RSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWGSH 726
             SPVGQLISP S ISGSGTSSPF D E+ +GG H+ + R G  PK+LNL  L +RDWGS 
Sbjct: 178  GSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSR 237

Query: 727  QGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHE-AVIDQRVSFELTVEDVV 903
              SG++TPDA +  S + + L +  +    L + ++    ++ A I  RVSFEL+ E+VV
Sbjct: 238  LCSGSVTPDAAKSTSSEGFTL-KPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 296

Query: 904  RCVEKGPKSLPKA----LQIVGSEGGDVSTNILADNIDKPPSEDGTGKSPSEA---DAEE 1062
            RCVEK P +L +A    LQ       +   N    +  + P  D +  S  +A   DAEE
Sbjct: 297  RCVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEE 356

Query: 1063 -GQRCQKQRSVSLGSIKE 1113
               R QK+RS++LGS KE
Sbjct: 357  LSYRYQKERSITLGSAKE 374


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  286 bits (732), Expect = 1e-74
 Identities = 175/389 (44%), Positives = 203/389 (52%), Gaps = 18/389 (4%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +KRRWGS W    CF S   +KRIGHAV                +L            
Sbjct: 36   TVQKRRWGSCWGEYWCFRSPK-DKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVA 94

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                      TSI+AN+YSPGGP SIFA+GPYAHETQLVSPP+F
Sbjct: 95   PPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVF 154

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQF 540
            STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL DP  ++G+ G RF  SQYEFQ YQ 
Sbjct: 155  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQL 214

Query: 541  YPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWG 720
            YP SPVG LISP S ISGSGTSSPFPD                                 
Sbjct: 215  YPGSPVGHLISPSSGISGSGTSSPFPDR-------------------------------- 242

Query: 721  SHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDV 900
                SG++TPDA+ P SRD  +LD              G   +E ++D RVSFELT EDV
Sbjct: 243  ----SGSITPDALGPPSRDGSVLDH------------SGCPNNEIMVDHRVSFELTAEDV 286

Query: 901  VRCVEKGPKSLPKALQ------------------IVGSEGGDVSTNILADNIDKPPSEDG 1026
            VRCVEK   +L KA+                   +V SEG       + +  + PP    
Sbjct: 287  VRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGR------VGETANNPPE--- 337

Query: 1027 TGKSPSEADAEEGQRCQKQRSVSLGSIKE 1113
              K+P +A+ EEGQ   KQRS++LGS KE
Sbjct: 338  --KAPEDANGEEGQPHHKQRSITLGSAKE 364


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  284 bits (727), Expect = 4e-74
 Identities = 159/291 (54%), Positives = 198/291 (68%), Gaps = 6/291 (2%)
 Frame = +1

Query: 259  TSISANLYSPGGPRSIFAVGPYAHETQLVSPPIFSTFTTEPSTAPFTPPPESVHLTTPSS 438
            TS+SA++YSP GP SIFA+GPYAHETQLVSPP FSTFTTEPSTAPFTPPPESV LTTPSS
Sbjct: 126  TSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFSTFTTEPSTAPFTPPPESVQLTTPSS 185

Query: 439  PEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQFYPRSPVGQLISPGSAISGSGTSSPFP 618
            PEVPFAQLL+P  ++G+ G RF  S YEFQ YQFYP SPVGQLISP S ISGSGTSSPFP
Sbjct: 186  PEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGTSSPFP 245

Query: 619  DHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWGSHQGSGTLTPDAIQPKSRDSYILDRQ 798
            D E+   G  + + ++ VPPKLLNL  L   + GS QGSGTLTPDA++  S  S+ LDRQ
Sbjct: 246  DGEFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLTPDAVRATS-CSFPLDRQ 304

Query: 799  DSTISPLPSLADGHHYHEAVIDQRVSFELTVEDVVRCVEKGP----KSLPKALQ-IVGSE 963
             S I+     +D  +  + V D RVSF+L+ ED +R  E  P    K +P++++  + +E
Sbjct: 305  CSDIAS-NRHSDNENKDDQVADLRVSFDLSAEDALRYAEPKPASPVKIMPESMKNEIAAE 363

Query: 964  GGDVSTNILADNIDKPPSEDGTG-KSPSEADAEEGQRCQKQRSVSLGSIKE 1113
                S+ I   N +    E   G    +    E+  R QK R+++LG+ KE
Sbjct: 364  KVQKSSEI-RHNFECRVGETSNGILEQASTGGEKTPRHQKHRTLTLGTFKE 413


>gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 485

 Score =  273 bits (699), Expect = 7e-71
 Identities = 173/417 (41%), Positives = 213/417 (51%), Gaps = 46/417 (11%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +K+RWGS W +  CFGSQ  +KRIGHAV V              ++            
Sbjct: 29   TVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIA 88

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                      TS+S N YSP GP SIFA+GPYAHETQLV+PP+F
Sbjct: 89   PPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVF 148

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQ----SGDVGQRFTPSQYEFQ 528
            S  TTEPSTAPFTPPPESV LTTPSSPEVPFAQLL   L+    +  + Q+F  S YEFQ
Sbjct: 149  SALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQ 208

Query: 529  MYQFYPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPS 708
             YQ YP SP G LISPGSAIS SGTSSPFPD           + R+G  PKLL  ++  +
Sbjct: 209  SYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPIL------EFRMGEAPKLLGFENFTT 262

Query: 709  RDWGSHQGS--------------------------------GTLTPDAIQPKSRDSYILD 792
            R WGS  GS                                G+LTPD + P SRD +++ 
Sbjct: 263  RKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVG 322

Query: 793  RQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDVVRCVE-------KGPKSLPKALQI 951
             Q S ++ L + A+G    E ++D RVSFEL+ EDV  C+E       +     PK L  
Sbjct: 323  SQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVA 382

Query: 952  VGSEGGDVSTNILADNID---KPPSEDGTGKSPSEADAEEGQRCQKQRSVSLGSIKE 1113
             G +  D     L  + +   +  S +   K+  E  AEE    QK RSV+LGSIKE
Sbjct: 383  EGRKERDGIKKDLESSCELFIRETSNETVEKASGE--AEEEHSYQKHRSVTLGSIKE 437


>gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 489

 Score =  272 bits (695), Expect = 2e-70
 Identities = 172/414 (41%), Positives = 211/414 (50%), Gaps = 46/414 (11%)
 Frame = +1

Query: 10   KRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXXXXX 189
            K+RWGS W +  CFGSQ  +KRIGHAV V              ++               
Sbjct: 36   KKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPS 95

Query: 190  XXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIFSTF 369
                                   TS+S N YSP GP SIFA+GPYAHETQLV+PP+FS  
Sbjct: 96   SPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSAL 155

Query: 370  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQ----SGDVGQRFTPSQYEFQMYQ 537
            TTEPSTAPFTPPPESV LTTPSSPEVPFAQLL   L+    +  + Q+F  S YEFQ YQ
Sbjct: 156  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQ 215

Query: 538  FYPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDW 717
             YP SP G LISPGSAIS SGTSSPFPD           + R+G  PKLL  ++  +R W
Sbjct: 216  IYPGSPGGNLISPGSAISNSGTSSPFPDRRPIL------EFRMGEAPKLLGFENFTTRKW 269

Query: 718  GSHQGS--------------------------------GTLTPDAIQPKSRDSYILDRQD 801
            GS  GS                                G+LTPD + P SRD +++  Q 
Sbjct: 270  GSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQI 329

Query: 802  STISPLPSLADGHHYHEAVIDQRVSFELTVEDVVRCVE-------KGPKSLPKALQIVGS 960
            S ++ L + A+G    E ++D RVSFEL+ EDV  C+E       +     PK L   G 
Sbjct: 330  SEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGR 389

Query: 961  EGGDVSTNILADNID---KPPSEDGTGKSPSEADAEEGQRCQKQRSVSLGSIKE 1113
            +  D     L  + +   +  S +   K+  E  AEE    QK RSV+LGSIKE
Sbjct: 390  KERDGIKKDLESSCELFIRETSNETVEKASGE--AEEEHSYQKHRSVTLGSIKE 441


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  271 bits (694), Expect = 3e-70
 Identities = 170/396 (42%), Positives = 205/396 (51%), Gaps = 25/396 (6%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +KRRWGS  S+  CFGS  ++KRIGHAV V              +L            
Sbjct: 29   TVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASENLNLSTSIVLPFIA 88

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                      T++S N YSP GP S+FA+GPYAHETQLVSPP+F
Sbjct: 89   PPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPVF 148

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL----LDPRLQSGDVGQRFTPSQYEFQ 528
            STF TEPSTAPFTPPPESV LTTPSSPEVPFAQL    LD   ++    Q+ + S YEFQ
Sbjct: 149  STFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEFQ 208

Query: 529  MYQFYPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPS 708
             YQ YP SPVG LISP   IS SGTSSPFPD              +   PKLL  +   +
Sbjct: 209  PYQLYPESPVGHLISP---ISNSGTSSPFPDRR-----------PIVEAPKLLGFEHFST 254

Query: 709  RDWGSHQGSGTLTPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELT 888
            R WGS  GSG+LTPD   P SRDS++L+ Q S ++ L +   G    E VID RVSFEL 
Sbjct: 255  RRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELA 314

Query: 889  VEDVVRCVEKGPKSLPKALQIVGSEGGDVSTNILADNIDKPPSE---DGTGKS------- 1038
             EDV  CVEK P +  + +Q           N L D +++   E   DG  +S       
Sbjct: 315  GEDVAVCVEKKPVASAETVQ-----------NTLQDIVEEGEIERERDGISESTENCCEF 363

Query: 1039 -----------PSEADAEEGQRCQKQRSVSLGSIKE 1113
                        + A+ EE Q  +K   +  GSIKE
Sbjct: 364  CVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKE 399


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  268 bits (685), Expect = 3e-69
 Identities = 168/428 (39%), Positives = 210/428 (49%), Gaps = 57/428 (13%)
 Frame = +1

Query: 1    TSKKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXX 180
            T +KRRWG  WS+  CFGS    KRIGHAV                +             
Sbjct: 43   TVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTSAENQSQSTAITVPFIA 101

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIF 360
                                      TS+S N YSPGGP SIFA+GPYAHETQLV+PP F
Sbjct: 102  PPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTPPAF 161

Query: 361  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQ----SGDVGQRFTPSQYEFQ 528
            S FTTEPSTAPFTPPPESV LTTPSSPEVPFAQLL   L+    +    Q+F  S YEFQ
Sbjct: 162  SAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHYEFQ 221

Query: 529  MYQFYPRSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPS 708
             Y  YP SP GQLISPGS IS SGTSSPFPD           + R+G  PKLL  +   +
Sbjct: 222  SYPLYPGSPGGQLISPGSVISNSGTSSPFPDR------YPILEFRMGEAPKLLGFEHFTT 275

Query: 709  RDWGSHQGSGT------------------------------------------------L 744
            R WGS  GSGT                                                L
Sbjct: 276  RKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSL 335

Query: 745  TPDAIQPKSRDSYILDRQDSTISPLPSLADGHHYHEAVIDQRVSFELTVEDVVRCVEKGP 924
            TPDA+ P SRD + L+ Q S ++ L +  +G    E ++D RVSFEL+ E+V RC+E   
Sbjct: 336  TPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKS 395

Query: 925  KSLPKALQIVGSEG---GDVSTNILADNIDKPPSEDGTGKSPSEADAE-EGQRC-QKQRS 1089
             +  +A      +      + +  +    +  P+ + +G++P +   E E + C +K RS
Sbjct: 396  LASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTGETSGETPEKPSGEMEEEHCYRKHRS 455

Query: 1090 VSLGSIKE 1113
            ++LGSIKE
Sbjct: 456  ITLGSIKE 463


>ref|XP_002333412.1| predicted protein [Populus trichocarpa]
          Length = 321

 Score =  265 bits (678), Expect = 2e-68
 Identities = 141/269 (52%), Positives = 170/269 (63%)
 Frame = +1

Query: 7   KKRRWGSFWSISCCFGSQNYNKRIGHAVFVXXXXXXXXXXXXXRSLGXXXXXXXXXXXXX 186
           +K+RW S WSI  CFG Q   ++IGHAV                +               
Sbjct: 38  QKQRWRSHWSIYWCFGYQKSKRQIGHAVLFPESSAPGSGAPAAENSAQAPEVTFPFVAPP 97

Query: 187 XXXXXXXXXXXXXXXXXXXXXXXHTSISANLYSPGGPRSIFAVGPYAHETQLVSPPIFST 366
                                   TSISA++YSP GP SIFA+GPYAHETQLVSPP+FST
Sbjct: 98  SSPASFFQSEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFST 157

Query: 367 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPRLQSGDVGQRFTPSQYEFQMYQFYP 546
           FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL+DP L++G  G RF    ++FQ YQF+P
Sbjct: 158 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRF---PFDFQSYQFHP 214

Query: 547 RSPVGQLISPGSAISGSGTSSPFPDHEYYTGGLHYRDIRLGVPPKLLNLQSLPSRDWGSH 726
            S VGQLISP S ISGSGTSSPFPD E+  GG H  + R+G  PKLLNL  L +R+WGS+
Sbjct: 215 GSSVGQLISPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG--PKLLNLDKLSTREWGSY 272

Query: 727 QGSGTLTPDAIQPKSRDSYILDRQDSTIS 813
           Q SG LTPD+++  S  +++L RQ S ++
Sbjct: 273 QDSGALTPDSVRHGS-PNFLLHRQFSDVA 300


Top