BLASTX nr result

ID: Akebia26_contig00009967 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00009967
         (955 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   275   2e-71
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   275   2e-71
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   272   2e-70
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   268   2e-69
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   268   2e-69
ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun...   264   4e-68
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   252   2e-64
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   249   1e-63
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   245   2e-62
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   244   5e-62
ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-...   243   1e-61
ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr...   242   1e-61
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   241   3e-61
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   241   4e-61
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   237   6e-60
gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]     233   6e-59
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   231   4e-58
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   231   4e-58
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   227   5e-57
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   226   1e-56

>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  275 bits (703), Expect = 2e-71
 Identities = 175/327 (53%), Positives = 195/327 (59%), Gaps = 10/327 (3%)
 Frame = +3

Query: 3    APAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSLSA---NTYSP-GPNS 170
            APA EN     +I               QS+P SSTQSP G LSL+A   N YSP GP S
Sbjct: 70   APASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPAS 129

Query: 171  IFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXXH---LTTPSSPEVPFARLLTSSLDR 341
            +FAIGPYAHETQL                          LTTPSSPEVPFA+LLTSSLDR
Sbjct: 130  MFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDR 189

Query: 342  NCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPF 521
            + + SG  QK + S+YEFQ YQLYP SPVGHLISP   IS SGTSSPFPDR       P 
Sbjct: 190  SRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRR------PI 240

Query: 522  LEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANS 701
            +E      PKL  F+  STR+W    GSGSLTP D AGP SRDSFL+ENQISEVASLANS
Sbjct: 241  VE-----APKLLGFEHFSTRRWGSRLGSGSLTP-DGAGPASRDSFLLENQISEVASLANS 294

Query: 702  NNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGL 881
             +GSQN E VIDHRVSFEL  E+   CVEK+P +AS E    T  D      +  ERDG+
Sbjct: 295  ESGSQNGETVIDHRVSFELAGEDVAVCVEKKP-VASAETVQNTLQDIVEEGEIERERDGI 353

Query: 882  SSEAENT---CVGETSSNVSGKAFGDG 953
            S   EN    CVGE     S KA  +G
Sbjct: 354  SESTENCCEFCVGEALKAASEKASAEG 380


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  275 bits (703), Expect = 2e-71
 Identities = 175/327 (53%), Positives = 195/327 (59%), Gaps = 10/327 (3%)
 Frame = +3

Query: 3   APAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSLSA---NTYSP-GPNS 170
           APA EN     +I               QS+P SSTQSP G LSL+A   N YSP GP S
Sbjct: 7   APASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPAS 66

Query: 171 IFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXXH---LTTPSSPEVPFARLLTSSLDR 341
           +FAIGPYAHETQL                          LTTPSSPEVPFA+LLTSSLDR
Sbjct: 67  MFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDR 126

Query: 342 NCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPF 521
           + + SG  QK + S+YEFQ YQLYP SPVGHLISP   IS SGTSSPFPDR       P 
Sbjct: 127 SRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRR------PI 177

Query: 522 LEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANS 701
           +E      PKL  F+  STR+W    GSGSLTP D AGP SRDSFL+ENQISEVASLANS
Sbjct: 178 VE-----APKLLGFEHFSTRRWGSRLGSGSLTP-DGAGPASRDSFLLENQISEVASLANS 231

Query: 702 NNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGL 881
            +GSQN E VIDHRVSFEL  E+   CVEK+P +AS E    T  D      +  ERDG+
Sbjct: 232 ESGSQNGETVIDHRVSFELAGEDVAVCVEKKP-VASAETVQNTLQDIVEEGEIERERDGI 290

Query: 882 SSEAENT---CVGETSSNVSGKAFGDG 953
           S   EN    CVGE     S KA  +G
Sbjct: 291 SESTENCCEFCVGEALKAASEKASAEG 317


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  272 bits (695), Expect = 2e-70
 Identities = 169/322 (52%), Positives = 199/322 (61%), Gaps = 5/322 (1%)
 Frame = +3

Query: 3    APAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSLSANTYSP-GPNSIFA 179
            AP  EN I+ P+I               QSEP S+TQSP G  SL+A+ YSP GP SIFA
Sbjct: 77   APRAENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFA 136

Query: 180  IGPYAHETQLXXXXXXXXXXXXXXXXXXXXXX---HLTTPSSPEVPFARLLTSSLDRNCK 350
            IGPYAHETQL                         HLTTPSSPEVPFA+LL    D + +
Sbjct: 137  IGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL----DPHFR 192

Query: 351  TSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEF 530
                 Q+F  SHYEFQSYQLYPGSPVG LISPSS IS SGTSSPFPD EF++ GH FLEF
Sbjct: 193  NGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEF 252

Query: 531  RTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANSNNG 710
            RTG+PPKL + D LSTR W    GSGS+T PD A  TS D FL++ Q  EV     SNN 
Sbjct: 253  RTGDPPKLLNLDILSTRDWGSRLGSGSVT-PDGAKSTSSDGFLLKPQTPEVVLNPRSNNR 311

Query: 711  SQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSE 890
             +NN+I I+HRVSFEL++EE   CVEK+P +A  EA S TSL+ T  A    + D     
Sbjct: 312  GRNNDISINHRVSFELSSEEVIRCVEKKP-VALAEAVS-TSLEDTEKA--QSKEDPSKVV 367

Query: 891  AENTC-VGETSSNVSGKAFGDG 953
            + + C VGETS++ + KA  DG
Sbjct: 368  SSSICPVGETSNDAAEKAVADG 389


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  268 bits (685), Expect = 2e-69
 Identities = 166/353 (47%), Positives = 197/353 (55%), Gaps = 41/353 (11%)
 Frame = +3

Query: 15   ENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSL---SANTYSP-GPNSIFAI 182
            EN   P  I               QS+P S+TQSP GLLSL   S N YSP GP SIFAI
Sbjct: 78   ENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAI 137

Query: 183  GPYAHETQLXXXXXXXXXXXXXXXXXXXXXXH---LTTPSSPEVPFARLLTSSLDRNCKT 353
            GPYAHETQL                          LTTPSSPEVPFA+LLTSSL+R  + 
Sbjct: 138  GPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRN 197

Query: 354  SGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFR 533
            SG  QKF  SHYEFQSYQ+YPGSP G+LISP S IS SGTSSPFPDR       P LEFR
Sbjct: 198  SGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR------RPILEFR 251

Query: 534  TGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------------------- 620
             GE PKL  F+  +TRKW    GSGSLTP                               
Sbjct: 252  MGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLT 311

Query: 621  PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPM 800
            PD  GP SRD FLV +QISEVA LAN  NG +N+E ++DHRVSFEL+ E+   C+E + +
Sbjct: 312  PDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSL 371

Query: 801  MASLEAKSVTSLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSGKAFGD 950
            + S   ++V+   K  +A    ERDG+  + E++C   + ETS+    KA G+
Sbjct: 372  LPS---RAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGE 421


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  268 bits (685), Expect = 2e-69
 Identities = 166/353 (47%), Positives = 197/353 (55%), Gaps = 41/353 (11%)
 Frame = +3

Query: 15   ENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSL---SANTYSP-GPNSIFAI 182
            EN   P  I               QS+P S+TQSP GLLSL   S N YSP GP SIFAI
Sbjct: 74   ENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAI 133

Query: 183  GPYAHETQLXXXXXXXXXXXXXXXXXXXXXXH---LTTPSSPEVPFARLLTSSLDRNCKT 353
            GPYAHETQL                          LTTPSSPEVPFA+LLTSSL+R  + 
Sbjct: 134  GPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRN 193

Query: 354  SGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFR 533
            SG  QKF  SHYEFQSYQ+YPGSP G+LISP S IS SGTSSPFPDR       P LEFR
Sbjct: 194  SGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR------RPILEFR 247

Query: 534  TGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------------------- 620
             GE PKL  F+  +TRKW    GSGSLTP                               
Sbjct: 248  MGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLT 307

Query: 621  PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPM 800
            PD  GP SRD FLV +QISEVA LAN  NG +N+E ++DHRVSFEL+ E+   C+E + +
Sbjct: 308  PDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSL 367

Query: 801  MASLEAKSVTSLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSGKAFGD 950
            + S   ++V+   K  +A    ERDG+  + E++C   + ETS+    KA G+
Sbjct: 368  LPS---RAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGE 417


>ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
            gi|462415503|gb|EMJ20240.1| hypothetical protein
            PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  264 bits (674), Expect = 4e-68
 Identities = 164/345 (47%), Positives = 191/345 (55%), Gaps = 57/345 (16%)
 Frame = +3

Query: 90   SEPHSSTQSPTGLLSL---SANTYSPG-PNSIFAIGPYAHETQLXXXXXXXXXXXXXXXX 257
            S+P S+TQSP G LSL   SAN YSPG P SIF+IGPYA+ETQL                
Sbjct: 98   SDPPSATQSPAGFLSLKSLSANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTA 157

Query: 258  XXXXXXH---LTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPV 428
                      LTTPSSPEVPFA+LLTSSLDRN + SG  QKF  SHYEFQ YQ YPGSP 
Sbjct: 158  PFTPPPESVQLTTPSSPEVPFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPG 217

Query: 429  GHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSG 608
            G+LISP S +S SGTSSPFPDR      HP LEFR GE PKL+ FD  +TRKW    GSG
Sbjct: 218  GNLISPGSAVSNSGTSSPFPDR------HPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSG 271

Query: 609  SLTP-----------------------------------------------PDPAGPTSR 647
            SLTP                                               PD  GP SR
Sbjct: 272  SLTPDGVGLGSRLGSGSLTPDGNELGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASR 331

Query: 648  DSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSV 827
            DSFL+ENQISEVASLANS +G Q  E V DHRVSFELT E+   C+  + + ++   ++ 
Sbjct: 332  DSFLLENQISEVASLANSESGCQTVETVFDHRVSFELTGEDVACCLANKAVASN---RTA 388

Query: 828  TSLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSGKAFGDG 953
            +   K   +    ERD LSS++ N C   V E+SS +     G+G
Sbjct: 389  SGSSKVIASEYPSERDALSSDSSNHCEFSVEESSSRIPENVSGEG 433


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  252 bits (643), Expect = 2e-64
 Identities = 162/342 (47%), Positives = 186/342 (54%), Gaps = 54/342 (15%)
 Frame = +3

Query: 87   QSEPHSSTQSPTGLLSL---SANTYSPG-PNSIFAIGPYAHETQLXXXXXXXXXXXXXXX 254
            QS+P S+TQSP GLLSL   S N YSPG P SIFAIGPYAHETQL               
Sbjct: 111  QSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTPPAFSAFTTEPST 170

Query: 255  XXXXXXXH---LTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSP 425
                       LTTPSSPEVPFA+LLTSSL+R  + SG  QKF  SHYEFQSY LYPGSP
Sbjct: 171  APFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHYEFQSYPLYPGSP 230

Query: 426  VGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGS 605
             G LISP SVIS SGTSSPFPDR      +P LEFR GE PKL  F+  +TRKW    GS
Sbjct: 231  GGQLISPGSVISNSGTSSPFPDR------YPILEFRMGEAPKLLGFEHFTTRKWGSRLGS 284

Query: 606  GSLTP-----------------------------------------------PDPAGPTS 644
            G++TP                                               PD  GP S
Sbjct: 285  GTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSLTPDAVGPAS 344

Query: 645  RDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKS 824
            RD F +ENQISEVASLANS NGS+ +E ++DHRVSFEL+ EE   C+E +  +AS  A S
Sbjct: 345  RDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESK-SLASCRAFS 403

Query: 825  VTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGD 950
                D  ++A    +   +    EN   GETS     K  G+
Sbjct: 404  ECPPD--SMAEDQIKSGKMLMTDENLPTGETSGETPEKPSGE 443


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  249 bits (636), Expect = 1e-63
 Identities = 164/344 (47%), Positives = 195/344 (56%), Gaps = 32/344 (9%)
 Frame = +3

Query: 6    PAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSLS---ANTYSPG-PNSI 173
            PA EN  + PTI               QSEP S+TQSP+GLLSL+   AN YSPG P SI
Sbjct: 77   PAAENLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASI 136

Query: 174  FAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXX---HLTTPSSPEVPFARLLTSSLDRN 344
            FAIGPYAHETQL                         HLTTPSSPEVPFA+L     D N
Sbjct: 137  FAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLF----DPN 192

Query: 345  CKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREF-SSGGHPF 521
             +      +F  S YEFQSYQLYPGSPVGHLISPSS IS SGTSSPFPDR+F  SG   F
Sbjct: 193  NRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQF 252

Query: 522  LEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSR------------------ 647
            LEFR G PPKL + D LS  +W    GSGS+T PD  GP SR                  
Sbjct: 253  LEFRAGGPPKLLTLDKLSNHEWGSRIGSGSIT-PDALGPPSRDGSVLDRQVSDVIHPPSG 311

Query: 648  DSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKE------PMMAS 809
            D  +++ QIS+VAS + S++G  NNEI++DHRVSFELTAE+   CVEK+       + AS
Sbjct: 312  DDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSAS 371

Query: 810  LEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKA 941
            L+  +   +D+ +   V      + SE     VGET++N   KA
Sbjct: 372  LQNPATVEIDENSREVV------VDSEGR---VGETANNPPEKA 406


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222858882|gb|EEE96429.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  245 bits (626), Expect = 2e-62
 Identities = 162/342 (47%), Positives = 186/342 (54%), Gaps = 58/342 (16%)
 Frame = +3

Query: 87   QSEPHSSTQSPTGLLSL---SANTYSP-GPNSIFAIGPYAHETQLXXXXXXXXXXXXXXX 254
            QS+P SSTQSP GLLSL   SAN YSP GP SIFAIGPYAHETQL               
Sbjct: 104  QSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYAHETQLVTPPVFSAFTTEPST 163

Query: 255  XXXXXXXH---LTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSP 425
                       LTTPSSPEVPFA+LLTSSL+R  + SGP QKF+ SHYEFQSY LYPGSP
Sbjct: 164  APFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPNQKFSLSHYEFQSYHLYPGSP 223

Query: 426  VGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGS 605
             G +ISP S IS SGTSSPFPDR      HP LEFR GE PKL  F+  STRKW    GS
Sbjct: 224  GGQIISPGSAISNSGTSSPFPDR------HPMLEFRMGEAPKLLGFEHFSTRKWGSRLGS 277

Query: 606  GSLTP---------------------------------PDPAG----------------P 638
            GSLTP                                 PD AG                P
Sbjct: 278  GSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGLRSRLGSGTLTPDCFVP 337

Query: 639  TSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEA 818
             S+  FL+ENQISEVASL NS NGS+  E V+ HRVSFEL+ EE   C+E + + ++   
Sbjct: 338  ASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEEVARCLEIKSVAST--- 394

Query: 819  KSVTSLDKTTLATVTPERDGLSSEAENTCV--GETSSNVSGK 938
            ++     + T+       D L+   E  C+  GE SS +  K
Sbjct: 395  RTFPEYPQDTMPEDPVRGDRLAMNGER-CLQNGEASSEMPEK 435


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  244 bits (622), Expect = 5e-62
 Identities = 152/290 (52%), Positives = 184/290 (63%), Gaps = 10/290 (3%)
 Frame = +3

Query: 87  QSEPHSSTQSPTGLLSL---SANTYSPG-PNSIFAIGPYAHETQLXXXXXXXXXXXXXXX 254
           QSEP S+TQSP GL+SL   S N YSPG P+SIFAIGPYAHETQL               
Sbjct: 106 QSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPST 165

Query: 255 XXXXXXX---HLTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSP 425
                     HLTTPSSPEVPFA+LL  SL    +     QKF  S+YEFQSY L+PGSP
Sbjct: 166 APFTPPPESVHLTTPSSPEVPFAQLLDPSL----RFGEQGQKFPFSYYEFQSYHLHPGSP 221

Query: 426 VGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGS 605
           VG+LISPSS IS SGTSSPFPD EF++ G  F +F  G+PPKL + D LS R+W   QGS
Sbjct: 222 VGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGS 281

Query: 606 GSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCV 785
           G+LT PD  G T R+ F    QISEVA   +S NG + ++IV DHRVSFELT E+   CV
Sbjct: 282 GTLT-PDAVGSTPRNGFFQNRQISEVALRPHSENGLRKDQIV-DHRVSFELTTEDVVRCV 339

Query: 786 EKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAEN---TCVGETSSN 926
           EK+P   + EA S +  + TT+     E++  S EAEN   +C GE +++
Sbjct: 340 EKKPTTLA-EAVSESLQNGTTV-----EKEESSGEAENVHHSCAGEAAND 383


>ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis]
          Length = 500

 Score =  243 bits (619), Expect = 1e-61
 Identities = 158/374 (42%), Positives = 192/374 (51%), Gaps = 57/374 (15%)
 Frame = +3

Query: 3    APAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSL---SANTYSPG-PNS 170
            APA E       I               QS+P S+TQSP GLLSL   S N YSPG P S
Sbjct: 70   APAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLSLNSLSVNAYSPGGPAS 129

Query: 171  IFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXXH---LTTPSSPEVPFARLLTSSLDR 341
            +FAIGPYAHETQL                          LTTPSSPEVPFA+LLTSSL+R
Sbjct: 130  MFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFAQLLTSSLER 189

Query: 342  NCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPF 521
              + SG  QK + SHY +Q YQLYPGSP G LISP SV+S SGTSSPFPDR      HP 
Sbjct: 190  ARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR------HPI 243

Query: 522  LEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP--------------------------- 620
            L+F     PKL  F+  +TRKW    GSGS+TP                           
Sbjct: 244  LDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRLGS 303

Query: 621  --------------------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDH 740
                                PD  GPTSRD F+ ENQISEVASLANS+NG++++E +IDH
Sbjct: 304  GTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDH 363

Query: 741  RVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAEN---TCVG 911
            RVSFEL+ EE   C+  +   ++   + V    +  +      RDG  +++EN    C  
Sbjct: 364  RVSFELSGEEVARCLANK---SAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPE 420

Query: 912  ETSSNVSGKAFGDG 953
            E+S+ +  K   DG
Sbjct: 421  ESSNRMPEKTMRDG 434


>ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina]
            gi|557523850|gb|ESR35217.1| hypothetical protein
            CICLE_v10004813mg [Citrus clementina]
          Length = 500

 Score =  242 bits (618), Expect = 1e-61
 Identities = 158/374 (42%), Positives = 192/374 (51%), Gaps = 57/374 (15%)
 Frame = +3

Query: 3    APAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSL---SANTYSPG-PNS 170
            APA E       I               QS+P S+TQSP GLLSL   S N YSPG P S
Sbjct: 70   APAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVNAYSPGGPAS 129

Query: 171  IFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXXH---LTTPSSPEVPFARLLTSSLDR 341
            +FAIGPYAHETQL                          LTTPSSPEVPFA+LLTSSL+R
Sbjct: 130  MFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFAQLLTSSLER 189

Query: 342  NCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPF 521
              + SG  QK + SHY +Q YQLYPGSP G LISP SV+S SGTSSPFPDR      HP 
Sbjct: 190  ARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR------HPI 243

Query: 522  LEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP--------------------------- 620
            L+F     PKL  F+  +TRKW    GSGS+TP                           
Sbjct: 244  LDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRLGS 303

Query: 621  --------------------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDH 740
                                PD  GPTSRD F+ ENQISEVASLANS+NG++++E +IDH
Sbjct: 304  GTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDH 363

Query: 741  RVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAEN---TCVG 911
            RVSFEL+ EE   C+  +   ++   + V    +  +      RDG  +++EN    C  
Sbjct: 364  RVSFELSGEEVARCLANK---SAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPE 420

Query: 912  ETSSNVSGKAFGDG 953
            E+S+ +  K   DG
Sbjct: 421  ESSNRMPEKTMRDG 434


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  241 bits (615), Expect = 3e-61
 Identities = 159/345 (46%), Positives = 186/345 (53%), Gaps = 34/345 (9%)
 Frame = +3

Query: 6    PAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSLSA---NTYSPGPN-SI 173
            P  EN     TI                S+P S+TQSP GLLSL A   N YSPG   SI
Sbjct: 71   PVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASI 130

Query: 174  FAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXX---HLTTPSSPEVPFARLLTSSLDRN 344
            FAIGPYAHETQL                         H+TTP SPEVPFA+LLTSSL RN
Sbjct: 131  FAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARN 190

Query: 345  CKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFL 524
             + SG   KF  S YEF  YQ  PGSP  +LISP SV+S SGTSSPFP      G  P +
Sbjct: 191  RRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFP------GKCPII 243

Query: 525  EFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP---------------------------P 623
            EFR GEPPK   ++  STRKW    GSGS+TP                           P
Sbjct: 244  EFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTP 303

Query: 624  DPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMM 803
            +   P SRDS+L+ENQISEVASLANS+NGS+  E VIDHRVSFELT E+ PSC EKEP+M
Sbjct: 304  NGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVM 363

Query: 804  ASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGK 938
            +   ++    +D + L   +  R G S   E T      ++ SG+
Sbjct: 364  S--HSQPTLPMDVSNL-LASEMRSGSSMAEEKTYGSPRKASESGE 405


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
           gi|557541785|gb|ESR52763.1| hypothetical protein
           CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  241 bits (614), Expect = 4e-61
 Identities = 151/290 (52%), Positives = 183/290 (63%), Gaps = 10/290 (3%)
 Frame = +3

Query: 87  QSEPHSSTQSPTGLLSL---SANTYSPG-PNSIFAIGPYAHETQLXXXXXXXXXXXXXXX 254
           QSEP S+TQSP GL+SL   S N YSPG P+SIFAIGPYAHETQL               
Sbjct: 106 QSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPST 165

Query: 255 XXXXXXX---HLTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSP 425
                     HLTTPSSPEVPFA+LL  SL    +     QKF  S+YEFQSY L+PGSP
Sbjct: 166 APFTPPPESVHLTTPSSPEVPFAQLLDPSL----RFGEQGQKFPFSYYEFQSYHLHPGSP 221

Query: 426 VGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGS 605
           VG+LISPSS IS SGTSSPFPD EF++ G  F +F  G+PPKL + D LS R+W   QGS
Sbjct: 222 VGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGS 281

Query: 606 GSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCV 785
           G+LT PD    T R+ F    QISEVA   +S NG + ++IV DHRVSFELT E+   CV
Sbjct: 282 GTLT-PDAVRSTPRNGFFQNRQISEVALRPHSENGLRKDQIV-DHRVSFELTTEDVVRCV 339

Query: 786 EKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAEN---TCVGETSSN 926
           EK+P   + EA S +  + TT+     E++  S EAEN   +C GE +++
Sbjct: 340 EKKPTTLA-EAVSESLQNGTTV-----EKEESSGEAENVHHSCAGEAAND 383


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  237 bits (604), Expect = 6e-60
 Identities = 158/345 (45%), Positives = 184/345 (53%), Gaps = 34/345 (9%)
 Frame = +3

Query: 6    PAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSL---SANTYSPGPN-SI 173
            P  EN     TI                S+P S+TQSP GLLSL   S N YSPG   SI
Sbjct: 71   PVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSINAYSPGGTASI 130

Query: 174  FAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXX---HLTTPSSPEVPFARLLTSSLDRN 344
            FAIGPYAHETQL                         H+TTP SPEVPFA+LLTSSL RN
Sbjct: 131  FAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLLTSSLARN 190

Query: 345  CKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFL 524
             + SG   KF  S YEF  YQ  PGSP  +LISP SV+S SGTSSPFP      G  P +
Sbjct: 191  RRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFP------GKCPII 243

Query: 525  EFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP---------------------------P 623
            EFR GEPPK   ++  STRKW    GSGSLTP                           P
Sbjct: 244  EFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTP 303

Query: 624  DPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMM 803
            +   P SRDS+L+E QISEVASLANS+NGS+  E VIDHRVSFELT E+ PSC EKEP+M
Sbjct: 304  NGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKEPVM 363

Query: 804  ASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGK 938
            +   ++    +D + L      + G S   E T      ++ SG+
Sbjct: 364  S--HSQQTLPMDVSNL-LANEMKSGSSMAEEKTYGSPRKASESGE 405


>gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]
          Length = 521

 Score =  233 bits (595), Expect = 6e-59
 Identities = 165/383 (43%), Positives = 195/383 (50%), Gaps = 76/383 (19%)
 Frame = +3

Query: 3    APAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSL---SANTYSPG-PNS 170
            APAPEN      I               QS+P S+TQSP GLLSL   S N YSPG P S
Sbjct: 70   APAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSINAYSPGGPTS 129

Query: 171  IFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXXH---LTTPSSPEVPFARLLTSSLDR 341
            IFAIGPYA+ETQL                          LTTPSSPEVPFA+LLTSSLDR
Sbjct: 130  IFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDR 189

Query: 342  NCK-TSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHP 518
              + +SG  QKF+ SH EFQ YQLYPGSP G+LISP SV+S SGTSSPFPD+      HP
Sbjct: 190  TRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPFPDK------HP 243

Query: 519  FLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP---------------PDPAGPTSR-- 647
             L FR GE P+L  F+  +T KW    GSGSLTP               PD  G  SR  
Sbjct: 244  ILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGLGSRLG 303

Query: 648  --------------------------------------DSFLV--------ENQISEVAS 689
                                                  D FLV        ENQISEVAS
Sbjct: 304  SGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQISEVAS 363

Query: 690  LANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPE 869
            LANS+NG QN+  V+DHRVSFELT E+   C+  +   AS   ++ +   + + A    +
Sbjct: 364  LANSDNGCQNDGSVVDHRVSFELTGEDVARCLASK--SASSNGRTTSESLEDSPAECPTK 421

Query: 870  RDGLS-----SEAENTCVGETSS 923
            +DG+S     S  + +CV ETS+
Sbjct: 422  KDGISANNVDSPNDQSCVEETSN 444


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
           vesca subsp. vesca]
          Length = 422

 Score =  231 bits (588), Expect = 4e-58
 Identities = 152/317 (47%), Positives = 179/317 (56%), Gaps = 3/317 (0%)
 Frame = +3

Query: 6   PAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSLSANTYSPGPNSIFAIG 185
           P  EN  +  +I               QSEP S+ QSP    SLSA+ YSPGP+SIFAIG
Sbjct: 42  PRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFNFSLSASMYSPGPSSIFAIG 101

Query: 186 PYAHETQLXXXXXXXXXXXXXXXXXXXXXX---HLTTPSSPEVPFARLLTSSLDRNCKTS 356
           PYAHETQL                         HLT PSSPEVPFA+LL    D N +  
Sbjct: 102 PYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLL----DSNFRFG 157

Query: 357 GPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRT 536
              Q++  SHYEFQSYQ YPGSPVG LISPSS IS SGTSSPF D EF+SGGH FLEFRT
Sbjct: 158 EGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRT 217

Query: 537 GEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQ 716
           GE PK+ + D L TR W     SGS+T PD A  TS + F ++    E    A SN+  +
Sbjct: 218 GEAPKVLNLDILFTRDWGSRLCSGSVT-PDAAKSTSSEGFTLKPYTPEGVLNARSNSRRR 276

Query: 717 NNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAE 896
           N+   I HRVSFEL+AEE   CVEK+P +A  EA S TSL     A      +   S + 
Sbjct: 277 NDGASIGHRVSFELSAEEVVRCVEKKP-VALAEAVS-TSLQSAEKAEREEGPNQEVSSSH 334

Query: 897 NTCVGETSSNVSGKAFG 947
              V +TS++ S KA G
Sbjct: 335 ECPVVDTSNDSSEKAVG 351


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  231 bits (588), Expect = 4e-58
 Identities = 152/317 (47%), Positives = 179/317 (56%), Gaps = 3/317 (0%)
 Frame = +3

Query: 6    PAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSLSANTYSPGPNSIFAIG 185
            P  EN  +  +I               QSEP S+ QSP    SLSA+ YSPGP+SIFAIG
Sbjct: 79   PRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFNFSLSASMYSPGPSSIFAIG 138

Query: 186  PYAHETQLXXXXXXXXXXXXXXXXXXXXXX---HLTTPSSPEVPFARLLTSSLDRNCKTS 356
            PYAHETQL                         HLT PSSPEVPFA+LL    D N +  
Sbjct: 139  PYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLL----DSNFRFG 194

Query: 357  GPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRT 536
               Q++  SHYEFQSYQ YPGSPVG LISPSS IS SGTSSPF D EF+SGGH FLEFRT
Sbjct: 195  EGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRT 254

Query: 537  GEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQ 716
            GE PK+ + D L TR W     SGS+T PD A  TS + F ++    E    A SN+  +
Sbjct: 255  GEAPKVLNLDILFTRDWGSRLCSGSVT-PDAAKSTSSEGFTLKPYTPEGVLNARSNSRRR 313

Query: 717  NNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAE 896
            N+   I HRVSFEL+AEE   CVEK+P +A  EA S TSL     A      +   S + 
Sbjct: 314  NDGASIGHRVSFELSAEEVVRCVEKKP-VALAEAVS-TSLQSAEKAEREEGPNQEVSSSH 371

Query: 897  NTCVGETSSNVSGKAFG 947
               V +TS++ S KA G
Sbjct: 372  ECPVVDTSNDSSEKAVG 388


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  227 bits (579), Expect = 5e-57
 Identities = 148/322 (45%), Positives = 180/322 (55%), Gaps = 7/322 (2%)
 Frame = +3

Query: 6    PAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSL---SANTYSPGPNSIF 176
            PA EN  + P I                SEP S+TQSP GL+SL   SA+ YSPGP SIF
Sbjct: 78   PAAENPTQAPAIALPFVAPPSSPASFLPSEPPSATQSPAGLVSLTSISASMYSPGPASIF 137

Query: 177  AIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXX---HLTTPSSPEVPFARLLTSSLDRNC 347
            AIGPYAHETQL                         HLTTPSSPEVPFA+LL  +L    
Sbjct: 138  AIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNL---- 193

Query: 348  KTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLE 527
            +     Q+F  SHYEFQSYQL+PGSPVG LISPSS IS SGTSSPF D EF++  H F E
Sbjct: 194  QYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPFRDGEFAASLH-FPE 252

Query: 528  FRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLAN-SN 704
            FR G+PPKL + D  S+ +W  H GSG+LT PD    T R+ FL+++QISE+ S  +  N
Sbjct: 253  FRMGDPPKLLNLDKHSSCEWGSHHGSGTLT-PDATRSTPRNGFLLDHQISEITSHPHLKN 311

Query: 705  NGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLS 884
               QN+++  +HRVSFELT EE    +E E    S        ++ T     + E D   
Sbjct: 312  KEVQNDQVAHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQIEAT---RESEEHDTKV 368

Query: 885  SEAENTCVGETSSNVSGKAFGD 950
             +     VGETS+    KA  D
Sbjct: 369  VDDYECRVGETSNERPEKALAD 390


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  226 bits (576), Expect = 1e-56
 Identities = 150/324 (46%), Positives = 182/324 (56%), Gaps = 7/324 (2%)
 Frame = +3

Query: 3    APAPENAIRPPTIXXXXXXXXXXXXXXXQSEPHSSTQSPTGLLSL---SANTYSP-GPNS 170
            APA EN  + P +               QSEP S TQSP GL+SL   SA+ YSP GP S
Sbjct: 76   APASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPAS 135

Query: 171  IFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXX---HLTTPSSPEVPFARLLTSSLDR 341
            IFAIGPYAHETQL                         HLTTPSSPEVPFA+ L  SL R
Sbjct: 136  IFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSL-R 194

Query: 342  NCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPF 521
            N  T     +F    ++FQSYQ +PGSPVG LISPSS IS SGTSSPFPD EF+ GG  F
Sbjct: 195  NGDTG---LRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHF 248

Query: 522  LEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANS 701
             EFR GEPPKL + D LST +W  +QGSG+LTP          +FL+  Q S+V S   S
Sbjct: 249  PEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESVR--RGSPNFLLHRQFSDVPSRPRS 306

Query: 702  NNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGL 881
             NG +N + V++HRVSFELTAE+   CVE++P   +   K+V    +        +  G 
Sbjct: 307  GNGHKNGQ-VVNHRVSFELTAEDASRCVEEKP---AFSIKTVPEYVENGTQAKEEKNSGE 362

Query: 882  SSEAENTCVGETSSNVSGKAFGDG 953
            S ++    VG TS++    A  DG
Sbjct: 363  SIQSFECRVGVTSNDSPEMASTDG 386


Top