BLASTX nr result

ID: Akebia23_contig00019343 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00019343
         (2052 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002528710.1| DNA binding protein, putative [Ricinus commu...   436   e-119
ref|XP_006374055.1| hypothetical protein POPTR_0016s14630g [Popu...   420   e-114
gb|EXB74412.1| hypothetical protein L484_004233 [Morus notabilis]     410   e-111
ref|XP_004302521.1| PREDICTED: uncharacterized protein LOC101300...   409   e-111
ref|XP_007033486.1| Uncharacterized protein isoform 4 [Theobroma...   322   3e-85
ref|XP_007033485.1| Uncharacterized protein isoform 3 [Theobroma...   322   3e-85
ref|XP_007033484.1| Uncharacterized protein isoform 2, partial [...   322   3e-85
ref|XP_002266425.1| PREDICTED: uncharacterized protein LOC100267...   321   9e-85
ref|XP_006481235.1| PREDICTED: putative GPI-anchored protein PB1...   318   5e-84
ref|XP_006429633.1| hypothetical protein CICLE_v100114702mg, par...   318   5e-84
ref|XP_006381258.1| hypothetical protein POPTR_0006s11130g [Popu...   313   2e-82
ref|XP_007208445.1| hypothetical protein PRUPE_ppa003389mg [Prun...   312   3e-82
ref|XP_007033483.1| Uncharacterized protein isoform 1 [Theobroma...   309   4e-81
gb|EYU42049.1| hypothetical protein MIMGU_mgv1a003860mg [Mimulus...   308   8e-81
ref|XP_004170022.1| PREDICTED: uncharacterized protein LOC101225...   278   5e-72
ref|XP_004142729.1| PREDICTED: uncharacterized protein LOC101206...   278   5e-72
ref|XP_006353835.1| PREDICTED: putative GPI-anchored protein PB1...   278   7e-72
ref|XP_007140052.1| hypothetical protein PHAVU_008G080400g [Phas...   277   2e-71
ref|XP_006602722.1| PREDICTED: flocculation protein FLO11-like [...   272   4e-70
ref|XP_003534630.1| PREDICTED: flocculation protein FLO11-like [...   272   5e-70

>ref|XP_002528710.1| DNA binding protein, putative [Ricinus communis]
            gi|223531882|gb|EEF33699.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 580

 Score =  436 bits (1121), Expect = e-119
 Identities = 272/571 (47%), Positives = 326/571 (57%), Gaps = 18/571 (3%)
 Frame = -2

Query: 1967 KESLSGGRNI-------QHRRGRILTRGGMSKDR--DENLDLFSRNRRSLSITSSDES-D 1818
            ++SL G RN         HRRG  L  G  SKD   DENLDLFS+NRRSLS+ SSDES D
Sbjct: 15   RDSLIGARNFPAGTGSFSHRRGHSLN-GFSSKDTTTDENLDLFSKNRRSLSVASSDESSD 73

Query: 1817 VSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASEPEPXXXX 1638
            VS KLGR+SVGS K+A+SG+DDLLS+ DGGKHDYDWLLTPPGTPLFP+S+ S+ +P    
Sbjct: 74   VSMKLGRVSVGSAKVAKSGIDDLLSSTDGGKHDYDWLLTPPGTPLFPTSDGSDSQPTLVA 133

Query: 1637 XXXXXXXXXXXXXXXXXXSMTQXXXXXXXXXXXXXXXXXXXXXTMNYNTHSSISNRSPXX 1458
                              S++Q                        Y+T+SS  NRS   
Sbjct: 134  PRSRSLSRSVSTTKASRLSVSQSESQHSSRPTRSSSVTRSSISNSQYSTYSS--NRSSSI 191

Query: 1457 XXXXXXXXXXXXXXSTPGAXXXXXXXXXXXXXXXXXXXXXPLTPVRSRPTPITSSVEKTR 1278
                          S+P                         TP R RP P +S V+K R
Sbjct: 192  LNTSSASVSSYTRPSSP--ITRSPSTARPSTPSSRPTASRASTPSRVRPAPTSSLVDKNR 249

Query: 1277 ASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAGRILTNG 1098
             SQ+SRPSTP SR+Q+                                    AGR+ +NG
Sbjct: 250  QSQSSRPSTPSSRAQLPANSNSTSTRSNSRPSTPTQRNPVSSVSPASGPSISAGRVPSNG 309

Query: 1097 RIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLPDRAVSAGRSRPNATTT--- 927
            RI + ASR SSPGP+ RP  Q +VPPDFP+DTPPNLRTTLPDR +SAGRSRP A+TT   
Sbjct: 310  RISAPASRPSSPGPRIRPSQQPVVPPDFPLDTPPNLRTTLPDRPISAGRSRPGASTTIKG 369

Query: 926  ----PGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIAATEQQKTTPISELAMRRP 759
                 G  N V R+ SSPIV+RGRL E  GKGR H+NG     +E +K + +S+  MR+P
Sbjct: 370  SPETTGATN-VPRRHSSPIVSRGRLAEAPGKGRAHSNGHAADISEPRKVSHVSDPGMRKP 428

Query: 758  AKP-VTTTESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLSGTGLFPQSIRSGTLKGQP 582
             K  VTTT++ GFGRTISKKSLDMA+RHMD R G+GS R LS T LFPQSIR+ T K Q 
Sbjct: 429  VKSSVTTTDNNGFGRTISKKSLDMAIRHMDIRTGNGSTRALSSTTLFPQSIRTAT-KAQ- 486

Query: 581  VKASGDLVSVSSNGRPSNNSKIAVLANNGNCNGRSEDGETEKDGYLSAKVNEADIYESSR 402
                    SV     P +N+   +L N  + +   E G    DG  SAK++E DIYESSR
Sbjct: 487  --------SVRPMNAPESNNNGGILENGHHVSRPVEYGSEVNDGRYSAKLSEVDIYESSR 538

Query: 401  YDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
            YDA+LLKEDLKNTNWLHSIDDKSDQG IFD+
Sbjct: 539  YDALLLKEDLKNTNWLHSIDDKSDQGSIFDN 569


>ref|XP_006374055.1| hypothetical protein POPTR_0016s14630g [Populus trichocarpa]
            gi|550321518|gb|ERP51852.1| hypothetical protein
            POPTR_0016s14630g [Populus trichocarpa]
          Length = 596

 Score =  420 bits (1080), Expect = e-114
 Identities = 279/581 (48%), Positives = 332/581 (57%), Gaps = 20/581 (3%)
 Frame = -2

Query: 1991 FDSMNRMSKESLSGGRNIQ-----HRRGRILTRGGM-SKD---RDENLDLFSRNRRSLSI 1839
            + S+N   +ESL GGRNI      HRRG  LT GG+ SKD   +DENLDLFS+NRRSLS+
Sbjct: 14   YRSVNGSLRESLVGGRNIPVGSQYHRRGHSLTGGGVFSKDNHNKDENLDLFSKNRRSLSV 73

Query: 1838 TSSDES-DVSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNAS 1662
             SSDES DVS KLGR+SVGS KL RSG+DDLLS+ +GGKHDYDWLLTPPGTPLFPSS  S
Sbjct: 74   ASSDESSDVSVKLGRLSVGSAKLVRSGIDDLLSS-EGGKHDYDWLLTPPGTPLFPSSEGS 132

Query: 1661 EPEP-XXXXXXXXXXXXXXXXXXXXXXSMTQXXXXXXXXXXXXXXXXXXXXXTMNYNTHS 1485
            E +P                       S++Q                     +  Y+T+S
Sbjct: 133  ESQPTLVAPRSSSLARSASTTKAASTLSVSQSESYHSSRPARSSSVTRPSISSSQYSTYS 192

Query: 1484 SISNRSPXXXXXXXXXXXXXXXXSTPGAXXXXXXXXXXXXXXXXXXXXXPLTPVRSRPTP 1305
              SNRS                 S+P +                       +  R  PT 
Sbjct: 193  --SNRSSSILNTSSASVSSYTRPSSPVSRTPSIARPSTPSARPTPSRSSTPSRARPAPT- 249

Query: 1304 ITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1125
             +SS++KTR SQNSRPSTP SR QI                                   
Sbjct: 250  -SSSIDKTRPSQNSRPSTPSSRGQIPANLSTAPTRSNSRPSTPTRRNPAPSSSTASSPST 308

Query: 1124 XAGRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLPDRAVSAGRSR 945
             AGR+L+N RIP   SR +SP P+ R P Q +VPPDFP+DTPPNLRTTLPDR +SAGRSR
Sbjct: 309  SAGRVLSNNRIPGPTSRPNSPSPRVR-PQQPVVPPDFPLDTPPNLRTTLPDRPLSAGRSR 367

Query: 944  PNATTT----PGPVNPV--RRQSSSPIVTRGRLPELSGKGRLHANGQDIAATEQQKTTPI 783
            PN   T    P  V  V   R+ SSPIV+RGRL E SGKGR+H+NG    A E +K + +
Sbjct: 368  PNVHATMKGNPETVGSVIAPRRHSSPIVSRGRLTEPSGKGRVHSNGHIADAPEPRKVSHV 427

Query: 782  SELAMRRPAKPVTT-TESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLSGTGLFPQSIR 606
            SEL MR+P K  +T +ESTGFGRTISKKSLDMA+RHMD R+G+GS R LS T LFPQSIR
Sbjct: 428  SELGMRKPVKSSSTASESTGFGRTISKKSLDMAIRHMDLRNGTGSTRSLSSTTLFPQSIR 487

Query: 605  SGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNCNGRSEDGETEKDGY-LSAKVN 429
            S T K    ++     S+ +NG   N     VL N    +  +E      DG   SAK++
Sbjct: 488  SATPKTHSARSRSAPESI-NNGNLQNGD---VLENESYFSRATEIRREANDGQRYSAKLS 543

Query: 428  E-ADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
            E  DI ESSRYDAILLKEDLKNT+WLH IDDKSDQG  FD+
Sbjct: 544  EVVDICESSRYDAILLKEDLKNTDWLHGIDDKSDQGPFFDN 584


>gb|EXB74412.1| hypothetical protein L484_004233 [Morus notabilis]
          Length = 574

 Score =  410 bits (1053), Expect = e-111
 Identities = 251/556 (45%), Positives = 319/556 (57%), Gaps = 13/556 (2%)
 Frame = -2

Query: 1937 QHRRGRILTRGGMSKDR---DENLDLFSRNRRSLSITSSDES-DVSTKLGRISVGSEKLA 1770
            QHRRG  L   G +KD    + NLDLFS+NRRSLS+TSSDES DVS KLGR+SVGS K++
Sbjct: 14   QHRRGHSLNLAGGNKDAVTDENNLDLFSKNRRSLSVTSSDESSDVSVKLGRLSVGSAKVS 73

Query: 1769 RSGMDDLLSTVDGGKHDYDWLLTPPGTP-LFPSSNASEPEPXXXXXXXXXXXXXXXXXXX 1593
            RSG+DDLLS+ DGGKHDYDWLLTPPGTP +FPSS  +EP+                    
Sbjct: 74   RSGIDDLLSSTDGGKHDYDWLLTPPGTPTIFPSSEGNEPQRTIAAPRSSSLARSASTTKA 133

Query: 1592 XXXSMTQXXXXXXXXXXXXXXXXXXXXXTMNYNTHSSISNRSPXXXXXXXXXXXXXXXXS 1413
               S++Q                     T  +NT+S  SNRS                 +
Sbjct: 134  SRLSVSQSETNHSSRPTRSSSVTRSSTSTSLHNTYS--SNRSSNILNTSSASVSSYTRPA 191

Query: 1412 TPGAXXXXXXXXXXXXXXXXXXXXXPLTPVRSRPTPITSSVEKTRASQNSRPSTPISRSQ 1233
            +P                         +     PT  +SS +++R  Q+SRPSTP SR Q
Sbjct: 192  SPITRSSSTARPSTPSSRPTLSRPSTPSRAHPSPT--SSSADRSRPIQSSRPSTPSSRPQ 249

Query: 1232 IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAGRILTNGRIPSTASRGSSPGPK 1053
            I                                     GR+L+NGR P+++SR SSP P+
Sbjct: 250  IPANLSSPAARSNSRPSTPTRRSPVSTISPAASPSISNGRVLSNGRNPTSSSRPSSPSPR 309

Query: 1052 ARPPPQRIVPPDFPIDTPPNLRTTLPDRAVSAGRSRPNATTT------PGPVNPVRRQSS 891
             RPPPQ +VPPDFP+DTPPNLRTTLPDR +SAGRSRP +T T              R+ S
Sbjct: 310  IRPPPQPVVPPDFPLDTPPNLRTTLPDRPLSAGRSRPGSTVTMKGNSETTTTANTSRRHS 369

Query: 890  SPIVTRGRLPELSGKGRLHANGQDIAATEQQKTTPISELAMRRPAK-PVTTTESTGFGRT 714
            SPIVTRGRL E +G+GRL  NG    A E +K +   +L MR+P K  + + ++ GFGRT
Sbjct: 370  SPIVTRGRLTEPAGRGRLQGNGHYTDA-EPRKASHAPDLTMRKPVKASIASLDNGGFGRT 428

Query: 713  ISKKSLDMALRHMDFRHGSGSIRPLSGTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRP 534
            ISKKSLDMA+RHMD R G G++RPL+GT LFPQSIRS + K Q +++S    S+ + G  
Sbjct: 429  ISKKSLDMAIRHMDIRSGGGNVRPLAGTTLFPQSIRSASSKTQSIRSSSAPSSIINGGLQ 488

Query: 533  SNNSKIAVLANNGNCNGR-SEDGETEKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNW 357
            ++ +   ++++NGN   R +E+G     G   AK+NE DIYESSRYD +LLKEDLKNTNW
Sbjct: 489  TSYN--GIISDNGNAIDRPAENGIGADGGRHFAKLNEVDIYESSRYDTLLLKEDLKNTNW 546

Query: 356  LHSIDDKSDQGLIFDH 309
            LHSIDDK+D G IFD+
Sbjct: 547  LHSIDDKTDHGPIFDN 562


>ref|XP_004302521.1| PREDICTED: uncharacterized protein LOC101300547 [Fragaria vesca
            subsp. vesca]
          Length = 583

 Score =  409 bits (1051), Expect = e-111
 Identities = 261/577 (45%), Positives = 323/577 (55%), Gaps = 19/577 (3%)
 Frame = -2

Query: 1982 MNRMSKESLSGGRNIQ-HRRGRILTRGGMSK-------DRDENLDLFSRNRRSLSITSSD 1827
            MNR ++ESL GGRN Q HRRG  L    +S        D   +LDLFS++RR+LS+ SSD
Sbjct: 2    MNRNARESLIGGRNFQNHRRGGSLNLPVLSSSMKQEHHDESSSLDLFSKSRRTLSVASSD 61

Query: 1826 ES-DVSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASEPEP 1650
            ES DVS KLGR+SVGS K+ R+G+DDLLS+ DGGKHDYDWLLTPP TPLFPSS+ SE +P
Sbjct: 62   ESSDVSVKLGRLSVGSGKVGRTGIDDLLSSADGGKHDYDWLLTPPETPLFPSSDGSESQP 121

Query: 1649 XXXXXXXXXXXXXXXXXXXXXXSMTQXXXXXXXXXXXXXXXXXXXXXTMNYNTHSSISNR 1470
                                  S++Q                     +  YN +SS  NR
Sbjct: 122  TLAAARGSALIRSTSSAKPSRLSVSQSESNHSSRPARSSSVTRSSISSSQYNNYSS--NR 179

Query: 1469 SPXXXXXXXXXXXXXXXXSTPGAXXXXXXXXXXXXXXXXXXXXXPLTPVRSRPTPITSSV 1290
            +                 S+P                       P TP R+R  P +SS+
Sbjct: 180  NSNFLNTSSASVSSYSRPSSP--ITRSPSTARPSTPTSRPSLSRPSTPSRARSVPASSSI 237

Query: 1289 EKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAGRI 1110
            E+ R+  +SRPSTP SR QI                                    AGR 
Sbjct: 238  ERPRSVASSRPSTPSSRPQIPANLSSPAARTPSRPSTPTRRHSLPSLSPASSPSPSAGR- 296

Query: 1109 LTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLPDRAVSAGRSRPNATT 930
            L+NGR P+  SR SSP P+ RPPPQ IVP DFP+DTPPNLRTTLPDR +SAGRSRP A  
Sbjct: 297  LSNGRNPAPTSRPSSPSPRVRPPPQPIVPHDFPLDTPPNLRTTLPDRPISAGRSRPGAAV 356

Query: 929  -------TPGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIAATEQQKTTPISELA 771
                   TP  V   RRQ SSP+V+RGRL +  G+ R+ +NG      E +K   + +L 
Sbjct: 357  VVKGKLETPAAVVVPRRQ-SSPVVSRGRLTDPPGRSRVLSNGHH-DVPELRKPQHLPDLG 414

Query: 770  MRRPAKPVTTT--ESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLSGTGLFPQSIRSGT 597
            MR+P K  +TT  E+TGFGR ISKKSLDMA+RHMD ++G+G+ R LSG+ LFPQSIRSGT
Sbjct: 415  MRKPVKTSSTTAPENTGFGRNISKKSLDMAIRHMDIKNGTGNSRQLSGSTLFPQSIRSGT 474

Query: 596  LKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNCNGRSEDG-ETEKDGYLSAKVNEAD 420
             K Q V+      SV+ NG   +     V  N  N +   ++G E    G  S+K+ + D
Sbjct: 475  PKTQTVRTLSSSASVNMNGGLQSRGNGFVYENGNNMSKPVQNGTEANGGGRYSSKLTDVD 534

Query: 419  IYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
            IYESSRYDAILLKEDLKNTNWLHS+DDK D+G IFD+
Sbjct: 535  IYESSRYDAILLKEDLKNTNWLHSLDDKLDEGPIFDN 571


>ref|XP_007033486.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508712515|gb|EOY04412.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 442

 Score =  322 bits (826), Expect = 3e-85
 Identities = 186/351 (52%), Positives = 225/351 (64%), Gaps = 10/351 (2%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP + RP+  +S ++K+R SQ+SRPSTP SR QI                          
Sbjct: 82   TPSKVRPSSTSSYIDKSRPSQSSRPSTPSSRPQIPANLNSTAVRSNSRPSTPTRRNPIPS 141

Query: 1151 XXXXXXXXXXA-GRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLP 975
                      + GR L+NGR  + ASR SSPGP+ RPP Q +VPPDFP+DTPPNLRTTLP
Sbjct: 142  LSSAAAGASPSAGRTLSNGRSAAPASRPSSPGPRVRPPQQPVVPPDFPLDTPPNLRTTLP 201

Query: 974  DRAVSAGRSRPNATT-------TPGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDI 816
            DR VSAGRSRP  +        T   VN  RR SS PIVTRGRL E  G+ R+H+NG   
Sbjct: 202  DRPVSAGRSRPGVSVGMKANQDTTSSVNMPRRHSS-PIVTRGRLTEPPGRTRVHSNGHAS 260

Query: 815  AATEQQKTTPISELAMRRPAKPVTTT-ESTGFGRTISKKSLDMALRHMDFRHGSGSIRPL 639
               E +KT+ +++ AMR+P K  TTT +S GFGRTISKKSLDMA+RHMD R+G+GSIR L
Sbjct: 261  DIHESRKTSHVNDSAMRKPVKSSTTTADSAGFGRTISKKSLDMAIRHMDIRNGTGSIRSL 320

Query: 638  SGTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNCNGRS-EDGET 462
            SGT LFPQSIRS T + Q +++     SV+SNG P +       + NGN   R  ++G  
Sbjct: 321  SGTTLFPQSIRSATTRTQSLRSFSTSDSVNSNGSPGSLQN-GDFSENGNSISRPVQNGSD 379

Query: 461  EKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
              DG  SAK +E DIYESSRYDAILLKEDLKNTNWLHSIDDKSD G IF++
Sbjct: 380  SHDGRYSAKFSEVDIYESSRYDAILLKEDLKNTNWLHSIDDKSDPGSIFEN 430


>ref|XP_007033485.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508712514|gb|EOY04411.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 578

 Score =  322 bits (826), Expect = 3e-85
 Identities = 186/351 (52%), Positives = 225/351 (64%), Gaps = 10/351 (2%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP + RP+  +S ++K+R SQ+SRPSTP SR QI                          
Sbjct: 218  TPSKVRPSSTSSYIDKSRPSQSSRPSTPSSRPQIPANLNSTAVRSNSRPSTPTRRNPIPS 277

Query: 1151 XXXXXXXXXXA-GRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLP 975
                      + GR L+NGR  + ASR SSPGP+ RPP Q +VPPDFP+DTPPNLRTTLP
Sbjct: 278  LSSAAAGASPSAGRTLSNGRSAAPASRPSSPGPRVRPPQQPVVPPDFPLDTPPNLRTTLP 337

Query: 974  DRAVSAGRSRPNATT-------TPGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDI 816
            DR VSAGRSRP  +        T   VN  RR SS PIVTRGRL E  G+ R+H+NG   
Sbjct: 338  DRPVSAGRSRPGVSVGMKANQDTTSSVNMPRRHSS-PIVTRGRLTEPPGRTRVHSNGHAS 396

Query: 815  AATEQQKTTPISELAMRRPAKPVTTT-ESTGFGRTISKKSLDMALRHMDFRHGSGSIRPL 639
               E +KT+ +++ AMR+P K  TTT +S GFGRTISKKSLDMA+RHMD R+G+GSIR L
Sbjct: 397  DIHESRKTSHVNDSAMRKPVKSSTTTADSAGFGRTISKKSLDMAIRHMDIRNGTGSIRSL 456

Query: 638  SGTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNCNGRS-EDGET 462
            SGT LFPQSIRS T + Q +++     SV+SNG P +       + NGN   R  ++G  
Sbjct: 457  SGTTLFPQSIRSATTRTQSLRSFSTSDSVNSNGSPGSLQN-GDFSENGNSISRPVQNGSD 515

Query: 461  EKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
              DG  SAK +E DIYESSRYDAILLKEDLKNTNWLHSIDDKSD G IF++
Sbjct: 516  SHDGRYSAKFSEVDIYESSRYDAILLKEDLKNTNWLHSIDDKSDPGSIFEN 566



 Score =  140 bits (354), Expect = 2e-30
 Identities = 74/119 (62%), Positives = 87/119 (73%), Gaps = 9/119 (7%)
 Frame = -2

Query: 1982 MNRMSKESLSGG--RNIQ-------HRRGRILTRGGMSKDRDENLDLFSRNRRSLSITSS 1830
            MNR  +ESL GG   NI        HRRG+ LT G   +D DENLDLFS+NRRSLS+ SS
Sbjct: 2    MNRNLRESLVGGGRNNINVLAASHHHRRGQSLTGGLFPRDSDENLDLFSKNRRSLSVASS 61

Query: 1829 DESDVSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASEPE 1653
            DES    KLGR+S+GS ++ + G+DDLLS+ DGGKHDYDWLLTPPGTPLFPSS  SE +
Sbjct: 62   DESS-DVKLGRLSLGSARVGKGGLDDLLSSTDGGKHDYDWLLTPPGTPLFPSSEGSESQ 119


>ref|XP_007033484.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508712513|gb|EOY04410.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 522

 Score =  322 bits (826), Expect = 3e-85
 Identities = 186/351 (52%), Positives = 225/351 (64%), Gaps = 10/351 (2%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP + RP+  +S ++K+R SQ+SRPSTP SR QI                          
Sbjct: 162  TPSKVRPSSTSSYIDKSRPSQSSRPSTPSSRPQIPANLNSTAVRSNSRPSTPTRRNPIPS 221

Query: 1151 XXXXXXXXXXA-GRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLP 975
                      + GR L+NGR  + ASR SSPGP+ RPP Q +VPPDFP+DTPPNLRTTLP
Sbjct: 222  LSSAAAGASPSAGRTLSNGRSAAPASRPSSPGPRVRPPQQPVVPPDFPLDTPPNLRTTLP 281

Query: 974  DRAVSAGRSRPNATT-------TPGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDI 816
            DR VSAGRSRP  +        T   VN  RR SS PIVTRGRL E  G+ R+H+NG   
Sbjct: 282  DRPVSAGRSRPGVSVGMKANQDTTSSVNMPRRHSS-PIVTRGRLTEPPGRTRVHSNGHAS 340

Query: 815  AATEQQKTTPISELAMRRPAKPVTTT-ESTGFGRTISKKSLDMALRHMDFRHGSGSIRPL 639
               E +KT+ +++ AMR+P K  TTT +S GFGRTISKKSLDMA+RHMD R+G+GSIR L
Sbjct: 341  DIHESRKTSHVNDSAMRKPVKSSTTTADSAGFGRTISKKSLDMAIRHMDIRNGTGSIRSL 400

Query: 638  SGTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNCNGRS-EDGET 462
            SGT LFPQSIRS T + Q +++     SV+SNG P +       + NGN   R  ++G  
Sbjct: 401  SGTTLFPQSIRSATTRTQSLRSFSTSDSVNSNGSPGSLQN-GDFSENGNSISRPVQNGSD 459

Query: 461  EKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
              DG  SAK +E DIYESSRYDAILLKEDLKNTNWLHSIDDKSD G IF++
Sbjct: 460  SHDGRYSAKFSEVDIYESSRYDAILLKEDLKNTNWLHSIDDKSDPGSIFEN 510



 Score = 95.9 bits (237), Expect = 6e-17
 Identities = 44/62 (70%), Positives = 53/62 (85%), Gaps = 1/62 (1%)
 Frame = -2

Query: 1835 SSDES-DVSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASE 1659
            SSDES DV+ KLGR+S+GS ++ + G+DDLLS+ DGGKHDYDWLLTPPGTPLFPSS  SE
Sbjct: 2    SSDESSDVAVKLGRLSLGSARVGKGGLDDLLSSTDGGKHDYDWLLTPPGTPLFPSSEGSE 61

Query: 1658 PE 1653
             +
Sbjct: 62   SQ 63


>ref|XP_002266425.1| PREDICTED: uncharacterized protein LOC100267210 [Vitis vinifera]
            gi|147841364|emb|CAN71240.1| hypothetical protein
            VITISV_034160 [Vitis vinifera]
            gi|296085846|emb|CBI31170.3| unnamed protein product
            [Vitis vinifera]
          Length = 570

 Score =  321 bits (822), Expect = 9e-85
 Identities = 191/349 (54%), Positives = 229/349 (65%), Gaps = 8/349 (2%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP R+RP   +SS++K R S NSRP+TP SR Q+                          
Sbjct: 215  TPSRARPGLTSSSIDKPRPSPNSRPTTPSSRPQLQANLSSPAARSNSRPSTPTRRTPAAS 274

Query: 1151 XXXXXXXXXXAGRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLPD 972
                        R ++NGR P+ ASR SSP P+ R PPQ IV PDFP+DTPPNLRTTLPD
Sbjct: 275  LSPTAGPSPSTARAMSNGRNPAPASRPSSPSPRVRNPPQPIVLPDFPLDTPPNLRTTLPD 334

Query: 971  RAVSAGRSRPNATTTP--GPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIAATEQQ 798
            R +SAGRSRP A  T       P RRQSS PIVTRGR+ E + +GRLH+NG    + E +
Sbjct: 335  RPLSAGRSRPGAAMTMKGNSETPTRRQSS-PIVTRGRVSEPNARGRLHSNGHVADSPESR 393

Query: 797  KTTPISELAMRRPAKPVTTT-ESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLSGTGLF 621
            K + ++E + R+P K  TT+ ESTGFGRTISKKSLDMA+RHMD R+G+GSIRPLSGT LF
Sbjct: 394  KASHVTEPS-RKPVKTSTTSSESTGFGRTISKKSLDMAIRHMDIRNGTGSIRPLSGTTLF 452

Query: 620  PQSIRSGTLKGQPVKASGDL---VSVSSNGR-PSNNSKIAVLANNGNCNGR-SEDGETEK 456
            PQSIRS   K Q  +AS       SV+SNG  P++N+   V + NGN   R SE+G  E 
Sbjct: 453  PQSIRSAASKTQSARASSASSAPASVNSNGSLPASNN--GVPSENGNYFTRPSENGAEED 510

Query: 455  DGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
            DG  SAK+N+ DIYESSRYDAILLKEDLKNTNWLHS+ DKSDQG IFD+
Sbjct: 511  DGRFSAKLNQTDIYESSRYDAILLKEDLKNTNWLHSV-DKSDQGPIFDN 558



 Score =  159 bits (403), Expect = 4e-36
 Identities = 82/115 (71%), Positives = 94/115 (81%), Gaps = 4/115 (3%)
 Frame = -2

Query: 1982 MNRMSKESLSGGRNI----QHRRGRILTRGGMSKDRDENLDLFSRNRRSLSITSSDESDV 1815
            MNR  KES +G R I     HRRGR LT  GM +D DENLDLFSRNRR+LS+ SS+ES+V
Sbjct: 1    MNRSFKESPAGPRTIPAVSHHRRGRSLT--GMPRDADENLDLFSRNRRTLSVVSSEESEV 58

Query: 1814 STKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASEPEP 1650
              KLGR+SVGS KLARSGMDDLLS+V+GGKHDYDWLLTPPGTPLFPSS+ +E +P
Sbjct: 59   PLKLGRLSVGSAKLARSGMDDLLSSVEGGKHDYDWLLTPPGTPLFPSSDGNESQP 113


>ref|XP_006481235.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like isoform X1
            [Citrus sinensis] gi|568855282|ref|XP_006481236.1|
            PREDICTED: putative GPI-anchored protein PB15E9.01c-like
            isoform X2 [Citrus sinensis]
          Length = 582

 Score =  318 bits (816), Expect = 5e-84
 Identities = 190/349 (54%), Positives = 222/349 (63%), Gaps = 9/349 (2%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP R+RP+  +SS++KTR SQ SRPSTP SR QI                          
Sbjct: 230  TPSRTRPSLTSSSMDKTRTSQTSRPSTPSSRPQIPANLNSSTARSSSRPSTPTRRNPITS 289

Query: 1151 XXXXXXXXXXAGRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLPD 972
                      AGR+++NGR    ASR SSP P+ R   Q IVPPDFP+DTPPNLRTTLPD
Sbjct: 290  TSPAMSSSTSAGRVMSNGRSQGPASRPSSPSPRVRSQ-QPIVPPDFPLDTPPNLRTTLPD 348

Query: 971  RAVSAGRSRPNATTT-------PGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIA 813
            R +SAGRSRP A  T        G VN  RR SS P+VTRGRL E  G+ R  ANG    
Sbjct: 349  RPLSAGRSRPGAALTMKSNPEATGSVNMPRRHSS-PVVTRGRLTEPPGRSRTPANGHTAD 407

Query: 812  ATEQQKTTPISELAMRRPAKPVTT-TESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLS 636
            A E ++T+ ISE + RRP K   T ++ TGFGRTISKKSLDMA+RHMD R+G+GSIR LS
Sbjct: 408  AHEYRRTSHISEQSTRRPVKSTNTASDGTGFGRTISKKSLDMAIRHMDIRNGAGSIRQLS 467

Query: 635  GTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNC-NGRSEDGETE 459
            GT LFPQSIRS T K +  +A   L SV +NG   N      ++  GN  +G +E+G   
Sbjct: 468  GTTLFPQSIRSATSKTRSARA---LESVHTNGILKNRD----ISEKGNTYSGPAENGNDA 520

Query: 458  KDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFD 312
             DG  SAK++EADIYESSRYDAILLKEDLKNTNWLHS DDKSDQG IFD
Sbjct: 521  HDGRYSAKLSEADIYESSRYDAILLKEDLKNTNWLHSFDDKSDQGAIFD 569



 Score =  144 bits (364), Expect = 1e-31
 Identities = 84/125 (67%), Positives = 97/125 (77%), Gaps = 15/125 (12%)
 Frame = -2

Query: 1982 MNRMS----KESLSGGRNI--------QHRRGRILTRGGMSKDR-DEN-LDLFSRNRRSL 1845
            MNR S    +ESL GGRNI        QHRRG+ LT  G +KD  DEN LDLFS++RRSL
Sbjct: 1    MNRYSNNHLRESLVGGRNIPVGMHLHHQHRRGQSLT--GSTKDTSDENHLDLFSKSRRSL 58

Query: 1844 SITSSDES-DVSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSN 1668
            S+ SSD+S DVS KLGR+SVGS KLA+SG+DDLLS+ DGGKHDYDWLLTPPGTPLFPSS+
Sbjct: 59   SVASSDDSSDVSVKLGRLSVGSAKLAKSGVDDLLSSTDGGKHDYDWLLTPPGTPLFPSSD 118

Query: 1667 ASEPE 1653
             SE +
Sbjct: 119  GSESQ 123


>ref|XP_006429633.1| hypothetical protein CICLE_v100114702mg, partial [Citrus clementina]
            gi|557531690|gb|ESR42873.1| hypothetical protein
            CICLE_v100114702mg, partial [Citrus clementina]
          Length = 368

 Score =  318 bits (816), Expect = 5e-84
 Identities = 190/349 (54%), Positives = 222/349 (63%), Gaps = 9/349 (2%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP R+RP+  +SS++KTR SQ SRPSTP SR QI                          
Sbjct: 16   TPSRTRPSLTSSSMDKTRTSQTSRPSTPSSRPQIPANLNSSTARSSSRPSTPTRRNPITS 75

Query: 1151 XXXXXXXXXXAGRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLPD 972
                      AGR+++NGR    ASR SSP P+ R   Q IVPPDFP+DTPPNLRTTLPD
Sbjct: 76   TSPAMSSSTSAGRVMSNGRSQGPASRPSSPSPRVRSQ-QPIVPPDFPLDTPPNLRTTLPD 134

Query: 971  RAVSAGRSRPNATTT-------PGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIA 813
            R +SAGRSRP A  T        G VN  RR SS P+VTRGRL E  G+ R  ANG    
Sbjct: 135  RPLSAGRSRPGAALTMKSNPEATGSVNMPRRHSS-PVVTRGRLTEPPGRSRTPANGHTAD 193

Query: 812  ATEQQKTTPISELAMRRPAKPVTT-TESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLS 636
            A E ++T+ ISE + RRP K   T ++ TGFGRTISKKSLDMA+RHMD R+G+GSIR LS
Sbjct: 194  AHEYRRTSHISEQSTRRPVKSTNTASDGTGFGRTISKKSLDMAIRHMDIRNGAGSIRQLS 253

Query: 635  GTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNC-NGRSEDGETE 459
            GT LFPQSIRS T K +  +A   L SV +NG   N      ++  GN  +G +E+G   
Sbjct: 254  GTTLFPQSIRSATSKTRSARA---LESVHTNGILKNRD----ISEKGNTYSGPAENGNDA 306

Query: 458  KDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFD 312
             DG  SAK++EADIYESSRYDAILLKEDLKNTNWLHS DDKSDQG IFD
Sbjct: 307  HDGRYSAKLSEADIYESSRYDAILLKEDLKNTNWLHSFDDKSDQGAIFD 355


>ref|XP_006381258.1| hypothetical protein POPTR_0006s11130g [Populus trichocarpa]
            gi|550335959|gb|ERP59055.1| hypothetical protein
            POPTR_0006s11130g [Populus trichocarpa]
          Length = 597

 Score =  313 bits (801), Expect = 2e-82
 Identities = 183/349 (52%), Positives = 220/349 (63%), Gaps = 8/349 (2%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP R RP P +SSV+KT   QNSRPSTP SR Q                           
Sbjct: 243  TPSRVRPAPTSSSVDKTPPFQNSRPSTPSSRGQSPANFSAAPTRSNSRPSTPTRRNPAPS 302

Query: 1151 XXXXXXXXXXAGRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLPD 972
                      AGR+L+NGRIP  ASR SSP P+ RPP Q ++PPDFP+DTPPNLRTTL  
Sbjct: 303  SSAASSPSTSAGRVLSNGRIPGPASRPSSPSPRVRPPQQPVIPPDFPLDTPPNLRTTLQG 362

Query: 971  RAVSAGRSRPNATT-------TPGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIA 813
            R +SAGRSR   ++       T G +N  RR  SSPIVTRGRL E SGKGR+H+NG    
Sbjct: 363  RPLSAGRSRTGVSSAMKGNPETMGSLNAPRRH-SSPIVTRGRLTEPSGKGRVHSNGHVAD 421

Query: 812  ATEQQKTTPISELAMRRPAKPVT-TTESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLS 636
              E +K + +SE+ +RRP K  +  ++STGFGRTISKKSLDMA+RHMD R+G+GS R LS
Sbjct: 422  TPEPRKVSHVSEVGIRRPVKSSSAASDSTGFGRTISKKSLDMAIRHMDIRNGTGSARSLS 481

Query: 635  GTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNCNGRSEDGETEK 456
             T LFPQSIRS T K Q V++     S+ +NG   N     VL +  + +  +E G    
Sbjct: 482  STTLFPQSIRSTTPKSQSVRSQRTQESI-NNGNSQNGD---VLDDEIHFSRAAEIGHEAN 537

Query: 455  DGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
            DG  SAK+++ DIYESSRYDAILL EDLKNTNWLHSIDDKSDQG  FD+
Sbjct: 538  DGRYSAKLSDVDIYESSRYDAILL-EDLKNTNWLHSIDDKSDQGPFFDN 585



 Score =  135 bits (339), Expect = 9e-29
 Identities = 73/121 (60%), Positives = 87/121 (71%), Gaps = 9/121 (7%)
 Frame = -2

Query: 1985 SMNRMSKESLSGGRNIQ-----HRRGRILTRGG-MSKDR---DENLDLFSRNRRSLSITS 1833
            S N   +ESL  GRNI      +RRG  +T GG  SK+    DENLDLFS+NRR LS++S
Sbjct: 19   STNGSLRESLVAGRNIPMGSQYYRRGHNVTGGGGFSKNNNGTDENLDLFSKNRRGLSVSS 78

Query: 1832 SDESDVSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASEPE 1653
             + SDVS KL R++VGS K ARSG+DDLLS+ +GGKHDYDWLLTPPGTPL P S  SE +
Sbjct: 79   DESSDVSVKLERLAVGSAKFARSGIDDLLSSTEGGKHDYDWLLTPPGTPLSPPSEGSESK 138

Query: 1652 P 1650
            P
Sbjct: 139  P 139


>ref|XP_007208445.1| hypothetical protein PRUPE_ppa003389mg [Prunus persica]
            gi|462404087|gb|EMJ09644.1| hypothetical protein
            PRUPE_ppa003389mg [Prunus persica]
          Length = 579

 Score =  312 bits (800), Expect = 3e-82
 Identities = 184/352 (52%), Positives = 220/352 (62%), Gaps = 11/352 (3%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPIS-RSQIXXXXXXXXXXXXXXXXXXXXXXXXX 1155
            TP R R T  +SS+EK R+ Q+SRPSTP S R QI                         
Sbjct: 217  TPSRPRTTSTSSSIEKPRSVQSSRPSTPSSTRPQIPANLNSHASRPNSRPSTPTRRSSLP 276

Query: 1154 XXXXXXXXXXXAGRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLP 975
                       AGR+L+NGR  + +SR SSP P+ RPPPQ +VPPDFP+DTPPNLRTTLP
Sbjct: 277  SLSPASSPSPSAGRVLSNGRSSAPSSRPSSPSPRIRPPPQPVVPPDFPLDTPPNLRTTLP 336

Query: 974  DRAVSAGRSRPNATTT------PGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIA 813
            DR +SAGRSRP A  +      P     V R+ SSPI +RGRL E  G+GR+H  G    
Sbjct: 337  DRPISAGRSRPGAVVSMKGKPEPPAAVVVPRRQSSPIASRGRLTEPPGRGRVHPTGHLPD 396

Query: 812  ATEQQKTTPISELAMRRPAKPVTT--TESTGFGRTISKKSLDMALRHMDFRHGSGSIRPL 639
              E +K T I +L MR+P K  TT  TESTGFGR ISKKSLDMA+RHMD R+G+G+ R L
Sbjct: 397  VPEPRKATLIPDLGMRKPVKTSTTTATESTGFGRNISKKSLDMAIRHMDIRNGTGNGRQL 456

Query: 638  SGTGLFPQSIR-SGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNCNGRSEDGET 462
            SG+ LFPQSIR S T K Q V+      S  +NG     S   V++ NGN   R  D  +
Sbjct: 457  SGSTLFPQSIRSSSTPKPQSVRGLSVPASARTNGSLQTGSN-GVISENGNIMNRPVDNGS 515

Query: 461  EKD-GYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
            E D G  SAK++EAD+YESSRYDAILLKEDLK+TNWLHS+DDK DQG IFD+
Sbjct: 516  EADSGRYSAKLSEADVYESSRYDAILLKEDLKSTNWLHSLDDKLDQGPIFDN 567



 Score =  145 bits (367), Expect = 5e-32
 Identities = 77/117 (65%), Positives = 95/117 (81%), Gaps = 6/117 (5%)
 Frame = -2

Query: 1982 MNRMSKESLSGGRNI----QHRRGRILTRGGMSKDRDE-NLDLFSRNRRSLSITSSDES- 1821
            MNR ++ESL GGRNI    QHRRG  L    ++K+ DE +LDLFS+NRR+LS+TSSDES 
Sbjct: 2    MNRNARESLVGGRNIPFGSQHRRGLSLN---LAKESDEGSLDLFSKNRRTLSVTSSDESS 58

Query: 1820 DVSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASEPEP 1650
            DVS KLGR+S+GS K+ R+G+DDLLS+ +GGKHDYDWLLTPP TPLFPSS+ SE +P
Sbjct: 59   DVSVKLGRLSIGSAKVGRTGIDDLLSSAEGGKHDYDWLLTPPETPLFPSSDGSESQP 115


>ref|XP_007033483.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508712512|gb|EOY04409.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 651

 Score =  309 bits (791), Expect = 4e-81
 Identities = 186/375 (49%), Positives = 225/375 (60%), Gaps = 34/375 (9%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP + RP+  +S ++K+R SQ+SRPSTP SR QI                          
Sbjct: 267  TPSKVRPSSTSSYIDKSRPSQSSRPSTPSSRPQIPANLNSTAVRSNSRPSTPTRRNPIPS 326

Query: 1151 XXXXXXXXXXA-GRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLP 975
                      + GR L+NGR  + ASR SSPGP+ RPP Q +VPPDFP+DTPPNLRTTLP
Sbjct: 327  LSSAAAGASPSAGRTLSNGRSAAPASRPSSPGPRVRPPQQPVVPPDFPLDTPPNLRTTLP 386

Query: 974  DRAVSAGRSRPNATT-------TPGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDI 816
            DR VSAGRSRP  +        T   VN  RR SS PIVTRGRL E  G+ R+H+NG   
Sbjct: 387  DRPVSAGRSRPGVSVGMKANQDTTSSVNMPRRHSS-PIVTRGRLTEPPGRTRVHSNGHAS 445

Query: 815  AATEQQKTTPISELAMRRPAKPVTTT-ESTGFGRTISKKSLDMALRHM------------ 675
               E +KT+ +++ AMR+P K  TTT +S GFGRTISKKSLDMA+RHM            
Sbjct: 446  DIHESRKTSHVNDSAMRKPVKSSTTTADSAGFGRTISKKSLDMAIRHMSLELQWHDQWLE 505

Query: 674  ------------DFRHGSGSIRPLSGTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPS 531
                        D R+G+GSIR LSGT LFPQSIRS T + Q +++     SV+SNG P 
Sbjct: 506  LVVIFYVEAYAVDIRNGTGSIRSLSGTTLFPQSIRSATTRTQSLRSFSTSDSVNSNGSPG 565

Query: 530  NNSKIAVLANNGNCNGRS-EDGETEKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWL 354
            +       + NGN   R  ++G    DG  SAK +E DIYESSRYDAILLKEDLKNTNWL
Sbjct: 566  SLQN-GDFSENGNSISRPVQNGSDSHDGRYSAKFSEVDIYESSRYDAILLKEDLKNTNWL 624

Query: 353  HSIDDKSDQGLIFDH 309
            HSIDDKSD G IF++
Sbjct: 625  HSIDDKSDPGSIFEN 639



 Score =  117 bits (294), Expect = 2e-23
 Identities = 75/167 (44%), Positives = 89/167 (53%), Gaps = 57/167 (34%)
 Frame = -2

Query: 1982 MNRMSKESLSGG--RNIQ-------HRRGRILTRGGMSKDRDENLDLFSRNRRSLSITSS 1830
            MNR  +ESL GG   NI        HRRG+ LT G   +D DENLDLFS+NRRSLS+ SS
Sbjct: 2    MNRNLRESLVGGGRNNINVLAASHHHRRGQSLTGGLFPRDSDENLDLFSKNRRSLSVASS 61

Query: 1829 DES-DVSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDW------------------- 1710
            DES DV+ KLGR+S+GS ++ + G+DDLLS+ DGGKHDYD                    
Sbjct: 62   DESSDVAVKLGRLSLGSARVGKGGLDDLLSSTDGGKHDYDCYDVMDHSTRHSPFAPMPNV 121

Query: 1709 ----------------------------LLTPPGTPLFPSSNASEPE 1653
                                        LLTPPGTPLFPSS  SE +
Sbjct: 122  SKDYHFEAESVFGNLRSSQCVITTYVIRLLTPPGTPLFPSSEGSESQ 168


>gb|EYU42049.1| hypothetical protein MIMGU_mgv1a003860mg [Mimulus guttatus]
          Length = 559

 Score =  308 bits (788), Expect = 8e-81
 Identities = 187/350 (53%), Positives = 214/350 (61%), Gaps = 9/350 (2%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP R RP   TSS ++ R SQNSRPSTP SR QI                          
Sbjct: 218  TPARPRPALSTSSTDRPRPSQNSRPSTPTSRPQISSNMTSPAARTTSRPSTPTRRNPTPS 277

Query: 1151 XXXXXXXXXXAGRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLPD 972
                       GR LTNGR  ++ SR SSPGP+ RPPPQ IV  DFP+DTPPNLRTTLPD
Sbjct: 278  LSPTSGPSTPGGRSLTNGRSGASVSRPSSPGPRVRPPPQPIVLHDFPLDTPPNLRTTLPD 337

Query: 971  RAVSAGRSRP--------NATTTPGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDI 816
            R VSAGRSRP        NA  TPG    V R+ SSPIVTRGR+ E +G+GR HANGQ  
Sbjct: 338  RPVSAGRSRPGVSLTSKGNAEPTPGNA-AVPRRHSSPIVTRGRVAEPNGRGRTHANGQLP 396

Query: 815  AATEQQKTTPISELAMRRPAKPVTTTESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLS 636
             A + +K     EL  R+PAK   +T+STGFGRTISKKSLDMA+RHMD R+G+   RPL+
Sbjct: 397  DAMDSRK-----ELPARKPAK--ISTDSTGFGRTISKKSLDMAIRHMDIRNGNNGFRPLT 449

Query: 635  GTGLFPQSIRSGTLKGQPVKAS-GDLVSVSSNGRPSNNSKIAVLANNGNCNGRSEDGETE 459
            G+ LFPQSIRS   K Q   AS    +S++SNG          +A NG  +  SE G  E
Sbjct: 450  GSNLFPQSIRSTNQKTQQGAASPNGSLSINSNG---------AIAENG--HRFSESGSEE 498

Query: 458  KDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
                 SAK+   DIYESSRYD ILLKEDLKN NWLHSIDDKSDQG IFD+
Sbjct: 499  DKYQYSAKLTNIDIYESSRYDMILLKEDLKNANWLHSIDDKSDQGSIFDN 548



 Score =  144 bits (363), Expect = 2e-31
 Identities = 76/114 (66%), Positives = 89/114 (78%), Gaps = 4/114 (3%)
 Frame = -2

Query: 1982 MNRMSKESLSGG-RN--IQHRRGRILTRGGMSKDR-DENLDLFSRNRRSLSITSSDESDV 1815
            MNR  +ES++GG RN  + HRRG  +     SKD  DENLDLFS++RRSLS+ SSDESDV
Sbjct: 1    MNRTLRESVTGGGRNFPLNHRRGLSINGVPNSKDSTDENLDLFSKSRRSLSVASSDESDV 60

Query: 1814 STKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASEPE 1653
              KLGRIS+GS K  RSG+DDLLS+ DGGKHDYDWLLTPPGTPL PSSN +E +
Sbjct: 61   PVKLGRISIGSAKHGRSGLDDLLSSADGGKHDYDWLLTPPGTPLVPSSNVNESQ 114


>ref|XP_004170022.1| PREDICTED: uncharacterized protein LOC101225804, partial [Cucumis
            sativus]
          Length = 484

 Score =  278 bits (712), Expect = 5e-72
 Identities = 169/355 (47%), Positives = 206/355 (58%), Gaps = 14/355 (3%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP R+RP+P + S+EK R  Q+SRPSTP SR QI                          
Sbjct: 124  TPSRARPSPNSPSIEKPRPLQSSRPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPS 183

Query: 1151 XXXXXXXXXXAGRIL-TNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLP 975
                        R+L TNGR  ++ SR SSP P+ R  PQ IVPPDFP+DTPPNLRTTLP
Sbjct: 184  LSSVVGTPSSTSRVLSTNGRSSTSTSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLP 243

Query: 974  DRAVSAGRSRPN-ATTTPG-----PVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIA 813
            DR +SAGRSRP  A++  G         V R+++SP +TRGR+ +  G+GRL+ NG    
Sbjct: 244  DRPISAGRSRPTPASSVRGSPETTSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSD 303

Query: 812  ATEQQKTTPISELAMRRPAKPVTTT-ESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLS 636
            + E ++ +  S+L+ RRP K  TTT ES GFGR+ISKKSLDMA+RHMD R+G GS+R  S
Sbjct: 304  SPETRRLSSSSDLSGRRPVKASTTTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGS 363

Query: 635  GTGLFPQSIRSGTLKGQPVKASG------DLVSVSSNGRPSNNSKIAVLANNGNCNGRSE 474
            G  LFP SIRS T K Q +  S       D    S+N     N      A  G   G  E
Sbjct: 364  GNTLFPHSIRSATSKTQSIALSNSEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGE 423

Query: 473  DGETEKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
            +G        SA +N  DIYESSRYDAILLKEDLKNTNWLHS DDK+D   I D+
Sbjct: 424  NGR------FSASLNHLDIYESSRYDAILLKEDLKNTNWLHSTDDKTDLASILDN 472


>ref|XP_004142729.1| PREDICTED: uncharacterized protein LOC101206216 [Cucumis sativus]
          Length = 578

 Score =  278 bits (712), Expect = 5e-72
 Identities = 169/355 (47%), Positives = 206/355 (58%), Gaps = 14/355 (3%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP R+RP+P + S+EK R  Q+SRPSTP SR QI                          
Sbjct: 218  TPSRARPSPNSPSIEKPRPLQSSRPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPS 277

Query: 1151 XXXXXXXXXXAGRIL-TNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLP 975
                        R+L TNGR  ++ SR SSP P+ R  PQ IVPPDFP+DTPPNLRTTLP
Sbjct: 278  LSSVVGTPSSTSRVLSTNGRSSTSTSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLP 337

Query: 974  DRAVSAGRSRPN-ATTTPG-----PVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIA 813
            DR +SAGRSRP  A++  G         V R+++SP +TRGR+ +  G+GRL+ NG    
Sbjct: 338  DRPISAGRSRPTPASSVRGSPETTSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSD 397

Query: 812  ATEQQKTTPISELAMRRPAKPVTTT-ESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLS 636
            + E ++ +  S+L+ RRP K  TTT ES GFGR+ISKKSLDMA+RHMD R+G GS+R  S
Sbjct: 398  SPETRRLSSSSDLSGRRPVKASTTTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGS 457

Query: 635  GTGLFPQSIRSGTLKGQPVKASG------DLVSVSSNGRPSNNSKIAVLANNGNCNGRSE 474
            G  LFP SIRS T K Q +  S       D    S+N     N      A  G   G  E
Sbjct: 458  GNTLFPHSIRSATSKTQSIALSNSEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGE 517

Query: 473  DGETEKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
            +G        SA +N  DIYESSRYDAILLKEDLKNTNWLHS DDK+D   I D+
Sbjct: 518  NGR------FSASLNHLDIYESSRYDAILLKEDLKNTNWLHSTDDKTDLASILDN 566



 Score =  148 bits (374), Expect = 8e-33
 Identities = 77/113 (68%), Positives = 91/113 (80%), Gaps = 5/113 (4%)
 Frame = -2

Query: 1982 MNRMSKESLSGGRNI----QHRRGRILTRGGMSKDRDENLDLFSRNRRSLSITSSDES-D 1818
            MNR  +E LSG RN      HRRG   T  G+S+D DENLDLFS+NRR+LS+T+SD+S D
Sbjct: 1    MNRNWREPLSGSRNAPLFSHHRRGHSFT--GISRDSDENLDLFSKNRRTLSVTASDDSSD 58

Query: 1817 VSTKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASE 1659
             S KLGR+SVGS KLA+SG+DDLLS+ +GGKHDYDWLLTPPGTPLFPSS+ SE
Sbjct: 59   ASVKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESE 111


>ref|XP_006353835.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like [Solanum
            tuberosum]
          Length = 565

 Score =  278 bits (711), Expect = 7e-72
 Identities = 168/347 (48%), Positives = 205/347 (59%), Gaps = 8/347 (2%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTRASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            TP ++R  P TS     R +Q+SRPSTP SR QI                          
Sbjct: 215  TPSKARQAPSTS-----RPTQSSRPSTPTSRPQISGNLSTPSRPTSRPSTPTRRTITPSL 269

Query: 1151 XXXXXXXXXXAGRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPPNLRTTLPD 972
                       GR +TNGR  ++ SR SSP P+ R P Q IVPPDF ++TPPNLRTTLPD
Sbjct: 270  SPASRSSTPA-GRPVTNGRTAASLSRPSSPSPQVRRPSQPIVPPDFSLETPPNLRTTLPD 328

Query: 971  RAVSAGRSRPNATTT-------PGPVNPVRRQSSSPIVTRGRLPELSGKGRLHANGQDIA 813
            R +SAGRSRPN + T       P   NP  R+ SSPIV+RGRL E SG+GR+  +GQ   
Sbjct: 329  RPLSAGRSRPNPSVTTKGNAEAPSVANP--RRQSSPIVSRGRLTEPSGRGRVLGSGQLSD 386

Query: 812  ATEQQKTTPISELAMRRPAKPVTTTESTGFGRTISKKSLDMALRHMDFRHGSGSIRPLSG 633
             ++ ++ + +SEL+ R+P K  T  ++ G GRTISKKSLD+A+RHMD R+G   +RP SG
Sbjct: 387  ISDSRRASHVSELSTRKPVK--TAADNMGLGRTISKKSLDVAIRHMDIRNGINGVRPSSG 444

Query: 632  TGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGN-CNGRSEDGETEK 456
            + LFP SIRS   KGQP   S    S + N     N     L  NGN  N  SE+G  E 
Sbjct: 445  STLFPHSIRSTNGKGQPSHGSTGASSFNENASYHYNGN---LPENGNYLNRSSENGSEEA 501

Query: 455  DGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIF 315
                SAK+ + DIYESSRYD +LLKEDLKNTNWLHSIDDKSDQ  IF
Sbjct: 502  KSQHSAKLTDIDIYESSRYDVLLLKEDLKNTNWLHSIDDKSDQETIF 548



 Score =  135 bits (341), Expect = 5e-29
 Identities = 66/115 (57%), Positives = 90/115 (78%), Gaps = 4/115 (3%)
 Frame = -2

Query: 1982 MNRMSKESLSGGRNI----QHRRGRILTRGGMSKDRDENLDLFSRNRRSLSITSSDESDV 1815
            MNR  ++SL  G+N     QHRRG  L+  G S++ D+NLDLFS++RRS+S+ SSDE+DV
Sbjct: 1    MNRSFRDSLITGKNFPISSQHRRG--LSLNGASREPDDNLDLFSKSRRSVSVASSDETDV 58

Query: 1814 STKLGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASEPEP 1650
            + KLGR+S+GS K  +SG++DLL++ +G KHDYDWLLTPPGTPL P+S+ SE +P
Sbjct: 59   TVKLGRLSIGSVKQLKSGLEDLLASTEGEKHDYDWLLTPPGTPLVPTSDGSESKP 113


>ref|XP_007140052.1| hypothetical protein PHAVU_008G080400g [Phaseolus vulgaris]
            gi|561013185|gb|ESW12046.1| hypothetical protein
            PHAVU_008G080400g [Phaseolus vulgaris]
          Length = 586

 Score =  277 bits (708), Expect = 2e-71
 Identities = 174/365 (47%), Positives = 205/365 (56%), Gaps = 24/365 (6%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTR-ASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXX 1155
            TP + RP    S+ E+ R +SQ SRPSTP SR  I                         
Sbjct: 217  TPSKPRPVSTNSTSERHRPSSQGSRPSTPSSRPHIPANLHSPSAPSTRSLSRPSTPTRRS 276

Query: 1154 XXXXXXXXXXXA-------GRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPP 996
                                R   NGR  + ASR SSP P+ RPPPQ IVPPDFP+DTPP
Sbjct: 277  SMPSLSPSPSPTPGSLSSSSRASLNGRSSAPASRPSSPSPRIRPPPQPIVPPDFPLDTPP 336

Query: 995  NLRTTLPDRAVSAGRSRPNATT-------TPGPVNPVRRQSSSPIVTRGRLPELSGKGRL 837
            NLRTTLPDR VSAGRSRP  TT       T      V R+ SSP+V+RGR+ E   K R 
Sbjct: 337  NLRTTLPDRPVSAGRSRPGGTTLKANGSETQASSVTVPRRHSSPVVSRGRMTEPLAKSRG 396

Query: 836  HANGQDIAATEQQKTTPISELAMRRPAKPVTT-TESTGFGRTISKKSLDMALRHMDFRHG 660
            +ANG    A E +K     ELA R+  K  TT T++ GFGRTISKKSLDMA++HMD R+G
Sbjct: 397  YANGHHADAPEPRKVAHTPELAARKSVKASTTATDNNGFGRTISKKSLDMAIKHMDIRNG 456

Query: 659  SGSIRPLSGTGLFPQSIRSGTLKGQ--------PVKASGDLVSVSSNGRPSNNSKIAVLA 504
            SG+IR LS T LFPQSIR+ T K           V  +G L+S S+NG  SN      ++
Sbjct: 457  SGNIRSLSSTTLFPQSIRTSTPKSHSHRVCAPASVDMNGSLLSSSNNG--SNFDMGNGIS 514

Query: 503  NNGNCNGRSEDGETEKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQG 324
            N      R  D     +   SAKV+E DIYESSRYDA+L KEDLKNTNWLH +DDK DQG
Sbjct: 515  NRSMIRAREVD-----ERQYSAKVSEVDIYESSRYDALLFKEDLKNTNWLHGVDDKCDQG 569

Query: 323  LIFDH 309
             IFD+
Sbjct: 570  PIFDN 574



 Score =  129 bits (323), Expect = 7e-27
 Identities = 68/107 (63%), Positives = 80/107 (74%), Gaps = 1/107 (0%)
 Frame = -2

Query: 1976 RMSKESLSGGRNIQHRRGRILTRGGMSKDRDENLDLFSRNRRSLSITSSDES-DVSTKLG 1800
            R  +ESL G  N  HRRG     G  + + D+NLDLFS NRRSLS+ SSDES DVS KLG
Sbjct: 5    RNVRESLLGSLN-HHRRGHSFN-GVANNNHDDNLDLFSNNRRSLSLASSDESSDVSVKLG 62

Query: 1799 RISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASE 1659
            R+SVG+ K  RSG+DDLLS+ +GGKHDYDWLLTPPGTP+FPS   S+
Sbjct: 63   RLSVGTAKPVRSGIDDLLSSTEGGKHDYDWLLTPPGTPVFPSEGESQ 109


>ref|XP_006602722.1| PREDICTED: flocculation protein FLO11-like [Glycine max]
          Length = 585

 Score =  272 bits (696), Expect = 4e-70
 Identities = 163/357 (45%), Positives = 208/357 (58%), Gaps = 16/357 (4%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTR-ASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXX 1155
            TP + R     S+ EK R +SQ SRPSTP SR  I                         
Sbjct: 218  TPSKPRSVSTNSTAEKNRPSSQGSRPSTPSSRPHIPANLHSPSASSTRSLSRPSTPTRRS 277

Query: 1154 XXXXXXXXXXXA-------GRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPP 996
                               GR+ +NGR  + ASR SSP P+ RPPPQ IVPPDFP++TPP
Sbjct: 278  SMPSLSPSPSPTTGSLTSAGRVSSNGRSSAPASRPSSPSPRVRPPPQPIVPPDFPLETPP 337

Query: 995  NLRTTLPDRAVSAGRSRPNATTTPGPVNPVR-------RQSSSPIVTRGRLPELSGKGRL 837
            NLRTTLPDR VSAGRSRP   T    V+  +       R+ SSPIV+RGR+ E + K R 
Sbjct: 338  NLRTTLPDRPVSAGRSRPGGVTMKANVSETQASPVTMPRRHSSPIVSRGRVTEPAAKTRG 397

Query: 836  HANGQDIAATEQQKTTPISELAMRRPAKPVTTT-ESTGFGRTISKKSLDMALRHMDFRHG 660
            ++NG    A+E +K +   E+A R+  +  TT  ++TGFGRTISKKSLDMA++HMD R+ 
Sbjct: 398  YSNGHHADASEPRKVSHAPEVAARKSIRSSTTAPDNTGFGRTISKKSLDMAIKHMDIRNS 457

Query: 659  SGSIRPLSGTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVLANNGNCNGR 480
            SG+IR LS T LFPQSIR+ T K   V ++   V ++ +   S N     + N  + N  
Sbjct: 458  SGNIRSLSSTTLFPQSIRTSTSKSHRVSSAPASVDMNGSMISSKNGANFDVGNGIDRNMM 517

Query: 479  SEDGETEKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
             +  + ++  Y SAK++E DIYESSRYDA+L KEDLKNTNWLH  DDK DQG IFD+
Sbjct: 518  MKGRDADERQY-SAKLSEVDIYESSRYDALLFKEDLKNTNWLHGADDKCDQGPIFDN 573



 Score =  120 bits (300), Expect = 3e-24
 Identities = 64/109 (58%), Positives = 77/109 (70%), Gaps = 3/109 (2%)
 Frame = -2

Query: 1976 RMSKESLSGGRNIQHRRGRILTR-GGMSKDRDENLDLFSRNRRSLSITSS--DESDVSTK 1806
            R  +ESL G  N  HRRG         +   D+NLDLFS NRRSL++ +S  D SDVS K
Sbjct: 5    RNVRESLLGSLN-HHRRGHSFNGVANNNHHHDDNLDLFSDNRRSLALAASSDDSSDVSVK 63

Query: 1805 LGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASE 1659
            LGR+SVG+ K  RSG+DDLLS+ +GGKHDYDWLLTPPGTP+FPS   S+
Sbjct: 64   LGRLSVGTAKPVRSGIDDLLSSTEGGKHDYDWLLTPPGTPVFPSEGESQ 112


>ref|XP_003534630.1| PREDICTED: flocculation protein FLO11-like [Glycine max]
          Length = 586

 Score =  272 bits (695), Expect = 5e-70
 Identities = 163/358 (45%), Positives = 208/358 (58%), Gaps = 17/358 (4%)
 Frame = -2

Query: 1331 TPVRSRPTPITSSVEKTR-ASQNSRPSTPISRSQIXXXXXXXXXXXXXXXXXXXXXXXXX 1155
            TP + RP   +S+ E+ R +SQ SRPSTP SR  I                         
Sbjct: 219  TPSKPRPVSTSSTAERNRPSSQGSRPSTPSSRPHIPANLHSPSASSTRSLSRPSTPTRRS 278

Query: 1154 XXXXXXXXXXXA-------GRILTNGRIPSTASRGSSPGPKARPPPQRIVPPDFPIDTPP 996
                               GR+ +NGR  + ASR SSP P+ RPPPQ IVPPDFP++TPP
Sbjct: 279  SMPSLSPSPSPTIGSLTSAGRVSSNGRNSAPASRPSSPSPRVRPPPQPIVPPDFPLETPP 338

Query: 995  NLRTTLPDRAVSAGRSRP-------NATTTPGPVNPVRRQSSSPIVTRGRLPELSGKGRL 837
            NLRTTLPDR VSAGRSRP       N++ T      + R+ SSPIV+RGR+ E + K R 
Sbjct: 339  NLRTTLPDRPVSAGRSRPGGVTMKTNSSETQASPVTMPRRHSSPIVSRGRVTEPAAKTRG 398

Query: 836  HANGQDIAATEQQKTTPISELAMRRPAKPVTTT-ESTGFGRTISKKSLDMALRHMDFRHG 660
            ++NG  + A E +K +   E+A R+  +  +T  ++TGFGRTISKKSLDMA++HMD R+ 
Sbjct: 399  YSNGHHVDAPEPRKVSHAPEVAARKSVRSSSTAPDNTGFGRTISKKSLDMAIKHMDIRNS 458

Query: 659  SGSIRPLSGTGLFPQSIRSGTLKGQPVKASGDLVSVSSNGRPSNNSKIAVL-ANNGNCNG 483
            SG+IR L+ T LFPQSIR+ T K   V ++    SV  NG   ++   A     NG    
Sbjct: 459  SGNIRSLTSTTLFPQSIRTSTTKSHRVSSAP--ASVDMNGSMISSKNGANFDVGNGIDRN 516

Query: 482  RSEDGETEKDGYLSAKVNEADIYESSRYDAILLKEDLKNTNWLHSIDDKSDQGLIFDH 309
                G    + + SAK++E DIYESSRYDA+L KEDLKNTNWLH  DDK DQG IFD+
Sbjct: 517  MMMKGRDADERHYSAKLSEVDIYESSRYDALLFKEDLKNTNWLHGADDKCDQGPIFDN 574



 Score =  121 bits (303), Expect = 1e-24
 Identities = 63/109 (57%), Positives = 79/109 (72%), Gaps = 3/109 (2%)
 Frame = -2

Query: 1976 RMSKESLSGGRNIQHRRGRILTRGGMSKD-RDENLDLFSRNRRSLSITSS--DESDVSTK 1806
            R ++E L G  N  HRRG        + + RD+NLDLFS NRRSLS+ +S  D SDVS K
Sbjct: 5    RNAREYLLGSLN-HHRRGHSFNGVANNNNYRDDNLDLFSNNRRSLSLAASSDDSSDVSVK 63

Query: 1805 LGRISVGSEKLARSGMDDLLSTVDGGKHDYDWLLTPPGTPLFPSSNASE 1659
            LGR+S+G+ K  +SG+DDLLS+ +GGKHDYDWLLTPPGTP+FPS   S+
Sbjct: 64   LGRLSIGTAKPVKSGIDDLLSSTEGGKHDYDWLLTPPGTPVFPSEGESQ 112


Top