BLASTX nr result

ID: Rehmannia22_contig00021524 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00021524
         (785 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS59724.1| hypothetical protein M569_15079, partial [Genlise...   145   2e-32
ref|XP_004238743.1| PREDICTED: uncharacterized protein LOC101249...   140   5e-31
ref|XP_004135210.1| PREDICTED: uncharacterized protein LOC101206...   140   6e-31
ref|XP_006574567.1| PREDICTED: uncharacterized protein LOC100819...   138   2e-30
ref|XP_006357283.1| PREDICTED: uncharacterized protein LOC102605...   138   2e-30
ref|XP_002520174.1| conserved hypothetical protein [Ricinus comm...   137   5e-30
ref|XP_003517757.1| PREDICTED: uncharacterized protein LOC100787...   135   1e-29
ref|XP_006344293.1| PREDICTED: uncharacterized protein LOC102600...   134   5e-29
gb|EOY00247.1| CW14 protein isoform 1 [Theobroma cacao]               134   5e-29
ref|XP_003519785.1| PREDICTED: uncharacterized protein LOC100819...   134   5e-29
ref|XP_006302153.1| hypothetical protein CARUB_v10020162mg [Caps...   132   1e-28
gb|ESW29557.1| hypothetical protein PHAVU_002G079600g [Phaseolus...   132   2e-28
ref|XP_002272954.1| PREDICTED: uncharacterized protein LOC100260...   127   3e-27
ref|XP_004236975.1| PREDICTED: uncharacterized protein LOC101252...   127   4e-27
ref|XP_006483281.1| PREDICTED: uncharacterized protein LOC102624...   127   6e-27
gb|EOY00249.1| CW14 protein isoform 3 [Theobroma cacao]               126   7e-27
ref|XP_003631287.1| PREDICTED: uncharacterized protein LOC100260...   126   1e-26
gb|EOY00248.1| CW14 protein isoform 2 [Theobroma cacao]               124   5e-26
ref|XP_006438539.1| hypothetical protein CICLE_v10031173mg [Citr...   122   1e-25
ref|XP_002888194.1| hypothetical protein ARALYDRAFT_475347 [Arab...   122   1e-25

>gb|EPS59724.1| hypothetical protein M569_15079, partial [Genlisea aurea]
          Length = 490

 Score =  145 bits (365), Expect = 2e-32
 Identities = 84/187 (44%), Positives = 103/187 (55%), Gaps = 3/187 (1%)
 Frame = +2

Query: 233 RVEKNQSLPVDRPFSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPD---DAHSLNSFD 403
           RVEK   LP +R F+ PTF GN +EAWF+SA  L+SDWSDEDFQSIPD   ++H+    D
Sbjct: 48  RVEKLAPLPENRSFTGPTFQGNEDEAWFESAVYLESDWSDEDFQSIPDGMYESHAQTGVD 107

Query: 404 GTIVTNVASEEYSDGSSNSVNGAAKSSLPPICCDFKPKSEEQMSGTKPVFVDEISSAGET 583
           G         E+S    NS +G+ +SSL                  + V++DEIS     
Sbjct: 108 G---------EFS-SVRNSFSGSCESSL-------------ATDAKQHVYLDEISETSGG 144

Query: 584 AGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPT 763
             GDDS L+NCGIIP+NCLPCLAS VN                        F+WKEGHPT
Sbjct: 145 GDGDDSALNNCGIIPNNCLPCLASAVNNNAVDKRRSLTSSPPVKKSSLKLSFRWKEGHPT 204

Query: 764 AALFSSK 784
           AALF+SK
Sbjct: 205 AALFASK 211


>ref|XP_004238743.1| PREDICTED: uncharacterized protein LOC101249264 [Solanum
           lycopersicum]
          Length = 535

 Score =  140 bits (353), Expect = 5e-31
 Identities = 98/265 (36%), Positives = 124/265 (46%), Gaps = 32/265 (12%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MG CVSRP+ C                                      +V+K  S P+D
Sbjct: 1   MGGCVSRPDGCAGGRLGGSRRKSRKRRKAVKKRVSSHVSDRSAAD----KVDK--SFPLD 54

Query: 266 RPFSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFD--GTIVTNVASEEY 439
           R F+NPTF+G+ EEAWFDS+A  +SD SDEDFQS+ DD  SLN  D   T V +     +
Sbjct: 55  RSFNNPTFNGSTEEAWFDSSARFESDGSDEDFQSVADDVLSLNGSDCGRTSVASATDVHH 114

Query: 440 SDGSSN-----------------------SVNGAAKSSLPPIC------CDFKPKSEEQM 532
            D   N                       S +G+AK+S+ P         D K + +   
Sbjct: 115 GDVDVNAHHRLSSDLLRQGELSTSNPACSSDSGSAKTSINPSSMLRPKDADSKMRLDGPH 174

Query: 533 SGTKPVFVDEI-SSAGETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXX 709
           S  +PVF+DEI SSA  ++  +D LLDNCGI+ +NCLPCLASTV  VEK           
Sbjct: 175 SEVQPVFLDEISSSANGSSRREDGLLDNCGILSNNCLPCLASTVAPVEKRHSLSASSPSA 234

Query: 710 XXXXXXXXXFKWKEGHPTAALFSSK 784
                    FKWKE +P AAL SSK
Sbjct: 235 KKKAAIKLPFKWKEENPVAALLSSK 259


>ref|XP_004135210.1| PREDICTED: uncharacterized protein LOC101206832 [Cucumis sativus]
           gi|449516445|ref|XP_004165257.1| PREDICTED:
           uncharacterized protein LOC101227289 [Cucumis sativus]
          Length = 536

 Score =  140 bits (352), Expect = 6e-31
 Identities = 93/266 (34%), Positives = 127/266 (47%), Gaps = 33/266 (12%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MGACVS P+ CV                                         ++S P+D
Sbjct: 1   MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEG---------SHRSDPID 51

Query: 266 R-PFSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNVASEEYS 442
              FSNPTF G+ +EAWFD+    +SD  DED+QS+PDD  S+NS +    ++++S   +
Sbjct: 52  HCSFSNPTFQGSYDEAWFDTVGKFESD-CDEDYQSLPDDNQSINSLEAASTSSISSSGDA 110

Query: 443 DGSSNSVNG------------------AAKSSLPPICCDFKPKS------EEQMSG---- 538
           +   ++VN                   +  SS   +  D   ++      E Q+ G    
Sbjct: 111 NHGDHNVNRHSATSDQIHRPGNSARVHSVSSSESQVARDSHLQAINPDDAEPQLKGCGHS 170

Query: 539 ---TKPVFVDEISS-AGETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXX 706
               +PVF+DEISS AGE++   D +LDNCGI+PSNCLPCLAST+N+VEK          
Sbjct: 171 SEANEPVFIDEISSTAGESSAKGDGILDNCGILPSNCLPCLASTINSVEKRKSLSSSPPS 230

Query: 707 XXXXXXXXXXFKWKEGHPTAALFSSK 784
                     FKWKEG+P AALFSSK
Sbjct: 231 GLKKAALKLSFKWKEGNPNAALFSSK 256


>ref|XP_006574567.1| PREDICTED: uncharacterized protein LOC100819425 isoform X4 [Glycine
           max] gi|571438438|ref|XP_006574568.1| PREDICTED:
           uncharacterized protein LOC100819425 isoform X5 [Glycine
           max]
          Length = 512

 Score =  138 bits (348), Expect = 2e-30
 Identities = 81/182 (44%), Positives = 103/182 (56%), Gaps = 8/182 (4%)
 Frame = +2

Query: 263 DRPFSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNV-ASEEY 439
           D  F+NPTF G++EEAWFDS AV DSD  D+D+QS+PDD  SL+  +G  V++  +S + 
Sbjct: 54  DCSFANPTFQGSIEEAWFDSIAVFDSD-CDDDYQSVPDDVVSLSGIEGGSVSSFPSSRDA 112

Query: 440 SDGSSNSVNGAAKSSLPPICCDFKPKSEEQMSGT-------KPVFVDEISSAGETAGGDD 598
           + G S       K  L     +    S+ Q  G        +PVF+DEISS    +  DD
Sbjct: 113 TRGVSTDQVQKQKELLAG--SEAARSSDVQYFGVDVIDSQREPVFLDEISSVDANSNKDD 170

Query: 599 SLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPTAALFS 778
            LLDNCGI+P+NCLPCLAST+ +VEK                    FKWKEGH  A LFS
Sbjct: 171 GLLDNCGILPNNCLPCLASTIPSVEKRRSSSSSPPNARKKVPAKLSFKWKEGHGNATLFS 230

Query: 779 SK 784
           SK
Sbjct: 231 SK 232


>ref|XP_006357283.1| PREDICTED: uncharacterized protein LOC102605449 [Solanum tuberosum]
          Length = 535

 Score =  138 bits (348), Expect = 2e-30
 Identities = 97/265 (36%), Positives = 123/265 (46%), Gaps = 32/265 (12%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MG CVSRP+ C                                      +V+K  S P+D
Sbjct: 1   MGGCVSRPDGCAGGRLGGSRRKSRKRRKAVKKRVSSHVSDRSAAD----KVDK--SFPLD 54

Query: 266 RPFSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFD--GTIVTNVASEEY 439
           R F+NP F+G+ EEAWFDS+A  +SD SDEDFQS+ DD  SLN  D   T V +     +
Sbjct: 55  RSFNNPAFNGSTEEAWFDSSARFESDGSDEDFQSVADDVLSLNGSDCGRTSVASATDVHH 114

Query: 440 SDGSSN-----------------------SVNGAAKSSLPPIC------CDFKPKSEEQM 532
            D   N                       S +G+AK+S+ P         D K + +   
Sbjct: 115 GDVDVNAHHRLSSDLQRQGELSTSNPACSSDSGSAKTSINPSSMLRPKDADSKMRLDGPH 174

Query: 533 SGTKPVFVDEI-SSAGETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXX 709
           S  +PVF+DEI SSA  ++  +D LLDNCGI+ +NCLPCLASTV  VEK           
Sbjct: 175 SEVQPVFLDEISSSANGSSRREDGLLDNCGILSNNCLPCLASTVAPVEKRQSLSASSPSA 234

Query: 710 XXXXXXXXXFKWKEGHPTAALFSSK 784
                    FKWKE +P AAL SSK
Sbjct: 235 KKKAAIKLPFKWKEENPVAALLSSK 259


>ref|XP_002520174.1| conserved hypothetical protein [Ricinus communis]
           gi|223540666|gb|EEF42229.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 512

 Score =  137 bits (344), Expect = 5e-30
 Identities = 77/174 (44%), Positives = 104/174 (59%), Gaps = 3/174 (1%)
 Frame = +2

Query: 272 FSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNVASEEYSDGS 451
           FSNPTF G++E+AWFDS A+ +SD  +ED++S+PDD  SLN  DG     +  ++     
Sbjct: 64  FSNPTFQGSIEDAWFDSVAIFESD-CEEDYESVPDDLLSLNGSDG-----LPHDQMKKAG 117

Query: 452 SNSVNGAAKSSLP--PICCDFKPKSEEQMSGTKPVFVDEI-SSAGETAGGDDSLLDNCGI 622
             S   +A++S+   P+     P +E +    +PVF+DEI SSA E AG ++ LL+NCGI
Sbjct: 118 DLSAGNSARNSVSEAPVSKFDGPSNEAK----QPVFLDEIASSADENAGKEEGLLENCGI 173

Query: 623 IPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPTAALFSSK 784
           +P NCLPCLASTV+ VEK                    FKWKEGH   +LFSSK
Sbjct: 174 LPGNCLPCLASTVSQVEKRRSLSSSPPSARKKAALKLSFKWKEGHANNSLFSSK 227


>ref|XP_003517757.1| PREDICTED: uncharacterized protein LOC100787325 isoform X1 [Glycine
           max] gi|571434041|ref|XP_006573085.1| PREDICTED:
           uncharacterized protein LOC100787325 isoform X2 [Glycine
           max]
          Length = 512

 Score =  135 bits (341), Expect = 1e-29
 Identities = 79/194 (40%), Positives = 102/194 (52%), Gaps = 20/194 (10%)
 Frame = +2

Query: 263 DRPFSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNVASEEYS 442
           D  F+NPTF G++EEAWFDS AV DSD  D+D+QS+PDD  SL+  +G  V++  S    
Sbjct: 54  DCSFANPTFQGSIEEAWFDSVAVFDSD-CDDDYQSVPDDVVSLSGIEGGSVSSFPSS--- 109

Query: 443 DGSSNSVNGAAKSSLPPICCDFKPKSEEQMSGTK--------------------PVFVDE 562
            G +N            +  D   K +E ++G++                    PVF+DE
Sbjct: 110 -GDANH----------GVSTDHVQKQKELLAGSEAARSSDVQYFVVDAIDSQHEPVFLDE 158

Query: 563 ISSAGETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFK 742
           ISS    +  DD LLDNCGI+P+NCLPCL ST+ +VEK                    FK
Sbjct: 159 ISSVDANSNKDDGLLDNCGILPNNCLPCLVSTIPSVEKRRSTSSSPPNARKKPTTKLSFK 218

Query: 743 WKEGHPTAALFSSK 784
           WKEGH  A LFSSK
Sbjct: 219 WKEGHGNATLFSSK 232


>ref|XP_006344293.1| PREDICTED: uncharacterized protein LOC102600560 [Solanum tuberosum]
          Length = 480

 Score =  134 bits (336), Expect = 5e-29
 Identities = 92/237 (38%), Positives = 114/237 (48%), Gaps = 4/237 (1%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MGACVSRP++CV                                         ++S P+D
Sbjct: 1   MGACVSRPDSCVGGKLKGSNKFRKRRGRRRRKKSSSLHKI-------------DESFPLD 47

Query: 266 RPFSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFD--GTIVTNVASEEY 439
               NPTF G +EEAWFDSAA+ +SD SDEDFQS+PDD  S+NSFD   T V ++    +
Sbjct: 48  N--YNPTFQGRIEEAWFDSAAIFESDCSDEDFQSVPDDVLSVNSFDCGRTSVASIKDTNH 105

Query: 440 SDGSSNSVNGAAKSSLPPICCDFKPKSEEQMSGTKPVFVDEISSAGETAGGD--DSLLDN 613
            D + N                  P SE      +PVF+DEISS+ E  G D  D LL+N
Sbjct: 106 GDVNLNHDG---------------PHSE-----VRPVFLDEISSS-ENIGSDREDGLLEN 144

Query: 614 CGIIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPTAALFSSK 784
           CGI+ +NCLPCL STV  VEK                    FKWK+G+P A L SSK
Sbjct: 145 CGILSNNCLPCLTSTVVPVEK-RSLSSSPPSSRKKADLKLPFKWKDGNPCATLLSSK 200


>gb|EOY00247.1| CW14 protein isoform 1 [Theobroma cacao]
          Length = 541

 Score =  134 bits (336), Expect = 5e-29
 Identities = 93/268 (34%), Positives = 126/268 (47%), Gaps = 35/268 (13%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MGAC SRPE CV                                     R+ +  S  VD
Sbjct: 1   MGACASRPEGCVSPKLRSSKKKNRKRRKSCLKKRVSS------------RLSEVSSDKVD 48

Query: 266 RP--------FSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTN 421
           RP        F+NPTF G+++E WFD  AV DSD  DE+F+S+ +D  SLN  +G  +++
Sbjct: 49  RPAPPDHHSSFTNPTFQGSIDE-WFDPVAVFDSD-CDEEFESVQEDVLSLNGLEGVSISS 106

Query: 422 VAS-------------------EEYSDGSS--NSVNGAAKSSLPPICCDFKPKSEEQMSG 538
           ++S                    + S G+S  NSV    ++S   +       S+ +  G
Sbjct: 107 ISSLKDANCGEHSSLVDQMQKPGDLSAGNSACNSVGEVTRNSNSQVLNSEDVNSQSKSDG 166

Query: 539 T-----KPVFVDEI-SSAGETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXX 700
                 +PVF+D+I SS  E +G ++ LLDNCGI+PSNCLPCLASTV ++EK        
Sbjct: 167 PSNKAKQPVFLDDIASSVDEGSGKEEGLLDNCGILPSNCLPCLASTVPSIEKRRSLSSSP 226

Query: 701 XXXXXXXXXXXXFKWKEGHPTAALFSSK 784
                       FKW+EGHP A LFSSK
Sbjct: 227 PSARKKNALKLPFKWREGHPNATLFSSK 254


>ref|XP_003519785.1| PREDICTED: uncharacterized protein LOC100819425 isoform X1 [Glycine
           max] gi|571438431|ref|XP_006574565.1| PREDICTED:
           uncharacterized protein LOC100819425 isoform X2 [Glycine
           max] gi|571438434|ref|XP_006574566.1| PREDICTED:
           uncharacterized protein LOC100819425 isoform X3 [Glycine
           max]
          Length = 513

 Score =  134 bits (336), Expect = 5e-29
 Identities = 81/183 (44%), Positives = 103/183 (56%), Gaps = 9/183 (4%)
 Frame = +2

Query: 263 DRPFSNPTFH-GNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNV-ASEE 436
           D  F+NPTF  G++EEAWFDS AV DSD  D+D+QS+PDD  SL+  +G  V++  +S +
Sbjct: 54  DCSFANPTFQAGSIEEAWFDSIAVFDSD-CDDDYQSVPDDVVSLSGIEGGSVSSFPSSRD 112

Query: 437 YSDGSSNSVNGAAKSSLPPICCDFKPKSEEQMSGT-------KPVFVDEISSAGETAGGD 595
            + G S       K  L     +    S+ Q  G        +PVF+DEISS    +  D
Sbjct: 113 ATRGVSTDQVQKQKELLAG--SEAARSSDVQYFGVDVIDSQREPVFLDEISSVDANSNKD 170

Query: 596 DSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPTAALF 775
           D LLDNCGI+P+NCLPCLAST+ +VEK                    FKWKEGH  A LF
Sbjct: 171 DGLLDNCGILPNNCLPCLASTIPSVEKRRSSSSSPPNARKKVPAKLSFKWKEGHGNATLF 230

Query: 776 SSK 784
           SSK
Sbjct: 231 SSK 233


>ref|XP_006302153.1| hypothetical protein CARUB_v10020162mg [Capsella rubella]
           gi|482570863|gb|EOA35051.1| hypothetical protein
           CARUB_v10020162mg [Capsella rubella]
          Length = 511

 Score =  132 bits (333), Expect = 1e-28
 Identities = 77/182 (42%), Positives = 110/182 (60%), Gaps = 8/182 (4%)
 Frame = +2

Query: 263 DRPFSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNVASEEYS 442
           DR FSNPTF  +V+EAWFDS    ++D  D+DF S+ +D  S+N  +   V++++S + S
Sbjct: 50  DRSFSNPTFRASVDEAWFDSNLAFETD-CDDDFHSVQEDMLSVNGGERISVSSMSSVKDS 108

Query: 443 D---GSSNSVNGAAKSSLPPICCD---FKPKSEEQMSGTK-PVFVDEISS-AGETAGGDD 598
           +    + NS++   K+S      D    + KSE  ++ TK PVF+DEISS A +++  D+
Sbjct: 109 NLGGSARNSLSDGPKTSNSHSTVDDVISQSKSESTLTDTKQPVFIDEISSNADDSSRKDE 168

Query: 599 SLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPTAALFS 778
            LL+NCGI+PSNCLPCL STV ++EK                    FKW+EGHPT  LFS
Sbjct: 169 GLLENCGILPSNCLPCLHSTVPSIEKRRSLSSSPPSTRKKAALKLSFKWREGHPTGPLFS 228

Query: 779 SK 784
           +K
Sbjct: 229 TK 230


>gb|ESW29557.1| hypothetical protein PHAVU_002G079600g [Phaseolus vulgaris]
           gi|561030979|gb|ESW29558.1| hypothetical protein
           PHAVU_002G079600g [Phaseolus vulgaris]
          Length = 518

 Score =  132 bits (331), Expect = 2e-28
 Identities = 82/189 (43%), Positives = 102/189 (53%), Gaps = 15/189 (7%)
 Frame = +2

Query: 263 DRPFSNPTFH--GNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNVASEE 436
           D  F+NPTF   G++EEAWFDS AV DSD  D+D+QS+PDD  SL+  +G  V +  S  
Sbjct: 54  DCSFANPTFQATGSIEEAWFDSVAVFDSD-CDDDYQSVPDDVVSLSGIEGGSVLSFPSSR 112

Query: 437 YSDGSSNSVNGAAKSSLPPICCDFKPKSEE------QMSGT-------KPVFVDEISSAG 577
            +     S +   K  L     +F   SE       Q SG        +PVF+DEISS  
Sbjct: 113 -NGNHGVSTDQIQKQELQ--AGNFANMSEASRSSGVQYSGVDVIDSPREPVFLDEISSVD 169

Query: 578 ETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGH 757
             +  DD LLDNCGI+P+NCLPCLAST+ ++EK                    FKWKEGH
Sbjct: 170 ANSNKDDGLLDNCGILPNNCLPCLASTIPSIEKRRSSSSSPPNARKKTPTKVSFKWKEGH 229

Query: 758 PTAALFSSK 784
             A LFSSK
Sbjct: 230 GNATLFSSK 238


>ref|XP_002272954.1| PREDICTED: uncharacterized protein LOC100260447 isoform 1 [Vitis
           vinifera] gi|296086464|emb|CBI32053.3| unnamed protein
           product [Vitis vinifera]
          Length = 510

 Score =  127 bits (320), Expect = 3e-27
 Identities = 79/235 (33%), Positives = 116/235 (49%), Gaps = 2/235 (0%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MGACVS PE+CV                                             P D
Sbjct: 1   MGACVSSPESCVGGKLKYPKNKFRKRRKIKRRAVSRFADVTSFDKADRP---SRPGPPPD 57

Query: 266 RPFSNPTFH-GNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNVASEEYS 442
           R F+NPTF  G+++EAWFDS  V +SD  +E+F+S+ ++  S+N F+G  V++++S   S
Sbjct: 58  RSFTNPTFRAGSLDEAWFDSIPVFESD-CEEEFESVQEEVFSVNGFEGASVSSISSLRDS 116

Query: 443 DGSSNSVNGAAKSSLPPICCDFKPKSEEQMSGTKPVFVDEIS-SAGETAGGDDSLLDNCG 619
                +VN    S +  +    KP         +PVF+DEIS +A E+ G ++ +L+NCG
Sbjct: 117 SLWDCNVNVQHTSGMDGVDSQLKPDGPSN-EAKQPVFLDEISLTADESGGREEGMLENCG 175

Query: 620 IIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPTAALFSSK 784
           I+P+NCLPCLAST ++ EK                    FKW+EG+  A+L SS+
Sbjct: 176 ILPNNCLPCLASTASSEEKRGSLSSSPPSSRKKGALKISFKWREGNANASLLSSR 230


>ref|XP_004236975.1| PREDICTED: uncharacterized protein LOC101252276 [Solanum
           lycopersicum]
          Length = 481

 Score =  127 bits (319), Expect = 4e-27
 Identities = 90/238 (37%), Positives = 114/238 (47%), Gaps = 5/238 (2%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MGACVSRP++CV                                     R + N    +D
Sbjct: 1   MGACVSRPDSCVGGKLKGSNKFRKKRGGRRR------------------RKKSNSLHKID 42

Query: 266 RPFS----NPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNVASE 433
             F     NPTF G +EEAWFDSAA+ +SD SDEDFQS+PDD  S+NSFD    T+VAS 
Sbjct: 43  EAFPLDNYNPTFQGRIEEAWFDSAAIFESDCSDEDFQSVPDDVLSVNSFD-CGRTSVASI 101

Query: 434 EYSDGSSNSVNGAAKSSLPPICCDFKPKSEEQMSGTKPVFVDEISSAGETAGG-DDSLLD 610
           + ++    ++N               P SE      +PVF+DEISS+     G +D L +
Sbjct: 102 KDTNNGDVNLNPDG------------PHSE-----VRPVFLDEISSSENIGSGREDGLSE 144

Query: 611 NCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPTAALFSSK 784
           NCGI+ +NCLP L STV  VEK                    FKWK+G+P A L SSK
Sbjct: 145 NCGILSNNCLPRLTSTVVPVEK-RSLSSSPPSSRKKADLKLPFKWKDGNPCATLLSSK 201


>ref|XP_006483281.1| PREDICTED: uncharacterized protein LOC102624792 isoform X2 [Citrus
           sinensis]
          Length = 539

 Score =  127 bits (318), Expect = 6e-27
 Identities = 83/212 (39%), Positives = 106/212 (50%), Gaps = 36/212 (16%)
 Frame = +2

Query: 257 PVDR--PFSNPTFHGNVEEAWFDSAAVLDSDWSDED-FQSIPDDAHSLNSFDGTIVT--- 418
           PVDR   F+NP   G+V+++WFDS A+ +SD  D+D F S+ DD  SLN  DG   T   
Sbjct: 48  PVDRHSSFTNPALQGSVDDSWFDSVAIFESDGEDDDDFVSVQDDVVSLNGSDGVSRTSNV 107

Query: 419 ------------NVASEEYSD------------GSSNSVNGAAKSSLPPICCDFKPKSEE 526
                       N+     +D             + NSV+   K+S   +       S+ 
Sbjct: 108 SLRDANHRGHNVNIQCTSLTDQLQRPGGLSAGNSAHNSVSDVGKNSSSRVANSENVHSQS 167

Query: 527 QMSGT-----KPVFVDEISSA-GETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXX 688
           +  G      +PVF+DEISS+  E +G D+ LLDNCGIIPSNCLPCLASTV +VEK    
Sbjct: 168 KSDGPSYEGKQPVFLDEISSSVDEGSGKDEGLLDNCGIIPSNCLPCLASTVPSVEKRRSG 227

Query: 689 XXXXXXXXXXXXXXXXFKWKEGHPTAALFSSK 784
                           FKWKEGH  A L SSK
Sbjct: 228 SSSPPRPFKKTASKLSFKWKEGHANATLVSSK 259


>gb|EOY00249.1| CW14 protein isoform 3 [Theobroma cacao]
          Length = 503

 Score =  126 bits (317), Expect = 7e-27
 Identities = 89/264 (33%), Positives = 122/264 (46%), Gaps = 35/264 (13%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MGAC SRPE CV                                     R+ +  S  VD
Sbjct: 1   MGACASRPEGCVSPKLRSSKKKNRKRRKSCLKKRVSS------------RLSEVSSDKVD 48

Query: 266 RP--------FSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTN 421
           RP        F+NPTF G+++E WFD  AV DSD  DE+F+S+ +D  SLN  +G  +++
Sbjct: 49  RPAPPDHHSSFTNPTFQGSIDE-WFDPVAVFDSD-CDEEFESVQEDVLSLNGLEGVSISS 106

Query: 422 VAS-------------------EEYSDGSS--NSVNGAAKSSLPPICCDFKPKSEEQMSG 538
           ++S                    + S G+S  NSV    ++S   +       S+ +  G
Sbjct: 107 ISSLKDANCGEHSSLVDQMQKPGDLSAGNSACNSVGEVTRNSNSQVLNSEDVNSQSKSDG 166

Query: 539 T-----KPVFVDEI-SSAGETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXX 700
                 +PVF+D+I SS  E +G ++ LLDNCGI+PSNCLPCLASTV ++EK        
Sbjct: 167 PSNKAKQPVFLDDIASSVDEGSGKEEGLLDNCGILPSNCLPCLASTVPSIEKRRSLSSSP 226

Query: 701 XXXXXXXXXXXXFKWKEGHPTAAL 772
                       FKW+EGHP A L
Sbjct: 227 PSARKKNALKLPFKWREGHPNATL 250


>ref|XP_003631287.1| PREDICTED: uncharacterized protein LOC100260447 isoform 2 [Vitis
           vinifera]
          Length = 494

 Score =  126 bits (316), Expect = 1e-26
 Identities = 76/234 (32%), Positives = 114/234 (48%), Gaps = 1/234 (0%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MGACVS PE+CV                                             P D
Sbjct: 1   MGACVSSPESCVGGKLKYPKNKFRKRRKIKRRAVSRFADVTSFDKADRP---SRPGPPPD 57

Query: 266 RPFSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNVASEEYSD 445
           R F+NPTF G+++EAWFDS  V +SD  +E+F+S+ ++  S+N F+G  V++++S   S+
Sbjct: 58  RSFTNPTFPGSLDEAWFDSIPVFESD-CEEEFESVQEEVFSVNGFEGASVSSISSLRDSN 116

Query: 446 GSSNSVNGAAKSSLPPICCDFKPKSEEQMSGTKPVFVDEIS-SAGETAGGDDSLLDNCGI 622
           G  + +     S+                   +PVF+DEIS +A E+ G ++ +L+NCGI
Sbjct: 117 GVDSQLKPDGPSN----------------EAKQPVFLDEISLTADESGGREEGMLENCGI 160

Query: 623 IPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPTAALFSSK 784
           +P+NCLPCLAST ++ EK                    FKW+EG+  A+L SS+
Sbjct: 161 LPNNCLPCLASTASSEEKRGSLSSSPPSSRKKGALKISFKWREGNANASLLSSR 214


>gb|EOY00248.1| CW14 protein isoform 2 [Theobroma cacao]
          Length = 511

 Score =  124 bits (310), Expect = 5e-26
 Identities = 88/250 (35%), Positives = 113/250 (45%), Gaps = 17/250 (6%)
 Frame = +2

Query: 86  MGACVSRPENCVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVEKNQSLPVD 265
           MGAC SRPE CV                                     R+ +  S  VD
Sbjct: 1   MGACASRPEGCVSPKLRSSKKKNRKRRKSCLKKRVSS------------RLSEVSSDKVD 48

Query: 266 RP--------FSNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSI--------PDDAHSLNS 397
           RP        F+NPTF G+++E WFD  AV DSD  DE+F+S+        P D  + NS
Sbjct: 49  RPAPPDHHSSFTNPTFQGSIDE-WFDPVAVFDSD-CDEEFESVQEVDQMQKPGDLSAGNS 106

Query: 398 FDGTIVTNVASEEYSDGSSNSVNGAAKSSLPPICCDFKPKSEEQMSGTKPVFVDEI-SSA 574
              ++     +      +S  VN  +KS  P                 +PVF+D+I SS 
Sbjct: 107 ACNSVGEVTRNSNSQVLNSEDVNSQSKSDGP------------SNKAKQPVFLDDIASSV 154

Query: 575 GETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEG 754
            E +G ++ LLDNCGI+PSNCLPCLASTV ++EK                    FKW+EG
Sbjct: 155 DEGSGKEEGLLDNCGILPSNCLPCLASTVPSIEKRRSLSSSPPSARKKNALKLPFKWREG 214

Query: 755 HPTAALFSSK 784
           HP A LFSSK
Sbjct: 215 HPNATLFSSK 224


>ref|XP_006438539.1| hypothetical protein CICLE_v10031173mg [Citrus clementina]
           gi|568859507|ref|XP_006483280.1| PREDICTED:
           uncharacterized protein LOC102624792 isoform X1 [Citrus
           sinensis] gi|557540735|gb|ESR51779.1| hypothetical
           protein CICLE_v10031173mg [Citrus clementina]
          Length = 540

 Score =  122 bits (306), Expect = 1e-25
 Identities = 83/213 (38%), Positives = 106/213 (49%), Gaps = 37/213 (17%)
 Frame = +2

Query: 257 PVDR--PFSNPTFH-GNVEEAWFDSAAVLDSDWSDED-FQSIPDDAHSLNSFDGTIVT-- 418
           PVDR   F+NP    G+V+++WFDS A+ +SD  D+D F S+ DD  SLN  DG   T  
Sbjct: 48  PVDRHSSFTNPALQAGSVDDSWFDSVAIFESDGEDDDDFVSVQDDVVSLNGSDGVSRTSN 107

Query: 419 -------------NVASEEYSD------------GSSNSVNGAAKSSLPPICCDFKPKSE 523
                        N+     +D             + NSV+   K+S   +       S+
Sbjct: 108 VSLRDANHRGHNVNIQCTSLTDQLQRPGGLSAGNSAHNSVSDVGKNSSSRVANSENVHSQ 167

Query: 524 EQMSGT-----KPVFVDEISSA-GETAGGDDSLLDNCGIIPSNCLPCLASTVNTVEKXXX 685
            +  G      +PVF+DEISS+  E +G D+ LLDNCGIIPSNCLPCLASTV +VEK   
Sbjct: 168 SKSDGPSYEGKQPVFLDEISSSVDEGSGKDEGLLDNCGIIPSNCLPCLASTVPSVEKRRS 227

Query: 686 XXXXXXXXXXXXXXXXXFKWKEGHPTAALFSSK 784
                            FKWKEGH  A L SSK
Sbjct: 228 GSSSPPRPFKKTASKLSFKWKEGHANATLVSSK 260


>ref|XP_002888194.1| hypothetical protein ARALYDRAFT_475347 [Arabidopsis lyrata subsp.
           lyrata] gi|297334035|gb|EFH64453.1| hypothetical protein
           ARALYDRAFT_475347 [Arabidopsis lyrata subsp. lyrata]
          Length = 492

 Score =  122 bits (306), Expect = 1e-25
 Identities = 68/171 (39%), Positives = 103/171 (60%), Gaps = 1/171 (0%)
 Frame = +2

Query: 275 SNPTFHGNVEEAWFDSAAVLDSDWSDEDFQSIPDDAHSLNSFDGTIVTNVASEEYSDGSS 454
           +N TF  +V+EAWFDS    ++D  D+DF S+ +D  S+N  +   V++++S   S+   
Sbjct: 48  NNHTFRASVDEAWFDSNLAFETD-CDDDFHSVQEDILSVNGGERISVSSMSSVRDSN--- 103

Query: 455 NSVNGAAKSSLPPICCDFKPKSEEQMSGTKPVFVDEISS-AGETAGGDDSLLDNCGIIPS 631
             + G+A++SL  +    K +S   +   +PVF+DEISS AG+++  D+ LL+NCGI+PS
Sbjct: 104 --LGGSARNSLSDVISQSKAESA-LIDAKQPVFIDEISSNAGDSSRKDEGLLENCGILPS 160

Query: 632 NCLPCLASTVNTVEKXXXXXXXXXXXXXXXXXXXXFKWKEGHPTAALFSSK 784
           NCLPCL STV+++EK                    FKW+EGH T  LFS+K
Sbjct: 161 NCLPCLNSTVHSIEKRRSLSSSPPSTRKKAALKLSFKWREGHATGPLFSTK 211


Top