BLASTX nr result

ID: Catharanthus23_contig00012237 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00012237
         (2225 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI30576.3| unnamed protein product [Vitis vinifera]              251   1e-63
ref|XP_002269847.1| PREDICTED: uncharacterized protein LOC100261...   245   7e-62
emb|CAN79695.1| hypothetical protein VITISV_023936 [Vitis vinifera]   241   1e-60
ref|XP_004302736.1| PREDICTED: uncharacterized protein LOC101292...   188   1e-44
gb|EOY07557.1| Uncharacterized protein isoform 2 [Theobroma cacao]    178   1e-41
gb|EOY07556.1| Uncharacterized protein isoform 1 [Theobroma cacao]    178   1e-41
gb|EOY07558.1| Uncharacterized protein isoform 3 [Theobroma cacao]    174   1e-40
gb|EOY07559.1| Uncharacterized protein isoform 4 [Theobroma cacao]    173   3e-40
gb|EMJ07008.1| hypothetical protein PRUPE_ppa010713mg [Prunus pe...   162   4e-37
gb|EOY07560.1| Uncharacterized protein isoform 5 [Theobroma cacao]    109   4e-21
ref|XP_006853660.1| hypothetical protein AMTR_s00056p00105160 [A...    61   2e-06

>emb|CBI30576.3| unnamed protein product [Vitis vinifera]
          Length = 693

 Score =  251 bits (640), Expect = 1e-63
 Identities = 195/629 (31%), Positives = 307/629 (48%), Gaps = 17/629 (2%)
 Frame = +1

Query: 49   CTADNDAAEVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAV 228
            C+      +  VELEAMRK+D SWHPC VS  STG GL+V++ +   D +DII + EEA+
Sbjct: 122  CSMGTGTGDATVELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQ--DLEDIISNEEEAL 179

Query: 229  ARLRIRSAPLEGVACSIVQPGDRVLA--RRNRMNLFFDAEVIEVVRVRHSKRIHCRCTFT 402
            ARLRIRS PL+G  CS+++ G+RVLA  + +   L FDA V + +RVRHS RI CRCTF 
Sbjct: 180  ARLRIRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKALRVRHSTRISCRCTFV 239

Query: 403  VKWIHNGLEGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMD 582
            +KW+H  L+G T I+PSS +MK++T+SI +HP + AF   + T +C        V E +D
Sbjct: 240  IKWLHQDLKGATSIVPSSSIMKLATQSITVHPMVAAFLKPIKTLNCSAAPSFSTVFEDVD 299

Query: 583  CEMDIDE-LEKQIEQISNSADACRMKITK-VLSG-EVDVDERSQCNLIPASEVCDTYIHL 753
            CE+D+ + LEKQIE+ISN ADA + +I++ +L G + D+ E+  C+ +  S++  ++  +
Sbjct: 300  CEVDLHKLLEKQIEEISNLADASKKEISEDILFGIKADIKEQMDCSPVAESKITSSHFQV 359

Query: 754  PPSQ-NSITGSTGGAQLTRPVETEFKHP-PPHSWFTEEASVEGRSRCNPIAACAALASLM 927
            P  Q N    ST  +   R V  E K P PP S   +E S E R+  +P+A+ AALAS+M
Sbjct: 360  PHEQENHFKRSTRSSSKLR-VNMEVKDPLPPDSSIQKELS-ENRAYLSPLASRAALASIM 417

Query: 928  SKSSENTPTMSFISNSSKINDENFLGTESGTVTSISTVKELFPSKKLFKDPETLNASSVA 1107
            S   +       +  S    +EN        +T+     +L    K  KD  +    +  
Sbjct: 418  SNLPQK------LEFSIYHEEENGFACAPDNITNKHVTMDLLNGTKPVKDKLSSEIEAAF 471

Query: 1108 LNWE-------SKNKESSQEVASCVGNHVRSTKKMCLAARSSSAMNRSPFENVEMPVTNA 1266
            +  E       ++   S + +     + + + K    A+ S S +       +  P   +
Sbjct: 472  IPAEIFKSLITTEKGASRRPLLVEASSEIANPKSQNDASPSLSGLIEE--RELRQPAKES 529

Query: 1267 RRLTRSVIR--AIKEKQTVETKNVTEEILCSTFEQNLSVQKMDTLPDMEMVASVIDK-ES 1437
             R T S I+  A+      E K   EEI            K   L +  +  S + K E 
Sbjct: 530  -RFTSSAIQKHAVSSTSNAEMKTHAEEI------------KSVALTNKRLTRSAVHKQEE 576

Query: 1438 GLTISVKSKTQMDSKGICENNGSVMRSNGVGIQGLNNSRRLTRSAVKGKSKDSNGELPKG 1617
             L + VK +++++                      N+++ +  ++ +G     + + PK 
Sbjct: 577  NLAMEVKQRSEVN----------------------NSAQDIESNSSEGNVTIPDRKAPKK 614

Query: 1618 LEECSYPGHSKSDLEGNITSEREVPEMEKTISTSPVHETCPNPSIAEAYEKVQLSASIKT 1797
             +  S P  ++S                     SPV E        E  +K ++ ++++T
Sbjct: 615  KKPVSLPPAAQS---------------------SPVTE--------ERNKKRKMPSAVET 645

Query: 1798 TRKTEGNATDNVSLKQGVKRKSSASKNQE 1884
              KTEG  + N    +  K KS++SK QE
Sbjct: 646  ASKTEGKVSRNGGNSESQKSKSTSSKKQE 674


>ref|XP_002269847.1| PREDICTED: uncharacterized protein LOC100261386 [Vitis vinifera]
          Length = 552

 Score =  245 bits (625), Expect = 7e-62
 Identities = 168/482 (34%), Positives = 260/482 (53%), Gaps = 47/482 (9%)
 Frame = +1

Query: 73   EVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSA 252
            +  VELEAMRK+D SWHPC VS  STG GL+V++ +   D +DII + EEA+ARLRIRS 
Sbjct: 7    DATVELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQ--DLEDIISNEEEALARLRIRSV 64

Query: 253  PLEGVACSIVQPGDRVLA--RRNRMNLFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGL 426
            PL+G  CS+++ G+RVLA  + +   L FDA V + +RVRHS RI CRCTF +KW+H  L
Sbjct: 65   PLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKALRVRHSTRISCRCTFVIKWLHQDL 124

Query: 427  EGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE- 603
            +G T I+PSS +MK++T+SI +HP + AF   + T +C        V E +DCE+D+ + 
Sbjct: 125  KGATSIVPSSSIMKLATQSITVHPMVAAFLKPIKTLNCSAAPSFSTVFEDVDCEVDLHKL 184

Query: 604  LEKQIEQISNSADACRMKITK-VLSG-EVDVDERSQCNLIPASEVCDTYIHLPPSQ-NSI 774
            LEKQIE+ISN ADA + +I++ +L G + D+ E+  C+ +  S++  ++  +P  Q N  
Sbjct: 185  LEKQIEEISNLADASKKEISEDILFGIKADIKEQMDCSPVAESKITSSHFQVPHEQENHF 244

Query: 775  TGSTGGAQLTRPVETEFKHP-PPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTP 951
              ST  +   R V  E K P PP S   +E S E R+  +P+A+ AALAS+MS   +   
Sbjct: 245  KRSTRSSSKLR-VNMEVKDPLPPDSSIQKELS-ENRAYLSPLASRAALASIMSNLPQKLE 302

Query: 952  -----------------------TMSFISNSSKINDENFLGTESGTVTSISTVKELFPSK 1062
                                   TM  ++ +  + D+     E+  + +      +   K
Sbjct: 303  FSIYHEEENGFACAPDNITNKHVTMDLLNGTKPVKDKLSSEIEAAFIPAEIFKSLITTEK 362

Query: 1063 KLFKDPETLNASSVALNWESKNKESSQEVASCVGNHVR-----------STKKMCLAARS 1209
               + P  + ASS   N +S+N  S           +R           + +K  +++ S
Sbjct: 363  GASRRPLLVEASSEIANPKSQNDASPSLSGLIEERELRQPAKESRFTSSAIQKHAVSSTS 422

Query: 1210 SSAMNRSPFENVEMPVTNARRLTRSVIR------AIKEKQTVETKNVTEEILCSTFEQNL 1371
            ++ M     E   + +TN +RLTRS +       A++ KQ  E  N  ++I  ++ E N+
Sbjct: 423  NAEMKTHAEEIKSVALTN-KRLTRSAVHKQEENLAMEVKQRSEVNNSAQDIESNSSEGNV 481

Query: 1372 SV 1377
            ++
Sbjct: 482  TI 483


>emb|CAN79695.1| hypothetical protein VITISV_023936 [Vitis vinifera]
          Length = 1508

 Score =  241 bits (615), Expect = 1e-60
 Identities = 180/568 (31%), Positives = 278/568 (48%), Gaps = 76/568 (13%)
 Frame = +1

Query: 49   CTADNDAAEVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAV 228
            C+      +  VELEAMRK+D SWHPC VS  STG GL+V++ +   D +DII + EEA+
Sbjct: 27   CSMGTGTGDATVELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQ--DLEDIISNEEEAL 84

Query: 229  ARLRIRSAPLEGVACSIVQPGDRVLA--RRNRMNLFFDAEV------------------- 345
            ARLRIRS PL+G  CS+++ G+RVLA  + +   L FDA V                   
Sbjct: 85   ARLRIRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKEMSHEFXIECDLIDWGI 144

Query: 346  ---IEVVRVRHSKRIHCRCTFTVKWIHNGLEGETEIIPSSGLMKMSTESIHLHPTIFAFF 516
               +  +RVRHS RI CRCTF +KW+H  L+G T I+PSS +MK++T+SI +HP + AF 
Sbjct: 145  XVNVVALRVRHSTRISCRCTFVIKWLHQDLKGATSIVPSSSIMKLATQSITVHPMVAAFL 204

Query: 517  NTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEKQIEQISNSADACRMKITK-VLSG-EVD 687
              + T +C        V E +DCE+D+ + LEKQIE+ISN ADA + +I++ +L G + D
Sbjct: 205  KPIKTLNCSAAPSFSTVFEDVDCEVDLHKLLEKQIEEISNLADASKKEISEDILFGIKAD 264

Query: 688  VDERSQCNLIPASEVCDTYIHLPPSQ-NSITGSTGGAQLTRPVETEFKHP-PPHSWFTEE 861
            + E+  C+ +  S++  ++  +P  Q N    ST  +   R V  E K P PP S   EE
Sbjct: 265  IKEQMDCSPVAESKITSSHFQVPHEQENHFKRSTRSSSKLR-VNMEVKDPLPPDSSIQEE 323

Query: 862  ASVEGRSRCNPIAACAALASLMSKSSENTP-----------------------TMSFISN 972
             S E R+  +P+A+ AALAS+MS   +                          TM  ++ 
Sbjct: 324  LS-ENRAYLSPLASRAALASIMSNLPQKLEFSIXHEEENGFACAPDNITNKHVTMDLLNG 382

Query: 973  SSKINDENFLGTESGTVTSISTVKELFPSKKLFKDPETLNASSVALNWESKNKESSQEVA 1152
            +  + D+     E+  + +      +   K   + P  + ASS   N +S+N  S     
Sbjct: 383  TKPVKDKLSSEIEAAFIPAEIFKSLITTEKGASRRPLLVEASSEIANPKSQNDASPSLSG 442

Query: 1153 SCVGNHVRSTKKMC----------LAARSSSAMNRSPFENVEMPVTNARRLTRSVIR--- 1293
                  +R   K              + +S+A  ++  E ++      +RLTRS +    
Sbjct: 443  LIEERELRQPAKESRFTSSAIQKHAVSSTSNAEMKTHAEEIKSVALXNKRLTRSAVHKQE 502

Query: 1294 ---AIKEKQTVETKNVTEEILCSTFEQNLSV--------QKMDTLPDMEMVASVIDKESG 1440
               A++ KQ  E  N  ++I  ++ E N+++        +K  +LP     +S + +E  
Sbjct: 503  ENLAMEVKQRSEVNNSAQDIESNSSEGNVTIPDRKAPKKKKPVSLPPAAQTSSPVTEERN 562

Query: 1441 LTISVKSKTQMDSKGICENNGSVMRSNG 1524
                + S  +  SK      G V R+ G
Sbjct: 563  KKRKMPSAVETASK----TEGKVSRNGG 586


>ref|XP_004302736.1| PREDICTED: uncharacterized protein LOC101292719 [Fragaria vesca
            subsp. vesca]
          Length = 580

 Score =  188 bits (477), Expect = 1e-44
 Identities = 172/615 (27%), Positives = 296/615 (48%), Gaps = 10/615 (1%)
 Frame = +1

Query: 70   AEVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRS 249
            AE   ELEA+ K+D SW+PC VS  ST   L+V++   + + +D++L+ +EA+ RLR RS
Sbjct: 7    AENATELEALCKQDSSWYPCHVSLSSTEDSLIVDFG--RQELEDMVLNKDEALMRLRFRS 64

Query: 250  APLEGVACSIVQPGDRVLARRNR--MNLFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNG 423
             PL+G  CS ++ G+ VLA       +  +DA+V +V RVRHS R++CRC+F + W+H  
Sbjct: 65   GPLQGDDCSHIE-GEHVLAIHKSPFKSYLYDAKVEKVTRVRHSTRVYCRCSFMILWLHPD 123

Query: 424  LEGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE 603
             +G+   I SS +MK++++SI+ HPT+ A F ++     +    L  + E +D E D+++
Sbjct: 124  FKGQMVTITSSSIMKLASKSINSHPTVAALFKSVKQMGLYTAPLLPIMHEDIDVEFDLNK 183

Query: 604  -LEKQIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASEVCDTYIHLPPSQNSITG 780
             L KQIE+I+ SA+    +IT  +   V  D                  H      S+  
Sbjct: 184  LLGKQIEEINISANRVTNEITVDIIEGVKADSSGHVTESKIGTSKAQVSHDQDQLKSVAN 243

Query: 781  STGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTPTMS 960
             +G  ++ +  E E  HPP  S   +E   E R   +P+AA AALASL+S + ++     
Sbjct: 244  RSGNLEVNK--EDEDPHPPFLS--KQEEHSEHRCHISPLAARAALASLVSLTHKHIAI-- 297

Query: 961  FISNSSKINDENFLGTESGTVTSISTVKELFPSKKLFKDPETLNASSVALNWESKNKESS 1140
                             SGT        ELF S          +++ +++   S   ES 
Sbjct: 298  -----------------SGT--------ELFKSS---------DSTDLSIKVSSDRTESP 323

Query: 1141 QEVASCVGNHVRSTKKMCLAA--RSSSAMNRSPFENVEMPVTNARRLTRSVIRAIKEKQT 1314
            +   + +G+  R+T+   L    + +S ++ S        VTN   LTRS ++  K+  +
Sbjct: 324  KNGNANLGSGARTTRSRGLKGFEKQNSDLHDSAEAIKLRAVTNRGWLTRSAVKEEKDISS 383

Query: 1315 VETKNVTEEILCSTFEQNLSVQKMDTLPDMEMVASVIDKESGLT-ISVKSKTQMDSKGIC 1491
            V +K+ +EE   +   ++ S    D +   +    V+ K++G++  +V S    +S G  
Sbjct: 384  VASKHGSEESESAQSTESYSSDGTDIVHGNK----VLTKKNGISKKAVSSPLHSESNGHK 439

Query: 1492 ENNGSVMRSNGVGIQGLNNSRRLTRSAVKGKSKDSNGELPKGLEECSYPGHSKSDLEGN- 1668
            EN    + S  +G+  + ++   T++     +KD+N  +   L   +    S+   + N 
Sbjct: 440  EN----LTSGDLGV--IQDAYVQTKTC----AKDTNSSVSTNLRRLT---RSRVSCQDNL 486

Query: 1669 ITSEREVPEMEKTISTSPVHETCPNPSIAEAYE---KVQLSASIKTTRKTEGNATDNVSL 1839
            I  E    E E   S      +  + + + + E   +   S  ++ +R+TEG  + +   
Sbjct: 487  IVPECHAVEKENRESKKKKAGSASSQNYSTSGEDGNRQHNSGVVRNSRQTEGKMSGSGDN 546

Query: 1840 KQGVKRKSSASKNQE 1884
             QG KRKS++S  QE
Sbjct: 547  SQGRKRKSNSSSRQE 561


>gb|EOY07557.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 611

 Score =  178 bits (451), Expect = 1e-41
 Identities = 140/419 (33%), Positives = 207/419 (49%), Gaps = 22/419 (5%)
 Frame = +1

Query: 82   VELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSAPLE 261
            VELEA RKED SWHPC V   S+G  L+V +   + + DD++L  EE +  LR RS PL+
Sbjct: 11   VELEAKRKEDSSWHPCRVYLSSSGDSLIVNFG--RQELDDMLLQKEEVLMHLRFRSMPLQ 68

Query: 262  GVACSIVQPGDRVLARRNRMN--LFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGLEGE 435
               C  ++ G+RVLA R      LF DA V++V RVRHSKR  CRCTF +KW+   LEG+
Sbjct: 69   VDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQDLEGQ 127

Query: 436  TEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEK 612
            T  +PSS +MK++T+SI  HP I               SPL  + EG D E+D+++ L+K
Sbjct: 128  TFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLLQK 187

Query: 613  QIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASE-------VCDTYIHLPPSQNS 771
            QIEQISN ADA +  I + +        + Q    P +E       V D + HL      
Sbjct: 188  QIEQISNLADASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVPAVADHHNHL----KR 243

Query: 772  ITGSTGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTP 951
             T ST   Q    +  E ++   H+   +EA ++ RS  +P+A+ AALAS +  + +   
Sbjct: 244  TTRSTRKLQ----INIEAENQSGHTISMKEAFIQSRSHLSPLASRAALASSLLTAKK--- 296

Query: 952  TMSFISNSSKINDENFLGTESGTVTSIS------TVKELFPSKKLFKD----PETLNASS 1101
             +    +SS        G +S  + ++S         E+ P      D    P+    SS
Sbjct: 297  CLDMDLSSSMTASMFMKGKDSSDILAVSIPLVSEASHEISPHISTQGDASCEPQPTKPSS 356

Query: 1102 V--ALNWESKNKESSQEVASCVGNHVRSTKKMCLAARSSSAMNRSPFENVEMPVTNARR 1272
                  WE++NK +S E+         S  K+   + +S           E+P++ A++
Sbjct: 357  CIPTKGWENENK-TSDEINCTAEQRTYSPVKITAESVTSGVAT----STAELPISRAKK 410


>gb|EOY07556.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 567

 Score =  178 bits (451), Expect = 1e-41
 Identities = 140/419 (33%), Positives = 207/419 (49%), Gaps = 22/419 (5%)
 Frame = +1

Query: 82   VELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSAPLE 261
            VELEA RKED SWHPC V   S+G  L+V +   + + DD++L  EE +  LR RS PL+
Sbjct: 11   VELEAKRKEDSSWHPCRVYLSSSGDSLIVNFG--RQELDDMLLQKEEVLMHLRFRSMPLQ 68

Query: 262  GVACSIVQPGDRVLARRNRMN--LFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGLEGE 435
               C  ++ G+RVLA R      LF DA V++V RVRHSKR  CRCTF +KW+   LEG+
Sbjct: 69   VDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQDLEGQ 127

Query: 436  TEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEK 612
            T  +PSS +MK++T+SI  HP I               SPL  + EG D E+D+++ L+K
Sbjct: 128  TFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLLQK 187

Query: 613  QIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASE-------VCDTYIHLPPSQNS 771
            QIEQISN ADA +  I + +        + Q    P +E       V D + HL      
Sbjct: 188  QIEQISNLADASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVPAVADHHNHL----KR 243

Query: 772  ITGSTGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTP 951
             T ST   Q    +  E ++   H+   +EA ++ RS  +P+A+ AALAS +  + +   
Sbjct: 244  TTRSTRKLQ----INIEAENQSGHTISMKEAFIQSRSHLSPLASRAALASSLLTAKK--- 296

Query: 952  TMSFISNSSKINDENFLGTESGTVTSIS------TVKELFPSKKLFKD----PETLNASS 1101
             +    +SS        G +S  + ++S         E+ P      D    P+    SS
Sbjct: 297  CLDMDLSSSMTASMFMKGKDSSDILAVSIPLVSEASHEISPHISTQGDASCEPQPTKPSS 356

Query: 1102 V--ALNWESKNKESSQEVASCVGNHVRSTKKMCLAARSSSAMNRSPFENVEMPVTNARR 1272
                  WE++NK +S E+         S  K+   + +S           E+P++ A++
Sbjct: 357  CIPTKGWENENK-TSDEINCTAEQRTYSPVKITAESVTSGVAT----STAELPISRAKK 410


>gb|EOY07558.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 409

 Score =  174 bits (442), Expect = 1e-40
 Identities = 117/290 (40%), Positives = 161/290 (55%), Gaps = 10/290 (3%)
 Frame = +1

Query: 82  VELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSAPLE 261
           VELEA RKED SWHPC V   S+G  L+V +   + + DD++L  EE +  LR RS PL+
Sbjct: 11  VELEAKRKEDSSWHPCRVYLSSSGDSLIVNFG--RQELDDMLLQKEEVLMHLRFRSMPLQ 68

Query: 262 GVACSIVQPGDRVLARRNRMN--LFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGLEGE 435
              C  ++ G+RVLA R      LF DA V++V RVRHSKR  CRCTF +KW+   LEG+
Sbjct: 69  VDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQDLEGQ 127

Query: 436 TEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEK 612
           T  +PSS +MK++T+SI  HP I               SPL  + EG D E+D+++ L+K
Sbjct: 128 TFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLLQK 187

Query: 613 QIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASE-------VCDTYIHLPPSQNS 771
           QIEQISN ADA +  I + +        + Q    P +E       V D + HL      
Sbjct: 188 QIEQISNLADASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVPAVADHHNHL----KR 243

Query: 772 ITGSTGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALAS 921
            T ST   Q    +  E ++   H+   +EA ++ RS  +P+A+ AALAS
Sbjct: 244 TTRSTRKLQ----INIEAENQSGHTISMKEAFIQSRSHLSPLASRAALAS 289


>gb|EOY07559.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 565

 Score =  173 bits (438), Expect = 3e-40
 Identities = 139/419 (33%), Positives = 206/419 (49%), Gaps = 22/419 (5%)
 Frame = +1

Query: 82   VELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSAPLE 261
            VELEA RKED SWHPC V   S+G  L+V +   + + DD++L  EE +  LR RS PL+
Sbjct: 11   VELEAKRKEDSSWHPCRVYLSSSGDSLIVNFG--RQELDDMLLQKEEVLMHLRFRSMPLQ 68

Query: 262  GVACSIVQPGDRVLARRNRMN--LFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGLEGE 435
               C  ++ G+RVLA R      LF DA V++  RVRHSKR  CRCTF +KW+   LEG+
Sbjct: 69   VDDCFHIEEGERVLADRKSQFKILFHDAVVVD--RVRHSKR-GCRCTFMIKWLDQDLEGQ 125

Query: 436  TEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEK 612
            T  +PSS +MK++T+SI  HP I               SPL  + EG D E+D+++ L+K
Sbjct: 126  TFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLLQK 185

Query: 613  QIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASE-------VCDTYIHLPPSQNS 771
            QIEQISN ADA +  I + +        + Q    P +E       V D + HL      
Sbjct: 186  QIEQISNLADASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVPAVADHHNHL----KR 241

Query: 772  ITGSTGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTP 951
             T ST   Q    +  E ++   H+   +EA ++ RS  +P+A+ AALAS +  + +   
Sbjct: 242  TTRSTRKLQ----INIEAENQSGHTISMKEAFIQSRSHLSPLASRAALASSLLTAKK--- 294

Query: 952  TMSFISNSSKINDENFLGTESGTVTSIS------TVKELFPSKKLFKD----PETLNASS 1101
             +    +SS        G +S  + ++S         E+ P      D    P+    SS
Sbjct: 295  CLDMDLSSSMTASMFMKGKDSSDILAVSIPLVSEASHEISPHISTQGDASCEPQPTKPSS 354

Query: 1102 V--ALNWESKNKESSQEVASCVGNHVRSTKKMCLAARSSSAMNRSPFENVEMPVTNARR 1272
                  WE++NK +S E+         S  K+   + +S           E+P++ A++
Sbjct: 355  CIPTKGWENENK-TSDEINCTAEQRTYSPVKITAESVTSGVAT----STAELPISRAKK 408


>gb|EMJ07008.1| hypothetical protein PRUPE_ppa010713mg [Prunus persica]
          Length = 238

 Score =  162 bits (411), Expect = 4e-37
 Identities = 90/207 (43%), Positives = 134/207 (64%), Gaps = 5/207 (2%)
 Frame = +1

Query: 58  DNDAAEVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARL 237
           D   AE   ELEAM KED SWHPC VS  ST   L+V++   + +  D++L+T+EA+ RL
Sbjct: 3   DTSEAENVTELEAMCKEDSSWHPCQVSLSSTKDSLIVDFGGQELE--DMVLNTDEALTRL 60

Query: 238 RIRSAPLEGVACSIVQPGDRVLA--RRNRMNLFFDAEVIEVVRVRHSKRIHCRCTFTVKW 411
           R R APL+G  C+ ++ G+ VLA  +    + FFDA+V +V+RVRHS R++CRCTF +KW
Sbjct: 61  RFRCAPLQGDDCTRIE-GEHVLAINKSQSKSHFFDAKVEKVLRVRHSTRVYCRCTFMIKW 119

Query: 412 IHNGLEGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVS--PLRAVAEGMDC 585
           +H  L+G+   +PSS +MK++ ++I++HPT+ AF  ++        S  P+    E    
Sbjct: 120 LHQDLKGQMVTVPSSSIMKLTGKNINVHPTVSAFLKSVKQMGLDSASSVPVMLEVEDFAV 179

Query: 586 EMDIDE-LEKQIEQISNSADACRMKIT 663
           E+D+++ LEKQIE I+ SA+  R  IT
Sbjct: 180 ELDLNKFLEKQIEDITVSANEFRKAIT 206


>gb|EOY07560.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 468

 Score =  109 bits (273), Expect = 4e-21
 Identities = 98/327 (29%), Positives = 151/327 (46%), Gaps = 20/327 (6%)
 Frame = +1

Query: 352  VVRVRHSKRIHCRCTFTVKWIHNGLEGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLT 531
            V RVRHSKR  CRCTF +KW+   LEG+T  +PSS +MK++T+SI  HP I         
Sbjct: 2    VDRVRHSKR-GCRCTFMIKWLDQDLEGQTFTLPSSSIMKLATKSISAHPIINKLLKPEKH 60

Query: 532  SSCFDVSPLRAVAEGMDCEMDIDE-LEKQIEQISNSADACRMKITKVLSGEVDVDERSQC 708
                  SPL  + EG D E+D+++ L+KQIEQISN ADA +  I + +        + Q 
Sbjct: 61   RGLSYSSPLLTILEGTDSEIDLNKLLQKQIEQISNLADASKKDIPEDIPWRNKGVNKGQS 120

Query: 709  NLIPASE-------VCDTYIHLPPSQNSITGSTGGAQLTRPVETEFKHPPPHSWFTEEAS 867
               P +E       V D + HL       T ST   Q    +  E ++   H+   +EA 
Sbjct: 121  PHKPTAESNACVPAVADHHNHL----KRTTRSTRKLQ----INIEAENQSGHTISMKEAF 172

Query: 868  VEGRSRCNPIAACAALASLMSKSSENTPTMSFISNSSKINDENFLGTESGTVTSIS---- 1035
            ++ RS  +P+A+ AALAS +  + +    +    +SS        G +S  + ++S    
Sbjct: 173  IQSRSHLSPLASRAALASSLLTAKK---CLDMDLSSSMTASMFMKGKDSSDILAVSIPLV 229

Query: 1036 --TVKELFPSKKLFKD----PETLNASSV--ALNWESKNKESSQEVASCVGNHVRSTKKM 1191
                 E+ P      D    P+    SS      WE++NK +S E+         S  K+
Sbjct: 230  SEASHEISPHISTQGDASCEPQPTKPSSCIPTKGWENENK-TSDEINCTAEQRTYSPVKI 288

Query: 1192 CLAARSSSAMNRSPFENVEMPVTNARR 1272
               + +S           E+P++ A++
Sbjct: 289  TAESVTSGVAT----STAELPISRAKK 311


>ref|XP_006853660.1| hypothetical protein AMTR_s00056p00105160 [Amborella trichopoda]
           gi|548857321|gb|ERN15127.1| hypothetical protein
           AMTR_s00056p00105160 [Amborella trichopoda]
          Length = 228

 Score = 61.2 bits (147), Expect = 2e-06
 Identities = 46/124 (37%), Positives = 64/124 (51%), Gaps = 7/124 (5%)
 Frame = +1

Query: 79  EVELEAMRKEDFSWHPCTVSPC-----STGVGLMVEYSNDKSDPDDIILSTEEAVARLRI 243
           E+E EA   +D +W+   +        S    + V ++   ++ D+ + + + AV R   
Sbjct: 97  ELEFEARSAKDGAWYDVALFLTHRILHSGEPEVRVRFTGFGAEEDEWV-NVKRAVRR--- 152

Query: 244 RSAPLEGVACSIVQPGDRVLARRNRMNL--FFDAEVIEVVRVRHSKRIHCRCTFTVKWIH 417
           RS PLE   C  V PGD VL  R   NL  +FDA VIEV R RH  R  CRCTF V++ H
Sbjct: 153 RSIPLESSECGKVMPGDLVLCFREGENLATYFDAHVIEVQRRRHDLR-GCRCTFLVRYDH 211

Query: 418 NGLE 429
           +  E
Sbjct: 212 DQAE 215


Top