BLASTX nr result

ID: Gardenia21_contig00030171 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Gardenia21_contig00030171
         (559 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CDP20635.1| unnamed protein product [Coffea canephora]            131   2e-28
emb|CDP17945.1| unnamed protein product [Coffea canephora]            124   3e-26
emb|CDP11040.1| unnamed protein product [Coffea canephora]            101   2e-19
ref|XP_004247264.1| PREDICTED: uncharacterized protein LOC101265...    92   2e-16
emb|CDP20087.1| unnamed protein product [Coffea canephora]             84   6e-14
ref|XP_006367184.1| PREDICTED: uncharacterized protein LOC102601...    84   6e-14
ref|XP_009618989.1| PREDICTED: uncharacterized protein LOC104111...    78   3e-12
ref|XP_009777052.1| PREDICTED: uncharacterized protein LOC104226...    77   5e-12
ref|XP_011085143.1| PREDICTED: uncharacterized protein LOC105167...    75   2e-11
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...    72   2e-10
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...    71   3e-10
ref|XP_009630874.1| PREDICTED: uncharacterized protein LOC104120...    71   4e-10
ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom...    69   1e-09
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...    69   2e-09
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...    68   3e-09
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...    67   4e-09
ref|XP_009621360.1| PREDICTED: uncharacterized protein LOC104113...    67   6e-09
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...    67   6e-09
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...    66   1e-08
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...    66   1e-08

>emb|CDP20635.1| unnamed protein product [Coffea canephora]
          Length = 529

 Score =  131 bits (329), Expect = 2e-28
 Identities = 75/197 (38%), Positives = 105/197 (53%), Gaps = 34/197 (17%)
 Frame = -3

Query: 557 TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHVVCDFVQDGVWN 378
           TW+RM+ IQ  AE++I W L +G V+FWHDN +G GPLC QVE F E  V D+V+ G WN
Sbjct: 107 TWKRMMAIQGKAEQHILWQLSDGSVNFWHDNWLGQGPLCHQVETFQECAVYDYVEQGRWN 166

Query: 377 VQKLTQWVPADCVRKILAL---LHR------------------------------AWSWK 297
           V KL   +P+  V +IL +    H                               +W + 
Sbjct: 167 VHKLNDVLPSWLVGRILKVDPPCHTFPDSMVWAPSTSGDFSISTAYKCVQGSGNISWLYS 226

Query: 296 IRW-YGLRAPQELSQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFC 120
             W  GL  P  +S   +  R+L +R+PV++ L   G+ GPS C CCS+P  E++DH+FC
Sbjct: 227 SVWIQGL--PVNISFFMM--RLLRARLPVMDRLHHLGILGPSRCFCCSSPCSESIDHIFC 282

Query: 119 TGELARGLWGFFGANMG 69
            GE+A  +W FF   +G
Sbjct: 283 NGEVASKIWHFFEVVVG 299


>emb|CDP17945.1| unnamed protein product [Coffea canephora]
          Length = 463

 Score =  124 bits (311), Expect = 3e-26
 Identities = 76/213 (35%), Positives = 108/213 (50%), Gaps = 37/213 (17%)
 Frame = -3

Query: 545 MVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHVVCDFVQDGVWNVQKL 366
           MV +Q   E+NI+WV+  G +DFWHDN MG G LC +VE+F +H V DFV    WNV  L
Sbjct: 1   MVTMQRFGEDNISWVVREGALDFWHDNWMGSGALCDKVEVFHDHSVVDFVDQRAWNVDML 60

Query: 365 TQWVPADCVRKILAL------LHRAWSWKI---------------------RWYGLRAPQ 267
            Q++  + V ++L +       +    W +                      W   R  Q
Sbjct: 61  HQFLDGELVTQVLEIDPPTDRGNDTMVWALTNSGVFSTASAYSLIRQSNDNSWLFGRIWQ 120

Query: 266 ELSQCRLHF---RVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGELARGL 96
           +    ++ F   R+L  R+P+++ L+RFGV GPS C CC NP  E ++HVFC+GE AR +
Sbjct: 121 QGLPVKVSFFMLRLLQGRLPLMDRLKRFGVCGPSRCLCCQNPQEEDLNHVFCSGEGARLV 180

Query: 95  WGFFGANMG-YLGNH------GSCILYR*VGER 18
           W  F +  G + G H       SC L R   +R
Sbjct: 181 WRHFESTAGEFSGVHTVRHMVWSCWLRRGTNDR 213


>emb|CDP11040.1| unnamed protein product [Coffea canephora]
          Length = 522

 Score =  101 bits (252), Expect = 2e-19
 Identities = 64/182 (35%), Positives = 89/182 (48%), Gaps = 30/182 (16%)
 Frame = -3

Query: 548 RMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHVVCDFVQDGVWNVQK 369
           R++ I+ VAE+N+ W +  G+ +FW DN MG GPLC +++   +H+V DFV +G WN Q 
Sbjct: 150 RLLQIREVAEQNLWWEMRAGQCNFWFDNWMGSGPLCQRLQSVSDHLVRDFVLNGRWNQQL 209

Query: 368 LTQWVPADCVRKILALLHRAWSWKIR--W-------YGLRAPQEL--SQCRLHF------ 240
           L  WVP D V +I+  +    S   R  W       + + +  EL  SQ    F      
Sbjct: 210 LRLWVPDDIVSEIVTKVAPVGSADDRAVWALTESGDFSISSTYELLGSQTPSSFMFERVW 269

Query: 239 -------------RVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGELARG 99
                        R+L  R+P+   L R  VYGPS C CC     ET+DHVF    +  G
Sbjct: 270 HPVIPIKISFFMVRLLRDRLPLASSLGRLQVYGPSKCFCCLASQSETLDHVFADEFIGVG 329

Query: 98  LW 93
            W
Sbjct: 330 SW 331


>ref|XP_004247264.1| PREDICTED: uncharacterized protein LOC101265276 [Solanum
           lycopersicum]
          Length = 376

 Score = 91.7 bits (226), Expect = 2e-16
 Identities = 57/203 (28%), Positives = 87/203 (42%), Gaps = 34/203 (16%)
 Frame = -3

Query: 554 WRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPL---CAQVEIFGEHVVCDFVQDGV 384
           W  M+  +A+AE  I W + +G   FW D+ +GD PL   C  +       +  F+ +G 
Sbjct: 145 WNFMMKNKAIAESQIQWRINSGISSFWWDDWLGDDPLTPQCNHITSLNNTTISPFLINGS 204

Query: 383 WNVQKLTQWVPADCVRKILALLHR-----------------AWSWKIRWYGLRAPQELSQ 255
           WN   L QWVP   + K+L +  +                  +S    W  +R  +E + 
Sbjct: 205 WNETLLRQWVPPLLISKVLNIDIKFQEHMLDERVRKTSEKGQFSCATAWEHIRHRKEKNN 264

Query: 254 CR--------------LHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCT 117
           C               L +R L  ++P  E + +FG      CSCC +PG +T+DH+F T
Sbjct: 265 CSKNIWHKLILFKISLLVWRTLRCKLPTNERIIKFGREAAQ-CSCCYSPGEDTIDHIFVT 323

Query: 116 GELARGLWGFFGANMGYLGNHGS 48
           G     +W FF A  G    H +
Sbjct: 324 GHFDNNIWRFFSAAAGLQNEHST 346


>emb|CDP20087.1| unnamed protein product [Coffea canephora]
          Length = 167

 Score = 83.6 bits (205), Expect = 6e-14
 Identities = 33/61 (54%), Positives = 43/61 (70%)
 Frame = -3

Query: 557 TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHVVCDFVQDGVWN 378
           TW+RMV IQ +A++ ++WVL  G + FWHDN +G GPLC QV+ F E  V DFV  G WN
Sbjct: 107 TWKRMVSIQGIAKQKVSWVLARGDLSFWHDNWLGTGPLCPQVDTFQECAVSDFVDQGCWN 166

Query: 377 V 375
           +
Sbjct: 167 I 167


>ref|XP_006367184.1| PREDICTED: uncharacterized protein LOC102601483 [Solanum tuberosum]
          Length = 2019

 Score = 83.6 bits (205), Expect = 6e-14
 Identities = 48/170 (28%), Positives = 85/170 (50%), Gaps = 18/170 (10%)
 Frame = -3

Query: 524  AEENITWVLGNGKVDFWHDN*MGDGPLC---AQVEIFGEHVVCDFVQDGVWNVQKLTQWV 354
            A  NI W + +G   FW DN +G GPL    +    F    V +F+++G WN+ K+ +  
Sbjct: 799  AGSNIQWRIRSGSCSFWWDNWLGVGPLAHYTSNSNRFNNDSVSEFIEEGHWNIPKVLRVA 858

Query: 353  P-ADCVRKILALLHRAWSWKIRWYGLRAPQELSQCR--------------LHFRVLVSRV 219
            P +  V K+ +     +S    W  +R  +E+++                L +R +  ++
Sbjct: 859  PPSQAVWKLNS--SGLFSVSSAWNSIREKREITKINKYTWHPKIPFKCSFLLWRAIRGKL 916

Query: 218  PVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGELARGLWGFFGANMG 69
            P  E L  FG+  PS C CC +PG++T++H   +G+ A+ +W +F  ++G
Sbjct: 917  PTNEKLLSFGIE-PSDCHCCHSPGIDTIEHTLNSGDFAKNVWKYFAISLG 965



 Score = 81.3 bits (199), Expect = 3e-13
 Identities = 56/204 (27%), Positives = 83/204 (40%), Gaps = 34/204 (16%)
 Frame = -3

Query: 557  TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLC---AQVEIFGEHVVCDFVQDG 387
            TW+ ++  +   EE+I W L +G   FW DN +G GPL              V +F+ +G
Sbjct: 473  TWKHLMHNKHKVEEHIQWKLNSGSCSFWWDNWLGVGPLARFSTDSNRLNNTTVAEFLVEG 532

Query: 386  VWNVQKLTQWVPADCVRKILAL-------LHRAWSWKIR----------WYGLRAPQ--- 267
             WNV KL Q  P D +  IL+        +     WK+           W  +R  +   
Sbjct: 533  QWNVNKLIQQAPNDYLANILSTKFYVQQEIPNQAIWKLNSDGNFTYSSAWNAIREKRTKT 592

Query: 266  -----------ELSQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFC 120
                             L +R L  ++P  E L  FG    +   CC+  G++T++H F 
Sbjct: 593  IFNTFIWHKSIPFKTSFLLWRTLRGKLPTNEKLISFGNEPANCFCCCNRSGLDTIEHTFN 652

Query: 119  TGELARGLWGFFGANMGYLGNHGS 48
             G+ A  LW  F A  G +  H S
Sbjct: 653  KGQFATYLWKSFAAAAGIITYHNS 676


>ref|XP_009618989.1| PREDICTED: uncharacterized protein LOC104111091 [Nicotiana
            tomentosiformis]
          Length = 775

 Score = 77.8 bits (190), Expect = 3e-12
 Identities = 52/196 (26%), Positives = 86/196 (43%), Gaps = 33/196 (16%)
 Frame = -3

Query: 557  TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEI---FGEHVVCDFVQDG 387
            +W +++ I+  AE+N+TW +  G  +   DN  G G L   +     F +  V D++Q G
Sbjct: 494  SWGKLMQIKRKAEQNLTWKIQMGNNNLRWDNWTGKGALAELIPCQAKFAKDKVEDYIQGG 553

Query: 386  VWNVQKLTQWVPADCVRKILALL--------HRAW--------SWKIRWYGLRAPQELSQ 255
             WNV KL + +P   + +I  L+        +  W        S K   +  R P++  Q
Sbjct: 554  TWNVMKLAKIIPVHIISQITKLMIGDKNTQDYAIWDPAADGQFSTKSTHHLTRTPRQKDQ 613

Query: 254  CRLHF--------------RVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCT 117
             R  F              R++  ++P  + ++ FG    S C CC++   ET  HVF  
Sbjct: 614  FRYKFWHPNLPFKISFLAWRLIQRKLPFDDTMRIFGGNVVSECVCCNDTKRETFQHVFLE 673

Query: 116  GELARGLWGFFGANMG 69
            G+    +W  FG  +G
Sbjct: 674  GDAGNRMWKKFGGPLG 689


>ref|XP_009777052.1| PREDICTED: uncharacterized protein LOC104226713 [Nicotiana
           sylvestris]
          Length = 345

 Score = 77.0 bits (188), Expect = 5e-12
 Identities = 50/169 (29%), Positives = 86/169 (50%), Gaps = 7/169 (4%)
 Frame = -3

Query: 554 WRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEI-FGE----HVVCDFVQD 390
           WR+M++ + + E +ITW    G   FW DN  G G L   V + FG     H V D V++
Sbjct: 61  WRKMLECRDIIEHHITWHPKMGSSLFWFDNWTGLGALYFLVPLDFGIDENIHNVYDMVEE 120

Query: 389 GVWNVQKLTQWVPADCVRKILALLHRAWSWKIRWYGLRAPQELSQCRLHFRVLV--SRVP 216
           GVWNV +L + +P +    I+  +     + +    +  P   + C     +LV  +++P
Sbjct: 121 GVWNVDRLLEVLPEEFALHIVKKIRPPVIYNV----IVLP---TGCLKPGAILVWKAKLP 173

Query: 215 VLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGELARGLWGFFGANMG 69
           + + ++R G + PS C CC+ P  E++ H+F T  +   +W +F +  G
Sbjct: 174 LDDFMRRLGYFMPSRCWCCAEPKQESLVHLFFTSNVVVTVWKYFLSRAG 222


>ref|XP_011085143.1| PREDICTED: uncharacterized protein LOC105167219 [Sesamum indicum]
          Length = 1203

 Score = 75.1 bits (183), Expect = 2e-11
 Identities = 56/188 (29%), Positives = 87/188 (46%), Gaps = 31/188 (16%)
 Frame = -3

Query: 554  WRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGVW 381
            W+RM   +  A+  I W LG G + FW DN +G+ PL   +  F  +   V ++ ++  W
Sbjct: 870  WKRMCRHRKEADRQIFWSLGKGHISFWFDNWIGEKPLFEIMPDFEWNTTPVNNYWENNSW 929

Query: 380  NVQKLTQWVPADCVRKILALLHRA---------------WSWKIRWYGL---RAPQEL-- 261
            NV KL + + AD V +I  +                   +S K  W  L   RA Q+L  
Sbjct: 930  NVAKLREVLTADMVHQICQIPFDVDTSDTPLWKLSGDGIFSMKATWNSLRQTRATQQLVK 989

Query: 260  ---------SQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGEL 108
                     +     +R++  ++PV E LQ+ G+   S CSCC++  VE++ HVF  G  
Sbjct: 990  EIWSPFVTPTMSVFMWRLINDKLPVDEKLQKKGIQLASKCSCCNH--VESLQHVFIEGNG 1047

Query: 107  ARGLWGFF 84
             R +W  F
Sbjct: 1048 IRCVWEHF 1055


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 72.0 bits (175), Expect = 2e-10
 Identities = 51/189 (26%), Positives = 86/189 (45%), Gaps = 31/189 (16%)
 Frame = -3

Query: 557  TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGV 384
            TW+RMV I ++ E+NI W +G+G++ FWHD  MG+ PL  + + F   +  V DF  +  
Sbjct: 1752 TWKRMVTISSITEQNIRWRIGHGELFFWHDCWMGEEPLVNRNQAFASSMAQVSDFFLNNS 1811

Query: 383  WNVQKLTQWVPADCVRKILAL---------------LHRAWSWKIRWYGLRAPQ------ 267
            WNV+KL   +  + V +I+ +                +  +S K  W  +R  +      
Sbjct: 1812 WNVEKLKTVLQQEVVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPVF 1871

Query: 266  --------ELSQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGE 111
                     L+     +R+L   +PV   ++  G    S C CC +   E++ HV     
Sbjct: 1872 NFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCCKSE--ESLMHVMWKNP 1929

Query: 110  LARGLWGFF 84
            +A  +W +F
Sbjct: 1930 VANQVWSYF 1938


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 71.2 bits (173), Expect = 3e-10
 Identities = 52/189 (27%), Positives = 83/189 (43%), Gaps = 31/189 (16%)
 Frame = -3

Query: 557  TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGV 384
            TW+RMV I ++ E+NI W +G+GK+ FWHD  MG+ PL  + + F   +  V DF  +  
Sbjct: 3040 TWKRMVTISSITEQNIRWRVGHGKLFFWHDCWMGEEPLVIRNQEFASSMAQVSDFFLNNS 3099

Query: 383  WNVQKLTQWVPADCVRKILALLHRA---------------WSWKIRW------------- 288
            W+++KL   +  + V +I  +   A               +S K  W             
Sbjct: 3100 WDIEKLKSVLQQEVVEEIAKIPINASSNDRAYWTPTPNGDFSTKSAWQLSRERKVVNPTY 3159

Query: 287  -YGLRAPQELSQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGE 111
             Y       L+     +R+L   VPV   ++  G    S C CC +   E++ HV     
Sbjct: 3160 NYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQLASRCRCCKSE--ESLMHVMWDNP 3217

Query: 110  LARGLWGFF 84
            +A  +W +F
Sbjct: 3218 VANQVWSYF 3226



 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 50/188 (26%), Positives = 81/188 (43%), Gaps = 31/188 (16%)
 Frame = -3

Query: 554  WRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGVW 381
            W+RM+  + VA +NI W +G G++ FWHD  MGD PL      F   +  V  F     W
Sbjct: 1247 WKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLFPSFHNDMSHVHKFYNGDEW 1306

Query: 380  NVQKLTQWVPADCVRKILALLHRAWSWKIRWYGL------------------RAPQELSQ 255
            ++ KL  ++P   V +IL +        + ++ L                  + P  L  
Sbjct: 1307 DIVKLNSYLPTSLVDEILQIPFDRSQEDVAYWALTSNGEFSFWSAWEIIRQRQTPNALLS 1366

Query: 254  CRLH-----------FRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGEL 108
               H           +RVL + +PV   ++  G++  S C CC +   E++ HV     +
Sbjct: 1367 FNWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSE--ESLIHVLWENPV 1424

Query: 107  ARGLWGFF 84
            A+ +W FF
Sbjct: 1425 AKQVWNFF 1432


>ref|XP_009630874.1| PREDICTED: uncharacterized protein LOC104120752 [Nicotiana
           tomentosiformis]
          Length = 283

 Score = 70.9 bits (172), Expect = 4e-10
 Identities = 54/180 (30%), Positives = 78/180 (43%), Gaps = 32/180 (17%)
 Frame = -3

Query: 512 ITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV---VCDFVQDGVWNVQKLTQWVPADC 342
           I W +G+G   FW DN    GPL    +     +   V DF+ +G W    L+  +PA  
Sbjct: 39  ILWRIGDGDSSFWWDNWTSLGPLSKLTDGRPTPMNIKVKDFISNGRWRWDLLSLLLPASV 98

Query: 341 VRKI----LALLHRAWSW-----------KIRWYGLRAPQE-------LSQCRLHF---- 240
           V KI    +A   + WS+           K  W  LR  +        L   R+ F    
Sbjct: 99  VNKIQRVEVAQSRKDWSYWTLENSGSFSTKSTWQQLRTSRSETFIEKSLWHKRMPFNFSF 158

Query: 239 ---RVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGELARGLWGFFGANMG 69
              R+L ++V   E +QR G+  PS CSCC+    ET  H+F   ++AR +W +F    G
Sbjct: 159 LMMRLLKNKVSTDERIQRVGIMIPSRCSCCTRHQHETSSHLFGDSDIARQVWNYFSGTYG 218


>ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
            gi|508787492|gb|EOY34748.1| Uncharacterized protein
            TCM_042328 [Theobroma cacao]
          Length = 910

 Score = 69.3 bits (168), Expect = 1e-09
 Identities = 50/189 (26%), Positives = 83/189 (43%), Gaps = 31/189 (16%)
 Frame = -3

Query: 557  TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGV 384
            TW+RM+   A  E+++ W +G G + FWHD  MGD PL +  + F   +  VCDF  +  
Sbjct: 448  TWKRMLTSSATTEQHMRWRVGQGNLFFWHDCWMGDAPLISSNQEFTSSMVQVCDFFMNNS 507

Query: 383  WNVQKLTQWVPADCVRKILAL---------------LHRAWSWKIRWYGLRAPQ------ 267
            WNV+KL   +  + V +I  +                +  +S K  W  +R  +      
Sbjct: 508  WNVEKLKTVLQQEVVDEIAKIPIDTMSKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVF 567

Query: 266  --------ELSQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGE 111
                     L+     +R+L   +PV   ++  G+   S C CC +   E++ HV     
Sbjct: 568  NFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSE--ESIMHVMWDNP 625

Query: 110  LARGLWGFF 84
            +A  +W +F
Sbjct: 626  VAMQVWNYF 634


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 68.6 bits (166), Expect = 2e-09
 Identities = 53/188 (28%), Positives = 83/188 (44%), Gaps = 31/188 (16%)
 Frame = -3

Query: 554  WRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGVW 381
            W+RMV  + VA +N  W +G G + FWHD  MGD PL      F   +  V +F     W
Sbjct: 1493 WKRMVRGREVAIQNTRWRIGKGSLFFWHDCWMGDQPLVTSFPHFRNDMSTVHNFFNGHNW 1552

Query: 380  NVQKLTQWVPADCVRKILAL---------------LHRAWSWKIRWYGLR---APQELSQ 255
            +V KL  ++P + V +IL +                +  +S +  W  +R   +P  L  
Sbjct: 1553 DVDKLNLYLPMNLVDEILQIPIDRSQDDVAYWSLTSNGEFSTRSAWEAIRLRKSPNVLCS 1612

Query: 254  CRLH-----------FRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGEL 108
               H           +RV  + +PV   L+  G +  S C CC++   E++ HV     +
Sbjct: 1613 LLWHKSIPLSISFFLWRVFHNWIPVDIRLKEKGFHLASKCICCNSE--ESLIHVLWDNPI 1670

Query: 107  ARGLWGFF 84
            A+ +W FF
Sbjct: 1671 AKQVWNFF 1678


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 48/189 (25%), Positives = 83/189 (43%), Gaps = 31/189 (16%)
 Frame = -3

Query: 557  TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGV 384
            TW+RM+    + E+++ W +G G V FWHD  MG+ PL +  + F   +  VCDF  +  
Sbjct: 1789 TWKRMLTSSTITEQHMRWRVGQGNVFFWHDCWMGEAPLISSNQEFTSSMVQVCDFFTNNS 1848

Query: 383  WNVQKLTQWVPADCVRKILAL---------------LHRAWSWKIRWYGLRAPQ------ 267
            WN++KL   +  + V +I  +                +  +S K  W  +R  +      
Sbjct: 1849 WNIEKLKTVLQQEVVDEIAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVF 1908

Query: 266  --------ELSQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGE 111
                     L+     +R+L   +PV   ++  G+   S C CC +   E++ HV     
Sbjct: 1909 NFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSE--ESIMHVMWDNP 1966

Query: 110  LARGLWGFF 84
            +A  +W +F
Sbjct: 1967 VAMQVWNYF 1975


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 67.4 bits (163), Expect = 4e-09
 Identities = 53/188 (28%), Positives = 83/188 (44%), Gaps = 31/188 (16%)
 Frame = -3

Query: 554  WRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGVW 381
            W+RM+  + VA +NI W +G G++ FWHD  MGD PL      F   +  V  F    VW
Sbjct: 1490 WKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLCPSFHNDMSHVHKFYNGDVW 1549

Query: 380  NVQKLTQWVPADCVRKILAL---------------LHRAWSWKIRWYGLRAPQ------- 267
            +++KL+  +P   V +IL +                +  +S    W  +R  Q       
Sbjct: 1550 DIEKLSSCLPTSLVDEILQIPFDRSQEDVAYWALTSNGDFSLWSAWEAIRQRQTPNALFS 1609

Query: 266  -------ELSQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGEL 108
                    LS     +RVL + +PV   ++  G++  S C CC +   E++ HV     +
Sbjct: 1610 LIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSE--ESLIHVLWENPV 1667

Query: 107  ARGLWGFF 84
            A  +W FF
Sbjct: 1668 ATQVWFFF 1675


>ref|XP_009621360.1| PREDICTED: uncharacterized protein LOC104113001 [Nicotiana
           tomentosiformis]
          Length = 289

 Score = 67.0 bits (162), Expect = 6e-09
 Identities = 46/157 (29%), Positives = 68/157 (43%), Gaps = 6/157 (3%)
 Frame = -3

Query: 554 WRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEH------VVCDFVQ 393
           W+ M+  +  AE  + W +  G   FW DN   +GPL  +V    +       +V  F+ 
Sbjct: 50  WKSMIQARKKAEPRMIWQVNLGNSSFWWDNWTCEGPLFYKVNEAAKSAKRAKTMVSSFIT 109

Query: 392 DGVWNVQKLTQWVPADCVRKILALLHRAWSWKIRWYGLRAPQELSQCRLHFRVLVSRVPV 213
           +G WN  KL + +P   V++IL                  P  + +     +V+      
Sbjct: 110 EGNWNTCKLREVLPNHLVQQIL------------------PIHIGKQDRKDQVIWDLTDT 151

Query: 212 LEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGELAR 102
             +L RFG    S CSCC NP  E+M HVF  GE A+
Sbjct: 152 --ILNRFGRQNISTCSCCINPVNESMKHVFVEGEAAK 186


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 67.0 bits (162), Expect = 6e-09
 Identities = 50/189 (26%), Positives = 83/189 (43%), Gaps = 31/189 (16%)
 Frame = -3

Query: 557  TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGV 384
            TW+RMV   A+ E+N+ W +G GK+ FWHD  MG+ PL +  +     +  VCDF  +  
Sbjct: 1787 TWKRMVANSAITEQNMRWRVGQGKLFFWHDCWMGETPLTSSNQELSLSMVQVCDFFMNNS 1846

Query: 383  WNVQKLTQWVPADCVRKILALLHRA---------------WSWKIRWYGLRAPQ------ 267
            W+++KL   +  + V +I  +   A               +S K  W  +R  +      
Sbjct: 1847 WDIEKLKTVLQQEVVDEIAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVF 1906

Query: 266  --------ELSQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGE 111
                     L+     +R+L   +PV   ++  G    S C CC +   E++ HV     
Sbjct: 1907 NFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCCKSE--ESIMHVMWDNP 1964

Query: 110  LARGLWGFF 84
            +A  +W +F
Sbjct: 1965 VATQVWNYF 1973


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 66.2 bits (160), Expect = 1e-08
 Identities = 51/189 (26%), Positives = 84/189 (44%), Gaps = 31/189 (16%)
 Frame = -3

Query: 557  TWRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHV--VCDFVQDGV 384
            TW+ ++  +A A + I W +G G + FWHD  MGD PL      F + +  V  F  D  
Sbjct: 873  TWKPLLAGRATASQQIRWRIGKGDIFFWHDAWMGDEPLVNSFPSFSQSMMKVNYFFNDDA 932

Query: 383  WNVQKLTQWVPADCVRKILAL---------------LHRAWSWKIRWYGLRAPQELS--- 258
            W+V KL  ++P   V +IL +                +  +S K  W  LR  ++++   
Sbjct: 933  WDVDKLKTFIPNAIVEEILKIPISREKEDIAYWALTANGDFSIKSAWELLRQRKQVNLVG 992

Query: 257  QCRLH-----------FRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGE 111
            Q   H           +R L + +PV   ++  G+   S C CC +   E++ HV     
Sbjct: 993  QLIWHKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCKSE--ESLLHVLWESP 1050

Query: 110  LARGLWGFF 84
            +A+ +W +F
Sbjct: 1051 VAQQVWNYF 1059


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 65.9 bits (159), Expect = 1e-08
 Identities = 50/188 (26%), Positives = 81/188 (43%), Gaps = 31/188 (16%)
 Frame = -3

Query: 554  WRRMVDIQAVAEENITWVLGNGKVDFWHDN*MGDGPLCAQVEIFGEHVV--CDFVQDGVW 381
            W+RM+  + +A +NI W +G G + FWHD  MGD PL A    F   +     F     W
Sbjct: 1667 WKRMISGREMALQNIRWKIGKGDLFFWHDCWMGDKPLAASFPEFQNDMSHGYHFYNGDTW 1726

Query: 380  NVQKLTQWVPADCVRKILAL---------------LHRAWSWKIRWYGLRAPQE------ 264
            +V KL  ++P   V +IL +                +  +S +  W  +R  Q       
Sbjct: 1727 DVDKLRSFLPTILVEEILQVPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQRQTSNALCS 1786

Query: 263  --------LSQCRLHFRVLVSRVPVLEVLQRFGVYGPSICSCCSNPGVETMDHVFCTGEL 108
                    LS     ++ L + +PV   ++  G+   S C CC++   E++ HV     +
Sbjct: 1787 FIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCNSE--ESLIHVLWENPV 1844

Query: 107  ARGLWGFF 84
            A+ +W FF
Sbjct: 1845 AKQVWNFF 1852


Top