BLASTX nr result

ID: Rauwolfia21_contig00042237 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00042237
         (575 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo]    122   5e-26
gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus pe...   117   2e-24
gb|EMJ28586.1| hypothetical protein PRUPE_ppb016975mg [Prunus pe...   117   3e-24
ref|XP_004499744.1| PREDICTED: uncharacterized protein LOC101507...   114   1e-23
gb|EMJ28581.1| hypothetical protein PRUPE_ppb016096mg [Prunus pe...   114   2e-23
ref|XP_004516417.1| PREDICTED: uncharacterized protein LOC101507...   113   4e-23
ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502...   112   7e-23
ref|XP_004516035.1| PREDICTED: uncharacterized protein LOC101492...   111   1e-22
ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203...   110   2e-22
emb|CAN62233.1| hypothetical protein VITISV_010121 [Vitis vinifera]   110   3e-22
ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210...   109   4e-22
dbj|BAL46523.1| hypothetical protein [Gentiana scabra x Gentiana...   107   3e-21
ref|XP_002312663.1| predicted protein [Populus trichocarpa]           106   4e-21
gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobrom...   106   5e-21
gb|ADN34141.1| ty3-gypsy retrotransposon protein [Cucumis melo s...   106   5e-21
gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao]    105   8e-21
gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom...   105   1e-20
gb|EOY26421.1| DNA/RNA polymerases superfamily protein [Theobrom...   104   2e-20
gb|EOY08404.1| Retrotransposon-like protein [Theobroma cacao]         104   2e-20
gb|EOX99717.1| Uncharacterized protein TCM_008533 [Theobroma cacao]   104   2e-20

>gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo]
          Length = 871

 Score =  122 bits (307), Expect = 5e-26
 Identities = 76/188 (40%), Positives = 104/188 (55%), Gaps = 6/188 (3%)
 Frame = -2

Query: 562  PTRPAENRQSPNA------RVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFV 401
            P RP    Q+  A      RVFA    +AE+A  V+TGTL V    A VLFD G+SHSF+
Sbjct: 592  PLRPTGIAQNQGAGAPLQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFI 651

Query: 400  ASNFVKTHKLKSTPLNDPYRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFD 221
            +S FV   +L+  PL+    VS  S + ++S+   K  ++ I   ++   LI + M DFD
Sbjct: 652  SSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFD 711

Query: 220  IILGMDWLSKNHVYVDCRGKRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCG 41
            +ILGMDWL+ NH  +DC  K V F  P    F F+GG   KS   VIS+ +A + L +  
Sbjct: 712  VILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGG-GSKSLPQVISAIRASKLLSQGT 770

Query: 40   IGLLACAV 17
             G+LA  V
Sbjct: 771  WGILASVV 778


>gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica]
          Length = 1493

 Score =  117 bits (294), Expect = 2e-24
 Identities = 68/180 (37%), Positives = 105/180 (58%), Gaps = 2/180 (1%)
 Frame = -2

Query: 559 TRPAENRQSPNARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKT 380
           +R    R +  ARVF++T+Q+A    DVITG + +    A VL D GA+HSFVA NF+  
Sbjct: 337 SRGQPGRSTTQARVFSMTQQEAYATPDVITGMIPIFGYLARVLIDPGATHSFVAHNFIPY 396

Query: 379 HKLKSTPLNDPYRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDW 200
             ++ TP+   + +S+ + ++L + R F+   +++ +  L A+LI + + D DIILGMDW
Sbjct: 397 ISIRPTPITGSFSISLPTGEVLYADRVFRNCFVQVDDAWLEANLIPLDLVDLDIILGMDW 456

Query: 199 LSKNHVYVDCRGKRVRFVIPGKKKFIFQGGLKRKSFIP--VISSTQAFRDLRKCGIGLLA 26
           L K+H  VDC  K V    PG+ K  F+G    +  +P  +IS+  A + L+K   G LA
Sbjct: 457 LEKHHASVDCFRKEVTLRSPGQPKVTFRG---ERRVLPTCLISAITAKKLLKKGYEGYLA 513


>gb|EMJ28586.1| hypothetical protein PRUPE_ppb016975mg [Prunus persica]
          Length = 650

 Score =  117 bits (292), Expect = 3e-24
 Identities = 68/180 (37%), Positives = 106/180 (58%), Gaps = 2/180 (1%)
 Frame = -2

Query: 559 TRPAENRQSPNARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKT 380
           +R    R +  A VF++++Q+A    DVITG + + S  A VL D GA+HSFVA NF+  
Sbjct: 419 SRGEPGRSTTQAHVFSMSQQEAYATPDVITGMIPIFSYLARVLIDPGATHSFVAHNFIPY 478

Query: 379 HKLKSTPLNDPYRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDW 200
             ++ TP+   + +S+ + ++L + R F+   +++ +  L A+LI + + D DIILGMDW
Sbjct: 479 VSIRPTPMTWSFSISLPTGEVLYADRVFRNCFVQVDDAWLEANLIPLDLVDLDIILGMDW 538

Query: 199 LSKNHVYVDCRGKRVRFVIPGKKKFIFQGGLKRKSFIP--VISSTQAFRDLRKCGIGLLA 26
           L K+H  VDC  K V F  PG+ K  F+G    +  +P  +IS+  A + L+K   G LA
Sbjct: 539 LEKHHASVDCYRKEVTFRSPGQPKVTFRG---ERRVLPTCLISAITAKKLLQKGCEGYLA 595


>ref|XP_004499744.1| PREDICTED: uncharacterized protein LOC101507718 [Cicer arietinum]
          Length = 679

 Score =  114 bits (286), Expect = 1e-23
 Identities = 61/140 (43%), Positives = 90/140 (64%)
 Frame = -2

Query: 529 NARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLND 350
           +ARVFALT QDA  +  V+TG L + S  A VLFD GA+HSFV+S F       S+ L++
Sbjct: 234 HARVFALTRQDAHTSNAVVTGILSICSRDAHVLFDPGATHSFVSSWFATRLGKCSSSLDE 293

Query: 349 PYRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDC 170
           P  V+      L+++  ++  ++ I  ++LS DL+ I + +FD+ILG+DWL+ +H  +DC
Sbjct: 294 PLVVATPVGGNLIAKSVYRSCDITIDGKVLSVDLVVIDLIEFDVILGIDWLALHHATLDC 353

Query: 169 RGKRVRFVIPGKKKFIFQGG 110
             K V+F IPG+  F FQGG
Sbjct: 354 HNKVVKFEIPGQPVFSFQGG 373


>gb|EMJ28581.1| hypothetical protein PRUPE_ppb016096mg [Prunus persica]
          Length = 505

 Score =  114 bits (285), Expect = 2e-23
 Identities = 68/178 (38%), Positives = 101/178 (56%)
 Frame = -2

Query: 559 TRPAENRQSPNARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKT 380
           +R    R +   RVF++T+Q+A    DVITG + +    A VL D GA+HSFVA NF   
Sbjct: 259 SRGQPGRSTTQGRVFSMTQQEAHATPDVITGMIPIFGYLARVLIDPGATHSFVAHNFAPY 318

Query: 379 HKLKSTPLNDPYRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDW 200
             ++ TP+   + +S+ + ++L   R F+   +++ +  L A+L  + + D DIILGMDW
Sbjct: 319 INVRPTPMIGSFSISLPTGEVLYVDRVFRNCFVQVDDAWLEANLTPLDLVDLDIILGMDW 378

Query: 199 LSKNHVYVDCRGKRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLA 26
           L K+H  VDC  K+V    PG+ K  F GG +R     +IS+  A R L+K   G LA
Sbjct: 379 LEKHHASVDCFRKKVTLRSPGQPKVTF-GGERRVLPTCLISAITAKRLLKKGCEGYLA 435


>ref|XP_004516417.1| PREDICTED: uncharacterized protein LOC101507909 [Cicer arietinum]
          Length = 730

 Score =  113 bits (282), Expect = 4e-23
 Identities = 61/139 (43%), Positives = 88/139 (63%)
 Frame = -2

Query: 526 ARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDP 347
           ARVFALT QDA+ +  V TG L + S  A VLFD GA+HSFV+S F       S+ L +P
Sbjct: 318 ARVFALTRQDAQTSNAVFTGILSICSRDAHVLFDPGATHSFVSSWFATQLGKCSSSLEEP 377

Query: 346 YRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCR 167
             V+      L ++  ++  ++ I +++L  +L+ I + DFD+ILGMDWL+ +H  +DC 
Sbjct: 378 LVVATPVGGNLFAKSVYRSCDVIIDDKVLPVNLVVIDLIDFDVILGMDWLALHHATLDCH 437

Query: 166 GKRVRFVIPGKKKFIFQGG 110
            K V+F IPG+  F+FQGG
Sbjct: 438 NKVVKFEIPGQPVFLFQGG 456


>ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502180 [Cicer arietinum]
          Length = 1235

 Score =  112 bits (280), Expect = 7e-23
 Identities = 64/147 (43%), Positives = 90/147 (61%)
 Frame = -2

Query: 553 PAENRQSPNARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHK 374
           PAE  Q   A+VFALT QDA+    V+TG L + S  A VLFD GA+HSFV+S F     
Sbjct: 538 PAERGQ---AQVFALTRQDAQTCNAVVTGILSICSRDAHVLFDLGATHSFVSSWFATRLG 594

Query: 373 LKSTPLNDPYRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLS 194
             S+ L +P  V+      L+++  ++  ++ I  ++ S DL+ I + DFD+ILGMDWL+
Sbjct: 595 KCSSSLEEPLVVATPVGGNLLAKSVYRCCDITIDGKVFSVDLVVIDLIDFDVILGMDWLA 654

Query: 193 KNHVYVDCRGKRVRFVIPGKKKFIFQG 113
            +H  +DC  K V+F IPG+  F FQG
Sbjct: 655 FHHATLDCHDKVVKFEIPGQSVFSFQG 681


>ref|XP_004516035.1| PREDICTED: uncharacterized protein LOC101492305, partial [Cicer
            arietinum]
          Length = 1216

 Score =  111 bits (278), Expect = 1e-22
 Identities = 64/148 (43%), Positives = 90/148 (60%)
 Frame = -2

Query: 553  PAENRQSPNARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHK 374
            P E  Q   ARVFALT QDA+ +  V+TG L + S  A VLFD  A+HSFV+S F     
Sbjct: 1004 PTERGQ---ARVFALTRQDAQTSNAVVTGILSICSRDAHVLFDPRATHSFVSSWFATQLG 1060

Query: 373  LKSTPLNDPYRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLS 194
               + L +P  V+      L+++  ++  ++ I +++L  DL+ I + DFD+ILGMDWL+
Sbjct: 1061 KCPSSLEEPLVVATPVGGNLLAKSVYRSCDVIIDDKVLPVDLVVIDLIDFDVILGMDWLA 1120

Query: 193  KNHVYVDCRGKRVRFVIPGKKKFIFQGG 110
             +H   DCR K V+F IPG+  F FQGG
Sbjct: 1121 LHHATSDCRNKVVKFEIPGQPVFSFQGG 1148


>ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203289 [Cucumis sativus]
          Length = 655

 Score =  110 bits (276), Expect = 2e-22
 Identities = 68/175 (38%), Positives = 95/175 (54%)
 Frame = -2

Query: 541 RQSPNARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKST 362
           R      +FA    +AE+A  V+TGTL V    A  LFD G+SHSF++S FV    L+  
Sbjct: 394 RPPQRGTIFATNRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVE 453

Query: 361 PLNDPYRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHV 182
           PL+    VS  S +I++S+   K  E+ I   +L   L+ + MRDFD+ILGMDWL+ NH 
Sbjct: 454 PLDYVLSVSTPSGEIMLSKEKIKACEIEIAGRVLDVTLLVLDMRDFDVILGMDWLATNHA 513

Query: 181 YVDCRGKRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLACAV 17
            +DC  K V F  P    F F+ G+       VIS+ +A + L +    +LA  V
Sbjct: 514 SIDCSRKEVVFSPPTASSFKFK-GVGTVVLPKVISAMKASKLLNQGTWSILASVV 567


>emb|CAN62233.1| hypothetical protein VITISV_010121 [Vitis vinifera]
          Length = 1797

 Score =  110 bits (275), Expect = 3e-22
 Identities = 59/153 (38%), Positives = 92/153 (60%), Gaps = 2/153 (1%)
 Frame = -2

Query: 565  PPTRPAENRQSPNAR--VFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASN 392
            P     +++Q P A+  VFA+T +DA+  +DV+TGTL ++++ A VL D G++HSFV+ +
Sbjct: 758  PKEENKKDKQKPKAQGWVFAMTHRDAQATSDVVTGTLRIHTLFARVLIDLGSTHSFVSVS 817

Query: 391  FVKTHKLKSTPLNDPYRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIIL 212
            F     L    ++    V+      +V+ R  +   + IG   +  DL+ + ++DFD+IL
Sbjct: 818  FAGLLGLPVASMDFDLIVATPVGDSVVASRMLRNCIVMIGYREMPIDLVLLDLQDFDVIL 877

Query: 211  GMDWLSKNHVYVDCRGKRVRFVIPGKKKFIFQG 113
            GMDWL+  H  VDC  KRV F IPG+ KF F+G
Sbjct: 878  GMDWLASYHASVDCFEKRVTFSIPGQPKFSFEG 910



 Score = 90.5 bits (223), Expect = 3e-16
 Identities = 56/168 (33%), Positives = 91/168 (54%)
 Frame = -2

Query: 505 EQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDPYRVSISS 326
           +Q     +D   GTL ++++ A VL D G++HSFV+ +F     L    ++    V+   
Sbjct: 74  QQRKRNRSDGAHGTLRIHTLFARVLIDPGSTHSFVSVSFAGLLGLPVASMDFDLIVATPV 133

Query: 325 KKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCRGKRVRFV 146
              +V+ R  +   + IG   +  DL+ + ++DFD+ILGMDWL+  H  +DC  KRV F 
Sbjct: 134 GDFVVASRMLRNCIVMIGYREMLVDLVLLDLQDFDVILGMDWLTSYHASIDCFEKRVTFS 193

Query: 145 IPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLACAVKVEKE 2
           IPG+ KF F+G    +  + +IS+ +A   L+K   G LA  +  E +
Sbjct: 194 IPGQPKFSFEGKHVDRP-LRMISALRASSLLKKGCQGFLASVMSNESD 240


>ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210300 [Cucumis sativus]
          Length = 623

 Score =  109 bits (273), Expect = 4e-22
 Identities = 66/166 (39%), Positives = 95/166 (57%)
 Frame = -2

Query: 499 DAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDPYRVSISSKK 320
           +AE+A  V+TGTL V    A  LFD G+SHSF++S FV    L+  PL+    VS  S +
Sbjct: 366 EAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEPLDYVLSVSTPSGE 425

Query: 319 ILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCRGKRVRFVIP 140
           I++S+   K  E+ I   +L   L+ + MRDFD+ILGMDWL+ NH  +DC  K V F  P
Sbjct: 426 IMLSKEKIKACEIEIAGRVLDVTLLVLDMRDFDVILGMDWLATNHASIDCSRKEVVFSPP 485

Query: 139 GKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLACAVKVEKE 2
            +  F F+ G+ R     VIS+ +A + L +    +LA  V   ++
Sbjct: 486 TESSFKFK-GVGRVVLPKVISAMKASKLLNQGTWSILASVVDTRED 530


>dbj|BAL46523.1| hypothetical protein [Gentiana scabra x Gentiana triflora]
          Length = 1152

 Score =  107 bits (266), Expect = 3e-21
 Identities = 57/159 (35%), Positives = 91/159 (57%), Gaps = 2/159 (1%)
 Frame = -2

Query: 475 ITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDPYRVSISSKKILVSRRNF 296
           + GT++++++    LFD GAS +FV+S   +   L    L  P  ++    ++    +  
Sbjct: 12  LLGTIIIDNLAVSALFDTGASGTFVSSKIARKLNLPLQELEKPLSITTPLGRVTKVAQVL 71

Query: 295 KGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCRGKRVRFVIPGKKKFIFQ 116
             V++R+G     +DL  +   +FD+ILGMDWLSKN V+VDCRGK+V F +PGK    FQ
Sbjct: 72  PQVDVRVGAYRCKSDLTVLDFTNFDVILGMDWLSKNFVHVDCRGKKVIFRVPGKSDKTFQ 131

Query: 115 GGLKR--KSFIPVISSTQAFRDLRKCGIGLLACAVKVEK 5
           G + +  K   P+IS+ +A + L+K   G +  A+  EK
Sbjct: 132 GNVYKASKKKYPIISAVRAMKALQKGCEGYVLYAMDTEK 170


>ref|XP_002312663.1| predicted protein [Populus trichocarpa]
          Length = 610

 Score =  106 bits (265), Expect = 4e-21
 Identities = 61/147 (41%), Positives = 87/147 (59%), Gaps = 1/147 (0%)
 Frame = -2

Query: 526 ARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDP 347
           ARVFA+T+QDA+ +  V++GTLL+ S  A VLFD GA+HSFV+S F      +   L  P
Sbjct: 420 ARVFAITQQDAQTSNIVVSGTLLICSFEARVLFDTGATHSFVSSYFALRFTKQPILLESP 479

Query: 346 YRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCR 167
             V+I S +++     +   E+++    L  DL+ + +  FD+ILGMDWLS++H  VDC 
Sbjct: 480 LCVAIPSDEVMFGEYVYVDCEVQVQGRNLLGDLVILEIVGFDVILGMDWLSRHHASVDCW 539

Query: 166 GKRVRFVIPGKKKFIFQG-GLKRKSFI 89
            K V F    + +F F G GL   S I
Sbjct: 540 NKTVIFKPDEETEFAFHGDGLSSPSSI 566


>gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1401

 Score =  106 bits (264), Expect = 5e-21
 Identities = 66/167 (39%), Positives = 95/167 (56%)
 Frame = -2

Query: 526 ARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDP 347
           ARVFALT+Q+A+ +  V++G L V ++ A VLFD GA+HSF++  F            + 
Sbjct: 432 ARVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISPCFASRLGRGRVRREEQ 491

Query: 346 YRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCR 167
             VS   K+I V+   ++   +R+ ++  S +L+ +   DFD+ILGMDWLS  H  VDC 
Sbjct: 492 LMVSTPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILGMDWLSPCHASVDCY 551

Query: 166 GKRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLA 26
            K VRF  PG+  F  QG  +  +   +IS   A R LR+  IG LA
Sbjct: 552 HKLVRFDFPGEPLFSIQGD-RSNAPTNLISVISARRLLRQGCIGYLA 597


>gb|ADN34141.1| ty3-gypsy retrotransposon protein [Cucumis melo subsp. melo]
          Length = 1359

 Score =  106 bits (264), Expect = 5e-21
 Identities = 64/169 (37%), Positives = 94/169 (55%)
 Frame = -2

Query: 523 RVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDPY 344
           +VFA  + +AE A+ V+TGTL V    A VLFD G SHSF++S FV   +L+  PL+   
Sbjct: 266 KVFATNKTEAERASTVVTGTLPVLGHYALVLFDSGFSHSFISSAFVLHARLEVEPLHHVL 325

Query: 343 RVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCRG 164
            VS    + ++S+   K  ++ I   ++   L+ + M DFD+ILGMDWL+ NH  +DC  
Sbjct: 326 SVSTPFGECMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSR 385

Query: 163 KRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLACAV 17
           K + F  P    F F+    R S   VIS+ +A + L +    +LA  V
Sbjct: 386 KEIAFNPPSMANFKFKEEGSR-SLPKVISAMRASKLLSQGIWSILASVV 433


>gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao]
          Length = 1480

 Score =  105 bits (262), Expect = 8e-21
 Identities = 65/167 (38%), Positives = 94/167 (56%)
 Frame = -2

Query: 526 ARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDP 347
           ARVFALT+Q+A+ +  V++G L V ++ A VLFD GA+HSF++  F            + 
Sbjct: 460 ARVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISPCFASRLGRGRVRREEQ 519

Query: 346 YRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCR 167
             VS   K+I ++   ++   +R+ ++  S +L+ +   DFD+ILGMDWLS  H  VDC 
Sbjct: 520 LVVSTLLKEIFMAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILGMDWLSPCHASVDCY 579

Query: 166 GKRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLA 26
            K VRF  PG+  F  QG +       +IS   A R LR+  IG LA
Sbjct: 580 HKLVRFDFPGEPSFSIQGDMSNAP-TNLISVISARRLLRQGCIGYLA 625


>gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1447

 Score =  105 bits (261), Expect = 1e-20
 Identities = 65/167 (38%), Positives = 96/167 (57%)
 Frame = -2

Query: 526 ARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDP 347
           ARVFALT+Q+A+ +  V++G L V ++ A VLFD GA+HSF+++ F            + 
Sbjct: 331 ARVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISTCFASRLGRGRVRREEQ 390

Query: 346 YRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCR 167
             VS   K+I V+   ++   +R+ ++  S +L+ +   DFD+ILGM+WLS  H  VDC 
Sbjct: 391 LVVSTPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILGMNWLSPCHASVDCY 450

Query: 166 GKRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLA 26
            K VRF  PG+  F  QG  +  +   +IS   A R LR+  IG LA
Sbjct: 451 HKLVRFDFPGEPSFSIQGD-RSNAPTNLISVISARRLLRQGCIGYLA 496


>gb|EOY26421.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 334

 Score =  104 bits (259), Expect = 2e-20
 Identities = 65/167 (38%), Positives = 95/167 (56%)
 Frame = -2

Query: 526 ARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDP 347
           ARVFALT+Q+A+ +  V++G L V ++ A VLFD GA+HSF++  F            + 
Sbjct: 108 ARVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISPCFASRLGRGRVRREEQ 167

Query: 346 YRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCR 167
             VS   K+I V+   ++   +R+ ++  S +L+ +   DFD+ILGM+WLS  H  VDC 
Sbjct: 168 LVVSTPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILGMNWLSPCHASVDCY 227

Query: 166 GKRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLA 26
            K VRF  PG+  F  QG  +  +   +IS   A R LR+  IG LA
Sbjct: 228 HKLVRFDFPGEPSFSIQGD-RSNAPTNLISVISARRLLRQGCIGYLA 273


>gb|EOY08404.1| Retrotransposon-like protein [Theobroma cacao]
          Length = 654

 Score =  104 bits (259), Expect = 2e-20
 Identities = 65/167 (38%), Positives = 95/167 (56%)
 Frame = -2

Query: 526 ARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDP 347
           ARVFALT+Q+A+ +  V++G L V ++ A VLFD GA+HSF++  F            + 
Sbjct: 312 ARVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISPCFASRLGRGRVRREEQ 371

Query: 346 YRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCR 167
             VS   K+I V+   ++   +R+ ++  S +L+ +   DFD+ILGM+WLS  H  VDC 
Sbjct: 372 LVVSTPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILGMNWLSPCHASVDCY 431

Query: 166 GKRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLA 26
            K VRF  PG+  F  QG  +  +   +IS   A R LR+  IG LA
Sbjct: 432 HKLVRFDFPGEPSFSIQGD-RSNAPTNLISVISARRLLRQGCIGYLA 477


>gb|EOX99717.1| Uncharacterized protein TCM_008533 [Theobroma cacao]
          Length = 563

 Score =  104 bits (259), Expect = 2e-20
 Identities = 65/167 (38%), Positives = 95/167 (56%)
 Frame = -2

Query: 526 ARVFALTEQDAEEATDVITGTLLVNSVPACVLFDCGASHSFVASNFVKTHKLKSTPLNDP 347
           ARVFALT+Q+A+ +  V++G L V ++ A VLFD GA+HSF++  F            + 
Sbjct: 165 ARVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISPCFASRLGRGRVRREEQ 224

Query: 346 YRVSISSKKILVSRRNFKGVELRIGEEMLSADLIQISMRDFDIILGMDWLSKNHVYVDCR 167
             VS   K+I V+   ++   +R+ ++  S +L+ +   DFD+ILGM+WLS  H  VDC 
Sbjct: 225 LVVSTPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILGMNWLSPCHASVDCY 284

Query: 166 GKRVRFVIPGKKKFIFQGGLKRKSFIPVISSTQAFRDLRKCGIGLLA 26
            K VRF  PG+  F  QG  +  +   +IS   A R LR+  IG LA
Sbjct: 285 HKLVRFDFPGEPSFSIQGD-RSNAPTNLISVISARRLLRQGCIGYLA 330


Top