BLASTX nr result

ID: Rehmannia22_contig00000742 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00000742
         (691 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   218   1e-54
gb|ABN09154.1| RNA-directed DNA polymerase (Reverse transcriptas...   216   5e-54
gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptas...   213   4e-53
ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A...   189   5e-46
gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptas...   189   5e-46
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   144   3e-32
gb|AAD24831.1| putative non-LTR retroelement reverse transcripta...   140   4e-31
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta...   140   4e-31
gb|EOY25449.1| Uncharacterized protein TCM_016755 [Theobroma cacao]   140   5e-31
gb|EMJ15225.1| hypothetical protein PRUPE_ppa016668mg, partial [...   138   2e-30
gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]   137   2e-30
ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596...   137   4e-30
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   137   4e-30
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   136   5e-30
emb|CAB75484.1| putative protein [Arabidopsis thaliana]               136   5e-30
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   135   9e-30
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   135   2e-29
emb|CAN60702.1| hypothetical protein VITISV_015869 [Vitis vinifera]   134   3e-29
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   134   4e-29
gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao]   134   4e-29

>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 872

 Score =  218 bits (555), Expect = 1e-54
 Identities = 115/247 (46%), Positives = 151/247 (61%), Gaps = 17/247 (6%)
 Frame = -2

Query: 690 QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
           Q  F+ GR I DC+ + SEC N LD + YGGN+AIK DI KAFDT+SWDFLL V++AFGF
Sbjct: 47  QHAFVVGRNISDCILVTSECFNLLDSKCYGGNVAIKTDITKAFDTLSWDFLLHVLQAFGF 106

Query: 510 SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQH 331
              F   +  +L SA+LS+L+NG   GYF C QGVRQGDPLSPLLFC+AEEVLS  IS  
Sbjct: 107 HESFV-QVRVLLLSARLSLLINGRTYGYFSCGQGVRQGDPLSPLLFCLAEEVLSRGISML 165

Query: 330 VQSRAISSIRAGHG--------------ISXXXXXXTISRILS---DYEQLSGQYANRDK 202
           V S  +  I +  G              +        + R++S   +Y  +SGQ  N+DK
Sbjct: 166 VSSGQVKRIHSPRGTLSPSYVLFAGDVIVFCRGNRQNLLRVMSFFYEYGSVSGQIINKDK 225

Query: 201 STIYFGKFVRQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNWK 22
           S ++ GK  R++ SI   + +  G+ PF YLG P+F G PR  H + I+D++  K S+W 
Sbjct: 226 SQVFIGKHNRRRHSISDCLGIPLGTAPFMYLGAPIFHGKPRVAHFQAIVDKVRLKLSSWV 285

Query: 21  GHSLSMA 1
           G  LSMA
Sbjct: 286 GSFLSMA 292


>gb|ABN09154.1| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
           truncatula]
          Length = 528

 Score =  216 bits (550), Expect = 5e-54
 Identities = 115/230 (50%), Positives = 148/230 (64%)
 Frame = -2

Query: 690 QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
           Q GFI GR I DCV LASE IN LD++++GGN+A KVDI KAFDT++W FLL V++ FGF
Sbjct: 225 QRGFIQGRNIKDCVCLASEAINMLDQKSFGGNLAFKVDISKAFDTLNWKFLLKVLKQFGF 284

Query: 510 SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQH 331
           S  FC WI +ILQSAKLSI +NGS  GYF CS+GVRQGDPLSPLLFC+AE+VLS  +++ 
Sbjct: 285 SETFCNWIDAILQSAKLSICINGSQQGYFSCSRGVRQGDPLSPLLFCLAEDVLSRSLTKL 344

Query: 330 VQSRAISSIRAGHGISXXXXXXTISRILSDYEQLSGQYANRDKSTIYFGKFVRQKRSILR 151
           V+   +  +R              S IL         YA+        G    + + ++ 
Sbjct: 345 VEQGKLKQMRGTRN------CLVPSHIL---------YADDIMIFCNGGISDARLQQLIN 389

Query: 150 SIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNWKGHSLSMA 1
            I   +GS PF YLGVP+FKG P+   L+PI+D+I +K SNWK   LS+A
Sbjct: 390 VIGFNKGSFPFNYLGVPIFKGKPKARFLQPIVDKIKTKLSNWKASILSIA 439


>gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
            truncatula]
          Length = 642

 Score =  213 bits (543), Expect = 4e-53
 Identities = 109/248 (43%), Positives = 149/248 (60%), Gaps = 18/248 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GF+ GR I DC++L SE IN LD +++GGN+A+K+D+ KAFDT++WDFLL V++ FGF
Sbjct: 286  QRGFVQGRNIRDCIALTSEAINVLDNKSFGGNLALKIDVTKAFDTLNWDFLLLVLKTFGF 345

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQH 331
            +  FC WI +IL S+K+ I +NG+  G+F C++GVRQGDPLSPLLFCI EEVLS  IS  
Sbjct: 346  NELFCNWIKTILHSSKMFISMNGAQHGFFNCNRGVRQGDPLSPLLFCIVEEVLSRSISIL 405

Query: 330  VQSRAISSIRAGHG-----------------ISXXXXXXTISRILSDYEQLSGQYANRDK 202
                 I  I A                     +       +  + + Y   SGQ  N  K
Sbjct: 406  ADKGLIDLIAASRNNCLPFHCFYVDDLMVFCKAKMSSLIVLKSLFTRYADCSGQIMNIRK 465

Query: 201  STIYFGKFV-RQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNW 25
            S I+ G     +  +I+  +    GSLPFTYLG P+FKG P+ IH +PI D++ +K + W
Sbjct: 466  SFIFAGGITDTRMNNIVNILGFNVGSLPFTYLGAPIFKGKPKGIHFQPIADKVKAKLAKW 525

Query: 24   KGHSLSMA 1
            K   LS+A
Sbjct: 526  KASLLSIA 533


>ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 751

 Score =  189 bits (481), Expect = 5e-46
 Identities = 99/232 (42%), Positives = 138/232 (59%), Gaps = 17/232 (7%)
 Frame = -2

Query: 645 LASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGFSNKFCGWISSILQSA 466
           + SE  N LDR+   GN+ IKVDI KAFDT++W FL+ V+  FGF ++F   +  +L SA
Sbjct: 1   MVSEGFNLLDRKIVDGNVGIKVDIAKAFDTLNWQFLIEVLHRFGFGSRFTDLMLILLNSA 60

Query: 465 KLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQHVQSRAISSIRAGHGI 286
            LSIL+NGSP G+F C++GVRQGDPLSP+LFCIAEE LS  ++    S+ + SI    G 
Sbjct: 61  HLSILINGSPHGFFSCTKGVRQGDPLSPILFCIAEEALSRGLTALFSSKKVRSISLPRGC 120

Query: 285 SXXXXXXT----------------ISRILSDYEQLSGQYANRDKSTIYFG-KFVRQKRSI 157
           S                       +   L +Y   SGQ  N+DKST Y G     ++  +
Sbjct: 121 SLTHVLYADDLFIFCRGDTKSLRQLQSFLDNYGAASGQLVNKDKSTFYLGASHFHRRHQV 180

Query: 156 LRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNWKGHSLSMA 1
            + +  + G+ PF+YLGVP+FKG P R HL+ ++D+  ++ + WKG  LSMA
Sbjct: 181 KKILGFKLGTSPFSYLGVPIFKGKPCRKHLQALVDKAKARLAGWKGKLLSMA 232


>gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H;
            Endonuclease/exonuclease/phosphatase [Medicago
            truncatula]
          Length = 1246

 Score =  189 bits (481), Expect = 5e-46
 Identities = 102/222 (45%), Positives = 131/222 (59%), Gaps = 18/222 (8%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GFI  R I  CV LASE IN L++R YGGN+A+KVDI KAFDT+ W+FLL V++ FGF
Sbjct: 551  QRGFIRDRDISKCVILASEAINLLEKRQYGGNVALKVDIAKAFDTLDWNFLLAVLQRFGF 610

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQH 331
              KF  WI  ILQSA+LS+L+NG  VG+F CS GVRQGDPLSPLLFC+ EEVLS  +S  
Sbjct: 611  DEKFVHWILVILQSARLSVLVNGKAVGFFTCSHGVRQGDPLSPLLFCLVEEVLSRALSMA 670

Query: 330  VQSRAISSIRAGHGIS-----------------XXXXXXTISRILSDYEQLSGQYANRDK 202
                 +  +    G+S                        + +I S Y ++SGQ  N  K
Sbjct: 671  ATDGQLIPMSYCRGVSFPTHILYADDVLIFCTGTKRNIRRLIKIFSQYSEVSGQLINNAK 730

Query: 201  STIYFGKFVRQKRSILRS-IRMREGSLPFTYLGVPLFKGVPR 79
            S  +       +  ++ S +    GSLPFTYLG P+F+G P+
Sbjct: 731  SRFFTSAMTGSRVQMISSLLGFNVGSLPFTYLGCPIFRGKPK 772


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  144 bits (363), Expect = 3e-32
 Identities = 88/247 (35%), Positives = 135/247 (54%), Gaps = 19/247 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GF++GR I+D + LA E I  +D +A GGN+ +K+D+ KA+D ++WDFL+ V+  FGF
Sbjct: 507  QSGFVSGRLINDNILLAQELIGKIDYKARGGNVVLKLDMMKAYDRLNWDFLILVLERFGF 566

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQH 331
            ++ +   I   + +   S+L+NG   GYF   +G+RQGD +SP+LF +A E LS  I++ 
Sbjct: 567  NDMWIDMIRRCITNCWFSVLINGHSAGYFKSERGLRQGDSISPMLFILAAEYLSRGINE- 625

Query: 330  VQSRAIS-------SIRAGH-GISXXXXXXT---------ISRILSDYEQLSGQYANRDK 202
            + SR IS       S+   H   +      T         I   L +YEQ+SGQ  N  K
Sbjct: 626  LFSRYISLHYHSGCSLNISHLAFADDIMIFTNGSKSVLEKILEFLQEYEQISGQRVNHQK 685

Query: 201  STIYFGKFVRQKRS--ILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSN 28
            S       +   R   I ++I     +LP TYLG PLFKG  + +  + +I++I  + + 
Sbjct: 686  SCFVTANNMPSSRRQIISQTIGFLHKTLPITYLGAPLFKGPKKVMLFDSLINKIRERITG 745

Query: 27   WKGHSLS 7
            W+   LS
Sbjct: 746  WENKILS 752


>gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1524

 Score =  140 bits (353), Expect = 4e-31
 Identities = 91/253 (35%), Positives = 128/253 (50%), Gaps = 23/253 (9%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLD--RRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAF 517
            Q  FI GR I+D V +A E ++ L   +R     MA+K D+ KA+D + WDFL   MR F
Sbjct: 693  QAAFIPGRIINDNVMIAHEVMHSLKVRKRVSKTYMAVKTDVSKAYDRVEWDFLETTMRLF 752

Query: 516  GFSNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLIS 337
            GF NK+ GWI + ++S   S+L+NGSP GY   ++G+RQGDPLSP LF +  ++LS+LI+
Sbjct: 753  GFCNKWIGWIMAAVKSVHYSVLINGSPHGYITPTRGIRQGDPLSPYLFILCGDILSHLIN 812

Query: 336  QHVQSRAISSIRAGHGI-----------------SXXXXXXTISRILSDYEQLSGQYANR 208
                S  +  +R G+G                  +       +  +   YE  SGQ  N 
Sbjct: 813  GRASSGDLRGVRIGNGAPAITHLQFADDSLFFCQANVRNCQALKDVFDVYEYYSGQKINV 872

Query: 207  DKSTIYFGKFV----RQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIIS 40
             KS I FG  V    + K   +  I  + G     YLG+P   G  ++   E IIDR+  
Sbjct: 873  QKSMITFGSRVYGSTQSKLKQILEIPNQGGG--GKYLGLPEQFGRKKKEMFEYIIDRVKK 930

Query: 39   KFSNWKGHSLSMA 1
            + S W    LS A
Sbjct: 931  RTSTWSARFLSPA 943


>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1750

 Score =  140 bits (353), Expect = 4e-31
 Identities = 91/254 (35%), Positives = 127/254 (50%), Gaps = 24/254 (9%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLD--RRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAF 517
            Q  FI GR I+D V +A E ++ L   +R     MA+K D+ KA+D + WDFL   MR F
Sbjct: 919  QAAFIPGRIINDNVMIAHEVMHSLKVRKRVSKTYMAVKTDVSKAYDRVEWDFLETTMRLF 978

Query: 516  GFSNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLIS 337
            GF NK+ GWI + ++S   S+L+NGSP GY   ++G+RQGDPLSP LF +  ++LS+LI+
Sbjct: 979  GFCNKWIGWIMAAVKSVHYSVLINGSPHGYITPTRGIRQGDPLSPYLFILCGDILSHLIN 1038

Query: 336  QHVQSRAISSIRAGHGI-----------------SXXXXXXTISRILSDYEQLSGQYANR 208
                S  +  +R G+G                  +       +  +   YE  SGQ  N 
Sbjct: 1039 GRASSGDLRGVRIGNGAPAITHLQFADDSLFFCQANVRNCQALKDVFDVYEYYSGQKINV 1098

Query: 207  DKSTIYFGKFV-----RQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRII 43
             KS I FG  V      + + IL       G     YLG+P   G  ++   E IIDR+ 
Sbjct: 1099 QKSMITFGSRVYGSTQSRLKQILEIPNQGGGG---KYLGLPEQFGRKKKEMFEYIIDRVK 1155

Query: 42   SKFSNWKGHSLSMA 1
             + S W    LS A
Sbjct: 1156 KRTSTWSARFLSPA 1169


>gb|EOY25449.1| Uncharacterized protein TCM_016755 [Theobroma cacao]
          Length = 1245

 Score =  140 bits (352), Expect = 5e-31
 Identities = 88/246 (35%), Positives = 130/246 (52%), Gaps = 18/246 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GF+ GR I D + LA E +  LD +A GGN+ +K+D+ KA+D +SWDFL  +M  FGF
Sbjct: 875  QSGFVNGRLISDNILLAQELVGKLDAKARGGNVVLKLDMAKAYDRLSWDFLYLMMEQFGF 934

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQ- 334
            ++++   I + + +   S+L+NGS VGYF   +G+RQGD +SPLLF +A E LS  I+Q 
Sbjct: 935  NDRWISMIKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPLLFILAAEYLSRGINQL 994

Query: 333  -------HVQSRA---ISSIRAGHGI-----SXXXXXXTISRILSDYEQLSGQYANRDKS 199
                   H  S     IS +     I             I   L +YE +SGQ  N  KS
Sbjct: 995  FSDHKSLHYLSGCFMPISHLAFADDIVIFTNGCRPALQKILIFLQEYEAVSGQQVNHQKS 1054

Query: 198  TIYF--GKFVRQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNW 25
                  G  + +++ I  +   +  +LP  YLG PL KG  +    + +I +I  + S W
Sbjct: 1055 CFITSNGCPMTRRQIIAHTTGFQHKTLPVIYLGAPLHKGPKKVALFDSLITKIRDRISGW 1114

Query: 24   KGHSLS 7
            +  +LS
Sbjct: 1115 ENKTLS 1120


>gb|EMJ15225.1| hypothetical protein PRUPE_ppa016668mg, partial [Prunus persica]
          Length = 152

 Score =  138 bits (347), Expect = 2e-30
 Identities = 65/111 (58%), Positives = 80/111 (72%)
 Frame = -2

Query: 690 QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
           QF F+ G+ I  C+ L SE IN LD R +GGN++IK D+ KAFDT++W FL  V+ AFGF
Sbjct: 26  QFSFLKGKHISYCILLTSEGINLLDNRNFGGNVSIKFDVAKAFDTLNWTFLTNVLTAFGF 85

Query: 510 SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEE 358
              F  W+ +IL  A   IL NGSPVG+FGCSQGVRQGD LSP+LF +AEE
Sbjct: 86  HEVFIKWVGAILSPACFLILFNGSPVGFFGCSQGVRQGDLLSPILFYLAEE 136


>gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score =  137 bits (346), Expect = 2e-30
 Identities = 88/247 (35%), Positives = 136/247 (55%), Gaps = 19/247 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GFI GR I D + LA E +  LD +A GGN+A+K+D+ KA+D ++WDFL  +++ FGF
Sbjct: 748  QSGFINGRLISDNILLAQELVGKLDTKARGGNVALKLDMAKAYDRLNWDFLYLMLKQFGF 807

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQ- 334
            ++++   I + + +   S+L+NGS VGYF   +G+RQGD +SPLLF +A + LS  I+Q 
Sbjct: 808  NDRWISMIKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPLLFILAADYLSRGINQL 867

Query: 333  --HVQS--------RAISSIRAGHGI-----SXXXXXXTISRILSDYEQLSGQYANRDKS 199
              H +S          IS +     I             I   L +YE++ GQ  N  KS
Sbjct: 868  FSHHKSLHYLSGCFMPISRLAFADDIVIFTNGCRPALQKILVFLQEYEKMFGQQVNHQKS 927

Query: 198  TIYF--GKFVRQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHL-EPIIDRIISKFSN 28
                  G  + +++ I  +   +   LP  YLG PL K VP+++ L + +I +I  + S 
Sbjct: 928  CFITANGCSMTRRQIIAHTTGFQHKILPIIYLGAPLHK-VPKKVALFDSLITKIRDRISG 986

Query: 27   WKGHSLS 7
            W+  +LS
Sbjct: 987  WENKTLS 993


>ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum]
          Length = 1135

 Score =  137 bits (344), Expect = 4e-30
 Identities = 81/236 (34%), Positives = 125/236 (52%), Gaps = 19/236 (8%)
 Frame = -2

Query: 657  DCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGFSNKFCGWISSI 478
            D  ++  E I  ++RR    N+ +K+D+ KA+D +SW FL+ VMR FGF+ +    I  +
Sbjct: 420  DVTNMVKEIIRDINRRNKYHNVVVKLDMAKAYDRVSWKFLVRVMRNFGFAERIIDMIVRL 479

Query: 477  LQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQHVQ--------- 325
            + +   S+L+NG   G+F  ++G++QGDPLSP LF IA EVLS  ++   +         
Sbjct: 480  ISNNWYSVLMNGQSFGFFQSTRGLKQGDPLSPTLFIIAAEVLSRGLNSLFEDPDYIGYGM 539

Query: 324  ---SRAISSIRAGHGISXXXXXXTIS-----RILSDYEQLSGQYANRDKSTIYFGKFV-- 175
               S  +S +             T S      IL  YE++SGQ  N DKS IY  K V  
Sbjct: 540  PKWSPVVSHLSYADDTILFCSGQTTSMRKMINILRGYEKVSGQMINLDKSMIYLHKQVPN 599

Query: 174  RQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNWKGHSLS 7
            R    + R   +R+GS PFTYLG P+F G   + H E ++ ++ ++ + W+   +S
Sbjct: 600  RVCNLVKRITGIRQGSFPFTYLGCPIFYGRKNKGHFENLLKKVSNRMNTWQNKLMS 655


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  137 bits (344), Expect = 4e-30
 Identities = 88/246 (35%), Positives = 129/246 (52%), Gaps = 18/246 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GF+ GR I D + LA E I  L+ ++ GGN+A+K+D+ KA+D + W FL+ V++ FGF
Sbjct: 1593 QSGFVGGRLISDNILLAQELIGKLNTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGF 1652

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLS---NLI 340
            ++++ G I   + +   S+LLNG   GYF   +G+RQGDP+SP LF IA E LS   N +
Sbjct: 1653 NDQWIGMIQKCISNCWFSLLLNGRTEGYFKFERGLRQGDPISPQLFLIAAEYLSRGLNAL 1712

Query: 339  SQHVQSRAIS---SIRAGH-------GISXXXXXXTISRILS---DYEQLSGQYANRDKS 199
             +   S   S   SI   H        I        + RIL+   +YE++S Q  N  KS
Sbjct: 1713 YEQYPSLHYSTGVSIPVSHLAFADDVLIFTNGSKSALQRILAFLQEYEEISRQRINAQKS 1772

Query: 198  TIYFGKFVRQKRS--ILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNW 25
                   V   R   I ++       LP TYLG PL+KG  + I    ++ +I  + + W
Sbjct: 1773 CFVTHTNVSSSRRQIIAQTTGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGW 1832

Query: 24   KGHSLS 7
            +   LS
Sbjct: 1833 ENKILS 1838


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  136 bits (343), Expect = 5e-30
 Identities = 85/246 (34%), Positives = 132/246 (53%), Gaps = 18/246 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GF+ GR I D + LA E I  +D ++ GGN+ +K+D+ KA+D ++WDFL  +M  FGF
Sbjct: 1300 QSGFVNGRLISDNILLAQELIGKIDAKSRGGNVVLKLDMAKAYDRLNWDFLYLMMEHFGF 1359

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLS----NL 343
            +  +   I S + +   S+L+NGS  GYF   +G+RQGD +SP+LF +A + LS    +L
Sbjct: 1360 NAHWINMIKSCISNCWFSLLINGSLAGYFKSERGLRQGDSISPMLFILAADYLSRGLNHL 1419

Query: 342  ISQHVQSRAIS---------SIRAGHGISXXXXXXTISRILS---DYEQLSGQYANRDKS 199
             S +   + +S         S      I        + +ILS   +YEQ+SGQ  N  KS
Sbjct: 1420 FSCYSSLQYLSGCQMPISHLSFADDIVIFTNGGRSALQKILSFLQEYEQVSGQKVNHQKS 1479

Query: 198  TIYF--GKFVRQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNW 25
                  G  + +++ I  +   +  +LP TYLG PL KG  + +  + +I +I  + S W
Sbjct: 1480 CFITANGCSLSRRQIISHTTGFQHKTLPVTYLGAPLHKGPKKVLLFDSLISKIRDRISGW 1539

Query: 24   KGHSLS 7
            +   LS
Sbjct: 1540 ENKILS 1545


>emb|CAB75484.1| putative protein [Arabidopsis thaliana]
          Length = 851

 Score =  136 bits (343), Expect = 5e-30
 Identities = 89/253 (35%), Positives = 128/253 (50%), Gaps = 23/253 (9%)
 Frame = -2

Query: 690 QFGFIAGRRIHDCVSLASECINCLD--RRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAF 517
           Q  FI GR I+D V +A E ++ L   +R     MA+K D+ KA+D + WDFL   MR F
Sbjct: 158 QAAFIPGRIINDNVMIAHEIMHSLKVRKRVSKTYMAVKTDVSKAYDRVEWDFLETTMRLF 217

Query: 516 GFSNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLIS 337
           GF +K+ GWI + ++S   S+L+NGSP GY   ++G+RQGDPLSP LF +  ++LS+LI 
Sbjct: 218 GFCDKWIGWIMAAVKSVHYSVLINGSPHGYISPTRGIRQGDPLSPYLFILCGDILSHLIK 277

Query: 336 QHVQSRAISSIRAGHGI-----------------SXXXXXXTISRILSDYEQLSGQYANR 208
               S  I  +R G+G                  +       +  +   YE  SGQ  N 
Sbjct: 278 VKASSGDIRGVRIGNGAPAITHLQFADDSLFFCQANVRNCQALKDVFDVYEYYSGQKINV 337

Query: 207 DKSTIYFGKFV----RQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIIS 40
            KS I FG  V    + +   L +I  + G     YLG+P   G  ++     IIDR+  
Sbjct: 338 QKSLITFGSRVYGSTQTRLKTLLNIPNQGGG--GKYLGLPEQFGRKKKEMFNYIIDRVKE 395

Query: 39  KFSNWKGHSLSMA 1
           + ++W    LS A
Sbjct: 396 RTASWSAKFLSPA 408


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  135 bits (341), Expect = 9e-30
 Identities = 82/246 (33%), Positives = 129/246 (52%), Gaps = 18/246 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GF+ GR I D + LA E I  LD++  GGN+A+K+D+ KA+D + W FL  V++  GF
Sbjct: 1386 QSGFVGGRLISDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWSFLFKVLQHLGF 1445

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSN----L 343
            + ++ G I   + +   S+LLNG  VGYF   +G+RQGD +SP LF +A E L+     L
Sbjct: 1446 NAQWIGMIQKCISNCWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILAAEYLARGLNAL 1505

Query: 342  ISQHVQ-------SRAISSIRAGHGI-----SXXXXXXTISRILSDYEQLSGQYANRDKS 199
              Q+         S ++S +     +             I   L +YE+LSGQ  N  KS
Sbjct: 1506 YDQYPSLHYSSGCSLSVSHLAFADDVIIFANGSKSALQKIMAFLQEYEKLSGQRINPQKS 1565

Query: 198  TI--YFGKFVRQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNW 25
             +  +      +++ IL++       LP TYLG PL+KG  + +    ++ +I  + + W
Sbjct: 1566 CVVTHTNMASSRRQIILQATGFSHRPLPITYLGAPLYKGHKKVMLFNDLVAKIEERITGW 1625

Query: 24   KGHSLS 7
            +  +LS
Sbjct: 1626 ENKTLS 1631


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  135 bits (339), Expect = 2e-29
 Identities = 86/246 (34%), Positives = 126/246 (51%), Gaps = 18/246 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GF+ GR I D + LA E I  LD ++ GGN+A+K+D+ KA+D + W FL+ V++ FGF
Sbjct: 1423 QSGFVGGRLISDNILLAQELIRKLDTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGF 1482

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLS---NLI 340
            + ++ G I   + +   S+LLNG   GYF   +G+RQGD +SP LF +A E LS   N +
Sbjct: 1483 NEQWIGMIQKCISNCWFSLLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLSRGLNAL 1542

Query: 339  SQHVQSRAISS---IRAGH-------GISXXXXXXTISRI---LSDYEQLSGQYANRDKS 199
                 S   SS   +   H        I        + RI   L +YE++SGQ  N  KS
Sbjct: 1543 YDQYPSLHYSSGVPLSVSHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINAQKS 1602

Query: 198  TIYFGKFVRQKRS--ILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNW 25
                   +   R   I ++       LP TYLG PL+KG  + I    ++ +I  + + W
Sbjct: 1603 CFVTHTNIPNSRRQIIAQATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGW 1662

Query: 24   KGHSLS 7
            +   LS
Sbjct: 1663 ENKILS 1668


>emb|CAN60702.1| hypothetical protein VITISV_015869 [Vitis vinifera]
          Length = 3028

 Score =  134 bits (337), Expect = 3e-29
 Identities = 85/248 (34%), Positives = 129/248 (52%), Gaps = 20/248 (8%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q  F+ GR+I D   +A+E I+ L +R   G +  K+D+ KA+D I+WDFL+ V+++ GF
Sbjct: 2404 QNAFVEGRQILDAALIANEAIDSLLKRNESGVLC-KLDLEKAYDHINWDFLIFVLQSMGF 2462

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQH 331
              K+ GWIS  + +A  S+L+NG+P GYF  S+G+RQGDPLSP LF I  E LS LI++ 
Sbjct: 2463 GEKWIGWISWCISTATFSVLINGTPEGYFNSSRGLRQGDPLSPYLFVIGMEALSRLINRA 2522

Query: 330  VQSRAISSI----RAGHGI----------------SXXXXXXTISRILSDYEQLSGQYAN 211
            V    +S      R G+G+                +       +S +L  +E +SG   N
Sbjct: 2523 VGGGFLSGCRVDGRGGNGVLVSHLLFADDTLVFCEASEDQMVYLSWLLMWFEAISGLRIN 2582

Query: 210  RDKSTIYFGKFVRQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFS 31
             DKS I     V    ++      + G LP +YLG+PL          + + +R   + +
Sbjct: 2583 LDKSEILPVGRVXNLENLALEAGCKVGRLPSSYLGIPLGANHKSVAVWDGVEERFRKRLA 2642

Query: 30   NWKGHSLS 7
             WK   +S
Sbjct: 2643 LWKRQFIS 2650


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  134 bits (336), Expect = 4e-29
 Identities = 82/246 (33%), Positives = 129/246 (52%), Gaps = 18/246 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GF+ GR I D + LA E ++ ++ R+ GGN+ +K+D+ KA+D ++W+FL  +M  FGF
Sbjct: 1387 QSGFVNGRLISDNILLAQELVDKINARSRGGNVVLKLDMAKAYDRLNWEFLYLMMEQFGF 1446

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQH 331
            +  +   I + + +   S+L+NGS VGYF   +G+RQGD +SP LF +A E LS  ++Q 
Sbjct: 1447 NALWINMIKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPSLFILAAEYLSRGLNQL 1506

Query: 330  VQ-----------SRAISSIRAGHGI-----SXXXXXXTISRILSDYEQLSGQYANRDKS 199
                         S ++S +     I             I   L +YEQ+SGQ  N  KS
Sbjct: 1507 FSRYNSLHYLSGCSMSVSHLAFADDIVIFTNGCHSALQKILVFLQEYEQVSGQQVNHQKS 1566

Query: 198  TIYF--GKFVRQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISKFSNW 25
                  G  + +++ I +    +  +LP TYLG PL KG  +    + +I +I  + S W
Sbjct: 1567 CFITANGCPLSRRQIIAQVTGFQHKTLPVTYLGAPLHKGPKKVFLFDSLISKIRDRISGW 1626

Query: 24   KGHSLS 7
            +   LS
Sbjct: 1627 ENKILS 1632


>gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao]
          Length = 1659

 Score =  134 bits (336), Expect = 4e-29
 Identities = 87/236 (36%), Positives = 130/236 (55%), Gaps = 18/236 (7%)
 Frame = -2

Query: 690  QFGFIAGRRIHDCVSLASECINCLDRRAYGGNMAIKVDIRKAFDTISWDFLLGVMRAFGF 511
            Q GF+ GR I D + LA E I  LD +A GGN+ +K+D+ KA+D ++WDFL  +M+ FGF
Sbjct: 1023 QSGFVNGRLISDNILLAQELIGKLDAKARGGNVVLKLDMAKAYDRLNWDFLYLMMKQFGF 1082

Query: 510  SNKFCGWISSILQSAKLSILLNGSPVGYFGCSQGVRQGDPLSPLLFCIAEEVLSNLISQ- 334
            ++++   I + + +   S+L+NGS VGYF   +G+RQGD +SPLLF +A + LS  I+Q 
Sbjct: 1083 NDRWISMIKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPLLFILAADYLSRGINQL 1142

Query: 333  --HVQS--------RAISSIRAGHGI-----SXXXXXXTISRILSDYEQLSGQYANRDKS 199
              H +S          IS +     I             I   L +YE++SGQ  N  KS
Sbjct: 1143 FSHHKSLLYLSGCFMPISHLAFADDIVIFTNGCRPALQKILVFLQEYEEVSGQQVNHQKS 1202

Query: 198  TIYF--GKFVRQKRSILRSIRMREGSLPFTYLGVPLFKGVPRRIHLEPIIDRIISK 37
                  G  +  ++ I  +   +  +LP  YLGVPL KG P+++ L    D +I+K
Sbjct: 1203 CFITANGCPMTMRQIIAHTTGFQHKTLPVIYLGVPLHKG-PKKVTL---FDSLITK 1254


Top