BLASTX nr result

ID: Mentha29_contig00027141 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00027141
         (670 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao] g...   350   2e-94
ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The...   345   1e-92
ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [The...   344   1e-92
emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera]   342   6e-92
emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera]   337   2e-90
ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [The...   337   3e-90
ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [The...   337   3e-90
ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass,...   337   3e-90
emb|CAA73042.1| polyprotein [Ananas comosus]                          335   8e-90
ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [The...   334   1e-89
ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobrom...   334   2e-89
ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [The...   333   2e-89
ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [The...   333   2e-89
ref|XP_007028157.1| DNA/RNA polymerases superfamily protein [The...   333   3e-89
ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prun...   333   3e-89
gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum]   332   5e-89
ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobrom...   332   6e-89
ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [The...   332   6e-89
ref|XP_007198824.1| hypothetical protein PRUPE_ppb020037mg [Prun...   331   1e-88
ref|XP_007028151.1| DNA/RNA polymerases superfamily protein [The...   330   3e-88

>ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao]
           gi|508702196|gb|EOX94092.1| Gag protease polyprotein
           [Theobroma cacao]
          Length = 269

 Score =  350 bits (898), Expect = 2e-94
 Identities = 161/222 (72%), Positives = 186/222 (83%)
 Frame = +3

Query: 3   HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
           HPGSTKMY  +K  +WW GMKRDVA FV +CL CQQVKA HQRP G LQ LP+PEWKWEH
Sbjct: 12  HPGSTKMYRTIKENYWWPGMKRDVAEFVAKCLVCQQVKAEHQRPAGTLQSLPVPEWKWEH 71

Query: 183 INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
           + MDFV+ LP++ + N AIWVIVDRL+KSAHFL +  T+S+EKLA+LY+ EIVRLHGVPV
Sbjct: 72  VTMDFVLGLPRTQRGNDAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVPV 131

Query: 363 SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
           SI SDRD RFTS FW   Q+ +GT+L FSTAFHPQTDGQSERTIQ LEDMLRA V+D  G
Sbjct: 132 SIVSDRDPRFTSRFWLKFQEALGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVIDFIG 191

Query: 543 NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
           +W+  LPLVEFAYNNS+Q++IGMAPYEALYGRKCR+PL WDE
Sbjct: 192 SWDRHLPLVEFAYNNSFQSSIGMAPYEALYGRKCRTPLCWDE 233


>ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508708318|gb|EOY00215.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1537

 Score =  345 bits (884), Expect = 1e-92
 Identities = 157/222 (70%), Positives = 184/222 (82%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPGSTKMY  +K  +WW GM+RD+A FV +CL CQQ+KA HQ+P G LQPL IPEWKWEH
Sbjct: 1112 HPGSTKMYRTIKESYWWPGMERDIAEFVAKCLTCQQIKAEHQKPSGTLQPLSIPEWKWEH 1171

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            + MDFV+ LP++     AIWVIVDRL+KSAHFL I  T+S+E+LA+LY+ EIVRLHGVPV
Sbjct: 1172 VTMDFVLGLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEIVRLHGVPV 1231

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDRD RFTS FW   Q+ +GT+L FSTAFHPQTDGQSERTIQ LEDMLRA V+D  G
Sbjct: 1232 SIVSDRDLRFTSRFWPKFQEALGTKLRFSTAFHPQTDGQSERTIQTLEDMLRACVIDFIG 1291

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            +W+  LPLVEFAYNNS+Q++IGMAPYEALYGRKCR+PL WDE
Sbjct: 1292 SWDRHLPLVEFAYNNSFQSSIGMAPYEALYGRKCRTPLCWDE 1333


>ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508702098|gb|EOX93994.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 811

 Score =  344 bits (883), Expect = 1e-92
 Identities = 155/222 (69%), Positives = 184/222 (82%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPGSTKMY  +K  +WW GMKRD+A FV +CL CQQ+KA HQ+  G LQPLPIPEWKWEH
Sbjct: 513  HPGSTKMYRTIKESYWWPGMKRDIAKFVAKCLTCQQIKAEHQKSSGTLQPLPIPEWKWEH 572

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            + MDFV+ LP++     AIWVIVDRL+KSAHFL I  T+S+E+LA+LY+ E+VRLHGVP+
Sbjct: 573  VTMDFVLGLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEVVRLHGVPI 632

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDRD RFTS FW   Q+ +GT+L FST+FHPQTDGQSERTIQ LEDMLRA V+D  G
Sbjct: 633  SIVSDRDPRFTSRFWPKFQEALGTKLRFSTSFHPQTDGQSERTIQTLEDMLRACVIDFIG 692

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            +W+  LPLVEFAYNNS+Q++IGMAPYEALYGRKCR+PL WDE
Sbjct: 693  SWDRHLPLVEFAYNNSFQSSIGMAPYEALYGRKCRTPLCWDE 734


>emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera]
          Length = 1573

 Score =  342 bits (877), Expect = 6e-92
 Identities = 158/222 (71%), Positives = 181/222 (81%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPG+TKMY DLK  F W GMKRD+A FV  C  CQQVKA HQRP  +LQPLPIP+WKW++
Sbjct: 1210 HPGNTKMYQDLKRQFXWSGMKRDIAQFVANCQICQQVKAEHQRPAELLQPLPIPKWKWDN 1269

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDFV+ LP++  K   +WVIVDRL+KSAHFL ++ T S+  LAKLY++EIVRLHG+PV
Sbjct: 1270 ITMDFVIGLPRTRSKKNGVWVIVDRLTKSAHFLAMKTTDSMNSLAKLYIQEIVRLHGIPV 1329

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDRD +FTS FW+SLQ  +GTQLNFST FHPQTDGQSER IQILEDMLRA V+D GG
Sbjct: 1330 SIVSDRDPKFTSQFWQSLQRALGTQLNFSTVFHPQTDGQSERVIQILEDMLRACVLDFGG 1389

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            NW D LPL EFAYNN YQ++IGMAPYEALYGR CRSPL W E
Sbjct: 1390 NWADYLPLAEFAYNNXYQSSIGMAPYEALYGRPCRSPLCWIE 1431


>emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera]
          Length = 1313

 Score =  337 bits (865), Expect = 2e-90
 Identities = 157/222 (70%), Positives = 180/222 (81%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPG+TKMY DLK  FWW GMKRD+A FV     CQQVKA HQRP G+LQPLPIPEWKW++
Sbjct: 904  HPGNTKMYQDLKRQFWWSGMKRDIAQFVANFQICQQVKAEHQRPAGLLQPLPIPEWKWDN 963

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDFV+ LP++  K   +WVIVD L+KSAHFL ++ T S+  LAKLY++EIVRLHG+ V
Sbjct: 964  ITMDFVIGLPRTRSKKNGVWVIVDCLTKSAHFLAMKTTDSMNSLAKLYIQEIVRLHGILV 1023

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDRD +FTS FW+SLQ  +GTQLNF+TAFHPQTDGQSER IQILEDMLRA V+D GG
Sbjct: 1024 SIVSDRDPKFTSQFWQSLQRALGTQLNFNTAFHPQTDGQSERVIQILEDMLRACVLDFGG 1083

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            NW D LPL EFAYNNSYQ++I  APYEALYGR CRSPL W E
Sbjct: 1084 NWADYLPLAEFAYNNSYQSSIXXAPYEALYGRPCRSPLCWIE 1125


>ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508774422|gb|EOY21678.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 448

 Score =  337 bits (863), Expect = 3e-90
 Identities = 150/222 (67%), Positives = 183/222 (82%)
 Frame = +3

Query: 3   HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
           HPG+TKMY DLK V+WW+G+KRDVA FV +CL CQQVKA HQ+P G+LQPLP+PEWKWEH
Sbjct: 40  HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 99

Query: 183 INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
           I MDFV  LP++     +IW++VDRL+KSAHFLP++ T+   + A++YV EIVRLHG+P+
Sbjct: 100 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPI 159

Query: 363 SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
           SI SDR  +FTS FW  LQ+ +GT+L+FSTAFHPQTDGQSERTIQ LEDMLRA V+D G 
Sbjct: 160 SIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGV 219

Query: 543 NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            WE  LPLVEFAYNNS+Q +I MAP+EALYGR+CRSP+ W E
Sbjct: 220 RWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 261


>ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508774222|gb|EOY21478.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 878

 Score =  337 bits (863), Expect = 3e-90
 Identities = 150/222 (67%), Positives = 183/222 (82%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPG+TKMY DLK V+WW+G+KRDVA FV +CL CQQVKA HQ+P G+LQPLP+PEWKWEH
Sbjct: 648  HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 707

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDFV  LP++     +IW++VDRL+KSAHFLP++ T+   + A++YV EIVRLHG+P+
Sbjct: 708  IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPI 767

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDR  +FTS FW  LQ+ +GT+L+FSTAFHPQTDGQSERTIQ LEDMLRA V+D G 
Sbjct: 768  SIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGV 827

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
             WE  LPLVEFAYNNS+Q +I MAP+EALYGR+CRSP+ W E
Sbjct: 828  RWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 869


>ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma
           cacao] gi|508716770|gb|EOY08667.1| Retrotransposon
           protein, Ty3-gypsy subclass, putative [Theobroma cacao]
          Length = 521

 Score =  337 bits (863), Expect = 3e-90
 Identities = 150/222 (67%), Positives = 183/222 (82%)
 Frame = +3

Query: 3   HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
           HPG+TKMY DLK V+WW+G+KRDVA FV +CL CQQVKA HQ+P G+LQPLP+PEWKWEH
Sbjct: 113 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 172

Query: 183 INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
           I MDFV  LP++     +IW++VDRL+KSAHFLP++ T+   + A++YV EIVRLHG+P+
Sbjct: 173 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPI 232

Query: 363 SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
           SI SDR  +FTS FW  LQ+ +GT+L+FSTAFHPQTDGQSERTIQ LEDMLRA V+D G 
Sbjct: 233 SIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGV 292

Query: 543 NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            WE  LPLVEFAYNNS+Q +I MAP+EALYGR+CRSP+ W E
Sbjct: 293 RWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 334


>emb|CAA73042.1| polyprotein [Ananas comosus]
          Length = 871

 Score =  335 bits (859), Expect = 8e-90
 Identities = 155/222 (69%), Positives = 181/222 (81%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPG TKMY DLK ++WW G+K+DV  FV +CL CQQVKA H+ P G LQ LPIP WKWE 
Sbjct: 539  HPGGTKMYKDLKLLYWWPGIKKDVGEFVAKCLTCQQVKAEHRVPAGKLQSLPIPVWKWEK 598

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDFV  LP+S   + AIWVIVDRL+KSAHF+PI  T++ E+LA++Y+ EIVRLHGVP 
Sbjct: 599  ITMDFVTGLPRSQAGHDAIWVIVDRLTKSAHFIPIHTTWTGERLAQVYLDEIVRLHGVPT 658

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDRDTRF SHFW+SLQD +GT+L+FSTAFHPQ+DGQSERTIQ LEDMLRA V+D  G
Sbjct: 659  SIVSDRDTRFVSHFWRSLQDALGTRLDFSTAFHPQSDGQSERTIQTLEDMLRACVIDFQG 718

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
             W   LP+ EFAYNNSYQA+I MAP+EALYGRKCRSPL+W E
Sbjct: 719  GWSQHLPMAEFAYNNSYQASIKMAPFEALYGRKCRSPLHWSE 760


>ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508779195|gb|EOY26451.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 679

 Score =  334 bits (857), Expect = 1e-89
 Identities = 149/222 (67%), Positives = 182/222 (81%)
 Frame = +3

Query: 3   HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
           HPG+TKMY DLK V+WW+G+KRDVA FV +CL CQQVKA HQ+P G+LQPLP+PEWKWEH
Sbjct: 271 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 330

Query: 183 INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
           I MDFV  LP++     +IW++VD+L+KSAHFLP++ T+     A++YV EIVRLHG+P+
Sbjct: 331 IAMDFVTGLPRTSGGYDSIWIVVDQLTKSAHFLPVKTTYGAAHYARVYVDEIVRLHGIPI 390

Query: 363 SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
           SI SDR  +FTS FW  LQ+ +GT+L+FSTAFHPQTDGQSERTIQ LEDMLRA V+D G 
Sbjct: 391 SIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGV 450

Query: 543 NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            WE  LPLVEFAYNNS+Q +I MAP+EALYGR+CRSP+ W E
Sbjct: 451 RWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 492


>ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobroma cacao]
           gi|508728383|gb|EOY20280.1| Uncharacterized protein
           TCM_045699 [Theobroma cacao]
          Length = 415

 Score =  334 bits (856), Expect = 2e-89
 Identities = 149/222 (67%), Positives = 183/222 (82%)
 Frame = +3

Query: 3   HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
           HPG+TKMY DLK V+WW+G+KRDVA FV +CL CQQVKA HQ+P G+LQPLP+PEWKWEH
Sbjct: 7   HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPTGLLQPLPVPEWKWEH 66

Query: 183 INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
           I MDFV  LP++     +IW++VDRL+KSAHFL ++ T+   + A++YV EIVRLHG+P+
Sbjct: 67  IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLLVKTTYGAAQYARVYVDEIVRLHGIPI 126

Query: 363 SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
           SI SDR+ +FTS FW  LQ+ +GT+L+FSTAFHPQTDGQSERTIQ LEDMLRA V+D G 
Sbjct: 127 SIVSDREAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGV 186

Query: 543 NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            WE  LPLVEFAYNNS+Q +I MAP+EALYGR+CRSP+ W E
Sbjct: 187 KWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 228


>ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508779254|gb|EOY26510.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1290

 Score =  333 bits (855), Expect = 2e-89
 Identities = 152/222 (68%), Positives = 180/222 (81%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPGSTKMY  +K  +WW GMKRD+A FV +CL CQQ+KA HQ+  G LQPLPIPEWKWEH
Sbjct: 901  HPGSTKMYQTIKESYWWPGMKRDIAEFVAKCLICQQIKAEHQKSSGTLQPLPIPEWKWEH 960

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            + MDFV+ LP++     AIWVI+ RL+KSAHFL I  T+S+E+LA+LY+ E+VRLHGVPV
Sbjct: 961  VTMDFVLGLPRTQSGKDAIWVIMGRLTKSAHFLAIHSTYSIERLARLYIDEVVRLHGVPV 1020

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDRD RFTS FW   Q+ +GT+L FSTAFHPQ DGQSERTIQ LEDMLRA V+D   
Sbjct: 1021 SIVSDRDPRFTSRFWPKFQEALGTKLRFSTAFHPQIDGQSERTIQTLEDMLRACVIDFIR 1080

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            +W+  LPLVEFAYNNS+Q++IGMA YEALYGRKCR+PL WDE
Sbjct: 1081 SWDRHLPLVEFAYNNSFQSSIGMATYEALYGRKCRTPLCWDE 1122


>ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508711429|gb|EOY03326.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1447

 Score =  333 bits (855), Expect = 2e-89
 Identities = 149/222 (67%), Positives = 182/222 (81%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPG+TKMY DLK V+WW+G+KRDVA FV +CL CQQVKA HQ+P G+LQPLP+PEWKWEH
Sbjct: 1039 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 1098

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDFV  LP++     +IW++VDRL+KSAHFLP++ T+   + A++YV EIVRLHG+P+
Sbjct: 1099 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPI 1158

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDR  +FTS FW  LQ+ +GT+L+FSTAFHPQTDGQSERTIQ LE MLRA V+D G 
Sbjct: 1159 SIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEAMLRACVIDLGV 1218

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
             WE  LPLVEFAYNNS+Q +I MAP+EALYGR+CRSP+ W E
Sbjct: 1219 RWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 1260


>ref|XP_007028157.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508716762|gb|EOY08659.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 937

 Score =  333 bits (854), Expect = 3e-89
 Identities = 149/222 (67%), Positives = 182/222 (81%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPG+TKMY DLK V+WW+G+KRDVA FV +CL CQQVKA HQ+P G+LQPLP+PEWKWEH
Sbjct: 501  HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 560

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDFV  LP++     +IW++VDRL+KSAHFLP++ T+   + A++YV EIVRLHG+P+
Sbjct: 561  IAMDFVTGLPRTNGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPI 620

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDR  +FTS FW  LQ+ +GT+L+FSTAFHPQTDGQSE TIQ LEDMLRA V+D G 
Sbjct: 621  SIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSEWTIQTLEDMLRACVIDLGV 680

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
             WE  LPLVEFAYNNS+Q +I MAP+EALYGR+CRSP+ W E
Sbjct: 681  RWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 722


>ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prunus persica]
           gi|462417788|gb|EMJ22433.1| hypothetical protein
           PRUPE_ppb019121mg [Prunus persica]
          Length = 552

 Score =  333 bits (854), Expect = 3e-89
 Identities = 149/222 (67%), Positives = 180/222 (81%)
 Frame = +3

Query: 3   HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
           HPGSTKMY  L+  +WW  MK+++A +V RCL CQQVKA  Q+P G+LQPLPIPEWKWE 
Sbjct: 145 HPGSTKMYHTLREHYWWPFMKKEIAEYVRRCLICQQVKAERQKPSGLLQPLPIPEWKWER 204

Query: 183 INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
           I MDFV  LP++  K+  +WVIVDRL+KSAHFLP++  +SL KLAK+++ EIVRLHGVPV
Sbjct: 205 ITMDFVFKLPRTQSKHDGVWVIVDRLTKSAHFLPVRANYSLNKLAKIFIDEIVRLHGVPV 264

Query: 363 SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
           SI SDRD RFTS FW  L +  GTQL FSTAFHPQTDGQSERTIQ LEDMLRA  +   G
Sbjct: 265 SIVSDRDPRFTSRFWTKLNEAFGTQLQFSTAFHPQTDGQSERTIQTLEDMLRACALQFRG 324

Query: 543 NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
           +W++ LPL+EFAYNNSYQ +IGM+P++ALYGR+CR+P YWDE
Sbjct: 325 DWDEKLPLMEFAYNNSYQVSIGMSPFDALYGRQCRTPFYWDE 366


>gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum]
          Length = 1475

 Score =  332 bits (852), Expect = 5e-89
 Identities = 148/222 (66%), Positives = 186/222 (83%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPG+TKMY DL+  +WW GM+RD+A FV RCL CQQVKA H RP G+ + LPIPEWKWE 
Sbjct: 1203 HPGTTKMYRDLRQHYWWSGMRRDIADFVSRCLCCQQVKAEHLRPGGVFKRLPIPEWKWER 1262

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDF+V LP++ +   +IWVIVDRL+KSAHFLP+Q +FS E+LA++Y++E+VRLHGVPV
Sbjct: 1263 ITMDFIVGLPRTPRGVDSIWVIVDRLTKSAHFLPVQCSFSAERLARIYIREVVRLHGVPV 1322

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDR ++FTS+FW++ QD +GT+++ STAFHPQTDGQSERTIQ+LEDMLRA V+D GG
Sbjct: 1323 SIISDRGSQFTSNFWRTFQDELGTRVDLSTAFHPQTDGQSERTIQVLEDMLRACVMDFGG 1382

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
             W+  LPL EFAYNNSY ++I MAP+EALYGR+CRSP+ W E
Sbjct: 1383 QWDQFLPLAEFAYNNSYHSSIQMAPFEALYGRRCRSPVGWFE 1424


>ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobroma cacao]
            gi|508727367|gb|EOY19264.1| Uncharacterized protein
            TCM_044274 [Theobroma cacao]
          Length = 860

 Score =  332 bits (851), Expect = 6e-89
 Identities = 148/222 (66%), Positives = 182/222 (81%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPG+TKMY DLK V+WW+G+KRDVA FV +CL CQQVKA HQ+P G+LQPLP+PEWKWEH
Sbjct: 482  HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 541

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDFV  LP++     +IW++VDRL+KSAHFLP++ T+   + A++YV EIVRLHG+P+
Sbjct: 542  IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPI 601

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDR  +FTS FW  LQ+ +GT+L+FSTAFHPQTDGQSERTI+ LEDMLRA V+D G 
Sbjct: 602  SIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIKTLEDMLRACVIDLGV 661

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
             WE  LPLVEFAYNNS+Q +I MA +EALYGR+CRSP+ W E
Sbjct: 662  KWEQYLPLVEFAYNNSFQTSIQMAAFEALYGRRCRSPIGWLE 703


>ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508716781|gb|EOY08678.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 666

 Score =  332 bits (851), Expect = 6e-89
 Identities = 148/222 (66%), Positives = 181/222 (81%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPG+TKMY DLK V+WW+G+KRDVA FV +CL CQQVKA HQ+P G+LQPLP+PEWKWEH
Sbjct: 365  HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 424

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDFV  LP++     +IW++VDRL+KSAHFL ++ T+   + A++YV EIVRLHG+P+
Sbjct: 425  IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLSVKTTYGAAQYARVYVDEIVRLHGIPI 484

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDR  +FTS FW  LQ+ +GT+L+FST FHPQTDGQSERTIQ LEDMLRA V+D G 
Sbjct: 485  SIVSDRGAQFTSRFWGKLQEALGTKLDFSTTFHPQTDGQSERTIQTLEDMLRACVIDLGV 544

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
             WE  LPLVEFAYNNS+Q +I MAP+EALYGR+CRSP+ W E
Sbjct: 545  KWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 586


>ref|XP_007198824.1| hypothetical protein PRUPE_ppb020037mg [Prunus persica]
            gi|462394119|gb|EMJ00023.1| hypothetical protein
            PRUPE_ppb020037mg [Prunus persica]
          Length = 1279

 Score =  331 bits (848), Expect = 1e-88
 Identities = 148/221 (66%), Positives = 179/221 (80%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            HPGSTKMY  L+  +WW  MK+++A +V RCL CQQVKA  Q+P G+LQPLPIPEWKWE 
Sbjct: 900  HPGSTKMYHTLREHYWWPFMKKEIAEYVRRCLICQQVKAERQKPSGLLQPLPIPEWKWER 959

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            I MDFV  LP++  K+  +WVIVDRL+KSAHFLP++  +SL KLAK+++ EIVRLHGVPV
Sbjct: 960  ITMDFVFKLPRTHSKHDGVWVIVDRLTKSAHFLPVRANYSLNKLAKIFIDEIVRLHGVPV 1019

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDRD RFTS FW  L +  GTQL FSTAFHPQTDGQSERTIQ LEDMLRA  +   G
Sbjct: 1020 SIVSDRDPRFTSRFWTKLNEAFGTQLQFSTAFHPQTDGQSERTIQTLEDMLRACALQFRG 1079

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWD 665
            +W++ LPL+EFAYNNSYQ +IGM+P++ALYGR+CR+P YWD
Sbjct: 1080 DWDEKLPLMEFAYNNSYQVSIGMSPFDALYGRQCRTPFYWD 1120


>ref|XP_007028151.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508716756|gb|EOY08653.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1110

 Score =  330 bits (845), Expect = 3e-88
 Identities = 152/222 (68%), Positives = 179/222 (80%)
 Frame = +3

Query: 3    HPGSTKMYMDLKNVFWWKGMKRDVASFVERCLACQQVKAVHQRPYGMLQPLPIPEWKWEH 182
            H  STKMY  +K  +WW GMKRD+A FV +CL CQQ+KA HQ+  G LQPLPIPEWKWEH
Sbjct: 807  HLESTKMYRTIKESYWWPGMKRDIAEFVAKCLTCQQIKAEHQKLSGTLQPLPIPEWKWEH 866

Query: 183  INMDFVVNLPKSVQKNTAIWVIVDRLSKSAHFLPIQMTFSLEKLAKLYVKEIVRLHGVPV 362
            + MDFV+ L ++     AIWVIVDRL+KSAHFL I  T+S+EKL KLY+ EIVRL+GVP+
Sbjct: 867  VTMDFVLGLLRTQSGKDAIWVIVDRLTKSAHFLAIHNTYSIEKLVKLYIDEIVRLYGVPI 926

Query: 363  SITSDRDTRFTSHFWKSLQDCMGTQLNFSTAFHPQTDGQSERTIQILEDMLRAIVVDKGG 542
            SI SDRD RFTS FW   Q+ +GT+L FSTAFHPQTDGQSERTIQ LEDMLRA V+D  G
Sbjct: 927  SIVSDRDPRFTSRFWSKFQEALGTKLRFSTAFHPQTDGQSERTIQTLEDMLRACVIDFIG 986

Query: 543  NWEDLLPLVEFAYNNSYQATIGMAPYEALYGRKCRSPLYWDE 668
            +W+  LPLVEFAYNNS+Q++IGMAPYEALYGRKC++P  WDE
Sbjct: 987  SWDRHLPLVEFAYNNSFQSSIGMAPYEALYGRKCQTPFCWDE 1028


Top