BLASTX nr result

ID: Cornus23_contig00025825 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00025825
         (291 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007022333.1| Uncharacterized protein TCM_032543 [Theobrom...   147   4e-33
ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao] g...   145   1e-32
ref|XP_010111872.1| Transposon Ty3-I Gag-Pol polyprotein [Morus ...   144   2e-32
ref|XP_012487705.1| PREDICTED: uncharacterized protein LOC105800...   143   4e-32
ref|XP_012073065.1| PREDICTED: uncharacterized protein LOC105634...   143   4e-32
gb|AIG55302.1| gag-pol, partial [Camellia sinensis]                   143   6e-32
ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [The...   142   1e-31
ref|XP_007050046.1| DNA/RNA polymerases superfamily protein [The...   142   1e-31
ref|XP_007019770.1| Uncharacterized protein TCM_036100 [Theobrom...   142   1e-31
ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [The...   141   2e-31
ref|XP_007032766.1| Gag protease polyprotein [Theobroma cacao] g...   141   2e-31
ref|XP_007022613.1| Uncharacterized protein TCM_033423 [Theobrom...   140   3e-31
ref|XP_007037156.1| DNA/RNA polymerases superfamily protein [The...   140   4e-31
ref|XP_007028151.1| DNA/RNA polymerases superfamily protein [The...   140   4e-31
ref|XP_007033074.1| Uncharacterized protein TCM_019247 [Theobrom...   140   4e-31
ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prun...   140   5e-31
ref|XP_007198824.1| hypothetical protein PRUPE_ppb020037mg [Prun...   140   5e-31
emb|CAN81132.1| hypothetical protein VITISV_009934 [Vitis vinifera]   140   5e-31
emb|CAN61139.1| hypothetical protein VITISV_009489 [Vitis vinifera]   140   5e-31
ref|XP_012453364.1| PREDICTED: uncharacterized protein LOC105775...   139   6e-31

>ref|XP_007022333.1| Uncharacterized protein TCM_032543 [Theobroma cacao]
           gi|508721961|gb|EOY13858.1| Uncharacterized protein
           TCM_032543 [Theobroma cacao]
          Length = 189

 Score =  147 bits (370), Expect = 4e-33
 Identities = 70/93 (75%), Positives = 79/93 (84%)
 Frame = -3

Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
           DAIWVIVDRLTK+AHF+ +     +E L QLY+D IVR HGVP++IVSDRDPRFTSRFW 
Sbjct: 15  DAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVPISIVSDRDPRFTSRFWP 74

Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
            FQEA++T LKFSTAFHPQTDGQSERTIQ LED
Sbjct: 75  KFQEALETKLKFSTAFHPQTDGQSERTIQTLED 107


>ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao]
           gi|508702196|gb|EOX94092.1| Gag protease polyprotein
           [Theobroma cacao]
          Length = 269

 Score =  145 bits (366), Expect = 1e-32
 Identities = 70/94 (74%), Positives = 79/94 (84%)
 Frame = -3

Query: 283 HDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFW 104
           +DAIWVIVDRLTK+AHF+ +     +E L QLY+D IVR HGVP++IVSDRDPRFTSRFW
Sbjct: 87  NDAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVPVSIVSDRDPRFTSRFW 146

Query: 103 KGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
             FQEA+ T LKFSTAFHPQTDGQSERTIQ LED
Sbjct: 147 LKFQEALGTKLKFSTAFHPQTDGQSERTIQTLED 180


>ref|XP_010111872.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notabilis]
            gi|587945430|gb|EXC31837.1| Transposon Ty3-I Gag-Pol
            polyprotein [Morus notabilis]
          Length = 1088

 Score =  144 bits (364), Expect = 2e-32
 Identities = 65/94 (69%), Positives = 80/94 (85%)
 Frame = -3

Query: 283  HDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFW 104
            +DA+WV+VDRLTKTAHFIP+    ++  LC+LY++RIV  HGVP++IVSDRD +FTS+FW
Sbjct: 910  YDAVWVVVDRLTKTAHFIPIRADYKVPKLCRLYIERIVTLHGVPVSIVSDRDAQFTSKFW 969

Query: 103  KGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
            KG Q A+ T L+FSTAFHPQTDGQSER IQILED
Sbjct: 970  KGLQNALGTELRFSTAFHPQTDGQSERVIQILED 1003


>ref|XP_012487705.1| PREDICTED: uncharacterized protein LOC105800880, partial [Gossypium
            raimondii]
          Length = 1085

 Score =  143 bits (361), Expect = 4e-32
 Identities = 67/96 (69%), Positives = 79/96 (82%)
 Frame = -3

Query: 289  QRHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSR 110
            ++ D+IWVIVDRLTK+AHFIP+    ++E L +LYV  IVR HGVP++I+SDRDPRFTSR
Sbjct: 737  KKKDSIWVIVDRLTKSAHFIPVRTDYQLEKLAELYVSEIVRLHGVPISIISDRDPRFTSR 796

Query: 109  FWKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
            FW   QEA+ T L FSTAFHPQTDGQSER IQILED
Sbjct: 797  FWSKLQEALGTKLNFSTAFHPQTDGQSERVIQILED 832


>ref|XP_012073065.1| PREDICTED: uncharacterized protein LOC105634770 [Jatropha curcas]
          Length = 1963

 Score =  143 bits (361), Expect = 4e-32
 Identities = 65/96 (67%), Positives = 79/96 (82%)
 Frame = -3

Query: 289 QRHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSR 110
           ++HDA+WVIVDRLTK+AHF+P+     +E L ++Y+  IVR HGVP++IVSDRDPRFTSR
Sbjct: 410 KKHDAVWVIVDRLTKSAHFLPIRSNYSLEKLAEMYIGEIVRLHGVPVSIVSDRDPRFTSR 469

Query: 109 FWKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
           FW   Q+A+ T L FSTAFHPQTDGQSER IQILED
Sbjct: 470 FWASLQKALGTRLNFSTAFHPQTDGQSERIIQILED 505


>gb|AIG55302.1| gag-pol, partial [Camellia sinensis]
          Length = 923

 Score =  143 bits (360), Expect = 6e-32
 Identities = 67/93 (72%), Positives = 77/93 (82%)
 Frame = -3

Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
           DAIWV+VDRLTK+AHFIPM VRD M+ L  LY+  +VR HGVP+TIVSDRDP FT+R W+
Sbjct: 590 DAIWVVVDRLTKSAHFIPMRVRDSMDHLADLYIRDVVRLHGVPVTIVSDRDPCFTARLWQ 649

Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
             Q A+ T L FSTA+HPQTDGQSERTIQILED
Sbjct: 650 SLQSALGTKLTFSTAYHPQTDGQSERTIQILED 682


>ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508708185|gb|EOY00082.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1515

 Score =  142 bits (358), Expect = 1e-31
 Identities = 69/93 (74%), Positives = 77/93 (82%)
 Frame = -3

Query: 280  DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
            DAIWVIVDRLTK+AHF+ +     +E L QLY+D IVR HGV ++IVSDRDPRFTSRFW 
Sbjct: 1175 DAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVSVSIVSDRDPRFTSRFWP 1234

Query: 100  GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
             FQEA+ T LKFSTAFHPQTDGQSERTIQ LED
Sbjct: 1235 KFQEALGTKLKFSTAFHPQTDGQSERTIQTLED 1267


>ref|XP_007050046.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508702307|gb|EOX94203.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1336

 Score =  142 bits (358), Expect = 1e-31
 Identities = 68/93 (73%), Positives = 77/93 (82%)
 Frame = -3

Query: 280  DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
            D IWVIVD+LTK+AHF+ +     +E L QLY+D IVR HGVP++IVSDRDPRFTSRFW 
Sbjct: 1093 DVIWVIVDQLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVPVSIVSDRDPRFTSRFWP 1152

Query: 100  GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
             FQEA+ T LKFSTAFHPQTDGQSERTIQ LED
Sbjct: 1153 KFQEALGTKLKFSTAFHPQTDGQSERTIQTLED 1185


>ref|XP_007019770.1| Uncharacterized protein TCM_036100 [Theobroma cacao]
           gi|508725098|gb|EOY16995.1| Uncharacterized protein
           TCM_036100 [Theobroma cacao]
          Length = 160

 Score =  142 bits (357), Expect = 1e-31
 Identities = 67/93 (72%), Positives = 78/93 (83%)
 Frame = -3

Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
           DAIWVIVDRLTK+AHF+ +     +E L +LY+D +VR HGVP++IVSDRDPRFTSRFW 
Sbjct: 15  DAIWVIVDRLTKSAHFLAIHSTFSIERLARLYIDEVVRLHGVPVSIVSDRDPRFTSRFWL 74

Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
            FQEA+ T L+FSTAFHPQTDGQSERTIQ LED
Sbjct: 75  KFQEALGTKLRFSTAFHPQTDGQSERTIQTLED 107


>ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508702098|gb|EOX93994.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 811

 Score =  141 bits (356), Expect = 2e-31
 Identities = 66/93 (70%), Positives = 78/93 (83%)
 Frame = -3

Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
           DAIWVIVDRLTK+AHF+ +     +E L +LY+D +VR HGVP++IVSDRDPRFTSRFW 
Sbjct: 589 DAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEVVRLHGVPISIVSDRDPRFTSRFWP 648

Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
            FQEA+ T L+FST+FHPQTDGQSERTIQ LED
Sbjct: 649 KFQEALGTKLRFSTSFHPQTDGQSERTIQTLED 681


>ref|XP_007032766.1| Gag protease polyprotein [Theobroma cacao]
           gi|508711795|gb|EOY03692.1| Gag protease polyprotein
           [Theobroma cacao]
          Length = 689

 Score =  141 bits (355), Expect = 2e-31
 Identities = 68/93 (73%), Positives = 77/93 (82%)
 Frame = -3

Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
           DAIWVIVDRLTK+AHF+ +     +E L QLY+D IVR HGVP+ IVSD+DPRFTSRFW 
Sbjct: 497 DAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVPVFIVSDQDPRFTSRFWP 556

Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
            FQEA+ T LKFSTAFHPQTDGQSERTIQ L+D
Sbjct: 557 KFQEALGTKLKFSTAFHPQTDGQSERTIQTLKD 589


>ref|XP_007022613.1| Uncharacterized protein TCM_033423 [Theobroma cacao]
           gi|508722241|gb|EOY14138.1| Uncharacterized protein
           TCM_033423 [Theobroma cacao]
          Length = 809

 Score =  140 bits (354), Expect = 3e-31
 Identities = 67/93 (72%), Positives = 77/93 (82%)
 Frame = -3

Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
           DAIWVIVDRLTK+AHF+ +     +E L +LY+D IVR HGVP++IVSDRDPRFTSRFW 
Sbjct: 693 DAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEIVRLHGVPVSIVSDRDPRFTSRFWP 752

Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
            F EA+ T L+FSTAFHPQTDGQSERTIQ LED
Sbjct: 753 KFHEALGTKLRFSTAFHPQTDGQSERTIQTLED 785


>ref|XP_007037156.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508774401|gb|EOY21657.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1188

 Score =  140 bits (353), Expect = 4e-31
 Identities = 67/93 (72%), Positives = 77/93 (82%)
 Frame = -3

Query: 280  DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
            DAIWVIVDRLTK+AHF+ +     +E L +LY+D IVR HGVP++IVSDRDPRFTSR W 
Sbjct: 996  DAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEIVRLHGVPISIVSDRDPRFTSRLWL 1055

Query: 100  GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
             FQEA+ T L+FSTAFHPQTDGQSERTIQ LED
Sbjct: 1056 KFQEALGTKLRFSTAFHPQTDGQSERTIQTLED 1088


>ref|XP_007028151.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508716756|gb|EOY08653.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1110

 Score =  140 bits (353), Expect = 4e-31
 Identities = 67/93 (72%), Positives = 78/93 (83%)
 Frame = -3

Query: 280  DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
            DAIWVIVDRLTK+AHF+ +     +E L +LY+D IVR +GVP++IVSDRDPRFTSRFW 
Sbjct: 883  DAIWVIVDRLTKSAHFLAIHNTYSIEKLVKLYIDEIVRLYGVPISIVSDRDPRFTSRFWS 942

Query: 100  GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
             FQEA+ T L+FSTAFHPQTDGQSERTIQ LED
Sbjct: 943  KFQEALGTKLRFSTAFHPQTDGQSERTIQTLED 975


>ref|XP_007033074.1| Uncharacterized protein TCM_019247 [Theobroma cacao]
           gi|508712103|gb|EOY04000.1| Uncharacterized protein
           TCM_019247 [Theobroma cacao]
          Length = 544

 Score =  140 bits (353), Expect = 4e-31
 Identities = 67/93 (72%), Positives = 77/93 (82%)
 Frame = -3

Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101
           DAIWVIVDRLTK+AHF+ +     +E L +LY+D IVR HGVP++IVSDRDPRFTSRFW 
Sbjct: 169 DAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEIVRLHGVPVSIVSDRDPRFTSRFWP 228

Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
            FQEA+ T L+FSTAFHPQ DGQSERTIQ LED
Sbjct: 229 KFQEALGTKLRFSTAFHPQKDGQSERTIQTLED 261


>ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prunus persica]
           gi|462417788|gb|EMJ22433.1| hypothetical protein
           PRUPE_ppb019121mg [Prunus persica]
          Length = 552

 Score =  140 bits (352), Expect = 5e-31
 Identities = 63/95 (66%), Positives = 76/95 (80%)
 Frame = -3

Query: 286 RHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRF 107
           +HD +WVIVDRLTK+AHF+P+     +  L ++++D IVR HGVP++IVSDRDPRFTSRF
Sbjct: 219 KHDGVWVIVDRLTKSAHFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDPRFTSRF 278

Query: 106 WKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
           W    EA  T L+FSTAFHPQTDGQSERTIQ LED
Sbjct: 279 WTKLNEAFGTQLQFSTAFHPQTDGQSERTIQTLED 313


>ref|XP_007198824.1| hypothetical protein PRUPE_ppb020037mg [Prunus persica]
            gi|462394119|gb|EMJ00023.1| hypothetical protein
            PRUPE_ppb020037mg [Prunus persica]
          Length = 1279

 Score =  140 bits (352), Expect = 5e-31
 Identities = 63/95 (66%), Positives = 76/95 (80%)
 Frame = -3

Query: 286  RHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRF 107
            +HD +WVIVDRLTK+AHF+P+     +  L ++++D IVR HGVP++IVSDRDPRFTSRF
Sbjct: 974  KHDGVWVIVDRLTKSAHFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDPRFTSRF 1033

Query: 106  WKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
            W    EA  T L+FSTAFHPQTDGQSERTIQ LED
Sbjct: 1034 WTKLNEAFGTQLQFSTAFHPQTDGQSERTIQTLED 1068


>emb|CAN81132.1| hypothetical protein VITISV_009934 [Vitis vinifera]
          Length = 730

 Score =  140 bits (352), Expect = 5e-31
 Identities = 65/94 (69%), Positives = 76/94 (80%)
 Frame = -3

Query: 283 HDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFW 104
           ++AIWVIVDRLTK+AHF+PM V   M+ L  LY+  IVR HGVPL+IVSDRDP FTSRFW
Sbjct: 559 NNAIWVIVDRLTKSAHFLPMKVNFSMDHLASLYIKEIVRMHGVPLSIVSDRDPHFTSRFW 618

Query: 103 KGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
              Q+A+ T L FSTAFHPQTDGQS+R IQ+LED
Sbjct: 619 HSLQKALSTKLSFSTAFHPQTDGQSDRVIQVLED 652


>emb|CAN61139.1| hypothetical protein VITISV_009489 [Vitis vinifera]
          Length = 984

 Score =  140 bits (352), Expect = 5e-31
 Identities = 65/94 (69%), Positives = 77/94 (81%)
 Frame = -3

Query: 283 HDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFW 104
           ++AIWVIVDRLTK+AHF+PM V   ++ L  LYV  IVR HGVP++IVSDRDPRFTSRFW
Sbjct: 643 NNAIWVIVDRLTKSAHFLPMKVNFSLDRLASLYVKEIVRMHGVPVSIVSDRDPRFTSRFW 702

Query: 103 KGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
              Q+++ T L FSTAFHPQTDGQSER IQ+LED
Sbjct: 703 HSLQKSLGTKLSFSTAFHPQTDGQSERVIQVLED 736


>ref|XP_012453364.1| PREDICTED: uncharacterized protein LOC105775392, partial [Gossypium
           raimondii]
          Length = 653

 Score =  139 bits (351), Expect = 6e-31
 Identities = 66/96 (68%), Positives = 78/96 (81%)
 Frame = -3

Query: 289 QRHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSR 110
           ++ DAIWVIVDRLTK+AHFIP+ +   ++ L +LYV  IVR HGVP +I+SDRDPRFTSR
Sbjct: 318 KKKDAIWVIVDRLTKSAHFIPIRIDYSLDRLAELYVAEIVRLHGVPKSIISDRDPRFTSR 377

Query: 109 FWKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2
           FW   QEA+ T L FSTAFHPQTDGQSER IQ+LED
Sbjct: 378 FWIKLQEALGTKLNFSTAFHPQTDGQSERMIQVLED 413


Top