BLASTX nr result
ID: Cornus23_contig00025825
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cornus23_contig00025825 (291 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007022333.1| Uncharacterized protein TCM_032543 [Theobrom... 147 4e-33 ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao] g... 145 1e-32 ref|XP_010111872.1| Transposon Ty3-I Gag-Pol polyprotein [Morus ... 144 2e-32 ref|XP_012487705.1| PREDICTED: uncharacterized protein LOC105800... 143 4e-32 ref|XP_012073065.1| PREDICTED: uncharacterized protein LOC105634... 143 4e-32 gb|AIG55302.1| gag-pol, partial [Camellia sinensis] 143 6e-32 ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [The... 142 1e-31 ref|XP_007050046.1| DNA/RNA polymerases superfamily protein [The... 142 1e-31 ref|XP_007019770.1| Uncharacterized protein TCM_036100 [Theobrom... 142 1e-31 ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [The... 141 2e-31 ref|XP_007032766.1| Gag protease polyprotein [Theobroma cacao] g... 141 2e-31 ref|XP_007022613.1| Uncharacterized protein TCM_033423 [Theobrom... 140 3e-31 ref|XP_007037156.1| DNA/RNA polymerases superfamily protein [The... 140 4e-31 ref|XP_007028151.1| DNA/RNA polymerases superfamily protein [The... 140 4e-31 ref|XP_007033074.1| Uncharacterized protein TCM_019247 [Theobrom... 140 4e-31 ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prun... 140 5e-31 ref|XP_007198824.1| hypothetical protein PRUPE_ppb020037mg [Prun... 140 5e-31 emb|CAN81132.1| hypothetical protein VITISV_009934 [Vitis vinifera] 140 5e-31 emb|CAN61139.1| hypothetical protein VITISV_009489 [Vitis vinifera] 140 5e-31 ref|XP_012453364.1| PREDICTED: uncharacterized protein LOC105775... 139 6e-31 >ref|XP_007022333.1| Uncharacterized protein TCM_032543 [Theobroma cacao] gi|508721961|gb|EOY13858.1| Uncharacterized protein TCM_032543 [Theobroma cacao] Length = 189 Score = 147 bits (370), Expect = 4e-33 Identities = 70/93 (75%), Positives = 79/93 (84%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWVIVDRLTK+AHF+ + +E L QLY+D IVR HGVP++IVSDRDPRFTSRFW Sbjct: 15 DAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVPISIVSDRDPRFTSRFWP 74 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA++T LKFSTAFHPQTDGQSERTIQ LED Sbjct: 75 KFQEALETKLKFSTAFHPQTDGQSERTIQTLED 107 >ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao] gi|508702196|gb|EOX94092.1| Gag protease polyprotein [Theobroma cacao] Length = 269 Score = 145 bits (366), Expect = 1e-32 Identities = 70/94 (74%), Positives = 79/94 (84%) Frame = -3 Query: 283 HDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFW 104 +DAIWVIVDRLTK+AHF+ + +E L QLY+D IVR HGVP++IVSDRDPRFTSRFW Sbjct: 87 NDAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVPVSIVSDRDPRFTSRFW 146 Query: 103 KGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA+ T LKFSTAFHPQTDGQSERTIQ LED Sbjct: 147 LKFQEALGTKLKFSTAFHPQTDGQSERTIQTLED 180 >ref|XP_010111872.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notabilis] gi|587945430|gb|EXC31837.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notabilis] Length = 1088 Score = 144 bits (364), Expect = 2e-32 Identities = 65/94 (69%), Positives = 80/94 (85%) Frame = -3 Query: 283 HDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFW 104 +DA+WV+VDRLTKTAHFIP+ ++ LC+LY++RIV HGVP++IVSDRD +FTS+FW Sbjct: 910 YDAVWVVVDRLTKTAHFIPIRADYKVPKLCRLYIERIVTLHGVPVSIVSDRDAQFTSKFW 969 Query: 103 KGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 KG Q A+ T L+FSTAFHPQTDGQSER IQILED Sbjct: 970 KGLQNALGTELRFSTAFHPQTDGQSERVIQILED 1003 >ref|XP_012487705.1| PREDICTED: uncharacterized protein LOC105800880, partial [Gossypium raimondii] Length = 1085 Score = 143 bits (361), Expect = 4e-32 Identities = 67/96 (69%), Positives = 79/96 (82%) Frame = -3 Query: 289 QRHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSR 110 ++ D+IWVIVDRLTK+AHFIP+ ++E L +LYV IVR HGVP++I+SDRDPRFTSR Sbjct: 737 KKKDSIWVIVDRLTKSAHFIPVRTDYQLEKLAELYVSEIVRLHGVPISIISDRDPRFTSR 796 Query: 109 FWKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FW QEA+ T L FSTAFHPQTDGQSER IQILED Sbjct: 797 FWSKLQEALGTKLNFSTAFHPQTDGQSERVIQILED 832 >ref|XP_012073065.1| PREDICTED: uncharacterized protein LOC105634770 [Jatropha curcas] Length = 1963 Score = 143 bits (361), Expect = 4e-32 Identities = 65/96 (67%), Positives = 79/96 (82%) Frame = -3 Query: 289 QRHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSR 110 ++HDA+WVIVDRLTK+AHF+P+ +E L ++Y+ IVR HGVP++IVSDRDPRFTSR Sbjct: 410 KKHDAVWVIVDRLTKSAHFLPIRSNYSLEKLAEMYIGEIVRLHGVPVSIVSDRDPRFTSR 469 Query: 109 FWKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FW Q+A+ T L FSTAFHPQTDGQSER IQILED Sbjct: 470 FWASLQKALGTRLNFSTAFHPQTDGQSERIIQILED 505 >gb|AIG55302.1| gag-pol, partial [Camellia sinensis] Length = 923 Score = 143 bits (360), Expect = 6e-32 Identities = 67/93 (72%), Positives = 77/93 (82%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWV+VDRLTK+AHFIPM VRD M+ L LY+ +VR HGVP+TIVSDRDP FT+R W+ Sbjct: 590 DAIWVVVDRLTKSAHFIPMRVRDSMDHLADLYIRDVVRLHGVPVTIVSDRDPCFTARLWQ 649 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 Q A+ T L FSTA+HPQTDGQSERTIQILED Sbjct: 650 SLQSALGTKLTFSTAYHPQTDGQSERTIQILED 682 >ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708185|gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1515 Score = 142 bits (358), Expect = 1e-31 Identities = 69/93 (74%), Positives = 77/93 (82%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWVIVDRLTK+AHF+ + +E L QLY+D IVR HGV ++IVSDRDPRFTSRFW Sbjct: 1175 DAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVSVSIVSDRDPRFTSRFWP 1234 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA+ T LKFSTAFHPQTDGQSERTIQ LED Sbjct: 1235 KFQEALGTKLKFSTAFHPQTDGQSERTIQTLED 1267 >ref|XP_007050046.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702307|gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1336 Score = 142 bits (358), Expect = 1e-31 Identities = 68/93 (73%), Positives = 77/93 (82%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 D IWVIVD+LTK+AHF+ + +E L QLY+D IVR HGVP++IVSDRDPRFTSRFW Sbjct: 1093 DVIWVIVDQLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVPVSIVSDRDPRFTSRFWP 1152 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA+ T LKFSTAFHPQTDGQSERTIQ LED Sbjct: 1153 KFQEALGTKLKFSTAFHPQTDGQSERTIQTLED 1185 >ref|XP_007019770.1| Uncharacterized protein TCM_036100 [Theobroma cacao] gi|508725098|gb|EOY16995.1| Uncharacterized protein TCM_036100 [Theobroma cacao] Length = 160 Score = 142 bits (357), Expect = 1e-31 Identities = 67/93 (72%), Positives = 78/93 (83%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWVIVDRLTK+AHF+ + +E L +LY+D +VR HGVP++IVSDRDPRFTSRFW Sbjct: 15 DAIWVIVDRLTKSAHFLAIHSTFSIERLARLYIDEVVRLHGVPVSIVSDRDPRFTSRFWL 74 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA+ T L+FSTAFHPQTDGQSERTIQ LED Sbjct: 75 KFQEALGTKLRFSTAFHPQTDGQSERTIQTLED 107 >ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702098|gb|EOX93994.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 811 Score = 141 bits (356), Expect = 2e-31 Identities = 66/93 (70%), Positives = 78/93 (83%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWVIVDRLTK+AHF+ + +E L +LY+D +VR HGVP++IVSDRDPRFTSRFW Sbjct: 589 DAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEVVRLHGVPISIVSDRDPRFTSRFWP 648 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA+ T L+FST+FHPQTDGQSERTIQ LED Sbjct: 649 KFQEALGTKLRFSTSFHPQTDGQSERTIQTLED 681 >ref|XP_007032766.1| Gag protease polyprotein [Theobroma cacao] gi|508711795|gb|EOY03692.1| Gag protease polyprotein [Theobroma cacao] Length = 689 Score = 141 bits (355), Expect = 2e-31 Identities = 68/93 (73%), Positives = 77/93 (82%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWVIVDRLTK+AHF+ + +E L QLY+D IVR HGVP+ IVSD+DPRFTSRFW Sbjct: 497 DAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDEIVRLHGVPVFIVSDQDPRFTSRFWP 556 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA+ T LKFSTAFHPQTDGQSERTIQ L+D Sbjct: 557 KFQEALGTKLKFSTAFHPQTDGQSERTIQTLKD 589 >ref|XP_007022613.1| Uncharacterized protein TCM_033423 [Theobroma cacao] gi|508722241|gb|EOY14138.1| Uncharacterized protein TCM_033423 [Theobroma cacao] Length = 809 Score = 140 bits (354), Expect = 3e-31 Identities = 67/93 (72%), Positives = 77/93 (82%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWVIVDRLTK+AHF+ + +E L +LY+D IVR HGVP++IVSDRDPRFTSRFW Sbjct: 693 DAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEIVRLHGVPVSIVSDRDPRFTSRFWP 752 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 F EA+ T L+FSTAFHPQTDGQSERTIQ LED Sbjct: 753 KFHEALGTKLRFSTAFHPQTDGQSERTIQTLED 785 >ref|XP_007037156.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508774401|gb|EOY21657.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1188 Score = 140 bits (353), Expect = 4e-31 Identities = 67/93 (72%), Positives = 77/93 (82%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWVIVDRLTK+AHF+ + +E L +LY+D IVR HGVP++IVSDRDPRFTSR W Sbjct: 996 DAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEIVRLHGVPISIVSDRDPRFTSRLWL 1055 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA+ T L+FSTAFHPQTDGQSERTIQ LED Sbjct: 1056 KFQEALGTKLRFSTAFHPQTDGQSERTIQTLED 1088 >ref|XP_007028151.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508716756|gb|EOY08653.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1110 Score = 140 bits (353), Expect = 4e-31 Identities = 67/93 (72%), Positives = 78/93 (83%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWVIVDRLTK+AHF+ + +E L +LY+D IVR +GVP++IVSDRDPRFTSRFW Sbjct: 883 DAIWVIVDRLTKSAHFLAIHNTYSIEKLVKLYIDEIVRLYGVPISIVSDRDPRFTSRFWS 942 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA+ T L+FSTAFHPQTDGQSERTIQ LED Sbjct: 943 KFQEALGTKLRFSTAFHPQTDGQSERTIQTLED 975 >ref|XP_007033074.1| Uncharacterized protein TCM_019247 [Theobroma cacao] gi|508712103|gb|EOY04000.1| Uncharacterized protein TCM_019247 [Theobroma cacao] Length = 544 Score = 140 bits (353), Expect = 4e-31 Identities = 67/93 (72%), Positives = 77/93 (82%) Frame = -3 Query: 280 DAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFWK 101 DAIWVIVDRLTK+AHF+ + +E L +LY+D IVR HGVP++IVSDRDPRFTSRFW Sbjct: 169 DAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEIVRLHGVPVSIVSDRDPRFTSRFWP 228 Query: 100 GFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FQEA+ T L+FSTAFHPQ DGQSERTIQ LED Sbjct: 229 KFQEALGTKLRFSTAFHPQKDGQSERTIQTLED 261 >ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prunus persica] gi|462417788|gb|EMJ22433.1| hypothetical protein PRUPE_ppb019121mg [Prunus persica] Length = 552 Score = 140 bits (352), Expect = 5e-31 Identities = 63/95 (66%), Positives = 76/95 (80%) Frame = -3 Query: 286 RHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRF 107 +HD +WVIVDRLTK+AHF+P+ + L ++++D IVR HGVP++IVSDRDPRFTSRF Sbjct: 219 KHDGVWVIVDRLTKSAHFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDPRFTSRF 278 Query: 106 WKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 W EA T L+FSTAFHPQTDGQSERTIQ LED Sbjct: 279 WTKLNEAFGTQLQFSTAFHPQTDGQSERTIQTLED 313 >ref|XP_007198824.1| hypothetical protein PRUPE_ppb020037mg [Prunus persica] gi|462394119|gb|EMJ00023.1| hypothetical protein PRUPE_ppb020037mg [Prunus persica] Length = 1279 Score = 140 bits (352), Expect = 5e-31 Identities = 63/95 (66%), Positives = 76/95 (80%) Frame = -3 Query: 286 RHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRF 107 +HD +WVIVDRLTK+AHF+P+ + L ++++D IVR HGVP++IVSDRDPRFTSRF Sbjct: 974 KHDGVWVIVDRLTKSAHFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDPRFTSRF 1033 Query: 106 WKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 W EA T L+FSTAFHPQTDGQSERTIQ LED Sbjct: 1034 WTKLNEAFGTQLQFSTAFHPQTDGQSERTIQTLED 1068 >emb|CAN81132.1| hypothetical protein VITISV_009934 [Vitis vinifera] Length = 730 Score = 140 bits (352), Expect = 5e-31 Identities = 65/94 (69%), Positives = 76/94 (80%) Frame = -3 Query: 283 HDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFW 104 ++AIWVIVDRLTK+AHF+PM V M+ L LY+ IVR HGVPL+IVSDRDP FTSRFW Sbjct: 559 NNAIWVIVDRLTKSAHFLPMKVNFSMDHLASLYIKEIVRMHGVPLSIVSDRDPHFTSRFW 618 Query: 103 KGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 Q+A+ T L FSTAFHPQTDGQS+R IQ+LED Sbjct: 619 HSLQKALSTKLSFSTAFHPQTDGQSDRVIQVLED 652 >emb|CAN61139.1| hypothetical protein VITISV_009489 [Vitis vinifera] Length = 984 Score = 140 bits (352), Expect = 5e-31 Identities = 65/94 (69%), Positives = 77/94 (81%) Frame = -3 Query: 283 HDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSRFW 104 ++AIWVIVDRLTK+AHF+PM V ++ L LYV IVR HGVP++IVSDRDPRFTSRFW Sbjct: 643 NNAIWVIVDRLTKSAHFLPMKVNFSLDRLASLYVKEIVRMHGVPVSIVSDRDPRFTSRFW 702 Query: 103 KGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 Q+++ T L FSTAFHPQTDGQSER IQ+LED Sbjct: 703 HSLQKSLGTKLSFSTAFHPQTDGQSERVIQVLED 736 >ref|XP_012453364.1| PREDICTED: uncharacterized protein LOC105775392, partial [Gossypium raimondii] Length = 653 Score = 139 bits (351), Expect = 6e-31 Identities = 66/96 (68%), Positives = 78/96 (81%) Frame = -3 Query: 289 QRHDAIWVIVDRLTKTAHFIPMSVRDRMETLCQLYVDRIVRYHGVPLTIVSDRDPRFTSR 110 ++ DAIWVIVDRLTK+AHFIP+ + ++ L +LYV IVR HGVP +I+SDRDPRFTSR Sbjct: 318 KKKDAIWVIVDRLTKSAHFIPIRIDYSLDRLAELYVAEIVRLHGVPKSIISDRDPRFTSR 377 Query: 109 FWKGFQEAMDTHLKFSTAFHPQTDGQSERTIQILED 2 FW QEA+ T L FSTAFHPQTDGQSER IQ+LED Sbjct: 378 FWIKLQEALGTKLNFSTAFHPQTDGQSERMIQVLED 413