BLASTX nr result
ID: Coptis21_contig00000517
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00000517 (2148 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002285216.1| PREDICTED: protein-tyrosine sulfotransferase... 612 e-172 ref|XP_003538159.1| PREDICTED: protein-tyrosine sulfotransferase... 567 e-159 ref|XP_002529939.1| conserved hypothetical protein [Ricinus comm... 559 e-156 ref|XP_002314278.1| predicted protein [Populus trichocarpa] gi|2... 541 e-151 ref|XP_002436516.1| hypothetical protein SORBIDRAFT_10g004010 [S... 509 e-142 >ref|XP_002285216.1| PREDICTED: protein-tyrosine sulfotransferase [Vitis vinifera] gi|297746268|emb|CBI16324.3| unnamed protein product [Vitis vinifera] Length = 512 Score = 612 bits (1577), Expect = e-172 Identities = 314/486 (64%), Positives = 372/486 (76%), Gaps = 6/486 (1%) Frame = +3 Query: 345 VHVKSSEDDFVKCESGVKKWAASSLDEQVKKDKHTLQDLLFFLHVPRTGGRTYFHCFLRK 524 V+ ++ DF CE VKKWA+SSLD +VK+DKHTLQDLLFFLHVPRTGGRTYFHCFL++ Sbjct: 28 VNASPAKHDFGHCERTVKKWASSSLDLEVKEDKHTLQDLLFFLHVPRTGGRTYFHCFLKR 87 Query: 525 LYSTYLECPRSYDKLRFDPSKPNCRLMVTHDDYSMMSRLPEEKTSVVTILRKPVDRVLST 704 LY + LECPRSYDKLRFDPSKPNCRL+VTHDDYSMMS+LP EKTSVVTILR P+DRV S Sbjct: 88 LYPSSLECPRSYDKLRFDPSKPNCRLLVTHDDYSMMSKLPREKTSVVTILRNPLDRVFSA 147 Query: 705 YEFSVEVAARFLVHPNLTSATQMTGRIRPKSNGVSTLDIWPWKYLVPWMREDLFSRREGR 884 YEFSVEVAARFLVHPNLTSA QM RIR K+ GVSTLDIWPWKYLVPWMR+DLF+RR+ R Sbjct: 148 YEFSVEVAARFLVHPNLTSAKQMALRIRSKTKGVSTLDIWPWKYLVPWMRDDLFARRDAR 207 Query: 885 KRGEFNDFKESNDPYNIQGIVMPLHEFINDPVAHEIIHNGATFQVAGLTNNSCVAEAHEV 1064 K + ++ + ND YN++ IVMPLHE+INDP+A +IIHNGATFQVAGLTNNS +AE HEV Sbjct: 208 K-DKGPNYVKGNDSYNMEEIVMPLHEYINDPIARDIIHNGATFQVAGLTNNSYLAEVHEV 266 Query: 1065 RQCVRKYRSLGGYVLDVAKKRLDDMLYVGLTEDHKGSATMFANVVGGQVLSQLHTLSSNM 1244 R CV+KY++LG +VL+VAKKRLD+MLYVG+TEDHK SATMF N+VG QV+SQL SS+M Sbjct: 267 RHCVQKYQTLGAFVLEVAKKRLDNMLYVGITEDHKESATMFGNMVGAQVISQLMASSSSM 326 Query: 1245 KKAAYNETDMSSSLEDTDHDTDHLQ---NSTKDEKGK---RLSSTGNAETINETMTVGDL 1406 + AA N ++ S+S D+ D H Q NST E G+ + ST N ET E +TVG+L Sbjct: 327 EGAANNLSEQSTSFPDSKSDNSHHQDPNNSTGQEAGEIDSTIPSTENVETTKENITVGEL 386 Query: 1407 MNSYEVCISSLRKAQARRRTASLKRVAPANFSKEARLQISEVVLQNIISLNSLDVELYKH 1586 M SYEVCISSLRK Q+ RRT SLK ++PANFSKE RLQ+ ++VLQ IISLNSLDVELYK+ Sbjct: 387 MKSYEVCISSLRKTQSYRRTNSLKAISPANFSKETRLQVPQMVLQQIISLNSLDVELYKY 446 Query: 1587 AKYIFAQQGKRLTEKLIEAERQQYAFSNSYRVISWKEFSXXXXXXXXXXXXXXXXXXRRR 1766 A+ IFA+Q K KL + Q+ F +Y WK S +RR Sbjct: 447 AQSIFAKQHKHFMRKLDTTDMQESIFDIAYDNPLWKVVSLAISLVCLLLLIFLIVNAKRR 506 Query: 1767 TLKLKI 1784 T KLKI Sbjct: 507 TSKLKI 512 >ref|XP_003538159.1| PREDICTED: protein-tyrosine sulfotransferase-like [Glycine max] Length = 494 Score = 567 bits (1461), Expect = e-159 Identities = 292/475 (61%), Positives = 357/475 (75%), Gaps = 1/475 (0%) Frame = +3 Query: 360 SEDDFVKCESGVKKWAASSLDEQVKKD-KHTLQDLLFFLHVPRTGGRTYFHCFLRKLYST 536 +E+D+ +CES VK WA SSLDE++ KD KHTL+DLLFFLHVPRTGGRTYFHCFL+KLY + Sbjct: 27 AENDYGRCESVVKSWARSSLDEEMTKDDKHTLRDLLFFLHVPRTGGRTYFHCFLKKLYPS 86 Query: 537 YLECPRSYDKLRFDPSKPNCRLMVTHDDYSMMSRLPEEKTSVVTILRKPVDRVLSTYEFS 716 YLECPRSYDKLRFDPSKP CRL+VTHDDYS+ S+LP E+TSVVTILR PVDRV STYEFS Sbjct: 87 YLECPRSYDKLRFDPSKPKCRLLVTHDDYSITSKLPRERTSVVTILRDPVDRVFSTYEFS 146 Query: 717 VEVAARFLVHPNLTSATQMTGRIRPKSNGVSTLDIWPWKYLVPWMREDLFSRREGRKRGE 896 +EVAARFLVHPNLTSAT+M R+ K+ GVSTLDIWPWKYLVPWMREDLF+RRE R Sbjct: 147 IEVAARFLVHPNLTSATKMALRLSSKTKGVSTLDIWPWKYLVPWMREDLFARREARYSRG 206 Query: 897 FNDFKESNDPYNIQGIVMPLHEFINDPVAHEIIHNGATFQVAGLTNNSCVAEAHEVRQCV 1076 N ESND Y+++ MPL E+INDPVA +++HNGATFQVAGLTNNS +AEAHEVR CV Sbjct: 207 LN-IIESNDSYDMEDFAMPLQEYINDPVAVDVVHNGATFQVAGLTNNSYIAEAHEVRHCV 265 Query: 1077 RKYRSLGGYVLDVAKKRLDDMLYVGLTEDHKGSATMFANVVGGQVLSQLHTLSSNMKKAA 1256 +KY++LG YVL VAKKRLD+MLYVGLTE+H+ SATMFANVVG QV+SQL+ +++++ Sbjct: 266 QKYKTLGKYVLQVAKKRLDEMLYVGLTEEHRKSATMFANVVGAQVISQLNAPNTSLETTD 325 Query: 1257 YNETDMSSSLEDTDHDTDHLQNSTKDEKGKRLSSTGNAETINETMTVGDLMNSYEVCISS 1436 E SS D D D+ QNST D ++S+ E MTVG+LM++YEVCIS+ Sbjct: 326 KTE---RSSFTDNDPDSSEHQNSTLDRGESAVTSSEGGEATEFNMTVGELMDAYEVCISN 382 Query: 1437 LRKAQARRRTASLKRVAPANFSKEARLQISEVVLQNIISLNSLDVELYKHAKYIFAQQGK 1616 LRKAQ+RRR +SLKR++P NF+KEARLQ+ E +L I SLN LD++LY++AK IF +Q K Sbjct: 383 LRKAQSRRRISSLKRISPVNFTKEARLQVPEEILHKIRSLNDLDLQLYEYAKAIFNKQHK 442 Query: 1617 RLTEKLIEAERQQYAFSNSYRVISWKEFSXXXXXXXXXXXXXXXXXXRRRTLKLK 1781 T LI E ++Y + W+ + RRRT K+K Sbjct: 443 --TSLLITEESWDNISGSAYGL--WRVVTLAITCVFFIFLFLLIVNVRRRTSKVK 493 >ref|XP_002529939.1| conserved hypothetical protein [Ricinus communis] gi|223530569|gb|EEF32447.1| conserved hypothetical protein [Ricinus communis] Length = 433 Score = 559 bits (1440), Expect = e-156 Identities = 279/424 (65%), Positives = 336/424 (79%) Frame = +3 Query: 363 EDDFVKCESGVKKWAASSLDEQVKKDKHTLQDLLFFLHVPRTGGRTYFHCFLRKLYSTYL 542 ++DF +CE VKKWA +SL+++VK+DKH L+DLLFFLHVPRTGGRTYFHCFLRKLYS Sbjct: 24 KNDFSQCEKTVKKWAVASLEQEVKEDKHMLRDLLFFLHVPRTGGRTYFHCFLRKLYSNSQ 83 Query: 543 ECPRSYDKLRFDPSKPNCRLMVTHDDYSMMSRLPEEKTSVVTILRKPVDRVLSTYEFSVE 722 ECPRSYDKLRFDPSK CRL+VTHDDYSMMS+LP+EKTSVVTILR PVDR+ STYEFS+E Sbjct: 84 ECPRSYDKLRFDPSKQKCRLLVTHDDYSMMSKLPKEKTSVVTILRNPVDRIFSTYEFSIE 143 Query: 723 VAARFLVHPNLTSATQMTGRIRPKSNGVSTLDIWPWKYLVPWMREDLFSRREGRKRGEFN 902 V ARFLVHPNLTSATQM R+RP++ GVSTLDIWPWKYLVPWMREDLF+RR+ RK N Sbjct: 144 VGARFLVHPNLTSATQMASRLRPRNGGVSTLDIWPWKYLVPWMREDLFARRDARKLKGIN 203 Query: 903 DFKESNDPYNIQGIVMPLHEFINDPVAHEIIHNGATFQVAGLTNNSCVAEAHEVRQCVRK 1082 K S DPYN++ IVMPL E+I DP+A +I+HNGATFQVAGLTNNS AE+HEVR CV+K Sbjct: 204 HVK-SKDPYNMEEIVMPLREYITDPIARDIVHNGATFQVAGLTNNSYSAESHEVRHCVQK 262 Query: 1083 YRSLGGYVLDVAKKRLDDMLYVGLTEDHKGSATMFANVVGGQVLSQLHTLSSNMKKAAYN 1262 Y LG VL VAKKRLD+MLYVGLTEDH+ SATMFA+VVG QV+SQ TL+S+M AA + Sbjct: 263 YEILGELVLQVAKKRLDEMLYVGLTEDHRESATMFAHVVGAQVISQALTLNSSMDTAADS 322 Query: 1263 ETDMSSSLEDTDHDTDHLQNSTKDEKGKRLSSTGNAETINETMTVGDLMNSYEVCISSLR 1442 +++ +SS+ D++ D+ MTV LM++YE CIS+LR Sbjct: 323 KSEQTSSVSDSEPSDDN------------------------QMTVKKLMDAYEDCISNLR 358 Query: 1443 KAQARRRTASLKRVAPANFSKEARLQISEVVLQNIISLNSLDVELYKHAKYIFAQQGKRL 1622 K QARRRT+SLKR+APANFSKE R ++ E++L+ I SLN+LD+ELYK+AK IFA+Q K Sbjct: 359 KTQARRRTSSLKRIAPANFSKEDRRRVPEMILEQIRSLNNLDLELYKYAKDIFAKQHKHT 418 Query: 1623 TEKL 1634 +KL Sbjct: 419 VQKL 422 >ref|XP_002314278.1| predicted protein [Populus trichocarpa] gi|222850686|gb|EEE88233.1| predicted protein [Populus trichocarpa] Length = 447 Score = 541 bits (1395), Expect = e-151 Identities = 266/386 (68%), Positives = 315/386 (81%) Frame = +3 Query: 363 EDDFVKCESGVKKWAASSLDEQVKKDKHTLQDLLFFLHVPRTGGRTYFHCFLRKLYSTYL 542 + DF CE VK WA SSL ++VK+DKHTL+DLLFFLHVPRTGGRTYFHCFL++LY+ Sbjct: 51 KSDFSHCEKVVKNWAFSSLQQRVKEDKHTLRDLLFFLHVPRTGGRTYFHCFLKRLYANAQ 110 Query: 543 ECPRSYDKLRFDPSKPNCRLMVTHDDYSMMSRLPEEKTSVVTILRKPVDRVLSTYEFSVE 722 ECPRSYDKLRFDP K CRL+ THDDYSMMS+LP+EKTSVVTILR PVDR+ STYEFS+E Sbjct: 111 ECPRSYDKLRFDPRKQECRLLATHDDYSMMSKLPKEKTSVVTILRNPVDRIFSTYEFSIE 170 Query: 723 VAARFLVHPNLTSATQMTGRIRPKSNGVSTLDIWPWKYLVPWMREDLFSRREGRKRGEFN 902 VAARFLVHPNLTSAT+M GR+RP + GVSTLDIWPWKYLVPWMREDLF+RR+ RK Sbjct: 171 VAARFLVHPNLTSATKMVGRLRPGATGVSTLDIWPWKYLVPWMREDLFARRDARKMMGSI 230 Query: 903 DFKESNDPYNIQGIVMPLHEFINDPVAHEIIHNGATFQVAGLTNNSCVAEAHEVRQCVRK 1082 D K NDPYN++ +VMPL E+INDP AHE++HNG TFQVAGLTNNS AE+HEVR CV+K Sbjct: 231 DIKR-NDPYNMEEMVMPLQEYINDPRAHELVHNGETFQVAGLTNNSYFAESHEVRCCVQK 289 Query: 1083 YRSLGGYVLDVAKKRLDDMLYVGLTEDHKGSATMFANVVGGQVLSQLHTLSSNMKKAAYN 1262 ++ LG +VL+VAKKRLDDMLYVGLTEDH+ SATMFANVVG QV+SQ T +S+M+ AA + Sbjct: 290 HKILGEHVLEVAKKRLDDMLYVGLTEDHRESATMFANVVGAQVISQALTENSSMESAANS 349 Query: 1263 ETDMSSSLEDTDHDTDHLQNSTKDEKGKRLSSTGNAETINETMTVGDLMNSYEVCISSLR 1442 ++ SS ++ D D Q+ST D K + ST + E ETMTVG LM +YE CISSLR Sbjct: 350 KSGQGSSHSESLPDNDDNQDSTSDHKADEIGSTEDLEEKKETMTVGKLMEAYEGCISSLR 409 Query: 1443 KAQARRRTASLKRVAPANFSKEARLQ 1520 K Q+RRR +SLKR++PANFSKE+RLQ Sbjct: 410 KTQSRRRKSSLKRISPANFSKESRLQ 435 >ref|XP_002436516.1| hypothetical protein SORBIDRAFT_10g004010 [Sorghum bicolor] gi|241914739|gb|EER87883.1| hypothetical protein SORBIDRAFT_10g004010 [Sorghum bicolor] Length = 515 Score = 509 bits (1312), Expect = e-142 Identities = 263/420 (62%), Positives = 320/420 (76%), Gaps = 2/420 (0%) Frame = +3 Query: 357 SSEDDFVKCESGVKKWAASSLDEQVKKDKHTLQDLLFFLHVPRTGGRTYFHCFLRKLYST 536 SS+D + CE V+ WA SS + DK +L+DLLFFLH+PRTGGRTYFHCFL+KLY+ Sbjct: 30 SSDDGYKHCEGVVRGWADSSTGREKDGDKLSLKDLLFFLHIPRTGGRTYFHCFLKKLYTN 89 Query: 537 YLECPRSYDKLRFDPSKPNCRLMVTHDDYSMMSRLPEEKTSVVTILRKPVDRVLSTYEFS 716 ECPRSYDKLRFDPS P+C+L+V+HDDYS+ S+LP E+TSVVTILR PVDRV STYEFS Sbjct: 90 AQECPRSYDKLRFDPSHPDCKLVVSHDDYSLTSKLPRERTSVVTILRNPVDRVFSTYEFS 149 Query: 717 VEVAARFLVHPNLTSATQMTGRIRPKSNGVSTLDIWPWKYLVPWMREDLFSRREGRKRGE 896 VEVAARFLVHPNLTSA MT R+ KS VSTLDIWPWKYLVPWMREDLF+RR+ R + Sbjct: 150 VEVAARFLVHPNLTSAKLMTTRVLTKSRAVSTLDIWPWKYLVPWMREDLFARRDARGGDK 209 Query: 897 FNDFKESNDPYNIQGIVMPLHEFINDPVAHEIIHNGATFQVAGLTNNSCVAEAHEVRQCV 1076 + K+ N Y+++ +VMPLH++INDPVAHEIIHNGATFQ+ GLTNNS A EVR CV Sbjct: 210 VHSSKKVN-AYDVEDMVMPLHQYINDPVAHEIIHNGATFQITGLTNNSYFDGAPEVRHCV 268 Query: 1077 RKYRSLGGYVLDVAKKRLDDMLYVGLTEDHKGSATMFANVVGGQVLSQLHTLSSNMKKAA 1256 RK+ LG VL+VAK RLD MLYVGLTEDH+ SA +FA++VG QVLSQ TL+ ++K+ Sbjct: 269 RKHPDLGRIVLEVAKNRLDQMLYVGLTEDHEESARLFAHMVGAQVLSQSGTLNLDLKEDV 328 Query: 1257 YNETDMSSSL-EDTDHDT-DHLQNSTKDEKGKRLSSTGNAETINETMTVGDLMNSYEVCI 1430 +ETD SS+ E D +T DHL ++ + + L+ST N E MTVG LM +YE CI Sbjct: 329 PSETDSHSSMVEPEDEETNDHLNSTHGWQNNEALNST-NDEHGKGNMTVGKLMEAYETCI 387 Query: 1431 SSLRKAQARRRTASLKRVAPANFSKEARLQISEVVLQNIISLNSLDVELYKHAKYIFAQQ 1610 + LRK+Q+ RR SLK+V ANFSKEAR + E +L+ IISLNSLD+ELY HAK IF Q+ Sbjct: 388 AKLRKSQSGRRKISLKKVQEANFSKEARKLVPEAILKQIISLNSLDMELYDHAKKIFTQE 447