BLASTX nr result
ID: Bupleurum21_contig00032896
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00032896 (645 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003543991.1| PREDICTED: uncharacterized protein LOC100811... 163 3e-38 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 156 4e-38 ref|XP_003533176.1| PREDICTED: uncharacterized protein LOC100777... 159 6e-37 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 158 9e-37 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 147 3e-35 >ref|XP_003543991.1| PREDICTED: uncharacterized protein LOC100811508 [Glycine max] Length = 1441 Score = 163 bits (412), Expect = 3e-38 Identities = 81/191 (42%), Positives = 117/191 (61%) Frame = +2 Query: 41 MDPLRVFIIAMEVLSTFLNASSRKPGFKFHASCSSVKLSHIIFADDVLLFSYGDSNSISA 220 M P+ +F+I ME L L + P F H+ C + L+++ FADDVLLF GDS S+S Sbjct: 971 MSPM-LFVIIMEYLHRTLVKMQQNPDFNHHSKCEKIGLTNLTFADDVLLFCRGDSKSVSM 1029 Query: 221 LLEGVKGFSEISGLQLNPQKCSIFFGGIXXXXXXXXXXXFGMNLGCLPITYLGLPLITSR 400 ++E ++ FS+ +GL++NP KC +FFGG+ G LP+ YLG+PL R Sbjct: 1030 MMETIRKFSDSTGLKVNPAKCQMFFGGMDGCSKENLRRITDFAEGKLPVRYLGVPLSCKR 1089 Query: 401 LNSLHCAPLVAKLCQRISCWTGRFLSFAGRLQLLKSILTSIAGYWSMYIFLPKFVLKKIK 580 L PL+ K+ R+ WT + LS+AGR+QL+KSI ++IA YW LP+FVL+KI Sbjct: 1090 LTIQQYMPLIDKIVDRVKHWTSKLLSYAGRIQLVKSITSAIAMYWMQCFPLPQFVLRKIN 1149 Query: 581 ALFGKFLWSGK 613 A+ F+W+GK Sbjct: 1150 AICRSFVWTGK 1160 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 156 bits (395), Expect(2) = 4e-38 Identities = 78/185 (42%), Positives = 114/185 (61%) Frame = +2 Query: 56 VFIIAMEVLSTFLNASSRKPGFKFHASCSSVKLSHIIFADDVLLFSYGDSNSISALLEGV 235 +++I M VLS L+ ++ + +H C ++ L+H+ FADD+++FS G S SI L Sbjct: 813 LYVICMNVLSCMLDKAAVEKKISYHPRCRNMNLTHLCFADDIMVFSDGTSKSIQGTLAIF 872 Query: 236 KGFSEISGLQLNPQKCSIFFGGIXXXXXXXXXXXFGMNLGCLPITYLGLPLITSRLNSLH 415 + F+ +S L+++ +K +IF GI F LG LP+ YLGLPL+T R+ Sbjct: 873 EKFAAMSWLKISLEKSTIFMAGISPNAKTSILQQFPFELGTLPVKYLGLPLLTKRMTQSD 932 Query: 416 CAPLVAKLCQRISCWTGRFLSFAGRLQLLKSILTSIAGYWSMYIFLPKFVLKKIKALFGK 595 PLV K+ RI+ WT RFLSFAGRLQL+KS+L+SI +W LPK L++I+ +F Sbjct: 933 YLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSA 992 Query: 596 FLWSG 610 FLWSG Sbjct: 993 FLWSG 997 Score = 27.3 bits (59), Expect(2) = 4e-38 Identities = 9/19 (47%), Positives = 14/19 (73%) Frame = +1 Query: 7 CISSCMYSIKVNGSLEGFY 63 CI + +S++VNG L GF+ Sbjct: 779 CIGTASFSVQVNGELSGFF 797 >ref|XP_003533176.1| PREDICTED: uncharacterized protein LOC100777167 [Glycine max] Length = 1324 Score = 159 bits (401), Expect = 6e-37 Identities = 79/191 (41%), Positives = 115/191 (60%) Frame = +2 Query: 41 MDPLRVFIIAMEVLSTFLNASSRKPGFKFHASCSSVKLSHIIFADDVLLFSYGDSNSISA 220 M P+ +F+I ME + L + P F H+ C + L+++ FADDVLLF GDS S+S Sbjct: 854 MSPM-LFVIIMEYMHRTLVKMQQNPDFNHHSKCEKIGLTNLTFADDVLLFCRGDSKSVSM 912 Query: 221 LLEGVKGFSEISGLQLNPQKCSIFFGGIXXXXXXXXXXXFGMNLGCLPITYLGLPLITSR 400 ++E ++ FS+ +GL++NP KC +FFGG+ G LP+ YLG+PL R Sbjct: 913 MMETIRKFSDSTGLKVNPAKCQMFFGGMDGCSKENLRRITDFAEGKLPVRYLGMPLSCKR 972 Query: 401 LNSLHCAPLVAKLCQRISCWTGRFLSFAGRLQLLKSILTSIAGYWSMYIFLPKFVLKKIK 580 L PL+ K+ R+ WT + LS AGR+QL+KSI ++IA YW LP+FVL+KI Sbjct: 973 LTIQQYMPLIDKIVDRVKHWTSKLLSHAGRIQLVKSITSAIAMYWMQCFPLPQFVLRKIN 1032 Query: 581 ALFGKFLWSGK 613 + F+W+GK Sbjct: 1033 DICRSFVWTGK 1043 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 158 bits (399), Expect = 9e-37 Identities = 78/194 (40%), Positives = 114/194 (58%), Gaps = 2/194 (1%) Frame = +2 Query: 44 DPLRVFIIA--MEVLSTFLNASSRKPGFKFHASCSSVKLSHIIFADDVLLFSYGDSNSIS 217 DPL F+ A ME LS + + P F FH C +KL+H++FADD+L+F+ D++SIS Sbjct: 648 DPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSIS 707 Query: 218 ALLEGVKGFSEISGLQLNPQKCSIFFGGIXXXXXXXXXXXFGMNLGCLPITYLGLPLITS 397 ++ FS+ SGLQ + +K I+FGG+ M +G LP YLG+PL + Sbjct: 708 KIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLGVPLASK 767 Query: 398 RLNSLHCAPLVAKLCQRISCWTGRFLSFAGRLQLLKSILTSIAGYWSMYIFLPKFVLKKI 577 +LN C PL+ K+ R W LS+AGRLQL+K+IL S+ YW LPK ++K + Sbjct: 768 KLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAV 827 Query: 578 KALFGKFLWSGKLN 619 + KFLW+G ++ Sbjct: 828 ETTCRKFLWTGTVD 841 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 147 bits (372), Expect(2) = 3e-35 Identities = 75/196 (38%), Positives = 114/196 (58%) Frame = +2 Query: 56 VFIIAMEVLSTFLNASSRKPGFKFHASCSSVKLSHIIFADDVLLFSYGDSNSISALLEGV 235 +F+I+MEVLS L+ ++ F FH C ++ L+H+ FADD+++ + G S+ ++E + Sbjct: 71 LFVISMEVLSKMLDQAAGGKRFGFHPKCKNLGLTHLCFADDLMILTDGKVRSVDGIVEVM 130 Query: 236 KGFSEISGLQLNPQKCSIFFGGIXXXXXXXXXXXFGMNLGCLPITYLGLPLITSRLNSLH 415 F++ SGLQ+N +K +++ G+ + LG LP+ YLGLPL+T RL Sbjct: 131 NLFAKRSGLQINMEKTTLYTAGVSDHNRYMMISRYPFGLGQLPVRYLGLPLVTKRLTKED 190 Query: 416 CAPLVAKLCQRISCWTGRFLSFAGRLQLLKSILTSIAGYWSMYIFLPKFVLKKIKALFGK 595 +PL ++ RI WT R+LSFAGRL L+ S+L S +W LP LK+I ++ Sbjct: 191 LSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTMNFWMSAFRLPSACLKEINSICSA 250 Query: 596 FLWSGKLNARAHYKVS 643 FLWSG R KVS Sbjct: 251 FLWSGPELHRRKAKVS 266 Score = 26.6 bits (57), Expect(2) = 3e-35 Identities = 8/20 (40%), Positives = 16/20 (80%) Frame = +1 Query: 4 KCISSCMYSIKVNGSLEGFY 63 +CI++ +S++VNG L G++ Sbjct: 36 RCITTTSFSVQVNGELAGYF 55