BLASTX nr result
ID: Alisma22_contig00040905
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Alisma22_contig00040905 (375 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_009117857.1 PREDICTED: uncharacterized protein LOC103842926 [... 88 5e-18 XP_019089249.1 PREDICTED: uncharacterized protein LOC109128031 [... 82 6e-16 XP_010451841.1 PREDICTED: uncharacterized protein LOC104734030 [... 82 6e-16 XP_010490261.1 PREDICTED: uncharacterized protein LOC104768010 [... 82 8e-16 AAC61290.1 putative retroelement pol polyprotein [Arabidopsis th... 81 3e-15 XP_013738037.1 PREDICTED: uncharacterized protein LOC106440823 [... 80 3e-15 JAU70191.1 hypothetical protein LE_TR14648_c0_g1_i1_g.46319 [Noc... 80 3e-15 OAP09371.1 hypothetical protein AXX17_AT2G09080 [Arabidopsis tha... 80 4e-15 XP_010462983.1 PREDICTED: uncharacterized protein LOC104743624 [... 79 1e-14 XP_018514215.1 PREDICTED: uncharacterized protein LOC108871752 [... 78 2e-14 XP_019085935.1 PREDICTED: uncharacterized protein LOC104710419 [... 77 2e-14 OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis tha... 78 2e-14 XP_010468183.1 PREDICTED: uncharacterized protein LOC104748204 [... 77 3e-14 XP_019084173.1 PREDICTED: uncharacterized protein LOC109125876 [... 75 3e-14 XP_019085409.1 PREDICTED: uncharacterized protein LOC109126354 [... 77 3e-14 XP_018489922.1 PREDICTED: uncharacterized protein LOC108860553 i... 76 4e-14 XP_018489919.1 PREDICTED: uncharacterized protein LOC108860553 i... 76 5e-14 XP_010468182.1 PREDICTED: uncharacterized protein LOC104748203 [... 77 5e-14 XP_013722277.1 PREDICTED: uncharacterized protein LOC106426115 [... 77 6e-14 CAC37623.1 copia-like polyprotein [Arabidopsis thaliana] 77 6e-14 >XP_009117857.1 PREDICTED: uncharacterized protein LOC103842926 [Brassica rapa] Length = 390 Score = 87.8 bits (216), Expect = 5e-18 Identities = 51/114 (44%), Positives = 66/114 (57%), Gaps = 3/114 (2%) Frame = -3 Query: 334 NTQDRGFSQ---SPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAP 164 +TQ RGF Q SPSG F S N+ PL QI K GH AL+CWHRFDN+YQ + P Sbjct: 256 STQGRGFHQHVSSPSG--SFTSSASENR--PLCQICGKLGHNALRCWHRFDNSYQLDDLP 311 Query: 163 SNLSENTNAALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 AAL ALRI + ++W DSGA+ H+ ++ L + Y G+DS+M Sbjct: 312 --------AALTALRITDVTGHEWFPDSGASSHVTNSPHHLQQAQVYNGSDSVM 357 >XP_019089249.1 PREDICTED: uncharacterized protein LOC109128031 [Camelina sativa] Length = 374 Score = 82.0 bits (201), Expect = 6e-16 Identities = 45/118 (38%), Positives = 62/118 (52%), Gaps = 4/118 (3%) Frame = -3 Query: 343 SQLNTQDRGF----SQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQF 176 + T+ RGF SQ P G N+ + QI K GH A KCWHRFDN+YQF Sbjct: 258 NNFTTKGRGFHQQISQEPGGTNKV-----------ICQICGKPGHPASKCWHRFDNSYQF 306 Query: 175 SNAPSNLSENTNAALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 N P L AAL + + + N+W +D GAT H+ ++ L + Y+GN+S+M Sbjct: 307 DNVPQVL-----AALRVTDVTDHNGNEWVLDFGATTHVTNSPHHLQQAQVYEGNESVM 359 >XP_010451841.1 PREDICTED: uncharacterized protein LOC104734030 [Camelina sativa] Length = 374 Score = 82.0 bits (201), Expect = 6e-16 Identities = 45/118 (38%), Positives = 62/118 (52%), Gaps = 4/118 (3%) Frame = -3 Query: 343 SQLNTQDRGF----SQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQF 176 + T+ RGF SQ P G N+ + QI K GH A KCWHRFDN+YQF Sbjct: 258 NNFTTKGRGFHQQISQEPGGTNKV-----------ICQICGKPGHPASKCWHRFDNSYQF 306 Query: 175 SNAPSNLSENTNAALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 N P L AAL + + + N+W +D GAT H+ ++ L + Y+GN+S+M Sbjct: 307 DNVPQVL-----AALRVTDVTDHNGNEWVLDFGATTHVTNSPHHLQQAQVYEGNESVM 359 >XP_010490261.1 PREDICTED: uncharacterized protein LOC104768010 [Camelina sativa] Length = 437 Score = 82.0 bits (201), Expect = 8e-16 Identities = 46/112 (41%), Positives = 62/112 (55%), Gaps = 1/112 (0%) Frame = -3 Query: 334 NTQDRGFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNL 155 +T+ RGF Q S N + + D P+ QIY KRGH A+ CW+RFD Y N Sbjct: 255 STRGRGFHQQFSSANP----STVSSDKPVCQIYSKRGHNAIDCWYRFDEEYSKPN----- 305 Query: 154 SENTNAALAALRINE-ADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 N +A +AL I++ D N W DSGAT HI + T +L ++ Y GND +M Sbjct: 306 --NVASAFSALHISDVTDDNGWYPDSGATAHITNTTQRLQKVQPYFGNDVVM 355 >AAC61290.1 putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1149 Score = 80.9 bits (198), Expect = 3e-15 Identities = 46/112 (41%), Positives = 67/112 (59%), Gaps = 1/112 (0%) Frame = -3 Query: 334 NTQDRGFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNL 155 +T+ RGF Q S +S + + P+ QI KRGH AL+CWHRFD++YQ S A + Sbjct: 244 STRGRGFQQQFSS----SSSSVSASEKPMCQICGKRGHYALQCWHRFDDSYQHSEAAA-- 297 Query: 154 SENTNAALAALRINE-ADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 AA +AL I + +D + W DS AT HI +N+ +L ++ Y GND++M Sbjct: 298 -----AAFSALHITDVSDDSGWVPDSAATAHITNNSSRLQQMQPYLGNDTVM 344 >XP_013738037.1 PREDICTED: uncharacterized protein LOC106440823 [Brassica napus] Length = 410 Score = 80.5 bits (197), Expect = 3e-15 Identities = 43/111 (38%), Positives = 62/111 (55%) Frame = -3 Query: 334 NTQDRGFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNL 155 +++ RGF Q Q N + P+ QI + GHTAL+CW+RFD NYQ N P Sbjct: 270 SSRGRGFHQQSVSTGQNNHTTSATQR-PICQICGRMGHTALRCWNRFDTNYQNDNLPQ-- 326 Query: 154 SENTNAALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 ALAAL+++E +W DSGAT H+ T L+++ Y G+++IM Sbjct: 327 ------ALAALQVSETSGQEWYPDSGATAHVTSTTAGLNSLTPYNGSETIM 371 >JAU70191.1 hypothetical protein LE_TR14648_c0_g1_i1_g.46319 [Noccaea caerulescens] Length = 395 Score = 80.1 bits (196), Expect = 3e-15 Identities = 46/113 (40%), Positives = 64/113 (56%), Gaps = 2/113 (1%) Frame = -3 Query: 334 NTQDRGFSQ--SPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPS 161 +T+ RGFSQ +P+G NQ S + + P+ QI + GHTALKCW+ FD+ YQ + P Sbjct: 261 STRGRGFSQQVNPAGWNQSLSSDGNQNNRPMCQICGRMGHTALKCWNMFDHAYQSDDVPK 320 Query: 160 NLSENTNAALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 ALAAL I++ +W DSGAT HI + L N Y G+D ++ Sbjct: 321 --------ALAALHISDDSGMEWYPDSGATAHITASASSLQNPTPYHGSDMVL 365 >OAP09371.1 hypothetical protein AXX17_AT2G09080 [Arabidopsis thaliana] Length = 2795 Score = 80.5 bits (197), Expect = 4e-15 Identities = 47/119 (39%), Positives = 67/119 (56%), Gaps = 4/119 (3%) Frame = -3 Query: 346 SSQLNTQDRGFSQSPSGVNQFNSLNYYNKD-LPLFQIYQKRGHTALKCWHRFDNNYQFSN 170 S +T+ RGF Q S + +S +Y + D L QI K GH ALKCWHRF+N+YQ+ Sbjct: 1875 SGNYSTKGRGFPQQISSSSSSSSGSYNSTDNRVLCQICGKPGHPALKCWHRFNNSYQYEE 1934 Query: 169 APSNLSENTNAALAALRINEA---DFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 P AAL A+RI + + N+W DSGAT H+ ++ L + Y G+D++M Sbjct: 1935 LP--------AALTAMRITDVTDHNGNEWVGDSGATTHVTNSPHNLQQSQPYGGSDAVM 1985 >XP_010462983.1 PREDICTED: uncharacterized protein LOC104743624 [Camelina sativa] Length = 473 Score = 78.6 bits (192), Expect = 1e-14 Identities = 45/113 (39%), Positives = 65/113 (57%), Gaps = 2/113 (1%) Frame = -3 Query: 334 NTQDRGFSQ--SPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPS 161 +T+ RGFSQ + SG NQ S N N P+ QI + GH ALKCW+RFD +YQ + P Sbjct: 251 STRGRGFSQQVNSSGWNQNQSGNSANPR-PVCQICGRTGHVALKCWNRFDASYQSDDVPQ 309 Query: 160 NLSENTNAALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 ALA L+++++ +W DSGAT HI T L ++ Y G ++++ Sbjct: 310 --------ALATLQVSDSSGREWLTDSGATAHITPTTDSLQSVTPYNGAENVI 354 >XP_018514215.1 PREDICTED: uncharacterized protein LOC108871752 [Brassica rapa] Length = 392 Score = 78.2 bits (191), Expect = 2e-14 Identities = 46/113 (40%), Positives = 66/113 (58%), Gaps = 2/113 (1%) Frame = -3 Query: 334 NTQDRGFSQ--SPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPS 161 +T+ RGFSQ + SG NQ S + + P QI + GHTALKCW+RFDN YQ ++ P Sbjct: 261 STRGRGFSQQVNTSGWNQALS---GDSNRPSCQICGRPGHTALKCWNRFDNAYQSTDIPK 317 Query: 160 NLSENTNAALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 ALAA+++++A ++W DS AT HI L N+ Y G +S++ Sbjct: 318 --------ALAAIQVSDATGHEWYPDSAATAHITSAASSLQNVSSYHGPESVL 362 >XP_019085935.1 PREDICTED: uncharacterized protein LOC104710419 [Camelina sativa] Length = 331 Score = 77.4 bits (189), Expect = 2e-14 Identities = 43/106 (40%), Positives = 60/106 (56%) Frame = -3 Query: 319 GFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNLSENTN 140 G+S G Q S + P+ QI + GHTA+KC++RFDNNYQ SE T Sbjct: 206 GYSSRGRGFPQHQSSCSSQGERPVCQICGRIGHTAIKCYNRFDNNYQ--------SEATT 257 Query: 139 AALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 A ++LR+++ +W DSGAT H+ +T LH YKGND++M Sbjct: 258 QAFSSLRVSDDSGREWHPDSGATAHVTSSTSGLH-ATAYKGNDTVM 302 >OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis thaliana] Length = 2099 Score = 78.2 bits (191), Expect = 2e-14 Identities = 47/118 (39%), Positives = 64/118 (54%), Gaps = 3/118 (2%) Frame = -3 Query: 346 SSQLNTQDRGFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNA 167 S +T+ RGF Q S + N N+ + QI K GH ALKCWHRF+N+YQ+ Sbjct: 369 SGNYSTKGRGFPQQISSSTSGSYNNTENR--VVCQICGKPGHPALKCWHRFNNSYQYEEL 426 Query: 166 PSNLSENTNAALAALRINEA---DFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 P AAL A+RI + + N W DSGAT H+ ++T L + Y G+DS+M Sbjct: 427 P--------AALTAMRITDVTDHNGNKWVGDSGATAHVTNSTHNLQQSQPYGGSDSVM 476 >XP_010468183.1 PREDICTED: uncharacterized protein LOC104748204 [Camelina sativa] Length = 284 Score = 76.6 bits (187), Expect = 3e-14 Identities = 42/106 (39%), Positives = 60/106 (56%) Frame = -3 Query: 319 GFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNLSENTN 140 G+S G Q S + + P+ QI + GHTALKC++RFDNNYQ Sbjct: 180 GYSTRGRGFTQHQSSSPSSGQRPVCQICGRTGHTALKCYNRFDNNYQ---------AEAV 230 Query: 139 AALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 A ++LR+ +D N+W DSGAT H+ +T LH+ Y+GND++M Sbjct: 231 QAFSSLRV--SDGNEWHPDSGATAHVTPSTDNLHSATTYEGNDTVM 274 >XP_019084173.1 PREDICTED: uncharacterized protein LOC109125876 [Camelina sativa] Length = 221 Score = 75.5 bits (184), Expect = 3e-14 Identities = 43/113 (38%), Positives = 64/113 (56%), Gaps = 2/113 (1%) Frame = -3 Query: 334 NTQDRGFSQ--SPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPS 161 +T+ RGFSQ + SG NQ S N P+ QI + GH ALKCW+RFDN YQ + P Sbjct: 88 STRGRGFSQQVNNSGWNQSQS-GVSNNIRPVCQICGRVGHVALKCWNRFDNTYQSDDVPQ 146 Query: 160 NLSENTNAALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 ALAAL+++++ +W DSG++ H+ +L + Y G +++M Sbjct: 147 --------ALAALQVSDSCGREWVTDSGSSAHVTSTPNQLSAVTPYNGPETVM 191 >XP_019085409.1 PREDICTED: uncharacterized protein LOC109126354 [Camelina sativa] Length = 440 Score = 77.4 bits (189), Expect = 3e-14 Identities = 39/84 (46%), Positives = 49/84 (58%) Frame = -3 Query: 253 PLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNLSENTNAALAALRINEADFNDWCVDSGA 74 P QI K GHTA KCW+RFDNNYQ EN LAAL+++++ DW DSGA Sbjct: 273 PTCQICCKVGHTAAKCWNRFDNNYQ--------GENLAQVLAALQVSDSSGRDWIPDSGA 324 Query: 73 TDHIIHNTGKLHNIRFYKGNDSIM 2 T H+ L ++ Y+G DSIM Sbjct: 325 TSHVTTTEAALQHVTPYQGTDSIM 348 >XP_018489922.1 PREDICTED: uncharacterized protein LOC108860553 isoform X2 [Raphanus sativus] Length = 301 Score = 76.3 bits (186), Expect = 4e-14 Identities = 43/114 (37%), Positives = 65/114 (57%), Gaps = 3/114 (2%) Frame = -3 Query: 334 NTQDRGFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNL 155 +TQ RGF Q S + S N+ P QI K GH A KC+ RFD++YQ Sbjct: 168 STQGRGFPQQISQSSGRGSSAPDNR--PTCQICNKFGHPAYKCYKRFDHSYQ-------- 217 Query: 154 SENTNAALAALRINEADFN---DWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 ++ ++A+AA+R + +N +WC DSGAT HI +N L +++ Y GND+++ Sbjct: 218 ADEYHSAMAAMRAQDPPYNAGNEWCADSGATAHITNNPSHLQSVQAYSGNDTVL 271 >XP_018489919.1 PREDICTED: uncharacterized protein LOC108860553 isoform X1 [Raphanus sativus] Length = 304 Score = 76.3 bits (186), Expect = 5e-14 Identities = 43/114 (37%), Positives = 65/114 (57%), Gaps = 3/114 (2%) Frame = -3 Query: 334 NTQDRGFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNL 155 +TQ RGF Q S + S N+ P QI K GH A KC+ RFD++YQ Sbjct: 168 STQGRGFPQQISQSSGRGSSAPDNR--PTCQICNKFGHPAYKCYKRFDHSYQ-------- 217 Query: 154 SENTNAALAALRINEADFN---DWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 ++ ++A+AA+R + +N +WC DSGAT HI +N L +++ Y GND+++ Sbjct: 218 ADEYHSAMAAMRAQDPPYNAGNEWCADSGATAHITNNPSHLQSVQAYSGNDTVL 271 >XP_010468182.1 PREDICTED: uncharacterized protein LOC104748203 [Camelina sativa] Length = 360 Score = 76.6 bits (187), Expect = 5e-14 Identities = 42/106 (39%), Positives = 60/106 (56%) Frame = -3 Query: 319 GFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNLSENTN 140 G+S G Q S + + P+ QI + GHTALKC++RFDNNYQ Sbjct: 256 GYSTRGRGFTQHQSSSPSSGQRPVCQICGRTGHTALKCYNRFDNNYQ---------AEAV 306 Query: 139 AALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 A ++LR+ +D N+W DSGAT H+ +T LH+ Y+GND++M Sbjct: 307 QAFSSLRV--SDGNEWHPDSGATAHVTPSTDNLHSATTYEGNDTVM 350 >XP_013722277.1 PREDICTED: uncharacterized protein LOC106426115 [Brassica napus] Length = 1107 Score = 77.0 bits (188), Expect = 6e-14 Identities = 43/113 (38%), Positives = 65/113 (57%), Gaps = 2/113 (1%) Frame = -3 Query: 334 NTQDRGFSQ--SPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPS 161 +T+ RGFSQ + SG NQ S ++ + P+ QI + GH+ALKCW+RFDN YQ + P Sbjct: 312 STRGRGFSQQVNSSGWNQSQS---HSDNRPVCQICGRTGHSALKCWNRFDNAYQSDDIPK 368 Query: 160 NLSENTNAALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 ALAA ++++ +W DSGAT HI L N+ Y +++++ Sbjct: 369 --------ALAAFQMSDESGKEWLPDSGATAHITSTPSTLQNVALYHVSETVL 413 >CAC37623.1 copia-like polyprotein [Arabidopsis thaliana] Length = 1466 Score = 77.0 bits (188), Expect = 6e-14 Identities = 41/106 (38%), Positives = 60/106 (56%) Frame = -3 Query: 319 GFSQSPSGVNQFNSLNYYNKDLPLFQIYQKRGHTALKCWHRFDNNYQFSNAPSNLSENTN 140 G+S G +Q S + + P+ QI + GHTA+KC++RFDNNYQ SE Sbjct: 257 GYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDNNYQ--------SEVPT 308 Query: 139 AALAALRINEADFNDWCVDSGATDHIIHNTGKLHNIRFYKGNDSIM 2 A +ALR+++ +W DS AT HI +T L N Y+GND+++ Sbjct: 309 QAFSALRVSDETGKEWYPDSAATAHITASTSGLQNATTYEGNDAVL 354