BLASTX nr result
ID: Angelica23_contig00022685
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00022685 (1184 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] 343 4e-92 gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi... 342 1e-91 gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal... 342 1e-91 gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi... 339 7e-91 emb|CAB75469.1| copia-type reverse transcriptase-like protein [A... 339 9e-91 >emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] Length = 1352 Score = 343 bits (881), Expect = 4e-92 Identities = 174/399 (43%), Positives = 255/399 (63%), Gaps = 5/399 (1%) Frame = +3 Query: 3 VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182 VR+MEK+LRSL KF+++VT IEE+KDL ++I++L+GSLQA+E++ + +D + + Sbjct: 155 VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDIAEQVLNM 214 Query: 183 QI-KVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQKNEGYXXXXXXXXXXXXX 359 QI K G++ N Q+ E Sbjct: 215 QITKEENGQSYQRRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRY 274 Query: 360 XXXXXXXCDKSQFQCYNCNKYGHFSYECRSP---KVKERSYFATAK-EDKDIGSAMFLTY 527 DKS +CYNC K+GH++ EC++P K +E++++ K +++D+ + +Y Sbjct: 275 --------DKSSVKCYNCGKFGHYASECKAPSNKKFEEKAHYVEEKIQEEDM--LLMASY 324 Query: 528 KGDEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSK 707 K DE+ + + WYLDSGASNHMCG K +F ELD+++ G V GD SK+ VKGKG + I K Sbjct: 325 KKDEQKENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLK 384 Query: 708 KGDKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRL 887 GD ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR+Q LI V MSKNR+ Sbjct: 385 NGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRM 444 Query: 888 FTLDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEA 1067 F L+++ + +CLK K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P +CE Sbjct: 445 FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEG 504 Query: 1068 CVKGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184 C+ GKQ + SFP S RA++PLE++HTD+ GP SL Sbjct: 505 CLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSL 543 >gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana] gi|12321387|gb|AAG50765.1|AC079131_10 copia-type polyprotein, putative [Arabidopsis thaliana] Length = 1320 Score = 342 bits (877), Expect = 1e-91 Identities = 176/397 (44%), Positives = 248/397 (62%), Gaps = 3/397 (0%) Frame = +3 Query: 3 VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182 VR+MEK+LRSL KF+++VT IEE+KDL ++I++L+GSLQA+E++ + +D + Sbjct: 155 VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDIVEQVLNM 214 Query: 183 QI-KVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQKNEGYXXXXXXXXXXXXX 359 QI K G++ N Q+ E Sbjct: 215 QITKEENGQSYQRRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRY 274 Query: 360 XXXXXXXCDKSQFQCYNCNKYGHFSYECRSPKVKERSYFATAKEDKDIGSAMFL--TYKG 533 DKS +CYNC K+GH++ EC++P K+ A E+K M L +YK Sbjct: 275 --------DKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYKK 326 Query: 534 DEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSKKG 713 DE+ + + WYLDSGASNHMCG K +F ELD+++ G V GD SK+ VKGKG + I K G Sbjct: 327 DEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNG 386 Query: 714 DKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRLFT 893 D ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR+Q LI V MSKNR+F Sbjct: 387 DHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFV 446 Query: 894 LDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEACV 1073 L+++ + +CLK K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P +CE C+ Sbjct: 447 LNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCL 506 Query: 1074 KGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184 GKQ + SFP S RA++PLE++HTD+ GP SL Sbjct: 507 LGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSL 543 >gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana] Length = 1352 Score = 342 bits (877), Expect = 1e-91 Identities = 176/397 (44%), Positives = 248/397 (62%), Gaps = 3/397 (0%) Frame = +3 Query: 3 VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182 VR+MEK+LRSL KF+++VT IEE+KDL ++I++L+GSLQA+E++ + +D + Sbjct: 155 VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDIVEQVLNM 214 Query: 183 QI-KVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQKNEGYXXXXXXXXXXXXX 359 QI K G++ N Q+ E Sbjct: 215 QITKEENGQSYQRRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRY 274 Query: 360 XXXXXXXCDKSQFQCYNCNKYGHFSYECRSPKVKERSYFATAKEDKDIGSAMFL--TYKG 533 DKS +CYNC K+GH++ EC++P K+ A E+K M L +YK Sbjct: 275 --------DKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYKK 326 Query: 534 DEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSKKG 713 DE+ + + WYLDSGASNHMCG K +F ELD+++ G V GD SK+ VKGKG + I K G Sbjct: 327 DEQKENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNG 386 Query: 714 DKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRLFT 893 D ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR+Q LI V MSKNR+F Sbjct: 387 DHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFV 446 Query: 894 LDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEACV 1073 L+++ + +CLK K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P +CE C+ Sbjct: 447 LNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCL 506 Query: 1074 KGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184 GKQ + SFP S RA++PLE++HTD+ GP SL Sbjct: 507 LGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSL 543 >gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana] Length = 1352 Score = 339 bits (870), Expect = 7e-91 Identities = 175/397 (44%), Positives = 251/397 (63%), Gaps = 3/397 (0%) Frame = +3 Query: 3 VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182 VR+MEK+LRSL KF+++VT IEE+KDL ++I++L+GSLQA+E++ + +D +E+ L Sbjct: 155 VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDI--IEQVL 212 Query: 183 QIKVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQK-NEGYXXXXXXXXXXXXX 359 ++++ EN RG + +E Sbjct: 213 NMQITKEENGQSYQRRGGGQVRGRGRGGYGN------GRGWRPHEDNTNQRGENSSRGRG 266 Query: 360 XXXXXXXCDKSQFQCYNCNKYGHFSYECRSPKVKERSYFATAKEDKDIGSAMFL--TYKG 533 DKS +CYNC K+GH++ EC++P K+ A E+K M L +YK Sbjct: 267 KGHPKSRYDKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYKK 326 Query: 534 DEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSKKG 713 DE+ + + WYLDSGASNHMCG K +F ELD+++ G V GD SK+ VKGKG + I K G Sbjct: 327 DEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNG 386 Query: 714 DKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRLFT 893 D ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR+Q LI V MSKNR+F Sbjct: 387 DHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFV 446 Query: 894 LDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEACV 1073 L+++ + +CLK K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P +CE C+ Sbjct: 447 LNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCL 506 Query: 1074 KGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184 GKQ + SFP S RA++ LE++HTD+ GP SL Sbjct: 507 LGKQFKMSFPKESSSRAQKSLELIHTDVCGPIKPKSL 543 >emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1272 Score = 339 bits (869), Expect = 9e-91 Identities = 173/399 (43%), Positives = 252/399 (63%), Gaps = 5/399 (1%) Frame = +3 Query: 3 VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182 VR+MEK+LRSL KF+++VT IEE+KDL ++I++L+GSLQA+E++ + +D + Sbjct: 155 VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDIVEQVLNM 214 Query: 183 QI-KVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQKNEGYXXXXXXXXXXXXX 359 QI K G++ N Q+ E Sbjct: 215 QITKEENGQSYQRRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRY 274 Query: 360 XXXXXXXCDKSQFQCYNCNKYGHFSYECRSP---KVKERSYFATAK-EDKDIGSAMFLTY 527 DKS +CYNC K+GH++ EC++P K KE++ + K +++D+ + +Y Sbjct: 275 --------DKSSVKCYNCGKFGHYASECKAPSNKKFKEKANYVEEKIQEEDM--LLMASY 324 Query: 528 KGDEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSK 707 K DE+ + + WYLDSGASNHMCG K +F ELD+++ G V GD SK+ VKGKG + I K Sbjct: 325 KKDEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLK 384 Query: 708 KGDKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRL 887 GD ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR++ LI V MSKNR+ Sbjct: 385 NGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDKESNLITKVPMSKNRM 444 Query: 888 FTLDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEA 1067 F L+++ + +CLK K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P +CE Sbjct: 445 FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEG 504 Query: 1068 CVKGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184 C+ G Q + SFP S RA++PLE++HTD+ GP SL Sbjct: 505 CLLGNQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSL 543