BLASTX nr result

ID: Angelica23_contig00022685 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00022685
         (1184 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]         343   4e-92
gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi...   342   1e-91
gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal...   342   1e-91
gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi...   339   7e-91
emb|CAB75469.1| copia-type reverse transcriptase-like protein [A...   339   9e-91

>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
          Length = 1352

 Score =  343 bits (881), Expect = 4e-92
 Identities = 174/399 (43%), Positives = 255/399 (63%), Gaps = 5/399 (1%)
 Frame = +3

Query: 3    VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182
            VR+MEK+LRSL  KF+++VT IEE+KDL  ++I++L+GSLQA+E++  + +D +     +
Sbjct: 155  VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDIAEQVLNM 214

Query: 183  QI-KVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQKNEGYXXXXXXXXXXXXX 359
            QI K   G++                            N  Q+ E               
Sbjct: 215  QITKEENGQSYQRRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRY 274

Query: 360  XXXXXXXCDKSQFQCYNCNKYGHFSYECRSP---KVKERSYFATAK-EDKDIGSAMFLTY 527
                    DKS  +CYNC K+GH++ EC++P   K +E++++   K +++D+   +  +Y
Sbjct: 275  --------DKSSVKCYNCGKFGHYASECKAPSNKKFEEKAHYVEEKIQEEDM--LLMASY 324

Query: 528  KGDEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSK 707
            K DE+ + + WYLDSGASNHMCG K +F ELD+++ G V  GD SK+ VKGKG + I  K
Sbjct: 325  KKDEQKENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLK 384

Query: 708  KGDKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRL 887
             GD ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR+Q   LI  V MSKNR+
Sbjct: 385  NGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRM 444

Query: 888  FTLDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEA 1067
            F L+++  + +CLK   K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P  +CE 
Sbjct: 445  FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEG 504

Query: 1068 CVKGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184
            C+ GKQ + SFP   S RA++PLE++HTD+ GP    SL
Sbjct: 505  CLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSL 543


>gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana]
            gi|12321387|gb|AAG50765.1|AC079131_10 copia-type
            polyprotein, putative [Arabidopsis thaliana]
          Length = 1320

 Score =  342 bits (877), Expect = 1e-91
 Identities = 176/397 (44%), Positives = 248/397 (62%), Gaps = 3/397 (0%)
 Frame = +3

Query: 3    VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182
            VR+MEK+LRSL  KF+++VT IEE+KDL  ++I++L+GSLQA+E++  + +D       +
Sbjct: 155  VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDIVEQVLNM 214

Query: 183  QI-KVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQKNEGYXXXXXXXXXXXXX 359
            QI K   G++                            N  Q+ E               
Sbjct: 215  QITKEENGQSYQRRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRY 274

Query: 360  XXXXXXXCDKSQFQCYNCNKYGHFSYECRSPKVKERSYFATAKEDKDIGSAMFL--TYKG 533
                    DKS  +CYNC K+GH++ EC++P  K+    A   E+K     M L  +YK 
Sbjct: 275  --------DKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYKK 326

Query: 534  DEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSKKG 713
            DE+ + + WYLDSGASNHMCG K +F ELD+++ G V  GD SK+ VKGKG + I  K G
Sbjct: 327  DEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNG 386

Query: 714  DKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRLFT 893
            D ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR+Q   LI  V MSKNR+F 
Sbjct: 387  DHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFV 446

Query: 894  LDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEACV 1073
            L+++  + +CLK   K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P  +CE C+
Sbjct: 447  LNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCL 506

Query: 1074 KGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184
             GKQ + SFP   S RA++PLE++HTD+ GP    SL
Sbjct: 507  LGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSL 543


>gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  342 bits (877), Expect = 1e-91
 Identities = 176/397 (44%), Positives = 248/397 (62%), Gaps = 3/397 (0%)
 Frame = +3

Query: 3    VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182
            VR+MEK+LRSL  KF+++VT IEE+KDL  ++I++L+GSLQA+E++  + +D       +
Sbjct: 155  VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDIVEQVLNM 214

Query: 183  QI-KVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQKNEGYXXXXXXXXXXXXX 359
            QI K   G++                            N  Q+ E               
Sbjct: 215  QITKEENGQSYQRRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRY 274

Query: 360  XXXXXXXCDKSQFQCYNCNKYGHFSYECRSPKVKERSYFATAKEDKDIGSAMFL--TYKG 533
                    DKS  +CYNC K+GH++ EC++P  K+    A   E+K     M L  +YK 
Sbjct: 275  --------DKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYKK 326

Query: 534  DEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSKKG 713
            DE+ + + WYLDSGASNHMCG K +F ELD+++ G V  GD SK+ VKGKG + I  K G
Sbjct: 327  DEQKENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNG 386

Query: 714  DKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRLFT 893
            D ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR+Q   LI  V MSKNR+F 
Sbjct: 387  DHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFV 446

Query: 894  LDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEACV 1073
            L+++  + +CLK   K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P  +CE C+
Sbjct: 447  LNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCL 506

Query: 1074 KGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184
             GKQ + SFP   S RA++PLE++HTD+ GP    SL
Sbjct: 507  LGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSL 543


>gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1352

 Score =  339 bits (870), Expect = 7e-91
 Identities = 175/397 (44%), Positives = 251/397 (63%), Gaps = 3/397 (0%)
 Frame = +3

Query: 3    VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182
            VR+MEK+LRSL  KF+++VT IEE+KDL  ++I++L+GSLQA+E++  + +D   +E+ L
Sbjct: 155  VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDI--IEQVL 212

Query: 183  QIKVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQK-NEGYXXXXXXXXXXXXX 359
             ++++  EN                             RG + +E               
Sbjct: 213  NMQITKEENGQSYQRRGGGQVRGRGRGGYGN------GRGWRPHEDNTNQRGENSSRGRG 266

Query: 360  XXXXXXXCDKSQFQCYNCNKYGHFSYECRSPKVKERSYFATAKEDKDIGSAMFL--TYKG 533
                    DKS  +CYNC K+GH++ EC++P  K+    A   E+K     M L  +YK 
Sbjct: 267  KGHPKSRYDKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYKK 326

Query: 534  DEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSKKG 713
            DE+ + + WYLDSGASNHMCG K +F ELD+++ G V  GD SK+ VKGKG + I  K G
Sbjct: 327  DEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNG 386

Query: 714  DKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRLFT 893
            D ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR+Q   LI  V MSKNR+F 
Sbjct: 387  DHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFV 446

Query: 894  LDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEACV 1073
            L+++  + +CLK   K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P  +CE C+
Sbjct: 447  LNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCL 506

Query: 1074 KGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184
             GKQ + SFP   S RA++ LE++HTD+ GP    SL
Sbjct: 507  LGKQFKMSFPKESSSRAQKSLELIHTDVCGPIKPKSL 543


>emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis thaliana]
          Length = 1272

 Score =  339 bits (869), Expect = 9e-91
 Identities = 173/399 (43%), Positives = 252/399 (63%), Gaps = 5/399 (1%)
 Frame = +3

Query: 3    VRVMEKLLRSLTRKFDYVVTSIEESKDLSTISIDELVGSLQAHEQRMNQYDDASHLEKAL 182
            VR+MEK+LRSL  KF+++VT IEE+KDL  ++I++L+GSLQA+E++  + +D       +
Sbjct: 155  VRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDIVEQVLNM 214

Query: 183  QI-KVSIGENTXXXXXXXXXXXXXXXXXXXXXXXXXPFNRGQKNEGYXXXXXXXXXXXXX 359
            QI K   G++                            N  Q+ E               
Sbjct: 215  QITKEENGQSYQRRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRY 274

Query: 360  XXXXXXXCDKSQFQCYNCNKYGHFSYECRSP---KVKERSYFATAK-EDKDIGSAMFLTY 527
                    DKS  +CYNC K+GH++ EC++P   K KE++ +   K +++D+   +  +Y
Sbjct: 275  --------DKSSVKCYNCGKFGHYASECKAPSNKKFKEKANYVEEKIQEEDM--LLMASY 324

Query: 528  KGDEEGKKNTWYLDSGASNHMCGHKELFTELDDTISGEVTFGDSSKIPVKGKGTVTIVSK 707
            K DE+ + + WYLDSGASNHMCG K +F ELD+++ G V  GD SK+ VKGKG + I  K
Sbjct: 325  KKDEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLK 384

Query: 708  KGDKKYINDVYYIPALKSNIISLGQLVEKGYNIQMQDNSLIIRNQVRELIANVEMSKNRL 887
             GD ++I++VYYIP++K+NI+SLGQL+EKGY+I+++DN+L IR++   LI  V MSKNR+
Sbjct: 385  NGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDKESNLITKVPMSKNRM 444

Query: 888  FTLDMQTKVQKCLKSIIKNDSWLWHLRYGHLGFSGLKLLSKTKMVDGLPEINEPENLCEA 1067
            F L+++  + +CLK   K +SWLWHLR+GHL F GL+LLS+ +MV GLP IN P  +CE 
Sbjct: 445  FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEG 504

Query: 1068 CVKGKQHRQSFPVGKSWRARRPLEIVHTDIAGPFDIPSL 1184
            C+ G Q + SFP   S RA++PLE++HTD+ GP    SL
Sbjct: 505  CLLGNQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSL 543


Top