BLASTX nr result

ID: Angelica22_contig00022079 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00022079
         (879 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ACN78973.1| copia-type polyprotein [Glycine max] gi|225016157...   333   4e-89
gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi...   321   1e-85
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]         320   4e-85
gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal...   320   4e-85
emb|CAB75469.1| copia-type reverse transcriptase-like protein [A...   318   8e-85

>gb|ACN78973.1| copia-type polyprotein [Glycine max] gi|225016157|gb|ACN78980.1|
           copia-type polyprotein [Glycine max]
          Length = 1042

 Score =  333 bits (853), Expect = 4e-89
 Identities = 161/283 (56%), Positives = 203/283 (71%)
 Frame = +2

Query: 26  GVVSFGDSSKIEVKGKGEISFTKKDGKQGRIEDVYYVPDMKNNILSLGQLLEKGYSIFMG 205
           G VSFGDSSK++++GKG I  + KDG    I DVYYVP +K+NILSLGQL+EKGY I M 
Sbjct: 52  GNVSFGDSSKVQIQGKGTILISLKDGAHKLITDVYYVPKLKSNILSLGQLVEKGYEIHMK 111

Query: 206 DKAMILKDKNGRTIAKIEMSQNRMFKLNLKNIQEKCLQAKSEDKATLWHKRLGHLHYGGL 385
           D  + L+DKN   IAK+ MS+NRMF LN+K  + KCL+A  +D++  WH R GHL++G L
Sbjct: 112 DCCLWLRDKNSNLIAKVFMSRNRMFTLNIKTNEAKCLKASIKDESWCWHMRFGHLNFGAL 171

Query: 386 KELANKGMVHGLPSMDHKGSFCEDCVLAKHTRSSFQKKATYRASKPLELVHTDICGPITP 565
           K L  + MV G+P ++H    CE C+L KH R SF K+A  RA +PL+LV+TD+CGPI P
Sbjct: 172 KSLGEEKMVKGMPQINHPNQLCEACLLGKHARRSFPKEANSRAKEPLQLVYTDVCGPINP 231

Query: 566 KSHSSKRYFITFIDDYSRRTWVYFLNEKSEALRXXXXXXXXXXXXTGLHIKALRSDRGGE 745
            S  + +YF+ FIDDYSR+TWVYFL +KSEA              +G  IKALRSDRGGE
Sbjct: 232 PSCGNNKYFLLFIDDYSRKTWVYFLKQKSEAFVAFKNFKALVEKESGYVIKALRSDRGGE 291

Query: 746 YTSNAFTQYCEEHGIRRFLTAPYSPQQNGVAERKNRTILDMVR 874
           +TS  F ++CE++GIRR LT P SPQQNGVAERKNRTIL+M R
Sbjct: 292 FTSKEFNEFCEKYGIRRPLTVPRSPQQNGVAERKNRTILNMTR 334


>gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana]
            gi|12321387|gb|AAG50765.1|AC079131_10 copia-type
            polyprotein, putative [Arabidopsis thaliana]
          Length = 1320

 Score =  321 bits (823), Expect = 1e-85
 Identities = 157/291 (53%), Positives = 201/291 (69%)
 Frame = +2

Query: 5    DLKEINNGVVSFGDSSKIEVKGKGEISFTKKDGKQGRIEDVYYVPDMKNNILSLGQLLEK 184
            +L E   G V+ GD SK+EVKGKG I    K+G    I +VYY+P MK NILSLGQLLEK
Sbjct: 354  ELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEK 413

Query: 185  GYSIFMGDKAMILKDKNGRTIAKIEMSQNRMFKLNLKNIQEKCLQAKSEDKATLWHKRLG 364
            GY I + D  + ++D+    I K+ MS+NRMF LN++N   +CL+   ++++ LWH R G
Sbjct: 414  GYDIRLKDNNLSIRDQESNLITKVPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFG 473

Query: 365  HLHYGGLKELANKGMVHGLPSMDHKGSFCEDCVLAKHTRSSFQKKATYRASKPLELVHTD 544
            HL++GGL+ L+ K MV GLP ++H    CE C+L K  + SF K+++ RA KPLEL+HTD
Sbjct: 474  HLNFGGLELLSRKEMVRGLPCINHPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTD 533

Query: 545  ICGPITPKSHSSKRYFITFIDDYSRRTWVYFLNEKSEALRXXXXXXXXXXXXTGLHIKAL 724
            +CGPI PKS     YF+ FIDD+SR+TWVYFL EKSE               +GL IK +
Sbjct: 534  VCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTM 593

Query: 725  RSDRGGEYTSNAFTQYCEEHGIRRFLTAPYSPQQNGVAERKNRTILDMVRS 877
            RSDRGGE+TS  F +YCE++GIRR LT P SPQQNGVAERKNRTIL+M RS
Sbjct: 594  RSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARS 644


>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
          Length = 1352

 Score =  320 bits (819), Expect = 4e-85
 Identities = 156/291 (53%), Positives = 200/291 (68%)
 Frame = +2

Query: 5    DLKEINNGVVSFGDSSKIEVKGKGEISFTKKDGKQGRIEDVYYVPDMKNNILSLGQLLEK 184
            +L E   G V+ GD SK+EVKGKG I    K+G    I +VYY+P MK NILSLGQLLEK
Sbjct: 354  ELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEK 413

Query: 185  GYSIFMGDKAMILKDKNGRTIAKIEMSQNRMFKLNLKNIQEKCLQAKSEDKATLWHKRLG 364
            GY I + D  + ++D+    I K+ MS+NRMF LN++N   +CL+   ++++ LWH R G
Sbjct: 414  GYDIRLKDNNLSIRDQESNLITKVPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFG 473

Query: 365  HLHYGGLKELANKGMVHGLPSMDHKGSFCEDCVLAKHTRSSFQKKATYRASKPLELVHTD 544
            HL++GGL+ L+ K MV GLP ++H    CE C+L K  + SF K+++ RA KPLEL+HTD
Sbjct: 474  HLNFGGLELLSRKEMVRGLPCINHPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTD 533

Query: 545  ICGPITPKSHSSKRYFITFIDDYSRRTWVYFLNEKSEALRXXXXXXXXXXXXTGLHIKAL 724
            +CGPI PKS     YF+ FIDD+SR+TWVYFL EKSE               +GL IK +
Sbjct: 534  VCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTM 593

Query: 725  RSDRGGEYTSNAFTQYCEEHGIRRFLTAPYSPQQNGVAERKNRTILDMVRS 877
            RSDRGGE+TS  F +YCE++GIRR LT P SPQQNGV ERKNRTIL+M RS
Sbjct: 594  RSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARS 644


>gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  320 bits (819), Expect = 4e-85
 Identities = 156/291 (53%), Positives = 200/291 (68%)
 Frame = +2

Query: 5    DLKEINNGVVSFGDSSKIEVKGKGEISFTKKDGKQGRIEDVYYVPDMKNNILSLGQLLEK 184
            +L E   G V+ GD SK+EVKGKG I    K+G    I +VYY+P MK NILSLGQLLEK
Sbjct: 354  ELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEK 413

Query: 185  GYSIFMGDKAMILKDKNGRTIAKIEMSQNRMFKLNLKNIQEKCLQAKSEDKATLWHKRLG 364
            GY I + D  + ++D+    I K+ MS+NRMF LN++N   +CL+   ++++ LWH R G
Sbjct: 414  GYDIRLKDNNLSIRDQESNLITKVPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFG 473

Query: 365  HLHYGGLKELANKGMVHGLPSMDHKGSFCEDCVLAKHTRSSFQKKATYRASKPLELVHTD 544
            HL++GGL+ L+ K MV GLP ++H    CE C+L K  + SF K+++ RA KPLEL+HTD
Sbjct: 474  HLNFGGLELLSRKEMVRGLPCINHPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTD 533

Query: 545  ICGPITPKSHSSKRYFITFIDDYSRRTWVYFLNEKSEALRXXXXXXXXXXXXTGLHIKAL 724
            +CGPI PKS     YF+ FIDD+SR+TWVYFL EKSE               +GL IK +
Sbjct: 534  VCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTM 593

Query: 725  RSDRGGEYTSNAFTQYCEEHGIRRFLTAPYSPQQNGVAERKNRTILDMVRS 877
            RSDRGGE+TS  F +YCE++GIRR LT P SPQQNGV ERKNRTIL+M RS
Sbjct: 594  RSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARS 644


>emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis thaliana]
          Length = 1272

 Score =  318 bits (816), Expect = 8e-85
 Identities = 156/291 (53%), Positives = 199/291 (68%)
 Frame = +2

Query: 5    DLKEINNGVVSFGDSSKIEVKGKGEISFTKKDGKQGRIEDVYYVPDMKNNILSLGQLLEK 184
            +L E   G V+ GD SK+EVKGKG I    K+G    I +VYY+P MK NILSLGQLLEK
Sbjct: 354  ELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEK 413

Query: 185  GYSIFMGDKAMILKDKNGRTIAKIEMSQNRMFKLNLKNIQEKCLQAKSEDKATLWHKRLG 364
            GY I + D  + ++DK    I K+ MS+NRMF LN++N   +CL+   ++++ LWH R G
Sbjct: 414  GYDIRLKDNNLSIRDKESNLITKVPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFG 473

Query: 365  HLHYGGLKELANKGMVHGLPSMDHKGSFCEDCVLAKHTRSSFQKKATYRASKPLELVHTD 544
            HL++GGL+ L+ K MV GLP ++H    CE C+L    + SF K+++ RA KPLEL+HTD
Sbjct: 474  HLNFGGLELLSRKEMVRGLPCINHPNQVCEGCLLGNQFKMSFPKESSSRAQKPLELIHTD 533

Query: 545  ICGPITPKSHSSKRYFITFIDDYSRRTWVYFLNEKSEALRXXXXXXXXXXXXTGLHIKAL 724
            +CGPI PKS     YF+ FIDD+SR+TWVYFL EKSE               +GL IK +
Sbjct: 534  VCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTM 593

Query: 725  RSDRGGEYTSNAFTQYCEEHGIRRFLTAPYSPQQNGVAERKNRTILDMVRS 877
            RSD GGE+TS  F +YCE++GIRR LT P SPQQNGVAERKNRTIL+M RS
Sbjct: 594  RSDSGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARS 644


Top