BLASTX nr result

ID: Salvia21_contig00000819 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Salvia21_contig00000819
         (875 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABD96963.1| hypothetical protein [Cleome spinosa]                   78   2e-12
gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidop...    71   3e-10
emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana]         70   6e-10
gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Ar...    68   3e-09
gb|AAP53866.2| retrotransposon protein, putative, Ty1-copia subc...    68   3e-09

>gb|ABD96963.1| hypothetical protein [Cleome spinosa]
          Length = 408

 Score = 78.2 bits (191), Expect = 2e-12
 Identities = 56/209 (26%), Positives = 91/209 (43%), Gaps = 6/209 (2%)
 Frame = -1

Query: 836 ARATILLNLSSSVVRKVSHYACAKELWDELNSIYAAPSEVSTWSLQNQFMSFQMDSSKDV 657
           AR  I+L L+  V+RKV     A  +W +L  ++   S  +   L  +   F+MDSS+ +
Sbjct: 98  ARNLIVLALADQVLRKVISERTAFGIWRKLERLHIEQSLPNRMYLMQRVSGFRMDSSRTI 157

Query: 656 DTNMEIFNKLLHDLKLAGDDSIEKYAPQILLNSIPESFVEVKSALKYGGAKVTCDMI-IN 480
           + N++IF KLL DL        E+Y    LLNS+P ++ +++  LKY  A ++ + +   
Sbjct: 158 EENLDIFQKLLSDLHSLNVKVEEEYQAVYLLNSLPPAYEQLREVLKYSRATISVEEVKAA 217

Query: 479 GXXXXXXXXXXXXXXXXHGEVMFVNXXXXXXXXXXXKL-----CWNCGKPGHLSNKCRLP 315
                             GE + V            K      CW CGK GH   +CR  
Sbjct: 218 ARMKELELLAQGTLTRGTGEGLVVKGKPEKSGGGKKKAKDQVECWYCGKKGHYKKECRSR 277

Query: 314 KKNKSFEPEQRANNVFEEDGVYMVHDHDF 228
           +  +  E +    +V E D   ++   D+
Sbjct: 278 RAKEETEGKGVVASVQEYDSEVLLVCSDY 306


>gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1137

 Score = 71.2 bits (173), Expect = 3e-10
 Identities = 49/200 (24%), Positives = 91/200 (45%), Gaps = 9/200 (4%)
 Frame = -1

Query: 872 VTDEKLH-DMNDNARATILLNLSSSVVRKVSHYACAKELWDELNSIYAAPSEVSTWSLQN 696
           + +EK   D ++ A   I +N+   V+R + +   A E W  L+ +Y   S  +   LQ 
Sbjct: 31  IEEEKARIDQDEKAMDMIFINVGDKVLRNIENSKTAAEAWATLDKLYLVKSLPNRVYLQL 90

Query: 695 QFMSFQMDSSKDVDTNMEIFNKLLHDLKLAGDDSIEKYAPQILLNSIPESFVEVKSALKY 516
           +  +++M  SK ++ N++ F K++ DL        ++    ++L+++P+S+  +K  LKY
Sbjct: 91  KVYNYRMQDSKTLEENVDEFQKMISDLNNLQIQVPDEVQAILILSALPDSYDMLKETLKY 150

Query: 515 GGAKVTCDMIINGXXXXXXXXXXXXXXXXH-GEVMFVNXXXXXXXXXXXK------LCWN 357
           G   +  D +I+                   GE ++V            K      +CW 
Sbjct: 151 GREGIKLDDVISAAKSKELELRDSSGGSRPVGEGLYVRGKSQARGSDGPKSTEGKKVCWI 210

Query: 356 CGKPGHLSNKC-RLPKKNKS 300
           CGK GH   +C +  +KNK+
Sbjct: 211 CGKEGHFKRQCYKWLEKNKA 230


>emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana]
          Length = 560

 Score = 70.1 bits (170), Expect = 6e-10
 Identities = 50/223 (22%), Positives = 90/223 (40%), Gaps = 19/223 (8%)
 Frame = -1

Query: 872 VTDEKLHDMNDNARATILLNLSSSVVRKVSHYACAKELWDELNSIYAAPSEVSTWSLQNQ 693
           V D+   + ++NA   I+ ++  +V+RK+ H   A E+W  LN  Y   S  +   +Q +
Sbjct: 66  VPDQVKIEKSENAMNIIITHVGDAVLRKIDHCKSAAEMWKTLNKQYMETSLPNRIYVQLK 125

Query: 692 FMSFQMDSSKDVDTNMEIFNKLLHDLKLAGDDSIEKYAPQILLNSIPESFVEVKSALKYG 513
           F SF+M+ SK ++ N+  F K++ +L     + +E+    + LN +   + ++K  LKYG
Sbjct: 126 FYSFKMNDSKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNGLSSRYSQLKHTLKYG 185

Query: 512 GAKVTC-DMIINGXXXXXXXXXXXXXXXXHGEVMFVN------------------XXXXX 390
              ++  D+I +                    V++ N                       
Sbjct: 186 NKALSLQDVISSARSLERELDEQKETDKNTSTVLYTNERGRPLTRNQNQNKGGQGRGRSK 245

Query: 389 XXXXXXKLCWNCGKPGHLSNKCRLPKKNKSFEPEQRANNVFEE 261
                   CW C K GH+   C   K+    E    A  + E+
Sbjct: 246 SNSNAKLTCWYCKKEGHVKKDCFARKRKLESENPGEAGVITEK 288


>gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
          Length = 1356

 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 52/221 (23%), Positives = 89/221 (40%), Gaps = 17/221 (7%)
 Frame = -1

Query: 872 VTDEKLHDMNDNARATILLNLSSSVVRKVSHYACAKELWDELNSIYAAPSEVSTWSLQNQ 693
           V D    + ++ A+  I+ ++S  V+ KV+HYA   +LW  LN  Y   S  +    Q +
Sbjct: 70  VPDPVKIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIYTQLK 129

Query: 692 FMSFQMDSSKDVDTNMEIFNKLLHDLKLAGDDSIEKYAPQILLNSIPESFVEVKSALKYG 513
             SF+M S+  +D N++ F +++ +L        E+    ++LNS+P S +++K  LKYG
Sbjct: 130 LYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTLKYG 189

Query: 512 GAKVTC-DMIINGXXXXXXXXXXXXXXXXHGEVMFV----------------NXXXXXXX 384
              +T  D+  +                    V++                         
Sbjct: 190 NKTLTVQDVTSSAKSLERELAEAVDLDKGQAAVLYTTERGRPLVRNNQKGGQGKGRSRSN 249

Query: 383 XXXXKLCWNCGKPGHLSNKCRLPKKNKSFEPEQRANNVFEE 261
                 CW C K GH+   C   KK    E +  A  + E+
Sbjct: 250 SKTKVPCWYCKKEGHVKKDCYSRKKKMESEGQGEAGVITEK 290


>gb|AAP53866.2| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
           Japonica Group]
          Length = 415

 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 52/194 (26%), Positives = 86/194 (44%), Gaps = 7/194 (3%)
 Frame = -1

Query: 863 EKLHDMNDNARATILLNLSSSVVRKVSHYACAKELWDELNSIYAAPSEVSTWSLQNQFMS 684
           +K ++M   A ATI L+LS SV+ +V      KE+WD+L S++ + S  S   L+ Q   
Sbjct: 51  DKWNEMKAQAAATIRLSLSDSVMYQVMDEKSPKEIWDKLASLHMSKSLTSKLYLKQQLYG 110

Query: 683 FQMDSSKDVDTNMEIFNKLLHDLKLAGDDSIEKYAPQILLNSIPESFVEVKSALKYGGAK 504
            Q+    D+  ++++FN+L+ DL        ++    ILL S+P S+  V + L +G   
Sbjct: 111 LQVQEESDLRKHVDVFNQLVVDLSKLDVKLDDEDKAIILLCSLPLSYEHVVTTLTHGKDT 170

Query: 503 VTCDMIING--XXXXXXXXXXXXXXXXHGEVMFV-----NXXXXXXXXXXXKLCWNCGKP 345
           V  + II+                    G+ + V     +             C+ C + 
Sbjct: 171 VKTEEIISSLLARDLRRSKKNEATKASQGKSLLVKDKHDHEAGVSKSKEKGARCYKCHEF 230

Query: 344 GHLSNKCRLPKKNK 303
           GH+   C L KK K
Sbjct: 231 GHIRRNCPLLKKRK 244


Top