BLASTX nr result

ID: Cephaelis21_contig00036739 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00036739
         (1128 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAF79618.1|AC027665_19 F5M15.26 [Arabidopsis thaliana]             226   6e-57
gb|ABA97854.1| retrotransposon protein, putative, Ty3-gypsy subc...   218   3e-54
gb|ABA95229.1| retrotransposon protein, putative, Ty3-gypsy subc...   217   4e-54
gb|AAR13317.1| gag-pol polyprotein [Phaseolus vulgaris]               216   6e-54
gb|AAY99339.1| pol-polyprotein [Silene latifolia]                     211   3e-52

>gb|AAF79618.1|AC027665_19 F5M15.26 [Arabidopsis thaliana]
          Length = 1838

 Score =  226 bits (577), Expect = 6e-57
 Identities = 143/377 (37%), Positives = 198/377 (52%), Gaps = 4/377 (1%)
 Frame = -2

Query: 1121 SVVDIMYIDCFRKLGISFDELQPSTLPLTGFATQVVRPLGTISLAITVSEYPKDNTLICT 942
            S VD+++ D    + I+  +++P + PL GF    V  +GTI L I V            
Sbjct: 638  SSVDLIFKDVLTAMNITDRQIKPVSKPLAGFDGDFVMTIGTIKLPIFVGGL----IAWVK 693

Query: 941  FYVIDAPSPFNIILGQPWINGAKAVLSTYHLC*VFHRPWSCYRTGKANYGKRV*ESLFKQ 762
            F VI  P+ +N+ILG PWI+  +A+ STYH C  F        T    +  R  +     
Sbjct: 694  FVVIGKPAVYNVILGTPWIHQMQAIPSTYHQCVKFP-------THNGIFTLRAPKEAKTP 746

Query: 761  SQRYRPRGGEYCSFVFSDVNPPHLGVAPRGKLTGIQTEITDKVERVVLNPEFPDRVVQVG 582
            S+ Y     E C                             + E V ++   P R V VG
Sbjct: 747  SRSYEE--SELC-----------------------------RTEMVNIDESDPTRCVGVG 775

Query: 581  ATLPLEIKKQII*LLRKYVKVFARSSVDMSGVFSDLIVHRLNVDQLVKPVQQKRRILLQS 402
            A +   I+ ++I LL++  K FA S  DM G+   +  H LNVD   KPV+QKRR L   
Sbjct: 776  AEISPSIRLELIALLKRNSKTFAWSIEDMKGIDPAITAHELNVDPTFKPVKQKRRKLGPE 835

Query: 401  AAWQSRTKSENCDKTIS*RRF----YPTWLANSIMVKKSDSSWRMCIDYTDLNKYCPKDF 234
             A   R  +E  +K +   +     YP WLAN ++VKK +  WR+C+DYTDLNK CPKD 
Sbjct: 836  RA---RAVNEEVEKLLKAGQIIEVKYPEWLANPVVVKKKNGKWRVCVDYTDLNKACPKDS 892

Query: 233  HLLPVIDQKVEAFSGFEVLMFLDAYKGYHQILMAEEDAEKTTFITDIGIFCYKKMPFGLK 54
            + LP ID+ VEA SG  +L F+DA+ GY+QILM ++D EKT+F+TD G +CYK M FGLK
Sbjct: 893  YPLPHIDRLVEATSGNGLLSFMDAFSGYNQILMHKDDQEKTSFVTDRGTYCYKVMSFGLK 952

Query: 53   NAGATY*RMMDRVFTHQ 3
            NAGATY R ++++   Q
Sbjct: 953  NAGATYQRFVNKMLADQ 969


>gb|ABA97854.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1889

 Score =  218 bits (554), Expect = 3e-54
 Identities = 132/375 (35%), Positives = 199/375 (53%), Gaps = 2/375 (0%)
 Frame = -2

Query: 1121 SVVDIMYIDCFRKLGISFDELQPSTLPLTGFATQVVRPLGTISLAITVSEYPKDNTLICT 942
            S  D+++ D F+K+ I  D L  + +PL GF  Q V  +G ISL +   +          
Sbjct: 775  SSADVLFYDAFKKMQIPEDRLTNAGVPLQGFGGQQVHAIGKISLQVVFGKGTNVRKEEIV 834

Query: 941  FYVIDAPSPFNIILGQPWINGAKAVLSTYHLC*VFHRPWSCYRTGKANYGKRV*ESLFKQ 762
            F V+D P  +N ILG+  IN  +A++   ++C     P             R        
Sbjct: 835  FDVVDMPYQYNAILGRSTINIFEAIIHHNYICMKLPGPRGVITVRGEQLAAR-------- 886

Query: 761  SQRYRPRGGEYCSFVFSDVNPPHLGVAPRGKLTGIQTEITD-KVERVVLNPEFPDRVVQV 585
              +Y  +G          V   H+    +G+   IQ  I + K ++V L+   P + + +
Sbjct: 887  --KYELQGTP-------SVKGVHVVDQKQGEYIKIQKPIPEGKTKKVQLDEHDPGKFILI 937

Query: 584  GATLPLEIKKQII*LLRKYVKVFARSSVDMSGVFSDLIVHRLNVDQLVKPVQQKRRILLQ 405
            G  L   I+++I+ ++++ + VFA S  ++ GV   LI H L +    KP +QK R +  
Sbjct: 938  GENLEKHIEEEILKVVKENMAVFAWSPDELQGVDRSLIEHNLAIKSGYKPKKQKLRRMST 997

Query: 404  SAAWQSRTKSENCDKTIS*RR-FYPTWLANSIMVKKSDSSWRMCIDYTDLNKYCPKDFHL 228
                 ++ + E   K    R   +P WLAN ++VKK++  WRMCID+TDLNK CPKD   
Sbjct: 998  DRQQAAKIELEKLLKAKVIREVMHPEWLANPVLVKKANGKWRMCIDFTDLNKACPKDDFP 1057

Query: 227  LPVIDQKVEAFSGFEVLMFLDAYKGYHQILMAEEDAEKTTFITDIGIFCYKKMPFGLKNA 48
            LP IDQ V+A +G E++ FLDAY GYHQ+ M +ED EKT+FIT  G +C+ +MPFGLKNA
Sbjct: 1058 LPRIDQLVDATAGCELMSFLDAYSGYHQVFMVKEDEEKTSFITPFGSYCFIRMPFGLKNA 1117

Query: 47   GATY*RMMDRVFTHQ 3
            GAT+ R++ +V   Q
Sbjct: 1118 GATFARLIGKVLAKQ 1132


>gb|ABA95229.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1980

 Score =  217 bits (553), Expect = 4e-54
 Identities = 132/375 (35%), Positives = 204/375 (54%), Gaps = 2/375 (0%)
 Frame = -2

Query: 1121 SVVDIMYIDCFRKLGISFDELQPSTLPLTGFATQVVRPLGTISLAITVSEYPKDNTLICT 942
            S  D+++ D F+K+ I  D L  + +PL GF  Q V  +G ISL +   +          
Sbjct: 775  SSADVLFYDAFKKMQIPEDRLTNAGVPLQGFGGQQVHAIGKISLQVVFGKGTNVRKEEIV 834

Query: 941  FYVIDAPSPFNIILGQPWINGAKAVLSTYHLC*VFHRPWSCYRTGKANYGKRV*ESLFKQ 762
            F V+D P  +N ILG+  IN  +A++   ++C     P           G+++       
Sbjct: 835  FDVVDMPYQYNAILGRSTINIFEAIIHHNYICMKLPGPRGVITVR----GEQL------V 884

Query: 761  SQRYRPRGGEYCSFVFSDVNPPHLGVAPRGKLTGIQTEITD-KVERVVLNPEFPDRVVQV 585
            +++Y  +G          V   H+    +G+   IQ  I + K ++V L+   P + + +
Sbjct: 885  ARKYELQGTP-------SVKGVHVVDQKQGEYIKIQKPIPEGKTKKVQLDEHDPGKFILI 937

Query: 584  GATLPLEIKKQII*LLRKYVKVFARSSVDMSGVFSDLIVHRLNVDQLVKPVQQKRRILLQ 405
            G  L   I+++I+ ++++ + VFA S  ++ GV   LI H L +    KP +QK R +  
Sbjct: 938  GENLEKHIEEEILKVVKENMAVFAWSPDELQGVDRSLIEHNLAIKSGYKPKKQKLRRMST 997

Query: 404  SAAWQSRTKSENCDKTIS*RR-FYPTWLANSIMVKKSDSSWRMCIDYTDLNKYCPKDFHL 228
                 ++ + E   K    R   +P WLAN ++VKK++  WRMCID+TDLNK CPKD   
Sbjct: 998  DRQQAAKIELEKLLKAKVIREVMHPEWLANPVLVKKANGKWRMCIDFTDLNKACPKDDFP 1057

Query: 227  LPVIDQKVEAFSGFEVLMFLDAYKGYHQILMAEEDAEKTTFITDIGIFCYKKMPFGLKNA 48
            LP IDQ V+A +G E++ FLDAY GYHQ+ M +ED EKT+FIT  G +C+ +MPFGLKNA
Sbjct: 1058 LPRIDQLVDATAGCELMSFLDAYSGYHQVFMVKEDEEKTSFITPFGTYCFIRMPFGLKNA 1117

Query: 47   GATY*RMMDRVFTHQ 3
            GAT+ R++ +V   Q
Sbjct: 1118 GATFARLIGKVLAKQ 1132


>gb|AAR13317.1| gag-pol polyprotein [Phaseolus vulgaris]
          Length = 1859

 Score =  216 bits (551), Expect = 6e-54
 Identities = 147/399 (36%), Positives = 211/399 (52%), Gaps = 29/399 (7%)
 Frame = -2

Query: 1121 SVVDIMYIDCFRKLGISFDELQPSTLPLTGFATQVVRPLGTISLAITVSEYPKDNTLICT 942
            S VDI+Y + F+K+ I   E+QP    + GF+ + V   G I L  T  +     T+   
Sbjct: 660  SSVDILYWETFKKMKIPEAEIQPYNEQIVGFSRERVDTKGFIDLYTTFGDDYLSKTINIR 719

Query: 941  FYVIDAPSPFNIILGQPWINGAKAVLSTYHLC*VF--------------HRPWSCY---- 816
            + +++A + +NI+LG+P IN  KA++ST HL   F                   CY    
Sbjct: 720  YLLVNANTSYNILLGRPSINRLKAIVSTPHLAMKFPSVNGDIATVHIDQKTARECYVASL 779

Query: 815  -----RTGKANYGKRV*ESLFKQSQRYRPRGGEYCSFVFS--DVNPPHLGVAPRGKLTGI 657
                 R       +R  E   + ++R R RG E    + +  D++P         +L   
Sbjct: 780  KVEPTRRLYTTSAERTTERRGRSTER-RSRGRESRRHLVALVDLDP---------RLDDP 829

Query: 656  QTEITDKVERVVLNPEFPDRVVQVGATLPLEIKKQII*LLRKYVKVFARSSVDMSGVFSD 477
            + E  + ++ + L  +  DR   +G +L  + ++ I   L K   +FA ++ DM GV SD
Sbjct: 830  RMEAGEDLQPIFLRDK--DRKTYMGTSLKPDDRETIGKTLTKNADLFAWTAADMPGVKSD 887

Query: 476  LIVHRLNVDQLVKPVQQKRRILLQSAAWQSRTKSENCDKTIS*----RRFYPTWLANSIM 309
            +I HRL+V    +P+ QK+R L +     +R   E  DK I      +  Y TWLAN +M
Sbjct: 888  VITHRLSVYTEARPIAQKKRKLGEERRKAAR---EETDKLIQAGFIQKAHYTTWLANVVM 944

Query: 308  VKKSDSSWRMCIDYTDLNKYCPKDFHLLPVIDQKVEAFSGFEVLMFLDAYKGYHQILMAE 129
            VKK++  WRMC+DYTDLNK CPKD + LP ID+ V+  +G ++L FLDAY GY+QI M  
Sbjct: 945  VKKTNGKWRMCVDYTDLNKACPKDSYPLPTIDRLVDGAAGHQILSFLDAYSGYNQIQMYH 1004

Query: 128  EDAEKTTFITDIGIFCYKKMPFGLKNAGATY*RMMDRVF 12
             D EKT F TD   F Y+ MPFGLKNAGATY R+MD VF
Sbjct: 1005 RDREKTAFRTDSDNFFYEVMPFGLKNAGATYQRLMDHVF 1043


>gb|AAY99339.1| pol-polyprotein [Silene latifolia]
          Length = 1307

 Score =  211 bits (536), Expect = 3e-52
 Identities = 136/378 (35%), Positives = 199/378 (52%), Gaps = 8/378 (2%)
 Frame = -2

Query: 1112 DIMYIDCFRKLGISFDELQPSTLPLTGFATQVVRPLGTISLAITVSEYPKDNTLICTFYV 933
            +IM+ +CF  LG+  ++L P T PL  F+   + PLG+I L +   +      ++  F V
Sbjct: 88   NIMFRECFLNLGLKIEDLSPCTNPLYSFSGAGLVPLGSIRLPVMFGQTDAAKNVLSEFVV 147

Query: 932  IDAPSPFNIILGQPWINGAKAVLSTYHLC*VFHRPWSCYRTGKANYGKRV*ESLFKQ--S 759
            ID  S +N+++G+  ++ A AV+S   L  +       Y + +    K V +    +  +
Sbjct: 148  IDGSSAYNVLIGRVTLSEADAVMSIRALTLM-------YVSDQGEVQKLVSKDERDEVVN 200

Query: 758  QRYRPRGGEYCSFVFSDVNPPHLGVAPRGKLTGIQTEITDKVERVVLNPEF--PDRVVQV 585
             +   RG    S   +  +      + R +   + T     VE          P R V V
Sbjct: 201  VQISARGCNMQSLKVAKKSEKGKSPSLRQEGDPMSTNNVSMVEGAETEQVEIDPGRTVTV 260

Query: 584  GATLPLEIKKQII*LLRKYVKVFARSSVDMSGVFSDLIVHRLNVDQLVKPVQQKRRILLQ 405
            G  L  + +  ++ LLRK   VFA S+ +M GV  ++IVH+LNV    +PV+QK R    
Sbjct: 261  GVGLEPKFRADLLDLLRKNKDVFAYSAAEMPGVSREVIVHKLNVLSNARPVKQKMR---N 317

Query: 404  SAAWQSRTKSENCDKTIS*RRF----YPTWLANSIMVKKSDSSWRMCIDYTDLNKYCPKD 237
            S+A +        DK +         YP WLAN +MVKKS   WRMC+D+T+LNK CPKD
Sbjct: 318  SSAEKDDAIKAEVDKLLEAGFIMPCTYPEWLANVVMVKKSSGGWRMCVDFTNLNKACPKD 377

Query: 236  FHLLPVIDQKVEAFSGFEVLMFLDAYKGYHQILMAEEDAEKTTFITDIGIFCYKKMPFGL 57
             + LP ID  ++A + + +L  LDA+ GYHQ+ MAEED  K  FIT  G + YK M FGL
Sbjct: 378  CYPLPRIDSLIDATASYTMLSLLDAFSGYHQVFMAEEDVLKCAFITIHGTYMYKMMSFGL 437

Query: 56   KNAGATY*RMMDRVFTHQ 3
            KNAGATY R++D+VF  Q
Sbjct: 438  KNAGATYTRLVDKVFQDQ 455


Top