BLASTX nr result

ID: Cephaelis21_contig00035721 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00035721
         (1305 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulga...   422   e-116
gb|AAD24831.1| putative non-LTR retroelement reverse transcripta...   417   e-114
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta...   417   e-114
emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulga...   411   e-112
gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam...   408   e-111

>emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  422 bits (1086), Expect = e-116
 Identities = 200/434 (46%), Positives = 283/434 (65%)
 Frame = -2

Query: 1304 MTPLFFHNFWHIVKFDVCNAVKSFFRSNFMLKAFNHTLVSFIPKTQHPSNISQYRPISFC 1125
            M  +F+  FWHI+  DV   V S    +      NHT ++ IPK ++P+  +++RPI+ C
Sbjct: 455  MHAIFYQKFWHIIGDDVTQFVSSILHGSISPSCINHTNIALIPKVKNPTTPAEFRPIALC 514

Query: 1124 NVIYKIISKILSERLKPCLPFCISENQSAFLPGRQIIDNVVIAHEYVHYLNSLRKGKKAF 945
            NV+YK++SK L  RLK  LP  +SENQSAF+PGR I DN +IA E  H +    + +K  
Sbjct: 515  NVVYKLVSKALVIRLKDFLPRLVSENQSAFVPGRLITDNALIAMEVFHSMKHRNRSRKGT 574

Query: 944  VALKLDMSKAYDRVEWRFLSYIMIHMGFDLQFVGWINRCIQSSSFSFNVNGEVKGYIRPS 765
            +A+KLDMSKAYDRVEW FL  +++ MGFD ++V  I  C+ S S+SF +NG V G + P+
Sbjct: 575  IAMKLDMSKAYDRVEWGFLRKLLLTMGFDGRWVNLIMSCVSSVSYSFIINGGVCGSVTPA 634

Query: 764  RGIRQGDPLSPYLFLICSEAFSHLLQNAVRNKEINGMKISRTGPEITHLLFADDSLLFCE 585
            RG+R GDPLSPYLF++ ++AFS ++Q  V+ K+++G K SR+GP I+HL FAD SLLF  
Sbjct: 635  RGLRHGDPLSPYLFILIADAFSKMIQKKVQEKQLHGAKASRSGPVISHLFFADVSLLFTR 694

Query: 584  ADIQQIRTVKRILQDYEHCSGQQVNLDKSSILFSKNATPSLKEQVCAEIPGVAVHTRSKY 405
            A  Q+   +  IL  YE  SGQ++N DKS + FSK  + + KE++   +    V    KY
Sbjct: 695  ASRQECAIIVEILNLYEQASGQKINYDKSEVSFSKGVSIAQKEELSNILQMKQVERHMKY 754

Query: 404  LGLPLVIGRSKNQVFQYVVEAAKSRILSWKNNFLSHAGKEVLLKSVIQALPTYSMSCYKL 225
            LG+P + GRS+  +F  +++    ++  WK   LS AGKE+LLKSVIQA+PTY M  YKL
Sbjct: 755  LGIPSITGRSRTAIFDSLMDRIWKKLQGWKEKLLSRAGKEILLKSVIQAIPTYLMGVYKL 814

Query: 224  SKQVCRQVEKLMAHYWWGKGDHHRKLHWSKWDNLTKSKSEGGLNFTSLESANDALLAKQL 45
               + +++   MA +WWG  D  R++HW  WD+L   K  GG+ F  L   NDALL +Q 
Sbjct: 815  PCSIIQKIHSAMARFWWGSSDTQRRIHWKNWDSLCTLKCFGGMGFRDLRVFNDALLGRQA 874

Query: 44   WRLIQDPHSLMAKV 3
            WRL+++PHSL+A+V
Sbjct: 875  WRLVREPHSLLARV 888


>gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1524

 Score =  417 bits (1071), Expect = e-114
 Identities = 203/436 (46%), Positives = 282/436 (64%), Gaps = 2/436 (0%)
 Frame = -2

Query: 1304 MTPLFFHNFWHIVKFDVCNAVKSFFRSNFMLKAFNHTLVSFIPKTQHPSNISQYRPISFC 1125
            +T  F+ N W IV +DV   VK FF ++FM  + NHT +  IPK  +P+ +S YRPI+ C
Sbjct: 607  LTARFYKNCWDIVGYDVILEVKKFFETSFMKPSINHTNICMIPKITNPTTLSDYRPIALC 666

Query: 1124 NVIYKIISKILSERLKPCLPFCISENQSAFLPGRQIIDNVVIAHEYVHYLNSLRKGKKAF 945
            NV+YK+ISK L  RLK  L   +S++Q+AF+PGR I DNV+IAHE +H L   ++  K +
Sbjct: 667  NVLYKVISKCLVNRLKSHLNSIVSDSQAAFIPGRIINDNVMIAHEVMHSLKVRKRVSKTY 726

Query: 944  VALKLDMSKAYDRVEWRFLSYIMIHMGFDLQFVGWINRCIQSSSFSFNVNGEVKGYIRPS 765
            +A+K D+SKAYDRVEW FL   M   GF  +++GWI   ++S  +S  +NG   GYI P+
Sbjct: 727  MAVKTDVSKAYDRVEWDFLETTMRLFGFCNKWIGWIMAAVKSVHYSVLINGSPHGYITPT 786

Query: 764  RGIRQGDPLSPYLFLICSEAFSHLLQNAVRNKEINGMKISRTGPEITHLLFADDSLLFCE 585
            RGIRQGDPLSPYLF++C +  SHL+     + ++ G++I    P ITHL FADDSL FC+
Sbjct: 787  RGIRQGDPLSPYLFILCGDILSHLINGRASSGDLRGVRIGNGAPAITHLQFADDSLFFCQ 846

Query: 584  ADIQQIRTVKRILQDYEHCSGQQVNLDKSSILFSKNATPSLKEQV--CAEIPGVAVHTRS 411
            A+++  + +K +   YE+ SGQ++N+ KS I F      S + ++    EIP        
Sbjct: 847  ANVRNCQALKDVFDVYEYYSGQKINVQKSMITFGSRVYGSTQSKLKQILEIPNQG--GGG 904

Query: 410  KYLGLPLVIGRSKNQVFQYVVEAAKSRILSWKNNFLSHAGKEVLLKSVIQALPTYSMSCY 231
            KYLGLP   GR K ++F+Y+++  K R  +W   FLS AGKE++LKSV  A+P Y+MSC+
Sbjct: 905  KYLGLPEQFGRKKKEMFEYIIDRVKKRTSTWSARFLSPAGKEIMLKSVALAMPVYAMSCF 964

Query: 230  KLSKQVCRQVEKLMAHYWWGKGDHHRKLHWSKWDNLTKSKSEGGLNFTSLESANDALLAK 51
            KL K +  ++E L+ ++WW K  + R + W  W  L  SK EGGL F  L   NDALLAK
Sbjct: 965  KLPKGIVSEIESLLMNFWWEKASNQRGIPWVAWKRLQYSKKEGGLGFRDLAKFNDALLAK 1024

Query: 50   QLWRLIQDPHSLMAKV 3
            Q WRLIQ P+SL A+V
Sbjct: 1025 QAWRLIQYPNSLFARV 1040


>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1750

 Score =  417 bits (1071), Expect = e-114
 Identities = 203/436 (46%), Positives = 282/436 (64%), Gaps = 2/436 (0%)
 Frame = -2

Query: 1304 MTPLFFHNFWHIVKFDVCNAVKSFFRSNFMLKAFNHTLVSFIPKTQHPSNISQYRPISFC 1125
            +T  F+ N W IV +DV   VK FF ++FM  + NHT +  IPK  +P+ +S YRPI+ C
Sbjct: 833  LTARFYKNCWDIVGYDVILEVKKFFETSFMKPSINHTNICMIPKITNPTTLSDYRPIALC 892

Query: 1124 NVIYKIISKILSERLKPCLPFCISENQSAFLPGRQIIDNVVIAHEYVHYLNSLRKGKKAF 945
            NV+YK+ISK L  RLK  L   +S++Q+AF+PGR I DNV+IAHE +H L   ++  K +
Sbjct: 893  NVLYKVISKCLVNRLKSHLNSIVSDSQAAFIPGRIINDNVMIAHEVMHSLKVRKRVSKTY 952

Query: 944  VALKLDMSKAYDRVEWRFLSYIMIHMGFDLQFVGWINRCIQSSSFSFNVNGEVKGYIRPS 765
            +A+K D+SKAYDRVEW FL   M   GF  +++GWI   ++S  +S  +NG   GYI P+
Sbjct: 953  MAVKTDVSKAYDRVEWDFLETTMRLFGFCNKWIGWIMAAVKSVHYSVLINGSPHGYITPT 1012

Query: 764  RGIRQGDPLSPYLFLICSEAFSHLLQNAVRNKEINGMKISRTGPEITHLLFADDSLLFCE 585
            RGIRQGDPLSPYLF++C +  SHL+     + ++ G++I    P ITHL FADDSL FC+
Sbjct: 1013 RGIRQGDPLSPYLFILCGDILSHLINGRASSGDLRGVRIGNGAPAITHLQFADDSLFFCQ 1072

Query: 584  ADIQQIRTVKRILQDYEHCSGQQVNLDKSSILFSKNATPSLKEQV--CAEIPGVAVHTRS 411
            A+++  + +K +   YE+ SGQ++N+ KS I F      S + ++    EIP        
Sbjct: 1073 ANVRNCQALKDVFDVYEYYSGQKINVQKSMITFGSRVYGSTQSRLKQILEIPNQG--GGG 1130

Query: 410  KYLGLPLVIGRSKNQVFQYVVEAAKSRILSWKNNFLSHAGKEVLLKSVIQALPTYSMSCY 231
            KYLGLP   GR K ++F+Y+++  K R  +W   FLS AGKE++LKSV  A+P Y+MSC+
Sbjct: 1131 KYLGLPEQFGRKKKEMFEYIIDRVKKRTSTWSARFLSPAGKEIMLKSVALAMPVYAMSCF 1190

Query: 230  KLSKQVCRQVEKLMAHYWWGKGDHHRKLHWSKWDNLTKSKSEGGLNFTSLESANDALLAK 51
            KL K +  ++E L+ ++WW K  + R + W  W  L  SK EGGL F  L   NDALLAK
Sbjct: 1191 KLPKGIVSEIESLLMNFWWEKASNQRGIPWVAWKRLQYSKKEGGLGFRDLAKFNDALLAK 1250

Query: 50   QLWRLIQDPHSLMAKV 3
            Q WRLIQ P+SL A+V
Sbjct: 1251 QAWRLIQYPNSLFARV 1266


>emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  411 bits (1057), Expect = e-112
 Identities = 195/434 (44%), Positives = 286/434 (65%)
 Frame = -2

Query: 1304 MTPLFFHNFWHIVKFDVCNAVKSFFRSNFMLKAFNHTLVSFIPKTQHPSNISQYRPISFC 1125
            M  +F+  FWHIV  DV + + +    +      N+T ++ IPK ++P+  +++RPI+ C
Sbjct: 455  MHVIFYQRFWHIVGDDVTSFISNILHGHSSPSCVNNTNIALIPKVKNPTKAAEFRPIALC 514

Query: 1124 NVIYKIISKILSERLKPCLPFCISENQSAFLPGRQIIDNVVIAHEYVHYLNSLRKGKKAF 945
            NV+YK++SK +  RLK  LP  ISENQSAF+PGR I DN +IA E  H + +  + +K  
Sbjct: 515  NVLYKLMSKAIVMRLKSFLPEIISENQSAFVPGRLITDNALIAMEVFHSMKNRNRSRKGT 574

Query: 944  VALKLDMSKAYDRVEWRFLSYIMIHMGFDLQFVGWINRCIQSSSFSFNVNGEVKGYIRPS 765
            +A+KLDMSKAYDRVEW FL  +++ MGFD ++V  I   + S ++SF +NG V G + P+
Sbjct: 575  IAMKLDMSKAYDRVEWGFLRKLLLTMGFDGRWVNLIMEFVSSVTYSFIINGSVCGSVVPA 634

Query: 764  RGIRQGDPLSPYLFLICSEAFSHLLQNAVRNKEINGMKISRTGPEITHLLFADDSLLFCE 585
            RG+RQGDPLSPYLF++ ++AFS ++Q  V++K+++G K SR+GPEI+HL FADDSLLF  
Sbjct: 635  RGLRQGDPLSPYLFIMVADAFSKMIQRKVQDKQLHGAKASRSGPEISHLFFADDSLLFTR 694

Query: 584  ADIQQIRTVKRILQDYEHCSGQQVNLDKSSILFSKNATPSLKEQVCAEIPGVAVHTRSKY 405
            A+ Q+   +  IL  YE  SGQ++N +KS + +S+  + S K+++   +    V    KY
Sbjct: 695  ANRQECTIIVDILNQYELASGQKINYEKSEVSYSRGVSVSQKDELTNILNMRQVDRHEKY 754

Query: 404  LGLPLVIGRSKNQVFQYVVEAAKSRILSWKNNFLSHAGKEVLLKSVIQALPTYSMSCYKL 225
            LG+P + GRSK  +F  +++    ++  WK   LS AGKEVLLKSVIQA+PTY M  YK 
Sbjct: 755  LGIPSISGRSKKAIFDSLIDRIWKKLQGWKEKLLSRAGKEVLLKSVIQAIPTYLMGVYKF 814

Query: 224  SKQVCRQVEKLMAHYWWGKGDHHRKLHWSKWDNLTKSKSEGGLNFTSLESANDALLAKQL 45
               + ++++  MA +WWG  D  RK+HW  WD++   K  GG+ F  L   NDALL +Q 
Sbjct: 815  PVFIIQKIQSAMARFWWGSSDTQRKIHWKNWDSMCNLKCFGGMGFKDLTIFNDALLGRQA 874

Query: 44   WRLIQDPHSLMAKV 3
            WRL ++P SL+ +V
Sbjct: 875  WRLTREPQSLLGRV 888


>gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam: rvt.hmm, score:
            42.57) [Arabidopsis thaliana]
          Length = 1662

 Score =  408 bits (1048), Expect = e-111
 Identities = 196/435 (45%), Positives = 285/435 (65%), Gaps = 1/435 (0%)
 Frame = -2

Query: 1304 MTPLFFHNFWHIVKFDVCNAVKSFFRSNFMLKAFNHTLVSFIPKTQHPSNISQYRPISFC 1125
            +T  F+ + W IV  DV   VK FFR+++M ++ NHT +  IPK  +P  +S YRPI+ C
Sbjct: 833  LTARFYKSCWEIVGPDVIKEVKIFFRTSYMKQSINHTNICMIPKITNPETLSDYRPIALC 892

Query: 1124 NVIYKIISKILSERLKPCLPFCISENQSAFLPGRQIIDNVVIAHEYVHYLNSLRKGKKAF 945
            NV+YKIISK L ERLK  L   +S++Q+AF+PGR + DNV+IAHE +H L + ++  +++
Sbjct: 893  NVLYKIISKCLVERLKGHLDAIVSDSQAAFIPGRLVNDNVMIAHEMMHSLKTRKRVSQSY 952

Query: 944  VALKLDMSKAYDRVEWRFLSYIMIHMGFDLQFVGWINRCIQSSSFSFNVNGEVKGYIRPS 765
            +A+K D+SKAYDRVEW FL   M   GF   ++ WI   ++S ++S  VNG   G I+P 
Sbjct: 953  MAVKTDVSKAYDRVEWNFLETTMRLFGFSETWIKWIMGAVKSVNYSVLVNGIPHGTIQPQ 1012

Query: 764  RGIRQGDPLSPYLFLICSEAFSHLLQNAVRNKEINGMKISRTGPEITHLLFADDSLLFCE 585
            RGIRQGDPLSPYLF++C++  +HL++N V   +I G++I    P +THL FADDSL FC+
Sbjct: 1013 RGIRQGDPLSPYLFILCADILNHLIKNRVAEGDIRGIRIGNGVPGVTHLQFADDSLFFCQ 1072

Query: 584  ADIQQIRTVKRILQDYEHCSGQQVNLDKSSILFSKNATPSLKEQVCAEIPGVAVH-TRSK 408
            ++++  + +K +   YE+ SGQ++N+ KS I F      + + ++   I G+  H    K
Sbjct: 1073 SNVRNCQALKDVFDVYEYYSGQKINMSKSMITFGSRVHGTTQNRL-KNILGIQSHGGGGK 1131

Query: 407  YLGLPLVIGRSKNQVFQYVVEAAKSRILSWKNNFLSHAGKEVLLKSVIQALPTYSMSCYK 228
            YLGLP   GR K  +F Y++E  K R  SW   +LS AGKE++LKSV  ++P Y+MSC+K
Sbjct: 1132 YLGLPEQFGRKKRDMFNYIIERVKKRTSSWSAKYLSPAGKEIMLKSVAMSMPVYAMSCFK 1191

Query: 227  LSKQVCRQVEKLMAHYWWGKGDHHRKLHWSKWDNLTKSKSEGGLNFTSLESANDALLAKQ 48
            L   +  ++E L+ ++WW K    R++ W  W  L  SK EGGL F  L   NDALLAKQ
Sbjct: 1192 LPLNIVSEIEALLMNFWWEKNAKKREIPWIAWKRLQYSKKEGGLGFRDLAKFNDALLAKQ 1251

Query: 47   LWRLIQDPHSLMAKV 3
            +WR+I +P+SL A++
Sbjct: 1252 VWRMINNPNSLFARI 1266


Top