BLASTX nr result

ID: Cephaelis21_contig00027549 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00027549
         (1494 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...   180   7e-46
gb|AAD29058.1| putative non-LTR retroelement reverse transcripta...   174   3e-43
gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptas...   175   1e-42
gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-...   166   3e-42
emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulga...   171   4e-42

>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score =  180 bits (457), Expect(2) = 7e-46
 Identities = 109/318 (34%), Positives = 171/318 (53%), Gaps = 3/318 (0%)
 Frame = +2

Query: 11   DFLPISLCNFVNKLFTRILCNGIKGLMPKLISEPQYAFFPGRDISDNVLLSQELLQHLDR 190
            DF PISLCN + K+  ++L N +K ++P +I E Q  F PGR I+DNVL++ E    L +
Sbjct: 512  DFRPISLCNVLYKIVAKVLANRMKMVLPMVIHESQSGFVPGRLITDNVLVAYECFHFLRK 571

Query: 191  KV*GDN--VVFKLDLMKAFDRVNWQFLSLLLQKFGFTSRFIDLIMNNIHGS*FSILFNGM 364
            K  G    +  KLD+ KA+DRV W FL  ++ K GF +R+  L+MN +  + FS+L NG 
Sbjct: 572  KKTGKKGYLGLKLDMSKAYDRVEWCFLENMMLKLGFPTRYTKLVMNCVTSARFSVLVNGQ 631

Query: 365  PTGFFKSSLGLK*GDPLSLFLFILFVEVLSRALTSLVQNRALASY-IGHRVSLFINYLCF 541
            P+  F  S GL+ GDPLS FLF++  E LS  L    + + +    IGHRVS  I++L F
Sbjct: 632  PSRNFFPSRGLRQGDPLSPFLFVVCAEGLSTLLRDAEEKKVIHGVKIGHRVSP-ISHLFF 690

Query: 542  TDELIIFTSRLRGFLCKLFKLLGDYEFSTGQMVNKPQEYVPSFEMMQCSS*SYHQSLFGI 721
             D+ ++F       +  +  +L  YE ++GQ +N  +  +     ++    +  Q     
Sbjct: 691  ADDSLLFIRATEEEVENVMDILSTYEAASGQKLNMEKSEMSYSRNLEPDKINTLQMKLAF 750

Query: 722  AKSSLLMRYLGYILFKGRRKQVYF*PLVDKMVARIVGWAGKLLSPGGHLIMIKHILSSIP 901
                   +YLG   F G  K+  F  + D++  ++ GW GK LS  G  ++IK +  +IP
Sbjct: 751  KTVEGHEKYLGLPTFIGSSKKRVFQAIQDRVWKKLKGWKGKYLSQAGREVLIKAVAQAIP 810

Query: 902  IHTFAAIDPPKAVLSQLE 955
             +       PK+++  +E
Sbjct: 811  TYAMQCFVIPKSIIDGIE 828



 Score = 31.6 bits (70), Expect(2) = 7e-46
 Identities = 14/50 (28%), Positives = 24/50 (48%), Gaps = 1/50 (2%)
 Frame = +3

Query: 966  KRFFWCNCNDHCRI-WRCWDQMCFPTNKNCLGIHSLEDIQLALAYKLWWK 1112
            + FFW    +  R+ W  W+++  P  +  LGI + +    AL  K  W+
Sbjct: 832  RNFFWGQKEEERRVAWVAWEKLFLPKKEGGLGIRNFDVFNRALLAKQAWR 881


>gb|AAD29058.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1229

 Score =  174 bits (442), Expect(2) = 3e-43
 Identities = 113/320 (35%), Positives = 173/320 (54%), Gaps = 5/320 (1%)
 Frame = +2

Query: 11   DFLPISLCNFVNKLFTRILCNGIKGLMPKLISEPQYAFFPGRDISDNVLLSQELLQHLDR 190
            D+ PI+LCN   K+  +I+   ++ ++PKLISE Q AF PGR ISDNVL++ E+L  L  
Sbjct: 398  DYRPIALCNIFYKIVAKIMTKRMQLILPKLISENQSAFVPGRVISDNVLITHEVLHFLRT 457

Query: 191  KV*GDN--VVFKLDLMKAFDRVNWQFLSLLLQKFGFTSRFIDLIMNNIHGS*FSILFNGM 364
                 +  +  K D+ KA+DRV W FL  +LQ+FGF S +ID ++  +    +S L NG 
Sbjct: 458  SSAKKHCSMAVKTDMSKAYDRVEWDFLKKVLQRFGFHSIWIDWVLECVTSVSYSFLINGT 517

Query: 365  PTGFFKSSLGLK*GDPLSLFLFILFVEVLSRALTSLVQNRALASYIGHRVSL---FINYL 535
            P G    + GL+ GDPLS  LFIL  EVLS   T   + R L    G RVS+    +N+L
Sbjct: 518  PQGKVVPTRGLRQGDPLSPCLFILCTEVLSGLCTRAQRLRQLP---GVRVSINGPRVNHL 574

Query: 536  CFTDELIIFTSRLRGFLCKLFKLLGDYEFSTGQMVNKPQEYVPSFEMMQCSS*SYHQSLF 715
             F D+ + F+        KL ++L  Y  ++GQ +N  +  V        S     + + 
Sbjct: 575  LFADDTMFFSKSDPESCNKLSEILSRYGKASGQSINFHKSSVTFSSKTPRSVKGQVKRIL 634

Query: 716  GIAKSSLLMRYLGYILFKGRRKQVYF*PLVDKMVARIVGWAGKLLSPGGHLIMIKHILSS 895
             I K     +YLG     GRRK+  F  ++DK+  +   WA + LS  G  +M+K +L+S
Sbjct: 635  KIRKEGGTGKYLGLPEHFGRRKRDIFGAIIDKIRQKSHSWASRFLSQAGKQVMLKAVLAS 694

Query: 896  IPIHTFAAIDPPKAVLSQLE 955
            +P+++ +    P A+  +++
Sbjct: 695  MPLYSMSCFKLPSALCRKIQ 714



 Score = 28.5 bits (62), Expect(2) = 3e-43
 Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 1/49 (2%)
 Frame = +3

Query: 969  RFFWCNCNDHCRI-WRCWDQMCFPTNKNCLGIHSLEDIQLALAYKLWWK 1112
            RF+W    D  +  W  W ++  P N   LG   +E    +L  KL W+
Sbjct: 719  RFWWDTKPDVRKTSWVAWSKLTNPKNAGGLGFRDIERCNDSLLAKLGWR 767


>gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
            truncatula]
          Length = 642

 Score =  175 bits (443), Expect(2) = 1e-42
 Identities = 102/316 (32%), Positives = 172/316 (54%), Gaps = 1/316 (0%)
 Frame = +2

Query: 23   ISLCNFVNKLFTRILCNGIKGLMPKLISEPQYAFFPGRDISDNVLLSQELLQHLDRKV*G 202
            I+L NF  K+  ++L + +  ++P +IS+ Q  F  GR+I D + L+ E +  LD K  G
Sbjct: 256  IALVNFKFKIINKVLADRLAKILPSIISKEQRGFVQGRNIRDCIALTSEAINVLDNKSFG 315

Query: 203  DNVVFKLDLMKAFDRVNWQFLSLLLQKFGFTSRFIDLIMNNIHGS*FSILFNGMPTGFFK 382
             N+  K+D+ KAFD +NW FL L+L+ FGF   F + I   +H S   I  NG   GFF 
Sbjct: 316  GNLALKIDVTKAFDTLNWDFLLLVLKTFGFNELFCNWIKTILHSSKMFISMNGAQHGFFN 375

Query: 383  SSLGLK*GDPLSLFLFILFVEVLSRALTSLVQNRALASYIGHRVSLFINYLCF-TDELII 559
             + G++ GDPLS  LF +  EVLSR++ S++ ++ L   I    +  + + CF  D+L++
Sbjct: 376  CNRGVRQGDPLSPLLFCIVEEVLSRSI-SILADKGLIDLIAASRNNCLPFHCFYVDDLMV 434

Query: 560  FTSRLRGFLCKLFKLLGDYEFSTGQMVNKPQEYVPSFEMMQCSS*SYHQSLFGIAKSSLL 739
            F       L  L  L   Y   +GQ++N  + ++ +  +      +   ++ G    SL 
Sbjct: 435  FCKAKMSSLIVLKSLFTRYADCSGQIMNIRKSFIFAGGITDTRM-NNIVNILGFNVGSLP 493

Query: 740  MRYLGYILFKGRRKQVYF*PLVDKMVARIVGWAGKLLSPGGHLIMIKHILSSIPIHTFAA 919
              YLG  +FKG+ K ++F P+ DK+ A++  W   LLS  G + ++K ++ S+ +HT + 
Sbjct: 494  FTYLGAPIFKGKPKGIHFQPIADKVKAKLAKWKASLLSIAGRIQLVKSVVQSMLVHTMSI 553

Query: 920  IDPPKAVLSQLEHFFQ 967
               P  +L ++E + +
Sbjct: 554  YSWPIKILKEMEKWIK 569



 Score = 26.6 bits (57), Expect(2) = 1e-42
 Identities = 15/64 (23%), Positives = 26/64 (40%), Gaps = 1/64 (1%)
 Frame = +3

Query: 960  FFKRFFWC-NCNDHCRIWRCWDQMCFPTNKNCLGIHSLEDIQLALAYKLWWK*RTETGVW 1136
            + K F W  +      +   W ++C    +  LG+ SL  +  A   K+ W        W
Sbjct: 567  WIKNFIWSGDVTKRKMVTVAWRKICADYEEGGLGVKSLICLNEATNLKICWNLMQSDEQW 626

Query: 1137 AHIV 1148
            A+I+
Sbjct: 627  ANII 630


>gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana]
          Length = 1142

 Score =  166 bits (420), Expect(2) = 3e-42
 Identities = 104/313 (33%), Positives = 163/313 (52%), Gaps = 2/313 (0%)
 Frame = +2

Query: 20   PISLCNFVNKLFTRILCNGIKGLMPKLISEPQYAFFPGRDISDNVLLSQELLQHL--DRK 193
            PISLCN   K+ ++ILC  +K ++P LISE Q AF  GR ISDN+L++QE+   L  +  
Sbjct: 290  PISLCNVGYKVISKILCQRLKTVLPNLISETQSAFVDGRLISDNILIAQEMFHGLRTNSS 349

Query: 194  V*GDNVVFKLDLMKAFDRVNWQFLSLLLQKFGFTSRFIDLIMNNIHGS*FSILFNGMPTG 373
                 +  K D+ KA+D+V W F+  LL+K GF  ++I  IM  I    + +L NG P G
Sbjct: 350  CKDKFMAIKTDMSKAYDQVEWNFIEALLRKMGFCEKWISWIMWCITTVQYKVLINGQPKG 409

Query: 374  FFKSSLGLK*GDPLSLFLFILFVEVLSRALTSLVQNRALASYIGHRVSLFINYLCFTDEL 553
                  GL+ GDPLS +LFIL  EVL   +    +   +        S  +++L F D+ 
Sbjct: 410  LIIPERGLRQGDPLSPYLFILCTEVLIANIRKAERQNLITGIKVATPSPAVSHLLFADDS 469

Query: 554  IIFTSRLRGFLCKLFKLLGDYEFSTGQMVNKPQEYVPSFEMMQCSS*SYHQSLFGIAKSS 733
            + F    +     + ++L  YE  +GQ +N  +  +     ++ S  +  + + GI    
Sbjct: 470  LFFCKANKEQCGIILEILKQYESVSGQQINFSKSSIQFGHKVEDSIKADIKLILGIHNLG 529

Query: 734  LLMRYLGYILFKGRRKQVYF*PLVDKMVARIVGWAGKLLSPGGHLIMIKHILSSIPIHTF 913
             +  YLG     G  K   F  + D++ +RI GW+ K LS GG  +MIK + +++P +  
Sbjct: 530  GMGSYLGLPESLGGSKTKVFSFVRDRLQSRINGWSAKFLSKGGKEVMIKSVAATLPRYVM 589

Query: 914  AAIDPPKAVLSQL 952
            +    PKA+ S+L
Sbjct: 590  SCFRLPKAITSKL 602



 Score = 33.9 bits (76), Expect(2) = 3e-42
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 2/48 (4%)
 Frame = +3

Query: 975  FWCNCNDHCR--IWRCWDQMCFPTNKNCLGIHSLEDIQLALAYKLWWK 1112
            FW + N   R   W  WD++C   +   LG  +++D   AL  K  W+
Sbjct: 609  FWWSSNGDSRGMHWMAWDKLCSSKSDGGLGFRNVDDFNSALLAKQLWR 656


>emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  171 bits (434), Expect(2) = 4e-42
 Identities = 97/317 (30%), Positives = 166/317 (52%), Gaps = 2/317 (0%)
 Frame = +2

Query: 11   DFLPISLCNFVNKLFTRILCNGIKGLMPKLISEPQYAFFPGRDISDNVLLSQELLQHLDR 190
            +F PI+LCN + KL ++ +   +K  +P++ISE Q AF PGR I+DN L++ E+   +  
Sbjct: 507  EFRPIALCNVLYKLMSKAIVMRLKSFLPEIISENQSAFVPGRLITDNALIAMEVFHSMKN 566

Query: 191  KV*G--DNVVFKLDLMKAFDRVNWQFLSLLLQKFGFTSRFIDLIMNNIHGS*FSILFNGM 364
            +       +  KLD+ KA+DRV W FL  LL   GF  R+++LIM  +    +S + NG 
Sbjct: 567  RNRSRKGTIAMKLDMSKAYDRVEWGFLRKLLLTMGFDGRWVNLIMEFVSSVTYSFIINGS 626

Query: 365  PTGFFKSSLGLK*GDPLSLFLFILFVEVLSRALTSLVQNRALASYIGHRVSLFINYLCFT 544
              G    + GL+ GDPLS +LFI+  +  S+ +   VQ++ L      R    I++L F 
Sbjct: 627  VCGSVVPARGLRQGDPLSPYLFIMVADAFSKMIQRKVQDKQLHGAKASRSGPEISHLFFA 686

Query: 545  DELIIFTSRLRGFLCKLFKLLGDYEFSTGQMVNKPQEYVPSFEMMQCSS*SYHQSLFGIA 724
            D+ ++FT   R     +  +L  YE ++GQ +N  +  V     +  S      ++  + 
Sbjct: 687  DDSLLFTRANRQECTIIVDILNQYELASGQKINYEKSEVSYSRGVSVSQKDELTNILNMR 746

Query: 725  KSSLLMRYLGYILFKGRRKQVYF*PLVDKMVARIVGWAGKLLSPGGHLIMIKHILSSIPI 904
            +     +YLG     GR K+  F  L+D++  ++ GW  KLLS  G  +++K ++ +IP 
Sbjct: 747  QVDRHEKYLGIPSISGRSKKAIFDSLIDRIWKKLQGWKEKLLSRAGKEVLLKSVIQAIPT 806

Query: 905  HTFAAIDPPKAVLSQLE 955
            +       P  ++ +++
Sbjct: 807  YLMGVYKFPVFIIQKIQ 823



 Score = 28.1 bits (61), Expect(2) = 4e-42
 Identities = 13/40 (32%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
 Frame = +3

Query: 969  RFFWCNCNDHCRI-WRCWDQMCFPTNKNCLGIHSLEDIQL 1085
            RF+W + +   +I W+ WD MC   N  C G    +D+ +
Sbjct: 828  RFWWGSSDTQRKIHWKNWDSMC---NLKCFGGMGFKDLTI 864


Top