BLASTX nr result

ID: Cephaelis21_contig00001934 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00001934
         (1755 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ADK92871.1| retrotransposon protein [Hypericum perforatum]          54   7e-12
ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thalia...    53   1e-09
gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...    48   6e-09
gb|EEC71228.1| hypothetical protein OsI_03168 [Oryza sativa Indi...    45   8e-09
dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]              56   1e-08

>gb|ADK92871.1| retrotransposon protein [Hypericum perforatum]
          Length = 593

 Score = 53.9 bits (128), Expect(2) = 7e-12
 Identities = 47/193 (24%), Positives = 87/193 (45%), Gaps = 1/193 (0%)
 Frame = +2

Query: 1091 LWHIWKARNLWIF-*HQRLKVHVVVQKAITEWQEFEAVTRSQKTPQAGRLVVTNTQRART 1267
            LW IW+ RN  +F   + + +   ++K    WQ+F     +     A R   T+   A T
Sbjct: 384  LWIIWRVRNSIVFRTGEEIVICKELEKGFRFWQDFMDTEGNPTVRGAPR---TSKWNAPT 440

Query: 1268 AVEPGVISICVASESHFAGKDSGAGYTFQDDQGNLLLANVVFCSNVQHLYVVELDAIRSA 1447
            A   G   I V +    A +    G   +DD G  ++A      N+ H  ++E  A+ + 
Sbjct: 441  A---GFYKINVDAGLR-AERGGQVGIVVRDDTGAFVMATTRSFPNLVHPTLLEGQAVYTG 496

Query: 1448 LTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILEDILLLKSQFLCCKFIFIQRK 1627
            L  A    ++RV++  D   VV  L +     +D + I++D  +L S F   +   ++R+
Sbjct: 497  LEFANALGLERVELESDCLPVVMQLSKGYTDRSDLSNIIDDCKMLLSNFQQVRIAHVRRE 556

Query: 1628 WNECSYRLAQFVL 1666
             N+ ++ +A+  +
Sbjct: 557  ANQAAHEMAKMTI 569



 Score = 44.3 bits (103), Expect(2) = 7e-12
 Identities = 20/43 (46%), Positives = 24/43 (55%)
 Frame = +3

Query: 885  LATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIWK 1013
            L+    L RRG+QV + C  C     T  HLFF CP +L IWK
Sbjct: 305  LSVRTNLTRRGIQVDEVCPCCAGPSETAAHLFFCCPYTLDIWK 347


>ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thaliana]
            gi|4972055|emb|CAB43923.1| putative protein [Arabidopsis
            thaliana] gi|7269807|emb|CAB79667.1| putative protein
            [Arabidopsis thaliana] gi|67633766|gb|AAY78807.1|
            putative reverse transcriptase/RNA-dependent DNA
            polymerase [Arabidopsis thaliana]
            gi|332660185|gb|AEE85585.1| Ribonuclease H-like protein
            [Arabidopsis thaliana]
          Length = 575

 Score = 53.1 bits (126), Expect(2) = 1e-09
 Identities = 46/206 (22%), Positives = 94/206 (45%)
 Frame = +2

Query: 1061 QKGDELTGYFLWHIWKARNLWIF*HQRLKVHVVVQKAITEWQEFEAVTRSQKTPQAGRLV 1240
            +K  +L  + LW +WK RN  +F  +      V+++A  + +E+   T ++      ++ 
Sbjct: 354  EKASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVN 413

Query: 1241 VTNTQRARTAVEPGVISICVASESHFAGKDSGAGYTFQDDQGNLLLANVVFCSNVQHLYV 1420
             ++  R R      V     A+ +    +  G G+  ++++G +          ++ +  
Sbjct: 414  RSSCGRWRPPPHQWVKCNTDATWNR-DNERCGIGWVLRNEKGEVKWMGARALPKLKSVLE 472

Query: 1421 VELDAIRSALTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILEDILLLKSQFLC 1600
             EL+A+R A+ +  +   + V    D ++++  L       +   TI +D+  L SQF  
Sbjct: 473  AELEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNNDEIWPSLKPTI-QDLQRLLSQFTE 531

Query: 1601 CKFIFIQRKWNECSYRLAQFVLSRKN 1678
             KF+FI R+ N  + R+A+  LS  N
Sbjct: 532  VKFVFIPREGNTLAERVARESLSFLN 557



 Score = 37.7 bits (86), Expect(2) = 1e-09
 Identities = 27/114 (23%), Positives = 45/114 (39%), Gaps = 2/114 (1%)
 Frame = +3

Query: 690  WTAKNNGIFIVKSAYNLSQTLKEGRMAKAETSKAREDG--RRMWRXXXXXXXXXXXXXXX 863
            W   ++G + VKS Y +   +   R +  E S+   +   +++W+               
Sbjct: 215  WDYTSSGDYTVKSGYWVLTQIINKRSSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWK-- 272

Query: 864  XXCIHGWLATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIWKLAPI 1025
              C+   L    AL  R +     C RC S K T+ HL F C  +   W ++ I
Sbjct: 273  --CLSNSLPVAGALAYRHLSKESACIRCPSCKETVNHLLFKCTFARLTWAISSI 324


>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score = 48.1 bits (113), Expect(2) = 6e-09
 Identities = 50/204 (24%), Positives = 86/204 (42%), Gaps = 3/204 (1%)
 Frame = +2

Query: 1076 LTGYFLWHIWKARNLWIF*HQRLKVHVVVQKAITEWQEFEAVTRSQKTPQAGRLVVTNTQ 1255
            L  + LW +WK RN  +F  +      V+ KA  +   +    R +  PQ    V ++T+
Sbjct: 1156 LIPWILWRLWKNRNDLVFKGREFTAPQVILKATEDMDAWN--NRKEPQPQ----VTSSTR 1209

Query: 1256 RARTAVEPGVISICVASESHFAGKD---SGAGYTFQDDQGNLLLANVVFCSNVQHLYVVE 1426
                  +P        +      KD    G G+  ++  G LL   +    + Q +   E
Sbjct: 1210 DRCVKWQPPSHGWVKCNTDGAWSKDLGNCGVGWVLRNHTGRLLWLGLRALPSQQSVLETE 1269

Query: 1427 LDAIRSALTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILEDILLLKSQFLCCK 1606
            ++A+R A+ +  + N  RV    D + +V+ +Q +    +    I +DI  L   F   K
Sbjct: 1270 VEALRWAVLSLSRFNYRRVIFESDSQYLVSLIQNEMDIPSLAPRI-QDIRNLLRHFEEVK 1328

Query: 1607 FIFIQRKWNECSYRLAQFVLSRKN 1678
            F F +R+ N  + R A+  LS  N
Sbjct: 1329 FQFTRREGNNVADRTARESLSLMN 1352



 Score = 40.0 bits (92), Expect(2) = 6e-09
 Identities = 28/121 (23%), Positives = 48/121 (39%), Gaps = 2/121 (1%)
 Frame = +3

Query: 669  KTRTVLVWTAKNNGIFIVKSAYNLSQTLKEGRMAKAETSKAREDG--RRMWRXXXXXXXX 842
            +TR    W    +G + VKS Y +   +   R    E  +   D   +++W+        
Sbjct: 1005 ETRDRFTWEYSRSGHYSVKSGYWVMTEIINQRNNPQEVLQPSLDPIFQQIWKLDVPPKIH 1064

Query: 843  XXXXXXXXXCIHGWLATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIWKLAP 1022
                     C++  L+    L  R +    +C RC S   T+ HL F CP +   W ++P
Sbjct: 1065 HFLWR----CVNNCLSVASNLAYRHLAREKSCVRCPSHGETVNHLLFKCPFARLTWAISP 1120

Query: 1023 I 1025
            +
Sbjct: 1121 L 1121


>gb|EEC71228.1| hypothetical protein OsI_03168 [Oryza sativa Indica Group]
          Length = 995

 Score = 45.1 bits (105), Expect(2) = 8e-09
 Identities = 30/111 (27%), Positives = 46/111 (41%), Gaps = 4/111 (3%)
 Frame = +3

Query: 690  WTAKNNGIFIVKSAYNLSQT----LKEGRMAKAETSKAREDGRRMWRXXXXXXXXXXXXX 857
            W     G+F V+SAYNL ++    + +    + + S A  D  + W+             
Sbjct: 673  WPHDKRGLFTVRSAYNLVRSNLFVVAQSSNGRGQHSGANVDS-QFWKALWTINAPGKMLI 731

Query: 858  XXXXCIHGWLATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIW 1010
                 +H  L T   L+RR V   + C  CG     IEH+F  CP +  +W
Sbjct: 732  HLWRSVHDCLPTGFQLRRRHVPATEGCIFCGHDD-RIEHVFLVCPFAATVW 781



 Score = 42.7 bits (99), Expect(2) = 8e-09
 Identities = 48/195 (24%), Positives = 73/195 (37%), Gaps = 4/195 (2%)
 Frame = +2

Query: 1040 IATDSHIQKGDELTGYFLWHIWKARNLW----IF*HQRLKVHVVVQKAITEWQEFEAVTR 1207
            +   SHIQK   +    L HIW+ARN      +  H R  +H +V            V  
Sbjct: 808  LTRSSHIQK--TVLAVTLRHIWEARNFSRNNPVITHPRQVIHKIVSYVDM------IVQH 859

Query: 1208 SQKTPQAGRLVVTNTQRARTAVEPGVISICVASESHFAGKDSGAGYTFQDDQGNLLLANV 1387
              K   A    +       T   PG++ I   +    A   +G  +  +D     +LA  
Sbjct: 860  CPKDRNASGCDLPLPVTKWTPPPPGMVLINSDAALFQASNQTGLAFVIRDHSATCMLAAN 919

Query: 1388 VFCSNVQHLYVVELDAIRSALTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILE 1567
               + +    + E   IR AL  AK      V +  D   V+  +Q      +    I+ 
Sbjct: 920  KRITGLLSPELAEALVIRFALEHAKAEGFQNVLMASDCLSVIKRIQSGARDLSVVGVIVR 979

Query: 1568 DILLLKSQFLCCKFI 1612
            DI  L+++FL C FI
Sbjct: 980  DIKKLETEFLECSFI 994


>dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score = 56.2 bits (134), Expect(2) = 1e-08
 Identities = 50/191 (26%), Positives = 84/191 (43%), Gaps = 1/191 (0%)
 Frame = +2

Query: 1085 YFLWHIWKARNLWIF*HQRLKVHVVVQKAITEWQEFEAVTRSQKTPQ-AGRLVVTNTQRA 1261
            Y LW++WKARN  +F +       ++ ++  E  E   +   +   Q A +  V  +  A
Sbjct: 1145 YILWNLWKARNRLVFDNNITAPSDILNRSFMESSEARCLLAKRTGLQTAFQTWVVWSPPA 1204

Query: 1262 RTAVEPGVISICVASESHFAGKDSGAGYTFQDDQGNLLLANVVFCSNVQHLYVVELDAIR 1441
                +      C  S SH A     AG   +++ G L +A  +      + ++ EL  +R
Sbjct: 1205 AGFTKLNSDGAC-KSHSHLAS----AGGLLRNENG-LWVAGYICNIGTANSFLAELWGLR 1258

Query: 1442 SALTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILEDILLLKSQFLCCKFIFIQ 1621
              L  AK R   ++    D + VV  L++  P T D + +++D  LL   F   K   I 
Sbjct: 1259 EGLLLAKNRGFTKLIAETDSEAVVQVLRKDGPVTPDASILVKDCKLLLDHFQEIKVTHIL 1318

Query: 1622 RKWNECSYRLA 1654
            R+ N+C+  LA
Sbjct: 1319 REGNQCADFLA 1329



 Score = 31.2 bits (69), Expect(2) = 1e-08
 Identities = 16/45 (35%), Positives = 21/45 (46%)
 Frame = +3

Query: 885  LATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIWKLA 1019
            L   V  KRRG+    +C  CG    T++HLF  C  +   W  A
Sbjct: 1061 LMVNVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDSA 1105


Top