BLASTX nr result

ID: Cephaelis21_contig00002238 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00002238
         (1340 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABB46931.2| retrotransposon protein, putative, unclassified [...    99   3e-27
emb|CAN64220.1| hypothetical protein VITISV_014001 [Vitis vinifera]    84   4e-27
gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptas...    76   4e-26
emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulga...    91   6e-26
gb|EEC81314.1| hypothetical protein OsI_24468 [Oryza sativa Indi...    99   1e-25

>gb|ABB46931.2| retrotransposon protein, putative, unclassified [Oryza sativa
            Japonica Group]
          Length = 1853

 Score = 98.6 bits (244), Expect(2) = 3e-27
 Identities = 70/229 (30%), Positives = 114/229 (49%), Gaps = 7/229 (3%)
 Frame = +2

Query: 464  INLNRAQAD--LLGALAHVEYFWRQKAREKWLQKGDWNTKFFHSSVVSKRDRLQISKLKD 637
            + L+RA AD    G  +  +  W Q++R  WL++GD NT+FFH+  V +  + +I+KL+D
Sbjct: 850  VRLDRALADDGWRGLFSTAQMIWLQRSRIAWLKEGDRNTRFFHNKAVWRAKKNKITKLRD 909

Query: 638  HSGAWVDDLDLFC*HACDFFQAQLSEKISSFDAAKVNSLLDYIPQLGGNSILLRPVTLKE 817
                           A ++FQ +L     S D ++V SL+        N  L +  + +E
Sbjct: 910  SDDTVHSTTKELERMATEYFQ-RLFTADPSIDHSRVTSLMKPKVTDAMNEELCKTFS-EE 967

Query: 818  DIINRSRHCGE-----K**FNERFFQHH*DVISIDLFQVV*EFSTGVPIPRSIGSIWIVL 982
            +I N     G         F  RF+Q +  ++  D+ + V EF +   +P  +    IVL
Sbjct: 968  EIANALFQIGPLKAPGPDGFPGRFYQRNWAILKDDIVRAVQEFFSLGTMPSGVNETAIVL 1027

Query: 983  LPKKDNPMTFGDFQPISFCNFINKFFTRILCDHLKPLLPGLILDPQSAF 1129
            +PK + P    DF+PIS CN + K  ++ L + L+P+L  L+   QSAF
Sbjct: 1028 IPKTEQPQELKDFRPISLCNVVYKIVSKCLVNRLRPILDDLVSQNQSAF 1076



 Score = 51.2 bits (121), Expect(2) = 3e-27
 Identities = 26/59 (44%), Positives = 37/59 (62%), Gaps = 2/59 (3%)
 Frame = +3

Query: 1128 FLPG*DITDNVLLAQELL*HLDKRVRGHNLF--FKLDLMKAFDRVNWMFICLFLLKFGF 1298
            F+PG  ITDN L+A E   H+ +     N +  +KLDL KA+DRV+W F+   ++K GF
Sbjct: 1076 FVPGRLITDNALIAFEYFHHIQRNKNPENAYSAYKLDLSKAYDRVDWEFLEQAMVKLGF 1134


>emb|CAN64220.1| hypothetical protein VITISV_014001 [Vitis vinifera]
          Length = 1937

 Score = 83.6 bits (205), Expect(3) = 4e-27
 Identities = 77/264 (29%), Positives = 120/264 (45%), Gaps = 7/264 (2%)
 Frame = +2

Query: 359  NKHSFGNIFYXXXXXXXXXXXXE-RVFDSS*VDADLINLNRAQADLLGALAHVEYFWRQK 535
            NK   GN+ +            E +  +S     D+   NRA  D        E  WRQK
Sbjct: 1051 NKEVVGNVSFNRAEAFSRLQRWEAKENESPLTPGDVEAKNRALEDYKKWALLEETSWRQK 1110

Query: 536  AREKWLQKGDWNTKFFHSSVVSKRDRLQISKLKDHSGAWVDDLDLFC*HACDFFQAQLSE 715
            +RE WL++GD NTK+FH    +K  R  +SK+K  +G  +  ++      C  +Q+ LS+
Sbjct: 1111 SREIWLKEGDKNTKYFHKMANAKARRNFLSKIK-VNGVNLSSVEDIKEGVCRAYQSLLSD 1169

Query: 716  KISSFDAAKVNSLLDYIPQLG-GNSILLRPVTLKEDIINR-SRHCGEK----**FNERFF 877
              S      +N L     +LG G +  L  +  +E+I    S  CG+K      F   F+
Sbjct: 1170 --SGDWRPSINGL--NFKELGEGLASSLEVMFSEEEIFAALSSFCGDKAPGPDGFTMAFW 1225

Query: 878  QHH*DVISIDLFQVV*EFSTGVPIPRSIGSIWIVLLPKKDNPMTFGDFQPISFCNFINKF 1057
                DV+  ++  +  EF       RS+ S +++L+PKK+      DF PIS    + K 
Sbjct: 1226 LFCWDVVKPEILGLFREFYLHGTFQRSLNSTFLLLIPKKEGTEDLSDFXPISLVXSVYKL 1285

Query: 1058 FTRILCDHLKPLLPGLILDPQSAF 1129
              ++L + LK  +  +I D Q AF
Sbjct: 1286 LAKVLANRLKSXMGEVISDSQHAF 1309



 Score = 45.1 bits (105), Expect(3) = 4e-27
 Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
 Frame = +3

Query: 1128 FLPG*DITDNVLLAQELL*HLDKRVRGHN--LFFKLDLMKAFDRVNWMFICLFLLKFGF 1298
            F+ G  I D VL+A E L   D R++G+N  L  K+D+ KAFD V W F+   + K GF
Sbjct: 1309 FVHGRQILDAVLIANEAL---DSRLKGNNPGLLLKMDIEKAFDHVKWDFLMDVMSKMGF 1364



 Score = 40.8 bits (94), Expect(3) = 4e-27
 Identities = 28/92 (30%), Positives = 39/92 (42%), Gaps = 1/92 (1%)
 Frame = +1

Query: 67   QNLETLDRLLFSTQWLHQFSDSSVELLSRAISHHSPLFYNYSSPVSVPFLFKFHDILLRY 246
            Q    LDR L S  W   FS  +   L R +S HSP+        +    F+F ++ L+ 
Sbjct: 953  QAASRLDRFLISDPWEDHFSAITQSALPRLVSDHSPIVLEAGGFSTGKSPFRFENMWLKL 1012

Query: 247  KDFLQMVAASWELHQV-GFGMFHFAMKLHHLK 339
              F  +V   W  + V G+     A KL  LK
Sbjct: 1013 DGFKDLVRCWWNGYSVEGYSSHCIAEKLKALK 1044


>gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H;
            Endonuclease/exonuclease/phosphatase [Medicago
            truncatula]
          Length = 1246

 Score = 76.3 bits (186), Expect(3) = 4e-26
 Identities = 69/269 (25%), Positives = 121/269 (44%), Gaps = 7/269 (2%)
 Frame = +2

Query: 344  LFFYLNKHSFGNIFYXXXXXXXXXXXXERVFDSS*VDADLINLNRAQADLL--GALAHVE 517
            +F + N+  FG++              +++ DS    +D +     +A L+   AL + +
Sbjct: 289  VFRHWNRTVFGDVDRKVRMAVEEVNRIQQIIDSVGF-SDQLYAQELEAHLILTKALHYQD 347

Query: 518  YFWRQKAREKWLQKGDWNTKFFHSSVVSKRDRLQISKLKDHSGAWVDDLDLFC*HACDFF 697
              WR+K R++    GD NT +FH     +  +  IS L+D      D   +   H  ++F
Sbjct: 348  ELWREKLRDQRFIHGDRNTAYFHRISKVRATKNTISFLQDGDAVITDPARIEV-HVLNYF 406

Query: 698  QAQLSEKISSF-DAAKVNSLLDYIPQLGGNSILLRPV--TLKEDI--INRSRHCGEK**F 862
            QA  S   S   +   V+++   +  +  NS+L  P+   +K  +  +N     G    F
Sbjct: 407  QAIFSVDNSCIQNDLVVDTIPSLVSNVDNNSLLRLPLWGEVKNAVFTLNGDGAPGPNG-F 465

Query: 863  NERFFQHH*DVISIDLFQVV*EFSTGVPIPRSIGSIWIVLLPKKDNPMTFGDFQPISFCN 1042
               F+Q + D++  D+ Q V +F     + ++I S  IVL+PK       GD++PI+  N
Sbjct: 466  GGHFYQTYWDIVGADVIQSVQDFFISGQLAQNINSNLIVLIPKVPGARVMGDYRPIALAN 525

Query: 1043 FINKFFTRILCDHLKPLLPGLILDPQSAF 1129
            F  K  ++IL D L  +   +I   Q  F
Sbjct: 526  FQFKIISKILADRLADITMRIISVEQRGF 554



 Score = 48.1 bits (113), Expect(3) = 4e-26
 Identities = 24/57 (42%), Positives = 36/57 (63%)
 Frame = +3

Query: 1128 FLPG*DITDNVLLAQELL*HLDKRVRGHNLFFKLDLMKAFDRVNWMFICLFLLKFGF 1298
            F+   DI+  V+LA E +  L+KR  G N+  K+D+ KAFD ++W F+   L +FGF
Sbjct: 554  FIRDRDISKCVILASEAINLLEKRQYGGNVALKVDIAKAFDTLDWNFLLAVLQRFGF 610



 Score = 41.6 bits (96), Expect(3) = 4e-26
 Identities = 28/95 (29%), Positives = 42/95 (44%), Gaps = 6/95 (6%)
 Frame = +1

Query: 82  LDRLLFSTQWLHQFSDSSVELLSRAI-----SHHSPLFYNYSSPVSVPF-LFKFHDILLR 243
           LDR + + +W++ +  SS   L  +      S H PL  +     S     FKF      
Sbjct: 196 LDRAICNEEWVNFWRSSSCSALGNSALVRHQSDHHPLLMSMDFCTSQRSGNFKFFKTWTE 255

Query: 244 YKDFLQMVAASWELHQVGFGMFHFAMKLHHLKGTF 348
           ++D  ++VA +W  H  G GM     KL H+K  F
Sbjct: 256 HEDCRRIVAENWSKHTRGHGMTRLQAKLKHMKQVF 290


>emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1383

 Score = 91.3 bits (225), Expect(3) = 6e-26
 Identities = 77/266 (28%), Positives = 123/266 (46%), Gaps = 9/266 (3%)
 Frame = +2

Query: 359  NKHSFGNIFYXXXXXXXXXXXXERVFDSS*VDADLINLNR-AQADLLGALAHVEYFWRQK 535
            N + FG+I              ++  D   +D + +   R AQADL   +   E +W Q+
Sbjct: 283  NLNEFGSIDANIRKLEDCIANFDKEADERELDKEELEKRREAQADLWKWMKRKEIYWAQR 342

Query: 536  AREKWLQKGDWNTKFFHSSVVSKRDRLQISKLKDHSGAWVDDLDLFC*HACDFFQAQLSE 715
            +R  WL+ GD NTKFFH+ + S + R  +    +  G   +D       A  FF+     
Sbjct: 343  SRITWLKAGDKNTKFFHA-IASNKKRKNMMACIETDGQSTNDPSQIKKEARAFFK----- 396

Query: 716  KISSFDAAKVNSL----LDYIPQLGGNSILLRPVTLKEDIINRSRHCGEK**----FNER 871
            KI   D  K  +L    L  + Q   NS L+ P T +E     S    +K      FN +
Sbjct: 397  KIFKEDHVKRPTLENLHLKRLSQNQANS-LITPFTTEEIDTAVSSCASDKAPGPDGFNFK 455

Query: 872  FFQHH*DVISIDLFQVV*EFSTGVPIPRSIGSIWIVLLPKKDNPMTFGDFQPISFCNFIN 1051
            F +   D+I  D++ +V +F     +P+   + +I L+PK DNP +  D++PIS   FI 
Sbjct: 456  FVKSAWDIIKTDIYGIVNDFWETGCLPQGCNTAYIALIPKIDNPSSLKDYRPISMVGFIY 515

Query: 1052 KFFTRILCDHLKPLLPGLILDPQSAF 1129
            K   ++L   L+ ++  LI   QS++
Sbjct: 516  KIVAKLLAKRLQSVISSLISPLQSSY 541



 Score = 40.8 bits (94), Expect(3) = 6e-26
 Identities = 22/66 (33%), Positives = 34/66 (51%)
 Frame = +1

Query: 82  LDRLLFSTQWLHQFSDSSVELLSRAISHHSPLFYNYSSPVSVPFLFKFHDILLRYKDFLQ 261
           LDRLL S +W+    +  V +L R +S H PL  +       P  F+F++  L     ++
Sbjct: 195 LDRLLVSPEWVSHCPNIKVSILQRGLSDHCPLLVHSHIQEWGPKPFRFNNCWLTDPKCMK 254

Query: 262 MVAASW 279
           +V ASW
Sbjct: 255 IVEASW 260



 Score = 33.1 bits (74), Expect(3) = 6e-26
 Identities = 20/62 (32%), Positives = 30/62 (48%)
 Frame = +3

Query: 1128 FLPG*DITDNVLLAQELL*HLDKRVRGHNLFFKLDLMKAFDRVNWMFICLFLLKFGFHFT 1307
            ++ G  I D  L+A E++    KR     +  KLD  KA+D V+W F+   L +  F   
Sbjct: 541  YVKGRQILDGALVASEIIESCKKR-NIEAILLKLDFHKAYDSVSWNFLQWTLDQMNFPVK 599

Query: 1308 LC 1313
             C
Sbjct: 600  WC 601


>gb|EEC81314.1| hypothetical protein OsI_24468 [Oryza sativa Indica Group]
          Length = 797

 Score = 99.0 bits (245), Expect(2) = 1e-25
 Identities = 73/231 (31%), Positives = 110/231 (47%), Gaps = 5/231 (2%)
 Frame = +2

Query: 452  DADLINLNRAQADLLGALAHVEYFWRQKAREKWLQKGDWNTKFFHSSVVSKRDRLQISKL 631
            +AD   + +A   +   L   E  W Q++R  WL++GD NTKFFHS  V +  + +IS+L
Sbjct: 509  NADSREIRQASDRMNELLYREEMLWLQRSRISWLKEGDHNTKFFHSKAVWRAKKNRISRL 568

Query: 632  KDHSGAWVDDLDLFC*HACDFFQAQLSEKISSFDAAKVNSLLDYIPQLGGNSILLRPVTL 811
            +   G   +        A D+F+ ++ +   + D A V+ L         N  L +  T 
Sbjct: 569  RASDGTIHNTATTIEKLATDYFK-EIYKADPNLDQASVSQLFQKKVTAAMNENLCKEFT- 626

Query: 812  KEDIINRSRHCGE-----K**FNERFFQHH*DVISIDLFQVV*EFSTGVPIPRSIGSIWI 976
             E+I +     G         F  RF+Q +   I  D+   V EF     +P       I
Sbjct: 627  DEEIADALFQIGPLKAPGPDGFPARFYQRNWATIKSDVVAAVKEFFHSGIMPEGANETAI 686

Query: 977  VLLPKKDNPMTFGDFQPISFCNFINKFFTRILCDHLKPLLPGLILDPQSAF 1129
            VL+PK D P+   DF+PIS CN + K  ++ L + L+P+L  LIL  QSAF
Sbjct: 687  VLIPKIDQPVELKDFRPISLCNVLYKIVSKCLVNRLRPILDELILVNQSAF 737



 Score = 45.4 bits (106), Expect(2) = 1e-25
 Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
 Frame = +3

Query: 1128 FLPG*DITDNVLLAQELL*HLDKRVRGHNLF--FKLDLMKAFDRVNWMFICLFLLKFGF 1298
            F+PG  ITDN LLA E    + K    +     +KLDL KA+DRV+W+F+   + K GF
Sbjct: 737  FVPGRLITDNSLLAFECFHFIQKNKHQNKAACAYKLDLSKAYDRVDWVFLEQAMYKLGF 795


Top