BLASTX nr result

ID: Cephaelis21_contig00034691 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00034691
         (1835 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABA98491.1| retrotransposon protein, putative, unclassified [...   111   3e-61
gb|EEC68887.1| hypothetical protein OsI_37529 [Oryza sativa Indi...   120   1e-58
gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptas...    87   1e-56
gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...   103   1e-55
gb|EEE61581.1| hypothetical protein OsJ_15963 [Oryza sativa Japo...   105   7e-55

>gb|ABA98491.1| retrotransposon protein, putative, unclassified [Oryza sativa
            Japonica Group]
          Length = 1621

 Score =  111 bits (278), Expect(3) = 3e-61
 Identities = 86/316 (27%), Positives = 150/316 (47%), Gaps = 24/316 (7%)
 Frame = +2

Query: 863  KLDMMKAFDRVAWPFLRWLLLRFGFHAAFLTLIMNNLTSAWFLIMLNGSFSGFFKSSRVL 1042
            KLDM KA+DRV W FL  ++L+ GFH  ++ LIM  +++  + I +NG  S  F   R L
Sbjct: 830  KLDMSKAYDRVEWSFLHDMILKLGFHTDWVNLIMKCVSTVTYRIRVNGELSESFSPGRGL 889

Query: 1043 KRGDPLSPLPFILLTEALSRGLKKLVHEWNFSPYALMRDAPIITHLCFAEVTGVL----- 1207
            ++GDPLSP  F+L  E  S  L K   E       + + AP ++HL FA+ + +L     
Sbjct: 890  RQGDPLSPYLFLLCAEGFSALLSKTEEEGRLHGIRICQGAPSVSHLLFADDSLILCRANG 949

Query: 1208 ---------YAVY------VLS*RSMSDVLANVSTKRRVPSLCLIVVLWLKLRFCRKSWA 1342
                       +Y      V++ +  S V+ + +T   +    ++  L ++     + + 
Sbjct: 950  GEAQQLQTILQIYEECSGQVIN-KDKSAVMFSPNTS-SLEKRAVMAALNMQRETTNERY- 1006

Query: 1343 LGNPLF----RCNILADFSIRVGTKKLTITPMWKKFRGVWRVKGSFLSSGGHLVLIRHVL 1510
            LG P+F    R  I +    R+          W++ +G W+ K   LS  G  +LI+ V 
Sbjct: 1007 LGLPVFVGRSRTKIFSYLKERI----------WQRIQG-WKEK--LLSRAGKEILIKAVA 1053

Query: 1511 QAMPTHILDTMDPPKGVLGHLEQIFSKFF*GSSTGAVRRIFRSWDRMAYPVVENGIGVRR 1690
            QA+PT  +   +  K +   + ++ +K++  +     +  + SW+++  P    G+G R 
Sbjct: 1054 QAIPTFAMGCFELTKDLCDQISKMIAKYWWSNQEKDNKMHWLSWNKLTLPKNMGGLGFRD 1113

Query: 1691 LEDFHSAFTCKLWWKL 1738
            +  F+ A   K  W+L
Sbjct: 1114 IYIFNLAMLAKQGWRL 1129



 Score = 87.4 bits (215), Expect(3) = 3e-61
 Identities = 43/105 (40%), Positives = 63/105 (60%)
 Frame = +3

Query: 501  LPWAFYRSCWDIIAQDLLLAVQEFFAGAPISRIISSTVMVLLPKKLSPVTFTDFWPISLC 680
            +P  FY++CWD++ + +   V E   G  I    +   +VL+PK   P    D  PISLC
Sbjct: 707  MPAGFYKACWDVVGEKVTDEVLEVLRGGAIPEGWNDITIVLIPKVKKPELIKDLRPISLC 766

Query: 681  NFINKIFTRILCDRLASILPHLVSDEQSAFLHGRDISDSILLAQE 815
            N   K+ +++L +RL  ILP ++S  QSAF+ GR ISD+IL+A E
Sbjct: 767  NVCYKLVSKVLANRLKKILPDVISPAQSAFVPGRLISDNILIADE 811



 Score = 86.3 bits (212), Expect(3) = 3e-61
 Identities = 45/122 (36%), Positives = 69/122 (56%)
 Frame = +1

Query: 142 ELFRKQRARVKWLRKGDHNTKFFHASTLEKRNKLRVSRIRHVDGHWLDADADIRAHAVDF 321
           +++ KQRA   WL KGD NT FFHAS  E+R + R++++R  DG W++ + D RA  ++F
Sbjct: 589 DIYWKQRAHTNWLNKGDRNTSFFHASCSERRRRNRINKLRREDGSWVEREEDKRAMIIEF 648

Query: 322 FQGLLRDDGVLVTPYDNILSVIPTLVFIENNVALLRPMTLDEVRSAVFGLYPDSAPGGDG 501
           F+ L   +G   +    +L V+   V    N +L    T +EV+ A+  +    APG DG
Sbjct: 649 FKQLFTSNGGQNS--QKLLDVVDRKVSGAMNESLRAEFTREEVKEALDAIGDLKAPGPDG 706

Query: 502 YP 507
            P
Sbjct: 707 MP 708


>gb|EEC68887.1| hypothetical protein OsI_37529 [Oryza sativa Indica Group]
          Length = 1765

 Score =  120 bits (301), Expect(3) = 1e-58
 Identities = 84/296 (28%), Positives = 149/296 (50%), Gaps = 4/296 (1%)
 Frame = +2

Query: 863  KLDMMKAFDRVAWPFLRWLLLRFGFHAAFLTLIMNNLTSAWFLIMLNGSFSGFFKSSRVL 1042
            KLD+ KA+DRV W FL   + + GF   ++  IM  +TS  +++  NG+    F  +R L
Sbjct: 1049 KLDLSKAYDRVDWRFLEMAMNKLGFARRWVNWIMKCVTSVRYMVKFNGTLLQSFAPTRGL 1108

Query: 1043 KRGDPLSPLPFILLTEALSRGLKKLVHEWNFSPYALMRDAPIITHLCFAEVTGVLYAVYV 1222
            ++GDPL P  F+ + + LS  LK+ V + + +P+ + R AP I+HL FA+ T + +  + 
Sbjct: 1109 RQGDPLLPFLFLFVADGLSLLLKEKVAQNSLTPFKVCRAAPGISHLLFADDTLLFFKAHQ 1168

Query: 1223 LS*RSMSDVLAN--VSTKRRV-PSLCLIVVLWLKLRFCRKSWALGNPLFRCNILADFSI- 1390
                 + +VL++  + T + + P+ C I++                P     I   F + 
Sbjct: 1169 REAEVVKEVLSSYAMGTGQLINPAKCSILM-----------GGASTPAVSEAISEIFPVE 1217

Query: 1391 RVGTKKLTITPMWKKFRGVWRVKGSFLSSGGHLVLIRHVLQAMPTHILDTMDPPKGVLGH 1570
            R  T +     +WK+   V +   + LS+GG  VLI+ V+QA+P +++     P+ V+  
Sbjct: 1218 RDRTFQSLQAKIWKR---VIQWGENHLSTGGKEVLIKAVIQAIPVYVMGIFKLPESVIDD 1274

Query: 1571 LEQIFSKFF*GSSTGAVRRIFRSWDRMAYPVVENGIGVRRLEDFHSAFTCKLWWKL 1738
            L ++   F+  S  G  +  +++WD +  P    G+G R    F+ A   +  W+L
Sbjct: 1275 LTKLTKNFWWDSMNGQRKTHWKAWDSLTKPKSLGGLGFRDYRLFNQALLARQAWRL 1330



 Score = 89.4 bits (220), Expect(3) = 1e-58
 Identities = 48/104 (46%), Positives = 65/104 (62%)
 Frame = +3

Query: 504  PWAFYRSCWDIIAQDLLLAVQEFFAGAPISRIISSTVMVLLPKKLSPVTFTDFWPISLCN 683
            P  FY+  W  I  D++ AV+ FF    +   ++ T +VL+PKK  PV   DF PISLCN
Sbjct: 927  PARFYQRNWGTIKADIIGAVRRFFQTGLMPEGVNDTAIVLIPKKEQPVDLRDFRPISLCN 986

Query: 684  FINKIFTRILCDRLASILPHLVSDEQSAFLHGRDISDSILLAQE 815
             + K+ ++ L +RL  IL  LVS EQSAF+ GR I+D+ LLA E
Sbjct: 987  VVYKVVSKCLVNRLRPILDDLVSVEQSAFVQGRMITDNALLAFE 1030



 Score = 66.6 bits (161), Expect(3) = 1e-58
 Identities = 38/125 (30%), Positives = 64/125 (51%)
 Frame = +1

Query: 133  FDEELFRKQRARVKWLRKGDHNTKFFHASTLEKRNKLRVSRIRHVDGHWLDADADIRAHA 312
            + EE+   QR+RV WL+  D NTKFFH+  + +  K ++S++R  +     +   + + A
Sbjct: 805  YREEMLWLQRSRVNWLKDEDRNTKFFHSRAVWRAKKNKISKLRDANETVHSSTMKLESMA 864

Query: 313  VDFFQGLLRDDGVLVTPYDNILSVIPTLVFIENNVALLRPMTLDEVRSAVFGLYPDSAPG 492
             ++FQ +   D  L    + +  +I   V    N  L    T DE+  A+F + P  +PG
Sbjct: 865  TEYFQDVYTADPNLNP--ETVTRLIQEKVTDIMNEKLCEDFTEDEISQAIFQIGPLKSPG 922

Query: 493  GDGYP 507
             DG+P
Sbjct: 923  PDGFP 927


>gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H;
            Endonuclease/exonuclease/phosphatase [Medicago
            truncatula]
          Length = 1246

 Score = 87.0 bits (214), Expect(4) = 1e-56
 Identities = 46/116 (39%), Positives = 66/116 (56%)
 Frame = +2

Query: 845  GHNIMMKLDMMKAFDRVAWPFLRWLLLRFGFHAAFLTLIMNNLTSAWFLIMLNGSFSGFF 1024
            G N+ +K+D+ KAFD + W FL  +L RFGF   F+  I+  L SA   +++NG   GFF
Sbjct: 580  GGNVALKVDIAKAFDTLDWNFLLAVLQRFGFDEKFVHWILVILQSARLSVLVNGKAVGFF 639

Query: 1025 KSSRVLKRGDPLSPLPFILLTEALSRGLKKLVHEWNFSPYALMRDAPIITHLCFAE 1192
              S  +++GDPLSPL F L+ E LSR L     +    P +  R     TH+ +A+
Sbjct: 640  TCSHGVRQGDPLSPLLFCLVEEVLSRALSMAATDGQLIPMSYCRGVSFPTHILYAD 695



 Score = 86.7 bits (213), Expect(4) = 1e-56
 Identities = 42/101 (41%), Positives = 65/101 (64%)
 Frame = +3

Query: 513 FYRSCWDIIAQDLLLAVQEFFAGAPISRIISSTVMVLLPKKLSPVTFTDFWPISLCNFIN 692
           FY++ WDI+  D++ +VQ+FF    +++ I+S ++VL+PK        D+ PI+L NF  
Sbjct: 469 FYQTYWDIVGADVIQSVQDFFISGQLAQNINSNLIVLIPKVPGARVMGDYRPIALANFQF 528

Query: 693 KIFTRILCDRLASILPHLVSDEQSAFLHGRDISDSILLAQE 815
           KI ++IL DRLA I   ++S EQ  F+  RDIS  ++LA E
Sbjct: 529 KIISKILADRLADITMRIISVEQRGFIRDRDISKCVILASE 569



 Score = 64.7 bits (156), Expect(4) = 1e-56
 Identities = 49/172 (28%), Positives = 88/172 (51%), Gaps = 2/172 (1%)
 Frame = +1

Query: 1   GNIFENLN*AEMHLKALEQNFDASGSAGNLFNLN-QAQADYFRA-HFDEELFRKQRARVK 174
           G++   +  A   +  ++Q  D+ G +  L+    +A     +A H+ +EL+R++    +
Sbjct: 299 GDVDRKVRMAVEEVNRIQQIIDSVGFSDQLYAQELEAHLILTKALHYQDELWREKLRDQR 358

Query: 175 WLRKGDHNTKFFHASTLEKRNKLRVSRIRHVDGHWLDADADIRAHAVDFFQGLLRDDGVL 354
           ++  GD NT +FH  +  +  K  +S ++  D    D  A I  H +++FQ +   D   
Sbjct: 359 FIH-GDRNTAYFHRISKVRATKNTISFLQDGDAVITDP-ARIEVHVLNYFQAIFSVDNSC 416

Query: 355 VTPYDNILSVIPTLVFIENNVALLRPMTLDEVRSAVFGLYPDSAPGGDGYPG 510
           +   D ++  IP+LV   +N +LLR     EV++AVF L  D APG +G+ G
Sbjct: 417 IQN-DLVVDTIPSLVSNVDNNSLLRLPLWGEVKNAVFTLNGDGAPGPNGFGG 467



 Score = 52.0 bits (123), Expect(4) = 1e-56
 Identities = 26/75 (34%), Positives = 41/75 (54%)
 Frame = +1

Query: 1180 LFCRGHRCTLRSLCAFLEEYERCTGQCINKEKSSFFVSNRCSVAQASVLSQILGIRKSSL 1359
            +FC G +  +R L     +Y   +GQ IN  KS FF S   + ++  ++S +LG    SL
Sbjct: 699  IFCTGTKRNIRRLIKIFSQYSEVSGQLINNAKSRFFTS-AMTGSRVQMISSLLGFNVGSL 757

Query: 1360 PMQYLGGFLYQGRNK 1404
            P  YLG  +++G+ K
Sbjct: 758  PFTYLGCPIFRGKPK 772


>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score =  103 bits (256), Expect(3) = 1e-55
 Identities = 86/316 (27%), Positives = 148/316 (46%), Gaps = 21/316 (6%)
 Frame = +2

Query: 854  IMMKLDMMKAFDRVAWPFLRWLLLRFGFHAAFLTLIMNNLTSAWFLIMLNGSFSGFFKSS 1033
            I +K D+ KA+DRV WPFL   +   GF   ++ LIM  + S  + +++NG+  G    S
Sbjct: 571  IAIKTDISKAYDRVEWPFLEKAMRGLGFADHWIRLIMECVKSVRYQVLINGTPHGEIIPS 630

Query: 1034 RVLKRGDPLSPLPFILLTEALSRGLKKLVHEWNFSPYALMRDAPIITHLCFA-------- 1189
            R L++GDPLSP  F++ TE L + L+    +   +   + R AP I+HL FA        
Sbjct: 631  RGLRQGDPLSPYLFVICTEMLVKMLQSAEQKNQITGLKVARGAPPISHLLFADDSMFYCK 690

Query: 1190 ---EVTGVLYAV---YVLS*RSMSDVL-------ANVSTKRRVPSLCLIVVLWLKLRFCR 1330
               E  G +  +   Y L+     + L        ++S +RR    CL+     KL   R
Sbjct: 691  VNDEALGQIIRIIEEYSLASGQRVNYLKSSIYFGKHISEERR----CLVK---RKLGIER 743

Query: 1331 KSWALGNPLFRCNILADFSIRVGTKKLTITPMWKKFRGVWRVKGSFLSSGGHLVLIRHVL 1510
            +    G  ++     +    +V T       + KK  G W  + +FLS GG  +L++ V 
Sbjct: 744  EG---GEGVYLGLPESFQGSKVATLSYLKDRLGKKVLG-W--QSNFLSPGGKEILLKAVA 797

Query: 1511 QAMPTHILDTMDPPKGVLGHLEQIFSKFF*GSSTGAVRRIFRSWDRMAYPVVENGIGVRR 1690
             A+PT+ +     PK +   +E + ++F+  +        +++W  ++ P    G+G + 
Sbjct: 798  MALPTYTMSCFKIPKTICQQIESVMAEFWWKNKKEGRGLHWKAWCHLSRPKAVGGLGFKE 857

Query: 1691 LEDFHSAFTCKLWWKL 1738
            +E F+ A   K  W++
Sbjct: 858  IEAFNIALLGKQLWRM 873



 Score = 84.7 bits (208), Expect(3) = 1e-55
 Identities = 43/102 (42%), Positives = 62/102 (60%)
 Frame = +3

Query: 516 YRSCWDIIAQDLLLAVQEFFAGAPISRIISSTVMVLLPKKLSPVTFTDFWPISLCNFINK 695
           Y+  W+ +   +   VQ FF    I   ++ T + L+PK L     TDF PISLCN I K
Sbjct: 456 YQQFWETMGDQITEMVQAFFRSGSIEEGMNKTNICLIPKILKAEKMTDFRPISLCNVIYK 515

Query: 696 IFTRILCDRLASILPHLVSDEQSAFLHGRDISDSILLAQEWL 821
           +  +++ +RL  ILP L+S+ Q+AF+ GR ISD+IL+A E L
Sbjct: 516 VIGKLMANRLKKILPSLISETQAAFVKGRLISDNILIAHELL 557



 Score = 78.6 bits (192), Expect(3) = 1e-55
 Identities = 43/125 (34%), Positives = 70/125 (56%)
 Frame = +1

Query: 136 DEELFRKQRARVKWLRKGDHNTKFFHASTLEKRNKLRVSRIRHVDGHWLDADADIRAHAV 315
           +EE F ++++R+ W+R GD NTK+FHA+T  +R + R+ ++   +G    +D D+   A 
Sbjct: 331 NEEQFWQEKSRIMWMRNGDRNTKYFHAATKNRRAQNRIQKLIDEEGREWTSDEDLGRVAE 390

Query: 316 DFFQGLLRDDGVLVTPYDNILSVIPTLVFIENNVALLRPMTLDEVRSAVFGLYPDSAPGG 495
            +F+ L   + V  T  +  L  +  LV  + N  LL P+T +EV+ A F + P   PG 
Sbjct: 391 AYFKKLFASEDVGYTVEE--LENLTPLVSDQMNNNLLAPITKEEVQRATFSINPHKCPGP 448

Query: 496 DGYPG 510
           DG  G
Sbjct: 449 DGMNG 453


>gb|EEE61581.1| hypothetical protein OsJ_15963 [Oryza sativa Japonica Group]
          Length = 1494

 Score =  105 bits (261), Expect(3) = 7e-55
 Identities = 83/311 (26%), Positives = 142/311 (45%), Gaps = 18/311 (5%)
 Frame = +2

Query: 860  MKLDMMKAFDRVAWPFLRWLLLRFGFHAAFLTLIMNNLTSAWFLIMLNGSFSGFFKSSRV 1039
            +KLD+ KA+DRV W FL   L + GF   +   IM+ +TS  + + LNG+    F  +R 
Sbjct: 825  LKLDLSKAYDRVDWGFLDGALQKLGFGNIWRKWIMSCVTSVRYSVRLNGNMLEPFYPTRG 884

Query: 1040 LKRGDPLSPLPFILLTEALSRGLKKLVHEWNFSPYALMRDAPIITHLCFAEVTGVLYAVY 1219
            L+ GDPL+P  F+ + + LS  L++   E    P  + R AP ++HL FA+ + + +   
Sbjct: 885  LREGDPLNPYLFLFIADGLSNILQRRRDERQIQPLKVCRSAPGVSHLLFADDSLLFFKAE 944

Query: 1220 VLS---*RSMSDVLANVSTKRRVPSLCLIV---------------VLWLKLRFCRKSWAL 1345
            V+     +   D+    + +   P  C ++               VL ++ R C     L
Sbjct: 945  VIQATRIKEALDLYERCTGQLINPKECSLLFSALCPQERQDGIKAVLQVE-RTCFDDKCL 1003

Query: 1346 GNPLFRCNILADFSIRVGTKKLTITPMWKKFRGVWRVKGSFLSSGGHLVLIRHVLQAMPT 1525
            G P     + A+       +   I   ++K    W  +  FLS  G   LI+ V QA+PT
Sbjct: 1004 GLPTPDGRMKAE-------QFQPIKERFEKRLTDWSER--FLSLAGKEALIKSVAQALPT 1054

Query: 1526 HILDTMDPPKGVLGHLEQIFSKFF*GSSTGAVRRIFRSWDRMAYPVVENGIGVRRLEDFH 1705
            + +     P+      EQ+   F+ G   G  +  + +W+++  P +  G+G R +  F+
Sbjct: 1055 YTMGVFKMPERFCEEYEQLVRNFWWGHEKGEKKVHWIAWEKLTSPKLLGGLGFRDIRCFN 1114

Query: 1706 SAFTCKLWWKL 1738
             A   +  W+L
Sbjct: 1115 QALLARQAWRL 1125



 Score = 83.2 bits (204), Expect(3) = 7e-55
 Identities = 40/104 (38%), Positives = 66/104 (63%)
 Frame = +3

Query: 504  PWAFYRSCWDIIAQDLLLAVQEFFAGAPISRIISSTVMVLLPKKLSPVTFTDFWPISLCN 683
            P  F++  W ++ +D++  V+EFF        ++ TV+V++PK  +PV   DF P+SLCN
Sbjct: 704  PARFFQRNWGVLKRDVIEGVREFFETGEWKEGMNDTVIVMIPKTNAPVEMKDFRPVSLCN 763

Query: 684  FINKIFTRILCDRLASILPHLVSDEQSAFLHGRDISDSILLAQE 815
             I K+  + L +RL  +L  ++S+ QSAF+ GR I+D+ L+A E
Sbjct: 764  VIYKVVAKCLVNRLRPLLQEIISETQSAFVPGRMITDNALVAFE 807



 Score = 75.5 bits (184), Expect(3) = 7e-55
 Identities = 41/129 (31%), Positives = 67/129 (51%)
 Frame = +1

Query: 139 EELFRKQRARVKWLRKGDHNTKFFHASTLEKRNKLRVSRIRHVDGHWLDADADIRAHAVD 318
           EE++ KQR+R+ WL++GD NT++FH     +  K  + ++R  DG     + ++   A  
Sbjct: 584 EEIWWKQRSRITWLKEGDRNTRYFHLKASWRARKNLIKKLRRSDGMMCSKEEELGEIARS 643

Query: 319 FFQGLLRDDGVLVTPYDNILSVIPTLVFIENNVALLRPMTLDEVRSAVFGLYPDSAPGGD 498
           FF+ L   D  L      +L++    +  E N  L +P T +E+  A+F + P  APG D
Sbjct: 644 FFRDLYTKDESLNP--GELLNMFEPKITDEMNGMLTKPFTDEEISDALFQIGPLKAPGPD 701

Query: 499 GYPGLSIDR 525
           G+P     R
Sbjct: 702 GFPARFFQR 710


Top