BLASTX nr result

ID: Cephaelis21_contig00004102 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00004102
         (1372 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...    93   4e-32
gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa ...    86   7e-31
emb|CAN67932.1| hypothetical protein VITISV_013913 [Vitis vinifera]    81   8e-30
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...    84   2e-29
gb|AAN04214.1| Putative retroelement [Oryza sativa Japonica Grou...    89   2e-29

>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score = 92.8 bits (229), Expect(2) = 4e-32
 Identities = 45/144 (31%), Positives = 79/144 (54%)
 Frame = +2

Query: 8    LRSSHFSILVNGVSAGFF*ASRGLKQGDPLSPLLFILLTEAMSRGLKHLVSTNRVQHYAL 187
            ++S  + +L+NG   G    SRGL+QGDPLSP LF++ TE + + L+     N++    +
Sbjct: 610  VKSVRYQVLINGTPHGEIIPSRGLRQGDPLSPYLFVICTEMLVKMLQSAEQKNQITGLKV 669

Query: 188  YPGAPVITHLCFTDDLVIFTRANRRSVRELAAFLQIFEVASGQRINREKSDFILSRRCTT 367
              GAP I+HL F DD + + + N  ++ ++   ++ + +ASGQR+N  KS     +  + 
Sbjct: 670  ARGAPPISHLLFADDSMFYCKVNDEALGQIIRIIEEYSLASGQRVNYLKSSIYFGKHISE 729

Query: 368  HHSHILSQMLGIKRTAFTNALFGM 439
                ++ + LGI+R        G+
Sbjct: 730  ERRCLVKRKLGIEREGGEGVYLGL 753



 Score = 73.2 bits (178), Expect(2) = 4e-32
 Identities = 64/330 (19%), Positives = 118/330 (35%), Gaps = 37/330 (11%)
 Frame = +3

Query: 483  LLDKISHKLDAWKGQXXXXXXXXXXXXHVLQSLPVHTLAAMNPPRSILRQLEGFFVRFFW 662
            L D++  K+  W+               V  +LP +T++    P++I +Q+E     F+W
Sbjct: 768  LKDRLGKKVLGWQSNFLSPGGKEILLKAVAMALPTYTMSCFKIPKTICQQIESVMAEFWW 827

Query: 663  GSQPEHTRQIWRSWDNLAYPVQEGGLGLRKLDTVLEAFFAK-LWWKIHHSLGIWAVYVNG 839
             ++ E     W++W +L+ P   GGLG ++++    A   K LW  I     + A     
Sbjct: 828  KNKKEGRGLHWKAWCHLSRPKAVGGLGFKEIEAFNIALLGKQLWRMITEKDSLMAKVFKS 887

Query: 840  VTWKR------------SFYRSRLQSVKHFVDIHFRICVGDGSS-NFWTANWLGTSPLLT 980
              + +            SF    +   +  +    R  +G+G + N WT  W+G  P   
Sbjct: 888  RYFSKSDPLNAPLGSRPSFAWKSIYEAQVLIKQGIRAVIGNGETINVWTDPWIGAKPAKA 947

Query: 981  DGVTPSNPDLTLKEACSQEV-----------WQEDLFAMDLSNTALQKI*EMHCHFSDDQ 1127
                  +  ++   A S  V           W  +L ++   +   + I  +     + +
Sbjct: 948  AQAVKRSHLVSQYAANSIHVVKDLLLPDGRDWNWNLVSLLFPDNTQENILALRPGGKETR 1007

Query: 1128 D*FIWDLT------------PXXXXXXXXXXXXXXXXKEINPFMKWVWRSSVPLKMSILL 1271
            D F W+ +                               ++P  + +W+  VP K+   L
Sbjct: 1008 DRFTWEYSRSGHYSVKSGYWVMTEIINQRNNPQEVLQPSLDPIFQQIWKLDVPPKIHHFL 1067

Query: 1272 WRVLHGLLSTDDILQRMGFFLASKCPLCPS 1361
            WR ++  LS    L          C  CPS
Sbjct: 1068 WRCVNNCLSVASNLAYRHLAREKSCVRCPS 1097


>gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa rugosa]
          Length = 1656

 Score = 86.3 bits (212), Expect(2) = 7e-31
 Identities = 46/130 (35%), Positives = 71/130 (54%)
 Frame = +2

Query: 14   SSHFSILVNGVSAGFF*ASRGLKQGDPLSPLLFILLTEAMSRGLKHLVSTNRVQHYALYP 193
            +S  S+L+NG     F  SRGL+QGDPLSP LF+ + + +SR +  +   + +    + P
Sbjct: 1221 TSSLSVLINGKPGPSFLPSRGLRQGDPLSPFLFLFVNDVLSRMINKMCQDSLLTPVTIGP 1280

Query: 194  GAPVITHLCFTDDLVIFTRANRRSVRELAAFLQIFEVASGQRINREKSDFILSRRCTTHH 373
                ++HL F DD + F RA  ++   L+  L  + +ASGQ IN EKS    S       
Sbjct: 1281 NNLPVSHLFFADDSLFFLRATLQNCETLSDLLHTYCIASGQLINVEKSSIFFSPNTPPEI 1340

Query: 374  SHILSQMLGI 403
            +H+LS ++ I
Sbjct: 1341 AHLLSSIMQI 1350



 Score = 75.5 bits (184), Expect(2) = 7e-31
 Identities = 61/244 (25%), Positives = 104/244 (42%), Gaps = 24/244 (9%)
 Frame = +3

Query: 420  PMPYLGCSLYQGRTRRLYFQPLLDKISHKLDAWKGQXXXXXXXXXXXXHVLQSLPVHTLA 599
            P  YLG   +  R+++     + D I  K+  WK               V  ++P + + 
Sbjct: 1356 PGTYLGLPTFWHRSKKKALGFIKDSILRKVKGWKQATLSQAGKEVLIKAVATAIPAYPMG 1415

Query: 600  AMNPPRSILRQLEGFFVRFFWGSQPEHTRQI-WRSWDNLAYPVQEGGLGLRKLDTVLEAF 776
                P ++ ++L G    F+WG+    TR I W+SWD LA P ++GG+G R L+    + 
Sbjct: 1416 CFKFPSTLCKELNGILADFWWGNVD--TRGIHWKSWDFLARPKKDGGMGFRNLEDFNNSL 1473

Query: 777  FAKLWWKIHHS-LGIWAVYVNGVTWKRSFYR------------SRLQSVKHFVDIHFRIC 917
             AK  W++H +   +WA  +  + + RS +             + L   ++F+       
Sbjct: 1474 LAKQAWRLHQNPFALWARVLEQLYYPRSSFLEAPKGPNPSWIWNSLLIGRNFIHKEALWN 1533

Query: 918  VGDG-SSNFWTANWLGTSPLLTDGVTPSNPDLTLKEACSQ---------EVWQEDLFAMD 1067
            +G+G S N    NW+ + P       PS   L L E  S+         + W+ D FA  
Sbjct: 1534 IGNGFSVNIVGDNWIPSIP-------PSTVSLPLNEDNSRVCELINWDTKSWELDHFAET 1586

Query: 1068 LSNT 1079
            ++ T
Sbjct: 1587 ITPT 1590


>emb|CAN67932.1| hypothetical protein VITISV_013913 [Vitis vinifera]
          Length = 2077

 Score = 81.3 bits (199), Expect(2) = 8e-30
 Identities = 53/156 (33%), Positives = 83/156 (53%)
 Frame = +2

Query: 2    GNLRSSHFSILVNGVSAGFF*ASRGLKQGDPLSPLLFILLTEAMSRGLKHLVSTNRVQHY 181
            G L S  F++LVNG + G+  ASRGL+QGDPLSP LF ++ + +SR L  +   N ++ +
Sbjct: 1006 GCLSSVSFAVLVNGNAKGWVKASRGLRQGDPLSPFLFTIVADVLSRMLLKVEERNVLEGF 1065

Query: 182  ALYPGAPVITHLCFTDDLVIFTRANRRSVRELAAFLQIFEVASGQRINREKSDFILSRRC 361
             +      ++HL F +D + F+      +  L   L +F   SG ++N +KS+ I     
Sbjct: 1066 RVGRNRTRVSHLQFANDTIFFSSTREEDMMTLKNVLLVFGHISGLKVNLDKSN-IYGINL 1124

Query: 362  TTHHSHILSQMLGIKRTAFTNALFGMFLVSGPNKKA 469
              +H   L++ML  K  AF   +  + L  G N KA
Sbjct: 1125 EQNHLFRLAEMLDCK--AFGWPILYLCLPLGGNPKA 1158



 Score = 77.0 bits (188), Expect(2) = 8e-30
 Identities = 74/312 (23%), Positives = 113/312 (36%), Gaps = 30/312 (9%)
 Frame = +3

Query: 471  YFQPLLDKISHKLDAWKGQXXXXXXXXXXXXHVLQSLPVHTLAAMNPPRSILRQLEGFFV 650
            ++ P++++IS +LD W+                L  +P + L+      S+  ++E    
Sbjct: 1161 FWDPVIERISRRLDGWQMAYLSFGGRITLIQSCLTHMPCYFLSLFKISASVAAKIERMQR 1220

Query: 651  RFFWGSQPEHTRQIWRSWDNLAYPVQEGGLGLRKLDTVLEAFFAKLWWKIHHS------- 809
             F W S  E  R    +WD +  P   GGLG  K+     A   K  W+           
Sbjct: 1221 DFLWSSVGEGKRDHLVNWDVVCKPKSRGGLGFGKISVRNIALLGKWLWRYPREGSALWHQ 1280

Query: 810  --LGIWAVYVNG------VTWKRSFYRSRLQSVKHFVDIHFRICVGDGS-SNFWTANWLG 962
              L I+  + NG      V W        +  V        R  VGDG    FW   W G
Sbjct: 1281 VILSIYGSHSNGWDVNNTVRWSHRCPWKAIALVFQEFSKFTRFVVGDGDIIRFWEDLWWG 1340

Query: 963  TSPL------LTDGVTPSNPDLTLKEACSQEVWQEDLFAMDLSNTALQKI*EM-----HC 1109
              PL      L   VT  N  ++     +        F  +LS++ ++ +  +       
Sbjct: 1341 DQPLGVQYPRLLSIVTDKNAPISSILGYTHPFSWNFNFRRNLSDSEIEDLEGLMRSLDRL 1400

Query: 1110 HFSDD-QD*FIWDLTPXXXXXXXXXXXXXXXXKEINPFM--KWVWRSSVPLKMSILLWRV 1280
            H S    D   W L+P                 E  P    K+VW S VP K+   +W V
Sbjct: 1401 HISPSVPDKRSWSLSPSGLFVVKSFFLALSQYSESPPVFPTKFVWNSQVPFKVKSFVWLV 1460

Query: 1281 LHGLLSTDDILQ 1316
             H  ++T+D+LQ
Sbjct: 1461 AHKKVNTNDLLQ 1472


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score = 84.3 bits (207), Expect(2) = 2e-29
 Identities = 80/335 (23%), Positives = 126/335 (37%), Gaps = 22/335 (6%)
 Frame = +3

Query: 417  LPMPYLGCSLYQGRTRRLYFQPLLDKISHKLDAWKGQXXXXXXXXXXXXHVLQSLPVHTL 596
            LP+ YLG  L   R       PLL+++  ++ +W  +             VL S+    L
Sbjct: 767  LPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWL 826

Query: 597  AAMNPPRSILRQLEGFFVRFFWGSQPEHTRQIWRSWDNLAYPVQEGGLGLRKLDTVLEAF 776
            AA   PR  +R+LE     F W     ++ +   SW  +  P  EGGLGLR L    +  
Sbjct: 827  AAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVC 886

Query: 777  FAKLWWKI-HHSLGIWAVYV------NGVTW-------KRSFYRSRLQSVKHFVDIHFRI 914
              KL WKI  HS  +W  +V      N   W       + S+   +L   +       ++
Sbjct: 887  CLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYREVAKTLSKV 946

Query: 915  CVGDG-SSNFWTANWLGTSPLLTDGVTPSNPDLTLKEACS-QEVW---QEDLFAMDLSNT 1079
             VG+G  ++FW  NW     LL         DL +    + +E W   ++     D+ N 
Sbjct: 947  EVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQRRHRNDVYNV 1006

Query: 1080 ALQKI*EMHCHFSDDQD*FIWDLTPXXXXXXXXXXXXXXXXKEIN---PFMKWVWRSSVP 1250
                + +     ++ +D  +W                    +  +   P+ K +W S   
Sbjct: 1007 IEDALKKSWDTRTETEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSARVPWHKVIWFSHAT 1066

Query: 1251 LKMSILLWRVLHGLLSTDDILQRMGFFLASKCPLC 1355
             K S   W   HG L T D +      +A+ C  C
Sbjct: 1067 PKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFC 1101



 Score = 72.8 bits (177), Expect(2) = 2e-29
 Identities = 43/127 (33%), Positives = 67/127 (52%), Gaps = 2/127 (1%)
 Frame = +2

Query: 8    LRSSHFSILVNGVSAGFF*ASRGLKQGDPLSPLLFILLTEAMSRGLKHLVSTNRVQHYAL 187
            + ++ FS+ VNG  AG+F +SRGL+QG  LSP LF++  + +S   K L      +H+  
Sbjct: 633  ITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLS---KMLDKAAAARHFGY 689

Query: 188  YPGAPV--ITHLCFTDDLVIFTRANRRSVRELAAFLQIFEVASGQRINREKSDFILSRRC 361
            +P      +THL F DDL++ +    RS+  +      F   SG RI+ EKS   L+   
Sbjct: 690  HPKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLS 749

Query: 362  TTHHSHI 382
             T  + +
Sbjct: 750  ATARNEV 756


>gb|AAN04214.1| Putative retroelement [Oryza sativa Japonica Group]
            gi|31430304|gb|AAP52238.1| retrotransposon protein,
            putative, unclassified [Oryza sativa Japonica Group]
          Length = 764

 Score = 89.4 bits (220), Expect(2) = 2e-29
 Identities = 74/339 (21%), Positives = 132/339 (38%), Gaps = 23/339 (6%)
 Frame = +3

Query: 408  AQPLPMPYLGCSLYQGRTRRLYFQPLLDKISHKLDAWKGQXXXXXXXXXXXXHVLQSLPV 587
            A  +P  YLG  +   +          +KI  +L  WK                L S+P+
Sbjct: 280  AGKMPFTYLGIPISMNKLTNADLDIPPNKIEKRLATWKCGYLSYGGKAILINSCLSSIPL 339

Query: 588  HTLAAMNPPRSILRQLEGFFVRFFWGSQPEHTRQIWRSWDNLAYPVQEGGLGLRKLDTVL 767
            + +     P  +  +++    RFFW    +  +     W+ L  P + GGLG      + 
Sbjct: 340  YMMGVYLLPEGVHNKMDSIRARFFWEGLEKKRKYHMIKWEALCRPKEFGGLGFIDTRKMN 399

Query: 768  EAFFAKLWWKIHHS-------------LGIWAVYVNGVTWKRSFYRSRLQSVKHFVDIHF 908
             A   K  +++                +     +      + S +   L  VK ++D+  
Sbjct: 400  IALLCKWIYRLESGKEDPCCVLLRNKYMKDGGGFFQSKAEESSQFWKGLHEVKKWMDLGS 459

Query: 909  RICVGDG-SSNFWTANWLGTSPLLTDGVT----PSNPDLTLKEACSQEVWQEDLFAMDLS 1073
               VG+G ++NFW+  W+G +PL T         ++ + T+ + C +  W  +L    L 
Sbjct: 460  SYKVGNGKATNFWSDVWIGETPLKTQYPNIYRMCADKEKTVSQMCLEGDWYIEL-RRSLG 518

Query: 1074 NTALQKI*EMH-----CHFSDDQD*FIWDLTPXXXXXXXXXXXXXXXXKEINPFMKWVWR 1238
               L +  ++H      H  +++D  IW LT                    +  M+ +WR
Sbjct: 519  ERDLNEWNDLHNTPREIHLKEERDCIIWKLTKNGFYKAKTLYQALSFGGVKDKVMQDLWR 578

Query: 1239 SSVPLKMSILLWRVLHGLLSTDDILQRMGFFLASKCPLC 1355
            SS+PLK+ IL W +L G +     L++M +  +  C LC
Sbjct: 579  SSIPLKVKILFWLMLKGRIQAAGQLKKMKWSGSPNCKLC 617



 Score = 67.4 bits (163), Expect(2) = 2e-29
 Identities = 39/108 (36%), Positives = 62/108 (57%)
 Frame = +2

Query: 26  SILVNGVSAGFF*ASRGLKQGDPLSPLLFILLTEAMSRGLKHLVSTNRVQHYALYPGAPV 205
           ++ +NG    FF   RG++QGDPLSPLLF L+ +A+S  L +  +   V H  L PG   
Sbjct: 160 AVNINGEVKDFFKTYRGVRQGDPLSPLLFNLVADALSEMLNN--AKQAVPH--LVPGG-- 213

Query: 206 ITHLCFTDDLVIFTRANRRSVRELAAFLQIFEVASGQRINREKSDFIL 349
           +THL + DD ++F      ++  +   L  +E  SG +IN +KS+ ++
Sbjct: 214 LTHLQYADDTILFMTNTEENIVTVKFLLYCYEAMSGLKINYQKSEIMV 261


Top