BLASTX nr result

ID: Coptis24_contig00009157 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00009157
         (2753 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulga...   431   e-118
gb|EEC83100.1| hypothetical protein OsI_28249 [Oryza sativa Indi...   417   e-114
gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...   414   e-113
emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulga...   412   e-112
gb|ABA98491.1| retrotransposon protein, putative, unclassified [...   408   e-111

>emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1362

 Score =  431 bits (1107), Expect = e-118
 Identities = 251/781 (32%), Positives = 398/781 (50%), Gaps = 11/781 (1%)
 Frame = +1

Query: 4    RMRPILQRIIGPQQSAFLQGRSIHDNALLAHEAFHVIKNR-ANVNAKRFALKLDMHKAYD 180
            R++ IL  II P QSAF+  R I DNAL+A E FH +K + AN N    ALKLDM KAYD
Sbjct: 527  RLKVILPAIISPNQSAFVPRRLITDNALVAFEIFHAMKRKDANKNGV-CALKLDMSKAYD 585

Query: 181  KIEWPFLEEVLTRFGFGENWIRKVMFCVKSPTFSILLNGSTFGFFSPTRGLRQGDPMSPF 360
            ++EW FLE V+ + GF + WI +VM C+ S +F+  +NG   G  SP+RGLRQGDP+SP+
Sbjct: 586  RVEWCFLERVMKKMGFCDGWIDRVMACISSVSFTFNVNGVVEGSLSPSRGLRQGDPISPY 645

Query: 361  LFVLCAEVLSLNLNKLQLQGLVHGIKLSRNNSPITHLMYADDTIIFGKGSVQEADEISRC 540
            LF+LCA+  S  L+K   +  +HG ++ R    ++HL +ADD+I+F K SVQE   ++  
Sbjct: 646  LFLLCADAFSTLLSKAASEKKIHGAQICRGAPVVSHLFFADDSILFTKASVQECSMVADI 705

Query: 541  LDRYCEASGQSINKTKSQFLHHKGLPGTEIRIFHRIFGTPCTHNFPTYLGIPPVFNGKNK 720
            + +Y  ASGQ +N +K++ +  + +          + G         YLG+P +  G++K
Sbjct: 706  ISKYERASGQQVNLSKTEVVFSRSVDRERRSAIVNVLGVKEVDRQEKYLGLPTII-GRSK 764

Query: 721  RSHFLPLIQKFNNRLASWKASLLSKGGRLTLINSVLSALPTHIMACIQIPMNIIEDMDKI 900
            +  F  + ++   +L  WK  LLS+ G+  LI SV  A+PT++M+   +P  +I+++  +
Sbjct: 765  KVTFACIKERIWKKLQGWKEKLLSRPGKEVLIKSVAQAIPTYMMSVFSLPSGLIDEIHSL 824

Query: 901  RRNFLWAHEPGERKMHYFGWDLACTPIQFGGLGIKDLQMQNLALLGKKAWELC-NPRSLW 1077
               F W      RKMH+  WD  C P   GGLG +DL   N +LL K+AW LC   ++L 
Sbjct: 825  LARFWWGSSDTNRKMHWHSWDTLCYPKSMGGLGFRDLHCFNQSLLAKQAWRLCTGDQTLL 884

Query: 1078 ATQMKDKYFKNQHFMHAHLSTGASRTWRSMYYLRNMLVIGRRWKVGDGISIRAWRDNWIP 1257
               ++ +YFK+   + A      S TWRS++  +++L+ G +W VG G  IR W D WI 
Sbjct: 885  YRLLQARYFKSSELLEARRGYNPSFTWRSIWGSKSLLLEGLKWCVGSGERIRVWEDAWIL 944

Query: 1258 GTPIKMPLMSKPNGCFMERVCDFIDPVQRQWNTSLIREFWPDEYAKRIISIPLTITAKED 1437
            G    M    + +     +VCD ID  +  WN   +++ + +E  + ++SIPL+    +D
Sbjct: 945  GEGAHMVPTPQADSNLDLKVCDLIDVARGAWNIESVQQTFVEEEWELVLSIPLSRFLPDD 1004

Query: 1438 STFWWPNKYGRYDVKSAY------NMMQNYCDPNKEKVPIFSKIWKLNLPQKVRFFLWQL 1599
              +WWP++ G + V+S Y       +        + +  ++ ++W+L  P K+  FLW+ 
Sbjct: 1005 HRYWWPSRNGIFSVRSCYWLGRLGPVRTWQLQHGERETELWRRVWQLQGPPKLSHFLWRA 1064

Query: 1600 THQMIPTADFLSHRGFQVEARXXXXXXXXXXXKHLFFQCPFARAVWLGLGLSVSIAHFDF 1779
                +     L  R   V+A             H  F C FARA+W   G +  + +   
Sbjct: 1065 CKGSLAVKGRLFSRHISVDATCSVCGDPDESINHALFDCTFARAIWQVSGFASLMMNAPL 1124

Query: 1780 NQMSEGLIHMMKCLGSADGSRTWITTLGAGLWYIWLARNGKKF-TDTKQFPLASVIKAKR 1956
            +  SE L  + K       ++    T+ + +W  W  RN   F  +    PL +  +  +
Sbjct: 1125 SSFSERLEWLAK-----HATKEEFRTMCSFMWAGWFCRNKLIFENELSDAPLVAK-RFSK 1178

Query: 1957 MVVTMEQINGENKPNDHNLMEIRVKWDCPPRGWFKLNTDGSSHGNPGAAGAGCVLRGDNG 2136
            +V    +  G               W  PP G FK+N D     N G  G G V+R ++G
Sbjct: 1179 LVADYCEYAGSVFRGSGGGCGSSALWSPPPTGMFKVNFDAHLSPN-GEVGLGVVIRANDG 1237

Query: 2137 NF-ILAIAVPIAFESAFMAETVALRMGIKAVKELNLANVVVETDCRLLHQIVTNK-DSLA 2310
               +L +    A  +A MAE +A    ++    L    +V+E D  ++   V +K + +A
Sbjct: 1238 GIKMLGVKRVAARWTAVMAEAMAALFAVEVAHRLGFGRIVLEGDAMMVINAVKHKCEGVA 1297

Query: 2311 P 2313
            P
Sbjct: 1298 P 1298


>gb|EEC83100.1| hypothetical protein OsI_28249 [Oryza sativa Indica Group]
          Length = 1300

 Score =  417 bits (1073), Expect = e-114
 Identities = 264/829 (31%), Positives = 416/829 (50%), Gaps = 38/829 (4%)
 Frame = +1

Query: 4    RMRPILQRIIGPQQSAFLQGRSIHDNALLAHEAFHVIKNRANVNAKRFALKLDMHKAYDK 183
            RMRPIL  ++ P QSAF+ GR I DNALLA E FH I+   + N    A KLD+ KAYD+
Sbjct: 446  RMRPILDEVVSPNQSAFVPGRLITDNALLAFECFHFIQKNRSPNNAACAYKLDLSKAYDR 505

Query: 184  IEWPFLEEVLTRFGFGENWIRKVMFCVKSPTFSILLNGSTFGFFSPTRGLRQGDPMSPFL 363
            ++W FLE+ + + GF   W+  +M C+ +  ++I  NG+    F+PTRGLRQGDP+SPFL
Sbjct: 506  VDWRFLEQAMYKMGFAHRWVSWIMSCITTVRYAIKFNGTLLSTFAPTRGLRQGDPLSPFL 565

Query: 364  FVLCAEVLSLNLNKLQLQGLVHGIKLSRNNSPITHLMYADDTIIFGKGSVQEADEISRCL 543
            F+  A+ LSL L +   QG +  +++ R    I+HL++ADDT++F K + ++A+ I+  L
Sbjct: 566  FLFVADGLSLLLEEKVNQGAISPVRICRRAPGISHLLFADDTLLFLKSNKEQAEIINSVL 625

Query: 544  DRYCEASGQSINKTKSQFLHHKGLPGTE-------IRIFHRIFGTPCTHNFPTYLGIPPV 702
              Y  ++GQ +N +K   +  +  P +E       ++I + +F          YLG P  
Sbjct: 626  GDYAASTGQLVNPSKCSIMFGEATPLSEQTDIKATLQITNNVFE-------DKYLGFPTP 678

Query: 703  FNGKNKRSHFLPLIQKFNNRLASWKASLLSKGGRLTLINSVLSALPTHIMACIQIPMNII 882
             +G+  +  F  L ++   R+  W  + LS GG+  LI SV+ ALP ++M   ++P ++ 
Sbjct: 679  -DGRMHKGKFQSLHERVWKRIIQWGENFLSSGGKEVLIKSVMQALPVYVMGIFKLPDSVC 737

Query: 883  EDMDKIRRNFLWAHEPGERKMHYFGWDLACTPIQFGGLGIKDLQMQNLALLGKKAWELCN 1062
            ED+ K  RNF W    G+R+ H+  WD    P Q GG+G +D ++ N ALL +++W +  
Sbjct: 738  EDLSKAVRNFWWGAGDGKRRTHWRAWDSLTKPKQCGGMGFRDFRLFNQALLARQSWRILE 797

Query: 1063 -PRSLWATQMKDKYFKNQHFMHAHLSTGASRTWRSMYYLRNMLVIGRRWKVGDGISIRAW 1239
             P SL A  +K KYF N   +    S  AS  WR + Y   +L  G  W+VG+G +IR W
Sbjct: 798  FPESLCARVLKAKYFPNGSLIDTSFSGNASPGWRGIEYGLELLKQGIIWRVGNGRTIRIW 857

Query: 1240 RDNWIPGTPIKMPLMSKPNGCFMERVCDFIDPVQRQWNTSLIREFWPDEYAKRIISIPLT 1419
            RD WIP    + P+  K     ++ V + +D    +W++  I++ +     ++I+SI  +
Sbjct: 858  RDPWIPRDFSRRPITHKGTS-RVKWVSELLDQ-NGEWDSHKIQQIFLPIDVEKILSIHTS 915

Query: 1420 ITAKEDSTFWWPNKYGRYDVKSAY-------NMMQNYCDPNKEKVPIFSKIWKLNLPQKV 1578
               + D   W  +K GR+ V+SAY       N++ +     +E    ++++W  ++PQKV
Sbjct: 916  RFHENDFVAWHSDKLGRFSVRSAYHLALSLSNVVASSSSSGQELSKAWNQLWSCHVPQKV 975

Query: 1579 RFFLWQLTHQMIPTADFLSHRGFQVEARXXXXXXXXXXXKHLFFQCPFARAVWLGLGLSV 1758
            R F+W+     + T      +  +  +             H   +CP A+ +W  +  + 
Sbjct: 976  RIFIWRAASNSLATMVNKKKKRLEHCSMCSICGTEEEDVAHALCRCPHAKYLWEVMRRAK 1035

Query: 1759 SI-AHFDFNQMSEGLIHMMKCLGSADGSRTWITTLGAGLWYIWLARN----GK------- 1902
            +I    D N      I  +    S   S+    TL   LW IW  RN    GK       
Sbjct: 1036 AITVQADRNWTGADWIFDI----SERISKEERPTLLMMLWRIWYVRNEITHGKAAVPAEV 1091

Query: 1903 ----------KFTDTKQFPLASVIKAKRMVVTMEQINGENKPNDHNLMEIRVKWDCPPRG 2052
                         + +QFP A++ K K ++         N P    +  + V+W  P  G
Sbjct: 1092 SQRFISSYITSLLEIRQFPDANLCKGKHVIRCAAAGAQVNHP---RVNSVPVRWVRPQAG 1148

Query: 2053 WFKLNTDGSSHGNPGAAGAGCVLRGDNGNFILAIAVPIAFE-SAFMAETVALRMGIKAVK 2229
            W KLN DGS     G+ G G VLR   G  I A    +    SA  AE VA + GI    
Sbjct: 1149 WMKLNVDGSYDPRDGSGGIGAVLRNSEGKLIFAACGSMCRPVSALEAELVACKEGIILAL 1208

Query: 2230 ELNLANVVVETDCRLLHQIVTNKDSLAPAEVQELIMEIRDWMKDGNFIM 2376
            +     ++VETDC  L ++V  +  +  +++  LI EI+D +K    I+
Sbjct: 1209 QWTFLPIIVETDCLELVKLVAEQGKVM-SDLGFLIREIKDLVKGNREIV 1256


>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score =  414 bits (1063), Expect = e-113
 Identities = 252/808 (31%), Positives = 400/808 (49%), Gaps = 23/808 (2%)
 Frame = +1

Query: 4    RMRPILQRIIGPQQSAFLQGRSIHDNALLAHEAFHVIKNRANVNAKRFALKLDMHKAYDK 183
            R++ IL  +I   Q+AF++GR I DN L+AHE  H + +    + +  A+K D+ KAYD+
Sbjct: 524  RLKKILPSLISETQAAFVKGRLISDNILIAHELLHALSSNNKCSEEFIAIKTDISKAYDR 583

Query: 184  IEWPFLEEVLTRFGFGENWIRKVMFCVKSPTFSILLNGSTFGFFSPTRGLRQGDPMSPFL 363
            +EWPFLE+ +   GF ++WIR +M CVKS  + +L+NG+  G   P+RGLRQGDP+SP+L
Sbjct: 584  VEWPFLEKAMRGLGFADHWIRLIMECVKSVRYQVLINGTPHGEIIPSRGLRQGDPLSPYL 643

Query: 364  FVLCAEVLSLNLNKLQLQGLVHGIKLSRNNSPITHLMYADDTIIFGKGSVQEADEISRCL 543
            FV+C E+L   L   + +  + G+K++R   PI+HL++ADD++ + K + +   +I R +
Sbjct: 644  FVICTEMLVKMLQSAEQKNQITGLKVARGAPPISHLLFADDSMFYCKVNDEALGQIIRII 703

Query: 544  DRYCEASGQSINKTKSQFLHHKGLPGTEIRIFHRIFGTPCTHNFPTYLGIPPVFNGKNKR 723
            + Y  ASGQ +N  KS     K +      +  R  G         YLG+P  F G +K 
Sbjct: 704  EEYSLASGQRVNYLKSSIYFGKHISEERRCLVKRKLGIEREGGEGVYLGLPESFQG-SKV 762

Query: 724  SHFLPLIQKFNNRLASWKASLLSKGGRLTLINSVLSALPTHIMACIQIPMNIIEDMDKIR 903
            +    L  +   ++  W+++ LS GG+  L+ +V  ALPT+ M+C +IP  I + ++ + 
Sbjct: 763  ATLSYLKDRLGKKVLGWQSNFLSPGGKEILLKAVAMALPTYTMSCFKIPKTICQQIESVM 822

Query: 904  RNFLWAHEPGERKMHYFGWDLACTPIQFGGLGIKDLQMQNLALLGKKAWELCNPR-SLWA 1080
              F W ++   R +H+  W     P   GGLG K+++  N+ALLGK+ W +   + SL A
Sbjct: 823  AEFWWKNKKEGRGLHWKAWCHLSRPKAVGGLGFKEIEAFNIALLGKQLWRMITEKDSLMA 882

Query: 1081 TQMKDKYFKNQHFMHAHLSTGASRTWRSMYYLRNMLVIGRRWKVGDGISIRAWRDNWIPG 1260
               K +YF     ++A L +  S  W+S+Y  + ++  G R  +G+G +I  W D WI  
Sbjct: 883  KVFKSRYFSKSDPLNAPLGSRPSFAWKSIYEAQVLIKQGIRAVIGNGETINVWTDPWIGA 942

Query: 1261 TPIKMP-------LMSKPNGCFMERVCDFIDPVQRQWNTSLIREFWPDEYAKRIISIPLT 1419
             P K         L+S+     +  V D + P  R WN +L+   +PD   + I+++   
Sbjct: 943  KPAKAAQAVKRSHLVSQYAANSIHVVKDLLLPDGRDWNWNLVSLLFPDNTQENILALRPG 1002

Query: 1420 ITAKEDSTFWWPNKYGRYDVKSAYNMMQNYCDP--NKEKV------PIFSKIWKLNLPQK 1575
                 D   W  ++ G Y VKS Y +M    +   N ++V      PIF +IWKL++P K
Sbjct: 1003 GKETRDRFTWEYSRSGHYSVKSGYWVMTEIINQRNNPQEVLQPSLDPIFQQIWKLDVPPK 1062

Query: 1576 VRFFLWQLTHQMIPTADFLSHRGFQVEARXXXXXXXXXXXKHLFFQCPFARAVWLGLGLS 1755
            +  FLW+  +  +  A  L++R    E              HL F+CPFAR  W    L 
Sbjct: 1063 IHHFLWRCVNNCLSVASNLAYRHLAREKSCVRCPSHGETVNHLLFKCPFARLTWAISPLP 1122

Query: 1756 VSIAHFDFNQMSEGLIHMMKCLGSADGSRTWITTLGAGLWYIWLARNGKKFTDTKQFPLA 1935
                      +   + H++    S          +   LW +W  RN   F   ++F   
Sbjct: 1123 APPGGEWAESLFRNMHHVLSVHKSQPEESDHHALIPWILWRLWKNRNDLVFKG-REFTAP 1181

Query: 1936 SVIKAKRMVVTMEQINGENKPNDHNLMEIR---VKWDCPPRGWFKLNTDGSSHGNPGAAG 2106
             VI   +    M+  N   +P        R   VKW  P  GW K NTDG+   + G  G
Sbjct: 1182 QVI--LKATEDMDAWNNRKEPQPQVTSSTRDRCVKWQPPSHGWVKCNTDGAWSKDLGNCG 1239

Query: 2107 AGCVLRGDNGNFI-LAIAVPIAFESAFMAETVALRMGIKAVKELNLANVVVETDCRLLHQ 2283
             G VLR   G  + L +    + +S    E  ALR  + ++   N   V+ E+D + L  
Sbjct: 1240 VGWVLRNHTGRLLWLGLRALPSQQSVLETEVEALRWAVLSLSRFNYRRVIFESDSQYLVS 1299

Query: 2284 IVTNK---DSLAPAEVQELIMEIRDWMK 2358
            ++ N+    SLAP      I +IR+ ++
Sbjct: 1300 LIQNEMDIPSLAPR-----IQDIRNLLR 1322


>emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  412 bits (1059), Expect = e-112
 Identities = 252/770 (32%), Positives = 390/770 (50%), Gaps = 5/770 (0%)
 Frame = +1

Query: 4    RMRPILQRIIGPQQSAFLQGRSIHDNALLAHEAFHVIKNRANVNAKRFALKLDMHKAYDK 183
            R++  L R++   QSAF+ GR I DNAL+A E FH +K+R        A+KLDM KAYD+
Sbjct: 528  RLKDFLPRLVSENQSAFVPGRLITDNALIAMEVFHSMKHRNRSRKGTIAMKLDMSKAYDR 587

Query: 184  IEWPFLEEVLTRFGFGENWIRKVMFCVKSPTFSILLNGSTFGFFSPTRGLRQGDPMSPFL 363
            +EW FL ++L   GF   W+  +M CV S ++S ++NG   G  +P RGLR GDP+SP+L
Sbjct: 588  VEWGFLRKLLLTMGFDGRWVNLIMSCVSSVSYSFIINGGVCGSVTPARGLRHGDPLSPYL 647

Query: 364  FVLCAEVLSLNLNKLQLQGLVHGIKLSRNNSPITHLMYADDTIIFGKGSVQEADEISRCL 543
            F+L A+  S  + K   +  +HG K SR+   I+HL +AD +++F + S QE   I   L
Sbjct: 648  FILIADAFSKMIQKKVQEKQLHGAKASRSGPVISHLFFADVSLLFTRASRQECAIIVEIL 707

Query: 544  DRYCEASGQSINKTKSQFLHHKGLPGTEIRIFHRIFGTPCTHNFPTYLGIPPVFNGKNKR 723
            + Y +ASGQ IN  KS+    KG+   +      I           YLGIP +  G+++ 
Sbjct: 708  NLYEQASGQKINYDKSEVSFSKGVSIAQKEELSNILQMKQVERHMKYLGIPSI-TGRSRT 766

Query: 724  SHFLPLIQKFNNRLASWKASLLSKGGRLTLINSVLSALPTHIMACIQIPMNIIEDMDKIR 903
            + F  L+ +   +L  WK  LLS+ G+  L+ SV+ A+PT++M   ++P +II+ +    
Sbjct: 767  AIFDSLMDRIWKKLQGWKEKLLSRAGKEILLKSVIQAIPTYLMGVYKLPCSIIQKIHSAM 826

Query: 904  RNFLWAHEPGERKMHYFGWDLACTPIQFGGLGIKDLQMQNLALLGKKAWELC-NPRSLWA 1080
              F W     +R++H+  WD  CT   FGG+G +DL++ N ALLG++AW L   P SL A
Sbjct: 827  ARFWWGSSDTQRRIHWKNWDSLCTLKCFGGMGFRDLRVFNDALLGRQAWRLVREPHSLLA 886

Query: 1081 TQMKDKYFKNQHFMHAHLSTGASRTWRSMYYLRNMLVIGRRWKVGDGISIRAWRDNWIPG 1260
              MK KY+ N  F+ A L    S +WRS++  + +L  G  W++G+G ++R W D W+  
Sbjct: 887  RVMKAKYYSNHDFLDAPLGVSTSYSWRSIWSSKALLKEGMVWRIGNGTNVRIWEDPWVL- 945

Query: 1261 TPIKMPLMSKPNGCFMERVCDFIDPVQRQWNTSLIREFWPDEYAKRIISIPLTITAKEDS 1440
              +   + S+ +G  +  V + ID  + +W  SLI   + +   K I+SIPL+    +D 
Sbjct: 946  DELGRFITSEKHG-NLNMVSELIDFDRMEWKVSLIETVFNERDIKCILSIPLSSLPLKDE 1004

Query: 1441 TFWWPNKYGRYDVKSAYNMMQNYCDPNKEKVPIFSKIWKLNLPQKVRFFLWQLTHQMIPT 1620
              W   K   Y VK+AY + +     +  +  I   IW + +  KV+ FLW+L    +P 
Sbjct: 1005 LTWAFTKNAHYSVKTAYMLGKGGNLDSFHQAWI--DIWSMEVSPKVKHFLWRLGTNTLPV 1062

Query: 1621 ADFLSHRGFQVEARXXXXXXXXXXXKHLFFQCPFARAVWLGLGLSVSIAHFDFNQMSEGL 1800
               L HR    +              H  F CPF R +W+  G     A      M+E L
Sbjct: 1063 RSLLKHRHMLDDDLCPRGCGEPESQFHAIFGCPFIRDLWVDSGCDNFRALTTDTAMTEAL 1122

Query: 1801 IHMMKCLGSADGSRTWITTLGAGL-WYIWLARNGKKFTDTKQFPLASVIKAKRMVVTMEQ 1977
            +       ++ G    + T GA + W +W  RN   F  +   P   + +  R+V     
Sbjct: 1123 V-------NSHGLDASVRTKGAFMAWVLWSERNSIVFNQSSTPPHILLARVSRLVEEHGT 1175

Query: 1978 INGENKPNDH--NLMEIRVKWDCPPRGWFKLNTDGSSHGNPGAAGAGCVLRGDNGNFILA 2151
                  PN +   +   RV W  PP    KLN D +S  + G  G   + R  +G  + A
Sbjct: 1176 YTARIYPNRNCCAIPSARV-WAAPPPEVIKLNVD-ASLASAGWVGLSVIARDSHGTVLFA 1233

Query: 2152 IAVPIAFE-SAFMAETVALRMGIKAVKELNLANVVVETDCRLLHQIVTNK 2298
                +  + SA +AE  A+ M ++  +    A ++VE+DC    Q+V N+
Sbjct: 1234 AVRKVRAQWSAEIAEAKAIEMALRLGRRYGFAAIIVESDC----QVVVNR 1279


>gb|ABA98491.1| retrotransposon protein, putative, unclassified [Oryza sativa
            Japonica Group]
          Length = 1621

 Score =  408 bits (1049), Expect = e-111
 Identities = 256/797 (32%), Positives = 399/797 (50%), Gaps = 16/797 (2%)
 Frame = +1

Query: 4    RMRPILQRIIGPQQSAFLQGRSIHDNALLAHEAFHVIKNRANVNAKRFALKLDMHKAYDK 183
            R++ IL  +I P QSAF+ GR I DN L+A E  H ++N+ +      A KLDM KAYD+
Sbjct: 780  RLKKILPDVISPAQSAFVPGRLISDNILIADEMTHYMRNKRSGQVGYAAFKLDMSKAYDR 839

Query: 184  IEWPFLEEVLTRFGFGENWIRKVMFCVKSPTFSILLNGSTFGFFSPTRGLRQGDPMSPFL 363
            +EW FL +++ + GF  +W+  +M CV + T+ I +NG     FSP RGLRQGDP+SP+L
Sbjct: 840  VEWSFLHDMILKLGFHTDWVNLIMKCVSTVTYRIRVNGELSESFSPGRGLRQGDPLSPYL 899

Query: 364  FVLCAEVLSLNLNKLQLQGLVHGIKLSRNNSPITHLMYADDTIIFGKGSVQEADEISRCL 543
            F+LCAE  S  L+K + +G +HGI++ +    ++HL++ADD++I  + +  EA ++   L
Sbjct: 900  FLLCAEGFSALLSKTEEEGRLHGIRICQGAPSVSHLLFADDSLILCRANGGEAQQLQTIL 959

Query: 544  DRYCEASGQSINKTKSQFLHHKGLPGTEIRIFHRIFGTPCTHNFPTYLGIPPVFNGKNKR 723
              Y E SGQ INK KS  +        E R                YLG+ PVF G+++ 
Sbjct: 960  QIYEECSGQVINKDKSAVMFSPNTSSLEKRAVMAALNMQRETTNERYLGL-PVFVGRSRT 1018

Query: 724  SHFLPLIQKFNNRLASWKASLLSKGGRLTLINSVLSALPTHIMACIQIPMNIIEDMDKIR 903
              F  L ++   R+  WK  LLS+ G+  LI +V  A+PT  M C ++  ++ + + K+ 
Sbjct: 1019 KIFSYLKERIWQRIQGWKEKLLSRAGKEILIKAVAQAIPTFAMGCFELTKDLCDQISKMI 1078

Query: 904  RNFLWAHEPGERKMHYFGWDLACTPIQFGGLGIKDLQMQNLALLGKKAWELC-NPRSLWA 1080
              + W+++  + KMH+  W+    P   GGLG +D+ + NLA+L K+ W L  +P SL +
Sbjct: 1079 AKYWWSNQEKDNKMHWLSWNKLTLPKNMGGLGFRDIYIFNLAMLAKQGWRLIQDPDSLCS 1138

Query: 1081 TQMKDKYFKNQHFMHAHLSTGASRTWRSMYYLRNMLVIGRRWKVGDGISIRAWRDNWIPG 1260
              ++ KYF          ++  S TWRS+     +L  G  W+VGDG  I  W D WIP 
Sbjct: 1139 RVLRAKYFPLGDCFRPKQTSNVSYTWRSIQKGLRVLQNGMIWRVGDGSKINIWADPWIPR 1198

Query: 1261 TPIKMPLMSKPNGC-FMERVCDFIDPVQRQWNTSLIRE-FWPDEYAKRIISIPLTITAKE 1434
               + P+   P G   + +V + IDP    W+  L+ + FW ++ A  I SIP+ +   E
Sbjct: 1199 GWSRKPM--TPRGANLVTKVEELIDPYTGTWDEDLLSQTFWEEDVA-AIKSIPVHV-EME 1254

Query: 1435 DSTFWWPNKYGRYDVKSAYNMM--------QNYCD--PNKEK--VPIFSKIWKLNLPQKV 1578
            D   W  +  G + VKSAY +         +N C    N E      + K+WKL +P K+
Sbjct: 1255 DVLAWHFDARGCFTVKSAYKVQREMERRASRNGCPGVSNWESGDDDFWKKLWKLGVPGKI 1314

Query: 1579 RFFLWQLTHQMIPTADFLSHRGFQVEARXXXXXXXXXXXKHLFFQCPFARAVWLGLGLSV 1758
            + FLW++ H  +     L HRG  V+ R            HLFF+C   + VW  L L  
Sbjct: 1315 KHFLWRMCHNTLALRANLHHRGMDVDTRCVMCGRYNEDAGHLFFKCKPVKKVWQALNLEE 1374

Query: 1759 SIAHFDFNQMSEGLIHMMKCLGSADGSRTWITTLGAGLWYIWLARNGKKFTDTKQFPLAS 1938
              +  +     + ++  + C    + +   +      LW  W  RN  +     + P  +
Sbjct: 1375 LRSMLEQQTSGKNVLQSIYCRPENERTSAIVC-----LWQWWKERNEVREGGIPRSP--A 1427

Query: 1939 VIKAKRMVVTMEQINGENKPNDHNLMEIRVKWDCPPRGWFKLNTDGSSHGNPGAAGAGCV 2118
             +    M    E +    K       E  V W  PP  + K+NTDG+   N    G G V
Sbjct: 1428 ELSHLIMSQAGEFVRMNVKEKSPRTGECAV-WRRPPLNFVKINTDGAYSSNMKQGGWGFV 1486

Query: 2119 LRGDNGNFILAIAVPIAF-ESAFMAETVALRMGIKAVKELNLANVVVETDCRLLHQIVTN 2295
            ++   G  + A A P A+ + AF AE VA    IK   E  ++ + +ETD  +L   + +
Sbjct: 1487 IKDQTGAVLQAGAGPAAYLQDAFHAEVVACAAAIKTASERGMSRIELETDSMMLRYAIQD 1546

Query: 2296 KDSLAPAEVQELIMEIR 2346
             +S   + +  +I+EI+
Sbjct: 1547 -NSFNLSSLGGVILEIK 1562


Top