BLASTX nr result

ID: Catharanthus22_contig00005008 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00005008
         (5228 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       179   3e-59
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   201   6e-57
emb|CAB72467.1| putative protein [Arabidopsis thaliana]               189   5e-56
gb|AAD15471.1| putative non-LTR retroelement reverse transcripta...   190   2e-51
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   194   4e-46
gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]                194   4e-46
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               191   2e-45
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   189   9e-45
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   189   1e-44
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   186   1e-43
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   184   5e-43
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           184   5e-43
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   184   5e-43
gb|EMJ14085.1| hypothetical protein PRUPE_ppa021750mg, partial [...   127   3e-42
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   180   7e-42
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   179   9e-42
emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulga...   122   9e-41
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   170   5e-39
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   141   8e-39
emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694...   169   1e-38

>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  179 bits (455), Expect(2) = 3e-59
 Identities = 110/300 (36%), Positives = 169/300 (56%), Gaps = 4/300 (1%)
 Frame = -2

Query: 1531 LKQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSA 1352
            LKQ N   + +IPK  + + T  DFR IS  N  YKVI+  ++ RL  +L  +I +AQSA
Sbjct: 500  LKQWNATTIVLIPKIVNPTCT-SDFRPISCLNTLYKVIARLLTDRLQRLLSGVISSAQSA 558

Query: 1351 FVKGRSMVENIHLVEEIMRVYT----KDENITKVHFED*SKESL*LHFFGIIAIGPLCLR 1184
            F+ GRS+ EN+ L  +++  Y         + KV  +  + +S+   F  I A+  L + 
Sbjct: 559  FLPGRSLAENVLLATDLVHGYNWSNISPRGMLKVDLKK-AFDSVRWEFV-IAALRALAIP 616

Query: 1183 FSRKVYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQ 1004
              + + ++   + T T +++SING N GFFK  +GLRQGDP+SP+LF++ ME  S  L+ 
Sbjct: 617  -EKFINWISQCISTPT-FTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHS 674

Query: 1003 ATPEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLK 824
                   +YHPK   L + +L FADD+MIF  G   S+  + + L  F   S  K N  K
Sbjct: 675  RYESGLIHYHPKASNLSISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDK 734

Query: 823  HSIFLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGW 644
              ++LAG+N  E +  A   GF  G++P RYLG+ L    L++A++ PLL+K++   + W
Sbjct: 735  SHLYLAGLNQLESNANAAY-GFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSW 793



 Score = 80.1 bits (196), Expect(2) = 3e-59
 Identities = 48/148 (32%), Positives = 79/148 (53%), Gaps = 2/148 (1%)
 Frame = -3

Query: 1977 KKIFA*KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCF 1798
            +  F  K++  +  +GD  TK FH +    +  N I+++ + +G    S + I D    +
Sbjct: 349  ESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASY 408

Query: 1797 YKGLLGKKQDVIPIETSVMDSGL--KISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPD 1624
            +  LLG + D   +E + M+  L  + S  Q   L   F  E+I+ ALF +  +KS  PD
Sbjct: 409  FGSLLGDEVDPYLMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPD 468

Query: 1623 GYTSKFFKQAWNIVGGSFCEAVLEFFSS 1540
            G+T++FF  +W+IVG    +A+ EFFSS
Sbjct: 469  GFTAEFFIDSWSIVGAEVTDAIKEFFSS 496


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  201 bits (512), Expect(2) = 6e-57
 Identities = 119/303 (39%), Positives = 179/303 (59%), Gaps = 7/303 (2%)
 Frame = -2

Query: 1531 LKQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSA 1352
            L + N  I+ ++PK  + + T+ DFR IS CN FYK+I+  +++RL   L  ++  +QS 
Sbjct: 327  LMELNSTIITLVPKVANPT-TMSDFRPISCCNTFYKIIAKLLANRLKGTLHLIVGPSQST 385

Query: 1351 FVKGRSMVENIHLVEEIMRVYTKDENITKVHFE-D*SKESL*LHFFGIIAIGPLCLRFSR 1175
            F+ GR + +NI L +EI+  Y K +   +  F  D  K +  + +  IIA       F+ 
Sbjct: 386  FIPGRRIGDNILLAQEIICDYHKADGQPRCTFMVDMMKANDTVEWDFIIAT---LQAFNI 442

Query: 1174 KVYYVGY--GLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLS----RK 1013
                +G+     ++  +S+ +NGE  GFF  +RGLRQGDP+SP+LF+I ME LS    R+
Sbjct: 443  PSTLIGWIKSCISSAKFSVCVNGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRR 502

Query: 1012 LNQATPEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKAN 833
            +N +    CF YH +C++L L +L FADDL++F  GD  SV+T+HD   +F  +SS KAN
Sbjct: 503  INCSP---CFRYHWRCDQLNLSHLCFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKAN 559

Query: 832  CLKHSIFLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTI 653
              +  IFLAGV+ +    +  +T FS G+ P RYLGI L    L++ D  PLLD++   I
Sbjct: 560  VSESKIFLAGVDGNSSDSVLQVTNFSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRI 619

Query: 652  KGW 644
            K W
Sbjct: 620  KSW 622



 Score = 50.4 bits (119), Expect(2) = 6e-57
 Identities = 35/146 (23%), Positives = 59/146 (40%), Gaps = 1/146 (0%)
 Frame = -3

Query: 1977 KKIFA*KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCF 1798
            +K+   K++  +L +GDK +  F   + ++  RN IA++   DG                
Sbjct: 219  EKLLKKKSRVQWLKKGDKNSTFFFKTMTKHRNRNRIATINRSDGPDLAK----------- 267

Query: 1797 YKGLLGKKQDVIPIETSVMDSGLKISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGY 1618
                                           SL  +F  ++I+   F +  +KSP PDG+
Sbjct: 268  -------------------------------SLCNEFTHDDIRAVFFSMNPNKSPGPDGF 296

Query: 1617 TSKFFKQAWNIVGGS-FCEAVLEFFS 1543
               FF++AW ++G +    AV EFFS
Sbjct: 297  NGCFFQKAWLVIGDNVVAAAVKEFFS 322


>emb|CAB72467.1| putative protein [Arabidopsis thaliana]
          Length = 762

 Score =  189 bits (481), Expect(2) = 5e-56
 Identities = 113/298 (37%), Positives = 172/298 (57%), Gaps = 2/298 (0%)
 Frame = -2

Query: 1528 KQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAF 1349
            K  N  I+A+IPKK  S   + D+R IS CNV YKVIS  +++RL  +LP  I   QS+F
Sbjct: 60   KGVNSTILALIPKKLESKE-MKDYRPISCCNVMYKVISKILANRLKLLLPQFIAGNQSSF 118

Query: 1348 VKGRSMVENIHLVEEIMRVYTKDENITKVHFE-D*SKESL*LHFFGIIAIGPLCLRFSRK 1172
            VK R ++EN+ L  ++++ Y KD    +   + D SK S  + +  +I    L      +
Sbjct: 119  VKDRLLIENVLLATDLVKDYHKDSISERCAIKIDISKASDSVQWSFLI--NTLTAMHFPE 176

Query: 1171 VYYVGYGLCTTT-SYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQATP 995
            ++     LC TT S+S+ +NGE  GFF+  RGLRQG  +SP+LF+ICM+ LS+ L++   
Sbjct: 177  MFIHWIRLCITTPSFSVQVNGELAGFFQSSRGLRQGCALSPYLFVICMDVLSKLLDKVVG 236

Query: 994  EDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHSI 815
                 YHP C+++ L +L+FADDLMI T G   S++ + +V   F   S  K +  K +I
Sbjct: 237  IGRIGYHPHCKRMGLTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTI 296

Query: 814  FLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGWS 641
            F AG++ + ++ + T   F  G +P RYLG+ L    L   D+ PL++++   I  WS
Sbjct: 297  FSAGLSSTSRAQLHTHFPFEVGELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWS 354



 Score = 59.3 bits (142), Expect(2) = 5e-56
 Identities = 25/46 (54%), Positives = 35/46 (76%)
 Frame = -3

Query: 1680 EEIKEALFDIGDDKSPRPDGYTSKFFKQAWNIVGGSFCEAVLEFFS 1543
            EEIK+ LF + +DKSP PDG+TS+FFK++W I+G  F  A+  FF+
Sbjct: 9    EEIKKVLFSMPNDKSPGPDGFTSEFFKESWEILGPEFILAIQSFFA 54


>gb|AAD15471.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1277

 Score =  190 bits (482), Expect(2) = 2e-51
 Identities = 116/297 (39%), Positives = 172/297 (57%), Gaps = 4/297 (1%)
 Frame = -2

Query: 1519 NHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAFVKG 1340
            N  I+A+IPKK+ ++  + D+R IS CNV YKVIS  I++RL  +LP+ I   QSAFV+ 
Sbjct: 652  NATILALIPKKDEAT-LMRDYRPISCCNVIYKVISKIIANRLKVMLPTFILQNQSAFVRE 710

Query: 1339 RSMVENIHLVEEIMRVYTKDENITKVHFE-D*SK--ESL*LHFFGIIAIGPL-CLRFSRK 1172
            R ++EN+ L  E+++ Y KD    +   + D SK  +S+   F     +  L  L+F  K
Sbjct: 711  RLLMENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFL----LNTLEALKFPEK 766

Query: 1171 VYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQATPE 992
              +      +T ++S+ +N E  GFF  +RGLRQG  +SP+LF+ICM  LS  ++ A   
Sbjct: 767  FRHWIKLCISTATFSVQVNSEQAGFFGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVH 826

Query: 991  DCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHSIF 812
                YHPKC+KL L +L FADDLM+F  G   SV+ V ++ K F   S    +  K +++
Sbjct: 827  RNIGYHPKCKKLSLTHLCFADDLMVFIDGQQRSVEGVINIFKDFAGKSGLHISLEKSTLY 886

Query: 811  LAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGWS 641
            LA V+   ++ I +   F+ G +P RYLG  L    +  AD+ PLLDKV + I  W+
Sbjct: 887  LAEVSELNRNNILSAFPFASGQLPVRYLGFPLLTKQMTTADYSPLLDKVRSKISSWT 943



 Score = 43.5 bits (101), Expect(2) = 2e-51
 Identities = 29/118 (24%), Positives = 53/118 (44%), Gaps = 3/118 (2%)
 Frame = -3

Query: 1959 KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLG 1780
            K+K  ++  GD+    FH   +    +N I  +   +G    + +EI  +   F++  L 
Sbjct: 532  KSKLHWMKVGDRNNSYFHKAAQVRRMQNSIREIQGPNGVVLQTSEEIKGEAERFFQEFLN 591

Query: 1779 KKQDV---IPIETSVMDSGLKISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGYT 1615
             +      + +E        + S+     L ++   EEI++ LF +  +KSP PDGYT
Sbjct: 592  HQPSDFQGMTVEELQNLMSFRCSATDQDMLTREVTSEEIQKVLFAMPSNKSPGPDGYT 649


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  194 bits (493), Expect = 4e-46
 Identities = 115/297 (38%), Positives = 175/297 (58%), Gaps = 1/297 (0%)
 Frame = -2

Query: 1528 KQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAF 1349
            K  N  I+A+IPKK  +   + D+R IS CNV YKVIS  I++RL  +LP  I   QSAF
Sbjct: 507  KGINSTILALIPKKTEARE-MKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAF 565

Query: 1348 VKGRSMVENIHLVEEIMRVYTKDENITKVHFE-D*SKESL*LHFFGIIAIGPLCLRFSRK 1172
            VK R ++EN+ L  E+++ Y KD   T+   + D SK    + +  +I +  + L F R+
Sbjct: 566  VKDRLLIENLLLATELVKDYHKDTISTRCAIKIDISKAFDSVQWPFLINVFTI-LGFPRE 624

Query: 1171 VYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQATPE 992
              +      TT S+S+ +NGE  G+F+  RGLRQG  +SP+LF+ICM+ LS+ L++A   
Sbjct: 625  FIHWINICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAA 684

Query: 991  DCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHSIF 812
              F YHPKC+ + L +L+FADDLM+ + G   S++ +  V   F   S  + +  K +++
Sbjct: 685  RHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVY 744

Query: 811  LAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGWS 641
            LAG++ + ++ +A    FS G +P RYLG+ L    L   D  PLL++V   I  W+
Sbjct: 745  LAGLSATARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWT 801



 Score = 94.0 bits (232), Expect = 7e-16
 Identities = 74/286 (25%), Positives = 115/286 (40%), Gaps = 14/286 (4%)
 Frame = -3

Query: 1959 KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLG 1780
            K+K  +   GD+ TK FH         N I  +   DG   T  DEI  +   F++  L 
Sbjct: 360  KSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAERFFREFLQ 419

Query: 1779 ------------KKQDVIPIETSVMDSGLKISSLQA*SLVQDFIVEEIKEALFDIGDDKS 1636
                        + Q ++P+  S  D           SL++    EEI++ LF +  DKS
Sbjct: 420  LIPNDFEGVTITELQQLLPVRCSDADQQ---------SLIRPVTAEEIRKVLFRMPSDKS 470

Query: 1635 PRPDGYTSKFFKQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPRRSILVVL--WV 1462
            P PDGYTS+FFK  W I+G  F  AV  FF+                P+++    +  + 
Sbjct: 471  PGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDYR 530

Query: 1461 ILDIFHXXXXXXXXXXXVYLLDLVRSFLVSLTMHNRLL*KEEVWWRIFIL*KRL*GFIQR 1282
             +   +             L  ++  F+     +     K+ +     +L   L     +
Sbjct: 531  PISCCNVLYKVISKIIANRLKLVLPKFIAG---NQSAFVKDRLLIENLLLATELVKDYHK 587

Query: 1281 TRTSPKCTLKIDLKKAYDSISLELLQ*VLYALDFPERFIMWVMACV 1144
               S +C +KID+ KA+DS+    L  V   L FP  FI W+  C+
Sbjct: 588  DTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICI 633


>gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]
          Length = 740

 Score =  194 bits (493), Expect = 4e-46
 Identities = 118/300 (39%), Positives = 174/300 (58%), Gaps = 4/300 (1%)
 Frame = -2

Query: 1528 KQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAF 1349
            K  N  I+A+IPKK+ ++  + D+R IS CNV YKVIS  I++RL  +LP+ I   QSAF
Sbjct: 80   KGLNATILALIPKKDEAT-LMRDYRPISCCNVIYKVISKIIANRLKVMLPTFILQNQSAF 138

Query: 1348 VKGRSMVENIHLVEEIMRVYTKDENITKVHFE-D*SK--ESL*LHFFGIIAIGPL-CLRF 1181
            V+ R ++EN+ L  E+++ Y KD    +   + D SK  +S+   F     +  L  L F
Sbjct: 139  VRERLLIENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFL----LNTLEALNF 194

Query: 1180 SRKVYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQA 1001
                 +      +T ++S+ +NGE  GFF  +RGLRQG  +SP+LF+ICM  LS  ++ A
Sbjct: 195  PENFCHWIKLCISTATFSVQVNGELAGFFGSKRGLRQGCALSPYLFVICMNVLSHMIDVA 254

Query: 1000 TPEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKH 821
                   YHPKC+KL L +L FADDLM+F  G   SV+ V ++ K F   S    +  K 
Sbjct: 255  AVHRNIGYHPKCKKLSLTHLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLEKS 314

Query: 820  SIFLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGWS 641
            +++LAGV+   ++ I +   F+ G +P RYLG+ L    +  AD+ PLLDKV + I  W+
Sbjct: 315  TLYLAGVSELNRNNILSAFPFASGQLPVRYLGLPLLTKQMTTADYSPLLDKVRSKISSWT 374



 Score = 76.3 bits (186), Expect = 1e-10
 Identities = 50/187 (26%), Positives = 83/187 (44%), Gaps = 1/187 (0%)
 Frame = -3

Query: 1701 LVQDFIVEEIKEALFDIGDDKSPRPDGYTSKFFKQAWNIVGGSFCEAVLEFFSSREXXXX 1522
            L ++   EE ++ LF +  +K P PDGYTS+FFK  W+I G  F  A+  FF        
Sbjct: 22   LTREVTSEENQKVLFAMPSNKFPGPDGYTSEFFKATWSITGQDFIAAIKSFFIKGFLPKG 81

Query: 1521 XXXXXXX*SPRRSILVVLWVILDIFHXXXXXXXXXXXVYLLDLVRSFLVSLTMHNR-LL* 1345
                     P++    ++     I               + + ++  L +  + N+    
Sbjct: 82   LNATILALIPKKDEATLMRDYRPI--SCCNVIYKVISKIIANRLKVMLPTFILQNQSAFV 139

Query: 1344 KEEVWWRIFIL*KRL*GFIQRTRTSPKCTLKIDLKKAYDSISLELLQ*VLYALDFPERFI 1165
            +E +     +L   L     +   SP+C +KID+ KA+DS+  + L   L AL+FPE F 
Sbjct: 140  RERLLIENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFLLNTLEALNFPENFC 199

Query: 1164 MWVMACV 1144
             W+  C+
Sbjct: 200  HWIKLCI 206


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  191 bits (486), Expect = 2e-45
 Identities = 111/297 (37%), Positives = 175/297 (58%), Gaps = 1/297 (0%)
 Frame = -2

Query: 1528 KQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAF 1349
            K  N  I+A+IPKK  ++  + D+R IS CNV YKVIS  I++RL  +LP  I   QSAF
Sbjct: 154  KGINSIILALIPKKL-AAKEMRDYRPISCCNVLYKVISKIIANRLKLLLPRFIAENQSAF 212

Query: 1348 VKGRSMVENIHLVEEIMRVYTKDENITKVHFED*SKESL*LHFFGIIAIGPLCLRFSRKV 1169
            VK R ++EN+ L  E+++ Y KD    +   +    ++     +  +    + + FS   
Sbjct: 213  VKDRLLIENLLLATELVKDYHKDSISARCAIKIDISKAFDSVQWSFLTNTLVAMNFS-PT 271

Query: 1168 YYVGYGLC-TTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQATPE 992
            +     LC TT S+S+ +NG+ +G+F+ +RGLRQG  +SP+LF+ICM+ LS+ L++A   
Sbjct: 272  FIHWINLCITTASFSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGV 331

Query: 991  DCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHSIF 812
              F +HPKC++L L +L+FADDLM+ + G   S++ + +V   F   S  + +  K +++
Sbjct: 332  RKFGFHPKCQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLY 391

Query: 811  LAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGWS 641
            +AGV+   K  IA    F  G +P RYLG+ L    L  AD+ PLL+++   I  W+
Sbjct: 392  MAGVSPIIKQEIAAKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWT 448



 Score = 79.3 bits (194), Expect = 2e-11
 Identities = 68/266 (25%), Positives = 109/266 (40%), Gaps = 8/266 (3%)
 Frame = -3

Query: 1917 KLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLGK-KQDVIPIETSVM 1741
            K FH  V     +N+I  +   DG      D+I  +   F+K  L    +D + +E   +
Sbjct: 22   KTFHRAVIERETKNMIKEIYCTDGRVVQG-DDIMVEAEKFFKEFLQLIPEDFVGVEVREL 80

Query: 1740 DSGLKISSLQA*S--LVQDFIVEEIKEALFDIGDDKSPRPDGYTSKFFKQAWNIVGGSFC 1567
               L+     + +  L ++   EEIK  LF +  DKSP PDGYTS+F+K  W+I+G  F 
Sbjct: 81   QDLLQFRCTNSDNEMLTREVSSEEIKTVLFSMPKDKSPGPDGYTSEFYKATWDIIGQEFT 140

Query: 1566 EAVLEFFSSREXXXXXXXXXXX*SPRRSILVVL-----WVILDIFHXXXXXXXXXXXVYL 1402
              V  FF                 P++     +         ++ +             L
Sbjct: 141  LPVQSFFQKGFLPKGINSIILALIPKKLAAKEMRDYRPISCCNVLYKVISKIIANRLKLL 200

Query: 1401 LDLVRSFLVSLTMHNRLL*KEEVWWRIFIL*KRL*GFIQRTRTSPKCTLKIDLKKAYDSI 1222
            L    +   S  + +RLL +        +L   L     +   S +C +KID+ KA+DS+
Sbjct: 201  LPRFIAENQSAFVKDRLLIEN------LLLATELVKDYHKDSISARCAIKIDISKAFDSV 254

Query: 1221 SLELLQ*VLYALDFPERFIMWVMACV 1144
                L   L A++F   FI W+  C+
Sbjct: 255  QWSFLTNTLVAMNFSPTFIHWINLCI 280


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  189 bits (481), Expect = 9e-45
 Identities = 116/300 (38%), Positives = 173/300 (57%), Gaps = 4/300 (1%)
 Frame = -2

Query: 1528 KQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAF 1349
            K  N  I+A+IPKK+ +   + D+R IS CNV YKVIS  +++RL  +LPS I   QSAF
Sbjct: 804  KGLNATILALIPKKDEAIE-MKDYRPISCCNVLYKVISKILANRLKLLLPSFILQNQSAF 862

Query: 1348 VKGRSMVENIHLVEEIMRVYTKDENITKVHFE-D*SK--ESL*LHFFGIIAIGPLCLRFS 1178
            VK R ++EN+ L  E+++ Y K+    +   + D SK  +S+   F     +  L     
Sbjct: 863  VKERLLMENVLLATELVKDYHKESVTPRCAMKIDISKAFDSVQWQFL----LNTLEALNF 918

Query: 1177 RKVYYVGYGLC-TTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQA 1001
             + +     LC +T ++S+ +NGE  GFF   RGLRQG  +SP+LF+ICM  LS  +++A
Sbjct: 919  PETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEA 978

Query: 1000 TPEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKH 821
                   YHPKCEK+ L +L FADDLM+F  G   S++ V +V K F   S  + +  K 
Sbjct: 979  AVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKS 1038

Query: 820  SIFLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGWS 641
            +I+LAGV+ S++    +   F+ G +P RYLG+ L    +  AD+ PL++ V   I  W+
Sbjct: 1039 TIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWT 1098



 Score = 90.5 bits (223), Expect = 7e-15
 Identities = 69/276 (25%), Positives = 116/276 (42%), Gaps = 4/276 (1%)
 Frame = -3

Query: 1959 KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLG 1780
            K+K  ++  GD     FH   +    RN I  +   +     + +EI  +   F+   L 
Sbjct: 657  KSKLHWMNVGDGNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNEFLN 716

Query: 1779 KKQDV---IPIETSVMDSGLKISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGYTSK 1609
            ++      I +E        + S      L ++   EEI++ LF + ++KSP PDGYTS+
Sbjct: 717  RQSGDFHGISVEDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGYTSE 776

Query: 1608 FFKQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPRRSILVVLWVILDIFHXXXXX 1429
            FFK  W++ G  F  A+  FF                 P++   + +     I       
Sbjct: 777  FFKATWSLTGPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPI--SCCNV 834

Query: 1428 XXXXXXVYLLDLVRSFLVSLTMHNR-LL*KEEVWWRIFIL*KRL*GFIQRTRTSPKCTLK 1252
                    L + ++  L S  + N+    KE +     +L   L     +   +P+C +K
Sbjct: 835  LYKVISKILANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVTPRCAMK 894

Query: 1251 IDLKKAYDSISLELLQ*VLYALDFPERFIMWVMACV 1144
            ID+ KA+DS+  + L   L AL+FPE F  W+  C+
Sbjct: 895  IDISKAFDSVQWQFLLNTLEALNFPETFRHWIKLCI 930


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  189 bits (479), Expect = 1e-44
 Identities = 117/300 (39%), Positives = 171/300 (57%), Gaps = 4/300 (1%)
 Frame = -2

Query: 1528 KQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAF 1349
            K  N  I+A+IPKK+ +   + D+R IS CNV YK IS  +++RL  ILP  I   QSAF
Sbjct: 228  KGVNSTILALIPKKKEARE-IKDYRPISCCNVLYKAISKILANRLKRILPKFIVGNQSAF 286

Query: 1348 VKGRSMVENIHLVEEIMRVYTKDENITKVHFE-D*SK--ESL*LHFFGIIAIGPLCLRFS 1178
            VK R ++EN+ L  E+++ Y KD   T+   + D SK  +SL   F   +      + F 
Sbjct: 287  VKDRLLIENVLLATELVKDYHKDSISTRCAMKIDISKAFDSLQWSFLTHVLAA---MNFP 343

Query: 1177 RKVYYVGYGLC-TTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQA 1001
             +  +    LC +T S+S+ +NGE  G+F+  RGLRQG  +SP+LF+I M+ LSR L++A
Sbjct: 344  GEFIH-WISLCMSTASFSIQVNGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKA 402

Query: 1000 TPEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKH 821
                 F YHP+C+ L L +L FADDLMI T G   SV  +  VL  F      K    K 
Sbjct: 403  AGAREFGYHPRCKTLGLTHLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKT 462

Query: 820  SIFLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGWS 641
            +++LAGV+   +  +++   F  G +P RYLG+ L    L  +D+ PL+D++   I  W+
Sbjct: 463  TLYLAGVSDHSRQLMSSRYSFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWT 522



 Score = 87.0 bits (214), Expect = 8e-14
 Identities = 67/277 (24%), Positives = 114/277 (41%), Gaps = 14/277 (5%)
 Frame = -3

Query: 1932 GDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLG--------- 1780
            GD+  K FH  +      N I  +   DG   TS  +I  + + +++  L          
Sbjct: 90   GDRNNKTFHRAITTREAVNSIREIVTRDGLVVTSQQDIQTEAVNYFQDFLQTIPADYEGM 149

Query: 1779 ---KKQDVIPIETSVMDSGLKISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGYTSK 1609
               + ++++P   S  D  L         L +    EEIK+ +F +  DKSP PDGYTS+
Sbjct: 150  CVEELENLLPFRCSEDDHRL---------LTRVVTGEEIKKVIFSMPKDKSPGPDGYTSE 200

Query: 1608 FFKQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPRRSIL--VVLWVILDIFHXXX 1435
            F+K +W I+G     A+  FF+                P++     +  +  +   +   
Sbjct: 201  FYKASWEIIGDEVIIAIQSFFAKGFLPKGVNSTILALIPKKKEAREIKDYRPISCCNVLY 260

Query: 1434 XXXXXXXXVYLLDLVRSFLVSLTMHNRLL*KEEVWWRIFIL*KRL*GFIQRTRTSPKCTL 1255
                      L  ++  F+V    +     K+ +     +L   L     +   S +C +
Sbjct: 261  KAISKILANRLKRILPKFIVG---NQSAFVKDRLLIENVLLATELVKDYHKDSISTRCAM 317

Query: 1254 KIDLKKAYDSISLELLQ*VLYALDFPERFIMWVMACV 1144
            KID+ KA+DS+    L  VL A++FP  FI W+  C+
Sbjct: 318  KIDISKAFDSLQWSFLTHVLAAMNFPGEFIHWISLCM 354


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  186 bits (472), Expect = 1e-43
 Identities = 118/311 (37%), Positives = 178/311 (57%), Gaps = 4/311 (1%)
 Frame = -2

Query: 1537 RTLKQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQ 1358
            R L Q N   V M+PKK ++   + +FR IS CN  YKVIS  ++ RL  ILP  I  +Q
Sbjct: 497  RLLGQWNSTAVTMVPKKPNADR-ITEFRPISCCNAIYKVISKLLARRLENILPLWISPSQ 555

Query: 1357 SAFVKGRSMVENIHLVEEIMRVYTK----DENITKVHFED*SKESL*LHFFGIIAIGPLC 1190
            SAFVKGR + EN+ L  E+++ + +       + KV     + +S+   F  II      
Sbjct: 556  SAFVKGRLLTENVLLATELVQGFGQANISSRGVLKVDLRK-AFDSVGWGF--IIETLKAA 612

Query: 1189 LRFSRKVYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKL 1010
                R V ++   + T+TS+S++++G   G+FKG +GLRQGDP+SP LF+I ME LSR L
Sbjct: 613  NAPPRFVNWIKQCI-TSTSFSINVSGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLL 671

Query: 1009 NQATPEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANC 830
                 +    YHPK  ++++ +LAFADDLMIF  G   S++ +  VL+ F ++S  + N 
Sbjct: 672  ENKFSDGSIGYHPKASEVRISSLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNT 731

Query: 829  LKHSIFLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIK 650
             K +++ AG+  ++K       GF  G+ PFRYLG+ L    L+ +D+  L+DK++    
Sbjct: 732  EKSAVYTAGLEDTDKEDTLAF-GFVNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFN 790

Query: 649  GWSRVAWKTVS 617
             W   A KT+S
Sbjct: 791  HW---ATKTLS 798



 Score = 85.5 bits (210), Expect = 2e-13
 Identities = 75/280 (26%), Positives = 117/280 (41%), Gaps = 8/280 (2%)
 Frame = -3

Query: 1959 KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLG 1780
            K++  +L  GD  T  FH ++      N I  + ++ G    + DE+    + F+K L G
Sbjct: 353  KSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFG 412

Query: 1779 KKQDVIPIETSVMDSGL---KISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGYTSK 1609
                +I  E     + L   K        L  +    +IK   F +  +KSP PDGYTS+
Sbjct: 413  SSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSE 472

Query: 1608 FFKQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPRRSILVVLWVILDI-----FH 1444
            FFK+ W+IVG S   AV EFF S               P++     +     I      +
Sbjct: 473  FFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCCNAIY 532

Query: 1443 XXXXXXXXXXXVYLLDLVRSFLVSLTMHNRLL*KEEVWWRIFIL*KRL*GFIQRTRTSPK 1264
                         +L L  S   S  +  RLL +      + +  + + GF Q    S +
Sbjct: 533  KVISKLLARRLENILPLWISPSQSAFVKGRLLTE-----NVLLATELVQGFGQ-ANISSR 586

Query: 1263 CTLKIDLKKAYDSISLELLQ*VLYALDFPERFIMWVMACV 1144
              LK+DL+KA+DS+    +   L A + P RF+ W+  C+
Sbjct: 587  GVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCI 626


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  184 bits (466), Expect = 5e-43
 Identities = 108/296 (36%), Positives = 166/296 (56%)
 Frame = -2

Query: 1531 LKQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSA 1352
            LKQ N   + +IPK  ++  T+ +FR IS  N  YKVIS  ++SRL  +L ++I ++QSA
Sbjct: 360  LKQWNATTLVLIPKTSNAC-TISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSA 418

Query: 1351 FVKGRSMVENIHLVEEIMRVYTKDENITKVHFED*SKESL*LHFFGIIAIGPLCLRFSRK 1172
            F+ GRS+ EN+ L  E++  Y +     +   +   K++     +  +      L    +
Sbjct: 419  FLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPER 478

Query: 1171 VYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQATPE 992
                 +   TT S+++S+NG   GFF+  +GLRQGDP+SP+LF++ ME  S+ L      
Sbjct: 479  YINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDS 538

Query: 991  DCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHSIF 812
               +YHPK   L + +L FADD+MIF  G   S+  + + L  F D S  K N  K  +F
Sbjct: 539  GYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLF 598

Query: 811  LAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGW 644
             AG++ SE+   A   GF  G+ P RYLG+ L    L++AD+GPLL+K+S  ++ W
Sbjct: 599  QAGLDLSERITSAAY-GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSW 653



 Score = 99.0 bits (245), Expect = 2e-17
 Identities = 74/282 (26%), Positives = 125/282 (44%), Gaps = 4/282 (1%)
 Frame = -3

Query: 1977 KKIFA*KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCF 1798
            +  F  +++  +  +GD  T  FH +V      N I S+ + +G    S   I D  + +
Sbjct: 209  ESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGLLIDSQQGILDHCVTY 268

Query: 1797 YKGLLGKKQDVIPIETSVMDSGL--KISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPD 1624
            Y+ LLG  +    +E   M+  L  + S  Q   L + F  +EIK A   +  +K+  PD
Sbjct: 269  YERLLGSIESPFSMEQEDMNLLLTYRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPD 328

Query: 1623 GYTSKFFKQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPRRSILVVLWVILDIFH 1444
            GY+ +FF+  W+I+G     A+ EFF S +             P+ S    +     I  
Sbjct: 329  GYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPKTSNACTISEFRPI-- 386

Query: 1443 XXXXXXXXXXXVYLLDLVRSFLVSLTMHNR--LL*KEEVWWRIFIL*KRL*GFIQRTRTS 1270
                         L   ++  L ++  H++   L    +   + +  + + G+  R   S
Sbjct: 387  SCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGY-NRLNIS 445

Query: 1269 PKCTLKIDLKKAYDSISLELLQ*VLYALDFPERFIMWVMACV 1144
            P+  LK+DLKKA+DS+  E +   L AL  PER+I W+  C+
Sbjct: 446  PRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCI 487


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  184 bits (466), Expect = 5e-43
 Identities = 108/296 (36%), Positives = 166/296 (56%)
 Frame = -2

Query: 1531 LKQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSA 1352
            LKQ N   + +IPK  ++  T+ +FR IS  N  YKVIS  ++SRL  +L ++I ++QSA
Sbjct: 360  LKQWNATTLVLIPKTSNAC-TISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSA 418

Query: 1351 FVKGRSMVENIHLVEEIMRVYTKDENITKVHFED*SKESL*LHFFGIIAIGPLCLRFSRK 1172
            F+ GRS+ EN+ L  E++  Y +     +   +   K++     +  +      L    +
Sbjct: 419  FLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPER 478

Query: 1171 VYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQATPE 992
                 +   TT S+++S+NG   GFF+  +GLRQGDP+SP+LF++ ME  S+ L      
Sbjct: 479  YINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDS 538

Query: 991  DCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHSIF 812
               +YHPK   L + +L FADD+MIF  G   S+  + + L  F D S  K N  K  +F
Sbjct: 539  GYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLF 598

Query: 811  LAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGW 644
             AG++ SE+   A   GF  G+ P RYLG+ L    L++AD+GPLL+K+S  ++ W
Sbjct: 599  QAGLDLSERITSAAY-GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSW 653



 Score = 99.0 bits (245), Expect = 2e-17
 Identities = 74/282 (26%), Positives = 125/282 (44%), Gaps = 4/282 (1%)
 Frame = -3

Query: 1977 KKIFA*KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCF 1798
            +  F  +++  +  +GD  T  FH +V      N I S+ + +G    S   I D  + +
Sbjct: 209  ESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGLLIDSQQGILDHCVTY 268

Query: 1797 YKGLLGKKQDVIPIETSVMDSGL--KISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPD 1624
            Y+ LLG  +    +E   M+  L  + S  Q   L + F  +EIK A   +  +K+  PD
Sbjct: 269  YERLLGSIESPFSMEQEDMNLLLTYRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPD 328

Query: 1623 GYTSKFFKQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPRRSILVVLWVILDIFH 1444
            GY+ +FF+  W+I+G     A+ EFF S +             P+ S    +     I  
Sbjct: 329  GYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPKTSNACTISEFRPI-- 386

Query: 1443 XXXXXXXXXXXVYLLDLVRSFLVSLTMHNR--LL*KEEVWWRIFIL*KRL*GFIQRTRTS 1270
                         L   ++  L ++  H++   L    +   + +  + + G+  R   S
Sbjct: 387  SCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGY-NRLNIS 445

Query: 1269 PKCTLKIDLKKAYDSISLELLQ*VLYALDFPERFIMWVMACV 1144
            P+  LK+DLKKA+DS+  E +   L AL  PER+I W+  C+
Sbjct: 446  PRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCI 487


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  184 bits (466), Expect = 5e-43
 Identities = 109/298 (36%), Positives = 177/298 (59%), Gaps = 2/298 (0%)
 Frame = -2

Query: 1528 KQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAF 1349
            K  N  I+A+I KK H  S + D+R IS CNV YK++S  +++RL EILP+ I   QSAF
Sbjct: 654  KGINTTILALISKK-HEVSGMKDYRPISCCNVLYKIVSKLMANRLKEILPASIAPNQSAF 712

Query: 1348 VKGRSMVENIHLVEEIMRVYTKDENITKVHFE-D*SKESL*LHFFGIIAIGPLCLRFSRK 1172
            +K R M+EN+ L  E+++ Y K+   ++   + D SK    + +  +I +  L      +
Sbjct: 713  IKDRLMMENLLLASELVKDYHKESISSRSALKIDISKAFDFVQWPFLINV--LKAIHLPE 770

Query: 1171 VYYVGYGLCT-TTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQATP 995
            ++     LC  T S+S+ +NGE  GFF+ +RGLRQG  +SP+L++ICM  LS  L++A  
Sbjct: 771  MFIHWIELCIGTASFSVQVNGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAV 830

Query: 994  EDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHSI 815
            E   +YHP+C  + L +L FADD+M+F+ G   S++    + + F  +S  K +  K +I
Sbjct: 831  EKKISYHPRCRNMNLTHLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTI 890

Query: 814  FLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGWS 641
            F+AG++ + K+ I     F  G++P +YLG+ L    +  +D+ PL++K+   I  W+
Sbjct: 891  FMAGISPNAKTSILQQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWT 948


>gb|EMJ14085.1| hypothetical protein PRUPE_ppa021750mg, partial [Prunus persica]
          Length = 922

 Score =  127 bits (318), Expect(2) = 3e-42
 Identities = 96/305 (31%), Positives = 158/305 (51%), Gaps = 10/305 (3%)
 Frame = -2

Query: 1522 TNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAFVK 1343
            TN   + +IPKK +S   V D+R IS     YKVIS  ++SRL E+L + I  +Q AFV+
Sbjct: 227  TNETFICLIPKKANSVK-VTDYRPISLVTSLYKVISKVLASRLREVLGNTISQSQGAFVQ 285

Query: 1342 GRSMVENIHLVEEIMRVYTKDEN---ITKVHFED*SKESL*LHFFGIIAIGPLCLRFSRK 1172
             R +++ + +  E++    K +    + K+ FE  + + +  +F     +  +  R    
Sbjct: 286  KRQILDAVLVANEVVEEVRKQKRKGLVFKIDFEK-AYDHVEWNF-----VDDVMARKGFG 339

Query: 1171 VYYVGY--GLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQAT 998
            V + G+  G   + ++S+ ING+  G F+  RGLRQGDP+SPFLF +  + LSR + +A 
Sbjct: 340  VKWRGWIIGCLESVNFSIMINGKPRGKFRASRGLRQGDPLSPFLFTLVSDVLSRLIERA- 398

Query: 997  PEDCFNYH---PKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCL 827
             +D    H      +++++ +L FADD +    G       +  +LK F DVS  K N  
Sbjct: 399  -QDVNLVHGIVSGHDQVEVSHLQFADDTIFLLDGKEEYWLNLLQLLKLFCDVSGMKINKA 457

Query: 826  KHSIFLAGVNYSEK--SGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTI 653
            K  I   G+N+S +  + +A   G   G  P  YLG+ L G    +  + P+++KV   +
Sbjct: 458  KSCIL--GINFSTEVLNNMAGSWGCEVGCWPMVYLGLPLGGNPRALNFWNPVMEKVEKRL 515

Query: 652  KGWSR 638
            + W R
Sbjct: 516  QKWKR 520



 Score = 75.9 bits (185), Expect(2) = 3e-42
 Identities = 46/135 (34%), Positives = 69/135 (51%), Gaps = 3/135 (2%)
 Frame = -3

Query: 1935 QGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLGKKQDVIPI 1756
            +GD  TK FH +     KRN I  +  ED         I  + + F+KGL    ++V   
Sbjct: 91   EGDGNTKFFHRVANGARKRNYIEKLEVEDLGVIEVDANIEREVIRFFKGLYSSNKNV--- 147

Query: 1755 ETSVMDSGLK---ISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGYTSKFFKQAWNI 1585
                   GL    IS ++A  L + F +EE+++A+FD G DKSP PDG++  FF+  W +
Sbjct: 148  --GWGVEGLNWCPISQVEADWLERPFDLEEVQKAVFDCGKDKSPGPDGFSMSFFQSCWEV 205

Query: 1584 VGGSFCEAVLEFFSS 1540
            V G   + + +FF S
Sbjct: 206  VKGDLMKVMQDFFQS 220


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  180 bits (456), Expect = 7e-42
 Identities = 113/297 (38%), Positives = 166/297 (55%), Gaps = 1/297 (0%)
 Frame = -2

Query: 1528 KQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAF 1349
            K  N  I+A+IPKK+ +   + D+R IS CNV YKVIS  I++RL  +LP  I   QSAF
Sbjct: 395  KGINSTILALIPKKKEAKE-MKDYRPISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAF 453

Query: 1348 VKGRSMVENIHLVEEIMRVYTKDENITKVHFE-D*SKESL*LHFFGIIAIGPLCLRFSRK 1172
            VK R ++EN+ L  EI++ Y KD   ++   + D SK    + +  +I +    + F  +
Sbjct: 454  VKDRLLIENVLLATEIVKDYHKDSVSSRCALKIDISKAFDSVQWKFLINVLE-AMNFPPE 512

Query: 1171 VYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQATPE 992
              +      TT S+S+ +NGE  G F   R LRQG  +SP+LF+I M+ LS+ L++A   
Sbjct: 513  FTHWITLCITTASFSVQVNGELAGVFSSARELRQGCSLSPYLFVISMDVLSKMLDKAVGA 572

Query: 991  DCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHSIF 812
              F YHPKC  + L +L+FADDLMI + G   S+  +  VL  F   S  K +  K +++
Sbjct: 573  RQFGYHPKCRAIGLTHLSFADDLMILSDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMY 632

Query: 811  LAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGWS 641
            LAGV  S    I     F  G +P RYLG+ L    L  +D  PL++++   I+ W+
Sbjct: 633  LAGVQASVYQEIVQKFSFDVGKLPVRYLGLPLVSKRLTASDCLPLIEQLRKKIEAWT 689



 Score = 89.4 bits (220), Expect = 2e-14
 Identities = 70/292 (23%), Positives = 124/292 (42%), Gaps = 14/292 (4%)
 Frame = -3

Query: 1977 KKIFA*KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCF 1798
            +K    ++K  +L  GD+  K FH  V     +N I  +   DG+  +  ++I  +    
Sbjct: 242  EKFLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEHH 301

Query: 1797 YKGLLG------------KKQDVIPIETSVMDSGLKISSLQA*SLVQDFIVEEIKEALFD 1654
            ++  L             + QD++P   S  D  +  + + A         EEI + +F 
Sbjct: 302  FREFLQLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSA---------EEIHKVVFS 352

Query: 1653 IGDDKSPRPDGYTSKFFKQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPRRSILV 1474
            + +DKSP PDGYT++F+K AWNI+G  F  A+  FF+                P++    
Sbjct: 353  MPNDKSPGPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAK 412

Query: 1473 VL--WVILDIFHXXXXXXXXXXXVYLLDLVRSFLVSLTMHNRLL*KEEVWWRIFIL*KRL 1300
             +  +  +   +             L  ++  F+V    +     K+ +     +L   +
Sbjct: 413  EMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIVG---NQSAFVKDRLLIENVLLATEI 469

Query: 1299 *GFIQRTRTSPKCTLKIDLKKAYDSISLELLQ*VLYALDFPERFIMWVMACV 1144
                 +   S +C LKID+ KA+DS+  + L  VL A++FP  F  W+  C+
Sbjct: 470  VKDYHKDSVSSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCI 521


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  179 bits (455), Expect = 9e-42
 Identities = 106/305 (34%), Positives = 169/305 (55%), Gaps = 6/305 (1%)
 Frame = -2

Query: 1540 TRTLKQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNA 1361
            +R  +  N  +V ++PK +H++  V +FR I+ C V YK+IS  +++R+  I+  +++ A
Sbjct: 488  SRMHRPINCIVVTLLPKVQHATR-VKEFRPIACCTVIYKIISKMLTNRMKGIIGEVVNEA 546

Query: 1360 QSAFVKGRSMVENIHLVEEIMRVYTKDEN----ITKVHFED*SKESL*LHFFGIIAIGPL 1193
            QS F+ GR + +NI L  E++R YT+       I KV     + +S+   F     +  L
Sbjct: 547  QSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRK-AYDSVEWSF-----LETL 600

Query: 1192 CLRFSRKVYYVGYGL--CTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLS 1019
               F     +VG+ +   +T SYS+ +NG     F+ ++GLRQGDP+SPFLF +CMEYLS
Sbjct: 601  LYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFALCMEYLS 660

Query: 1018 RKLNQATPEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFK 839
            R L +      FN+HPKCE+L + +L FADDL++F   D  S+  ++   + F   S   
Sbjct: 661  RCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKFSHASGLA 720

Query: 838  ANCLKHSIFLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSN 659
            A+  K +I+  GV+      +A       G +PFRYLG+ L    L  A   PL++ ++N
Sbjct: 721  ASHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKPLVEMITN 780

Query: 658  TIKGW 644
              + W
Sbjct: 781  RAQTW 785



 Score =  113 bits (282), Expect = 1e-21
 Identities = 77/283 (27%), Positives = 128/283 (45%), Gaps = 11/283 (3%)
 Frame = -3

Query: 1959 KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLG 1780
            K++  +L QGD  +KLF + VK     N I  +  EDG      DE+ ++ L FYK LLG
Sbjct: 347  KSRITWLQQGDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLG 406

Query: 1779 KKQDVIP-IETSVMDSGLKISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGYTSKFF 1603
             +   +  ++ + +  G  +S+    SL+++    EI EAL  IG+DK+P  DG+ + FF
Sbjct: 407  TRASTLMGVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFF 466

Query: 1602 KQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPR----------RSILVVLWVILD 1453
            K++W  +       + EFF++               P+          R I     +   
Sbjct: 467  KKSWGSIKQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKI 526

Query: 1452 IFHXXXXXXXXXXXVYLLDLVRSFLVSLTMHNRLL*KEEVWWRIFIL*KRL*GFIQRTRT 1273
            I               + +    F+    + + +L   E+          + G+  R   
Sbjct: 527  ISKMLTNRMKGIIGEVVNEAQSGFIPGRHIADNILLASEL----------IRGY-TRKHM 575

Query: 1272 SPKCTLKIDLKKAYDSISLELLQ*VLYALDFPERFIMWVMACV 1144
            SP+C +K+D++KAYDS+    L+ +LY   FP RF+ W+M CV
Sbjct: 576  SPRCIMKVDIRKAYDSVEWSFLETLLYEFGFPSRFVGWIMECV 618


>emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1383

 Score =  122 bits (305), Expect(2) = 9e-41
 Identities = 88/296 (29%), Positives = 150/296 (50%), Gaps = 4/296 (1%)
 Frame = -2

Query: 1519 NHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSAFVKG 1340
            N A +A+IPK ++ SS + D+R IS     YK+++  ++ RL  ++ SLI   QS++VKG
Sbjct: 486  NTAYIALIPKIDNPSS-LKDYRPISMVGFIYKIVAKLLAKRLQSVISSLISPLQSSYVKG 544

Query: 1339 RSMVENIHLVEEIMRVYTK---DENITKVHFED*SKESL*LHFFGIIAIGPLCLRFSRKV 1169
            R +++   +  EI+    K   +  + K+ F   + +S+  +F          + F  K 
Sbjct: 545  RQILDGALVASEIIESCKKRNIEAILLKLDFHK-AYDSVSWNFLQWTLDQ---MNFPVKW 600

Query: 1168 YYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQATPED 989
                    T+ S S+ +NG     FK  RGLRQGDP+SPFLF++  E LS+ +++AT   
Sbjct: 601  CEWIKTCVTSASASILVNGSPTPPFKLHRGLRQGDPLSPFLFVLVGEVLSQMISKATSLQ 660

Query: 988  CFNYHPKCEK-LKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHSIF 812
             +   P C +  ++ +L +ADD ++F   +  S+K +   L  F  VS  + N  K S+ 
Sbjct: 661  LWRGIPACSRGSEITHLQYADDTLMFCEANTNSLKNIQKTLIIFQLVSGLQVNFHKSSLM 720

Query: 811  LAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGW 644
               V  S     A       G++PF YLG+ +     ++  + P++DK+   +  W
Sbjct: 721  GLNVTSSWIQEAANSLMCKIGTIPFSYLGLPIGDNPARIRTWDPIIDKLEKKLASW 776



 Score = 75.9 bits (185), Expect(2) = 9e-41
 Identities = 45/143 (31%), Positives = 75/143 (52%)
 Frame = -3

Query: 1968 FA*KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKG 1789
            +A +++  +L  GDK TK FH++     ++N++A + E DG +T    +I  +   F+K 
Sbjct: 339  WAQRSRITWLKAGDKNTKFFHAIASNKKRKNMMACI-ETDGQSTNDPSQIKKEARAFFKK 397

Query: 1788 LLGKKQDVIPIETSVMDSGLKISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGYTSK 1609
            +   K+D +   T       ++S  QA SL+  F  EEI  A+     DK+P PDG+  K
Sbjct: 398  IF--KEDHVKRPTLENLHLKRLSQNQANSLITPFTTEEIDTAVSSCASDKAPGPDGFNFK 455

Query: 1608 FFKQAWNIVGGSFCEAVLEFFSS 1540
            F K AW+I+       V +F+ +
Sbjct: 456  FVKSAWDIIKTDIYGIVNDFWET 478


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  170 bits (431), Expect = 5e-39
 Identities = 103/298 (34%), Positives = 159/298 (53%)
 Frame = -2

Query: 1537 RTLKQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQ 1358
            + LKQ N   + +IPK  ++SS + DFR IS  N  YKVIS  ++ RL + LP+ I ++Q
Sbjct: 395  KLLKQWNATNLVLIPKITNASS-MSDFRPISCLNTVYKVISKLLTDRLKDFLPAAISHSQ 453

Query: 1357 SAFVKGRSMVENIHLVEEIMRVYTKDENITKVHFED*SKESL*LHFFGIIAIGPLCLRFS 1178
            SAF+ GR  +EN+ L  E++  Y K         +   +++     +  I      L   
Sbjct: 454  SAFMPGRLFLENVLLATELVHGYNKKNIAPSSMLKVDLRKAFDSVRWDFIVSALRALNVP 513

Query: 1177 RKVYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQAT 998
             K         +T S+S+ +NG + G F   +GLRQGDP+SP+LF++ ME  S  L    
Sbjct: 514  EKFTCWILECLSTASFSVILNGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRY 573

Query: 997  PEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLKHS 818
                  YHPK  +L++ +L FADD+MIF  G   S+  + + L+ F   S    N  K  
Sbjct: 574  TSGYIAYHPKTSQLEISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQ 633

Query: 817  IFLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVSNTIKGW 644
            ++ AG++ SE   +A+  GF  GS+P RYLG+ L    L +A++ PL++K++     W
Sbjct: 634  LYHAGLSQSESDSMASY-GFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSW 690



 Score = 87.4 bits (215), Expect = 6e-14
 Identities = 65/275 (23%), Positives = 122/275 (44%), Gaps = 3/275 (1%)
 Frame = -3

Query: 1959 KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLG 1780
            +++  +L +GD  +  FH +       N I  +++  G        + +  + +++  LG
Sbjct: 252  RSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQQNLENHCVEYFQSNLG 311

Query: 1779 KKQDVIPIETSVMDSGL--KISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGYTSKF 1606
             +Q +   E + + + L  + S  Q  SL   F  E+IK A F +  +K+  PDG++ +F
Sbjct: 312  SEQGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFSLPRNKASGPDGFSPEF 371

Query: 1605 FKQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPRRSILVVLWVILDIFHXXXXXX 1426
            F   W I+GG   EA+ EFF+S +             P+ +    +     I        
Sbjct: 372  FCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITNASSMSDFRPI--SCLNTV 429

Query: 1425 XXXXXVYLLDLVRSFLVSLTMHNR-LL*KEEVWWRIFIL*KRL*GFIQRTRTSPKCTLKI 1249
                   L D ++ FL +   H++       ++    +L   L     +   +P   LK+
Sbjct: 430  YKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELVHGYNKKNIAPSSMLKV 489

Query: 1248 DLKKAYDSISLELLQ*VLYALDFPERFIMWVMACV 1144
            DL+KA+DS+  + +   L AL+ PE+F  W++ C+
Sbjct: 490  DLRKAFDSVRWDFIVSALRALNVPEKFTCWILECL 524


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  141 bits (356), Expect(2) = 8e-39
 Identities = 103/306 (33%), Positives = 162/306 (52%), Gaps = 6/306 (1%)
 Frame = -2

Query: 1543 FTRTLKQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDN 1364
            F R +  T   ++A    K+  ++T  DFR IS C +  K+++  +++RL ++LPSLI  
Sbjct: 450  FPRGVTSTTLVLLA----KKPDAATWSDFRPISLCTILNKIVTKLLANRLSKVLPSLISE 505

Query: 1363 AQSAFVKGRSMVENIHLVEEIM-RVYTKDENITKVHFED*SKESL*LHFFGIIAIGPLCL 1187
             QS FV GR + +NI L +E++ ++  K      V   D  K    L++  +I +     
Sbjct: 506  NQSGFVSGRLINDNILLAQELIGKIDYKARGGNVVLKLDMMKAYDRLNWDFLILV---LE 562

Query: 1186 RFSRKVYYVGY--GLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRK 1013
            RF     ++       T   +S+ ING + G+FK +RGLRQGD ISP LF++  EYLSR 
Sbjct: 563  RFGFNDMWIDMIRRCITNCWFSVLINGHSAGYFKSERGLRQGDSISPMLFILAAEYLSRG 622

Query: 1012 LNQA-TPEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKA 836
            +N+  +     +YH  C  L + +LAFADD+MIFT G    ++ + + L+ +  +S  + 
Sbjct: 623  INELFSRYISLHYHSGC-SLNISHLAFADDIMIFTNGSKSVLEKILEFLQEYEQISGQRV 681

Query: 835  NCLKHSIFLAGVNY--SEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDKVS 662
            N  K S F+   N   S +  I+   GF + ++P  YLG  L     KV  F  L++K+ 
Sbjct: 682  NHQK-SCFVTANNMPSSRRQIISQTIGFLHKTLPITYLGAPLFKGPKKVMLFDSLINKIR 740

Query: 661  NTIKGW 644
              I GW
Sbjct: 741  ERITGW 746



 Score = 49.7 bits (117), Expect(2) = 8e-39
 Identities = 38/139 (27%), Positives = 63/139 (45%), Gaps = 14/139 (10%)
 Frame = -3

Query: 1920 TKLFHSLVKRNSK------RNLIASVTEEDGTATTSLDEIHDQFLCFYKGLLGK------ 1777
            T+L ++  K N++      RN I  + + +GT       I    + F++ LL        
Sbjct: 316  TQLQYAYAKLNNQMQKKRVRNSIFKIQDSEGTLMEEPGLIESSAVEFFENLLKAENYDLS 375

Query: 1776 --KQDVIPIETSVMDSGLKISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPDGYTSKFF 1603
              K + IP   S  D+ L  +  Q         ++E+K+A+F I  D    PDG++S F+
Sbjct: 376  RFKAEFIPQMLSDADNNLLCAEPQ---------LQEVKDAVFAIDKDSVVGPDGFSSFFY 426

Query: 1602 KQAWNIVGGSFCEAVLEFF 1546
            +Q W I+      AV +FF
Sbjct: 427  QQCWPIIAEDLLAAVRDFF 445


>emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1|
            putative protein [Arabidopsis thaliana]
          Length = 1141

 Score =  169 bits (428), Expect = 1e-38
 Identities = 107/292 (36%), Positives = 157/292 (53%), Gaps = 4/292 (1%)
 Frame = -2

Query: 1531 LKQTNHAIVAMIPKKEHSSSTVGDFRHISYCNVFYKVISNCISSRLGEILPSLIDNAQSA 1352
            LKQ N   + +IPK  ++S T  DFR IS  N  YKVI+  ++ RL ++L  +I  +QSA
Sbjct: 478  LKQWNATTIVLIPKFPNASCT-SDFRPISCMNTLYKVIARLLTDRLQKLLSCVISPSQSA 536

Query: 1351 FVKGRSMVENIHLVEEIMRVYTKDENITKVHFED*SKESL*LHFFGI----IAIGPLCLR 1184
            F+ GR + EN+ L  E++  Y    N   +      K  L   F  +    I    L L 
Sbjct: 537  FLPGRLLAENVLLATEMVHGY----NWRNISLRGMLKVDLRKAFDSVRWEFIIAALLALG 592

Query: 1183 FSRKVYYVGYGLCTTTSYSLSINGENIGFFKGQRGLRQGDPISPFLFMICMEYLSRKLNQ 1004
               K     +   +T ++++S+NG   GFFK  +GLRQGDP+SP+LF++ ME  S+ LN 
Sbjct: 593  VPTKFINWIHQCISTPTFTVSVNGCCGGFFKSAKGLRQGDPLSPYLFVLAMEVFSKLLNS 652

Query: 1003 ATPEDCFNYHPKCEKLKLCNLAFADDLMIFTLGDFLSVKTVHDVLKHFGDVSSFKANCLK 824
                    YHPK   L + +L FADD+MIF  G   S+  + + L+ F   S  K N  K
Sbjct: 653  RFDSGYIRYHPKASDLSISHLMFADDVMIFFDGGSSSLHGICETLEDFASWSGLKVNNDK 712

Query: 823  HSIFLAGVNYSEKSGIATITGFSYGSMPFRYLGILLAGVYLKVADFGPLLDK 668
               F AG+  +E++ +A   GF  G +P RYLG+ L    L++A++ PLL+K
Sbjct: 713  SHFFCAGLEQAERNSLAAY-GFPQGCLPIRYLGLPLMCRKLRIAEYEPLLEK 763



 Score = 60.5 bits (145), Expect = 8e-06
 Identities = 73/305 (23%), Positives = 117/305 (38%), Gaps = 12/305 (3%)
 Frame = -3

Query: 1977 KKIFA*KTKCGFLLQGDKCTKLFHSLVKRNSKRNLIASVTEEDGTATTSLDEIHDQFLCF 1798
            +  F  +++  +  +GD  T+ FH +       N I ++ ++ GT   S   I D    +
Sbjct: 341  ESFFRQRSRVTWFAEGDGNTRYFHRMADSRKSVNTITTLVDDSGTQIDSQQGIADHCALY 400

Query: 1797 YKGLLGKKQDVIPIETSVMDSGL--KISSLQA*SLVQDFIVEEIKEALFDIGDDKSPRPD 1624
            ++ LL    D   +E   M+  L  +    Q   L   F  E+IK A F +  +K+  PD
Sbjct: 401  FENLLSDDNDPYSLEQDDMNLLLTYRCPYSQVADLEAMFSDEDIKAAFFGLPSNKACGPD 460

Query: 1623 GYTSKFFKQAWNIVGGSFCEAVLEFFSSREXXXXXXXXXXX*SPR----------RSILV 1474
            G+                  AV EFF S               P+          R I  
Sbjct: 461  GF--------------PVTAAVREFFISGNLLKQWNATTIVLIPKFPNASCTSDFRPI-- 504

Query: 1473 VLWVILDIFHXXXXXXXXXXXVYLLDLVRSFLVSLTMHNRLL*KEEVWWRIFIL*KRL*G 1294
                 ++  +             LL  V S   S  +  RLL +      + +  + + G
Sbjct: 505  ---SCMNTLYKVIARLLTDRLQKLLSCVISPSQSAFLPGRLLAE-----NVLLATEMVHG 556

Query: 1293 FIQRTRTSPKCTLKIDLKKAYDSISLELLQ*VLYALDFPERFIMWVMACVLQHPTPLVSM 1114
            +  R   S +  LK+DL+KA+DS+  E +   L AL  P +FI W+  C+   PT  VS+
Sbjct: 557  YNWR-NISLRGMLKVDLRKAFDSVRWEFIIAALLALGVPTKFINWIHQCI-STPTFTVSV 614

Query: 1113 EKILG 1099
                G
Sbjct: 615  NGCCG 619


Top