BLASTX nr result
ID: Mentha27_contig00010814
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00010814 (2261 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 465 e-128 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 432 e-118 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 428 e-117 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 415 e-113 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 411 e-112 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 407 e-110 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 403 e-109 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 399 e-108 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 398 e-108 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 393 e-106 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 385 e-104 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 384 e-103 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 382 e-103 gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] 382 e-103 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 380 e-102 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 380 e-102 gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] 373 e-100 dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ... 367 1e-98 ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664... 366 2e-98 emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li... 365 6e-98 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 465 bits (1197), Expect = e-128 Identities = 259/683 (37%), Positives = 379/683 (55%), Gaps = 10/683 (1%) Frame = -2 Query: 2245 GYRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVD 2066 G LS + + LIR V+ EI AL IG+DKAPG DG+ + FFKK+W + ++ A + Sbjct: 423 GKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQEIYAGIQ 482 Query: 2065 EFFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRL 1886 EFF+ + R +N VV+L+PK H V +FRPIAC V+YKII+K+L++RM ++ + Sbjct: 483 EFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLTNRMKGIIGEV 542 Query: 1885 ISPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVL 1706 ++ +QS FI GR+I DN LA ELI+ Y RK ++ RC++K+D+RKAYD + W FL +L Sbjct: 543 VNEAQSGFIPGRHIADNILLASELIRGYTRKH-MSPRCIMKVDIRKAYDSVEWSFLETLL 601 Query: 1705 YGLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSR 1526 Y F F+ WI+ CV++ ++S+ +NG + ++GLRQGDPMSP LF CM+YLSR Sbjct: 602 YEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFALCMEYLSR 661 Query: 1525 LLHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAI 1346 L + F HPKC+ +ITHL FADDLL+F R D S+ + ++F+ SGLA Sbjct: 662 CLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKFSHASGLAA 721 Query: 1345 NKSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNF 1166 + KS+I+ GV E+ + G LP +YLG+PL SK LT L+ I+N Sbjct: 722 SHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKPLVEMITNR 781 Query: 1165 IHRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW-----GSS 1001 W LS AGRL+LI+S+L ++ YW PL VI + K+ RKFLW + Sbjct: 782 AQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFLWTGKTEETK 841 Query: 1000 YCPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDI 821 PV+W T+ P+ GG + ++ WN+A K LW I K D LW++WIH+ Y++ QDI Sbjct: 842 KAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIHSYYIKRQDI 901 Query: 820 WEFPFPKRDAPHITNILRIRDRLILDCGGNLNDAKTKLAGWFTGKGTSEAYEHFRTKGEK 641 + + I++ RD L N+ D G +AY+ GE+ Sbjct: 902 LTVNISNQTTWILRKIVKARDHL-----SNIGDWDEICIG--DKFSMKKAYKKISENGER 954 Query: 640 KFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRLKHSDIA--RGCVLCDSSDETHDHLFF 467 W + I +Y PK LW+ L RL T+DR+ + LC + ET HLFF Sbjct: 955 VRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRNDGETIQHLFF 1014 Query: 466 TCEKSLAVWSGICSWLRCRNQMIT---IPSAVRRFQREKAGSGIIRKAKWVALGATVQYL 296 +C S VWS IC +R N ++ I S+V R+K G I+ + V + Sbjct: 1015 SCSYSAGVWSKICYIMRFPNSGVSHQEIISSVCGQARKKKGKLIV-----MLYTEFVYAI 1069 Query: 295 WQARNLKYVAKKPFEVSHVIKEI 227 W+ RN + + + + V+++I Sbjct: 1070 WKQRNKRTFTGENKDENEVLRKI 1092 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 432 bits (1110), Expect = e-118 Identities = 255/686 (37%), Positives = 368/686 (53%), Gaps = 14/686 (2%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 YR S E+ L+ ++ E+ F I +K+PGPDGYT FF++ W ++ +V A+ Sbjct: 705 YRYSLHEQNLLVAEITEAEVMKVFFSIPLNKSPGPDGYTVEFFRETWSVIGQEVTMAIKS 764 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF+ G + + LN T+++LIPK ++ + D+RPI+C NV+YK I+K+L++R+ LL I Sbjct: 765 FFTYGFLPKGLNSTILALIPKRTYAKEMKDYRPISCCNVLYKAISKLLANRLKCLLPEFI 824 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 +P+QSAFI R +M+N LA EL+K Y K G++ RC +KIDL KA+D + W FL + L Sbjct: 825 APNQSAFISDRLLMENLLLASELVKDYH-KDGLSPRCAMKIDLSKAFDSVQWPFLLNTLA 883 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 L+ FIHWI C+++A+FS+ +NG LRQG +SP LF+ CM+ LS + Sbjct: 884 ALDIPEKFIHWINLCISTASFSVQVNG-----------LRQGCSLSPYLFVICMNVLSAM 932 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 L F +HP+C +THL FADD+++F G S+ + ++F A SGL I+ Sbjct: 933 LDKGAVEKRFGYHPRCRNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNIS 992 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 KS +F+ + IL F F G+LPV+YLGLPL +K +T D L+ +I + I Sbjct: 993 LEKSTLFMASISSETCASILARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLLEKIRSRI 1052 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW-GSSYCP-- 992 W LS AGRL+L+ SV+ + +W+ A LP I I ++ FLW G+ P Sbjct: 1053 SSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFLWSGTDLNPHK 1112 Query: 991 --VSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 V+W VC P+ EGGLGLR L NK K +W + + +LW+ WI +R + Sbjct: 1113 AKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQNNLIR--TVA 1170 Query: 817 EFPFPKRDAPHITNILR-IRDRL-ILDCGGNLNDAKTKL----AGWFTGKGTS-EAYEHF 659 E R H +IL I + L L C G + L G F K S E + Sbjct: 1171 EALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDRSLCRSIGGQFKAKFFSPEIWHQI 1230 Query: 658 RTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRLK--HSDIARGCVLCDSSDET 485 R +G K WHKAIW S PKF+ WLA RL T D++ + I+ CVLC+ S E+ Sbjct: 1231 REQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISAES 1290 Query: 484 HDHLFFTCEKSLAVWSGICSWLRCRNQMITIPSAVRRFQREKAGSGIIRKAKWVALGATV 305 DHLFF+C S +W + L P+ + + SG R AT+ Sbjct: 1291 RDHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALLLLLSGQDF-SGTKRFLLRYVFQATI 1349 Query: 304 QYLWQARNLKYVAKKPFEVSHVIKEI 227 LW+ RN + P H+IK I Sbjct: 1350 HTLWRERNKRRHGDLPIPSDHIIKFI 1375 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 428 bits (1100), Expect = e-117 Identities = 245/690 (35%), Positives = 365/690 (52%), Gaps = 13/690 (1%) Frame = -2 Query: 2257 VMGAGYRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVV 2078 V+ G +LS +L++P+++ EI AL DI D KAPG DG+ S FFKK+W ++ ++ Sbjct: 422 VVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLVIKQEIY 481 Query: 2077 AAVDEFFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPL 1898 + +FF G + + +N T V+LIPK D+RPIAC + +YKII+KIL+ R+ + Sbjct: 482 EGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISKILTKRLQAV 541 Query: 1897 LQRLISPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFL 1718 + ++ +Q+ FI R+I DN LA ELI+ Y R R ++ RC++K+D+RKAYD + W FL Sbjct: 542 ITEVVDCAQTGFIPERHIGDNILLATELIRGYNR-RHVSPRCVIKVDIRKAYDSVEWVFL 600 Query: 1717 RDVLYGLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMD 1538 +L L F FI WI+ CV + ++SI +NG Q+GLRQGDP+SP LF M+ Sbjct: 601 ESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFDAQKGLRQGDPLSPFLFALSME 660 Query: 1537 YLSRLLHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATS 1358 YLSR + F HPKC+ +THL FADDLL+F R D S+ + F+ S Sbjct: 661 YLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSISKIMAAFNSFSKAS 720 Query: 1357 GLAINKSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQ 1178 GL + KS I+ GGV E ++ + P G+LP +YLG+PLASK L LI + Sbjct: 721 GLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLGVPLASKKLNFSQCKPLIDK 780 Query: 1177 ISNFIHRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW---- 1010 I+ W LS AGRL+L++++L ++ YW Q PLP +I + RKFLW Sbjct: 781 ITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKFLWTGTV 840 Query: 1009 GSSY-CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLR 833 +SY PV+W + P+ GGL + ++ +WNKA K LW I K D LW++W++A Y++ Sbjct: 841 DTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVNAYYIK 900 Query: 832 GQDIWEFPFPKRDAPHITNILRIRDRLILDCGGNLNDAKTKLAGW-----FTGKGTSEAY 668 Q+I + + I R+ L T+ GW + Y Sbjct: 901 RQNIENVTVSSNTSWILRKIFESRELL------------TRTGGWEAVSNHMNFSIKKTY 948 Query: 667 EHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRLK--HSDIARGCVLCDSS 494 + + E W + I + PK LWLA+ RL T +R+ + D++ C +C + Sbjct: 949 KLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKMCGNE 1008 Query: 493 DETHDHLFFTCEKSLAVWSGICSWLRCRNQMITIPSAVRRFQREKAGSGIIRKAKWVAL- 317 ET HLFF C S +W + +L + Q A + +KA S R +V + Sbjct: 1009 IETIQHLFFNCIYSKEIWGKVLLYLNLQPQ--ADAQAKKELAIKKARSTKDRNKLYVMMF 1066 Query: 316 GATVQYLWQARNLKYVAKKPFEVSHVIKEI 227 +V +W RN K + +K I Sbjct: 1067 TESVYAIWLLRNAKVFRGIEINQNQAVKSI 1096 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 415 bits (1066), Expect = e-113 Identities = 214/501 (42%), Positives = 300/501 (59%), Gaps = 6/501 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 +R + + L R VS EI+T LF + DK+PGPDGYTS F+K WD++ + V Sbjct: 86 FRCTNSDNEMLTREVSSEEIKTVLFSMPKDKSPGPDGYTSEFYKATWDIIGQEFTLPVQS 145 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF KG + + +N +++LIPK + D+RPI+C NV+YK+I+KI+++R+ LL R I Sbjct: 146 FFQKGFLPKGINSIILALIPKKLAAKEMRDYRPISCCNVLYKVISKIIANRLKLLLPRFI 205 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 + +QSAF+K R +++N LA EL+K Y K I+ARC +KID+ KA+D + W FL + L Sbjct: 206 AENQSAFVKDRLLIENLLLATELVKDYH-KDSISARCAIKIDISKAFDSVQWSFLTNTLV 264 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 +NF P FIHWI C+T+A+FS+ +NG G+ + +RGLRQG +SP LF+ CMD LS++ Sbjct: 265 AMNFSPTFIHWINLCITTASFSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKM 324 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 L F HPKC +THL+FADDL++ G S+ + + +EF SGL I+ Sbjct: 325 LDKAAGVRKFGFHPKCQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRIS 384 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 KS +++ GV P K EI F F G LPV+YLGLPL +K LT+ DY+ L+ QI I Sbjct: 385 LEKSTLYMAGVSPIIKQEIAAKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRI 444 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLWGSS-----Y 998 W++ S AGR LI+SVL + +WL A LP I I KL FLW S Sbjct: 445 ATWTFRFFSFAGRFNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHK 504 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 +SW VC P+ EGGLGLR+L N K +W I + +++LW KW+ +R + IW Sbjct: 505 AKISWDIVCKPKAEGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIW 564 Query: 817 EFPFPKRDAPHI-TNILRIRD 758 I IL+IRD Sbjct: 565 SLKQSTSMGSWIWRKILKIRD 585 Score = 67.0 bits (162), Expect = 4e-08 Identities = 34/141 (24%), Positives = 67/141 (47%), Gaps = 7/141 (4%) Frame = -2 Query: 682 TSEAYEHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRL----KHSDIARG 515 T + + + WHK +W + PK+++ WLA+ RL T DR+ ++ Sbjct: 687 TRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGN 746 Query: 514 CVLCDSSDETHDHLFFTCEKSLAVWSGICSWL---RCRNQMITIPSAVRRFQREKAGSGI 344 CVLC ++ +T +HLFF+C + VW+ + + R + + + + +++ + Sbjct: 747 CVLCTNNSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLLTHISTHFQDRVEGFL 806 Query: 343 IRKAKWVALGATVQYLWQARN 281 R AT+ ++W+ RN Sbjct: 807 TR----YIFQATIYHVWRERN 823 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 411 bits (1057), Expect = e-112 Identities = 245/681 (35%), Positives = 357/681 (52%), Gaps = 13/681 (1%) Frame = -2 Query: 2230 PEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAA-VDEFFS 2054 P+ L + +IR F + +K+PGPDG+ FF+K W ++ ++VVAA V EFFS Sbjct: 263 PDLAKSLCNEFTHDDIRAVFFSMNPNKSPGPDGFNGCFFQKAWLVIGDNVVAAAVKEFFS 322 Query: 2053 KGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLISPS 1874 G +L +LN T+++L+PK ++ +SDFRPI+C N YKII K+L++R+ L ++ PS Sbjct: 323 YGSLLMELNSTIITLVPKVANPTTMSDFRPISCCNTFYKIIAKLLANRLKGTLHLIVGPS 382 Query: 1873 QSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLYGLN 1694 QS FI GR I DN LA+E+I Y + G RC +D+ KA D + WDF+ L N Sbjct: 383 QSTFIPGRRIGDNILLAQEIICDYHKADG-QPRCTFMVDMMKANDTVEWDFIIATLQAFN 441 Query: 1693 FHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRLLHA 1514 I WI +C++SA FS+ +NG GF +RGLRQGDP+SP LF+ M+ LS + Sbjct: 442 IPSTLIGWIKSCISSAKFSVCVNGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQR 501 Query: 1513 RTHAST-FIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAINKS 1337 R + S F +H +CD +++HL FADDLL+F GD +S+R L D F + S L N S Sbjct: 502 RINCSPCFRYHWRCDQLNLSHLCFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVS 561 Query: 1336 KSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFIHR 1157 +S IFL GV +L++ F GT PV+YLG+PL + L D + L+ +I I Sbjct: 562 ESKIFLAGVDGNSSDSVLQVTNFSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKS 621 Query: 1156 WSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW-----GSSYCP 992 W LS AGRL+LI+SVL ++ YW L LP V+ I K +R FLW G + Sbjct: 622 WENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATK 681 Query: 991 VSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIWEF 812 V+W +CLP+ EGGLG++DL WNKAL +WN+ + + W W+ L+G W Sbjct: 682 VAWSEICLPKCEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNA 741 Query: 811 PFPKRDAPHITNILRIRDRLILDCGGNLNDAKTKLAGWFTGKGTSEAYEHFRTKGEKKFW 632 P P + + +L+IR+ L C +N + G G+ TS ++++ G Sbjct: 742 PLPSICSWNWRKLLKIRE---LCCSFFVN-----IIG--DGRATSLWFDNWHPLGPLTL- 790 Query: 631 HKAIWRSYI--PPKFSVTLWLALQGRLKTLDRLKHSDIARGCV----LCDSSDETHDHLF 470 W S I S + L G T +R V L ETH+HLF Sbjct: 791 ---RWSSNIIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFIVPWYRLVWFVAETHNHLF 847 Query: 469 FTCEKSLAVWSGICSWLRCRNQMITIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQ 290 F C S +W+ + S ++ + G+ + +AL A V +W+ Sbjct: 848 FDCAYSFGIWTHVLSKCDVSKPLLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWR 907 Query: 289 ARNLKYVAKKPFEVSHVIKEI 227 RN + + + V K I Sbjct: 908 ERNNRRFRNESLPPAVVFKGI 928 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 407 bits (1045), Expect = e-110 Identities = 206/480 (42%), Positives = 295/480 (61%), Gaps = 5/480 (1%) Frame = -2 Query: 2239 RLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDEF 2060 R S ++ LIRPV+ EIR LF + DK+PGPDGYTS FFK W+++ ++ AV F Sbjct: 440 RCSDADQQSLIRPVTAEEIRKVLFRMPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSF 499 Query: 2059 FSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLIS 1880 F+KG + + +N T+++LIPK + + D+RPI+C NV+YK+I+KI+++R+ +L + I+ Sbjct: 500 FTKGFLPKGINSTILALIPKKTEAREMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIA 559 Query: 1879 PSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLYG 1700 +QSAF+K R +++N LA EL+K Y K I+ RC +KID+ KA+D + W FL +V Sbjct: 560 GNQSAFVKDRLLIENLLLATELVKDYH-KDTISTRCAIKIDISKAFDSVQWPFLINVFTI 618 Query: 1699 LNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRLL 1520 L F FIHWI C+T+A+FS+ +NG G+ + RGLRQG +SP LF+ CMD LS++L Sbjct: 619 LGFPREFIHWINICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKML 678 Query: 1519 HARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAINK 1340 A F +HPKC T +THL+FADDL++ G S+ + +EF SGL I+ Sbjct: 679 DKAAAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISL 738 Query: 1339 SKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFIH 1160 KS ++L G+ + E+ + F F G LPV+YLGLPL +K L+T D L+ Q+ I Sbjct: 739 EKSTVYLAGLSATARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIG 798 Query: 1159 RWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW-----GSSYC 995 W+ LS AGRL LI SVL + +WL A LP I + K+ FLW S+ Sbjct: 799 SWTSRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKA 858 Query: 994 PVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIWE 815 +SW VC P+ EGGLGLR L N K +W I + +++LW+KW+ LR WE Sbjct: 859 KISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWE 918 Score = 80.9 bits (198), Expect = 2e-12 Identities = 49/168 (29%), Positives = 79/168 (47%), Gaps = 5/168 (2%) Frame = -2 Query: 682 TSEAYEHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRLKH--SDIARGCV 509 T + + H R+ + WHK IW S+ PK+S WLA GRL T DR+ + + IA C+ Sbjct: 1040 TRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCI 1099 Query: 508 LCDSSDETHDHLFFTCEKSLAVWSGICSWL---RCRNQMITIPSAVRRFQREKAGSGIIR 338 C + ET DHLFFTC + +W + + + + +I A+ Q + + R Sbjct: 1100 FCQGTLETRDHLFFTCSFTSVIWVDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRR 1159 Query: 337 KAKWVALGATVQYLWQARNLKYVAKKPFEVSHVIKEIKLDVYRVLYSL 194 AT+ +W+ RN + + P S ++ I + L S+ Sbjct: 1160 ----YVFQATIYIVWRERNGRRHGEPPNTASQLVGWIDKQIRNQLSSI 1203 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 403 bits (1035), Expect = e-109 Identities = 198/480 (41%), Positives = 298/480 (62%), Gaps = 5/480 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 ++ R L VS +I++ F + +K+PGPDGYTS FFKK W ++ ++AAV E Sbjct: 432 FKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFFKKTWSIVGPSLIAAVQE 491 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF G +L + N T V+++PK + +++FRPI+C N +YK+I+K+L+ R+ +L I Sbjct: 492 FFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCCNAIYKVISKLLARRLENILPLWI 551 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 SPSQSAF+KGR + +N LA EL++ + + I++R ++K+DLRKA+D + W F+ + L Sbjct: 552 SPSQSAFVKGRLLTENVLLATELVQGFGQAN-ISSRGVLKVDLRKAFDSVGWGFIIETLK 610 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 N P F++WI C+TS +FSI ++G G+ +G +GLRQGDP+SP+LF+ M+ LSRL Sbjct: 611 AANAPPRFVNWIKQCITSTSFSINVSGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRL 670 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 L + + +HPK I+ LAFADDL++F G S+R ++ LE F SGL +N Sbjct: 671 LENKFSDGSIGYHPKASEVRISSLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMN 730 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 KS ++ G+ +K + L FGF GT P +YLGLPL + L DY+ LI +I+ Sbjct: 731 TEKSAVYTAGLEDTDKEDTL-AFGFVNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARF 789 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLWGSSY----- 998 + W+ LS AGRL+LI SV+ +WL + LP + I ++ +FLWG+ Sbjct: 790 NHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGD 849 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 VSW+ CLP+ EGGLGLR+ WNK L+ + +W + A+ D+LW+ W HA LR + W Sbjct: 850 IKVSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFW 909 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 399 bits (1026), Expect = e-108 Identities = 203/480 (42%), Positives = 288/480 (60%), Gaps = 5/480 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 YR S ++ L R V+ EI+ LF + ++K+PGPDGYTS FFK W L D +AA+ Sbjct: 736 YRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWSLTGPDFIAAIQS 795 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF KG + + LN T+++LIPK + D+RPI+C NV+YK+I+KIL++R+ LL I Sbjct: 796 FFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKILANRLKLLLPSFI 855 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 +QSAF+K R +M+N LA EL+K Y K +T RC +KID+ KA+D + W FL + L Sbjct: 856 LQNQSAFVKERLLMENVLLATELVKDYH-KESVTPRCAMKIDISKAFDSVQWQFLLNTLE 914 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 LNF F HWI C+++ATFS+ +NG GF RGLRQG +SP LF+ CM+ LS + Sbjct: 915 ALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHM 974 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 + +HPKC+ +THL FADDL++F G S+ + + +EF SGL I+ Sbjct: 975 IDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQIS 1034 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 KS I+L GV ++++ L F F G LPV+YLGLPL +K +TT DY+ LI + I Sbjct: 1035 LEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKI 1094 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLWGS-----SY 998 W+ +LS AGRL L+ SV+ + +W+ A LP I I KL FLW Sbjct: 1095 SSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKK 1154 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 ++W ++C P+KEGGLG++ LA NK K +W + + +LW+ WI +R W Sbjct: 1155 AKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFW 1214 Score = 76.6 bits (187), Expect = 4e-11 Identities = 36/94 (38%), Positives = 54/94 (57%), Gaps = 2/94 (2%) Frame = -2 Query: 682 TSEAYEHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRLK--HSDIARGCV 509 T + + RT ++ W+K +W Y PK+S LWL +Q RL T DR+K +S C Sbjct: 1338 TKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCT 1397 Query: 508 LCDSSDETHDHLFFTCEKSLAVWSGICSWLRCRN 407 LC++++ET DHLFF+C+ + VW + L N Sbjct: 1398 LCNNAEETRDHLFFSCQYTSYVWEALTQRLLSTN 1431 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 398 bits (1023), Expect = e-108 Identities = 202/480 (42%), Positives = 291/480 (60%), Gaps = 5/480 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 +R S ++ L R V+ EI+ +F + DK+PGPDGYTS F+K +W+++ ++V+ A+ Sbjct: 160 FRCSEDDHRLLTRVVTGEEIKKVIFSMPKDKSPGPDGYTSEFYKASWEIIGDEVIIAIQS 219 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF+KG + + +N T+++LIPK + D+RPI+C NV+YK I+KIL++R+ +L + I Sbjct: 220 FFAKGFLPKGVNSTILALIPKKKEAREIKDYRPISCCNVLYKAISKILANRLKRILPKFI 279 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 +QSAF+K R +++N LA EL+K Y K I+ RC +KID+ KA+D + W FL VL Sbjct: 280 VGNQSAFVKDRLLIENVLLATELVKDYH-KDSISTRCAMKIDISKAFDSLQWSFLTHVLA 338 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 +NF FIHWI C+++A+FSI +NG G+ R RGLRQG +SP LF+ MD LSR+ Sbjct: 339 AMNFPGEFIHWISLCMSTASFSIQVNGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRM 398 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 L A F +HP+C T +THL FADDL++ G S+ + L +F A GL I Sbjct: 399 LDKAAGAREFGYHPRCKTLGLTHLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKIC 458 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 K+ ++L GV + + + + F G LPV+YLGLPL +K LTT DY+ LI QI I Sbjct: 459 MEKTTLYLAGVSDHSRQLMSSRYSFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRI 518 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLWGS-----SY 998 W+ LS AGRL LI SVL + +W+ A LP IN I ++ LW Sbjct: 519 GMWTSRYLSFAGRLSLINSVLWSITNFWMNAFRLPRECINEINRISSALLWSGPELNPKK 578 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 VSW +C P+KEGGLGL+ L NK K +W + + D+LW+KW L+ + W Sbjct: 579 AKVSWDEICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFW 638 Score = 86.7 bits (213), Expect = 4e-14 Identities = 47/154 (30%), Positives = 77/154 (50%), Gaps = 2/154 (1%) Frame = -2 Query: 682 TSEAYEHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRLK--HSDIARGCV 509 T + + H RT ++ WHK +W ++ PKFS WLA++ RL T DR+ ++ CV Sbjct: 762 TKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCV 821 Query: 508 LCDSSDETHDHLFFTCEKSLAVWSGICSWLRCRNQMITIPSAVRRFQREKAGSGIIRKAK 329 C S ET DHLFF C S +W+ I + +++ T SAV + + I Sbjct: 822 FCSSPMETRDHLFFQCCYSSEIWTSIAKNV-YKDRFSTKWSAVVNYISDSQPDRIQSFLS 880 Query: 328 WVALGATVQYLWQARNLKYVAKKPFEVSHVIKEI 227 ++ +W+ RN + +K S++I++I Sbjct: 881 RYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQI 914 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 393 bits (1010), Expect = e-106 Identities = 200/499 (40%), Positives = 297/499 (59%), Gaps = 5/499 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 YR SP + EL S +IR ALF + +K+ GPDG+T+ FF +W ++ +V A+ E Sbjct: 433 YRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFFIDSWSIVGAEVTDAIKE 492 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FFS G +L++ N T + LIPK + SDFRPI+C N +YK+I ++L+ R+ LL +I Sbjct: 493 FFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLYKVIARLLTDRLQRLLSGVI 552 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 S +QSAF+ GR++ +N LA +L+ Y I+ R M+K+DL+KA+D + W+F+ L Sbjct: 553 SSAQSAFLPGRSLAENVLLATDLVHGYNWSN-ISPRGMLKVDLKKAFDSVRWEFVIAALR 611 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 L FI+WI C+++ TF+++INGG+ GF + +GLRQGDP+SP LF+ M+ S L Sbjct: 612 ALAIPEKFINWISQCISTPTFTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNL 671 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 LH+R + +HPK I+HL FADD+++F G S+ + +TL++F + SGL +N Sbjct: 672 LHSRYESGLIHYHPKASNLSISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVN 731 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 K KSH++L G+ E +GFP GTLP++YLGLPL ++ L +Y L+ +I+ Sbjct: 732 KDKSHLYLAGLNQLES-NANAAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARF 790 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLWGSSY----- 998 W LS AGR++LI SV+ G +W+ LP I RI L +FLW + Sbjct: 791 RSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKG 850 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 VSW +CLP+ EGGLGLR L WNK L + +W + D+LW W H +L W Sbjct: 851 IKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFW 910 Query: 817 EFPFPKRDAPHITNILRIR 761 + D+ +L +R Sbjct: 911 AVEGGQSDSWTWKRLLSLR 929 Score = 60.1 bits (144), Expect = 4e-06 Identities = 41/144 (28%), Positives = 65/144 (45%), Gaps = 7/144 (4%) Frame = -2 Query: 691 GKGTSEAYEHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRLK-----HSD 527 G ++ +E R K K W +IW PK++ +W++ RL T RL SD Sbjct: 1032 GFSAAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSD 1091 Query: 526 IARGCVLCDSSDETHDHLFFTCEKSLAVWSGICSWLRCRNQMITIPSAVRRFQREKAGSG 347 CVLC + E+ DHL CE S VW + + R ++ + S + + R+ + Sbjct: 1092 ---ACVLCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELLSWVRQSSPEA 1148 Query: 346 --IIRKAKWVALGATVQYLWQARN 281 ++RK + V LW+ RN Sbjct: 1149 PPLLRK---IVSQVVVYNLWRQRN 1169 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 385 bits (988), Expect = e-104 Identities = 206/525 (39%), Positives = 311/525 (59%), Gaps = 10/525 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 YR SP +++ L P S +I+ A F + +KA GPDG++ FF W ++ +V A+ E Sbjct: 330 YRCSPAQQVSLDTPFSSEQIKNAFFSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHE 389 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF+ G +L++ N T + LIPK ++ +SDFRPI+C N VYK+I+K+L+ R+ L I Sbjct: 390 FFTSGKLLKQWNATNLVLIPKITNASSMSDFRPISCLNTVYKVISKLLTDRLKDFLPAAI 449 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 S SQSAF+ GR ++N LA EL+ Y +K I M+K+DLRKA+D + WDF+ L Sbjct: 450 SHSQSAFMPGRLFLENVLLATELVHGYNKKN-IAPSSMLKVDLRKAFDSVRWDFIVSALR 508 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 LN F WIL C+++A+FS+ +NG S G +GLRQGDPMSP LF+ M+ S L Sbjct: 509 ALNVPEKFTCWILECLSTASFSVILNGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGL 568 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 L +R + +HPK +I+HL FADD+++F G S+ + ++LE+F SGL +N Sbjct: 569 LQSRYTSGYIAYHPKTSQLEISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMN 628 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 +K+ ++ G+ E + + +GF G+LPV+YLGLPL S+ LT +YA LI +I+ Sbjct: 629 TNKTQLYHAGLSQSES-DSMASYGFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARF 687 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLWGS-----SY 998 + W LS AGR++L+ SV+ G+ +W+ + LP I +I L +FLW S Sbjct: 688 NSWVVRLLSFAGRVQLLASVISGIVNFWISSFILPLGCIKKIESLCSRFLWSSRIDKKGI 747 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQ--D 824 V+W VCLP+ EGG+GLR AV N+ L+ + +W + + + +LW+ W H ++ G+ Sbjct: 748 AKVAWSQVCLPKAEGGIGLRRFAVSNRTLYLRMIWLLFSNSGSLWVAW-HKQHSLGKSTS 806 Query: 823 IWEFPFPKRDAPHITNILRIR---DRLILDCGGNLNDAKTKLAGW 698 W P D+ + +LR+R +R I GN DA W Sbjct: 807 FWNQPEKPHDSWNWKCLLRLRVVAERFIRCNVGNGRDASFWFDNW 851 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 384 bits (985), Expect = e-103 Identities = 196/470 (41%), Positives = 282/470 (60%), Gaps = 5/470 (1%) Frame = -2 Query: 2212 LIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDEFFSKGLILRK 2033 L R VS EI+ LF + +DK+PGPDG+TS FFK++W++L + + A+ FF+ G + + Sbjct: 2 LTRVVSAEEIKKVLFSMPNDKSPGPDGFTSEFFKESWEILGPEFILAIQSFFALGFLPKG 61 Query: 2032 LNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLISPSQSAFIKG 1853 +N T+++LIPK + D+RPI+C NV+YK+I+KIL++R+ LL + I+ +QS+F+K Sbjct: 62 VNSTILALIPKKLESKEMKDYRPISCCNVMYKVISKILANRLKLLLPQFIAGNQSSFVKD 121 Query: 1852 RNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLYGLNFHPCFIH 1673 R +++N LA +L+K Y K I+ RC +KID+ KA D + W FL + L ++F FIH Sbjct: 122 RLLIENVLLATDLVKDYH-KDSISERCAIKIDISKASDSVQWSFLINTLTAMHFPEMFIH 180 Query: 1672 WILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRLLHARTHASTF 1493 WI C+T+ +FS+ +NG GF + RGLRQG +SP LF+ CMD LS+LL Sbjct: 181 WIRLCITTPSFSVQVNGELAGFFQSSRGLRQGCALSPYLFVICMDVLSKLLDKVVGIGRI 240 Query: 1492 IHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAINKSKSHIFLGG 1313 +HP C +THL+FADDL++ G S+ + + + F+ SGL I+ KS IF G Sbjct: 241 GYHPHCKRMGLTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAG 300 Query: 1312 VRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFIHRWSYSNLSR 1133 + + ++ F F G LP++YLGLPL +K L++ DYA LI QI I WS LS Sbjct: 301 LSSTSRAQLHTHFPFEVGELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSF 360 Query: 1132 AGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW-----GSSYCPVSWKTVCL 968 AGR LI S++ +WL A LP I I KL FLW S +SW VC Sbjct: 361 AGRFNLISSIIWSSCNFWLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQVCK 420 Query: 967 PRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 P+ EGGLGLR L N K +W I + D+LW+KW+ L+ + W Sbjct: 421 PKSEGGLGLRSLKEANDVCCLKLVWRIISHGDSLWVKWVEHNLLKREIFW 470 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 382 bits (982), Expect = e-103 Identities = 193/465 (41%), Positives = 284/465 (61%), Gaps = 7/465 (1%) Frame = -2 Query: 2191 GEIRTALFDIGDD--KAPGPDGYTSAFFKKNWDLLNNDVVAAVDEFFSKGLILRKLNHTV 2018 G + T+ DI ++ K+PGPDGYT FFK W +L D+V A+ FF KG + + +N T+ Sbjct: 601 GRVCTSHDDIKEEAHKSPGPDGYTVEFFKTAWPVLGRDLVIAIQSFFLKGFLPKGINTTI 660 Query: 2017 VSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLISPSQSAFIKGRNIMD 1838 ++LI K G+ D+RPI+C NV+YKI++K++++R+ +L I+P+QSAFIK R +M+ Sbjct: 661 LALISKKHEVSGMKDYRPISCCNVLYKIVSKLMANRLKEILPASIAPNQSAFIKDRLMME 720 Query: 1837 NFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLYGLNFHPCFIHWILTC 1658 N LA EL+K Y K I++R +KID+ KA+D + W FL +VL ++ FIHWI C Sbjct: 721 NLLLASELVKDYH-KESISSRSALKIDISKAFDFVQWPFLINVLKAIHLPEMFIHWIELC 779 Query: 1657 VTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRLLHARTHASTFIHHPK 1478 + +A+FS+ +NG GF R +RGLRQG +SP L++ CM+ LS +L +HP+ Sbjct: 780 IGTASFSVQVNGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPR 839 Query: 1477 CDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAINKSKSHIFLGGVRPYE 1298 C ++THL FADD+++F G S++ E+F A S L I+ KS IF+ G+ P Sbjct: 840 CRNMNLTHLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNA 899 Query: 1297 KLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFIHRWSYSNLSRAGRLE 1118 K IL+ F F GTLPVKYLGLPL +K +T DY L+ +I I W+ LS AGRL+ Sbjct: 900 KTSILQQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQ 959 Query: 1117 LIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW-----GSSYCPVSWKTVCLPRKEG 953 LI+SVL + +WL LP + I K+ FLW + ++W VC ++EG Sbjct: 960 LIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEG 1019 Query: 952 GLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 GLGL+ L N+ K +W I + D+LW+KW++ +R + W Sbjct: 1020 GLGLKPLKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFW 1064 Score = 63.2 bits (152), Expect = 5e-07 Identities = 33/90 (36%), Positives = 48/90 (53%), Gaps = 2/90 (2%) Frame = -2 Query: 682 TSEAYEHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRL-KHSDIAR-GCV 509 +S+ ++ R+ + W++ +W S PK+S WLA RL T D++ K + AR CV Sbjct: 1187 SSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCV 1246 Query: 508 LCDSSDETHDHLFFTCEKSLAVWSGICSWL 419 C ET DHLFF+C S VW + L Sbjct: 1247 FCGEELETRDHLFFSCPYSSHVWFSLTKGL 1276 >gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] Length = 740 Score = 382 bits (981), Expect = e-103 Identities = 192/480 (40%), Positives = 284/480 (59%), Gaps = 5/480 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 +R S ++ L R V+ E + LF + +K PGPDGYTS FFK W + D +AA+ Sbjct: 12 FRCSATDQDMLTREVTSEENQKVLFAMPSNKFPGPDGYTSEFFKATWSITGQDFIAAIKS 71 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF KG + + LN T+++LIPK + D+RPI+C NV+YK+I+KI+++R+ +L I Sbjct: 72 FFIKGFLPKGLNATILALIPKKDEATLMRDYRPISCCNVIYKVISKIIANRLKVMLPTFI 131 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 +QSAF++ R +++N LA EL+K Y K I+ RC +KID+ KA+D + W FL + L Sbjct: 132 LQNQSAFVRERLLIENVLLATELVKDYH-KDSISPRCAMKIDISKAFDSVQWQFLLNTLE 190 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 LNF F HWI C+++ATFS+ +NG GF +RGLRQG +SP LF+ CM+ LS + Sbjct: 191 ALNFPENFCHWIKLCISTATFSVQVNGELAGFFGSKRGLRQGCALSPYLFVICMNVLSHM 250 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 + +HPKC +THL FADDL++F G S+ + + +EF SGL I+ Sbjct: 251 IDVAAVHRNIGYHPKCKKLSLTHLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHIS 310 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 KS ++L GV + IL F F G LPV+YLGLPL +K +TT DY+ L+ ++ + I Sbjct: 311 LEKSTLYLAGVSELNRNNILSAFPFASGQLPVRYLGLPLLTKQMTTADYSPLLDKVRSKI 370 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLWGS-----SY 998 W+ +LS AGRL LI SV+ + +W+ A LP I I KL FLW Sbjct: 371 SSWTARSLSYAGRLALINSVIVSLSNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKK 430 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 ++W ++C ++EGGLG++ L NK K +W + ++ +LW+ W+ +R W Sbjct: 431 AKITWTSLCKLKQEGGLGIKSLLEANKVSCLKLIWRLVSRQSSLWVNWVWTYIIRKGSFW 490 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 380 bits (976), Expect = e-102 Identities = 192/481 (39%), Positives = 288/481 (59%), Gaps = 5/481 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 YR S ++ EL + + EI+ A + +K GPDGY+ FF+ W ++ +V+AA+ E Sbjct: 293 YRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHE 352 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF G +L++ N T + LIPKTS+ +S+FRPI+C N +YK+I+K+L+SR+ LL +I Sbjct: 353 FFDSGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVI 412 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 SQSAF+ GR++ +N LA E++ Y R I+ R M+K+DL+KA+D + W+F+ L Sbjct: 413 GHSQSAFLPGRSLAENVLLATEMVHGYNRLN-ISPRGMLKVDLKKAFDSVKWEFVTAALR 471 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 L +I+WI C+T+ +F+I++NG + GF R +GLRQGDP+SP LF+ M+ S+L Sbjct: 472 ALAIPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKL 531 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 L++R + +HPK I+HL FADD+++F G SM + +TL++F SGL +N Sbjct: 532 LYSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVN 591 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 K KS +F G+ E++ +GFP GT P++YLGLPL + L DY L+ ++S + Sbjct: 592 KDKSQLFQAGLDLSERI-TSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARL 650 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW-----GSSY 998 W LS AGR +LI SV+ G+ +W+ LP I +I L KFLW G Sbjct: 651 RSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKS 710 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 VSW CLP+ EGGLG R WNK L + +W + + +LW +W L W Sbjct: 711 SKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFW 770 Query: 817 E 815 + Sbjct: 771 Q 771 Score = 60.8 bits (146), Expect = 3e-06 Identities = 45/169 (26%), Positives = 71/169 (42%), Gaps = 4/169 (2%) Frame = -2 Query: 691 GKGTSEAYEHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRLKHSDIARG- 515 G ++ +E R + K W K++W PK + W A RL T RL + Sbjct: 891 GFSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSA 950 Query: 514 -CVLCDSSDETHDHLFFTCEKSLAVWSGICSWLRCRNQMITIPSAVRRFQREK--AGSGI 344 C LC ET DHL C+ S VW + L R +++ + + + R+ A + Sbjct: 951 ECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSL 1010 Query: 343 IRKAKWVALGATVQYLWQARNLKYVAKKPFEVSHVIKEIKLDVYRVLYS 197 +RK V V LW+ RNL + S V + + ++ V+ S Sbjct: 1011 LRK---VVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILS 1056 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 380 bits (976), Expect = e-102 Identities = 192/481 (39%), Positives = 288/481 (59%), Gaps = 5/481 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 YR S ++ EL + + EI+ A + +K GPDGY+ FF+ W ++ +V+AA+ E Sbjct: 293 YRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHE 352 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF G +L++ N T + LIPKTS+ +S+FRPI+C N +YK+I+K+L+SR+ LL +I Sbjct: 353 FFDSGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVI 412 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 SQSAF+ GR++ +N LA E++ Y R I+ R M+K+DL+KA+D + W+F+ L Sbjct: 413 GHSQSAFLPGRSLAENVLLATEMVHGYNRLN-ISPRGMLKVDLKKAFDSVKWEFVTAALR 471 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 L +I+WI C+T+ +F+I++NG + GF R +GLRQGDP+SP LF+ M+ S+L Sbjct: 472 ALAIPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKL 531 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 L++R + +HPK I+HL FADD+++F G SM + +TL++F SGL +N Sbjct: 532 LYSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVN 591 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 K KS +F G+ E++ +GFP GT P++YLGLPL + L DY L+ ++S + Sbjct: 592 KDKSQLFQAGLDLSERI-TSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARL 650 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW-----GSSY 998 W LS AGR +LI SV+ G+ +W+ LP I +I L KFLW G Sbjct: 651 RSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKS 710 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIW 818 VSW CLP+ EGGLG R WNK L + +W + + +LW +W L W Sbjct: 711 SKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFW 770 Query: 817 E 815 + Sbjct: 771 Q 771 Score = 59.7 bits (143), Expect = 6e-06 Identities = 44/169 (26%), Positives = 71/169 (42%), Gaps = 4/169 (2%) Frame = -2 Query: 691 GKGTSEAYEHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDRLKHSDIARG- 515 G ++ +E R + K W +++W PK + W A RL T RL + Sbjct: 891 GFSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSA 950 Query: 514 -CVLCDSSDETHDHLFFTCEKSLAVWSGICSWLRCRNQMITIPSAVRRFQREK--AGSGI 344 C LC ET DHL C+ S VW + L R +++ + + + R+ A + Sbjct: 951 ECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSL 1010 Query: 343 IRKAKWVALGATVQYLWQARNLKYVAKKPFEVSHVIKEIKLDVYRVLYS 197 +RK V V LW+ RNL + S V + + ++ V+ S Sbjct: 1011 LRK---VVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILS 1056 >gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] Length = 653 Score = 373 bits (958), Expect = e-100 Identities = 214/610 (35%), Positives = 331/610 (54%), Gaps = 12/610 (1%) Frame = -2 Query: 2029 NHTVVSLIPKTSHDPG--VSDFRPIACTNVVYKIITKILSSRMAPLLQRLISPSQSAFIK 1856 ++ + +P S G +S +RP++C NV+YKII+KI+++R+ +L + I+ +Q+AF+K Sbjct: 35 SYICIHFLPLLSSPTGHFISHYRPLSCCNVIYKIISKIIANRLKMVLPKFIAGNQTAFVK 94 Query: 1855 GRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLYGLNFHPCFI 1676 R +++N LA EL+K Y K +++RC +KID+ KA++ + W F+R++L ++F F+ Sbjct: 95 DRLLIENLLLATELVKDYH-KESVSSRCAIKIDISKAFNSVQWSFIRNILLSMDFPMEFV 153 Query: 1675 HWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRLLHARTHAST 1496 HWI+ C+++A+FS+ +NG GF + +RGLRQG +SP LF+ MD LS+LL A Sbjct: 154 HWIMLCISTASFSVQVNGELVGFFQSKRGLRQGCSLSPYLFVMSMDVLSKLLDQAASAKK 213 Query: 1495 FIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAINKSKSHIFLG 1316 F +H +C +THL+FADDL++ G S+ + + + F SGL I+ KS I+L Sbjct: 214 FGYHSRCKELSLTHLSFADDLMVLSDGKVRSIDGIVEVFDIFAKFSGLKISMEKSTIYLA 273 Query: 1315 GVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFIHRWSYSNLS 1136 GV EI + F G LPV+YLGLPL +K LT DY+ L+ I I W+ LS Sbjct: 274 GVTEDVYHEIQNRYQFDVGQLPVRYLGLPLVTKRLTATDYSPLLEHIKKKIGTWTTRYLS 333 Query: 1135 RAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLW-GSSYCP----VSWKTVC 971 AGRL LI SVL + +WL A LP I I K+ FLW G P V W VC Sbjct: 334 YAGRLNLITSVLWSICNFWLAAFRLPRECIREIDKICSAFLWSGPDLNPRKTRVCWGDVC 393 Query: 970 LPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLWIKWIHAEYLRGQDIWEFPFPKRDA 791 P++EGGLGLR L N+ K +W I + T++LW++WI L+ W Sbjct: 394 KPKQEGGLGLRSLKEMNEVSCLKLIWRIVSHTNSLWVRWIEQYLLKHDTFW-------SV 446 Query: 790 PHITNILRIRDRLILDCGGNLNDAKTKLAGWFTGKGTSEAYEHFRTKGEKKFWHKAIWRS 611 TN+ + R G ++ K + T + + R WH IW + Sbjct: 447 QTTTNMDSVLWR------GRNDEYMPKFS-------TRDTWNQTRNTSTPVTWHMGIWFA 493 Query: 610 YIPPKFSVTLWLALQGRLKTLDRLK--HSDIARGCVLCDSSDETHDHLFFTCEKSLAVWS 437 + PKFS WLA+Q RL T D++ + ++ CVLC+++ ET +HLFF+C + +W Sbjct: 494 HATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSCCYTAEIWE 553 Query: 436 GICSWL---RCRNQMITIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVA 266 + + + TI ++V R + S + R AT+ +W RN + Sbjct: 554 NLAKNIYKAKFSTNWSTILTSVSTTWRNRTESFLAR----YIFQATIHTIWHERNGRRHG 609 Query: 265 KKPFEVSHVI 236 ++ +H+I Sbjct: 610 ERSNSATHLI 619 >dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 367 bits (942), Expect = 1e-98 Identities = 188/465 (40%), Positives = 280/465 (60%), Gaps = 5/465 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 +R S ++ +L R S +I+ A F + +KA GPDGY+S FFK W ++ +V AV E Sbjct: 434 FRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSEFFKGVWFVVGPEVTEAVQE 493 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF G +L++ N T + LIPK ++ ++DFRPI+C N +YK+I K+L+SR+ LL +I Sbjct: 494 FFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTLYKVIAKLLTSRLKKLLNEVI 553 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 SPSQSAF+ GR + +N LA E++ Y K I++R M+K+DLRKA+D + WDF+ Sbjct: 554 SPSQSAFLPGRLLSENVLLATEIVHGYNTKN-ISSRGMLKVDLRKAFDSVRWDFIISAFR 612 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 L F+ WI C+++ FS+ +NG S GF + +GLRQGDP+SP LF+ M+ S L Sbjct: 613 ALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQGDPLSPYLFVLAMEVFSSL 672 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 L AR A +HPK I+HL FADD+++F G S+ + + L++F + SGL +N Sbjct: 673 LKARFDAGYIHYHPKTADLSISHLMFADDVMVFFDGGSSSLHGISEALDDFASWSGLHVN 732 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 K K++++L G E L I +GFP TLP++YLGLPL S+ L +Y ++ Sbjct: 733 KDKTNLYLAGTDEVEALAISH-YGFPISTLPIRYLGLPLMSRKLKISEY-----ELVKRF 786 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLWGSSY----- 998 W+ +LS AGR++LI SV+ G+ +W+ L + +I L +FLW S Sbjct: 787 RSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCVKKIESLCSRFLWSGSIDASKG 846 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLW 863 ++W VCLP+ EGG+GLR WNK + + +W + A D LW Sbjct: 847 AKIAWSGVCLPKNEGGVGLRRFTPWNKTFYLRFIWPLFADNDVLW 891 >ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max] Length = 939 Score = 366 bits (940), Expect = 2e-98 Identities = 212/569 (37%), Positives = 311/569 (54%), Gaps = 14/569 (2%) Frame = -2 Query: 1945 VYKIITKILSSRMAPLLQ------RLISPSQSAFIKGRNIMDNFYLAEELIKTYERKRGI 1784 V K + +L SR + L R + +Q+AF+ G+ + D+ LA EL++ YERK G Sbjct: 345 VLKFYSALLGSRESNLAGLNIPAIRNVGKNQAAFVPGQQLHDHVMLAFELLRGYERKHG- 403 Query: 1783 TARCMVKIDLRKAYDCISWDFLRDVLYGLNFHPCFIHWILTCVTSATFSIAINGGSHGFV 1604 T +CM++ID++KAYD + WD L +L L F FI WI+ V S T+ ING + Sbjct: 404 TPKCMLQIDIQKAYDTVHWDALEHILRELGFPDQFIKWIMIAVRSVTYVFNINGRFTRRL 463 Query: 1603 RGQRGLRQGDPMSPTLFLFCMDYLSRLLHARTHASTFIHHPKCDTTDITHLAFADDLLLF 1424 +RG+RQGDP+SP LF+ M+YL+R+L F +H KC+ IT+L FADDLLLF Sbjct: 464 EARRGIRQGDPISPLLFILVMEYLNRILSQLDKIPNFNYHSKCEKMKITNLCFADDLLLF 523 Query: 1423 GRGDPDSMRVLRDTLEEFTATSGLAINKSKSHIFLGGVRPYEKLEILELFGFPEGTLPVK 1244 RGD S++++ D F + GL +N SK +I+ G V K ++L + GF EG +P + Sbjct: 524 SRGDIGSVQIMLDKFNTFLRSMGLHVNPSKCNIYCGSVDINVKEQLLLISGFKEGKMPFR 583 Query: 1243 YLGLPLASKSLTTPDYASLITQISNFIHRWSYSNLSRAGRLELIRSVLQGVECYWLQALP 1064 YLG+PL+SK L Y LI +I I WS LS AGR++LI+SV+ +W+Q LP Sbjct: 584 YLGIPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLP 643 Query: 1063 LPGTVINRITKLIRKFLW-GSS----YCPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKT 899 LP VI RI + R FLW G+S P++W+ VC P+ GGL + +LA+WNK K Sbjct: 644 LPKFVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNKISILKL 703 Query: 898 LWNIHAKTDTLWIKWIHAEYLRGQDIWEFPFPKRDAPHITNILRIRDRLILDCGGNLNDA 719 LWN+ K+D LWIKW+H Y+RGQ IW K + +++++++R L+L + D Sbjct: 704 LWNVCNKSDNLWIKWLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLRP-LLLQYQSRMQDV 762 Query: 718 -KTKLAGWFTGKGTSEAYEHFRTKGEKKFWHKAIWRSYIPPKFSVTLWLALQGRLKTLDR 542 K K + Y + EK W + + P+ LW A RL + DR Sbjct: 763 FKMK-----------KIYLALFEESEKMSWRTLMCNNLARPRALFCLWQACHFRLASKDR 811 Query: 541 LKH--SDIARGCVLCDSSDETHDHLFFTCEKSLAVWSGICSWLRCRNQMITIPSAVRRFQ 368 L ++ C C SS E+H+HLFF C + +W+ + +WL+ + T + Sbjct: 812 LIKFGLNVDANCAFC-SSMESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTWSEELNWIT 870 Query: 367 REKAGSGIIRKAKWVALGATVQYLWQARN 281 R+ G G A T+ ++W RN Sbjct: 871 RKCKGKGWRAMLLKCAFTETIYHIWAYRN 899 >emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 365 bits (936), Expect = 6e-98 Identities = 187/465 (40%), Positives = 279/465 (60%), Gaps = 5/465 (1%) Frame = -2 Query: 2242 YRLSPEERMELIRPVSLGEIRTALFDIGDDKAPGPDGYTSAFFKKNWDLLNNDVVAAVDE 2063 +R S ++ +L R S +I+ A F + +KA GPDGY+S FFK W ++ +V AV E Sbjct: 434 FRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSEFFKGVWFVVGPEVTEAVQE 493 Query: 2062 FFSKGLILRKLNHTVVSLIPKTSHDPGVSDFRPIACTNVVYKIITKILSSRMAPLLQRLI 1883 FF G +L++ N T + LIPK ++ ++DFRPI+C N +YK+I K+L+SR+ LL +I Sbjct: 494 FFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTLYKVIAKLLTSRLKKLLNEVI 553 Query: 1882 SPSQSAFIKGRNIMDNFYLAEELIKTYERKRGITARCMVKIDLRKAYDCISWDFLRDVLY 1703 SPSQSAF+ GR + +N LA E++ Y K I++R M+K+DLRKA+D + WDF+ Sbjct: 554 SPSQSAFLPGRLLSENVLLATEIVHGYNTKN-ISSRGMLKVDLRKAFDSVRWDFIISAFR 612 Query: 1702 GLNFHPCFIHWILTCVTSATFSIAINGGSHGFVRGQRGLRQGDPMSPTLFLFCMDYLSRL 1523 L F+ WI C+++ FS+ +NG S GF + +GLRQGDP+SP LF+ M+ S L Sbjct: 613 ALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQGDPLSPYLFVLAMEVFSSL 672 Query: 1522 LHARTHASTFIHHPKCDTTDITHLAFADDLLLFGRGDPDSMRVLRDTLEEFTATSGLAIN 1343 L AR A +HPK I+HL FADD+++F G S+ + + L++F + SGL +N Sbjct: 673 LKARFDAGYIQYHPKTADLSISHLMFADDVMVFFDGGSSSLHGISEALDDFASWSGLHVN 732 Query: 1342 KSKSHIFLGGVRPYEKLEILELFGFPEGTLPVKYLGLPLASKSLTTPDYASLITQISNFI 1163 K K++++L G E L I +GFP TLP++YLGLPL S+ L +Y ++ Sbjct: 733 KDKTNLYLAGTDEVEALAISH-YGFPISTLPIRYLGLPLMSRKLKISEY-----ELVKRF 786 Query: 1162 HRWSYSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVINRITKLIRKFLWGSSY----- 998 W+ +LS AGR++LI SV+ G+ +W+ L + +I L +FLW S Sbjct: 787 RSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCVKKIESLCSRFLWSGSIDASKG 846 Query: 997 CPVSWKTVCLPRKEGGLGLRDLAVWNKALHSKTLWNIHAKTDTLW 863 ++W VCLP+ EGG+ LR WNK + + +W + A D LW Sbjct: 847 AKIAWSGVCLPKNEGGVALRRFTPWNKTFYLRFIWPLFADNDVLW 891