BLASTX nr result
ID: Zingiber23_contig00022992
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber23_contig00022992 (5654 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 621 e-174 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 619 e-174 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 577 e-161 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 566 e-158 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 555 e-155 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 525 e-145 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 513 e-142 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 503 e-139 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 503 e-139 dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal... 497 e-137 dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ... 492 e-136 emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li... 486 e-134 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 486 e-134 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 453 e-124 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 452 e-124 gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip... 446 e-122 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 442 e-121 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 433 e-118 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 428 e-116 emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694... 419 e-114 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 621 bits (1602), Expect = e-174 Identities = 362/1098 (32%), Positives = 586/1098 (53%), Gaps = 21/1098 (1%) Frame = +1 Query: 2278 MKIASWNIRGFHKALKHGGVQHLLKQKEIQVMASIETKLDDDALDIIMQRKFSGMGMANN 2457 M SWN+RG + K +++ L +I V A +ET++ + + + NN Sbjct: 1 MLCVSWNVRGMNDPFKIKEIKNFLYSHKIVVCALLETRVREQNASKVQGKLGKDWKWLNN 60 Query: 2458 FNYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPL 2637 +++S R RI + W V++ + Q M C ++ +A VYG H+I R+ L Sbjct: 61 YSHSARERIWIGWRPAWVNVTLTHTQEQLMVCDIQDQSHKLKMVA--VYGLHTIADRKSL 118 Query: 2638 WNSLTLLGENIAEPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIG 2817 W+ L + +P II+GD+N++ D+ G V++ E D ++F+ L++ RS Sbjct: 119 WSGLLQCVQQ-QDPMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSNLIESRSTW 177 Query: 2818 CRLTWTNGSVS-----SKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREK 2982 +W+N S+ S++D+A VN WL E V++LPPG +SDHS + + + Sbjct: 178 SYYSWSNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPG-ISDHSPLLFNLMTGRP 236 Query: 2983 RCNRPFKFYNMWTLHEGFQDLVASSWDEPISGNAQFALKAKLTRLKGKLRELDKLHFQHI 3162 + +PFKF N+ F + V +W+ S N +F L+A LK REL ++ Q I Sbjct: 237 QGGKPFKFMNVMAEQGEFLETVEKAWN---SVNGRFKLQAIWLNLKAVKRELKQMKTQKI 293 Query: 3163 SEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRATL-----LAEAEKLFYQQRAKSVYL 3327 + K++ Q ND ++ + + E QQ+++ +L Sbjct: 294 GLAHEKVKNLRHQLQDLQSQDDFDHNDIMQTDAKSIMNDLRHWSHIEDSILQQKSRITWL 353 Query: 3328 KNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLG-KTGQCT 3504 + D +K F VK + N I L + G DEV EE + +++ LLG + Sbjct: 354 QQGDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRASTLM 413 Query: 3505 PLDSNTISGNHLSQAQ-REELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKI 3681 +D NT+ G AQ +E L+ + + EI AL IG DKAPG DGFNA F+ +W I Sbjct: 414 GVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSI 473 Query: 3682 GDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSA 3861 + A ++EFF++ ++ + N + L+PK H+ V ++RPI+CCTV YK+ISK+L+ Sbjct: 474 KQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLTN 533 Query: 3862 RLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVD 4041 R+ ++ +++ AQ F+ R I +NI LA EL+R Y RK +SPRC++KVD++KA+D+V+ Sbjct: 534 RMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRKAYDSVE 593 Query: 4042 WGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFG 4221 W FL + L +GFP +F GWI EC+STVSYSV +NG F R+GLRQGDP+SP LF Sbjct: 594 WSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFA 653 Query: 4222 LCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDF 4401 LC+E SR ++E F++HP+C + +THL +ADDLL+F RAD SSL+ + F Sbjct: 654 LCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKF 713 Query: 4402 GDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGS 4581 +GL A+ KSN+Y G++++ EL Q G PFRYLG+PL +++L A Sbjct: 714 SHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKP 773 Query: 4582 LLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVW 4761 L+E IT + +W LS+AGR++LI+S+L ++ +W I P+ VI + + R F+W Sbjct: 774 LVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFLW 833 Query: 4762 S-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQ 4926 + TK P++W+TI PK GG+ + +++ WN + + LW I+ +RD LWV+WIH Sbjct: 834 TGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIHS 893 Query: 4927 HYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGENGTALAY 5106 +Y++ +D + +++ +++ RD L ++IG E + AY Sbjct: 894 YYIKRQDILTVNISNQTTWILRKIVKARDHL----SNIG----DWDEICIGDKFSMKKAY 945 Query: 5107 DFFRPKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLS---YEPSKMCALCQQH 5277 +V W +I + PK +F +W++ H +L T D +S + LC+ Sbjct: 946 KKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRND 1005 Query: 5278 EESNGHIFFECAIAKSLLDRVVRWMDIKFDITSMAKLL-DVFQRHYKGKGRQVKARYLAV 5454 E+ H+FF C+ + + ++ M S +++ V + K KG+ + Y Sbjct: 1006 GETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVCGQARKKKGKLIVMLY--- 1062 Query: 5455 SSMIYLIWEARNRSRFEG 5508 + +Y IW+ RN+ F G Sbjct: 1063 TEFVYAIWKQRNKRTFTG 1080 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 619 bits (1597), Expect = e-174 Identities = 366/1102 (33%), Positives = 581/1102 (52%), Gaps = 24/1102 (2%) Frame = +1 Query: 2278 MKIASWNIRGFHKALKHGGVQHLLKQKEIQVMASIETKLDDDALDIIMQRKFSGMGMANN 2457 MKI +WN+RG + +K V+H L ++I + + ET++ I ++ + NN Sbjct: 1 MKITTWNVRGLNDPIKVKEVKHFLHSQKISLCSLFETRVRQQNSGKIQKKFGNRWSWINN 60 Query: 2458 FNYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPL 2637 + S RGRI + W + V+++VL T Q + V+ S +F VYG H+I R+ L Sbjct: 61 YACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHTIADRKVL 120 Query: 2638 WNSLTLLGENIAEPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIG 2817 W L EP I++GDYN++ + QD+ G VS E DL FV L++ + G Sbjct: 121 WEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQLLEAPTTG 180 Query: 2818 CRLTWTNGSV-----SSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREK 2982 +W N S+ SS++D++ VN W+ + VE+ G +SDHS I + + Sbjct: 181 LFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAG-ISDHSPLIFNLATQHD 239 Query: 2983 RCNRPFKFYNMWTLHEGFQDLVASSWDEPISGNAQFALKAKLTRLKGKLRELDKLHFQHI 3162 RPFKF N GF ++V +W S N +F +K RL+ R L H + Sbjct: 240 EGGRPFKFLNFLADQNGFVEVVKEAWG---SANHRFKMKNIWVRLQAVKRALKSFHSKKF 296 Query: 3163 SEQALRAKSVLEDAQRSLLSGAGPPNDYEVIR---KRATLLAEAEKL------FYQQRAK 3315 S +A +E+ +R L + P +V + L+A+ K +Q+++ Sbjct: 297 S----KAHCQVEELRRKLAAVQALPEVSQVSELQEEEKDLIAQLRKWSTIDESILKQKSR 352 Query: 3316 SVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLG-KT 3492 +L D +KFF +K RN I+ L GD+ T E+ E +++ LLG + Sbjct: 353 IQWLSLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSS 412 Query: 3493 GQCTPLDSNTIS-GNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAA 3669 Q +D + + G LS +LV PIT +EI AL DI KAPG DGFN+ F+ + Sbjct: 413 SQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKS 472 Query: 3670 WDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISK 3849 W I + + +FF +G + K N T++ LIPK D + DYRPI+CC+ YK+ISK Sbjct: 473 WLVIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISK 532 Query: 3850 VLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAF 4029 +L+ RL V++ ++D AQ F+ +R IG+NI LA EL+R Y R+ +SPRC+IKVD++KA+ Sbjct: 533 ILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVSPRCVIKVDIRKAY 592 Query: 4030 DTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSP 4209 D+V+W FL S L GFP F WI C+ TVSYS+ LNG F ++GLRQGDPLSP Sbjct: 593 DSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFDAQKGLRQGDPLSP 652 Query: 4210 LLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGC 4389 LF L +E SR M + F++HP+C ++LTHL +ADDLL+FARAD SS++ IM Sbjct: 653 FLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSISKIMAA 712 Query: 4390 LTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIA 4569 F +GL+A+ KS +Y G+ + +L G PFRYLG+PLA+++L + Sbjct: 713 FNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLGVPLASKKLNFS 772 Query: 4570 NYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISR 4749 L++ IT + W H LS+AGR++L++++L ++ +W I P+P +I + R Sbjct: 773 QCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCR 832 Query: 4750 SFVW-----STKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVK 4914 F+W ++ P++W + PK GG + ++ WN + + LW I ++D LWV+ Sbjct: 833 KFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVR 892 Query: 4915 WIHQHYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGENGT 5094 W++ +Y++ ++ + + S +++ + + R+ LL RT A + + Sbjct: 893 WVNAYYIKRQNIENVTVSSNTSWILRKIFESRE-LLTRTGGWEAVSNHMNFSIKK----- 946 Query: 5095 ALAYDFFRPKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLS---YEPSKMCAL 5265 Y + V W +I + PK +F +WL +L TA+ +S + S +C + Sbjct: 947 --TYKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKM 1004 Query: 5266 CQQHEESNGHIFFECAIAKSLLDRVVRWMDIKFDITSMAKLLDVFQRHYKGKGRQVKARY 5445 C E+ H+FF C +K + +V+ +++++ + AK ++ K R K Sbjct: 1005 CGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQADAQAKKELAIKKARSTKDRN-KLYV 1063 Query: 5446 LAVSSMIYLIWEARNRSRFEGI 5511 + + +Y IW RN F GI Sbjct: 1064 MMFTESVYAIWLLRNAKVFRGI 1085 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 577 bits (1486), Expect = e-161 Identities = 328/954 (34%), Positives = 506/954 (53%), Gaps = 21/954 (2%) Frame = +1 Query: 2290 SWNIRGFHKALKHGGVQHLLKQKEIQVMASIETKLDDDALDIIMQRKFSGMGMANNFNYS 2469 SWN+RGF+ +++ + K + + +ET++ + + F G N+ ++ Sbjct: 6 SWNVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARRSLLSSFPGWKSVCNYEFA 65 Query: 2470 NRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSL 2649 GRI ++WD V + VL + Q + C V+ F+ TFVY + RR LW+ L Sbjct: 66 ALGRIWVVWDP-AVEVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWSEL 124 Query: 2650 TLLGENIA---EPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIGC 2820 LL N +PWII+GD+N L D G + + + T + DL G Sbjct: 125 ELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDLPFRGN 184 Query: 2821 RLTWTNGS----VSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREKRC 2988 TW N ++ K+DR +VN WLIA SY F SDH V+ + Sbjct: 185 HYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAME-FSDHCPSCVNISNQSGGR 243 Query: 2989 NRPFKFYNMWTLHEGFQDLVASSWDE-PISGNAQFALKAKLTRLKGKLRELDKLHFQHIS 3165 N+PFK N H F + + +WD G+A F L K LKG +R ++ H+ + Sbjct: 244 NKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREHYSGLE 303 Query: 3166 EQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRA----TLLAEAEKLFYQQRAKSVYLKN 3333 ++ ++A L+ Q +LL A P + + K A LA AE+ F Q+++ ++LK Sbjct: 304 KRVVQAAQNLKTCQNNLL--AAPSSYLAGLEKEAHRSWAELALAEERFLCQKSRVLWLKC 361 Query: 3334 SDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPLD 3513 D T FFH ++ N I L Q+G + DE+ V +F++L G + + Sbjct: 362 GDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFGSSSHLISAE 421 Query: 3514 S----NTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKI 3681 N+++ + R+ L ++ +IK+ F + ++K+PGPDG+ ++F+ W + Sbjct: 422 GISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFFKKTWSIV 481 Query: 3682 GDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSA 3861 G IAAV+EFF G+LL QWN T++ ++PK ++ +T++RPISCC YKVISK+L+ Sbjct: 482 GPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCCNAIYKVISKLLAR 541 Query: 3862 RLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVD 4041 RL N++ + P+Q AFVK R + EN+ LA EL++ + + IS R ++KVDL+KAFD+V Sbjct: 542 RLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQANISSRGVLKVDLRKAFDSVG 601 Query: 4042 WGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFG 4221 WGF++ L P +F WI +CI++ S+S++++G L G+F G +GLRQGDPLSP LF Sbjct: 602 WGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGYFKGSKGLRQGDPLSPSLFV 661 Query: 4222 LCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDF 4401 + +E+ SR ++ YHP+ +EV ++ LA+ADDL++F SSL I L F Sbjct: 662 IAMEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMIFYDGKASSLRGIKSVLESF 721 Query: 4402 GDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGS 4581 + +GL N KS +Y AG+E+ + + L GF G FPFRYLG+PL +LR ++Y Sbjct: 722 KNLSGLEMNTEKSAVYTAGLEDTDKEDTLAF-GFVNGTFPFRYLGLPLLHRKLRRSDYSQ 780 Query: 4582 LLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVW 4761 L++ I + W TLS AGR++LI SV+ FWLS +P + I + F+W Sbjct: 781 LIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRFLW 840 Query: 4762 STK-----HPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQ 4926 +SW CLPK EGG GLR+ WN TL R +W + +RRDSLWV W H Sbjct: 841 GNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHA 900 Query: 4927 HYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGEN 5088 + LR +FW+ S + K +L +R + + L+ W+ N Sbjct: 901 NRLRHVNFWNAEAASHHSWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSN 954 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 566 bits (1458), Expect = e-158 Identities = 347/963 (36%), Positives = 508/963 (52%), Gaps = 27/963 (2%) Frame = +1 Query: 2293 WNIRGFHKALKHGGVQHLLKQKEIQVMASIETKLDDDALDIIMQRKFSGMGMANNFNYSN 2472 WNIRGF+ G + +K + IET + + G N+ +S+ Sbjct: 8 WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRKFINALLPGWSFVENYAFSD 67 Query: 2473 RGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSLT 2652 G+I ++WD V + V+ + Q + C V + + + VY + + +R+ LW + Sbjct: 68 LGKIWVMWDPS-VQVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKELWIEIV 126 Query: 2653 ------LLGENIAEPWIIMGDYNSLLTVQDKEGGQPVS---NYELHDLERFVHTCGLVDL 2805 ++G+ PW+++GD+N +L Q E PVS + + D + L DL Sbjct: 127 NMVVSGIIGDR---PWLVLGDFNQVLNPQ--EHSNPVSLNVDINMRDFRDCLLAAELSDL 181 Query: 2806 RSIGCRLTWTNGS----VSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDH-SCGIVSTL 2970 R G TW N S V+ K+DR +VN W A + S + SDH SCG+V Sbjct: 182 RYKGNTFTWWNKSHTTPVAKKIDRILVNDSWN-ALFPSSLGIFGSLDFSDHVSCGVVLEE 240 Query: 2971 AREKRCNRPFKFYNMWTLHEGFQDLVASSWDE-PISGNAQFALKAKLTRLKGKLRELDKL 3147 K RPFKF+N + F +LV +W + G++ F + KL LK +++ +L Sbjct: 241 TSIK-AKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRL 299 Query: 3148 HFQHISEQALRAKSVLEDAQRSLLSGAGPPN-DYEVIRKRAT-LLAEAEKLFYQQRAKSV 3321 ++ + ++ A L Q L+ P N +E+ +R +L AE+ F++Q+++ Sbjct: 300 NYSELEKRTKEAHDFLIGCQDRTLADPTPINASFELEAERKWHILTAAEESFFRQKSRIS 359 Query: 3322 YLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQC 3501 + D TK+FH + N N+I L +G S + + + YF LLG + Sbjct: 360 WFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLGD--EV 417 Query: 3502 TPL-----DSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTA 3666 P D N + S AQ EL +NE+I+ ALF + +K+ GPDGF A+F+ Sbjct: 418 DPYLMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFFID 477 Query: 3667 AWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVIS 3846 +W +G + A+KEFF G LLKQWN T+I LIPK + C +D+RPISC YKVI+ Sbjct: 478 SWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLYKVIA 537 Query: 3847 KVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKA 4026 ++L+ RL ++S ++ AQ AF+ RS+ EN+ LA +L+ Y ISPR M+KVDLKKA Sbjct: 538 RLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNISPRGMLKVDLKKA 597 Query: 4027 FDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLS 4206 FD+V W F+++AL P++F WI +CIST +++VS+NGG GFF +GLRQGDPLS Sbjct: 598 FDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFKSTKGLRQGDPLS 657 Query: 4207 PLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMG 4386 P LF L +E FS + HYHP+ + + ++HL +ADD+++F SL+ I Sbjct: 658 PYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIFFDGGSFSLHGICE 717 Query: 4387 CLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRI 4566 L DF +GL+ N KS+LY+AG+ QL GF G P RYLG+PL +LRI Sbjct: 718 TLDDFASWSGLKVNKDKSHLYLAGL-NQLESNANAAYGFPIGTLPIRYLGLPLMNRKLRI 776 Query: 4567 ANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGIS 4746 A Y LLE IT + SW LS AGRI+LI SV+ G FW+S +P G I +I + Sbjct: 777 AEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLC 836 Query: 4747 RSFVWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWV 4911 F+WS K +SW+ +CLPK EGG GLR L WN TL R +W + +DSLW Sbjct: 837 SRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWA 896 Query: 4912 KWIHQHYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGENG 5091 W H H+L FW G SDS K LL +R + Q +K W+ +N Sbjct: 897 DWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWY---DNW 953 Query: 5092 TAL 5100 T+L Sbjct: 954 TSL 956 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 555 bits (1431), Expect = e-155 Identities = 322/939 (34%), Positives = 502/939 (53%), Gaps = 31/939 (3%) Frame = +1 Query: 2293 WNIRGFHKALKHGGVQHLLKQKEIQVMASIETKLDDDALDIIMQRKFSGMGMANNFNYSN 2472 WN+RG +K+ KH ++ +++ Q +ET++ + + ++ + F + N+ ++ Sbjct: 6 WNVRGLNKSSKHSVIKKWIEENNFQFGCLVETRVKESKVSQLVGKLFKDWSILTNYEHNR 65 Query: 2473 RGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSLT 2652 RGRI +LW + V L + + Q + C V+ F +FVY + + R+ LW+ L Sbjct: 66 RGRIWVLWRKN-VRLSPIYKSCQLLTCSVKLEDRQDEFFCSFVYASNYVEERKVLWSELK 124 Query: 2653 LLGENIA---EPWIIMGDYNSLLTVQDKEGG--QPVSNYELHDLERFVHTCGLVDLRSIG 2817 ++ +PW ++GD+N L + + P+ + D ++ ++ C L D+ + G Sbjct: 125 DHYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPGMRDFQQVINYCSLTDMAAQG 184 Query: 2818 CRLTWTN----GSVSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIV--STLARE 2979 TW N G + KLDR ++N W +SY F GC C I S + Sbjct: 185 PLFTWCNKREHGLIMKKLDRVLINDCWNQTFSQSYSVFEAGGCSDHLRCRISLNSEAGNK 244 Query: 2980 KRCNRPFKFYNMWTLHEGFQDLVASSWDEP----ISGNAQFALKAKLTRLKGKLRELDKL 3147 + +PFKF N T E F+ +V++ W + +S + F L LK K+R + + Sbjct: 245 VQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLFRFSKNLKGLKPKIRSMARD 304 Query: 3148 HFQHISEQALRAKSVLEDAQRSLLSGAGPP--NDYEVIRKRATLLAEAEKLFYQQRAKSV 3321 ++S++A A +L Q L+ + R +A E+ + +Q++K Sbjct: 305 RLGNLSKKANEAYKILCAKQHVNLTNPSSMAMEEENAAYSRWDRVAILEEKYLKQKSKLH 364 Query: 3322 YLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQC 3501 + + D+ TK FH N I ++ G T DE+ E +F++ L Q Sbjct: 365 WCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAERFFREFL----QL 420 Query: 3502 TPLDSNTISGNHLSQ--------AQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKF 3657 P D ++ L Q A ++ L+ P+T EEI+ LF + +DK+PGPDG+ ++F Sbjct: 421 IPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFRMPSDKSPGPDGYTSEF 480 Query: 3658 YTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYK 3837 + A W+ IGD+F AV+ FF G L K N T +ALIPK + + DYRPISCC V YK Sbjct: 481 FKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDYRPISCCNVLYK 540 Query: 3838 VISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDL 4017 VISK+++ RL V+ + Q AFVKDR + EN+ LA EL++ Y + IS RC IK+D+ Sbjct: 541 VISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDYHKDTISTRCAIKIDI 600 Query: 4018 KKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGD 4197 KAFD+V W FL++ GFP++F WI+ CI+T S+SV +NG L G+F RGLRQG Sbjct: 601 SKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGELAGYFQSSRGLRQGC 660 Query: 4198 PLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNL 4377 LSP LF +C++V S+ + +A+ F YHP+C + LTHL++ADDL++ + + S+ Sbjct: 661 ALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIER 720 Query: 4378 IMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATER 4557 I+ +F +GLR + KS +Y+AG+ R E+ F G P RYLG+PL T+R Sbjct: 721 IIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPVRYLGLPLITKR 780 Query: 4558 LRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIY 4737 L + LLE + K++ SW LS+AGR+ LI SVL + FWL+ +P I ++ Sbjct: 781 LSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELE 840 Query: 4738 GISRSFVWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDS 4902 + +F+WS + ISW +C PK+EGG GLR L+ N + +W I S +S Sbjct: 841 KMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNS 900 Query: 4903 LWVKWIHQHYLRSEDFWHWRGKPSDSPLI-KCLLQIRDV 5016 LWVKW+ QH LR+ FW + S I K LL+ R+V Sbjct: 901 LWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYREV 939 Score = 62.4 bits (150), Expect = 2e-06 Identities = 41/137 (29%), Positives = 60/137 (43%), Gaps = 9/137 (6%) Frame = +1 Query: 5116 RPKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLSYEPSKM---CALCQQHEES 5286 R +VPW VIW S PK+ F WL AHG+L T D + + + C CQ E+ Sbjct: 1048 RSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLET 1107 Query: 5287 NGHIFFECAIAKSLLDRVVRWMDI-----KFDITS-MAKLLDVFQRHYKGKGRQVKARYL 5448 H+FF C+ V W+D+ K TS +++ + RY+ Sbjct: 1108 RDHLFFTCSFTS------VIWVDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRRYV 1161 Query: 5449 AVSSMIYLIWEARNRSR 5499 + IY++W RN R Sbjct: 1162 -FQATIYIVWRERNGRR 1177 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 525 bits (1351), Expect = e-145 Identities = 313/830 (37%), Positives = 450/830 (54%), Gaps = 22/830 (2%) Frame = +1 Query: 2593 TFVYGFHSIMARRPLWNSLTLLGEN---IAEPWIIMGDYNSLLTVQDKEGGQPVS-NYEL 2760 +FVY + R+ LWN + + I +PW ++GD+N +L + + + Sbjct: 4 SFVYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILHPSEHSTSDGFNVDRPT 63 Query: 2761 HDLERFVHTCGLVDLRSIGCRLTWTN----GSVSSKLDRAMVNSQWLIADYESYVEFLPP 2928 + L DL G TW N V+ KLDR +VN +W S F P Sbjct: 64 RIFRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTTFPSSLGLFGEP 123 Query: 2929 GCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLVASSW-DEPISGNAQFALKAK 3105 SDHS +S ++ R +PF+F N E F L+ W ++G+A + + K Sbjct: 124 D-FSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKWFSTSVTGSAMYRVSVK 182 Query: 3106 LTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPN---DYEVIRKRATLL 3276 L LK +R+ + ++ I ++ A L AQ LL+ P N + E RK +L Sbjct: 183 LKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAIEAETQRKWR-IL 241 Query: 3277 AEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEE 3456 AEAE F+ QR++ +L+ D + +FH + N I L+ GD + Sbjct: 242 AEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQQNLENH 301 Query: 3457 FVGYFQDLLGKTGQCTPL----DSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADK 3624 V YFQ LG Q PL D + + S AQ+ L P ++E+IKNA F + +K Sbjct: 302 CVEYFQSNLGSE-QGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFSLPRNK 360 Query: 3625 APGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDY 3804 A GPDGF+ +F+ A W IG + A+ EFF GKLLKQWN T++ LIPK ++ ++D+ Sbjct: 361 ASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITNASSMSDF 420 Query: 3805 RPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKR 3984 RPISC YKVISK+L+ RL + + + + +Q AF+ R EN+ LA EL+ Y +K Sbjct: 421 RPISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELVHGYNKKN 480 Query: 3985 ISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGF 4164 I+P M+KVDL+KAFD+V W F++SAL P++F WI EC+ST S+SV LNG G Sbjct: 481 IAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFSVILNGHSAGH 540 Query: 4165 FSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLL 4344 F +GLRQGDP+SP LF L +EVFS ++ YHP+ +++E++HL +ADD+++ Sbjct: 541 FWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEISHLMFADDVMI 600 Query: 4345 FARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPF 4524 F SSL+ I+ L DF +GL N K+ LY AG+ Q + + GF+ G P Sbjct: 601 FFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGL-SQSESDSMASYGFKLGSLPV 659 Query: 4525 RYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSIL 4704 RYLG+PL + +L IA Y L+E IT + SW LS AGR++L+ SV+ G+ FW+S Sbjct: 660 RYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSF 719 Query: 4705 PIPSGVIDKIYGISRSFVWSTK-----HPPISWSTICLPKEEGGYGLRDLQAWNMTLLCR 4869 +P G I KI + F+WS++ ++WS +CLPK EGG GLR N TL R Sbjct: 720 ILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNRTLYLR 779 Query: 4870 TLWNIQSRRDSLWVKWIHQHYL-RSEDFWHWRGKPSDSPLIKCLLQIRDV 5016 +W + S SLWV W QH L +S FW+ KP DS KCLL++R V Sbjct: 780 MIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLRLRVV 829 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 513 bits (1321), Expect = e-142 Identities = 346/1109 (31%), Positives = 527/1109 (47%), Gaps = 45/1109 (4%) Frame = +1 Query: 2380 IETKLDDDALDIIMQRKFSGMGMANNFNYSNRGRILLLWDMHRVSLDVLMATSQYMHCRV 2559 +ET++ + + +I + F M +N+ ++ GRI ++W V L V+ +SQ + C V Sbjct: 337 LETRVIESKVPVIFAKVFKDWQMVSNYEFNRLGRIWVVWSSS-VQLQVIFKSSQMIVCLV 395 Query: 2560 RCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGENIA---EPWIIMGDYNSLLTVQDKE 2730 R D F+ +F+Y + + R+ LW L L ++A +PW++ GD+N L +++ Sbjct: 396 RVEHYDVEFICSFIYASNFVEERKKLWQDLHNLQNSVAFRNKPWLLFGDFNETLKMEEHS 455 Query: 2731 GG--QPVSNYELHDLERFVHTCGLVDLRSIGCRLTWTN----GSVSSKLDRAMVNSQWLI 2892 P+ + D + V C L D+R+ G TW N G + KLDR ++N ++ Sbjct: 456 SYAVSPMVTPGMRDFQIVVRYCSLEDMRTHGPLFTWGNKRNEGLICKKLDRVLLNPEYNS 515 Query: 2893 ADYESYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLVASSWDEPI 3072 A SY GC SDH G + ++ PFKF N+ H F V W Sbjct: 516 AYPHSYCIMDSGGC-SDHLRGRFHLRSAIQKPKGPFKFTNVIAAHPEFMPKVEDFWKNTT 574 Query: 3073 ----SGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPN 3240 S + F KL LK L++L + ++S+ RA E+ R N Sbjct: 575 ELFPSTSTLFRFSKKLKELKPILKDLSR---NNLSDLTRRATYAYEELCRCQTKSLTTLN 631 Query: 3241 DYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSG 3420 ++++ E L +++ K +L N AI ++ G Sbjct: 632 PHDIVD---------ESLAFERWEKERHLLN-------------------AIHEVMDPQG 663 Query: 3421 DETTSLDEVAEEFVGYFQDLLGKTGQCTPLDSNTISGNHL--------SQAQREELVVPI 3576 + D++ E V +F DLL P D IS + L S ++ LV I Sbjct: 664 TRPPNQDDIKIEAVRFFSDLLSSQ----PSDFTGISVDELKGILQYRYSLHEQNLLVAEI 719 Query: 3577 TNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTS 3756 T E+ F I +K+PGPDG+ +F+ W IG + A+K FF +G L K N T Sbjct: 720 TEAEVMKVFFSIPLNKSPGPDGYTVEFFRETWSVIGQEVTMAIKSFFTYGFLPKGLNSTI 779 Query: 3757 IALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGE 3936 +ALIPK ++ + DYRPISCC V YK ISK+L+ RL ++ + P Q AF+ DR + E Sbjct: 780 LALIPKRTYAKEMKDYRPISCCNVLYKAISKLLANRLKCLLPEFIAPNQSAFISDRLLME 839 Query: 3937 NIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECI 4116 N+ LA EL++ Y + +SPRC +K+DL KAFD+V W FL++ L P++F WI+ CI Sbjct: 840 NLLLASELVKDYHKDGLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDIPEKFIHWINLCI 899 Query: 4117 STVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRC 4296 ST S+SV +N GLRQG LSP LF +C+ V S + + + F YHPRC Sbjct: 900 STASFSVQVN-----------GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYHPRC 948 Query: 4297 AEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLR 4476 + LTHL +ADD+++F+ SL ++ DF +GL + KS L++A + + Sbjct: 949 RNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSETC 1008 Query: 4477 GELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGRIEL 4656 +L F G P RYLG+PL T+R+ +A+ LLE I ++ SW LS+AGR++L Sbjct: 1009 ASILARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQL 1068 Query: 4657 IRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS-----TKHPPISWSTICLPKEEGG 4821 + SV+ + FW+S +P I +I IS +F+WS ++W +C PK EGG Sbjct: 1069 LNSVISSLTKFWISAFRLPRACIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKSEGG 1128 Query: 4822 YGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLR--SEDFWHWRGKPSDSPLIKC 4995 GLR L N + +W + S + SLWV WI + +R +E R + ++ Sbjct: 1129 LGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQNNLIRTVAEALSSHRRRSHRDDILND 1188 Query: 4996 LLQIRDVLLQR----------TTSIGA---AKMKLTE-WFAEGENGTALAYDFFRPKKPK 5133 + + + LL R SIG AK E W E G Sbjct: 1189 IEEELEKLLCRGICTEQDRSLCRSIGGQFKAKFFSPEIWHQIREQGLV------------ 1236 Query: 5134 VPWFSVIWKSILMPKHRFTVWLLAHGKLRTAD---SLSYEPSKMCALCQQHEESNGHIFF 5304 W IW S PK F WL AH +L T D S + S +C LC ES H+FF Sbjct: 1237 KQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISAESRDHLFF 1296 Query: 5305 ECAIAKSLLDRVVRWMDIKFDITSMAKLLDVFQRHYKGKGRQVKARYLAVSSMIYLIWEA 5484 C + + DR+ R + + T+ LL + ++ RY+ + I+ +W Sbjct: 1297 SCNFSSHIWDRLTRRLLLCRYTTNFPALLLLLSGQDFSGTKRFLLRYV-FQATIHTLWRE 1355 Query: 5485 RNRSRFEGIVPCVERMFHKVQIHVWRFVD 5571 RN+ R G +P H+ +F+D Sbjct: 1356 RNKRR-HGDLPIPSD-------HIIKFID 1376 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 503 bits (1295), Expect = e-139 Identities = 294/797 (36%), Positives = 436/797 (54%), Gaps = 22/797 (2%) Frame = +1 Query: 2686 IMGDYNSLLTVQDKEGGQPVS-NYELHDLERFVHTCGLVDLRSIGCRLTWTNGS----VS 2850 ++GD+N +L Q+ ++ + + D + L DL G TW N S ++ Sbjct: 1 MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 2851 SKLDRAMVNSQWLIADYESYVEFLPPGCL--SDH-SCGIVSTLAREKRCNRPFKFYNMWT 3021 KLDR + N W S+ F G L SDH SCG+V A RPFKF+N Sbjct: 61 KKLDRILANDSWCNLYPSSHGLF---GNLDFSDHVSCGVVLE-ANGISAKRPFKFFNFLL 116 Query: 3022 LHEGFQDLVASSW-DEPISGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLE 3198 +E F ++V +W + G++ + + KL +K +++ +L++ I + A +L Sbjct: 117 KNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLI 176 Query: 3199 DAQRSLLSGAGPPN---DYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLV 3369 Q L+ N + E RK LL+ AE+ F+ QR++ + D T +FH +V Sbjct: 177 TCQNLTLANPSVSNAALELEAQRKWV-LLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMV 235 Query: 3370 KRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPL---DSNTISGNHL 3540 N I L +G S + + V Y++ LLG + D N + Sbjct: 236 DSRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRC 295 Query: 3541 SQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFH 3720 SQ Q EL T++EIK A + +K GPDG++ +F+ W IG + +AA+ EFF Sbjct: 296 SQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355 Query: 3721 HGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPA 3900 G+LLKQWN T++ LIPK+ ++ ++++RPISC YKVISK+L++RL ++S+++ + Sbjct: 356 SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHS 415 Query: 3901 QVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGF 4080 Q AF+ RS+ EN+ LA E++ Y R ISPR M+KVDLKKAFD+V W F+ +AL Sbjct: 416 QSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAI 475 Query: 4081 PQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEA 4260 P+++ WIH+CI+T S+++S+NG GFF +GLRQGDPLSP LF L +EVFS+ + Sbjct: 476 PERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR 535 Query: 4261 SRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKS 4440 HYHP+ ++ ++HL +ADD+++F SS++ I L DF D +GL+ N KS Sbjct: 536 YDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKS 595 Query: 4441 NLYVAGME--EQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVIS 4614 L+ AG++ E++ GF G FP RYLG+PL +LRIA+YG LLE ++ ++ S Sbjct: 596 QLFQAGLDLSERITSAAY---GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652 Query: 4615 WPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS-----TKHPP 4779 W LS AGR +LI SV+ G+ FW+S +P G I KI + F+W+ K Sbjct: 653 WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSK 712 Query: 4780 ISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSEDFWHW 4959 +SW CLPK EGG G R WN TLL R +W + R SLW +W H L FW Sbjct: 713 VSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQV 772 Query: 4960 RGKPSDSPLIKCLLQIR 5010 +D K LL +R Sbjct: 773 NALQTDPWTWKMLLNLR 789 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 503 bits (1295), Expect = e-139 Identities = 294/797 (36%), Positives = 436/797 (54%), Gaps = 22/797 (2%) Frame = +1 Query: 2686 IMGDYNSLLTVQDKEGGQPVS-NYELHDLERFVHTCGLVDLRSIGCRLTWTNGS----VS 2850 ++GD+N +L Q+ ++ + + D + L DL G TW N S ++ Sbjct: 1 MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 2851 SKLDRAMVNSQWLIADYESYVEFLPPGCL--SDH-SCGIVSTLAREKRCNRPFKFYNMWT 3021 KLDR + N W S+ F G L SDH SCG+V A RPFKF+N Sbjct: 61 KKLDRILANDSWCNLYPSSHGLF---GNLDFSDHVSCGVVLE-ANGISAKRPFKFFNFLL 116 Query: 3022 LHEGFQDLVASSW-DEPISGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLE 3198 +E F ++V +W + G++ + + KL +K +++ +L++ I + A +L Sbjct: 117 KNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLI 176 Query: 3199 DAQRSLLSGAGPPN---DYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLV 3369 Q L+ N + E RK LL+ AE+ F+ QR++ + D T +FH +V Sbjct: 177 TCQNLTLANPSVSNAALELEAQRKWV-LLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMV 235 Query: 3370 KRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPL---DSNTISGNHL 3540 N I L +G S + + V Y++ LLG + D N + Sbjct: 236 DSRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRC 295 Query: 3541 SQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFH 3720 SQ Q EL T++EIK A + +K GPDG++ +F+ W IG + +AA+ EFF Sbjct: 296 SQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355 Query: 3721 HGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPA 3900 G+LLKQWN T++ LIPK+ ++ ++++RPISC YKVISK+L++RL ++S+++ + Sbjct: 356 SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHS 415 Query: 3901 QVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGF 4080 Q AF+ RS+ EN+ LA E++ Y R ISPR M+KVDLKKAFD+V W F+ +AL Sbjct: 416 QSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAI 475 Query: 4081 PQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEA 4260 P+++ WIH+CI+T S+++S+NG GFF +GLRQGDPLSP LF L +EVFS+ + Sbjct: 476 PERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR 535 Query: 4261 SRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKS 4440 HYHP+ ++ ++HL +ADD+++F SS++ I L DF D +GL+ N KS Sbjct: 536 YDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKS 595 Query: 4441 NLYVAGME--EQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVIS 4614 L+ AG++ E++ GF G FP RYLG+PL +LRIA+YG LLE ++ ++ S Sbjct: 596 QLFQAGLDLSERITSAAY---GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652 Query: 4615 WPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS-----TKHPP 4779 W LS AGR +LI SV+ G+ FW+S +P G I KI + F+W+ K Sbjct: 653 WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSK 712 Query: 4780 ISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSEDFWHW 4959 +SW CLPK EGG G R WN TLL R +W + R SLW +W H L FW Sbjct: 713 VSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQV 772 Query: 4960 RGKPSDSPLIKCLLQIR 5010 +D K LL +R Sbjct: 773 NALQTDPWTWKMLLNLR 789 >dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 910 Score = 497 bits (1279), Expect = e-137 Identities = 293/915 (32%), Positives = 472/915 (51%), Gaps = 24/915 (2%) Frame = +1 Query: 2278 MKIASWNIRGFHKALKHGGVQHLLKQKEIQVMASIETKLDDDALDIIMQRKFSGMGMANN 2457 MK+ WNIRG + + V+ + + V +ET + + + ++ G M +N Sbjct: 1 MKVFCWNIRGLNSRNRQRVVRSWIASNNLLVGCFLETHVAQENANSVLASTLPGWRMDSN 60 Query: 2458 FNYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPL 2637 + S GRI ++WD +S+ V T Q M C ++ + F FVYG +S + RR L Sbjct: 61 YCCSELGRIWIVWDPS-ISVLVFKRTDQIMFCSIKIPSLLQSFAVAFVYGRNSELDRRSL 119 Query: 2638 WNSLTLLGENI---AEPWIIMGDYNSLLTVQDKEG-GQPVSNYE-LHDLERFVHTCGLVD 2802 W + +L PW+++GD+N + + Q + N + DL+ + L D Sbjct: 120 WEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQCCLRDSQLSD 179 Query: 2803 LRSIGCRLTWTN----GSVSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTL 2970 L S G TW+N + KLDRA+ N +W + F PPG SDH+ I+ Sbjct: 180 LPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPGD-SDHAPCIILID 238 Query: 2971 AREKRCNRPFKFYNMWTLHEGFQDLVASSWD-EPISGNAQFALKAKLTRLKGKLRELDKL 3147 + + FK+++ + H + ++++W+ + G+ F+L+ L K R L++L Sbjct: 239 NQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLKVAKLCCRTLNRL 298 Query: 3148 HFQHISEQALRAKSVLEDAQRSLLSGAGPPNDY-----EVIRKRATLLAEAEKLFYQQRA 3312 F +I ++ ++ + LED Q LL+ P+D V RK+ A A + F++Q++ Sbjct: 299 RFSNIQQRTAQSLTRLEDIQVELLTS---PSDTLFRREHVARKQWIFFAAALESFFRQKS 355 Query: 3313 KSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKT 3492 + +L D T+FFH V + N I L G ++D++ + Y+ LLG Sbjct: 356 RIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHLLGIP 415 Query: 3493 GQ-CTPLDSNTISGN---HLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFY 3660 + TP I G +L + EEI LF + +KAPGPDGF +F+ Sbjct: 416 SENVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPDGFPVEFF 475 Query: 3661 TAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKV 3840 AW + +AA++EFF G L + +N T+I LIPK + +T +RP++CCT YKV Sbjct: 476 IEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCTTIYKV 535 Query: 3841 ISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLK 4020 I++++S RL + + QV F+K R + EN+ LA EL+ + + R ++VD+ Sbjct: 536 ITRIISRRLKLFIDQAVQANQVGFIKGRLLCENVLLASELVDNFEADGETTRGCLQVDIS 595 Query: 4021 KAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDP 4200 KA+D V+W FL++ L P F WI CIS+ SYS++ NG L GFF G++G+RQGDP Sbjct: 596 KAYDNVNWEFLINILKALDLPLVFIHWIWVCISSASYSIAFNGELIGFFQGKKGIRQGDP 655 Query: 4201 LSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLI 4380 +S LF L ++V S+S+ + F+ HP C +THL++ADD+L+F+ SS+ I Sbjct: 656 MSSHLFVLVMDVLSKSLDLGALNGLFNLHPNCLAPIITHLSFADDVLVFSDGAASSIAGI 715 Query: 4381 MGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERL 4560 + L DF +GL N K+ L + G L G G P RYLG+PL ++++ Sbjct: 716 LTILDDFRQGSGLGINREKTELLLDGGNFARNRSLADNLGITHGSLPVRYLGVPLMSQKM 775 Query: 4561 RIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYG 4740 R +Y L++ I + SW LS AGR++L++SV+ FW S+ P+ + K+ Sbjct: 776 RRQDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINFWASVFIFPNQCLQKLEQ 835 Query: 4741 ISRSFVW-----STKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSL 4905 + +F+W S + ISW+ +C PKE GG GL+ L +WN L + +W + + SL Sbjct: 836 MCNAFLWSGAPNSARGAKISWNIVCSPKEAGGLGLKRLSSWNRILALKLIWLLFTSAGSL 895 Query: 4906 WVKWIHQHYLRSEDF 4950 WV W+ H + +F Sbjct: 896 WVSWVRLHLIGRRNF 910 >dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 492 bits (1266), Expect = e-136 Identities = 303/904 (33%), Positives = 465/904 (51%), Gaps = 28/904 (3%) Frame = +1 Query: 2281 KIASWNIRGFHKALKHGGVQHLLKQKEIQVMASIETKLDDDALDIIMQRKFSGMGMANNF 2460 K+ WN+RGF+ + G + + IET + + G N+ Sbjct: 4 KLFCWNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEKKFISNLLPGWSFVENY 63 Query: 2461 NYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLW 2640 +S G+I +LWD V + V+ + Q + C + + F+ + VY + R+ LW Sbjct: 64 EFSVLGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRKELW 122 Query: 2641 NSLTLLGEN---IAEPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTC----GLV 2799 N L L + + WI++GD+N +L + N + R +C L Sbjct: 123 NELVQLALSPVVVGRSWIVLGDFNQILNPES------AINANIGRKIRAFRSCLLDSDLY 176 Query: 2800 DLRSIGCRLTWTNGSVS----SKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVST 2967 DL G TW N S K+DR +VN W +Y F P SC +V Sbjct: 177 DLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEVVLD 236 Query: 2968 LAREKRCNRPFKFYNMWTLHEGFQDLVASSWDE-PISGNAQFALKAKLTRLKGKLRELDK 3144 A K RPF+F+N + + F L+ +W +SG+A + + KL LK + + Sbjct: 237 PAVLK-AKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSR 295 Query: 3145 LHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRAT----LLAEAEKLFYQQRA 3312 ++ I ++ A +++ QR L+ P + + AT +LA+AE+ F+ Q++ Sbjct: 296 ENYSDIEKRVSEAHAIVLHRQRITLTN--PSVVHATLELEATRKWQILAKAEESFFCQKS 353 Query: 3313 KSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSL----DEVAEEFVGYFQDL 3480 +L D T +FH + N I L G+ + + + E +F+ L Sbjct: 354 SISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESL 413 Query: 3481 L-GKTGQ--CTPLDSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNA 3651 L G G+ D N + S Q +L ++ +I+ A F + +KA GPDG+++ Sbjct: 414 LCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSS 473 Query: 3652 KFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVF 3831 +F+ W +G + AV+EFF G+LLKQWN T++ LIPK +S +TD+RPISC Sbjct: 474 EFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTL 533 Query: 3832 YKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKV 4011 YKVI+K+L++RL +++ ++ P+Q AF+ R + EN+ LA E++ Y K IS R M+KV Sbjct: 534 YKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISSRGMLKV 593 Query: 4012 DLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQ 4191 DL+KAFD+V W F++SA P++F WI++CIST +SV +NG GFF +GLRQ Sbjct: 594 DLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQ 653 Query: 4192 GDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSL 4371 GDPLSP LF L +EVFS +K HYHP+ A++ ++HL +ADD+++F SSL Sbjct: 654 GDPLSPYLFVLAMEVFSSLLKARFDAGYIHYHPKTADLSISHLMFADDVMVFFDGGSSSL 713 Query: 4372 NLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLAT 4551 + I L DF +GL N K+NLY+AG +E + H GF P RYLG+PL + Sbjct: 714 HGISEALDDFASWSGLHVNKDKTNLYLAGTDEVEALAISHY-GFPISTLPIRYLGLPLMS 772 Query: 4552 ERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDK 4731 +L+I+ Y + K+ SW +LS AGR++LI SV+ G+ FW+S + G + K Sbjct: 773 RKLKISEY-----ELVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCVKK 827 Query: 4732 IYGISRSFVWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRR 4896 I + F+WS +K I+WS +CLPK EGG GLR WN T R +W + + Sbjct: 828 IESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVGLRRFTPWNKTFYLRFIWPLFADN 887 Query: 4897 DSLW 4908 D LW Sbjct: 888 DVLW 891 >emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 486 bits (1252), Expect = e-134 Identities = 301/904 (33%), Positives = 463/904 (51%), Gaps = 28/904 (3%) Frame = +1 Query: 2281 KIASWNIRGFHKALKHGGVQHLLKQKEIQVMASIETKLDDDALDIIMQRKFSGMGMANNF 2460 K+ WN+RGF+ + G + + IET + + G N+ Sbjct: 4 KLFCWNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEKKFISNLLPGWSFVENY 63 Query: 2461 NYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLW 2640 +S G+I +LWD V + V+ + Q + C + + F+ + VY + R+ LW Sbjct: 64 EFSVLGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRKELW 122 Query: 2641 NSLTLLGEN---IAEPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTC----GLV 2799 N L L + + WI++GD+N +L + N + R +C L Sbjct: 123 NELVQLALSPVVVGRSWIVLGDFNQILNPES------AINANIGRKIRAFRSCLLDSDLY 176 Query: 2800 DLRSIGCRLTWTNGSVS----SKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVST 2967 DL G TW N S K+DR +VN W +Y F P SC +V Sbjct: 177 DLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEVVLD 236 Query: 2968 LAREKRCNRPFKFYNMWTLHEGFQDLVASSWDE-PISGNAQFALKAKLTRLKGKLRELDK 3144 A K RPF+F+N + + F L+ +W +SG+A + + KL LK + + Sbjct: 237 PAVLK-AKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSR 295 Query: 3145 LHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRAT----LLAEAEKLFYQQRA 3312 ++ I ++ A +++ QR L+ P + + AT +LA+AE+ F+ Q++ Sbjct: 296 ENYSDIEKRVSEAHAIVLHRQRITLTN--PSVVHATLELEATRKWQILAKAEESFFCQKS 353 Query: 3313 KSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSL----DEVAEEFVGYFQDL 3480 +L D T +FH + N I L G+ + + + E +F+ L Sbjct: 354 SISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESL 413 Query: 3481 L-GKTGQ--CTPLDSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNA 3651 L G G+ D N + S Q +L ++ +I+ A F + +KA GPDG+++ Sbjct: 414 LCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSS 473 Query: 3652 KFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVF 3831 +F+ W +G + AV+EFF G+LLKQWN T++ LIPK +S +TD+RPISC Sbjct: 474 EFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTL 533 Query: 3832 YKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKV 4011 YKVI+K+L++RL +++ ++ P+Q AF+ R + EN+ LA E++ Y K IS R M+KV Sbjct: 534 YKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISSRGMLKV 593 Query: 4012 DLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQ 4191 DL+KAFD+V W F++SA P++F WI++CIST +SV +NG GFF +GLRQ Sbjct: 594 DLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQ 653 Query: 4192 GDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSL 4371 GDPLSP LF L +EVFS +K YHP+ A++ ++HL +ADD+++F SSL Sbjct: 654 GDPLSPYLFVLAMEVFSSLLKARFDAGYIQYHPKTADLSISHLMFADDVMVFFDGGSSSL 713 Query: 4372 NLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLAT 4551 + I L DF +GL N K+NLY+AG +E + H GF P RYLG+PL + Sbjct: 714 HGISEALDDFASWSGLHVNKDKTNLYLAGTDEVEALAISHY-GFPISTLPIRYLGLPLMS 772 Query: 4552 ERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDK 4731 +L+I+ Y + K+ SW +LS AGR++LI SV+ G+ FW+S + G + K Sbjct: 773 RKLKISEY-----ELVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCVKK 827 Query: 4732 IYGISRSFVWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRR 4896 I + F+WS +K I+WS +CLPK EGG LR WN T R +W + + Sbjct: 828 IESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVALRRFTPWNKTFYLRFIWPLFADN 887 Query: 4897 DSLW 4908 D LW Sbjct: 888 DVLW 891 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 486 bits (1251), Expect = e-134 Identities = 334/1070 (31%), Positives = 514/1070 (48%), Gaps = 65/1070 (6%) Frame = +1 Query: 2560 RCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGENIA---EPWIIMGDYNSLLTVQDKE 2730 R D + + VY + + R+ LW L LL +++ +PWI++GD+N +L + Sbjct: 45 RTEFLDDSVVVSIVYAANEAITRKELWEELLLLSVSLSGNGKPWIMLGDFNQVLCPAEHS 104 Query: 2731 GGQPVS-NYELHDLERFVHTCGLVDLRSIGCRLTWTNGS----VSSKLDRAMVNSQWLIA 2895 ++ N + + L DL G TW N S V+ KLDR +VN W Sbjct: 105 QATSLNVNRRMKVFRDCLFEAELCDLVFKGNTFTWWNKSATRPVAKKLDRILVNESWCSR 164 Query: 2896 DYESYVEFLPPGCLSDH-SCGIVST--LAREKRCNRPFKFYNMWTLHEGFQDLVASSWDE 3066 +Y F P SDH SCG++ + REKR PF+FYN + F LV W Sbjct: 165 FPSAYAVFGEPD-FSDHASCGVIINPLMHREKR---PFRFYNFLLQNPDFISLVGELWYS 220 Query: 3067 -PISGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPN- 3240 + G++ F + KL LK +R +F ++ ++ A +++ Q LS PN Sbjct: 221 INVVGSSMFKMSKKLKALKNPIRTFSMENFSNLEKRVKEAHNLVLYRQNKTLSDPTIPNA 280 Query: 3241 --DYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQ 3414 + E RK +L +AE+ F+ QR++ ++ D T +FH + N I + Sbjct: 281 ALEMEAQRKWL-ILVKAEESFFCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDD 339 Query: 3415 SGDETTSLDEVAEEFVGYFQDLLGKTGQCTPL-----DSNTISGNHLSQAQREELVVPIT 3579 +G + + + E + YF +LLG G+ P D + + S Q++EL + + Sbjct: 340 NGVKIDTQLGIKEHCIEYFSNLLG--GEVGPPMLIQEDFDLLLPFRCSHDQKKELAMSFS 397 Query: 3580 NEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSI 3759 ++IK+A F ++K GPDGF +F+ W IG + AV EFF LLKQWN T++ Sbjct: 398 RQDIKSAFFSFPSNKTSGPDGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTL 457 Query: 3760 ALIPKSDHSPCVTDYRPISCCT----VFYKVISKVLSARLANVMSSLLDPAQVAFVKDRS 3927 LIPK ++ + D+RPISC YKVI+++L+ RL ++S ++ P Q AF+ R Sbjct: 458 VLIPKITNASKMNDFRPISCNDFGPITLYKVIARLLTNRLQCLLSQVISPFQSAFLPGRF 517 Query: 3928 IGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIH 4107 + EN+ LA EL++ Y R+ I PR M+KVDL+KAFD++ W F++SAL G P +F WI Sbjct: 518 LAENVLLATELVQGYNRQNIDPRGMLKVDLRKAFDSIRWDFIISALKAIGIPDRFVYWIT 577 Query: 4108 ECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFHYH 4287 +CIST ++SV +NG GFF RGLRQG+PLSP LF L +EVFS + + HYH Sbjct: 578 QCISTPTFSVCVNGNTGGFFKSTRGLRQGNPLSPFLFVLAMEVFSSLLNSRFQAGYIHYH 637 Query: 4288 PRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEE 4467 P+ + + ++HL +ADD+++F SSL+ I L DF +GL N K++LY+AG++ Sbjct: 638 PKTSPLSISHLMFADDIMVFFDGGSSSLHGISEALEDFAFWSGLVLNREKTHLYLAGLDR 697 Query: 4468 QLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGR 4647 + +LRIA YG LLE + K+ SW LS AGR Sbjct: 698 ---------------------IEASTIARKLRIAEYGPLLEKLAKRFRSWSVKCLSFAGR 736 Query: 4648 IELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS-----TKHPPISWSTICLPKE 4812 ++LI SV+ G+ FW+S +P G + +I + F+WS K ++WS +CLPKE Sbjct: 737 VQLIASVISGIINFWISTFILPKGCVKRIEALCARFLWSGNIDVKKGAKVAWSEVCLPKE 796 Query: 4813 EGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWV-KWIHQHYL------------------ 4935 EGG GLR T+L TLW+ ++ S W W L Sbjct: 797 EGGVGLR-----RFTVLNTTLWD--GKKISFWFDNWSPLGPLFKLFGSSGPRALCIPIQA 849 Query: 4936 ----RSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGENGTALA 5103 D P + L+ + + L S + + + F A Sbjct: 850 KVADACSDVGWLISPPRTDQALALLIHLTTIALPCFDSSPDTFVWIVDDFTCHGFSAART 909 Query: 5104 YDFFRPKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLS---YEPSKMCALCQQ 5274 ++ RPKKP W +W +PKH F +W+ +L T L+ + C LC Sbjct: 910 WEAMRPKKPVKDWTKSVWFKGSVPKHAFNMWVSHLNRLPTRQRLAAWGVTTTTDCCLCSS 969 Query: 5275 HEESNGHIFFECAIAKSLLDRVVRWMDIKFDIT-------SMAKLLDVFQRHYKGKGRQV 5433 ES H+ C + V W + F +T S A+LL + R K + Sbjct: 970 RPESRDHLLLYCVFS------AVIWKLVFFRLTPSQAIFNSWAELLS-WTRINSSKAPSL 1022 Query: 5434 KARYLAVSSMIYLIWEARNRSRFEGIVPCVERMFHKVQ---IHVWRFVDI 5574 R +A + ++ +W+ RN I +FH + +++R++ I Sbjct: 1023 -LRKIAAQASVFHLWKQRNNVLHNSIFISPATVFHFIDRELENLYRYIQI 1071 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 453 bits (1165), Expect = e-124 Identities = 274/813 (33%), Positives = 426/813 (52%), Gaps = 34/813 (4%) Frame = +1 Query: 2680 WIIMGDYNSLLTVQDKEGGQPVSNYELHDLER-----FVHTCGLVDLRSIGCRLTWTNGS 2844 W++ D N+ LT K + L E F++ V+ R R+ W + Sbjct: 431 WVVWRD-NARLTPVFKSSQMITCSILLEGKEEEFFCSFIYASNFVEER----RILWED-- 483 Query: 2845 VSSKLDRAMV-NSQWLIA-DYESYVE-----------FLPPGCLSDHSCGIVSTLAREKR 2985 + S D ++ N W++ D+ +E + PPG G + A Sbjct: 484 IRSHHDSPLIRNKPWILCGDFNEILEGGEHSNYDNSPYTPPGMRDFQEIGRLMLEAAATG 543 Query: 2986 CNRPFKFYNMWTLHEGFQDLVASSWDEP----ISGNAQFALKAKLTRLKGKLRELDKLHF 3153 +PFKF N+ T F +V S W +S +A + KL LK LREL K Sbjct: 544 GRKPFKFVNVLTKLPQFLPVVESHWASSAPLYVSTSALYRFSKKLKTLKPHLRELGKEKL 603 Query: 3154 QHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRA--TLLAEAEKLFYQQRAKSVYL 3327 + ++ A +L + Q + L+ E ++ T L+E E+ F +Q++K ++ Sbjct: 604 GDLPKRTREAHILLCEKQATTLANPSQETIAEELKAYTDWTHLSELEEGFLKQKSKLHWM 663 Query: 3328 KNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQ--- 3498 D +FH + RN+I ++ + + + +E+ E +F + L + Sbjct: 664 NVGDGNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNEFLNRQSGDFH 723 Query: 3499 -CTPLDSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWD 3675 + D + S + L +T EEI+ LF + +K+PGPDG+ ++F+ A W Sbjct: 724 GISVEDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWS 783 Query: 3676 KIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVL 3855 G DFIAA++ FF G L K N T +ALIPK D + + DYRPISCC V YKVISK+L Sbjct: 784 LTGPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKIL 843 Query: 3856 SARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDT 4035 + RL ++ S + Q AFVK+R + EN+ LA EL++ Y ++ ++PRC +K+D+ KAFD+ Sbjct: 844 ANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVTPRCAMKIDISKAFDS 903 Query: 4036 VDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLL 4215 V W FL++ L FP+ F WI CIST ++SV +NG L GFF RGLRQG LSP L Sbjct: 904 VQWQFLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYL 963 Query: 4216 FGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLT 4395 F +C+ V S + EA+ YHP+C ++ LTHL +ADDL++F S+ ++ Sbjct: 964 FVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFK 1023 Query: 4396 DFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANY 4575 +F ++GL+ + KS +Y+AG+ R + L F G P RYLG+PL T+++ A+Y Sbjct: 1024 EFAGRSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADY 1083 Query: 4576 GSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSF 4755 L+EA+ K+ SW +LS+AGR+ L+ SV+ + FW+S +P+G I +I + +F Sbjct: 1084 SPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAF 1143 Query: 4756 VWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWI 4920 +WS K I+WS+IC PK+EGG G++ L N + +W + S + SLWV WI Sbjct: 1144 LWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWI 1203 Query: 4921 HQHYLRSEDFWHWRGKPS-DSPLIKCLLQIRDV 5016 +R FW + S S + K LL+ R++ Sbjct: 1204 WTFIIRKGTFWSANERSSLGSWMWKKLLKYREL 1236 Score = 61.6 bits (148), Expect = 4e-06 Identities = 52/199 (26%), Positives = 84/199 (42%), Gaps = 7/199 (3%) Frame = +2 Query: 1157 YEDIETVEDSVGYCLVGCFMGRHPGRNGVHAIASR-W-----RVPYKFYMHKSGWVVYRF 1318 +ED + D + L+G F P VH I ++ W V Y+ + + +R Sbjct: 58 FEDSTPLWDDL---LIGRFPQISPHIAKVHVIVNKIWPLGDKTVKIDAYVVDTKTIKFRI 114 Query: 1319 DTPEDRDKVLRGGPYFAFGIPLFL-KYMPRCFLFEEDGRYIPAWIQIHKLPPDCWTSKVL 1495 R +VLR G + +P+ + K+ P + + + +P WI I +P +T K L Sbjct: 115 RESSVRARVLRRGMWNIADMPMIVSKWSPVAEDAQPEIKTMPMWITIKNVPRSMFTWKGL 174 Query: 1496 GLIGSEIGKPLYTDKLTQTRERLTYARLLVEVSVIGEKVKMVPITLPTGVQLDLEIIYES 1675 + S IG+P T A++ VE + E K TGV +E Y Sbjct: 175 SFLASPIGEPKKLHPDTVLCNSFEEAKVFVEADLTQEMPKQFRFKSETGVDAMVEYKYPW 234 Query: 1676 MPDFCEECRRIGHQKEACM 1732 +P C C + GH +E C+ Sbjct: 235 LPPRCSSCSKWGHIQEVCL 253 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 452 bits (1163), Expect = e-124 Identities = 328/1124 (29%), Positives = 521/1124 (46%), Gaps = 27/1124 (2%) Frame = +1 Query: 2200 STTSYANRQPKWAT-ITGPSSPLYTWIMKIASWNIRGFHKALKHGGVQHLLKQKEIQVMA 2376 ST + KW I P + L ++ A N R + V+ + + V Sbjct: 22 STKAKKKELKKWHNRIRPPVTELQGFVDGRAGLNSRNRQRV-----VRSWIASNNLLVGC 76 Query: 2377 SIETKLDDDALDIIMQRKFSGMGMANNFNYSNRGRILLLWDMHRVSLDVLMATSQYMHCR 2556 +ET + + + ++ G M +N+ S GRI ++WD +S+ V T Q M C Sbjct: 77 FLETHVAQENANSVLASTLPGWRMDSNYCCSELGRIWIVWDPS-ISVLVFKRTDQIMFCS 135 Query: 2557 VRCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGENI---AEPWIIMGDYNSLLTVQDK 2727 ++ + F FVYG +S + RR LW + +L PW+++GD+N + + Sbjct: 136 IKIPSLLQSFAVAFVYGRNSELDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASEH 195 Query: 2728 EG-GQPVSNYE-LHDLERFVHTCGLVDLRSIGCRLTWTN----GSVSSKLDRAMVNSQWL 2889 Q + N + DL+ + L DL S G TW+N + KLDRA+ N +W Sbjct: 196 YSINQSLLNLRGMEDLQCCLRDSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWF 255 Query: 2890 IADYESYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLVASSWDE- 3066 + F PPG SDH+ I+ + + FK+++ + H + ++++W+E Sbjct: 256 AVFPSALAVFDPPGD-SDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEEN 314 Query: 3067 PISGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDY 3246 + G+ F+L+ L K R L++L F +I ++ ++ + LED Q LL+ P+D Sbjct: 315 TLVGSHMFSLRQHLKVAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTS---PSDT 371 Query: 3247 -----EVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTK 3411 V RK+ A A + F++Q+++ +L D T+FFH V + N I L Sbjct: 372 LFRREHVARKQWIFFAAALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRG 431 Query: 3412 QSGDETTSLDEVAEEFVGYFQDLLGKTGQ-CTPLDSNTISGN---HLSQAQREELVVPIT 3579 G ++D++ + Y+ LLG + TP I G +L + Sbjct: 432 DDGFRVENVDQIKGMLIAYYSHLLGIPSENVTPFSVEKIKGLLPFRCDSFLASQLTTIPS 491 Query: 3580 NEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSI 3759 EEI LF + +KAPGPDGF +F+ AW + +AA++EFF G L + +N T+I Sbjct: 492 EEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAI 551 Query: 3760 ALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGEN 3939 LIPK + +T +RP++CCT YKVI++++S RL + + QV F+K R + EN Sbjct: 552 TLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCEN 611 Query: 3940 IHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECIS 4119 + LA EL+ + + R ++VD+ KA+D V+W FL++ L P F WI CIS Sbjct: 612 VLLASELVDNFEADGETTRGCLQVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVCIS 671 Query: 4120 TVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCA 4299 + SYS++ NG L GFF G++G+RQGDP+S LF L ++V S+S+ + F+ HP C Sbjct: 672 SASYSIAFNGELIGFFQGKKGIRQGDPMSSHLFVLVMDVLSKSLDLGALNGLFNLHPNCL 731 Query: 4300 EVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRG 4479 +THL++ADD+L+F+ SS+ I+ L DF +GL N K+ L + G Sbjct: 732 APIITHLSFADDVLVFSDGAASSIAGILTILDDFRQGSGLGINREKTELLLDGGNFARNR 791 Query: 4480 ELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELI 4659 L G G P RYLG+PL ++++R +Y L++ I + SW LS AGR++L+ Sbjct: 792 SLADNLGITHGSLPVRYLGVPLMSQKMRRQDYQPLVDRINSRFTSWTARHLSFAGRLQLL 851 Query: 4660 RSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS---TKHPPISWSTICLPKEEGGYGL 4830 + + + L P + + G++ SF W T H P+ T G L Sbjct: 852 NWIWR--KLCKLRPFARPFIICEVGSGVTASF-WHDNWTDHGPLLHLTGPAGPLLAGLPL 908 Query: 4831 RDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSEDFWHWRGKPSDSPLIKCLLQIR 5010 + L W I S R V + Q L PS + LI C Sbjct: 909 NSVV---RDALRDDTWRISSSRSRNPVITLLQRVL-----------PSAASLIDC--PHD 952 Query: 5011 DVLLQRTTSIGAAKMKLTEWFAEGENGTALAYDFFRPKKPKVPWFSVIWKSILMPKHRFT 5190 D L K+ TA + + +P V W +W +PK F Sbjct: 953 DTYL----------WKIGHHAPSNRFSTADTWSYLQPSSTSVLWHKAVWFKDHVPKQAFI 1002 Query: 5191 VWLLAHGKLRTADSL---SYEPSKMCALCQQHEESNGHIFFECAIAKSLLDRVVRWMDIK 5361 W++AH +L T D L + C LC +ES H+FF C + + +R +++ Sbjct: 1003 CWVVAHNRLHTRDRLRRWGFSIPPTCVLCNDLDESREHLFFRCQFSSEIWSFFMRALNLN 1062 Query: 5362 FDITSMAKLLDVFQRHYKGKGRQVK-ARYLAVSSMIYLIWEARN 5490 M LL + R + L + +Y IW RN Sbjct: 1063 PPPQFMHCLLWTLT---ASRDRNITLITKLLFHASVYFIWRERN 1103 >gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score: 72.31) [Arabidopsis thaliana] Length = 928 Score = 446 bits (1146), Expect = e-122 Identities = 270/760 (35%), Positives = 407/760 (53%), Gaps = 30/760 (3%) Frame = +1 Query: 2626 RRPLWNSLTLLGENI---AEPWIIMGDYNSLLTVQDKEGGQ--PVSNYELHDLERFVHTC 2790 R+ LWN L ++ ++PWII GD+N +L +++ + PV+ + D + V+ C Sbjct: 4 RKELWNDLRDHSDSPIIRSKPWIIFGDFNEILDMEEHSNSRENPVTTTGMRDFQMAVNHC 63 Query: 2791 GLVDLRSIGCRLTWTNGS----VSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGI 2958 + DL G TW+N ++ KLDR +VN WL + SY F GC C I Sbjct: 64 SITDLAYHGPLFTWSNKRENDLIAKKLDRVLVNDVWLQSFPRSYSVFEAGGCSDHLRCRI 123 Query: 2959 VSTLAREK--RCNRPFKFYNMWTLHEGFQDLVASSWDEP----ISGNAQFALKAKLTRLK 3120 + + RPFKF N+ T E F V S W+E +S ++ F KL LK Sbjct: 124 NLNVGAGAVVKGKRPFKFVNVITEMEHFIPTVESYWNETEAIFMSTSSLFRFSKKLKGLK 183 Query: 3121 GKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRATL--LAEAEKL 3294 LR L K ++ +Q A L Q ++ P + E A +A E+ Sbjct: 184 PLLRNLGKERLGNLVKQTKEAFETLCQKQAMKMANPSPSSMQEENEAYAKWDHIAVLEEK 243 Query: 3295 FYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQ 3474 F +QR+K +L DR K FH V +N+I ++ G + +++ E +F+ Sbjct: 244 FLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEHHFR 303 Query: 3475 DLLGKTGQCTPLDSNTISGNHL--------SQAQREELVVPITNEEIKNALFDIGADKAP 3630 + L Q P D I+ L S + +E L ++ EEI +F + DK+P Sbjct: 304 EFL----QLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDKSP 359 Query: 3631 GPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRP 3810 GPDG+ A+FY AW+ IG +FI A++ FF G L K N T +ALIPK + + DYRP Sbjct: 360 GPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAKEMKDYRP 419 Query: 3811 ISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRIS 3990 ISCC V YKVISK+++ RL V+ + Q AFVKDR + EN+ LA E+++ Y + +S Sbjct: 420 ISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIVKDYHKDSVS 479 Query: 3991 PRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFS 4170 RC +K+D+ KAFD+V W FL++ L FP +F WI CI+T S+SV +NG L G FS Sbjct: 480 SRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQVNGELAGVFS 539 Query: 4171 GRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFA 4350 R LRQG LSP LF + ++V S+ + +A F YHP+C + LTHL++ADDL++ + Sbjct: 540 SARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADDLMILS 599 Query: 4351 RADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRY 4530 V S++ I+ L +F +GL+ + KS +Y+AG++ + E++ F G P RY Sbjct: 600 DGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLPVRY 659 Query: 4531 LGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPI 4710 LG+PL ++RL ++ L+E + KK+ +W LS AGR+ LI S L + FW++ + Sbjct: 660 LGLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFAGRLNLISSTLWSICNFWMAAFRL 719 Query: 4711 PSGVIDKIYGISRSFVW-----STKHPPISWSTICLPKEE 4815 P I +I + +F+W S+ +SW IC PK+E Sbjct: 720 PRACIREIDKLCSAFLWSGTELSSNKAKVSWEAICKPKKE 759 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 442 bits (1138), Expect = e-121 Identities = 276/912 (30%), Positives = 432/912 (47%), Gaps = 39/912 (4%) Frame = +1 Query: 2905 SYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLVASSWDEPISGNA 3084 S + FLPPG +SDH+ +V + PFKF+N E F +V++ W + G+ Sbjct: 137 SVLSFLPPG-ISDHAAMVVKVGLPFRIRKAPFKFFNFLADREDFIPIVSAVWATNVWGSK 195 Query: 3085 QFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKR 3264 QF + KL +K + + L+ ++ E+ L+ KS ++ Sbjct: 196 QFQVWRKLKLVKNQFKLLNC----NVVEKLLKKKSRVQ---------------------- 229 Query: 3265 ATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDE 3444 +LK D+ + FF + ++ RN I + + G + Sbjct: 230 -------------------WLKKGDKNSTFFFKTMTKHRNRNRIATINRSDGPDLAK--S 268 Query: 3445 VAEEFVGYFQDLLGKTGQCTPLDSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADK 3624 + EF T+++I+ F + +K Sbjct: 269 LCNEF---------------------------------------THDDIRAVFFSMNPNK 289 Query: 3625 APGPDGFNAKFYTAAWDKIGDDFIAA-VKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTD 3801 +PGPDGFN F+ AW IGD+ +AA VKEFF +G LL + N T I L+PK + ++D Sbjct: 290 SPGPDGFNGCFFQKAWLVIGDNVVAAAVKEFFSYGSLLMELNSTIITLVPKVANPTTMSD 349 Query: 3802 YRPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARK 3981 +RPISCC FYK+I+K+L+ RL + ++ P+Q F+ R IG+NI LAQE++ Y + Sbjct: 350 FRPISCCNTFYKIIAKLLANRLKGTLHLIVGPSQSTFIPGRRIGDNILLAQEIICDYHKA 409 Query: 3982 RISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYG 4161 PRC VD+ KA DTV+W F+++ L + P GWI CIS+ +SV +NG L G Sbjct: 410 DGQPRCTFMVDMMKANDTVEWDFIIATLQAFNIPSTLIGWIKSCISSAKFSVCVNGELAG 469 Query: 4162 FFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDG-FHYHPRCAEVELTHLAYADDL 4338 FF+ RRGLRQGDPLSP LF + +EV S ++ F YH RC ++ L+HL +ADDL Sbjct: 470 FFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNLSHLCFADDL 529 Query: 4339 LLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVF 4518 L+F D +S+ + ++F + L+AN +S +++AG++ +L +T F G Sbjct: 530 LMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVTNFSLGTC 589 Query: 4519 PFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLS 4698 P RYLGIPL T +LR+ + LL+ I ++ SW LS AGR++LI+SVL ++ +W S Sbjct: 590 PVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWAS 649 Query: 4699 ILPIPSGVIDKIYGISRSFVWSTK-----HPPISWSTICLPKEEGGYGLRDLQAWNMTLL 4863 L +P V+ I R F+W+ ++WS ICLPK EGG G++DL WN L+ Sbjct: 650 HLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALM 709 Query: 4864 CRTLWNIQSRRDSLWVKWIHQHYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIG 5043 +WN+ S + W W+ + L+ FW+ S + LL+IR++ +I Sbjct: 710 ISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIRELCCSFFVNII 769 Query: 5044 AAKMKLTEWFAE----------------GENG--------------TALAYDFFRPKKPK 5133 + WF GE+G T+ A++ RP + Sbjct: 770 GDGRATSLWFDNWHPLGPLTLRWSSNIIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFI 829 Query: 5134 VPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLSYEPSKMCALCQQHEESNGHIFFECA 5313 VPW+ ++W F E++ H+FF+CA Sbjct: 830 VPWYRLVW---------FVA-----------------------------ETHNHLFFDCA 851 Query: 5314 IAKSLLDRVVRWMDIKFDITSMAKLLDVFQRHYKGKGRQVKARYLAVSSMIYLIWEARNR 5493 + + V+ D+ + + + ++KG V LA+ +++Y IW RN Sbjct: 852 YSFGIWTHVLSKCDVSKPLLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNN 911 Query: 5494 SRF--EGIVPCV 5523 RF E + P V Sbjct: 912 RRFRNESLPPAV 923 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 433 bits (1114), Expect = e-118 Identities = 303/1088 (27%), Positives = 499/1088 (45%), Gaps = 90/1088 (8%) Frame = +1 Query: 2335 VQHLLKQKEIQVMASIETKLDDDALDIIMQRKFSGMGMANNFNYSNRGRILLLWDMHRVS 2514 ++ L ++++A +E +D + ++ + NN +W H V Sbjct: 869 IKKLQLMHRLKILAILEPMVDTSKAEYFRRKMGFEKVIVNNSQK--------IWLFHSVE 920 Query: 2515 L--DVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGENIAEPWII 2688 +VL+ Q +H RV D TFVY + R PLWN L L ++ PWI+ Sbjct: 921 FICEVLLDHPQCLHVRVTIPWLDLPIFTTFVYAKCTRSERTPLWNCLRNLAADMEGPWIV 980 Query: 2689 MGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIGCRLTWTNGSVSSKLDRA 2868 GD+N +L +++ G + D + CGL+D G TWTN + +LDR Sbjct: 981 GGDFNIILKREERLYGADPHEGSIEDFASVLLDCGLLDGGFEGNPFTWTNNRMFQRLDRM 1040 Query: 2869 MVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLV 3048 + N QW+ + ++ L SDH ++S ++ F+F + W LH F V Sbjct: 1041 VYNQQWINKFPITRIQHLNRDG-SDHCPLLLSCSNSSEKAPSSFRFLHAWALHHNFNASV 1099 Query: 3049 ASSWDEPISGNAQFALKAKLTRLKGKLRELDKLHFQHI------SEQALRAKSVLEDAQR 3210 +W+ PI+G+ A +K RLK L+ +K F I +E+ + +L ++ Sbjct: 1100 EGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDIFSNIKEAEKRVEECEILHQQEQ 1159 Query: 3211 SLLSGAGPPNDYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRN 3390 ++ S Y + K+ ++ E++F++Q++ ++ +R TKFFH +++ R+ Sbjct: 1160 TIGSRIQLNKSYAQLNKQLSM----EEIFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRS 1215 Query: 3391 AIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPLDSNTISGNHLSQAQREELVV 3570 I K+ +Q G+ +++ + + +F LL +++ + +S L Sbjct: 1216 HIFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTRFQSSLCPSIISDTDNGFLCA 1275 Query: 3571 PITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNH 3750 T +E+K A+F I + A GPDGF++ FY WD I D AVKEFFH + + Sbjct: 1276 EPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFFHGADIPQGMTS 1335 Query: 3751 TSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSI 3930 T++ LIPK+ + +++RPIS CTV K+I+K+L+ RLA ++ S++ Q FV R I Sbjct: 1336 TTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILPSIITENQSGFVGGRLI 1395 Query: 3931 GENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHE 4110 +NI LAQEL+ K +K +K+D+ KA+D +DW FL L GF Q+ G I + Sbjct: 1396 SDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWSFLFKVLQHLGFNAQWIGMIQK 1455 Query: 4111 CISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEA-SRLDGFHYH 4287 CIS +S+ LNG G+F RGLRQGD +SP LF L E +R + + HY Sbjct: 1456 CISNCWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILAAEYLARGLNALYDQYPSLHYS 1515 Query: 4288 PRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYV-AGME 4464 C+ + ++HLA+ADD+++FA S+L IM L ++ +G R NP KS + M Sbjct: 1516 SGCS-LSVSHLAFADDVIIFANGSKSALQKIMAFLQEYEKLSGQRINPQKSCVVTHTNMA 1574 Query: 4465 EQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAG 4644 R +L TGF P YLG PL ++ + L+ I +++ W TLS G Sbjct: 1575 SSRRQIILQATGFSHRPLPITYLGAPLYKGHKKVMLFNDLVAKIEERITGWENKTLSPGG 1634 Query: 4645 RIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVW--STKHPPI---SWSTICLPK 4809 RI L+RS L + + L +L P V+++I + +F+W ST I SW I LP Sbjct: 1635 RITLLRSTLSSLPIYLLQVLKPPVIVLERINRLLNNFLWGGSTASKRIHWASWGKIALPI 1694 Query: 4810 EEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSE--------------- 4944 EGG +R+++ + W ++ +SLW +++ Y + Sbjct: 1695 AEGGLDIRNVEDVCEAFSMKLWWRFRT-TNSLWTQFMRAKYCGGQLPTDVQPKLHDSQTW 1753 Query: 4945 -----------------------DFWH--WRGKPSDSPLI-------KCLLQIRDVLLQR 5028 FWH W G + PL+ + Q+ D L Sbjct: 1754 KRMVTISSITEQNIRWRIGHGELFFWHDCWMG---EEPLVNRNQAFASSMAQVSDFFLNN 1810 Query: 5029 TTSIGAAKMKLTEWFAE----------------------GENGTALAYDFFRPKKPKVPW 5142 + ++ K L + E G+ T A+ R +K + P Sbjct: 1811 SWNVEKLKTVLQQEVVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPV 1870 Query: 5143 FSVIWKSILMPKHRFTVWLLAHG------KLRTADSLSYEPSKMCALCQQHEESNGHIFF 5304 F+ IW + F +W L H K++T ++ + C C+ EES H+ + Sbjct: 1871 FNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTK---GFQLASRCRCCKS-EESLMHVMW 1926 Query: 5305 ECAIAKSL 5328 + +A + Sbjct: 1927 KNPVANQV 1934 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 428 bits (1101), Expect = e-116 Identities = 288/1030 (27%), Positives = 483/1030 (46%), Gaps = 87/1030 (8%) Frame = +1 Query: 2491 LWDMHRVSL--DVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGE 2664 +W H + L D+++ Q +H R+ ++ F ATFVY + R LW+ L L Sbjct: 950 IWLFHSLELHSDIILDHPQCLHVRLTSPWLEKPFFATFVYAKCTRSERTLLWDCLRRLAA 1009 Query: 2665 NIAEPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIGCRLTWTNGS 2844 + EPW++ GD+N +L +++ G + D + CGL+D G TWTN Sbjct: 1010 DNEEPWLVGGDFNIILKREERLYGSAPHEGSMEDFASVLLDCGLLDGGFEGNPFTWTNNR 1069 Query: 2845 VSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTL 3024 + +LDR + N QW+ + ++ L SDH ++S ++ F+F + W L Sbjct: 1070 MFQRLDRVVYNHQWINMFPITRIQHLNRDG-SDHCPLLISCFISSEKSPSSFRFQHAWVL 1128 Query: 3025 HEGFQDLVASSWDEPISGNAQFALKAKLTRLKGKLRELDKLHFQHI------SEQALRAK 3186 H F+ V +W+ PI+G+ A K RLK L+ +K F I +E+ + Sbjct: 1129 HHDFKTSVEGNWNLPINGSGLQAFWIKQHRLKQHLKWWNKAVFGDIFSKLKEAEKRVEEC 1188 Query: 3187 SVLEDAQRSLLSGAGPPNDYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDL 3366 +L ++++ S Y + K+ + E++F++Q++ ++ +R TKFFH Sbjct: 1189 EILHQQEQTVGSRINLNKSYAQLNKQLNV----EEIFWKQKSGVKWVVEGERNTKFFHMR 1244 Query: 3367 VKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPLD----SNTISGN 3534 +++ R+ I K+ + G +++ + + YF LL + P D N++ + Sbjct: 1245 MQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIEYFSSLL----KAEPCDISRFQNSLIPS 1300 Query: 3535 HLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEF 3714 +S ++ E L +E+K+A+FDI + A GPDGF++ FY W+ I D + AV++F Sbjct: 1301 IISNSENELLCAEPNLQEVKDAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRDF 1360 Query: 3715 FHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLD 3894 FH + + T++ L+PK + +++RPIS CTV K+I+K+LS RLA ++ S++ Sbjct: 1361 FHGANIPRGVTSTTLVLLPKKSSASKWSEFRPISLCTVMNKIITKLLSNRLAKILPSIIT 1420 Query: 3895 PAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGY 4074 Q FV R I +NI LAQEL+RK K +K+D+ KA+D +DW FL+ L + Sbjct: 1421 ENQSGFVGGRLISDNILLAQELIRKLDTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHF 1480 Query: 4075 GFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMK 4254 GF +Q+ G I +CIS +S+ LNG + G+F RGLRQGD +SP LF L E SR + Sbjct: 1481 GFNEQWIGMIQKCISNCWFSLLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLSRGLN 1540 Query: 4255 EA-SRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANP 4431 + HY + ++HLA+ADD+L+F S+L I+ L ++ + +G R N Sbjct: 1541 ALYDQYPSLHYSSG-VPLSVSHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINA 1599 Query: 4432 LKSNLYV-AGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKV 4608 KS + R + TGF + P YLG PL ++ + L+ I +++ Sbjct: 1600 QKSCFVTHTNIPNSRRQIIAQATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERI 1659 Query: 4609 ISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVW-----STKH 4773 W LS GRI L+RSVL + + L +L P V++++ + SF+W S + Sbjct: 1660 TGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAASKRI 1719 Query: 4774 PPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSE--- 4944 SW+ I LP EGG +R L + W ++ DSLW +++ Y R + Sbjct: 1720 HWASWAKIALPVTEGGLDIRSLAEVFEAFSMKLWWRFRT-TDSLWTRFMRMKYCRGQLPM 1778 Query: 4945 -----------------------------------DFWH--WRGK----PSDSPLIKCLL 5001 FWH W G+ S+ ++ Sbjct: 1779 QTQPKLHDSQTWKRMLTSSTITEQHMRWRVGQGNVFFWHDCWMGEAPLISSNQEFTSSMV 1838 Query: 5002 QIRDVLLQRTTSIGAAKMKLTEWFAE----------------------GENGTALAYDFF 5115 Q+ D + +I K L + + G+ T A+ Sbjct: 1839 QVCDFFTNNSWNIEKLKTVLQQEVVDEIAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQLI 1898 Query: 5116 RPKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLSYEPSKMCALCQ--QHEESN 5289 R +K P F+ IW + F +W L H + + + ++ + C+ + EES Sbjct: 1899 RKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSEESI 1958 Query: 5290 GHIFFECAIA 5319 H+ ++ +A Sbjct: 1959 MHVMWDNPVA 1968 >emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1| putative protein [Arabidopsis thaliana] Length = 1141 Score = 419 bits (1078), Expect = e-114 Identities = 260/733 (35%), Positives = 392/733 (53%), Gaps = 17/733 (2%) Frame = +1 Query: 2455 NFNYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRP 2634 N+ +S+ G+I +LWD V + ++ + Q + C V + + + VY + R+ Sbjct: 54 NYGFSDLGKIWVLWDPS-VEVVIVAKSLQMITCEVLFPNSRTWIVISVVYAANEDDKRKE 112 Query: 2635 LWNSLTLLGEN---IAEPWIIMGDYNSLLTVQDKEGGQPVS---NYELHDLERFVHTCGL 2796 LW +T L + PWI++GD+N +L E + VS + + D + L Sbjct: 113 LWREITALVASPVTFNRPWILLGDFNQVL--HPHEHSRHVSLNVDRRIRDFRECLLDAEL 170 Query: 2797 VDLRSIGCRLTWTNGS----VSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDH-SCGIV 2961 DL G TW N S V+ K+DR +VN W S+ F PP SDH SCG+V Sbjct: 171 SDLVYKGSSFTWWNKSKTRPVAKKIDRILVNESWSNLFPSSFGLFGPPD-FSDHASCGVV 229 Query: 2962 STLAREKRCNRPFKFYNMWTLHEGFQDLVASSW-DEPISGNAQFALKAKLTRLKGKLREL 3138 L K RPFKF+N + F +LV W + G++ F + KL LK +++ Sbjct: 230 LELDPIK-AKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSMFRVSKKLKALKKPIKDF 288 Query: 3139 DKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVI--RKRATLLAEAEKLFYQQRA 3312 +L++ ++ ++ A L Q L N + +++ +LA AE+ F++QR+ Sbjct: 289 SRLNYSNLEKRTEEAHETLLSFQNLTLDNPSLENAAHELEAQRKWQILATAEESFFRQRS 348 Query: 3313 KSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKT 3492 + + D T++FH + N I L SG + S +A+ YF++LL Sbjct: 349 RVTWFAEGDGNTRYFHRMADSRKSVNTITTLVDDSGTQIDSQQGIADHCALYFENLLSDD 408 Query: 3493 GQCTPL---DSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYT 3663 L D N + +Q +L ++E+IK A F + ++KA GPDGF Sbjct: 409 NDPYSLEQDDMNLLLTYRCPYSQVADLEAMFSDEDIKAAFFGLPSNKACGPDGFPVT--- 465 Query: 3664 AAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVI 3843 AAV+EFF G LLKQWN T+I LIPK ++ C +D+RPISC YKVI Sbjct: 466 -----------AAVREFFISGNLLKQWNATTIVLIPKFPNASCTSDFRPISCMNTLYKVI 514 Query: 3844 SKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKK 4023 +++L+ RL ++S ++ P+Q AF+ R + EN+ LA E++ Y + IS R M+KVDL+K Sbjct: 515 ARLLTDRLQKLLSCVISPSQSAFLPGRLLAENVLLATEMVHGYNWRNISLRGMLKVDLRK 574 Query: 4024 AFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPL 4203 AFD+V W F+++AL+ G P +F WIH+CIST +++VS+NG GFF +GLRQGDPL Sbjct: 575 AFDSVRWEFIIAALLALGVPTKFINWIHQCISTPTFTVSVNGCCGGFFKSAKGLRQGDPL 634 Query: 4204 SPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIM 4383 SP LF L +EVFS+ + YHP+ +++ ++HL +ADD+++F SSL+ I Sbjct: 635 SPYLFVLAMEVFSKLLNSRFDSGYIRYHPKASDLSISHLMFADDVMIFFDGGSSSLHGIC 694 Query: 4384 GCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLR 4563 L DF +GL+ N KS+ + AG+E+ R L GF QG P RYLG+PL +LR Sbjct: 695 ETLEDFASWSGLKVNNDKSHFFCAGLEQAERNSLA-AYGFPQGCLPIRYLGLPLMCRKLR 753 Query: 4564 IANYGSLLEAITK 4602 IA Y LLE K Sbjct: 754 IAEYEPLLEKSPK 766