BLASTX nr result
ID: Mentha26_contig00005098
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00005098 (4420 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 437 e-119 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 433 e-118 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 400 e-108 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 394 e-106 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 393 e-106 emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li... 389 e-105 dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ... 389 e-105 dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal... 378 e-101 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 368 2e-98 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 358 1e-95 emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694... 353 5e-94 gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip... 352 1e-93 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 349 8e-93 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 349 8e-93 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 342 7e-91 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 323 4e-85 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 317 4e-83 emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulga... 313 4e-82 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 309 9e-81 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 304 2e-79 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 437 bits (1123), Expect = e-119 Identities = 259/786 (32%), Positives = 402/786 (51%), Gaps = 6/786 (0%) Frame = +1 Query: 2074 MIIATWNIRGMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQGWNFMHN 2253 M+ +WN+RGM K ++ + HKI + +LET + W +++N Sbjct: 1 MLCVSWNVRGMNDPFKIKEIKNFLYSHKIVVCALLETRVREQNASKVQGKLGKDWKWLNN 60 Query: 2254 FDIVSNGRILLCWNSNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDM 2433 + + RI + W V+V + ++Q++ ++ + + YG +TI DR + Sbjct: 61 YSHSARERIWIGWRPAWVNVTLTHTQEQLMVCDI--QDQSHKLKMVAVYGLHTIADRKSL 118 Query: 2434 WDSLILHVPLDAPAFVCGDFNCVQDPSERVGKRTPSEKELADFVDTSAFLTLQDAPSTGC 2613 W L+ V P + GDFN V ++R+ ++ E DF L ++ ST Sbjct: 119 WSGLLQCVQQQDPMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSNLIESRSTWS 178 Query: 2614 FFTFA----GKD-VFSRIDRTLINTIWLENNWFCRTEFLPRGIISDHSACISTLFQHVQN 2778 +++++ G+D V SRID+ +N +WL ++LP GI SDHS + L Sbjct: 179 YYSWSNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPGI-SDHSPLLFNLMTGRPQ 237 Query: 2779 FKKDFRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNI 2958 K F+F N E F ++++ W N+ K + + L ++ L+Q+ Sbjct: 238 GGKPFKFMNVMAEQGEFLETVEKAW-NSVNGRFKLQAIWLNLKAVKRELKQMKTQKIGLA 296 Query: 2959 SEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINF 3138 EK R +L+ Q Q D D N +H E + L Q+++ + Sbjct: 297 HEKVKNLRHQLQDLQSQDDFDH-NDIMQTDAKSIMNDLRHWSHIEDSILQQKSRITWLQQ 355 Query: 3139 SDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFG-KNTPRTPV 3315 D ++K F + VK N I + E+G D + + +++Y L G + + V Sbjct: 356 GDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRASTLMGV 415 Query: 3316 DWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXXX 3495 D + + G LS+ + +LIR V+ EI AL IG+DKAPG DGF + FF Sbjct: 416 DLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQ 475 Query: 3496 XXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSRM 3675 A + EFF+ + R +N +V+L+PK H + +FRPIAC V+YKII+K+LT+RM Sbjct: 476 EIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLTNRM 535 Query: 3676 SPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISW 3855 + ++++ AQS FI GR+I DN LA ELIR Y RK ++ RC++K+D+RKAYD + W Sbjct: 536 KGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKH-MSPRCIMKVDIRKAYDSVEW 594 Query: 3856 DFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLF 4035 FL +LY F F+ WI+ CV++ ++S+ +NG + ++GLRQGDPMSP LF Sbjct: 595 SFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFAL 654 Query: 4036 CMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEFT 4215 CMEYLSR + + F HPKC + THL FADDLL+F R D S+ + A +F+ Sbjct: 655 CMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKFS 714 Query: 4216 ATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSPL 4395 SGL + KS+I+ GV +E+ D G LP +YLG+PL SK LT PL Sbjct: 715 HASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKPL 774 Query: 4396 ISQISN 4413 + I+N Sbjct: 775 VEMITN 780 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 433 bits (1114), Expect = e-118 Identities = 260/786 (33%), Positives = 403/786 (51%), Gaps = 7/786 (0%) Frame = +1 Query: 2074 MIIATWNIRGMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQGWNFMHN 2253 M I TWN+RG+ K V+ + KI + + ET F W++++N Sbjct: 1 MKITTWNVRGLNDPIKVKEVKHFLHSQKISLCSLFETRVRQQNSGKIQKKFGNRWSWINN 60 Query: 2254 FDIVSNGRILLCWNSNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDM 2433 + GRI + W +N V++N++SV +QVI V N F A YG +TI DR + Sbjct: 61 YACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHTIADRKVL 120 Query: 2434 WDSLILHVPL-DAPAFVCGDFNCVQDPSERVGKRTPSEKELADFVDTSAFLTLQDAPSTG 2610 W+ L V + P + GD+N V +R+ SE E +D L +AP+TG Sbjct: 121 WEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQLLEAPTTG 180 Query: 2611 CFFTFAGKDV-----FSRIDRTLINTIWLENNWFCRTEFLPRGIISDHSACISTLFQHVQ 2775 F+++ K + SRID++ +N W+ E+ GI SDHS I L Sbjct: 181 LFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAGI-SDHSPLIFNLATQHD 239 Query: 2776 NFKKDFRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNN 2955 + F+F N + F +KE W +A R K + + +L ++ L+ + F+ Sbjct: 240 EGGRPFKFLNFLADQNGFVEVVKEAWGSANHRF-KMKNIWVRLQAVKRALKSFHSKKFSK 298 Query: 2956 ISEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHIN 3135 + R +L A Q + ++ + + T +++ L Q+++ + ++ Sbjct: 299 AHCQVEELRRKLAAVQALPEVSQVSELQEEEKDLIAQ-LRKWSTIDESILKQKSRIQWLS 357 Query: 3136 FSDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGKNTPRTP- 3312 D ++K+F + +K RN I ++ + G+ + I + ++Y L G ++ + Sbjct: 358 LGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQLEA 417 Query: 3313 VDWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXX 3492 +D V+ G +LS+ + L++P++ EI AL DI D KAPG DGF S FF Sbjct: 418 IDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLVIK 477 Query: 3493 XXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSR 3672 + +FF G + + +N T V+LIPK D+RPIAC + +YKII+KILT R Sbjct: 478 QEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISKILTKR 537 Query: 3673 MSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCIS 3852 + + +++ AQ+ FI R+I DN LA ELIR Y R+ ++ RC++K+D+RKAYD + Sbjct: 538 LQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRH-VSPRCVIKVDIRKAYDSVE 596 Query: 3853 WDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFL 4032 W FL +L L F FI WI+ CV + ++SI +NG ++GLRQGDP+SP LF Sbjct: 597 WVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFDAQKGLRQGDPLSPFLFA 656 Query: 4033 FCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEF 4212 MEYLSR + F HPKC THL FADDLL+F R D S+ + A + F Sbjct: 657 LSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSISKIMAAFNSF 716 Query: 4213 TATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSP 4392 + SGL + KS I+ GGV E +++ D P G+LP +YLG+PLASK L + P Sbjct: 717 SKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLGVPLASKKLNFSQCKP 776 Query: 4393 LISQIS 4410 LI +I+ Sbjct: 777 LIDKIT 782 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 400 bits (1029), Expect = e-108 Identities = 248/786 (31%), Positives = 400/786 (50%), Gaps = 12/786 (1%) Frame = +1 Query: 2089 WNIRGMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQGWNFMHNFDIVS 2268 WNIRG ++ + + +K G++ET L GW+F+ N+ Sbjct: 8 WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRKFINALLPGWSFVENYAFSD 67 Query: 2269 NGRILLCWNSNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDMWDSLI 2448 G+I + W+ + V V +++ Q+I V S + ++ Y + R ++W ++ Sbjct: 68 LGKIWVMWDPS-VQVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKELWIEIV 126 Query: 2449 LHVPL----DAPAFVCGDFNCVQDPSERVGKRTPS-EKELADFVDTSAFLTLQDAPSTGC 2613 V D P V GDFN V +P E + + + + DF D L D G Sbjct: 127 NMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELSDLRYKGN 186 Query: 2614 FFTFAGKD----VFSRIDRTLINTIWLENNWFCRTEFLPRGI-ISDHSACISTLFQHVQN 2778 FT+ K V +IDR L+N W N F + + + SDH +C L + Sbjct: 187 TFTWWNKSHTTPVAKKIDRILVNDSW--NALFPSSLGIFGSLDFSDHVSCGVVLEETSIK 244 Query: 2779 FKKDFRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNI 2958 K+ F+F N +++ F N +++NW V ++S KL L+ ++ +R +++ + Sbjct: 245 AKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRLNYSEL 304 Query: 2959 SEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINF 3138 ++ A L Q ++ DP +K L +E++F Q+++ Sbjct: 305 EKRTKEAHDFLIGCQDRTLADP-TPINASFELEAERKWHILTAAEESFFRQKSRISWFAE 363 Query: 3139 SDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGKNTPRTPVD 3318 D +TKYFH + N+IS + NG+ + I+ Y+ L G ++ Sbjct: 364 GDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLGDEVDPYLME 423 Query: 3319 WSVMGA--GFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXX 3492 + M +R S L S+ +IR ALF + +K+ GPDGFT+ FF Sbjct: 424 QNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFFIDSWSIVG 483 Query: 3493 XXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSR 3672 ++ EFFS G +L++ N T + LIPK + SDFRPI+C N +YK+I ++LT R Sbjct: 484 AEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLYKVIARLLTDR 543 Query: 3673 MSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCIS 3852 + L +IS AQSAF+ GR++ +N LA +L+ Y S I+ R M+K+DL+KA+D + Sbjct: 544 LQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNW-SNISPRGMLKVDLKKAFDSVR 602 Query: 3853 WDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFL 4032 W+F+ L L FI WI C+++ TF+++INGG+ GF + +GLRQGDP+SP LF+ Sbjct: 603 WEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFKSTKGLRQGDPLSPYLFV 662 Query: 4033 FCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEF 4212 ME S L+H+R + +HPK ++ +HL FADD+++F G S+ + + LD+F Sbjct: 663 LAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIFFDGGSFSLHGICETLDDF 722 Query: 4213 TATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSP 4392 + SGL VNK KSH++L G+ E +GFP GTLP++YLGLPL ++ L +Y P Sbjct: 723 ASWSGLKVNKDKSHLYLAGLNQLESNANA-AYGFPIGTLPIRYLGLPLMNRKLRIAEYEP 781 Query: 4393 LISQIS 4410 L+ +I+ Sbjct: 782 LLEKIT 787 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 394 bits (1013), Expect = e-106 Identities = 238/791 (30%), Positives = 383/791 (48%), Gaps = 16/791 (2%) Frame = +1 Query: 2086 TWNIRGMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQGWNFMHNFDIV 2265 +WN+RG + ++ R K ILET + GW + N++ Sbjct: 6 SWNVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARRSLLSSFPGWKSVCNYEFA 65 Query: 2266 SNGRILLCWNSNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDMWDSL 2445 + GRI + W+ V+V ++S Q I V F Y R +W L Sbjct: 66 ALGRIWVVWDP-AVEVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWSEL 124 Query: 2446 IL----HVPLDAPAFVCGDFNCVQDPSERVGKRTPSEKELADFVDTSAFLTLQDAPSTGC 2613 L D P + GDFN DP + + + + +F + + D P G Sbjct: 125 ELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDLPFRGN 184 Query: 2614 FFTF----AGKDVFSRIDRTLINTIWL-----ENNWFCRTEFLPRGIISDHSACISTLFQ 2766 +T+ + +IDR L+N WL FC EF SDH + Sbjct: 185 HYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAMEF------SDHCPSCVNISN 238 Query: 2767 HVQNFKKDFRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTH 2946 K F+ N M HP F ++ W + LS K L+ +R NR H Sbjct: 239 QSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREH 298 Query: 2947 FNNISEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTK 3126 ++ + ++ A L+ Q P + + L +E+ FL Q+++ Sbjct: 299 YSGLEKRVVQAAQNLKTCQNNLLAAP-SSYLAGLEKEAHRSWAELALAEERFLCQKSRVL 357 Query: 3127 HINFSDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGKNTPR 3306 + D +T +FH ++ N I ++ + G + + +D++ +LFG ++ Sbjct: 358 WLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFGSSSHL 417 Query: 3307 TPVDW-SVMGAGFRLSSDDQSA--LIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXX 3477 + S + + R D+ + L VS +I++ F + +K+PGPDG+TS FF Sbjct: 418 ISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFFKKT 477 Query: 3478 XXXXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITK 3657 A+V EFF G +L + N T V+++PK + I++FRPI+C N +YK+I+K Sbjct: 478 WSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCCNAIYKVISK 537 Query: 3658 ILTSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKA 3837 +L R+ L ISP+QSAF+KGR + +N LA EL++ + + + I++R ++K+DLRKA Sbjct: 538 LLARRLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQ-ANISSRGVLKVDLRKA 596 Query: 3838 YDCISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMS 4017 +D + W F+ + L N P F+ WI C+TS +FSI ++G G+ +G +GLRQGDP+S Sbjct: 597 FDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGYFKGSKGLRQGDPLS 656 Query: 4018 PTLFLFCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRD 4197 P+LF+ ME LSRL+ + + +HPK + + LAFADDL++F G S+R ++ Sbjct: 657 PSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMIFYDGKASSLRGIKS 716 Query: 4198 ALDEFTATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTT 4377 L+ F SGL +N KS ++ G+ +K++ L FGF GT P +YLGLPL + L Sbjct: 717 VLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGFVNGTFPFRYLGLPLLHRKLRR 775 Query: 4378 NDYSPLISQIS 4410 +DYS LI +I+ Sbjct: 776 SDYSQLIDKIA 786 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 393 bits (1010), Expect = e-106 Identities = 254/796 (31%), Positives = 392/796 (49%), Gaps = 23/796 (2%) Frame = +1 Query: 2089 WNIRGMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQGWNFMHNFDIVS 2268 WN+RG+ ++ K + ++ I ++ ++ET + W+ + N++ Sbjct: 6 WNVRGLNKSSKHSVIKKWIEENNFQFGCLVETRVKESKVSQLVGKLFKDWSILTNYEHNR 65 Query: 2269 NGRILLCWNSNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDMWDSLI 2448 GRI + W N V ++ I Q++ +V + F + Y +E+R +W L Sbjct: 66 RGRIWVLWRKN-VRLSPIYKSCQLLTCSVKLEDRQDEFFCSFVYASNYVEERKVLWSELK 124 Query: 2449 LH----VPLDAPAFVCGDFNCVQDPSERVGKR-----TPSEKELADFVDTSAFLTLQDAP 2601 H + P + GDFN D +E TP + DF + +L D Sbjct: 125 DHYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPG---MRDFQQVINYCSLTDMA 181 Query: 2602 STGCFFTFAGKD----VFSRIDRTLINTIWLENNWFCRT-EFLPRGIISDHSACISTLFQ 2766 + G FT+ K + ++DR LIN W N F ++ G SDH C +L Sbjct: 182 AQGPLFTWCNKREHGLIMKKLDRVLINDCW--NQTFSQSYSVFEAGGCSDHLRCRISLNS 239 Query: 2767 HVQNFK---KDFRFCNAWMEHPSFQNSLKENWVNAP---VREGKQEQLSAKLHRLRPILR 2928 N K F+F NA + F+ + W + + + S L L+P +R Sbjct: 240 EAGNKVQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLFRFSKNLKGLKPKIR 299 Query: 2929 QLNRTHFNNISEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLA 3108 + R N+S+KA A L A Q + +P + + + E+ +L Sbjct: 300 SMARDRLGNLSKKANEAYKILCAKQHVNLTNP-SSMAMEEENAAYSRWDRVAILEEKYLK 358 Query: 3109 QRAKTKHINFSDKSTKYFHSLVKRNMIRNTISFIRRENG--ETTGD-VQTIIADFIDYYS 3279 Q++K D++TK FH NTI I +G +T GD ++ F + Sbjct: 359 QKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAERFFREFL 418 Query: 3280 DLFGKNTPRTPVDWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTS 3459 L + + R S DQ +LIRPV+ EIR LF + DK+PGPDG+TS Sbjct: 419 QLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFRMPSDKSPGPDGYTS 478 Query: 3460 AFFXXXXXXXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVV 3639 FF +V FF+KG + + +N TI++LIPK T + D+RPI+C NV+ Sbjct: 479 EFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDYRPISCCNVL 538 Query: 3640 YKIITKILTSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVK 3819 YK+I+KI+ +R+ L K I+ QSAF+K R +++N LA EL++ Y K I+ RC +K Sbjct: 539 YKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDYH-KDTISTRCAIK 597 Query: 3820 IDLRKAYDCISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLR 3999 ID+ KA+D + W FL +V L F FI+WI C+T+A+FS+ +NG G+ + RGLR Sbjct: 598 IDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGELAGYFQSSRGLR 657 Query: 4000 QGDPMSPTLFLFCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPES 4179 QG +SP LF+ CM+ LS+++ A F +HPKC + THL+FADDL++ G S Sbjct: 658 QGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRS 717 Query: 4180 MRVLRDALDEFTATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLA 4359 + + DEF SGL ++ KS ++L G+ + E+ D F F G LPV+YLGLPL Sbjct: 718 IERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPVRYLGLPLI 777 Query: 4360 SKSLTTNDYSPLISQI 4407 +K L+T D PL+ Q+ Sbjct: 778 TKRLSTTDCLPLLEQV 793 >emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 389 bits (999), Expect = e-105 Identities = 247/784 (31%), Positives = 400/784 (51%), Gaps = 18/784 (2%) Frame = +1 Query: 2089 WNIRGMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQGWNFMHNFDIVS 2268 WN+RG + + + +K G++ET L GW+F+ N++ Sbjct: 8 WNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEKKFISNLLPGWSFVENYEFSV 67 Query: 2269 NGRILLCWNSNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDMWDSLI 2448 G+I + W+ + V V +I Q+I + S + F ++ Y R ++W+ L+ Sbjct: 68 LGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRKELWNELV 126 Query: 2449 L----HVPLDAPAFVCGDFNCVQDPSERVGKRTPSEKELADFVDTSAFLTLQDAPSTGCF 2616 V + V GDFN + +P + +++ F L D G Sbjct: 127 QLALSPVVVGRSWIVLGDFNQILNPESAINANIG--RKIRAFRSCLLDSDLYDLVYKGSS 184 Query: 2617 FTF----AGKDVFSRIDRTLINTIWLENNWFCRTEFLPRGI--ISDHSACISTLFQHVQN 2778 +T+ + + + +IDR L+N W N + + G SDHS+C L V Sbjct: 185 YTWWNKCSSRPLAKKIDRILVNDHW---NTLFPSAYANFGEPDFSDHSSCEVVLDPAVLK 241 Query: 2779 FKKDFRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNI 2958 K+ FRF N ++ +P F ++ENW + V ++S KL L+ + +R ++++I Sbjct: 242 AKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSRENYSDI 301 Query: 2959 SEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINF 3138 ++ + A A + QR + +P + +K Q L +E++F Q++ + Sbjct: 302 EKRVSEAHAIVLHRQRITLTNP-SVVHATLELEATRKWQILAKAEESFFCQKSSISWLYE 360 Query: 3139 SDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLF--------GK 3294 D +T YFH + NTI+F+ + GE Q I ++ + F G+ Sbjct: 361 GDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESLLCGVEGE 420 Query: 3295 NTPRTPVDWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXX 3474 N+ D +++ FR S D + L R S ++I+ A F + +KA GPDG++S FF Sbjct: 421 NS-LAQSDMNLL-LSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSEFFKG 478 Query: 3475 XXXXXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIIT 3654 +V EFF G +L++ N T + LIPK T+ ++DFRPI+C N +YK+I Sbjct: 479 VWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTLYKVIA 538 Query: 3655 KILTSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRK 3834 K+LTSR+ L ++ISP+QSAF+ GR + +N LA E++ Y K+ I++R M+K+DLRK Sbjct: 539 KLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKN-ISSRGMLKVDLRK 597 Query: 3835 AYDCISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPM 4014 A+D + WDF+ L F+ WI C+++ FS+ +NG S GF + +GLRQGDP+ Sbjct: 598 AFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQGDPL 657 Query: 4015 SPTLFLFCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLR 4194 SP LF+ ME S L+ AR A +HPK +HL FADD+++F G S+ + Sbjct: 658 SPYLFVLAMEVFSSLLKARFDAGYIQYHPKTADLSISHLMFADDVMVFFDGGSSSLHGIS 717 Query: 4195 DALDEFTATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLT 4374 +ALD+F + SGL VNK K++++L G E I +GFP TLP++YLGLPL S+ L Sbjct: 718 EALDDFASWSGLHVNKDKTNLYLAGTDEVEALAI-SHYGFPISTLPIRYLGLPLMSRKLK 776 Query: 4375 TNDY 4386 ++Y Sbjct: 777 ISEY 780 >dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 389 bits (998), Expect = e-105 Identities = 247/784 (31%), Positives = 400/784 (51%), Gaps = 18/784 (2%) Frame = +1 Query: 2089 WNIRGMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQGWNFMHNFDIVS 2268 WN+RG + + + +K G++ET L GW+F+ N++ Sbjct: 8 WNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEKKFISNLLPGWSFVENYEFSV 67 Query: 2269 NGRILLCWNSNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDMWDSLI 2448 G+I + W+ + V V +I Q+I + S + F ++ Y R ++W+ L+ Sbjct: 68 LGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRKELWNELV 126 Query: 2449 L----HVPLDAPAFVCGDFNCVQDPSERVGKRTPSEKELADFVDTSAFLTLQDAPSTGCF 2616 V + V GDFN + +P + +++ F L D G Sbjct: 127 QLALSPVVVGRSWIVLGDFNQILNPESAINANIG--RKIRAFRSCLLDSDLYDLVYKGSS 184 Query: 2617 FTF----AGKDVFSRIDRTLINTIWLENNWFCRTEFLPRGI--ISDHSACISTLFQHVQN 2778 +T+ + + + +IDR L+N W N + + G SDHS+C L V Sbjct: 185 YTWWNKCSSRPLAKKIDRILVNDHW---NTLFPSAYANFGEPDFSDHSSCEVVLDPAVLK 241 Query: 2779 FKKDFRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNI 2958 K+ FRF N ++ +P F ++ENW + V ++S KL L+ + +R ++++I Sbjct: 242 AKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSRENYSDI 301 Query: 2959 SEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINF 3138 ++ + A A + QR + +P + +K Q L +E++F Q++ + Sbjct: 302 EKRVSEAHAIVLHRQRITLTNP-SVVHATLELEATRKWQILAKAEESFFCQKSSISWLYE 360 Query: 3139 SDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLF--------GK 3294 D +T YFH + NTI+F+ + GE Q I ++ + F G+ Sbjct: 361 GDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESLLCGVEGE 420 Query: 3295 NTPRTPVDWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXX 3474 N+ D +++ FR S D + L R S ++I+ A F + +KA GPDG++S FF Sbjct: 421 NS-LAQSDMNLL-LSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSEFFKG 478 Query: 3475 XXXXXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIIT 3654 +V EFF G +L++ N T + LIPK T+ ++DFRPI+C N +YK+I Sbjct: 479 VWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTLYKVIA 538 Query: 3655 KILTSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRK 3834 K+LTSR+ L ++ISP+QSAF+ GR + +N LA E++ Y K+ I++R M+K+DLRK Sbjct: 539 KLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKN-ISSRGMLKVDLRK 597 Query: 3835 AYDCISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPM 4014 A+D + WDF+ L F+ WI C+++ FS+ +NG S GF + +GLRQGDP+ Sbjct: 598 AFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQGDPL 657 Query: 4015 SPTLFLFCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLR 4194 SP LF+ ME S L+ AR A +HPK +HL FADD+++F G S+ + Sbjct: 658 SPYLFVLAMEVFSSLLKARFDAGYIHYHPKTADLSISHLMFADDVMVFFDGGSSSLHGIS 717 Query: 4195 DALDEFTATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLT 4374 +ALD+F + SGL VNK K++++L G E I +GFP TLP++YLGLPL S+ L Sbjct: 718 EALDDFASWSGLHVNKDKTNLYLAGTDEVEALAI-SHYGFPISTLPIRYLGLPLMSRKLK 776 Query: 4375 TNDY 4386 ++Y Sbjct: 777 ISEY 780 >dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 910 Score = 378 bits (970), Expect = e-101 Identities = 244/793 (30%), Positives = 384/793 (48%), Gaps = 13/793 (1%) Frame = +1 Query: 2074 MIIATWNIRGMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQGWNFMHN 2253 M + WNIRG+ ++ VR+ I + + + LET + L GW N Sbjct: 1 MKVFCWNIRGLNSRNRQRVVRSWIASNNLLVGCFLETHVAQENANSVLASTLPGWRMDSN 60 Query: 2254 FDIVSNGRILLCWNSNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDM 2433 + GRI + W+ + + V + Q++ ++ +F A YG + DR + Sbjct: 61 YCCSELGRIWIVWDPS-ISVLVFKRTDQIMFCSIKIPSLLQSFAVAFVYGRNSELDRRSL 119 Query: 2434 WDSLIL---HVPLDA-PAFVCGDFNCVQDPSER--VGKRTPSEKELADFVDTSAFLTLQD 2595 W+ +++ PL P + GDFN + SE + + + + + D L D Sbjct: 120 WEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQCCLRDSQLSD 179 Query: 2596 APSTGCFFTFAGKD----VFSRIDRTLINTIWLENNWFCRTEFLPRGIISDHSACISTLF 2763 PS G FFT++ + ++DR L N W F P G SDH+ CI + Sbjct: 180 LPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPGD-SDHAPCIILID 238 Query: 2764 QHVQNFKKDFRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRT 2943 KK F++ + HPS+ +L W + L L + R LNR Sbjct: 239 NQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLKVAKLCCRTLNRL 298 Query: 2944 HFNNISEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKT 3123 F+NI ++ + LE Q + P + K+ + ++F Q+++ Sbjct: 299 RFSNIQQRTAQSLTRLEDIQVELLTSP-SDTLFRREHVARKQWIFFAAALESFFRQKSRI 357 Query: 3124 KHINFSDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFG---K 3294 + ++ D +T++FH V + N I F+R ++G +V I I YYS L G + Sbjct: 358 RWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHLLGIPSE 417 Query: 3295 NTPRTPVDWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXX 3474 N V+ FR S S L S EI LF + +KAPGPDGF FF Sbjct: 418 NVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPDGFPVEFFIE 477 Query: 3475 XXXXXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIIT 3654 A++ EFF G + R N T ++LIPK T ++ FRP+AC +YK+IT Sbjct: 478 AWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCTTIYKVIT 537 Query: 3655 KILTSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRK 3834 +I++ R+ F+ + + Q FIKGR + +N LA EL+ +E G T R +++D+ K Sbjct: 538 RIISRRLKLFIDQAVQANQVGFIKGRLLCENVLLASELVDNFEA-DGETTRGCLQVDISK 596 Query: 3835 AYDCISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPM 4014 AYD ++W+FL ++L L+ FI+WI C++SA++SI NG GF +GK+G+RQGDPM Sbjct: 597 AYDNVNWEFLINILKALDLPLVFIHWIWVCISSASYSIAFNGELIGFFQGKKGIRQGDPM 656 Query: 4015 SPTLFLFCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLR 4194 S LF+ M+ LS+ + F HP C + THL+FADD+L+F G S+ + Sbjct: 657 SSHLFVLVMDVLSKSLDLGALNGLFNLHPNCLAPIITHLSFADDVLVFSDGAASSIAGIL 716 Query: 4195 DALDEFTATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLT 4374 LD+F SGL +N+ K+ + L G + + D G G+LPV+YLG+PL S+ + Sbjct: 717 TILDDFRQGSGLGINREKTELLLDGGNFARNRSLADNLGITHGSLPVRYLGVPLMSQKMR 776 Query: 4375 TNDYSPLISQISN 4413 DY PL+ +I++ Sbjct: 777 RQDYQPLVDRINS 789 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 368 bits (944), Expect = 2e-98 Identities = 239/784 (30%), Positives = 378/784 (48%), Gaps = 13/784 (1%) Frame = +1 Query: 2101 GMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQGWNFMHNFDIVSNGRI 2280 G+ ++ VR+ I + + + LET + L GW N+ GRI Sbjct: 53 GLNSRNRQRVVRSWIASNNLLVGCFLETHVAQENANSVLASTLPGWRMDSNYCCSELGRI 112 Query: 2281 LLCWNSNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDMWDSLIL--- 2451 + W+ + + V + Q++ ++ +F A YG + DR +W+ +++ Sbjct: 113 WIVWDPS-ISVLVFKRTDQIMFCSIKIPSLLQSFAVAFVYGRNSELDRRSLWEDILVLSR 171 Query: 2452 HVPLDA-PAFVCGDFNCVQDPSER--VGKRTPSEKELADFVDTSAFLTLQDAPSTGCFFT 2622 PL P + GDFN + SE + + + + + D L D PS G FFT Sbjct: 172 TSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQCCLRDSQLSDLPSRGVFFT 231 Query: 2623 FAGKD----VFSRIDRTLINTIWLENNWFCRTEFLPRGIISDHSACISTLFQHVQNFKKD 2790 ++ + ++DR L N W F P G SDH+ CI + KK Sbjct: 232 WSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPGD-SDHAPCIILIDNQPPPSKKS 290 Query: 2791 FRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNISEKA 2970 F++ + HPS+ +L W + L L + R LNR F+NI ++ Sbjct: 291 FKYFSFLSSHPSYLAALSTAWEENTLVGSHMFSLRQHLKVAKLCCRTLNRLRFSNIQQRT 350 Query: 2971 TVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINFSDKS 3150 + LE Q + P + K+ + ++F Q+++ + ++ D + Sbjct: 351 AQSLTRLEDIQVELLTSP-SDTLFRREHVARKQWIFFAAALESFFRQKSRIRWLHEGDAN 409 Query: 3151 TKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFG---KNTPRTPVDW 3321 T++FH V + N I F+R ++G +V I I YYS L G +N V+ Sbjct: 410 TRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHLLGIPSENVTPFSVEK 469 Query: 3322 SVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXXXXX 3501 FR S S L S EI LF + +KAPGPDGF FF Sbjct: 470 IKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKSSV 529 Query: 3502 XASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSRMSP 3681 A++ EFF G + R N T ++LIPK T ++ FRP+AC +YK+IT+I++ R+ Sbjct: 530 VAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRLKL 589 Query: 3682 FLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDF 3861 F+ + + Q FIKGR + +N LA EL+ +E G T R +++D+ KAYD ++W+F Sbjct: 590 FIDQAVQANQVGFIKGRLLCENVLLASELVDNFEA-DGETTRGCLQVDISKAYDNVNWEF 648 Query: 3862 LRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCM 4041 L ++L L+ FI+WI C++SA++SI NG GF +GK+G+RQGDPMS LF+ M Sbjct: 649 LINILKALDLPLVFIHWIWVCISSASYSIAFNGELIGFFQGKKGIRQGDPMSSHLFVLVM 708 Query: 4042 EYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEFTAT 4221 + LS+ + F HP C + THL+FADD+L+F G S+ + LD+F Sbjct: 709 DVLSKSLDLGALNGLFNLHPNCLAPIITHLSFADDVLVFSDGAASSIAGILTILDDFRQG 768 Query: 4222 SGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSPLIS 4401 SGL +N+ K+ + L G + + D G G+LPV+YLG+PL S+ + DY PL+ Sbjct: 769 SGLGINREKTELLLDGGNFARNRSLADNLGITHGSLPVRYLGVPLMSQKMRRQDYQPLVD 828 Query: 4402 QISN 4413 +I++ Sbjct: 829 RINS 832 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 358 bits (920), Expect = 1e-95 Identities = 219/662 (33%), Positives = 341/662 (51%), Gaps = 12/662 (1%) Frame = +1 Query: 2461 LDAPAFVCGDFNCVQDPSER-VGKRTPSEKELADFVDTSAFLTLQDAPSTGCFFTFAGK- 2634 +D P V GDFN + PSE ++ F +T +L D G FT+ K Sbjct: 32 IDKPWTVLGDFNQILHPSEHSTSDGFNVDRPTRIFRETILLASLTDLSFRGNTFTWWNKR 91 Query: 2635 ---DVFSRIDRTLINTIWLEN-----NWFCRTEFLPRGIISDHSACISTLFQHVQNFKKD 2790 V ++DR L+N W F +F SDHS+C +L KK Sbjct: 92 SRAPVAKKLDRILVNDKWTTTFPSSLGLFGEPDF------SDHSSCELSLMSASPRSKKP 145 Query: 2791 FRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNISEKA 2970 FRF N ++ +F + + W + V ++S KL L+ ++R +R ++++I ++ Sbjct: 146 FRFNNFLLKDENFLSLICLKWFSTSVTGSAMYRVSVKLKALKKVIRDFSRDNYSDIEKRT 205 Query: 2971 TVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINFSDKS 3150 A L AQ P +K + L +E +F QR++ + D + Sbjct: 206 KEAHDALLLAQSVLLASPC-PSNAAIEAETQRKWRILAEAEASFFYQRSRVNWLREGDMN 264 Query: 3151 TKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGKNTPRTPVDWSVM 3330 + YFH + N I F+ G+ Q + ++Y+ G + + + Sbjct: 265 SSYFHKMASARQSLNHIHFLSDPVGDRIEGQQNLENHCVEYFQSNLGSEQGLPLFEQADI 324 Query: 3331 G--AGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXXXXXX 3504 +R S Q +L P S +I+NA F + +KA GPDGF+ FF Sbjct: 325 SNLLSYRCSPAQQVSLDTPFSSEQIKNAFFSLPRNKASGPDGFSPEFFCACWPIIGGEVT 384 Query: 3505 ASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSRMSPF 3684 ++ EFF+ G +L++ N T + LIPK T+ +SDFRPI+C N VYK+I+K+LT R+ F Sbjct: 385 EAIHEFFTSGKLLKQWNATNLVLIPKITNASSMSDFRPISCLNTVYKVISKLLTDRLKDF 444 Query: 3685 LQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFL 3864 L IS +QSAF+ GR ++N LA EL+ Y +K+ I M+K+DLRKA+D + WDF+ Sbjct: 445 LPAAISHSQSAFMPGRLFLENVLLATELVHGYNKKN-IAPSSMLKVDLRKAFDSVRWDFI 503 Query: 3865 RDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCME 4044 L LN F WIL C+++A+FS+ +NG S G +GLRQGDPMSP LF+ ME Sbjct: 504 VSALRALNVPEKFTCWILECLSTASFSVILNGHSAGHFWSSKGLRQGDPMSPYLFVLAME 563 Query: 4045 YLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEFTATS 4224 S L+ +R + +HPK + + +HL FADD+++F G S+ + ++L++F S Sbjct: 564 VFSGLLQSRYTSGYIAYHPKTSQLEISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWS 623 Query: 4225 GLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSPLISQ 4404 GL +N +K+ ++ G+ E + +GF G+LPV+YLGLPL S+ LT +Y+PLI + Sbjct: 624 GLLMNTNKTQLYHAGLSQSESDSMAS-YGFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEK 682 Query: 4405 IS 4410 I+ Sbjct: 683 IT 684 >emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1| putative protein [Arabidopsis thaliana] Length = 1141 Score = 353 bits (905), Expect = 5e-94 Identities = 240/771 (31%), Positives = 384/771 (49%), Gaps = 26/771 (3%) Frame = +1 Query: 2170 GILETXXXXXXXXXXXPTFLQGWNFMHNFDIVSNGRILLCWNSNTVDVNIISVEKQVIHA 2349 G++E L GW F N+ G+I + W+ + V+V I++ Q+I Sbjct: 27 GVIEKHVKQPKDKKFINALLPGWFFDENYGFSDLGKIWVLWDPS-VEVVIVAKSLQMITC 85 Query: 2350 NVTCRISGNNFHYALCYGFYTIEDRMDMWDSLIL----HVPLDAPAFVCGDFNCVQDPSE 2517 V S ++ Y + R ++W + V + P + GDFN V P E Sbjct: 86 EVLFPNSRTWIVISVVYAANEDDKRKELWREITALVASPVTFNRPWILLGDFNQVLHPHE 145 Query: 2518 RVGKRTPS-EKELADFVDTSAFLTLQDAPSTGCFFTFAGKD----VFSRIDRTLINTIWL 2682 + + ++ + DF + L D G FT+ K V +IDR L+N W Sbjct: 146 HSRHVSLNVDRRIRDFRECLLDAELSDLVYKGSSFTWWNKSKTRPVAKKIDRILVNESW- 204 Query: 2683 ENNWFCRTE--FLPRGIISDHSACISTLFQHVQNFKKDFRFCNAWMEHPSFQNSLKENWV 2856 +N F + F P SDH++C L K+ F+F N +++P F N + + W Sbjct: 205 -SNLFPSSFGLFGPPDF-SDHASCGVVLELDPIKAKRPFKFFNFLLKNPEFLNLVWDVWY 262 Query: 2857 NAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNISEKATVARAELEAAQRQSDRDPLNXX 3036 + V ++S KL L+ ++ +R +++N+ ++ A L + Q + +P + Sbjct: 263 STNVVGSSMFRVSKKLKALKKPIKDFSRLNYSNLEKRTEEAHETLLSFQNLTLDNP-SLE 321 Query: 3037 XXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINFSDKSTKYFHSLVKRNMIRNTISFIRR 3216 +K Q L T+E++F QR++ D +T+YFH + NTI+ + Sbjct: 322 NAAHELEAQRKWQILATAEESFFRQRSRVTWFAEGDGNTRYFHRMADSRKSVNTITTLVD 381 Query: 3217 ENGETTGDVQTIIADFIDYYSD--LFGKNTPRTPVDWSVMGAGFRLSSDDQSALIR---P 3381 ++G T D Q IAD Y + L N P + L DD + L+ P Sbjct: 382 DSG-TQIDSQQGIADHCALYFENLLSDDNDP------------YSLEQDDMNLLLTYRCP 428 Query: 3382 VSHI----------EIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXXXXXXASVDEFFSK 3531 S + +I+ A F + +KA GPDGF A+V EFF Sbjct: 429 YSQVADLEAMFSDEDIKAAFFGLPSNKACGPDGFPVT--------------AAVREFFIS 474 Query: 3532 GIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSRMSPFLQKLISPAQ 3711 G +L++ N T + LIPK + SDFRPI+C N +YK+I ++LT R+ L +ISP+Q Sbjct: 475 GNLLKQWNATTIVLIPKFPNASCTSDFRPISCMNTLYKVIARLLTDRLQKLLSCVISPSQ 534 Query: 3712 SAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFLRDVLYGLNF 3891 SAF+ GR + +N LA E++ Y ++ I+ R M+K+DLRKA+D + W+F+ L L Sbjct: 535 SAFLPGRLLAENVLLATEMVHGYNWRN-ISLRGMLKVDLRKAFDSVRWEFIIAALLALGV 593 Query: 3892 HPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHAR 4071 FI WI C+++ TF++++NG GF + +GLRQGDP+SP LF+ ME S+L+++R Sbjct: 594 PTKFINWIHQCISTPTFTVSVNGCCGGFFKSAKGLRQGDPLSPYLFVLAMEVFSKLLNSR 653 Query: 4072 THASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEFTATSGLTVNKSKS 4251 + +HPK + +HL FADD+++F G S+ + + L++F + SGL VN KS Sbjct: 654 FDSGYIRYHPKASDLSISHLMFADDVMIFFDGGSSSLHGICETLEDFASWSGLKVNNDKS 713 Query: 4252 HIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSPLISQ 4404 H F G+ E+ L +GFP+G LP++YLGLPL + L +Y PL+ + Sbjct: 714 HFFCAGLEQAERNS-LAAYGFPQGCLPIRYLGLPLMCRKLRIAEYEPLLEK 763 >gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score: 72.31) [Arabidopsis thaliana] Length = 928 Score = 352 bits (902), Expect = 1e-93 Identities = 222/685 (32%), Positives = 347/685 (50%), Gaps = 20/685 (2%) Frame = +1 Query: 2413 IEDRMDMWDSLILH----VPLDAPAFVCGDFNCVQDPSERVGKRTP--SEKELADFVDTS 2574 +E+R ++W+ L H + P + GDFN + D E R + + DF Sbjct: 1 MEERKELWNDLRDHSDSPIIRSKPWIIFGDFNEILDMEEHSNSRENPVTTTGMRDFQMAV 60 Query: 2575 AFLTLQDAPSTGCFFTFAGKD----VFSRIDRTLINTIWLENNWFCRT-EFLPRGIISDH 2739 ++ D G FT++ K + ++DR L+N +WL++ F R+ G SDH Sbjct: 61 NHCSITDLAYHGPLFTWSNKRENDLIAKKLDRVLVNDVWLQS--FPRSYSVFEAGGCSDH 118 Query: 2740 SACISTL---FQHVQNFKKDFRFCNAWMEHPSFQNSLKENWVNAP---VREGKQEQLSAK 2901 C L V K+ F+F N E F +++ W + + S K Sbjct: 119 LRCRINLNVGAGAVVKGKRPFKFVNVITEMEHFIPTVESYWNETEAIFMSTSSLFRFSKK 178 Query: 2902 LHRLRPILRQLNRTHFNNISEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHL 3081 L L+P+LR L + N+ ++ A L Q +P + K H+ Sbjct: 179 LKGLKPLLRNLGKERLGNLVKQTKEAFETLCQKQAMKMANP-SPSSMQEENEAYAKWDHI 237 Query: 3082 DTSEKNFLAQRAKTKHINFSDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIAD 3261 E+ FL QR+K ++ D++ K FH V +N+I I +G + I + Sbjct: 238 AVLEEKFLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTE 297 Query: 3262 FIDYYSD---LFGKNTPRTPVDWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDK 3432 ++ + L + V+ +R S D+ L VS EI +F + +DK Sbjct: 298 AEHHFREFLQLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDK 357 Query: 3433 APGPDGFTSAFFXXXXXXXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDF 3612 +PGPDG+T+ F+ ++ FF+KG + + +N TI++LIPK + D+ Sbjct: 358 SPGPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAKEMKDY 417 Query: 3613 RPIACTNVVYKIITKILTSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKS 3792 RPI+C NV+YK+I+KI+ +R+ L K I QSAF+K R +++N LA E+++ Y + S Sbjct: 418 RPISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIVKDYHKDS 477 Query: 3793 GITARCMVKIDLRKAYDCISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHG 3972 +++RC +KID+ KA+D + W FL +VL +NF P F +WI C+T+A+FS+ +NG G Sbjct: 478 -VSSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQVNGELAG 536 Query: 3973 FVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLL 4152 R LRQG +SP LF+ M+ LS+++ A F +HPKC + THL+FADDL+ Sbjct: 537 VFSSARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADDLM 596 Query: 4153 LFGRGDPESMRVLRDALDEFTATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLP 4332 + G S+ + L EF SGL ++ KS ++L GV+ QEI+ F F G LP Sbjct: 597 ILSDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLP 656 Query: 4333 VKYLGLPLASKSLTTNDYSPLISQI 4407 V+YLGLPL SK LT +D PLI Q+ Sbjct: 657 VRYLGLPLVSKRLTASDCLPLIEQL 681 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 349 bits (895), Expect = 8e-93 Identities = 214/656 (32%), Positives = 339/656 (51%), Gaps = 14/656 (2%) Frame = +1 Query: 2485 GDFNCVQDPSERVGKRTPS---EKELADFVDTSAFLTLQDAPSTGCFFTFAGKD----VF 2643 GDFN V P E PS ++ + DF + + L D G FT+ K + Sbjct: 3 GDFNQVLLPQEH--SNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 2644 SRIDRTLINTIWLE-----NNWFCRTEFLPRGIISDHSACISTLFQHVQNFKKDFRFCNA 2808 ++DR L N W + F +F SDH +C L + + K+ F+F N Sbjct: 61 KKLDRILANDSWCNLYPSSHGLFGNLDF------SDHVSCGVVLEANGISAKRPFKFFNF 114 Query: 2809 WMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNISEKATVARAE 2988 +++ F N + +NW + V ++S KL ++ ++ +R +++ I + A Sbjct: 115 LLKNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHEL 174 Query: 2989 LEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINFSDKSTKYFHS 3168 L Q + +P + +K L +E++F QR++ D +T YFH Sbjct: 175 LITCQNLTLANP-SVSNAALELEAQRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHR 233 Query: 3169 LVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGKNTPRTPVDWSVMGA--GF 3342 +V NTI+ + NG Q I+ + YY L G ++ M + Sbjct: 234 MVDSRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTY 293 Query: 3343 RLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXXXXXXASVDEF 3522 R S D S L + + EI+ A + +K GPDG++ FF A++ EF Sbjct: 294 RCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEF 353 Query: 3523 FSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSRMSPFLQKLIS 3702 F G +L++ N T + LIPKT++ IS+FRPI+C N +YK+I+K+LTSR+ L +I Sbjct: 354 FDSGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIG 413 Query: 3703 PAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFLRDVLYG 3882 +QSAF+ GR++ +N LA E++ Y R + I+ R M+K+DL+KA+D + W+F+ L Sbjct: 414 HSQSAFLPGRSLAENVLLATEMVHGYNRLN-ISPRGMLKVDLKKAFDSVKWEFVTAALRA 472 Query: 3883 LNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLI 4062 L +I WI C+T+ +F+I++NG + GF R +GLRQGDP+SP LF+ ME S+L+ Sbjct: 473 LAIPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLL 532 Query: 4063 HARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEFTATSGLTVNK 4242 ++R + +HPK +HL FADD+++F G SM + + LD+F SGL VNK Sbjct: 533 YSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNK 592 Query: 4243 SKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSPLISQIS 4410 KS +F G+ ++ +GFP GT P++YLGLPL + L DY PL+ ++S Sbjct: 593 DKSQLFQAGL-DLSERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLS 647 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 349 bits (895), Expect = 8e-93 Identities = 214/656 (32%), Positives = 339/656 (51%), Gaps = 14/656 (2%) Frame = +1 Query: 2485 GDFNCVQDPSERVGKRTPS---EKELADFVDTSAFLTLQDAPSTGCFFTFAGKD----VF 2643 GDFN V P E PS ++ + DF + + L D G FT+ K + Sbjct: 3 GDFNQVLLPQEH--SNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 2644 SRIDRTLINTIWLE-----NNWFCRTEFLPRGIISDHSACISTLFQHVQNFKKDFRFCNA 2808 ++DR L N W + F +F SDH +C L + + K+ F+F N Sbjct: 61 KKLDRILANDSWCNLYPSSHGLFGNLDF------SDHVSCGVVLEANGISAKRPFKFFNF 114 Query: 2809 WMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNISEKATVARAE 2988 +++ F N + +NW + V ++S KL ++ ++ +R +++ I + A Sbjct: 115 LLKNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHEL 174 Query: 2989 LEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINFSDKSTKYFHS 3168 L Q + +P + +K L +E++F QR++ D +T YFH Sbjct: 175 LITCQNLTLANP-SVSNAALELEAQRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHR 233 Query: 3169 LVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGKNTPRTPVDWSVMGA--GF 3342 +V NTI+ + NG Q I+ + YY L G ++ M + Sbjct: 234 MVDSRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTY 293 Query: 3343 RLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXXXXXXASVDEF 3522 R S D S L + + EI+ A + +K GPDG++ FF A++ EF Sbjct: 294 RCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEF 353 Query: 3523 FSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSRMSPFLQKLIS 3702 F G +L++ N T + LIPKT++ IS+FRPI+C N +YK+I+K+LTSR+ L +I Sbjct: 354 FDSGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIG 413 Query: 3703 PAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFLRDVLYG 3882 +QSAF+ GR++ +N LA E++ Y R + I+ R M+K+DL+KA+D + W+F+ L Sbjct: 414 HSQSAFLPGRSLAENVLLATEMVHGYNRLN-ISPRGMLKVDLKKAFDSVKWEFVTAALRA 472 Query: 3883 LNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLI 4062 L +I WI C+T+ +F+I++NG + GF R +GLRQGDP+SP LF+ ME S+L+ Sbjct: 473 LAIPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLL 532 Query: 4063 HARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEFTATSGLTVNK 4242 ++R + +HPK +HL FADD+++F G SM + + LD+F SGL VNK Sbjct: 533 YSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNK 592 Query: 4243 SKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSPLISQIS 4410 KS +F G+ ++ +GFP GT P++YLGLPL + L DY PL+ ++S Sbjct: 593 DKSQLFQAGL-DLSERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLS 647 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 342 bits (878), Expect = 7e-91 Identities = 198/552 (35%), Positives = 295/552 (53%), Gaps = 6/552 (1%) Frame = +1 Query: 2782 KKDFRFCNAWMEHPSFQNSLKENWVNAP---VREGKQEQLSAKLHRLRPILRQLNRTHFN 2952 +K F+F N + P F ++ +W ++ V + S KL L+P LR+L + Sbjct: 545 RKPFKFVNVLTKLPQFLPVVESHWASSAPLYVSTSALYRFSKKLKTLKPHLRELGKEKLG 604 Query: 2953 NISEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHI 3132 ++ ++ A L Q + +P + HL E+ FL Q++K + Sbjct: 605 DLPKRTREAHILLCEKQATTLANP-SQETIAEELKAYTDWTHLSELEEGFLKQKSKLHWM 663 Query: 3133 NFSDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGKNTPR-- 3306 N D + YFH + +RN+I IR N ET + I + ++++ + + Sbjct: 664 NVGDGNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNEFLNRQSGDFH 723 Query: 3307 -TPVDWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXX 3483 V+ +R S DQ+ L R V+ EI+ LF + ++K+PGPDG+TS FF Sbjct: 724 GISVEDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWS 783 Query: 3484 XXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKIL 3663 A++ FF KG + + LN TI++LIPK + D+RPI+C NV+YK+I+KIL Sbjct: 784 LTGPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKIL 843 Query: 3664 TSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYD 3843 +R+ L I QSAF+K R +M+N LA EL++ Y ++S +T RC +KID+ KA+D Sbjct: 844 ANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKES-VTPRCAMKIDISKAFD 902 Query: 3844 CISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPT 4023 + W FL + L LNF F +WI C+++ATFS+ +NG GF RGLRQG +SP Sbjct: 903 SVQWQFLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPY 962 Query: 4024 LFLFCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDAL 4203 LF+ CM LS +I +HPKC THL FADDL++F G S+ + + Sbjct: 963 LFVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVF 1022 Query: 4204 DEFTATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTND 4383 EF SGL ++ KS I+L GV ++ + L F F G LPV+YLGLPL +K +TT D Sbjct: 1023 KEFAGRSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTAD 1082 Query: 4384 YSPLISQISNFI 4419 YSPLI + I Sbjct: 1083 YSPLIEAVKTKI 1094 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 323 bits (828), Expect = 4e-85 Identities = 205/641 (31%), Positives = 329/641 (51%), Gaps = 16/641 (2%) Frame = +1 Query: 2422 RMDMWDSLIL-HVPLDA---PAFVCGDFNCVQDPSERVGKRTPS-EKELADFVDTSAFLT 2586 R ++W+ L+L V L P + GDFN V P+E + + + + F D Sbjct: 67 RKELWEELLLLSVSLSGNGKPWIMLGDFNQVLCPAEHSQATSLNVNRRMKVFRDCLFEAE 126 Query: 2587 LQDAPSTGCFFTF----AGKDVFSRIDRTLINTIWLENNWFCRTEFLPRGIISDHSACIS 2754 L D G FT+ A + V ++DR L+N W F SDH++C Sbjct: 127 LCDLVFKGNTFTWWNKSATRPVAKKLDRILVNESWCSRFPSAYAVF-GEPDFSDHASCGV 185 Query: 2755 TLFQHVQNFKKDFRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQL 2934 + + K+ FRF N +++P F + + E W + V ++S KL L+ +R Sbjct: 186 IINPLMHREKRPFRFYNFLLQNPDFISLVGELWYSINVVGSSMFKMSKKLKALKNPIRTF 245 Query: 2935 NRTHFNNISEKATVARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQR 3114 + +F+N+ ++ A + Q ++ DP +K L +E++F QR Sbjct: 246 SMENFSNLEKRVKEAHNLVLYRQNKTLSDP-TIPNAALEMEAQRKWLILVKAEESFFCQR 304 Query: 3115 AKTKHINFSDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGK 3294 ++ + D +T YFH + NTI I +NG I I+Y+S+L G Sbjct: 305 SRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDDNGVKIDTQLGIKEHCIEYFSNLLGG 364 Query: 3295 NTPRTPV---DWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAF 3465 + D+ ++ FR S D + L S +I++A F +K GPDGF F Sbjct: 365 EVGPPMLIQEDFDLL-LPFRCSHDQKKELAMSFSRQDIKSAFFSFPSNKTSGPDGFPVEF 423 Query: 3466 FXXXXXXXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTN---- 3633 F +V EFF+ ++L++ N T + LIPK T+ ++DFRPI+C + Sbjct: 424 FKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTLVLIPKITNASKMNDFRPISCNDFGPI 483 Query: 3634 VVYKIITKILTSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCM 3813 +YK+I ++LT+R+ L ++ISP QSAF+ GR + +N LA EL++ Y R++ I R M Sbjct: 484 TLYKVIARLLTNRLQCLLSQVISPFQSAFLPGRFLAENVLLATELVQGYNRQN-IDPRGM 542 Query: 3814 VKIDLRKAYDCISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRG 3993 +K+DLRKA+D I WDF+ L + F+YWI C+++ TFS+ +NG + GF + RG Sbjct: 543 LKVDLRKAFDSIRWDFIISALKAIGIPDRFVYWITQCISTPTFSVCVNGNTGGFFKSTRG 602 Query: 3994 LRQGDPMSPTLFLFCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDP 4173 LRQG+P+SP LF+ ME S L+++R A +HPK + +HL FADD+++F G Sbjct: 603 LRQGNPLSPFLFVLAMEVFSSLLNSRFQAGYIHYHPKTSPLSISHLMFADDIMVFFDGGS 662 Query: 4174 ESMRVLRDALDEFTATSGLTVNKSKSHIFLGGVRPFEKQEI 4296 S+ + +AL++F SGL +N+ K+H++L G+ E I Sbjct: 663 SSLHGISEALEDFAFWSGLVLNREKTHLYLAGLDRIEASTI 703 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 317 bits (811), Expect = 4e-83 Identities = 169/424 (39%), Positives = 241/424 (56%), Gaps = 2/424 (0%) Frame = +1 Query: 3154 KYFHSLVKRNMIRNTISFIRRENGETT--GDVQTIIADFIDYYSDLFGKNTPRTPVDWSV 3327 K FH V +N I I +G D+ F + L ++ V Sbjct: 22 KTFHRAVIERETKNMIKEIYCTDGRVVQGDDIMVEAEKFFKEFLQLIPEDFVGVEVRELQ 81 Query: 3328 MGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXXXXXXA 3507 FR ++ D L R VS EI+ LF + DK+PGPDG+TS F+ Sbjct: 82 DLLQFRCTNSDNEMLTREVSSEEIKTVLFSMPKDKSPGPDGYTSEFYKATWDIIGQEFTL 141 Query: 3508 SVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSRMSPFL 3687 V FF KG + + +N I++LIPK + D+RPI+C NV+YK+I+KI+ +R+ L Sbjct: 142 PVQSFFQKGFLPKGINSIILALIPKKLAAKEMRDYRPISCCNVLYKVISKIIANRLKLLL 201 Query: 3688 QKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFLR 3867 + I+ QSAF+K R +++N LA EL++ Y + S I+ARC +KID+ KA+D + W FL Sbjct: 202 PRFIAENQSAFVKDRLLIENLLLATELVKDYHKDS-ISARCAIKIDISKAFDSVQWSFLT 260 Query: 3868 DVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEY 4047 + L +NF P FI+WI C+T+A+FS+ +NG G+ + KRGLRQG +SP LF+ CM+ Sbjct: 261 NTLVAMNFSPTFIHWINLCITTASFSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDV 320 Query: 4048 LSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEFTATSG 4227 LS+++ F HPKC THL+FADDL++ G S+ + + DEF SG Sbjct: 321 LSKMLDKAAGVRKFGFHPKCQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSG 380 Query: 4228 LTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSPLISQI 4407 L ++ KS +++ GV P KQEI F F G LPV+YLGLPL +K LT+ DYSPL+ QI Sbjct: 381 LRISLEKSTLYMAGVSPIIKQEIAAKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQI 440 Query: 4408 SNFI 4419 I Sbjct: 441 KKRI 444 >emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1383 Score = 313 bits (803), Expect = 4e-82 Identities = 230/792 (29%), Positives = 376/792 (47%), Gaps = 16/792 (2%) Frame = +1 Query: 2080 IATWNIRGMQQAPKKNAVRALITKHKIDIFGILETXXXXXXXXXXXPTFLQG---WNFMH 2250 I +WNIRG+ K+ ++R LI + + ET + W F Sbjct: 4 ILSWNIRGLNARMKRASLRKLIAINNPGCVFVQETKMENINARLMRTCWKSNEIEWIFSP 63 Query: 2251 NFDIVSNGRILLCWNSNTVDVNIISVEKQVIHAN---VTCRISGNNFHYALC--YGFYTI 2415 + S+G IL W D NI + VIH + ++ S + F L Y I Sbjct: 64 SRG--SSGGILAIW-----DKNIFNANSNVIHQSWIAISGIFSTDQFECTLITVYNPCEI 116 Query: 2416 EDRMDMWDSLI-LHVPLDAPAFVCGDFNCVQDPSERVGKRTPSEKELADFVDTSAFLTLQ 2592 R ++W +I P + GDFN V PSER G + S + DF L L Sbjct: 117 AARSEVWKQIIEFQNSNPLPCLLVGDFNEVLRPSER-GSLSFSHNGINDFKSFVQELKLL 175 Query: 2593 DAPSTGCFFTFAGKDVFSRIDRTLINTIWLENNWFCRTEFLPRGIISDHSACISTLFQHV 2772 + PS+ +T+ + S +DR L++ W+ + + L RG+ SDH C + H+ Sbjct: 176 EIPSSSRAYTWYRANSKSLLDRLLVSPEWVSHCPNIKVSILQRGL-SDH--CPLLVHSHI 232 Query: 2773 QNF-KKDFRFCNAWMEHPSFQNSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHF 2949 Q + K FRF N W+ P ++ +W ++P + + KL + L++ N F Sbjct: 233 QEWGPKPFRFNNCWLTDPKCMKIVEASWSSSP-----KISVVEKLKETKKRLKEWNLNEF 287 Query: 2950 NNISEKAT-----VARAELEAAQRQSDRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQR 3114 +I +A + EA +R+ D++ L + ++ + AQR Sbjct: 288 GSIDANIRKLEDCIANFDKEADERELDKEELEKRREAQADLWKWMKR-----KEIYWAQR 342 Query: 3115 AKTKHINFSDKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGK 3294 ++ + DK+TK+FH++ +N ++ I + G++T D I + ++ +F + Sbjct: 343 SRITWLKAGDKNTKFFHAIASNKKRKNMMACIETD-GQSTNDPSQIKKEARAFFKKIFKE 401 Query: 3295 NTPRTPVDWSVMGAGFRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXX 3474 + + P ++ RLS + ++LI P + EI A+ DKAPGPDGF F Sbjct: 402 DHVKRPTLENLHLK--RLSQNQANSLITPFTTEEIDTAVSSCASDKAPGPDGFNFKFVKS 459 Query: 3475 XXXXXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIIT 3654 V++F+ G + + N ++LIPK + + D+RPI+ +YKI+ Sbjct: 460 AWDIIKTDIYGIVNDFWETGCLPQGCNTAYIALIPKIDNPSSLKDYRPISMVGFIYKIVA 519 Query: 3655 KILTSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRK 3834 K+L R+ + LISP QS+++KGR I+D +A E+I + ++++ ++K+D K Sbjct: 520 KLLAKRLQSVISSLISPLQSSYVKGRQILDGALVASEIIESCKKRN--IEAILLKLDFHK 577 Query: 3835 AYDCISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPM 4014 AYD +SW+FL+ L +NF + WI TCVTSA+ SI +NG + RGLRQGDP+ Sbjct: 578 AYDSVSWNFLQWTLDQMNFPVKWCEWIKTCVTSASASILVNGSPTPPFKLHRGLRQGDPL 637 Query: 4015 SPTLFLFCMEYLSRLIHARTHASTFVHHPKCN-STDTTHLAFADDLLLFGRGDPESMRVL 4191 SP LF+ E LS++I T + P C+ ++ THL +ADD L+F + S++ + Sbjct: 638 SPFLFVLVGEVLSQMISKATSLQLWRGIPACSRGSEITHLQYADDTLMFCEANTNSLKNI 697 Query: 4192 RDALDEFTATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSL 4371 + L F SGL VN KS + V QE + GT+P YLGLP+ Sbjct: 698 QKTLIIFQLVSGLQVNFHKSSLMGLNVTSSWIQEAANSLMCKIGTIPFSYLGLPIGDNPA 757 Query: 4372 TTNDYSPLISQI 4407 + P+I ++ Sbjct: 758 RIRTWDPIIDKL 769 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 309 bits (791), Expect = 9e-81 Identities = 169/429 (39%), Positives = 244/429 (56%), Gaps = 7/429 (1%) Frame = +1 Query: 3142 DKSTKYFHSLVKRNMIRNTISFIRRENGETTGDVQTIIADFIDYYSDLFGKNTPRTPVDW 3321 D++ K FH + N+I I +G Q I + ++Y+ D P D+ Sbjct: 91 DRNNKTFHRAITTREAVNSIREIVTRDGLVVTSQQDIQTEAVNYFQDFL----QTIPADY 146 Query: 3322 SVMGAG-------FRLSSDDQSALIRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXX 3480 M FR S DD L R V+ EI+ +F + DK+PGPDG+TS F+ Sbjct: 147 EGMCVEELENLLPFRCSEDDHRLLTRVVTGEEIKKVIFSMPKDKSPGPDGYTSEFYKASW 206 Query: 3481 XXXXXXXXASVDEFFSKGIILRKLNHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKI 3660 ++ FF+KG + + +N TI++LIPK I D+RPI+C NV+YK I+KI Sbjct: 207 EIIGDEVIIAIQSFFAKGFLPKGVNSTILALIPKKKEAREIKDYRPISCCNVLYKAISKI 266 Query: 3661 LTSRMSPFLQKLISPAQSAFIKGRNIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAY 3840 L +R+ L K I QSAF+K R +++N LA EL++ Y + S I+ RC +KID+ KA+ Sbjct: 267 LANRLKRILPKFIVGNQSAFVKDRLLIENVLLATELVKDYHKDS-ISTRCAMKIDISKAF 325 Query: 3841 DCISWDFLRDVLYGLNFHPCFIYWILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSP 4020 D + W FL VL +NF FI+WI C+++A+FSI +NG G+ R RGLRQG +SP Sbjct: 326 DSLQWSFLTHVLAAMNFPGEFIHWISLCMSTASFSIQVNGELAGYFRSARGLRQGCSLSP 385 Query: 4021 TLFLFCMEYLSRLIHARTHASTFVHHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDA 4200 LF+ M+ LSR++ A F +HP+C + THL FADDL++ G S+ + Sbjct: 386 YLFVISMDVLSRMLDKAAGAREFGYHPRCKTLGLTHLCFADDLMILTDGKIRSVDGIVKV 445 Query: 4201 LDEFTATSGLTVNKSKSHIFLGGVRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTN 4380 L++F A GL + K+ ++L GV +Q + + F G LPV+YLGLPL +K LTT+ Sbjct: 446 LNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMSSRYSFGVGKLPVRYLGLPLVTKRLTTS 505 Query: 4381 DYSPLISQI 4407 DYSPLI QI Sbjct: 506 DYSPLIDQI 514 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 304 bits (779), Expect = 2e-79 Identities = 214/710 (30%), Positives = 335/710 (47%), Gaps = 2/710 (0%) Frame = +1 Query: 2296 SNTVDVNIISVEKQVIHANVTCRISGNNFHYALCYGFYTIEDRMDMWDSLI-LHVPLDAP 2472 S T +I Q +H +T F + Y T +R +WD L L ++ P Sbjct: 953 SPTAKNYVIFDHPQCLHVRLTSPWLETPFFVTIVYAKCTRSERTLLWDCLRRLADDIEVP 1012 Query: 2473 AFVCGDFNCVQDPSERVGKRTPSEKELADFVDTSAFLTLQDAPSTGCFFTFAGKDVFSRI 2652 V GDFN + ER+ P E + DF T L D G FT+ +F R+ Sbjct: 1013 WLVGGDFNVILKREERLYGSAPHEGAMEDFASTLLDCGLLDGGFEGNSFTWTNNRMFQRL 1072 Query: 2653 DRTLINTIWLENNWFCRTEFLPRGIISDHSACISTLFQHVQNFKKDFRFCNAWMEHPSFQ 2832 DR + N W+ R + L R SDH + + F + FRF +AW+ H F+ Sbjct: 1073 DRIVYNHHWINKFPVTRIQHLNRDG-SDHCPLLISCFNSSEKAPSSFRFQHAWVLHHDFK 1131 Query: 2833 NSLKENWVNAPVREGKQEQLSAKLHRLRPILRQLNRTHFNNISEKATVARAELEAAQRQS 3012 S++ NW N P+ + +K HRL+ L+ N+ F +I K A +E + Sbjct: 1132 TSVESNW-NLPINGSGLQAFWSKQHRLKQHLKWWNKAVFGDIFSKLKEAEKRVEECEILH 1190 Query: 3013 DRDPLNXXXXXXXXXXXKKCQHLDTSEKNFLAQRAKTKHINFSDKSTKYFHSLVKRNMIR 3192 ++ + + L+ E F Q++ K + +++TK+FH +++ IR Sbjct: 1191 QQEQTFESRIKLNKSYAQLNKQLNIEEL-FWKQKSGVKWVVEGERNTKFFHMRMQKKRIR 1249 Query: 3193 NTISFIRRENGETTGDVQTIIADFIDYYSDLFGKNTPRTPVDWSVMGAGFRLSSDDQSAL 3372 + I ++ G D + + I+Y+S L K P + +S+ + L Sbjct: 1250 SHIFKVQDPEGRWIEDQEQLKHSAIEYFSSLL-KVEPCYDSRFQSSLIPSIISNSENELL 1308 Query: 3373 IRPVSHIEIRNALFDIGDDKAPGPDGFTSAFFXXXXXXXXXXXXASVDEFFSKGIILRKL 3552 S E+++A+F I + A GPDGF+S F+ +V +FF I R + Sbjct: 1309 CAEPSLQEVKDAVFGINSESAAGPDGFSSYFYQQCWNIIAQDLLDAVRDFFHGANIPRGV 1368 Query: 3553 NHTIVSLIPKTTHDPGISDFRPIACTNVVYKIITKILTSRMSPFLQKLISPAQSAFIKGR 3732 T + L+PK + SDFRPI+ V+ KIITK+L++R++ L +I+ QS F+ GR Sbjct: 1369 TSTTLILLPKKSSASKWSDFRPISLCTVMNKIITKLLSNRLAKVLPSIITENQSGFVGGR 1428 Query: 3733 NIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFLRDVLYGLNFHPCFIYW 3912 I DN LAQELI KS +K+D+ KAYD + W FL VL F+ +I Sbjct: 1429 LISDNILLAQELIGKLNTKSR-GGNLALKLDMMKAYDKLDWSFLFKVLQHFGFNGQWIKM 1487 Query: 3913 ILTCVTSATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHASTFV 4092 I C+++ FS+ +NG + G+ + +RGLRQGD +SP LF+ EYLSR ++A + Sbjct: 1488 IQKCISNCWFSLLLNGRTEGYFKSERGLRQGDSISPQLFIIAAEYLSRGLNALYDQYPSL 1547 Query: 4093 HHPKCNSTDTTHLAFADDLLLFGRGDPESMRVLRDALDEFTATSGLTVNKSKS-HIFLGG 4269 H+ S +HLAFADD+L+F G +++ + L E+ SG +N KS + Sbjct: 1548 HYSSGVSISVSHLAFADDVLIFTNGSKSALQRILAFLQEYQEISGQRINVQKSCFVTHTN 1607 Query: 4270 VRPFEKQEILDLFGFPEGTLPVKYLGLPLASKSLTTNDYSPLISQISNFI 4419 V +Q I GF L + YLG PL ++ L+++I I Sbjct: 1608 VSSSRRQIIAQTTGFSHQLLLITYLGAPLYKGHKKVILFNDLVAKIEERI 1657