BLASTX nr result
ID: Rauwolfia21_contig00005392
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00005392 (4521 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 530 e-173 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 512 e-167 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 493 e-152 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 473 e-147 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 452 e-138 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 452 e-137 dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal... 462 e-136 dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ... 444 e-130 emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li... 444 e-130 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 412 e-129 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 424 e-127 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 460 e-126 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 365 e-123 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 429 e-117 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 388 e-115 gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip... 419 e-114 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 365 e-113 emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694... 416 e-113 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 383 e-108 ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659... 395 e-107 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 530 bits (1365), Expect(2) = e-173 Identities = 288/841 (34%), Positives = 454/841 (53%), Gaps = 10/841 (1%) Frame = -3 Query: 3043 MRIASWNIRGLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGG-WNHFH 2867 M+I +WN+RGLN P+K V+H + +I + ++ ET++ +I + KFG W+ + Sbjct: 1 MKITTWNVRGLNDPIKVKEVKHFLHSQKISLCSLFETRVRQQNSGKIQK-KFGNRWSWIN 59 Query: 2866 NFHLHNAGRILIIWDPSTTILEPIILDAQFILARAICKVTALSFHICFIYGFHTVVSRRP 2687 N+ GRI + W + + + + Q I F + +YG HT+ R+ Sbjct: 60 NYACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHTIADRKV 119 Query: 2686 LWDTXXXXXXXXXXXXXL-GDFNCVMKASERLNGTEVSSYETRDLLQCCLSAGLSDLNSI 2510 LW+ L GD+N V A +RLNG +VS ET DL L A L + + Sbjct: 120 LWEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQLLEAPTT 179 Query: 2509 GSFHTWTNNTV-----LCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHS 2345 G F++W N ++ ++D++ N AW + + + +G +SDHSP I L Sbjct: 180 GLFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAG-ISDHSPLIFNLATQH 238 Query: 2344 RPTKKNFKFFNMWCDHADFEQLISEHWEEPIHGTKQFTLCKKLKRLKGPLKALNKKHFSH 2165 + FKF N D F +++ E W H K + +L+ +K LK+ + K FS Sbjct: 239 DEGGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWVRLQAVKRALKSFHSKKFSK 298 Query: 2164 ISSRAEKARNDFD--QALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQAKCKH 1991 + E+ R QAL E + LQ + DL + R S + S Q+++ + Sbjct: 299 AHCQVEELRRKLAAVQALPEVSQV---SELQEEEKDLIAQLRKWSTIDESILKQKSRIQW 355 Query: 1990 LTYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGD-C 1814 L+ D +KFF + +K RN I + G T + E+ E +FY LLGT Sbjct: 356 LSLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQL 415 Query: 1813 QPINLEICQDGPLITQNQSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNI 1634 + I+L + + G ++ L++PI+I EI AL I D K+PG DG+ + F+KK+W + Sbjct: 416 EAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLV 475 Query: 1633 VGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILA 1454 + + + I++FF +G + + IN T + L+PK D A D+RPIACC+ YK+I+KIL Sbjct: 476 IKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISKILT 535 Query: 1453 DRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTI 1274 RL + ++D AQ+ F+P R + DNI + EL++ YNR+ +SPRC++K+D++KAYD++ Sbjct: 536 KRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVSPRCVIKVDIRKAYDSV 595 Query: 1273 CWDFLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLF 1094 W FL+ ML L FP F+ WIM CV T SYS+ +NG F Q+GLRQGDPLSPFLF Sbjct: 596 EWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFDAQKGLRQGDPLSPFLF 655 Query: 1093 VICVEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTD 914 + +EY SR + K+ F++HPKC +K++H ASS+ +++ Sbjct: 656 ALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSISKIMAAFNS 715 Query: 913 FGNKSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYG 734 F SGL+A+ KS I+ G+ E + + + +PIG +PFRYLG+P+ ++KL Q Sbjct: 716 FSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLGVPLASKKLNFSQCK 775 Query: 733 LLIDKIHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIISICRSFL 554 LIDKI + + W A LS+AG+L+L++ +L + W + P+P ++ + + CR FL Sbjct: 776 PLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKFL 835 Query: 553 W 551 W Sbjct: 836 W 836 Score = 108 bits (269), Expect(2) = e-173 Identities = 62/182 (34%), Positives = 94/182 (51%), Gaps = 3/182 (1%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYLHGVSIW 360 APVAW L PKS GGL + + WN + + K+LW I K+D LW+RW++ Y+ +I Sbjct: 846 APVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVNAYYIKRQNIE 905 Query: 359 DVQNKKDHSPLMKRILQIRNQLLQMEGSCTAAITRIESWMRLGNFSSSLAYEWLRPKGTK 180 +V + S ++++I + R +LL G A + NFS Y+ L+ Sbjct: 906 NVTVSSNTSWILRKIFESR-ELLTRTGGWEAVSNHM-------NFSIKKTYKLLQEDYEN 957 Query: 179 TIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRLSFQD---NTECCLCNNATESHRHLFF 9 +W + I PK F LWLA +RL T +R+S + + C +C N E+ +HLFF Sbjct: 958 VVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKMCGNEIETIQHLFF 1017 Query: 8 QC 3 C Sbjct: 1018 NC 1019 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 512 bits (1318), Expect(2) = e-167 Identities = 281/841 (33%), Positives = 443/841 (52%), Gaps = 10/841 (1%) Frame = -3 Query: 3043 MRIASWNIRGLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHN 2864 M SWN+RG+N P K +++ + H+I V A+LET++ + S++ W +N Sbjct: 1 MLCVSWNVRGMNDPFKIKEIKNFLYSHKIVVCALLETRVREQNASKVQGKLGKDWKWLNN 60 Query: 2863 FHLHNAGRILIIWDPSTTILEPIILDAQFILARAICKVTALSFHICFI--YGFHTVVSRR 2690 + RI I W P+ + Q + +C + S + + YG HT+ R+ Sbjct: 61 YSHSARERIWIGWRPAWVNVTLTHTQEQLM----VCDIQDQSHKLKMVAVYGLHTIADRK 116 Query: 2689 PLWDTXXXXXXXXXXXXXLGDFNCVMKASERLNGTEVSSYETRDLLQCCLSAGLSDLNSI 2510 LW +GDFN V +++RL GT V+ ET D Q L + L + S Sbjct: 117 SLWSGLLQCVQQQDPMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSNLIESRST 176 Query: 2509 GSFHTWTNNT-----VLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHS 2345 S+++W+N++ VL ++D+A N W +LP G +SDHSP + L Sbjct: 177 WSYYSWSNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPG-ISDHSPLLFNLMTGR 235 Query: 2344 RPTKKNFKFFNMWCDHADFEQLISEHWEEPIHGTKQFTLCKKLKRLKGPLKALNKKHFSH 2165 K FKF N+ + +F + + + W K + LK +K LK + + Sbjct: 236 PQGGKPFKFMNVMAEQGEFLETVEKAWNSVNGRFKLQAIWLNLKAVKRELKQMKTQKIGL 295 Query: 2164 ISSRAEKARNDFD--QALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQAKCKH 1991 + + R+ Q+ ++F N +Q + R S E S Q+++ Sbjct: 296 AHEKVKNLRHQLQDLQSQDDFD---HNDIMQTDAKSIMNDLRHWSHIEDSILQQKSRITW 352 Query: 1990 LTYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGDC- 1814 L D +K F + VK N I + DG V + EV +E L+FY LLGT Sbjct: 353 LQQGDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRASTL 412 Query: 1813 QPINLEICQDGPLITQNQSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNI 1634 ++L + G ++ L+R ++ EI AL IG+DK+PG DG+ A F+KK+W Sbjct: 413 MGVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGS 472 Query: 1633 VGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILA 1454 + + I EFF + + R IN V+ L+PK HA+ V +FRPIACC V YK+I+K+L Sbjct: 473 IKQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLT 532 Query: 1453 DRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTI 1274 +R+ +G ++++AQS F+PGR + DNI + EL++ Y RK +SPRCI+K+D++KAYD++ Sbjct: 533 NRMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRKAYDSV 592 Query: 1273 CWDFLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLF 1094 W FL+ +L FP +FV WIMECVST SYS+ +NG F+ ++GLRQGDP+SPFLF Sbjct: 593 EWSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLF 652 Query: 1093 VICVEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTD 914 +C+EY SR L + F++HPKC L I+H SS+ + Sbjct: 653 ALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQK 712 Query: 913 FGNKSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYG 734 F + SGL A+ KS+I+ G+ + + + ++ +G +PFRYLG+P+ ++KL Q Sbjct: 713 FSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCK 772 Query: 733 LLIDKIHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIISICRSFL 554 L++ I + +TW AK LS+AG+L+LI+++L + W + P+ V+ + +CR FL Sbjct: 773 PLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFL 832 Query: 553 W 551 W Sbjct: 833 W 833 Score = 106 bits (265), Expect(2) = e-167 Identities = 61/185 (32%), Positives = 89/185 (48%), Gaps = 6/185 (3%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYLHGVSIW 360 APVAW + PKS GG + +K WN + + K+LW I K+D LW+RWI Y+ I Sbjct: 843 APVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIHSYYIKRQDIL 902 Query: 359 DVQNKKDHSPLMKRILQIRNQLLQMEGSCTAAITRIESWMRL---GNFSSSLAYEWLRPK 189 V + ++++I++ R+ L + I W + FS AY+ + Sbjct: 903 TVNISNQTTWILRKIVKARDHL-----------SNIGDWDEICIGDKFSMKKAYKKISEN 951 Query: 188 GTKTIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRLS---FQDNTECCLCNNATESHRH 18 G + W + I Y PK F LW+ RL T DR+S Q + LC N E+ +H Sbjct: 952 GERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRNDGETIQH 1011 Query: 17 LFFQC 3 LFF C Sbjct: 1012 LFFSC 1016 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 493 bits (1270), Expect(2) = e-152 Identities = 295/852 (34%), Positives = 459/852 (53%), Gaps = 17/852 (1%) Frame = -3 Query: 3028 WNIRGLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHNFHLHN 2849 WNIRG N ++G + +K ++ V+ET + K + + GW+ N+ + Sbjct: 8 WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRKFINALLPGWSFVENYAFSD 67 Query: 2848 AGRILIIWDPSTTILEPIILDAQFILARAICKVTALSFHICFIYGFHTVVSRRPLW---- 2681 G+I ++WDPS ++ + Q I + + + +Y + V SR+ LW Sbjct: 68 LGKIWVMWDPSVQVVV-VAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKELWIEIV 126 Query: 2680 DTXXXXXXXXXXXXXLGDFNCVMKASERLNGTEVS-SYETRDLLQCCLSAGLSDLNSIGS 2504 + LGDFN V+ E N ++ RD C L+A LSDL G+ Sbjct: 127 NMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELSDLRYKGN 186 Query: 2503 FHTWTNNT----VLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPT 2336 TW N + V K+DR + N++W + + + F S SDH C V L + S Sbjct: 187 TFTWWNKSHTTPVAKKIDRILVNDSWNALFPSSLGIF-GSLDFSDHVSCGVVLEETSIKA 245 Query: 2335 KKNFKFFNMWCDHADFEQLISEHWEE-PIHGTKQFTLCKKLKRLKGPLKALNKKHFSHIS 2159 K+ FKFFN + DF L+ ++W + G+ F + KKLK LK P+K ++ ++S + Sbjct: 246 KRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRLNYSELE 305 Query: 2158 SRAEKARNDFDQALEEFHLQ---PANTALQLQIADLKLKARSLSEAERSFYFQQAKCKHL 1988 R ++A +DF ++ L P N + +L+ + K L+ AE SF+ Q+++ Sbjct: 306 KRTKEA-HDFLIGCQDRTLADPTPINASFELEA---ERKWHILTAAEESFFRQKSRISWF 361 Query: 1987 TYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGDCQP 1808 D TK+FH + N I+A+ +G + S ++ ++ +LLG E D P Sbjct: 362 AEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLGDEVD--P 419 Query: 1807 INLEICQDGPLITQN----QSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAW 1640 +E L++ Q +L S ++I++ALFS+ +KS GPDG+TA+F+ +W Sbjct: 420 YLMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFFIDSW 479 Query: 1639 NIVGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKI 1460 +IVG + + AI EFF+SG LL+ N T I L+PK + + DFRPI+C N YKVIA++ Sbjct: 480 SIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLYKVIARL 539 Query: 1459 LADRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYD 1280 L DRL L +I AQSAF+PGRS+ +N+ + +L+ YN ISPR +LK+DLKKA+D Sbjct: 540 LTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNISPRGMLKVDLKKAFD 599 Query: 1279 TICWDFLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPF 1100 ++ W+F+ L L P KF++WI +C+STP++++ ING GFFK +GLRQGDPLSP+ Sbjct: 600 SVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFKSTKGLRQGDPLSPY 659 Query: 1099 LFVICVEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTL 920 LFV+ +E FS L+ ++ YHPK L ISH + S+ + TL Sbjct: 660 LFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIFFDGGSFSLHGICETL 719 Query: 919 TDFGNKSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQ 740 DF + SGL+ N KS ++ AG+ E A PIG +P RYLG+P++ KL++ + Sbjct: 720 DDFASWSGLKVNKDKSHLYLAGLNQLESN-ANAAYGFPIGTLPIRYLGLPLMNRKLRIAE 778 Query: 739 YGLLIDKIHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIISICRS 560 Y L++KI + ++W K LS AG+++LI +V+ G+ W S +P + +I S+C Sbjct: 779 YEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSR 838 Query: 559 FLWGVNKHQLLG 524 FLW N Q G Sbjct: 839 FLWSGNIEQAKG 850 Score = 75.5 bits (184), Expect(2) = e-152 Identities = 42/105 (40%), Positives = 58/105 (55%), Gaps = 4/105 (3%) Frame = -2 Query: 533 VAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYLHGVSIWDV 354 V+W LCLPKSEGGLGLR L WN +L +++W + KDSLW W +L S W V Sbjct: 853 VSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAV 912 Query: 353 QNKKDHSPLMKRILQIR---NQLLQME-GSCTAAITRIESWMRLG 231 + + S KR+L +R +Q L + G+ A ++W LG Sbjct: 913 EGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLG 957 Score = 63.5 bits (153), Expect = 8e-07 Identities = 31/78 (39%), Positives = 46/78 (58%), Gaps = 3/78 (3%) Frame = -2 Query: 227 FSSSLAYEWLRPKGTKTIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRLSFQDNTE--- 57 FS++ +E +RPK T W IW + PKY+FN+W++ +RL TR RL+ + + Sbjct: 1033 FSAAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDA 1092 Query: 56 CCLCNNATESHRHLFFQC 3 C LC+ A+ES HL C Sbjct: 1093 CVLCSFASESRDHLLLIC 1110 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 473 bits (1216), Expect(2) = e-147 Identities = 283/847 (33%), Positives = 432/847 (51%), Gaps = 19/847 (2%) Frame = -3 Query: 3031 SWNIRGLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHNFHLH 2852 SWN+RG N +++ R K + ++LET++ + + R + + F GW N+ Sbjct: 6 SWNVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARRSLLSSFPGWKSVCNYEFA 65 Query: 2851 NAGRILIIWDPSTTILEPIILDAQFILARAICKVTALS--FHICFIYGFHTVVSRRPLWD 2678 GRI ++WDP+ +E +L K+ +S F + F+Y + RR LW Sbjct: 66 ALGRIWVVWDPA---VEVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWS 122 Query: 2677 TXXXXXXXXXXXXXL----GDFNCVMKASERLNGTEVSSYETRDLLQCCLSAGLSDLNSI 2510 GDFN + + G + + +C L++ +SDL Sbjct: 123 ELELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDLPFR 182 Query: 2509 GSFHTW----TNNTVLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHSR 2342 G+ +TW NN + K+DR + N++W ++ +F SDH P V + S Sbjct: 183 GNHYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAME-FSDHCPSCVNISNQSG 241 Query: 2341 PTKKNFKFFNMWCDHADFEQLISEHWEEPIH-GTKQFTLCKKLKRLKGPLKALNKKHFSH 2165 K FK N H +F + I W+ + G+ FTL KK K LKG ++ N++H+S Sbjct: 242 GRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREHYSG 301 Query: 2164 ISSRAEKARNDFDQALEEFHLQPANTALQLQIADLKLKARSLSE---AERSFYFQQAKCK 1994 + R +A + P++ L+ K RS +E AE F Q+++ Sbjct: 302 LEKRVVQAAQNLKTCQNNLLAAPSSYLAGLE----KEAHRSWAELALAEERFLCQKSRVL 357 Query: 1993 HLTYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGDC 1814 L D T FFH ++ N I + G ++ E+ +DF+ L G+ Sbjct: 358 WLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFGSSSHL 417 Query: 1813 QPINLEICQDGPLITQ----NQSRDLLRP-ISIDEIKSALFSIGDDKSPGPDGYTAQFYK 1649 I+ E +T+ +R LL +S +IKS F++ +KSPGPDGYT++F+K Sbjct: 418 --ISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFFK 475 Query: 1648 KAWNIVGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVI 1469 K W+IVG A+ EFF SG LL N T + +VPK +A + +FRPI+CCN YKVI Sbjct: 476 KTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCCNAIYKVI 535 Query: 1468 AKILADRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKK 1289 +K+LA RL L I +QSAFV GR + +N+ + EL++ + + IS R +LK+DL+K Sbjct: 536 SKLLARRLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQANISSRGVLKVDLRK 595 Query: 1288 AYDTICWDFLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPL 1109 A+D++ W F+ + L N PP+FV+WI +C+++ S+S+ ++G + G+FKG +GLRQGDPL Sbjct: 596 AFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGYFKGSKGLRQGDPL 655 Query: 1108 SPFLFVICVEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILL 929 SP LFVI +E SR L + YHPK ++IS ASS+ + Sbjct: 656 SPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMIFYDGKASSLRGIK 715 Query: 928 STLTDFGNKSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLK 749 S L F N SGL N KS+++TAG++ +K+ L A G PFRYLG+P++ KL+ Sbjct: 716 SVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGFVNGTFPFRYLGLPLLHRKLR 774 Query: 748 VCQYGLLIDKIHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIISI 569 Y LIDKI + W K LS AG+L+LI +V+ T W S +P + I + Sbjct: 775 RSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPKCCLKTIEQM 834 Query: 568 CRSFLWG 548 C FLWG Sbjct: 835 CNRFLWG 841 Score = 79.3 bits (194), Expect(2) = e-147 Identities = 66/259 (25%), Positives = 99/259 (38%), Gaps = 82/259 (31%) Frame = -2 Query: 533 VAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYLHGVSIWDV 354 V+W++ CLPK+EGGLGLR TWN +L +++W + A++DSLW+ W L V+ W+ Sbjct: 852 VSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNA 911 Query: 353 QNKKDHSPLMKRILQIR-------------NQLL----------------------QMEG 279 + HS + K IL +R QLL Q+ G Sbjct: 912 EAASHHSWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTG 971 Query: 278 SCTAAITRIES----WM---------RLGNFSSSL-------------AYEWLRPKGTKT 177 +A+ S W+ L N S+L Y W + T Sbjct: 972 IHESAVVTEASSSTGWILPSARTRNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSST 1031 Query: 176 IWIKQIWKEYIPPKYSFNLWLAA------------------KSRLQTRDRLSFQDNTE-- 57 + ++ E + + + LW AA +RL R R + Sbjct: 1032 SFSSKLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPS 1091 Query: 56 -CCLCNNATESHRHLFFQC 3 CC+C TE+ HLF C Sbjct: 1092 LCCVCQRETETRDHLFIHC 1110 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 452 bits (1162), Expect(2) = e-138 Identities = 256/709 (36%), Positives = 398/709 (56%), Gaps = 15/709 (2%) Frame = -3 Query: 2632 GDFNCVMKASERLNGTEVS-SYETRDLLQCCLSAGLSDLNSIGSFHTWTNNT----VLCK 2468 GDFN V+ E N ++ RD C LSDL G+ TW N + + K Sbjct: 3 GDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIAKK 62 Query: 2467 LDRAMANEAW---FSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPTKKNFKFFNMWCDH 2297 LDR +AN++W + S+H N SDH C V L + K+ FKFFN + Sbjct: 63 LDRILANDSWCNLYPSSHGLFGNL----DFSDHVSCGVVLEANGISAKRPFKFFNFLLKN 118 Query: 2296 ADFEQLISEHW-EEPIHGTKQFTLCKKLKRLKGPLKALNKKHFSHISSRAEKARNDFD-- 2126 DF ++ ++W + G+ + + KKLK +K P+K ++ ++S I R ++A Sbjct: 119 EDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITC 178 Query: 2125 QALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQAKCKHLTYSDRGTKFFHSLV 1946 Q L + +N AL+L+ + K LS AE SF+ Q+++ D T +FH +V Sbjct: 179 QNLTLANPSVSNAALELEA---QRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMV 235 Query: 1945 KRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGDCQPINLEICQDGPLIT- 1769 N I ++ +G + S ++ + +Y+ LLG+ P ++E L+T Sbjct: 236 DSRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIES--PFSMEQEDMNLLLTY 293 Query: 1768 ---QNQSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGVQFSQAIMEF 1598 Q+Q +L + + DEIK+A S+ +K+ GPDGY+ +F++ W+I+G + AI EF Sbjct: 294 RCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEF 353 Query: 1597 FTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILADRLSVTLGTLID 1418 F SG LL+ N T + L+PK+ +A T+ +FRPI+C N YKVI+K+L RL L +I Sbjct: 354 FDSGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIG 413 Query: 1417 KAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTICWDFLKDMLDGL 1238 +QSAF+PGRS+ +N+ + E++ YNR ISPR +LK+DLKKA+D++ W+F+ L L Sbjct: 414 HSQSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRAL 473 Query: 1237 NFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVICVEYFSRSLN 1058 P ++++WI +C++TPS+++ +NG GFF+ +GLRQGDPLSP+LFV+ +E FS+ L Sbjct: 474 AIPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLY 533 Query: 1057 RAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTDFGNKSGLRANAL 878 + + YHPK G L ISH +SS+ + TL DF + SGL+ N Sbjct: 534 SRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKD 593 Query: 877 KSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLIDKIHSYLKT 698 KS +F AG+ E+ + A P G P RYLG+P++ KL++ YG L++K+ + L++ Sbjct: 594 KSQLFQAGLDLSER-ITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652 Query: 697 WTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIISICRSFLW 551 W +K LS AG+ +LI +V+ G W S +P + KI S+C FLW Sbjct: 653 WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLW 701 Score = 69.7 bits (169), Expect(2) = e-138 Identities = 64/263 (24%), Positives = 97/263 (36%), Gaps = 80/263 (30%) Frame = -2 Query: 551 GR**APVAWKDLCLPKSEGGLGL--------------------RELKTW-----NNSLLA 447 GR + V+W D CLPKSEGGLG R+ W ++ L Sbjct: 707 GRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGH 766 Query: 446 KVLWNINAKKDSLW-------IRWIDQVYLHG-------VSIW---------------DV 354 W +NA + W +R + + ++ VS W DV Sbjct: 767 ASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDV 826 Query: 353 QNKKDHSPLMKRILQ-------------------IRNQLLQMEGSCTAAITRIESW---- 243 ++ P ++ I + L + ++ SW Sbjct: 827 GSRPLRIPFSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDD 886 Query: 242 MRLGNFSSSLAYEWLRPKGTKTIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRL---SF 72 + FS++ +E LRP+ W K +W + PK++FN W A +RL TR RL Sbjct: 887 VDCQGFSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGL 946 Query: 71 QDNTECCLCNNATESHRHLFFQC 3 + ECCLC+ TE+ HL C Sbjct: 947 VSSAECCLCSFDTETRDHLLLLC 969 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 452 bits (1162), Expect(2) = e-137 Identities = 256/709 (36%), Positives = 398/709 (56%), Gaps = 15/709 (2%) Frame = -3 Query: 2632 GDFNCVMKASERLNGTEVS-SYETRDLLQCCLSAGLSDLNSIGSFHTWTNNT----VLCK 2468 GDFN V+ E N ++ RD C LSDL G+ TW N + + K Sbjct: 3 GDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIAKK 62 Query: 2467 LDRAMANEAW---FSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPTKKNFKFFNMWCDH 2297 LDR +AN++W + S+H N SDH C V L + K+ FKFFN + Sbjct: 63 LDRILANDSWCNLYPSSHGLFGNL----DFSDHVSCGVVLEANGISAKRPFKFFNFLLKN 118 Query: 2296 ADFEQLISEHW-EEPIHGTKQFTLCKKLKRLKGPLKALNKKHFSHISSRAEKARNDFD-- 2126 DF ++ ++W + G+ + + KKLK +K P+K ++ ++S I R ++A Sbjct: 119 EDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITC 178 Query: 2125 QALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQAKCKHLTYSDRGTKFFHSLV 1946 Q L + +N AL+L+ + K LS AE SF+ Q+++ D T +FH +V Sbjct: 179 QNLTLANPSVSNAALELEA---QRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMV 235 Query: 1945 KRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGDCQPINLEICQDGPLIT- 1769 N I ++ +G + S ++ + +Y+ LLG+ P ++E L+T Sbjct: 236 DSRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIES--PFSMEQEDMNLLLTY 293 Query: 1768 ---QNQSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGVQFSQAIMEF 1598 Q+Q +L + + DEIK+A S+ +K+ GPDGY+ +F++ W+I+G + AI EF Sbjct: 294 RCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEF 353 Query: 1597 FTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILADRLSVTLGTLID 1418 F SG LL+ N T + L+PK+ +A T+ +FRPI+C N YKVI+K+L RL L +I Sbjct: 354 FDSGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIG 413 Query: 1417 KAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTICWDFLKDMLDGL 1238 +QSAF+PGRS+ +N+ + E++ YNR ISPR +LK+DLKKA+D++ W+F+ L L Sbjct: 414 HSQSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRAL 473 Query: 1237 NFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVICVEYFSRSLN 1058 P ++++WI +C++TPS+++ +NG GFF+ +GLRQGDPLSP+LFV+ +E FS+ L Sbjct: 474 AIPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLY 533 Query: 1057 RAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTDFGNKSGLRANAL 878 + + YHPK G L ISH +SS+ + TL DF + SGL+ N Sbjct: 534 SRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKD 593 Query: 877 KSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLIDKIHSYLKT 698 KS +F AG+ E+ + A P G P RYLG+P++ KL++ YG L++K+ + L++ Sbjct: 594 KSQLFQAGLDLSER-ITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652 Query: 697 WTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIISICRSFLW 551 W +K LS AG+ +LI +V+ G W S +P + KI S+C FLW Sbjct: 653 WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLW 701 Score = 68.6 bits (166), Expect(2) = e-137 Identities = 63/263 (23%), Positives = 97/263 (36%), Gaps = 80/263 (30%) Frame = -2 Query: 551 GR**APVAWKDLCLPKSEGGLGL--------------------RELKTW-----NNSLLA 447 GR + V+W D CLPKSEGGLG R+ W ++ L Sbjct: 707 GRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGH 766 Query: 446 KVLWNINAKKDSLW-------IRWIDQVYLHG-------VSIW---------------DV 354 W +NA + W +R + + ++ VS W DV Sbjct: 767 ASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDV 826 Query: 353 QNKKDHSPLMKRILQ-------------------IRNQLLQMEGSCTAAITRIESW---- 243 ++ P ++ I + L + ++ SW Sbjct: 827 GSRPLRIPFSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDD 886 Query: 242 MRLGNFSSSLAYEWLRPKGTKTIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRL---SF 72 + FS++ +E LRP+ W + +W + PK++FN W A +RL TR RL Sbjct: 887 VDCQGFSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGL 946 Query: 71 QDNTECCLCNNATESHRHLFFQC 3 + ECCLC+ TE+ HL C Sbjct: 947 VSSAECCLCSFDTETRDHLLLLC 969 >dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 910 Score = 462 bits (1189), Expect(2) = e-136 Identities = 284/847 (33%), Positives = 449/847 (53%), Gaps = 16/847 (1%) Frame = -3 Query: 3043 MRIASWNIRGLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHN 2864 M++ WNIRGLN +Q VR I + + V LET + + ++ + GW N Sbjct: 1 MKVFCWNIRGLNSRNRQRVVRSWIASNNLLVGCFLETHVAQENANSVLASTLPGWRMDSN 60 Query: 2863 FHLHNAGRILIIWDPSTTILEPIILDAQFILARAICKVTAL--SFHICFIYGFHTVVSRR 2690 + GRI I+WDPS ++L + I+ +I K+ +L SF + F+YG ++ + RR Sbjct: 61 YCCSELGRIWIVWDPSISVL--VFKRTDQIMFCSI-KIPSLLQSFAVAFVYGRNSELDRR 117 Query: 2689 PLWDTXXXXXXXXXXXXXL----GDFNCVMKASERLN-GTEVSSYETRDLLQCCL-SAGL 2528 LW+ GDFN + ASE + + + + LQCCL + L Sbjct: 118 SLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQCCLRDSQL 177 Query: 2527 SDLNSIGSFHTWTN----NTVLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVA 2360 SDL S G F TW+N N +L KLDRA+AN WF+ + +A F P G SDH+PCI+ Sbjct: 178 SDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPGD-SDHAPCIIL 236 Query: 2359 LFQHSRPTKKNFKFFNMWCDHADFEQLISEHWE-EPIHGTKQFTLCKKLKRLKGPLKALN 2183 + P+KK+FK+F+ H + +S WE + G+ F+L + LK K + LN Sbjct: 237 IDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLKVAKLCCRTLN 296 Query: 2182 KKHFSHISSRAEKARNDFDQALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQA 2003 + FS+I R ++ + E P++T + + K + + A SF+ Q++ Sbjct: 297 RLRFSNIQQRTAQSLTRLEDIQVELLTSPSDTLFRREHVARK-QWIFFAAALESFFRQKS 355 Query: 2002 KCKHLTYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTE 1823 + + L D T+FFH V + N I + DG + ++ + +Y +LLG Sbjct: 356 RIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHLLGIP 415 Query: 1822 GD-CQPINLEICQDG-PLITQNQSRDLLRPI-SIDEIKSALFSIGDDKSPGPDGYTAQFY 1652 + P ++E + P + L I S +EI LFS+ +K+PGPDG+ +F+ Sbjct: 416 SENVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPDGFPVEFF 475 Query: 1651 KKAWNIVGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKV 1472 +AW IV AI EFF SG+L R N T I L+PK A + FRP+ACC YKV Sbjct: 476 IEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCTTIYKV 535 Query: 1471 IAKILADRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLK 1292 I +I++ RL + + + Q F+ GR + +N+ + EL+ ++ + R L++D+ Sbjct: 536 ITRIISRRLKLFIDQAVQANQVGFIKGRLLCENVLLASELVDNFEADGETTRGCLQVDIS 595 Query: 1291 KAYDTICWDFLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDP 1112 KAYD + W+FL ++L L+ P F+HWI C+S+ SYS+ NGE+ GFF+G++G+RQGDP Sbjct: 596 KAYDNVNWEFLINILKALDLPLVFIHWIWVCISSASYSIAFNGELIGFFQGKKGIRQGDP 655 Query: 1111 LSPFLFVICVEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGIL 932 +S LFV+ ++ S+SL+ A N F+ HP C I+H ASS+ + Sbjct: 656 MSSHLFVLVMDVLSKSLDLGALNGLFNLHPNCLAPIITHLSFADDVLVFSDGAASSIAGI 715 Query: 931 LSTLTDFGNKSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKL 752 L+ L DF SGL N K+ + G + + + I G +P RYLG+P++++K+ Sbjct: 716 LTILDDFRQGSGLGINREKTELLLDGGNFARNRSLADNLGITHGSLPVRYLGVPLMSQKM 775 Query: 751 KVCQYGLLIDKIHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIIS 572 + Y L+D+I+S +WTA++LS AG+L+L+++V+ T W SV P + K+ Sbjct: 776 RRQDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINFWASVFIFPNQCLQKLEQ 835 Query: 571 ICRSFLW 551 +C +FLW Sbjct: 836 MCNAFLW 842 Score = 55.5 bits (132), Expect(2) = e-136 Identities = 20/49 (40%), Positives = 30/49 (61%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWI 393 A ++W +C PK GGLGL+ L +WN L K++W + SLW+ W+ Sbjct: 852 AKISWNIVCSPKEAGGLGLKRLSSWNRILALKLIWLLFTSAGSLWVSWV 900 >dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 444 bits (1142), Expect(2) = e-130 Identities = 276/849 (32%), Positives = 431/849 (50%), Gaps = 19/849 (2%) Frame = -3 Query: 3040 RIASWNIRGLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHNF 2861 ++ WN+RG N+ + G + ++ ++ET + K + + N GW+ N+ Sbjct: 4 KLFCWNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEKKFISNLLPGWSFVENY 63 Query: 2860 HLHNAGRILIIWDPSTTILEPIILDAQFILARAICKVTALSFHICFIYGFHTVVSRRPLW 2681 G+I ++WDPS ++ I Q I + + F + +Y + +R+ LW Sbjct: 64 EFSVLGKIWVLWDPSVKVVV-IGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRKELW 122 Query: 2680 DTXXXXXXXXXXXXXL----GDFNCVMKASERLNGTEVSSYETRDLLQCCLSAGLSDLNS 2513 + GDFN ++ +N + R C L + L DL Sbjct: 123 NELVQLALSPVVVGRSWIVLGDFNQILNPESAINAN--IGRKIRAFRSCLLDSDLYDLVY 180 Query: 2512 IGSFHTWTNNT----VLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHS 2345 GS +TW N + K+DR + N+ W + + ANF SDHS C V L Sbjct: 181 KGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPD-FSDHSSCEVVLDPAV 239 Query: 2344 RPTKKNFKFFNMWCDHADFEQLISEHWEE-PIHGTKQFTLCKKLKRLKGPLKALNKKHFS 2168 K+ F+FFN + + DF QLI E+W + G+ + + KKLK LK P+ +++++S Sbjct: 240 LKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSRENYS 299 Query: 2167 HISSRAEKARNDFDQALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQAKCKHL 1988 I R +A P+ L++ + K + L++AE SF+ Q++ L Sbjct: 300 DIEKRVSEAHAIVLHRQRITLTNPSVVHATLELEATR-KWQILAKAEESFFCQKSSISWL 358 Query: 1987 TYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSC---EVVKEF-LDFYDNLL-GTE 1823 D T +FH + N I + G + E +KE +F+++LL G E Sbjct: 359 YEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESLLCGVE 418 Query: 1822 GDCQPINLEICQDGPLITQ-----NQSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQ 1658 G+ N D L+ +Q DL R S +I+ A FS+ +K+ GPDGY+++ Sbjct: 419 GE----NSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSE 474 Query: 1657 FYKKAWNIVGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTY 1478 F+K W +VG + ++A+ EFF SG LL+ N T + L+PK ++S + DFRPI+C N Y Sbjct: 475 FFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTLY 534 Query: 1477 KVIAKILADRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKID 1298 KVIAK+L RL L +I +QSAF+PGR + +N+ + E++ YN K IS R +LK+D Sbjct: 535 KVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISSRGMLKVD 594 Query: 1297 LKKAYDTICWDFLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQG 1118 L+KA+D++ WDF+ L P KFV WI +C+STP +S+ +NG GFFK +GLRQG Sbjct: 595 LRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQG 654 Query: 1117 DPLSPFLFVICVEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVG 938 DPLSP+LFV+ +E FS L + YHPK L ISH +SS+ Sbjct: 655 DPLSPYLFVLAMEVFSSLLKARFDAGYIHYHPKTADLSISHLMFADDVMVFFDGGSSSLH 714 Query: 937 ILLSTLTDFGNKSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAE 758 + L DF + SGL N K++++ AG E + + PI +P RYLG+P+++ Sbjct: 715 GISEALDDFASWSGLHVNKDKTNLYLAGTDEVE-ALAISHYGFPISTLPIRYLGLPLMSR 773 Query: 757 KLKVCQYGLLIDKIHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKI 578 KLK+ +Y L+ ++W K+LS AG+++LI +V+ G W S + + KI Sbjct: 774 KLKISEYELV-----KRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCVKKI 828 Query: 577 ISICRSFLW 551 S+C FLW Sbjct: 829 ESLCSRFLW 837 Score = 53.5 bits (127), Expect(2) = e-130 Identities = 20/45 (44%), Positives = 28/45 (62%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLW 405 A +AW +CLPK+EGG+GLR WN + + +W + A D LW Sbjct: 847 AKIAWSGVCLPKNEGGVGLRRFTPWNKTFYLRFIWPLFADNDVLW 891 >emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 444 bits (1143), Expect(2) = e-130 Identities = 276/849 (32%), Positives = 431/849 (50%), Gaps = 19/849 (2%) Frame = -3 Query: 3040 RIASWNIRGLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHNF 2861 ++ WN+RG N+ + G + ++ ++ET + K + + N GW+ N+ Sbjct: 4 KLFCWNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEKKFISNLLPGWSFVENY 63 Query: 2860 HLHNAGRILIIWDPSTTILEPIILDAQFILARAICKVTALSFHICFIYGFHTVVSRRPLW 2681 G+I ++WDPS ++ I Q I + + F + +Y + +R+ LW Sbjct: 64 EFSVLGKIWVLWDPSVKVVV-IGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRKELW 122 Query: 2680 DTXXXXXXXXXXXXXL----GDFNCVMKASERLNGTEVSSYETRDLLQCCLSAGLSDLNS 2513 + GDFN ++ +N + R C L + L DL Sbjct: 123 NELVQLALSPVVVGRSWIVLGDFNQILNPESAINAN--IGRKIRAFRSCLLDSDLYDLVY 180 Query: 2512 IGSFHTWTNNT----VLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHS 2345 GS +TW N + K+DR + N+ W + + ANF SDHS C V L Sbjct: 181 KGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPD-FSDHSSCEVVLDPAV 239 Query: 2344 RPTKKNFKFFNMWCDHADFEQLISEHWEE-PIHGTKQFTLCKKLKRLKGPLKALNKKHFS 2168 K+ F+FFN + + DF QLI E+W + G+ + + KKLK LK P+ +++++S Sbjct: 240 LKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSRENYS 299 Query: 2167 HISSRAEKARNDFDQALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQAKCKHL 1988 I R +A P+ L++ + K + L++AE SF+ Q++ L Sbjct: 300 DIEKRVSEAHAIVLHRQRITLTNPSVVHATLELEATR-KWQILAKAEESFFCQKSSISWL 358 Query: 1987 TYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSC---EVVKEF-LDFYDNLL-GTE 1823 D T +FH + N I + G + E +KE +F+++LL G E Sbjct: 359 YEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESLLCGVE 418 Query: 1822 GDCQPINLEICQDGPLITQ-----NQSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQ 1658 G+ N D L+ +Q DL R S +I+ A FS+ +K+ GPDGY+++ Sbjct: 419 GE----NSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSE 474 Query: 1657 FYKKAWNIVGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTY 1478 F+K W +VG + ++A+ EFF SG LL+ N T + L+PK ++S + DFRPI+C N Y Sbjct: 475 FFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTLY 534 Query: 1477 KVIAKILADRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKID 1298 KVIAK+L RL L +I +QSAF+PGR + +N+ + E++ YN K IS R +LK+D Sbjct: 535 KVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISSRGMLKVD 594 Query: 1297 LKKAYDTICWDFLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQG 1118 L+KA+D++ WDF+ L P KFV WI +C+STP +S+ +NG GFFK +GLRQG Sbjct: 595 LRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQG 654 Query: 1117 DPLSPFLFVICVEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVG 938 DPLSP+LFV+ +E FS L + YHPK L ISH +SS+ Sbjct: 655 DPLSPYLFVLAMEVFSSLLKARFDAGYIQYHPKTADLSISHLMFADDVMVFFDGGSSSLH 714 Query: 937 ILLSTLTDFGNKSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAE 758 + L DF + SGL N K++++ AG E + + PI +P RYLG+P+++ Sbjct: 715 GISEALDDFASWSGLHVNKDKTNLYLAGTDEVE-ALAISHYGFPISTLPIRYLGLPLMSR 773 Query: 757 KLKVCQYGLLIDKIHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKI 578 KLK+ +Y L+ ++W K+LS AG+++LI +V+ G W S + + KI Sbjct: 774 KLKISEYELV-----KRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCVKKI 828 Query: 577 ISICRSFLW 551 S+C FLW Sbjct: 829 ESLCSRFLW 837 Score = 51.2 bits (121), Expect(2) = e-130 Identities = 19/45 (42%), Positives = 27/45 (60%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLW 405 A +AW +CLPK+EGG+ LR WN + + +W + A D LW Sbjct: 847 AKIAWSGVCLPKNEGGVALRRFTPWNKTFYLRFIWPLFADNDVLW 891 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 412 bits (1058), Expect(2) = e-129 Identities = 250/751 (33%), Positives = 395/751 (52%), Gaps = 21/751 (2%) Frame = -3 Query: 2731 ICFIYGFHTVVSRRPLWDTXXXXXXXXXXXXXL----GDFNCVMKASERLNGTEVSSYET 2564 + +Y + ++R+ LW+ GDFN V+ +E T ++ Sbjct: 55 VSIVYAANEAITRKELWEELLLLSVSLSGNGKPWIMLGDFNQVLCPAEHSQATSLNVNRR 114 Query: 2563 RDLLQCCL-SAGLSDLNSIGSFHTWTNNT----VLCKLDRAMANEAWFSSNHAGMANFLP 2399 + + CL A L DL G+ TW N + V KLDR + NE+W S + A F Sbjct: 115 MKVFRDCLFEAELCDLVFKGNTFTWWNKSATRPVAKKLDRILVNESWCSRFPSAYAVFGE 174 Query: 2398 SGCLSDHSPCIVALFQHSRPTKKNFKFFNMWCDHADFEQLISEHWEE-PIHGTKQFTLCK 2222 SDH+ C V + K+ F+F+N + DF L+ E W + G+ F + K Sbjct: 175 PD-FSDHASCGVIINPLMHREKRPFRFYNFLLQNPDFISLVGELWYSINVVGSSMFKMSK 233 Query: 2221 KLKRLKGPLKALNKKHFSHISSRAEKARNDFDQALEEFHLQPA--NTALQLQIADLKLKA 2048 KLK LK P++ + ++FS++ R ++A N + P N AL+++ + K Sbjct: 234 KLKALKNPIRTFSMENFSNLEKRVKEAHNLVLYRQNKTLSDPTIPNAALEMEA---QRKW 290 Query: 2047 RSLSEAERSFYFQQAKCKHLTYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEV 1868 L +AE SF+ Q+++ + D T +FH + N I I +G + + Sbjct: 291 LILVKAEESFFCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDDNGVKIDTQLGI 350 Query: 1867 VKEFLDFYDNLLGTEGDCQPINLEICQDGPLI-----TQNQSRDLLRPISIDEIKSALFS 1703 + ++++ NLLG G+ P L I +D L+ + +Q ++L S +IKSA FS Sbjct: 351 KEHCIEYFSNLLG--GEVGPPML-IQEDFDLLLPFRCSHDQKKELAMSFSRQDIKSAFFS 407 Query: 1702 IGDDKSPGPDGYTAQFYKKAWNIVGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSDHAS 1523 +K+ GPDG+ +F+K+ W+++G + + A+ EFFTS LL+ N T + L+PK +AS Sbjct: 408 FPSNKTSGPDGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTLVLIPKITNAS 467 Query: 1522 TVGDFRPIACCN----VTYKVIAKILADRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQE 1355 + DFRPI+C + YKVIA++L +RL L +I QSAF+PGR + +N+ + E Sbjct: 468 KMNDFRPISCNDFGPITLYKVIARLLTNRLQCLLSQVISPFQSAFLPGRFLAENVLLATE 527 Query: 1354 LLKHYNRKRISPRCILKIDLKKAYDTICWDFLKDMLDGLNFPPKFVHWIMECVSTPSYSL 1175 L++ YNR+ I PR +LK+DL+KA+D+I WDF+ L + P +FV+WI +C+STP++S+ Sbjct: 528 LVQGYNRQNIDPRGMLKVDLRKAFDSIRWDFIISALKAIGIPDRFVYWITQCISTPTFSV 587 Query: 1174 RINGEMHGFFKGQRGLRQGDPLSPFLFVICVEYFSRSLNRAAKNAHFSYHPKCGGLKISH 995 +NG GFFK RGLRQG+PLSPFLFV+ +E FS LN + + YHPK L ISH Sbjct: 588 CVNGNTGGFFKSTRGLRQGNPLSPFLFVLAMEVFSSLLNSRFQAGYIHYHPKTSPLSISH 647 Query: 994 XXXXXXXXXXXXXXASSVGILLSTLTDFGNKSGLRANALKSSIFTAGIQGREKQVVLEAA 815 +SS+ + L DF SGL N K+ ++ AG+ +EA+ Sbjct: 648 LMFADDIMVFFDGGSSSLHGISEALEDFAFWSGLVLNREKTHLYLAGLDR------IEAS 701 Query: 814 NIPIGHMPFRYLGIPIVAEKLKVCQYGLLIDKIHSYLKTWTAKNLSHAGKLELIQAVLQG 635 I A KL++ +YG L++K+ ++W+ K LS AG+++LI +V+ G Sbjct: 702 TI---------------ARKLRIAEYGPLLEKLAKRFRSWSVKCLSFAGRVQLIASVISG 746 Query: 634 TECLWFSVLPVPCAVMDKIISICRSFLWGVN 542 W S +P + +I ++C FLW N Sbjct: 747 IINFWISTFILPKGCVKRIEALCARFLWSGN 777 Score = 80.5 bits (197), Expect(2) = e-129 Identities = 62/205 (30%), Positives = 93/205 (45%), Gaps = 26/205 (12%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWI-RW--IDQVY-LHG 372 A VAW ++CLPK EGG+GLR N +L W+ KK S W W + ++ L G Sbjct: 784 AKVAWSEVCLPKEEGGVGLRRFTVLNTTL-----WD--GKKISFWFDNWSPLGPLFKLFG 836 Query: 371 VS-----IWDVQNKKDHS----------PLMKRILQIRNQLLQMEGSCTAAITRIESWM- 240 S +Q K + P + L + L + C + W+ Sbjct: 837 SSGPRALCIPIQAKVADACSDVGWLISPPRTDQALALLIHLTTIALPCFDSSPDTFVWIV 896 Query: 239 ---RLGNFSSSLAYEWLRPKGTKTIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRLS-- 75 FS++ +E +RPK W K +W + PK++FN+W++ +RL TR RL+ Sbjct: 897 DDFTCHGFSAARTWEAMRPKKPVKDWTKSVWFKGSVPKHAFNMWVSHLNRLPTRQRLAAW 956 Query: 74 -FQDNTECCLCNNATESHRHLFFQC 3 T+CCLC++ ES HL C Sbjct: 957 GVTTTTDCCLCSSRPESRDHLLLYC 981 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 424 bits (1089), Expect(2) = e-127 Identities = 254/752 (33%), Positives = 397/752 (52%), Gaps = 25/752 (3%) Frame = -3 Query: 2731 ICFIYGFHTVVSRRPLW----DTXXXXXXXXXXXXXLGDFNCVMKASERLNGTEVS-SYE 2567 + F+Y V+R+ LW D LGDFN ++ SE + Sbjct: 3 LSFVYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILHPSEHSTSDGFNVDRP 62 Query: 2566 TRDLLQCCLSAGLSDLNSIGSFHTWTNNT----VLCKLDRAMANEAWFSSNHAGMANFLP 2399 TR + L A L+DL+ G+ TW N V KLDR + N+ W ++ + + F Sbjct: 63 TRIFRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTTFPSSLGLFGE 122 Query: 2398 SGCLSDHSPCIVALFQHSRPTKKNFKFFNMWCDHADFEQLISEHW-EEPIHGTKQFTLCK 2222 SDHS C ++L S +KK F+F N +F LI W + G+ + + Sbjct: 123 PD-FSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKWFSTSVTGSAMYRVSV 181 Query: 2221 KLKRLKGPLKALNKKHFSHISSRAEKARNDFDQALEEFHLQP--ANTALQLQIADLKLKA 2048 KLK LK ++ ++ ++S I R ++A + A P +N A++ A+ + K Sbjct: 182 KLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAIE---AETQRKW 238 Query: 2047 RSLSEAERSFYFQQAKCKHLTYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEV 1868 R L+EAE SF++Q+++ L D + +FH + NHI ++ G + Sbjct: 239 RILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQQNL 298 Query: 1867 VKEFLDFYDNLLGTEGDCQPINLEICQDGPLITQNQSRDLLR-------------PISID 1727 ++++ + LG+E Q PL Q +LL P S + Sbjct: 299 ENHCVEYFQSNLGSE-----------QGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSE 347 Query: 1726 EIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGVQFSQAIMEFFTSGSLLRMINHTVIAL 1547 +IK+A FS+ +K+ GPDG++ +F+ W I+G + ++AI EFFTSG LL+ N T + L Sbjct: 348 QIKNAFFSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVL 407 Query: 1546 VPKSDHASTVGDFRPIACCNVTYKVIAKILADRLSVTLGTLIDKAQSAFVPGRSMLDNIH 1367 +PK +AS++ DFRPI+C N YKVI+K+L DRL L I +QSAF+PGR L+N+ Sbjct: 408 IPKITNASSMSDFRPISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVL 467 Query: 1366 MVQELLKHYNRKRISPRCILKIDLKKAYDTICWDFLKDMLDGLNFPPKFVHWIMECVSTP 1187 + EL+ YN+K I+P +LK+DL+KA+D++ WDF+ L LN P KF WI+EC+ST Sbjct: 468 LATELVHGYNKKNIAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTA 527 Query: 1186 SYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVICVEYFSRSLNRAAKNAHFSYHPKCGGL 1007 S+S+ +NG G F +GLRQGDP+SP+LFV+ +E FS L + + +YHPK L Sbjct: 528 SFSVILNGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQL 587 Query: 1006 KISHXXXXXXXXXXXXXXASSVGILLSTLTDFGNKSGLRANALKSSIFTAGIQGREKQVV 827 +ISH +SS+ ++ +L DF SGL N K+ ++ AG+ E Sbjct: 588 EISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESD-S 646 Query: 826 LEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLIDKIHSYLKTWTAKNLSHAGKLELIQA 647 + + +G +P RYLG+P+++ KL + +Y LI+KI + +W + LS AG+++L+ + Sbjct: 647 MASYGFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLAS 706 Query: 646 VLQGTECLWFSVLPVPCAVMDKIISICRSFLW 551 V+ G W S +P + KI S+C FLW Sbjct: 707 VISGIVNFWISSFILPLGCIKKIESLCSRFLW 738 Score = 60.8 bits (146), Expect(2) = e-127 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 1/80 (1%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYL-HGVSI 363 A VAW +CLPK+EGG+GLR N +L +++W + + SLW+ W Q L S Sbjct: 748 AKVAWSQVCLPKAEGGIGLRRFAVSNRTLYLRMIWLLFSNSGSLWVAWHKQHSLGKSTSF 807 Query: 362 WDVQNKKDHSPLMKRILQIR 303 W+ K S K +L++R Sbjct: 808 WNQPEKPHDSWNWKCLLRLR 827 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 460 bits (1184), Expect = e-126 Identities = 275/887 (31%), Positives = 450/887 (50%), Gaps = 25/887 (2%) Frame = -3 Query: 3028 WNIRGLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHNFHLHN 2849 WN+RGLN K + ++ I+E+ ++ET++ + K S+++ F W+ N+ + Sbjct: 6 WNVRGLNKSSKHSVIKKWIEENNFQFGCLVETRVKESKVSQLVGKLFKDWSILTNYEHNR 65 Query: 2848 AGRILIIWDPSTTILEPIILDAQFILARAICKVTALSFHICFIYGFHTVVSRRPLW---- 2681 GRI ++W + L PI Q + + F F+Y + V R+ LW Sbjct: 66 RGRIWVLWRKNVR-LSPIYKSCQLLTCSVKLEDRQDEFFCSFVYASNYVEERKVLWSELK 124 Query: 2680 DTXXXXXXXXXXXXXLGDFNCVMKASERLNGT--EVSSYETRDLLQCCLSAGLSDLNSIG 2507 D LGDFN + +E + + RD Q L+D+ + G Sbjct: 125 DHYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPGMRDFQQVINYCSLTDMAAQG 184 Query: 2506 SFHTWTNNT----VLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHSRP 2339 TW N ++ KLDR + N+ W + + F GC SDH C ++L + Sbjct: 185 PLFTWCNKREHGLIMKKLDRVLINDCWNQTFSQSYSVFEAGGC-SDHLRCRISLNSEAGN 243 Query: 2338 TK---KNFKFFNMWCDHADFEQLISEHWEEP----IHGTKQFTLCKKLKRLKGPLKALNK 2180 K FKF N D DF+ ++S +W++ + + F K LK LK ++++ + Sbjct: 244 KVQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLFRFSKNLKGLKPKIRSMAR 303 Query: 2179 KHFSHISSRAEKARNDFDQALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQAK 2000 ++S +A +A P++ A++ + A R ++ E + Q++K Sbjct: 304 DRLGNLSKKANEAYKILCAKQHVNLTNPSSMAMEEENAAYSRWDR-VAILEEKYLKQKSK 362 Query: 1999 CKHLTYSDRGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGT-E 1823 D+ TK FH N I I DG V T E+ E F+ L Sbjct: 363 LHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAERFFREFLQLIP 422 Query: 1822 GDCQPINL-EICQDGPL-ITQNQSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQFYK 1649 D + + + E+ Q P+ + + L+RP++ +EI+ LF + DKSPGPDGYT++F+K Sbjct: 423 NDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFRMPSDKSPGPDGYTSEFFK 482 Query: 1648 KAWNIVGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVI 1469 W I+G +F+ A+ FFT G L + IN T++AL+PK A + D+RPI+CCNV YKVI Sbjct: 483 ATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDYRPISCCNVLYKVI 542 Query: 1468 AKILADRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKK 1289 +KI+A+RL + L I QSAFV R +++N+ + EL+K Y++ IS RC +KID+ K Sbjct: 543 SKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDYHKDTISTRCAIKIDISK 602 Query: 1288 AYDTICWDFLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPL 1109 A+D++ W FL ++ L FP +F+HWI C++T S+S+++NGE+ G+F+ RGLRQG L Sbjct: 603 AFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGELAGYFQSSRGLRQGCAL 662 Query: 1108 SPFLFVICVEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILL 929 SP+LFVIC++ S+ L++AA HF YHPKC + ++H S+ ++ Sbjct: 663 SPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIERII 722 Query: 928 STLTDFGNKSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLK 749 +F SGLR + KS+++ AG+ + V + G +P RYLG+P++ ++L Sbjct: 723 KVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPVRYLGLPLITKRLS 782 Query: 748 VCQYGLLIDKIHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIISI 569 L++++ + +WT++ LS+AG+L LI +VL W + +P + ++ + Sbjct: 783 TTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKM 842 Query: 568 CRSFLW-----GVNKHQLLGKIYACPNPREDLALES*KHGTTPSWLK 443 C +FLW NK ++ + P L L S K LK Sbjct: 843 CSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLK 889 Score = 72.0 bits (175), Expect = 2e-09 Identities = 33/80 (41%), Positives = 49/80 (61%), Gaps = 1/80 (1%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYLHGVSIW 360 A ++W +C PK EGGLGLR LK N+ K++W I + +SLW++W+DQ L S W Sbjct: 858 AKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFW 917 Query: 359 DV-QNKKDHSPLMKRILQIR 303 +V Q S + K++L+ R Sbjct: 918 EVKQTVSQGSWIWKKLLKYR 937 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 365 bits (938), Expect(2) = e-123 Identities = 253/827 (30%), Positives = 400/827 (48%), Gaps = 23/827 (2%) Frame = -3 Query: 2944 VLETKINDVKCSRIMRNKFGGWNHFHNFHLHNAGRILIIWDPSTTILEPIILDAQFILAR 2765 VLET++ + K I F W N+ + GRI ++W S L+ I +Q I+ Sbjct: 336 VLETRVIESKVPVIFAKVFKDWQMVSNYEFNRLGRIWVVWSSSVQ-LQVIFKSSQMIVCL 394 Query: 2764 AICKVTALSFHICFIYGFHTVVSRRPLW----DTXXXXXXXXXXXXXLGDFNCVMKASER 2597 + + F FIY + V R+ LW + GDFN +K E Sbjct: 395 VRVEHYDVEFICSFIYASNFVEERKKLWQDLHNLQNSVAFRNKPWLLFGDFNETLKMEEH 454 Query: 2596 LNGTEVSSYET---RDLLQCCLSAGLSDLNSIGSFHTWTNNT---VLCK-LDRAMANEAW 2438 + VS T RD L D+ + G TW N ++CK LDR + N Sbjct: 455 -SSYAVSPMVTPGMRDFQIVVRYCSLEDMRTHGPLFTWGNKRNEGLICKKLDRVLLNPE- 512 Query: 2437 FSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPTKKNFKFFNMWCDHADFEQLISEHWEE 2258 ++S + + SG SDH L + K FKF N+ H +F + + W+ Sbjct: 513 YNSAYPHSYCIMDSGGCSDHLRGRFHLRSAIQKPKGPFKFTNVIAAHPEFMPKVEDFWKN 572 Query: 2257 PIH----GTKQFTLCKKLKRLKGPLKALNKKHFSHISSRAEKARNDFDQALEEFHLQPAN 2090 + F KKLK LK LK L++ + S ++ RA A EE Sbjct: 573 TTELFPSTSTLFRFSKKLKELKPILKDLSRNNLSDLTRRAT-------YAYEELCRCQTK 625 Query: 2089 TALQLQIADLKLKARSLSEAERSFYFQQ-AKCKHLTYSDRGTKFFHSLVKRNTKRNHIAA 1913 + L D+ + S F++ K +HL N I Sbjct: 626 SLTTLNPHDI---------VDESLAFERWEKERHLL-------------------NAIHE 657 Query: 1912 ITKMDGTVTTSSCEVVKEFLDFYDNLLGTE-GDCQPINLEICQDGPL---ITQNQSRDLL 1745 + GT + ++ E + F+ +LL ++ D I+++ + G L + ++ L+ Sbjct: 658 VMDPQGTRPPNQDDIKIEAVRFFSDLLSSQPSDFTGISVDELK-GILQYRYSLHEQNLLV 716 Query: 1744 RPISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGVQFSQAIMEFFTSGSLLRMIN 1565 I+ E+ FSI +KSPGPDGYT +F+++ W+++G + + AI FFT G L + +N Sbjct: 717 AEITEAEVMKVFFSIPLNKSPGPDGYTVEFFRETWSVIGQEVTMAIKSFFTYGFLPKGLN 776 Query: 1564 HTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILADRLSVTLGTLIDKAQSAFVPGRS 1385 T++AL+PK +A + D+RPI+CCNV YK I+K+LA+RL L I QSAF+ R Sbjct: 777 STILALIPKRTYAKEMKDYRPISCCNVLYKAISKLLANRLKCLLPEFIAPNQSAFISDRL 836 Query: 1384 MLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTICWDFLKDMLDGLNFPPKFVHWIM 1205 +++N+ + EL+K Y++ +SPRC +KIDL KA+D++ W FL + L L+ P KF+HWI Sbjct: 837 LMENLLLASELVKDYHKDGLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDIPEKFIHWIN 896 Query: 1204 ECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVICVEYFSRSLNRAAKNAHFSYH 1025 C+ST S+S+++N GLRQG LSP+LFVIC+ S L++ A F YH Sbjct: 897 LCISTASFSVQVN-----------GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYH 945 Query: 1024 PKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTDFGNKSGLRANALKSSIFTAGIQG 845 P+C + ++H A S+ +L+ DF SGL + KS++F A I Sbjct: 946 PRCRNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISS 1005 Query: 844 REKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLIDKIHSYLKTWTAKNLSHAGK 665 +L G +P RYLG+P++ +++ + L++KI S + +W + LS+AG+ Sbjct: 1006 ETCASILARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGR 1065 Query: 664 LELIQAVLQGTECLWFSVLPVPCAVMDKIISICRSFLWG---VNKHQ 533 L+L+ +V+ W S +P A + +I I +FLW +N H+ Sbjct: 1066 LQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFLWSGTDLNPHK 1112 Score = 106 bits (264), Expect(2) = e-123 Identities = 63/186 (33%), Positives = 91/186 (48%), Gaps = 7/186 (3%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYLHGVS-I 363 A VAW D+C PKSEGGLGLR L N K++W + + K SLW+ WI + V+ Sbjct: 1113 AKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQNNLIRTVAEA 1172 Query: 362 WDVQNKKDHSPLMKRILQIRNQLLQMEGSCT---AAITRIESWMRLGNFSSSLAYEWLRP 192 ++ H + ++ + L G CT ++ R F S + +R Sbjct: 1173 LSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDRSLCRSIGGQFKAKFFSPEIWHQIRE 1232 Query: 191 KGTKTIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRLSFQD---NTECCLCNNATESHR 21 +G W K IW PK++F WLAA RL T D+++ + ++ C LCN + ES Sbjct: 1233 QGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISAESRD 1292 Query: 20 HLFFQC 3 HLFF C Sbjct: 1293 HLFFSC 1298 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 429 bits (1102), Expect = e-117 Identities = 269/804 (33%), Positives = 425/804 (52%), Gaps = 16/804 (1%) Frame = -3 Query: 3016 GLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHNFHLHNAGRI 2837 GLN +Q VR I + + V LET + + ++ + GW N+ GRI Sbjct: 53 GLNSRNRQRVVRSWIASNNLLVGCFLETHVAQENANSVLASTLPGWRMDSNYCCSELGRI 112 Query: 2836 LIIWDPSTTILEPIILDAQFILARAICKVTAL--SFHICFIYGFHTVVSRRPLWDTXXXX 2663 I+WDPS ++L + I+ +I K+ +L SF + F+YG ++ + RR LW+ Sbjct: 113 WIVWDPSISVL--VFKRTDQIMFCSI-KIPSLLQSFAVAFVYGRNSELDRRSLWEDILVL 169 Query: 2662 XXXXXXXXXL----GDFNCVMKASERLN-GTEVSSYETRDLLQCCL-SAGLSDLNSIGSF 2501 GDFN + ASE + + + + LQCCL + LSDL S G F Sbjct: 170 SRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQCCLRDSQLSDLPSRGVF 229 Query: 2500 HTWTN----NTVLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPTK 2333 TW+N N +L KLDRA+AN WF+ + +A F P G SDH+PCI+ + P+K Sbjct: 230 FTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPGD-SDHAPCIILIDNQPPPSK 288 Query: 2332 KNFKFFNMWCDHADFEQLISEHWEE-PIHGTKQFTLCKKLKRLKGPLKALNKKHFSHISS 2156 K+FK+F+ H + +S WEE + G+ F+L + LK K + LN+ FS+I Sbjct: 289 KSFKYFSFLSSHPSYLAALSTAWEENTLVGSHMFSLRQHLKVAKLCCRTLNRLRFSNIQQ 348 Query: 2155 RAEKARNDFDQALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQAKCKHLTYSD 1976 R ++ + E P++T + + K + + A SF+ Q+++ + L D Sbjct: 349 RTAQSLTRLEDIQVELLTSPSDTLFRREHVARK-QWIFFAAALESFFRQKSRIRWLHEGD 407 Query: 1975 RGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGD-CQPINL 1799 T+FFH V + N I + DG + ++ + +Y +LLG + P ++ Sbjct: 408 ANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHLLGIPSENVTPFSV 467 Query: 1798 EICQDG-PLITQNQSRDLLRPI-SIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGV 1625 E + P + L I S +EI LFS+ +K+PGPDG+ +F+ +AW IV Sbjct: 468 EKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKS 527 Query: 1624 QFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILADRL 1445 AI EFF SG+L R N T I L+PK A + FRP+ACC YKVI +I++ RL Sbjct: 528 SVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRL 587 Query: 1444 SVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTICWD 1265 + + + Q F+ GR + +N+ + EL+ ++ + R L++D+ KAYD + W+ Sbjct: 588 KLFIDQAVQANQVGFIKGRLLCENVLLASELVDNFEADGETTRGCLQVDISKAYDNVNWE 647 Query: 1264 FLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVIC 1085 FL ++L L+ P F+HWI C+S+ SYS+ NGE+ GFF+G++G+RQGDP+S LFV+ Sbjct: 648 FLINILKALDLPLVFIHWIWVCISSASYSIAFNGELIGFFQGKKGIRQGDPMSSHLFVLV 707 Query: 1084 VEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTDFGN 905 ++ S+SL+ A N F+ HP C I+H ASS+ +L+ L DF Sbjct: 708 MDVLSKSLDLGALNGLFNLHPNCLAPIITHLSFADDVLVFSDGAASSIAGILTILDDFRQ 767 Query: 904 KSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLI 725 SGL N K+ + G + + + I G +P RYLG+P++++K++ Y L+ Sbjct: 768 GSGLGINREKTELLLDGGNFARNRSLADNLGITHGSLPVRYLGVPLMSQKMRRQDYQPLV 827 Query: 724 DKIHSYLKTWTAKNLSHAGKLELI 653 D+I+S +WTA++LS AG+L+L+ Sbjct: 828 DRINSRFTSWTARHLSFAGRLQLL 851 Score = 71.2 bits (173), Expect = 4e-09 Identities = 40/126 (31%), Positives = 63/126 (50%), Gaps = 6/126 (4%) Frame = -2 Query: 362 WDVQNKKDHSP---LMKRILQIRNQLLQMEGSCTAAITRIESWMRLGNFSSSLAYEWLRP 192 W + + + +P L++R+L L+ T + +I FS++ + +L+P Sbjct: 921 WRISSSRSRNPVITLLQRVLPSAASLIDCPHDDTY-LWKIGHHAPSNRFSTADTWSYLQP 979 Query: 191 KGTKTIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRL---SFQDNTECCLCNNATESHR 21 T +W K +W + PK +F W+ A +RL TRDRL F C LCN+ ES Sbjct: 980 SSTSVLWHKAVWFKDHVPKQAFICWVVAHNRLHTRDRLRRWGFSIPPTCVLCNDLDESRE 1039 Query: 20 HLFFQC 3 HLFF+C Sbjct: 1040 HLFFRC 1045 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 388 bits (996), Expect(2) = e-115 Identities = 239/778 (30%), Positives = 381/778 (48%), Gaps = 13/778 (1%) Frame = -3 Query: 2845 GRILIIWDPSTTILEPIILDAQFILARAICKVTALSFHICFIYGFHTVVSRRPLWDTXXX 2666 GRI ++W + L P+ +Q I + + F FIY + V RR LW+ Sbjct: 428 GRIWVVWRDNAR-LTPVFKSSQMITCSILLEGKEEEFFCSFIYASNFVEERRILWEDIRS 486 Query: 2665 XXXXXXXXXXL----GDFNCVMKASERLNGTEVSSYETRDLLQCCLSAGLSDLNSIGSFH 2498 GDFN E L G E S+Y+ G+ D IG Sbjct: 487 HHDSPLIRNKPWILCGDFN------EILEGGEHSNYDNSPYTP----PGMRDFQEIG--- 533 Query: 2497 TWTNNTVLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPTKKNFKF 2318 R M A + +K FKF Sbjct: 534 ------------RLMLEAA-------------------------------ATGGRKPFKF 550 Query: 2317 FNMWCDHADFEQLISEHWEEP----IHGTKQFTLCKKLKRLKGPLKALNKKHFSHISSRA 2150 N+ F ++ HW + + + KKLK LK L+ L K+ + R Sbjct: 551 VNVLTKLPQFLPVVESHWASSAPLYVSTSALYRFSKKLKTLKPHLRELGKEKLGDLPKRT 610 Query: 2149 EKARNDFDQALEEFHLQPANTALQLQIADLKLKA--RSLSEAERSFYFQQAKCKHLTYSD 1976 +A E+ AN + + +LK LSE E F Q++K + D Sbjct: 611 REAHI---LLCEKQATTLANPSQETIAEELKAYTDWTHLSELEEGFLKQKSKLHWMNVGD 667 Query: 1975 RGTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTE-GDCQPINL 1799 +FH + RN I I + +S E+ E F++ L + GD I++ Sbjct: 668 GNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNEFLNRQSGDFHGISV 727 Query: 1798 EICQD--GPLITQNQSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGV 1625 E ++ + L R ++ +EI+ LF++ ++KSPGPDGYT++F+K W++ G Sbjct: 728 EDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWSLTGP 787 Query: 1624 QFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILADRL 1445 F AI FF G L + +N T++AL+PK D A + D+RPI+CCNV YKVI+KILA+RL Sbjct: 788 DFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKILANRL 847 Query: 1444 SVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTICWD 1265 + L + I + QSAFV R +++N+ + EL+K Y+++ ++PRC +KID+ KA+D++ W Sbjct: 848 KLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVTPRCAMKIDISKAFDSVQWQ 907 Query: 1264 FLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVIC 1085 FL + L+ LNFP F HWI C+ST ++S+++NGE+ GFF RGLRQG LSP+LFVIC Sbjct: 908 FLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVIC 967 Query: 1084 VEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTDFGN 905 + S ++ AA + + YHPKC + ++H S+ +++ +F Sbjct: 968 MNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAG 1027 Query: 904 KSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLI 725 +SGL+ + KS+I+ AG+ ++ L + G +P RYLG+P++ +++ Y LI Sbjct: 1028 RSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYSPLI 1087 Query: 724 DKIHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIISICRSFLW 551 + + + + +WTA++LS+AG+L L+ +V+ W S +P + +I +C +FLW Sbjct: 1088 EAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLW 1145 Score = 56.6 bits (135), Expect(2) = e-115 Identities = 25/80 (31%), Positives = 42/80 (52%), Gaps = 1/80 (1%) Frame = -2 Query: 539 APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYLHGVSIW 360 A +AW +C PK EGGLG++ L N K++W + + + SLW+ WI + + W Sbjct: 1155 AKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFW 1214 Query: 359 DVQNKKD-HSPLMKRILQIR 303 + S + K++L+ R Sbjct: 1215 SANERSSLGSWMWKKLLKYR 1234 Score = 67.4 bits (163), Expect = 6e-08 Identities = 42/125 (33%), Positives = 61/125 (48%), Gaps = 8/125 (6%) Frame = -2 Query: 353 QNKKDHSPLMKRILQIRNQLLQMEGSCTAAITRIESWMRLGN-----FSSSLAYEWLRPK 189 Q+++ + + RI +L Q E A I W L N F + + + +R Sbjct: 1292 QHRQHRAAIYNRINAEIQRLQQQERE---AGPDISLWRSLKNDFNKRFITKVTWNNVRTH 1348 Query: 188 GTKTIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRLSFQDNTE---CCLCNNATESHRH 18 + W K +W Y PKYSF LWL ++RL T DR+ ++ + C LCNNA E+ H Sbjct: 1349 QPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEETRDH 1408 Query: 17 LFFQC 3 LFF C Sbjct: 1409 LFFSC 1413 Score = 63.5 bits (153), Expect = 8e-07 Identities = 50/226 (22%), Positives = 92/226 (40%), Gaps = 8/226 (3%) Frame = -1 Query: 4002 VKYKYYTHRSGWLVFKFENEEDKAKVLQGGPYFVFGRPLMIKSLPYCFQFDETDFHDVPV 3823 VK Y + + F+ +A+VL+ G + + P+++ + + + +P+ Sbjct: 98 VKIDAYVVDTKTIKFRIRESSVRARVLRRGMWNIADMPMIVSKWSPVAEDAQPEIKTMPM 157 Query: 3822 WVTLPGLPLECWHPMALSKICSKVGKPISSDGLTASRDRLSYARVLVEVDASKPLVKSVP 3643 W+T+ +P + LS + S +G+P T + A+V VE D ++ + K Sbjct: 158 WITIKNVPRSMFTWKGLSFLASPIGEPKKLHPDTVLCNSFEEAKVFVEADLTQEMPKQFR 217 Query: 3642 IKLPNGQTRVQEIRFEHEPRFCTSCKMLGHDLENC---NGSHHMSTPSAALEKGNNQAST 3472 K G + E ++ P C+SC GH E C + +STP+ + + Sbjct: 218 FKSETGVDAMVEYKYPWLPPRCSSCSKWGHIQEVCLTRPSPNQLSTPTEIETEDKTEPPL 277 Query: 3471 RRGR-----SKEPSKHNQRDGKGLENATTSSMLPPTQKGSSATEVA 3349 + + SK PS + G + M PT + EVA Sbjct: 278 MKEKPLEILSKSPSATLTKTLNGDSHTQKVPMKNPTVLQNKGKEVA 323 >gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score: 72.31) [Arabidopsis thaliana] Length = 928 Score = 419 bits (1077), Expect = e-114 Identities = 244/716 (34%), Positives = 384/716 (53%), Gaps = 22/716 (3%) Frame = -3 Query: 2632 GDFNCVMKASERLNGTE--VSSYETRDLLQCCLSAGLSDLNSIGSFHTWTN----NTVLC 2471 GDFN ++ E N E V++ RD ++DL G TW+N + + Sbjct: 29 GDFNEILDMEEHSNSRENPVTTTGMRDFQMAVNHCSITDLAYHGPLFTWSNKRENDLIAK 88 Query: 2470 KLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPT---KKNFKFFNMWCD 2300 KLDR + N+ W S + F GC SDH C + L + K+ FKF N+ + Sbjct: 89 KLDRVLVNDVWLQSFPRSYSVFEAGGC-SDHLRCRINLNVGAGAVVKGKRPFKFVNVITE 147 Query: 2299 HADFEQLISEHWEEP----IHGTKQFTLCKKLKRLKGPLKALNKKHFSHISSRAEKARND 2132 F + +W E + + F KKLK LK L+ L K+ ++ + ++A Sbjct: 148 MEHFIPTVESYWNETEAIFMSTSSLFRFSKKLKGLKPLLRNLGKERLGNLVKQTKEAFET 207 Query: 2131 FDQALEEFHLQPANTALQLQIADLKLKARSLSEAERSFYFQQAKCKHLTYSDRGTKFFHS 1952 Q P+ +++Q + + K ++ E F Q++K L DR K FH Sbjct: 208 LCQKQAMKMANPSPSSMQEE-NEAYAKWDHIAVLEEKFLKQRSKLHWLDIGDRNNKAFHR 266 Query: 1951 LVKRNTKRNHIAAITKMDGTVTTSSCEV-------VKEFLDFYDNLLGTEGDCQPINLEI 1793 V +N I I DG+V + ++ +EFL N D + I +E Sbjct: 267 AVVAREAQNSIREIICHDGSVASQEEKIKTEAEHHFREFLQLIPN------DFEGIAVEE 320 Query: 1792 CQDG-PLITQNQSRDLL-RPISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGVQF 1619 QD P + +++L +S +EI +FS+ +DKSPGPDGYTA+FYK AWNI+G +F Sbjct: 321 LQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDKSPGPDGYTAEFYKGAWNIIGAEF 380 Query: 1618 SQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILADRLSV 1439 AI FF G L + IN T++AL+PK A + D+RPI+CCNV YKVI+KI+A+RL + Sbjct: 381 ILAIQSFFAKGFLPKGINSTILALIPKKKEAKEMKDYRPISCCNVLYKVISKIIANRLKL 440 Query: 1438 TLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTICWDFL 1259 L I QSAFV R +++N+ + E++K Y++ +S RC LKID+ KA+D++ W FL Sbjct: 441 VLPKFIVGNQSAFVKDRLLIENVLLATEIVKDYHKDSVSSRCALKIDISKAFDSVQWKFL 500 Query: 1258 KDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVICVE 1079 ++L+ +NFPP+F HWI C++T S+S+++NGE+ G F R LRQG LSP+LFVI ++ Sbjct: 501 INVLEAMNFPPEFTHWITLCITTASFSVQVNGELAGVFSSARELRQGCSLSPYLFVISMD 560 Query: 1078 YFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTDFGNKS 899 S+ L++A F YHPKC + ++H S+ ++ L +F S Sbjct: 561 VLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADDLMILSDGKVRSIDGIVKVLYEFAKWS 620 Query: 898 GLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLIDK 719 GL+ + KS+++ AG+Q Q +++ + +G +P RYLG+P+V+++L LI++ Sbjct: 621 GLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLPVRYLGLPLVSKRLTASDCLPLIEQ 680 Query: 718 IHSYLKTWTAKNLSHAGKLELIQAVLQGTECLWFSVLPVPCAVMDKIISICRSFLW 551 + ++ WT++ LS AG+L LI + L W + +P A + +I +C +FLW Sbjct: 681 LRKKIEAWTSRFLSFAGRLNLISSTLWSICNFWMAAFRLPRACIREIDKLCSAFLW 736 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 365 bits (937), Expect(2) = e-113 Identities = 224/699 (32%), Positives = 332/699 (47%), Gaps = 2/699 (0%) Frame = -3 Query: 2632 GDFNCVMKASERLNGTEVSSYETRDLLQCCLSAGLSDLNSIGSFHTWTNNTVLCKLDRAM 2453 GDFN + E + G + + C ++ L DLN Sbjct: 97 GDFNVTRRCEETIGGNSRFTNAMDEFNSCLHNSKLDDLNY-------------------- 136 Query: 2452 ANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPTKKNFKFFNMWCDHADFEQLIS 2273 + +FLP G +SDH+ +V + R K FKFFN D DF ++S Sbjct: 137 -----------SVLSFLPPG-ISDHAAMVVKVGLPFRIRKAPFKFFNFLADREDFIPIVS 184 Query: 2272 EHWEEPIHGTKQFTLCKKLKRLKGPLKALNKKHFSHISSRAEKARNDFDQALEEFHLQPA 2093 W + G+KQF + +KLK +K K LN Sbjct: 185 AVWATNVWGSKQFQVWRKLKLVKNQFKLLN------------------------------ 214 Query: 2092 NTALQLQIADLKLKARSLSEAERSFYFQQAKCKHLTYSDRGTKFFHSLVKRNTKRNHIAA 1913 + + LK +S + + L D+ + FF + ++ RN IA Sbjct: 215 -----CNVVEKLLKKKS-------------RVQWLKKGDKNSTFFFKTMTKHRNRNRIAT 256 Query: 1912 ITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGDCQPINLEICQDGPLITQNQSRDLLRPIS 1733 I + DG + ++ L + Sbjct: 257 INRSDGP------------------------------------------DLAKSLCNEFT 274 Query: 1732 IDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGVQF-SQAIMEFFTSGSLLRMINHTV 1556 D+I++ FS+ +KSPGPDG+ F++KAW ++G + A+ EFF+ GSLL +N T+ Sbjct: 275 HDDIRAVFFSMNPNKSPGPDGFNGCFFQKAWLVIGDNVVAAAVKEFFSYGSLLMELNSTI 334 Query: 1555 IALVPKSDHASTVGDFRPIACCNVTYKVIAKILADRLSVTLGTLIDKAQSAFVPGRSMLD 1376 I LVPK + +T+ DFRPI+CCN YK+IAK+LA+RL TL ++ +QS F+PGR + D Sbjct: 335 ITLVPKVANPTTMSDFRPISCCNTFYKIIAKLLANRLKGTLHLIVGPSQSTFIPGRRIGD 394 Query: 1375 NIHMVQELLKHYNRKRISPRCILKIDLKKAYDTICWDFLKDMLDGLNFPPKFVHWIMECV 1196 NI + QE++ Y++ PRC +D+ KA DT+ WDF+ L N P + WI C+ Sbjct: 395 NILLAQEIICDYHKADGQPRCTFMVDMMKANDTVEWDFIIATLQAFNIPSTLIGWIKSCI 454 Query: 1195 STPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVICVEYFSRSLNRAAK-NAHFSYHPK 1019 S+ +S+ +NGE+ GFF +RGLRQGDPLSP+LFVI +E S + R + F YH + Sbjct: 455 SSAKFSVCVNGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWR 514 Query: 1018 CGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTDFGNKSGLRANALKSSIFTAGIQGRE 839 C L +SH +SV L ++F + S L+AN +S IF AG+ G Sbjct: 515 CDQLNLSHLCFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNS 574 Query: 838 KQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLIDKIHSYLKTWTAKNLSHAGKLE 659 VL+ N +G P RYLGIP++ KL++ L+D+I + +K+W K LS AG+L+ Sbjct: 575 SDSVLQVTNFSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQ 634 Query: 658 LIQAVLQGTECLWFSVLPVPCAVMDKIISICRSFLWGVN 542 LIQ+VL + W S L +P V+ I R FLW N Sbjct: 635 LIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFLWAGN 673 Score = 75.5 bits (184), Expect(2) = e-113 Identities = 55/201 (27%), Positives = 84/201 (41%), Gaps = 18/201 (8%) Frame = -2 Query: 551 GR**APVAWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYLHG 372 GR VAW ++CLPK EGGLG+++L WN +L+ +WN+ + + W W+ L G Sbjct: 676 GRAATKVAWSEICLPKCEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWVKVYLLKG 735 Query: 371 VSIWDVQNKKDHSPLMKRILQIR----NQLLQMEGSCTAAITRIESWMRLG----NFSSS 216 S W+ S +++L+IR + + + G A ++W LG +SS+ Sbjct: 736 NSFWNAPLPSICSWNWRKLLKIRELCCSFFVNIIGDGRATSLWFDNWHPLGPLTLRWSSN 795 Query: 215 LAYE-------WLRPKG---TKTIWIKQIWKEYIPPKYSFNLWLAAKSRLQTRDRLSFQD 66 + E L P G T + W +I P Y +W A Sbjct: 796 IIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFIVPWYRL-VWFVA-------------- 840 Query: 65 NTECCLCNNATESHRHLFFQC 3 E+H HLFF C Sbjct: 841 -----------ETHNHLFFDC 850 >emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1| putative protein [Arabidopsis thaliana] Length = 1141 Score = 416 bits (1069), Expect = e-113 Identities = 283/903 (31%), Positives = 439/903 (48%), Gaps = 29/903 (3%) Frame = -3 Query: 3016 GLNLPLKQNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHNFHLHNAGRI 2837 G N+P +NG + K +R V+E + K + + GW N+ + G+I Sbjct: 4 GFNIPSHRNGFKKWFKVNRPIFGGVIEKHVKQPKDKKFINALLPGWFFDENYGFSDLGKI 63 Query: 2836 LIIWDPSTTILEPIILDAQFILARAICKVTALSFHICFIYGFHTVVSRRPLWDTXXXXXX 2657 ++WDPS ++ + Q I + + I +Y + R+ LW Sbjct: 64 WVLWDPSVEVVI-VAKSLQMITCEVLFPNSRTWIVISVVYAANEDDKRKELWREITALVA 122 Query: 2656 XXXXXXXL----GDFNCVMKASERLNGTEVS-SYETRDLLQCCLSAGLSDLNSIGSFHTW 2492 GDFN V+ E ++ RD +C L A LSDL GS TW Sbjct: 123 SPVTFNRPWILLGDFNQVLHPHEHSRHVSLNVDRRIRDFRECLLDAELSDLVYKGSSFTW 182 Query: 2491 TNNT----VLCKLDRAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPTKKNF 2324 N + V K+DR + NE+W + + F P SDH+ C V L K+ F Sbjct: 183 WNKSKTRPVAKKIDRILVNESWSNLFPSSFGLFGPPD-FSDHASCGVVLELDPIKAKRPF 241 Query: 2323 KFFNMWCDHADFEQLISEHWEEP-IHGTKQFTLCKKLKRLKGPLKALNKKHFSHISSRAE 2147 KFFN + +F L+ + W + G+ F + KKLK LK P+K ++ ++S++ R E Sbjct: 242 KFFNFLLKNPEFLNLVWDVWYSTNVVGSSMFRVSKKLKALKKPIKDFSRLNYSNLEKRTE 301 Query: 2146 KARNDFDQALEEFHLQPANTALQLQIADLKL--KARSLSEAERSFYFQQAKCKHLTYSDR 1973 +A + L +L N +L+ +L+ K + L+ AE SF+ Q+++ D Sbjct: 302 EAH---ETLLSFQNLTLDNPSLENAAHELEAQRKWQILATAEESFFRQRSRVTWFAEGDG 358 Query: 1972 GTKFFHSLVKRNTKRNHIAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGDCQPINLEI 1793 T++FH + N I + GT S + +++NLL + D P +LE Sbjct: 359 NTRYFHRMADSRKSVNTITTLVDDSGTQIDSQQGIADHCALYFENLLSDDND--PYSLEQ 416 Query: 1792 CQDGPLITQ----NQSRDLLRPISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGV 1625 L+T +Q DL S ++IK+A F + +K+ GPDG+ Sbjct: 417 DDMNLLLTYRCPYSQVADLEAMFSDEDIKAAFFGLPSNKACGPDGFPV------------ 464 Query: 1624 QFSQAIMEFFTSGSLLRMINHTVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILADRL 1445 + A+ EFF SG+LL+ N T I L+PK +AS DFRPI+C N YKVIA++L DRL Sbjct: 465 --TAAVREFFISGNLLKQWNATTIVLIPKFPNASCTSDFRPISCMNTLYKVIARLLTDRL 522 Query: 1444 SVTLGTLIDKAQSAFVPGRSMLDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTICWD 1265 L +I +QSAF+PGR + +N+ + E++ YN + IS R +LK+DL+KA+D++ W+ Sbjct: 523 QKLLSCVISPSQSAFLPGRLLAENVLLATEMVHGYNWRNISLRGMLKVDLRKAFDSVRWE 582 Query: 1264 FLKDMLDGLNFPPKFVHWIMECVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVIC 1085 F+ L L P KF++WI +C+STP++++ +NG GFFK +GLRQGDPLSP+LFV+ Sbjct: 583 FIIAALLALGVPTKFINWIHQCISTPTFTVSVNGCCGGFFKSAKGLRQGDPLSPYLFVLA 642 Query: 1084 VEYFSRSLNRAAKNAHFSYHPKCGGLKISHXXXXXXXXXXXXXXASSVGILLSTLTDFGN 905 +E FS+ LN + + YHPK L ISH +SS+ + TL DF + Sbjct: 643 MEVFSKLLNSRFDSGYIRYHPKASDLSISHLMFADDVMIFFDGGSSSLHGICETLEDFAS 702 Query: 904 KSGLRANALKSSIFTAGIQGREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLI 725 SGL+ N KS F AG++ E+ L A P G +P RYLG+P++ KL++ +Y L+ Sbjct: 703 WSGLKVNNDKSHFFCAGLEQAERN-SLAAYGFPQGCLPIRYLGLPLMCRKLRIAEYEPLL 761 Query: 724 DKIHSYLKTWTAKNLSHAGKLELIQAV---LQGTECLWFSVLPVPCAVMDKII------- 575 +K + KN H ++ + V L + S P + ++ Sbjct: 762 EK--------SPKNSDHGQQIVYLTQVEFNLLLPLSMVSSTFGCPLSCCQRVALRRLKAF 813 Query: 574 ---SICRSFLWGVNKHQLLGKIYACPNPREDLALES*KHGTTPSWLKYYGTSMQRRTLFG 404 S R L V + + LG ++A + LALE GT +G S+ + FG Sbjct: 814 VLGSFERETLMVVEEQRSLGLLFASQKMKVGLALEDSPSGTKRFVCVLFGFSLIIKVRFG 873 Query: 403 YDG 395 + G Sbjct: 874 FLG 876 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 383 bits (983), Expect(2) = e-108 Identities = 245/820 (29%), Positives = 395/820 (48%), Gaps = 4/820 (0%) Frame = -3 Query: 2995 QNGVRHLIKEHRIDVLAVLETKINDVKCSRIMRNKFGGWNHFHNFHLHNAGRILIIWDPS 2816 Q ++ L HR+ +LA+LE ++ K + R K G F ++N+ +I + S Sbjct: 866 QRRIKKLQLMHRLKILAILEPMVDTSK-AEYFRRKMG----FEKVIVNNSQKIWLFH--S 918 Query: 2815 TTILEPIILD-AQFILARAICKVTALSFHICFIYGFHTVVSRRPLWDTXXXXXXXXXXXX 2639 + ++LD Q + R L F+Y T R PLW+ Sbjct: 919 VEFICEVLLDHPQCLHVRVTIPWLDLPIFTTFVYAKCTRSERTPLWNCLRNLAADMEGPW 978 Query: 2638 XLG-DFNCVMKASERLNGTEVSSYETRDLLQCCLSAGLSDLNSIGSFHTWTNNTVLCKLD 2462 +G DFN ++K ERL G + D L GL D G+ TWTNN + +LD Sbjct: 979 IVGGDFNIILKREERLYGADPHEGSIEDFASVLLDCGLLDGGFEGNPFTWTNNRMFQRLD 1038 Query: 2461 RAMANEAWFSSNHAGMANFLPSGCLSDHSPCIVALFQHSRPTKKNFKFFNMWCDHADFEQ 2282 R + N+ W + L SDH P +++ S +F+F + W H +F Sbjct: 1039 RMVYNQQWINKFPITRIQHLNRDG-SDHCPLLLSCSNSSEKAPSSFRFLHAWALHHNFNA 1097 Query: 2281 LISEHWEEPIHGTKQFTLCKKLKRLKGPLKALNKKHFSHISSRAEKARNDFDQALEEFHL 2102 + +W PI+G+ K KRLK LK NK F I S ++A ++ E H Sbjct: 1098 SVEGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDIFSNIKEAEKRVEEC-EILHQ 1156 Query: 2101 QPANTALQLQIADLKLKARSLSEAERSFYFQQAKCKHLTYSDRGTKFFHSLVKRNTKRNH 1922 Q ++Q+ + E F+ Q++ K + +R TKFFH +++ R+H Sbjct: 1157 QEQTIGSRIQLNKSYAQLNKQLSMEEIFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSH 1216 Query: 1921 IAAITKMDGTVTTSSCEVVKEFLDFYDNLLGTEGDCQPINLEICQDGPLITQNQSRDLLR 1742 I I + DG ++ + +DF+ +LL E C + +I+ + L Sbjct: 1217 IFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAES-CDDTRFQSSLCPSIISDTDNGFLCA 1275 Query: 1741 PISIDEIKSALFSIGDDKSPGPDGYTAQFYKKAWNIVGVQFSQAIMEFFTSGSLLRMINH 1562 ++ E+K A+F I + + GPDG+++ FY++ W+I+ +A+ EFF + + + Sbjct: 1276 EPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFFHGADIPQGMTS 1335 Query: 1561 TVIALVPKSDHASTVGDFRPIACCNVTYKVIAKILADRLSVTLGTLIDKAQSAFVPGRSM 1382 T + L+PK+ AS +FRPI+ C V K+I KILA+RL+ L ++I + QS FV GR + Sbjct: 1336 TTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILPSIITENQSGFVGGRLI 1395 Query: 1381 LDNIHMVQELLKHYNRKRISPRCILKIDLKKAYDTICWDFLKDMLDGLNFPPKFVHWIME 1202 DNI + QEL+ ++K LK+D+ KAYD + W FL +L L F +++ I + Sbjct: 1396 SDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWSFLFKVLQHLGFNAQWIGMIQK 1455 Query: 1201 CVSTPSYSLRINGEMHGFFKGQRGLRQGDPLSPFLFVICVEYFSRSLNRAAKNAHFSYHP 1022 C+S +SL +NG G+FK +RGLRQGD +SP LF++ EY +R LN A + + S H Sbjct: 1456 CISNCWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILAAEYLARGLN-ALYDQYPSLHY 1514 Query: 1021 KCG-GLKISHXXXXXXXXXXXXXXASSVGILLSTLTDFGNKSGLRANALKSSIFT-AGIQ 848 G L +SH S++ +++ L ++ SG R N KS + T + Sbjct: 1515 SSGCSLSVSHLAFADDVIIFANGSKSALQKIMAFLQEYEKLSGQRINPQKSCVVTHTNMA 1574 Query: 847 GREKQVVLEAANIPIGHMPFRYLGIPIVAEKLKVCQYGLLIDKIHSYLKTWTAKNLSHAG 668 +Q++L+A +P YLG P+ KV + L+ KI + W K LS G Sbjct: 1575 SSRRQIILQATGFSHRPLPITYLGAPLYKGHKKVMLFNDLVAKIEERITGWENKTLSPGG 1634 Query: 667 KLELIQAVLQGTECLWFSVLPVPCAVMDKIISICRSFLWG 548 ++ L+++ L VL P V+++I + +FLWG Sbjct: 1635 RITLLRSTLSSLPIYLLQVLKPPVIVLERINRLLNNFLWG 1674 Score = 39.3 bits (90), Expect(2) = e-108 Identities = 23/75 (30%), Positives = 37/75 (49%) Frame = -2 Query: 530 AWKDLCLPKSEGGLGLRELKTWNNSLLAKVLWNINAKKDSLWIRWIDQVYLHGVSIWDVQ 351 +W + LP +EGGL +R ++ + K+ W +SLW +++ Y G DVQ Sbjct: 1686 SWGKIALPIAEGGLDIRNVEDVCEAFSMKLWWRFRT-TNSLWTQFMRAKYCGGQLPTDVQ 1744 Query: 350 NKKDHSPLMKRILQI 306 K S KR++ I Sbjct: 1745 PKLHDSQTWKRMVTI 1759 Score = 65.5 bits (158), Expect = 2e-07 Identities = 47/163 (28%), Positives = 73/163 (44%), Gaps = 6/163 (3%) Frame = -1 Query: 4008 WNVKYKYYTHRSGWLVFKFENEEDKAKVLQGGPYFVFGRPLMIKSLPYCFQFDETDFHDV 3829 + V++ Y H ++ NE+D ++ +F+ + + + F+ E + V Sbjct: 20 YEVRWLDYKH----VLIHLSNEQDFNRIWTKQNWFIATQKMRVFKWTPEFE-PEKESAVV 74 Query: 3828 PVWVTLPGLPLECWHPMALSKICSKVGKPISSDGLTASRDRLSYARVLVEVDASKPLVKS 3649 PVW++ P L + AL I VGKP+ D TA+ R S ARV VE D K V Sbjct: 75 PVWISFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCVEYDCRKSPVDQ 134 Query: 3648 VPIKLPNGQT------RVQEIRFEHEPRFCTSCKMLGHDLENC 3538 V I + N +T Q + F P +C C +GH +C Sbjct: 135 VWIVVQNRKTGEVMNGYSQRVEFAQMPAYCDHCCHVGHKETDC 177 >ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659506 [Glycine max] Length = 964 Score = 395 bits (1016), Expect = e-107 Identities = 202/506 (39%), Positives = 301/506 (59%), Gaps = 1/506 (0%) Frame = -3 Query: 2788 DAQFILARAICKVTALSFHICFIYGFHTVVSRRPLW-DTXXXXXXXXXXXXXLGDFNCVM 2612 +AQ I CK TA F + FIYG H++++RR LW + +GDFN ++ Sbjct: 458 NAQLIHCAIDCKTTAKRFQVSFIYGLHSIMARRSLWINLNSINANMNCPWLLIGDFNSIL 517 Query: 2611 KASERLNGTEVSSYETRDLLQCCLSAGLSDLNSIGSFHTWTNNTVLCKLDRAMANEAWFS 2432 ++R NG E+++YE +D + C GL +N+ G +TWTN+ V KLDRA+ N+AWF+ Sbjct: 518 SPTDRFNGAELNAYELQDFVDCYSDLGLGSINTHGPLYTWTNSRVWSKLDRALCNQAWFN 577 Query: 2431 SNHAGMANFLPSGCLSDHSPCIVALFQHSRPTKKNFKFFNMWCDHADFEQLISEHWEEPI 2252 S + +SDH+P +V FKF N+ DH +F +++++ W++ I Sbjct: 578 SFGNSACEVMEFISISDHTPLVVTTELVVPRGNSPFKFNNLIVDHPNFLRIVADGWKQNI 637 Query: 2251 HGTKQFTLCKKLKRLKGPLKALNKKHFSHISSRAEKARNDFDQALEEFHLQPANTALQLQ 2072 HG F +CKKLK LK PLK L K+ FS+IS+R E A +++ L P + +L Sbjct: 638 HGCSMFKVCKKLKALKAPLKNLFKQEFSNISNRVELAEAEYNSVLNSIKQNPQDPSLLAL 697 Query: 2071 IADLKLKARSLSEAERSFYFQQAKCKHLTYSDRGTKFFHSLVKRNTKRNHIAAITKMDGT 1892 + + L +AE + Q K K+L +D+ +KFFH+L+KRN IAAI DG Sbjct: 698 ANRTRGQTIMLRKAESMKFAQLIKNKYLLQADKCSKFFHALIKRNKHSRFIAAIRLEDGH 757 Query: 1891 VTTSSCEVVKEFLDFYDNLLGTEGDCQPINLEICQDGPLITQNQSRDLLRPISIDEIKSA 1712 T+S E+ F++ + N Q ++ IC GP + + LL P S ++ + Sbjct: 758 NTSSQDEIALAFVNHFRNFFSAHELTQTPSISICNRGPKVPTDCFAALLCPTSKQKVWNI 817 Query: 1711 LFSIGDDKSPGPDGYTAQFYKKAWNIVGVQFSQAIMEFFTSGSLLRMINHTVIALVPKSD 1532 + + ++K+PGPDG+ F+KKAWNIVG A+ EFFT+G +L+ +NH +I L+PK D Sbjct: 818 ISVMANNKAPGPDGFNVLFFKKAWNIVGDDIFAAVNEFFTTGKILKQLNHAIIVLIPKHD 877 Query: 1531 HASTVGDFRPIACCNVTYKVIAKILADRLSVTLGTLIDKAQSAFVPGRSMLDNIHMVQEL 1352 AS V FRPI+CCN+ YK+++KILA+R++ L T+I + Q+AF+ R M+DNI +VQE+ Sbjct: 878 QASQVNHFRPISCCNLLYKIVSKILANRIAPVLETIIGETQTAFIKNRKMMDNIFLVQEI 937 Query: 1351 LKHYNRKRISPRCILKIDLKKAYDTI 1274 L+ Y RKR SPRC+LKIDL KAYD I Sbjct: 938 LRKYARKRPSPRCLLKIDLHKAYDFI 963 Score = 254 bits (648), Expect = 3e-64 Identities = 127/259 (49%), Positives = 162/259 (62%), Gaps = 5/259 (1%) Frame = -1 Query: 4299 SRTLGTEDTYTSDSDCSASQDRATKPP----QTTAPWADLFKTNRSPQMGLALT-EIKDQ 4135 S TLG D+ T+D D S S + P + PW +LFK NRSP G + Sbjct: 100 SCTLGDNDS-TTDDDSSHSCGSKSSPQLDNNKALTPWVNLFKDNRSPSKGFGMKFSPPPS 158 Query: 4134 PEEVTIMSHESFDVHTAWGFCIVGYIAGRFPGKTALLRVCDEWNVKYKYYTHRSGWLVFK 3955 +EV + + + AWG ++GY+AGRFPGK ALL C +W VK+ Y H SGWLVFK Sbjct: 159 DDEVLLEETDLQPLEEAWGHSLIGYVAGRFPGKKALLDCCKKWGVKFSYSAHESGWLVFK 218 Query: 3954 FENEEDKAKVLQGGPYFVFGRPLMIKSLPYCFQFDETDFHDVPVWVTLPGLPLECWHPMA 3775 FE+E+D +VL GPYF+F RPL++K +P F F + +PVWV L LPLE W+P A Sbjct: 219 FESEDDLNQVLSAGPYFIFQRPLLLKVMPAFFDFGNEELSKIPVWVKLRNLPLELWNPQA 278 Query: 3774 LSKICSKVGKPISSDGLTASRDRLSYARVLVEVDASKPLVKSVPIKLPNGQTRVQEIRFE 3595 L KI SK+G PI SD LTAS+ +S+AR LVEVDAS L+ V +LP G+T VQ+I +E Sbjct: 279 LGKILSKIGSPIRSDHLTASKGSISFARALVEVDASLELIDEVRFRLPTGKTFVQKIEYE 338 Query: 3594 HEPRFCTSCKMLGHDLENC 3538 + P FCT CKM GH L NC Sbjct: 339 NRPSFCTHCKMTGHRLTNC 357