BLASTX nr result
ID: Cocculus23_contig00009173
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00009173 (1974 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 332 4e-88 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 328 4e-87 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 327 9e-87 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 326 3e-86 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 326 3e-86 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 322 3e-85 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 319 3e-84 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 313 1e-82 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 312 4e-82 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 298 5e-78 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 295 7e-77 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 291 6e-76 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 273 3e-70 gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CA... 270 2e-69 gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] 262 4e-67 emb|CAN78577.1| hypothetical protein VITISV_020585 [Vitis vinifera] 246 4e-62 emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulga... 241 7e-61 gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thali... 238 1e-59 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 237 1e-59 dbj|BAB01344.1| reverse transcriptase-like protein [Arabidopsis ... 237 1e-59 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 332 bits (851), Expect = 4e-88 Identities = 188/543 (34%), Positives = 287/543 (52%), Gaps = 5/543 (0%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSV VNG AG+F+S RG+RQG +SPYLF I M+VLS + H CK G Sbjct: 638 FSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMG 697 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ++HL FADDL+V ++ + + FA++SGL I+ +KS ++LAG+S+ + + Sbjct: 698 LTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVA 757 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 + F G PV+YLGLPLI+ RL + C PL++ + ++ SW SR LS+AGR L+ SV Sbjct: 758 DRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSV 817 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717 L S+ +W AF +P +LE + FL SG+ + + ISW + +P +EGGL +R Sbjct: 818 LWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLR 877 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLK-GEXXXXXXXXXXXXXXXRRIFKLR 894 K+ N LKL+WK++++ +SLWV+WV L+ +++ K R Sbjct: 878 SLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYR 937 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 ++AK VGNG+ TSFW D W + G L + I LGI R V N R Sbjct: 938 EVAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQR 997 Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTS---GLFSTKSAWEITRFKHPKCS 1245 R + I + + T ED+V+W+ + FST+ W TR + Sbjct: 998 --RHRNDVYNVIEDALKKSWDTRTETEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSARVP 1055 Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425 W K+IWF PK++ W ++PT ++ + ++ CIFC ET HLFF+C Sbjct: 1056 WHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDHLFFTC 1115 Query: 1426 HFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNERN 1605 F+ +W +A+ F + + + W++ + I + ++ + + FQA+IY VW ERN Sbjct: 1116 SFTSVIWVDLARGIFKTQYTSHWQSIIEAITNS-QHHRVEWFLRRYVFQATIYIVWRERN 1174 Query: 1606 RRR 1614 RR Sbjct: 1175 GRR 1177 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 328 bits (842), Expect = 4e-87 Identities = 191/541 (35%), Positives = 280/541 (51%), Gaps = 6/541 (1%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 F++ VNG+ GFFRS +G+RQGDP+SPYLF +AMEV S + G++ Sbjct: 492 FTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLS 551 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ISHLMFADD+++F S++++ + FA++SGL++NK KS +F AG+ +L + + Sbjct: 552 ISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL--DLSERIT 609 Query: 361 EAS-GFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRS 537 A+ GF G FP++YLGLPL+ +L I+ PL++ + ++L+SW S++LS+AGRT L+ S Sbjct: 610 SAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISS 669 Query: 538 VLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNI 714 V+ L +W F +P K+ES+ +FL +GS K S +SW D P EGGL Sbjct: 670 VIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGF 729 Query: 715 RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLR 894 R + NK LL+LIW L SLW +W L + + LR Sbjct: 730 RSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLR 789 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 LA+ I VGNG + SFW D W + G L + ++ L I +A VA GWR Sbjct: 790 PLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGSGWR 849 Query: 1075 FPYSRIPLIREIWNQCSSL-YCLPTLEEDEVVWKATTSGL--FSTKSAWEITRFKHPKCS 1245 P SR I + +SL P + D W FS WE+ R + P Sbjct: 850 LPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWEVLRPRRPVKR 909 Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425 WAK +WF +PKHA W L ++PT +L +V S+ C C ET HL C Sbjct: 910 WAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSFDTETRDHLLLLC 969 Query: 1426 HFSLGVWSLI-AQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602 FS VW ++ + C +W L + S+ + S++ K+ Q +Y++W +R Sbjct: 970 DFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQ--STAAAPSLLRKVVAQLVVYNLWRQR 1027 Query: 1603 N 1605 N Sbjct: 1028 N 1028 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 327 bits (839), Expect = 9e-87 Identities = 190/541 (35%), Positives = 280/541 (51%), Gaps = 6/541 (1%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 F++ VNG+ GFFRS +G+RQGDP+SPYLF +AMEV S + G++ Sbjct: 492 FTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLS 551 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ISHLMFADD+++F S++++ + FA++SGL++NK KS +F AG+ +L + + Sbjct: 552 ISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL--DLSERIT 609 Query: 361 EAS-GFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRS 537 A+ GF G FP++YLGLPL+ +L I+ PL++ + ++L+SW S++LS+AGRT L+ S Sbjct: 610 SAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISS 669 Query: 538 VLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNI 714 V+ L +W F +P K+ES+ +FL +GS K S +SW D P EGGL Sbjct: 670 VIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGF 729 Query: 715 RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLR 894 R + NK LL+LIW L SLW +W L + + LR Sbjct: 730 RSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLR 789 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 LA+ I VGNG + SFW D W + G L + ++ L I +A VA GWR Sbjct: 790 PLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGSGWR 849 Query: 1075 FPYSRIPLIREIWNQCSSL-YCLPTLEEDEVVWKATTSGL--FSTKSAWEITRFKHPKCS 1245 P SR I + +SL P + D W FS WE+ R + P Sbjct: 850 LPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWEVLRPRRPVKR 909 Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425 WA+ +WF +PKHA W L ++PT +L +V S+ C C ET HL C Sbjct: 910 WARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSFDTETRDHLLLLC 969 Query: 1426 HFSLGVWSLI-AQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602 FS VW ++ + C +W L + S+ + S++ K+ Q +Y++W +R Sbjct: 970 DFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQ--STAAAPSLLRKVVAQLVVYNLWRQR 1027 Query: 1603 N 1605 N Sbjct: 1028 N 1028 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 326 bits (835), Expect = 3e-86 Identities = 193/552 (34%), Positives = 287/552 (51%), Gaps = 14/552 (2%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSV VNG G+F+S+RG+RQG +SPYLF I M+VLS + + C+ G Sbjct: 285 FSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFHPKCQRLG 344 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ++HL FADDL+V + ++ L + F + SGL I+ +KS +++AG+S ++ + Sbjct: 345 LTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIA 404 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 F G PV+YLGLPL++ RL + PL++ ++ ++ +W R S+AGR L++SV Sbjct: 405 AKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSV 464 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS--SKHKFSLISWKDIARPLEEGGLNI 714 L S+ +W AF +P +++ + FL SGS S HK + ISW + +P EGGL + Sbjct: 465 LWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHK-AKISWDIVCKPKAEGGLGL 523 Query: 715 RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYL--KGEXXXXXXXXXXXXXXXRRIFK 888 R K+ N LKL+W++I+N +SLW +WV YL K R+I K Sbjct: 524 RNLKEANDVSCLKLVWRIISNSNSLWTKWV-AEYLIRKKSIWSLKQSTSMGSWIWRKILK 582 Query: 889 LRDLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDG 1068 +RD+AK VGNG S SFW D W HG L + + I LGI R A VA D Sbjct: 583 IRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVA-----DA 637 Query: 1069 W---RFPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTSGL---FSTKSAWEITRFK 1230 W R L+ EI + + ED V+W+ FST+ W + + Sbjct: 638 WTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDTWHLIKAT 697 Query: 1231 HPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSS----CIFCWTGEE 1398 SW K +WF PK+A W ++PT ++ LK S S C+ C + Sbjct: 698 SSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRM--LKWNSSGSVSGNCVLCTNNSK 755 Query: 1399 TEKHLFFSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQAS 1578 T +HLFFSC ++ VW+ +A+ + + + W L I F + ++ + + FQA+ Sbjct: 756 TLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLLTHISTHF-QDRVEGFLTRYIFQAT 814 Query: 1579 IYHVWNERNRRR 1614 IYHVW ERN RR Sbjct: 815 IYHVWRERNGRR 826 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 326 bits (835), Expect = 3e-86 Identities = 192/544 (35%), Positives = 278/544 (51%), Gaps = 6/544 (1%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSV VNG AG+FRS RGIRQG +SPYLF I+MEVLS + CK G Sbjct: 43 FSVQVNGELAGYFRSARGIRQGCALSPYLFVISMEVLSKMLDQAAGGKRFGFHPKCKNLG 102 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ++HL FADDL++ +++ + +M FA+ SGL+IN +K+ ++ AG+S + ++ Sbjct: 103 LTHLCFADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMI 162 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 F G PV+YLGLPL++ RL PL + + +++ +W SR LS+AGR L+ SV Sbjct: 163 SRYPFGLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSV 222 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717 L S +W AF +PS+ ++ SI FL SG H + + +SW DI +P +EGGL +R Sbjct: 223 LWSTMNFWMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLR 282 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894 + N +LKLIW++ +N DSLWV+W LK E +++ K R Sbjct: 283 SLTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLTPNSSLGSWMWKKMLKYR 342 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 + AK V NG TSFW D W G L + QI LGI RN VA N R Sbjct: 343 ETAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNR--R 400 Query: 1075 FPYSRIPLIREIWNQCSSLY-CLPTLEEDEVVWKA---TTSGLFSTKSAWEITRFKHPKC 1242 R + +I + Y L ED +W+ FSTK W R K + Sbjct: 401 RRKHRTEQLNDIEAALNQKYQTRNLLREDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEV 460 Query: 1243 SWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFS 1422 +W K +WF PK+ W ++ T ++++ C FC T ET HLFFS Sbjct: 461 AWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSIETRDHLFFS 520 Query: 1423 CHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602 C ++ +W+ IA+ F W+ + I + ++ I S + + FQ +++ VW ER Sbjct: 521 CSYASAIWTAIAKNVLQHRFSTDWQTIVNYI-SETQTDRIRSFLSRYIFQLTVHTVWKER 579 Query: 1603 NRRR 1614 N RR Sbjct: 580 NDRR 583 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 322 bits (826), Expect = 3e-85 Identities = 183/541 (33%), Positives = 281/541 (51%), Gaps = 6/541 (1%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 F+V +NG GFF+S +G+RQGDP+SPYLF +AME S++ G + Sbjct: 632 FTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLS 691 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ISHLMFADD+++F S +L+ + FA +SGL++NK KS+++LAG+ + LE + Sbjct: 692 ISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGL-NQLESNAN 750 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 A GF G P++YLGLPL++ +L I+ +PL++ + ++ +SW ++ LS+AGR L+ SV Sbjct: 751 AAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSV 810 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNIR 717 + +W F +P ++ES+ RFL SG+ + K +SW + P EGGL +R Sbjct: 811 IFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLR 870 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLRD 897 + NK ++LIW+L KDSLW W H +L +R+ LR Sbjct: 871 RLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRP 930 Query: 898 LAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWRF 1077 LA + VGNG +W D W + G L I ++ SL + A VA DGWR Sbjct: 931 LAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLAKVASAFSEDGWRL 990 Query: 1078 PYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTSGL----FSTKSAWEITRFKHPKCS 1245 P SR + I + ++ T +ED ++ + +G FS WE R K S Sbjct: 991 PVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQGFSAAKTWEAIRPKATVKS 1050 Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425 WA IWF +PK+A +W L ++ T +L + S +C+ C E+ HL C Sbjct: 1051 WASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACVLCSFASESRDHLLLIC 1110 Query: 1426 HFSLGVWSLIAQK-CFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602 FS VW L+ ++ C +SW L + + SS ++ K+ Q +Y++W +R Sbjct: 1111 EFSAQVWRLVFRRICPRQRLFSSWSELLSWVRQ--SSPEAPPLLRKIVSQVVVYNLWRQR 1168 Query: 1603 N 1605 N Sbjct: 1169 N 1169 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 319 bits (818), Expect = 3e-84 Identities = 186/553 (33%), Positives = 277/553 (50%), Gaps = 9/553 (1%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FS+ VNG AG+FRS RG+RQG +SPYLF I+M+VLS + CK G Sbjct: 359 FSIQVNGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYHPRCKTLG 418 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ++HL FADDL++ +++ + ++ FA GL+I +K+ ++LAG+S + + Sbjct: 419 LTHLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMS 478 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 F G PV+YLGLPL++ RL S PLID + ++ W SR LS+AGR L+ SV Sbjct: 479 SRYSFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSV 538 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717 L S+ +W AF +P ++ I L SG + K + +SW +I +P +EGGL ++ Sbjct: 539 LWSITNFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQ 598 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894 ++ NK LKLIW+L++ +DSLWV+W LK E RR+ K R Sbjct: 599 SLREANKVSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSIGTHSTLGSWIWRRLLKHR 658 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGW- 1071 ++AK V NG +TSFW D W G L I +GI R H + + W Sbjct: 659 EVAKSFCKIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISR-----HMTLAEAWS 713 Query: 1072 --RFPYSRIPLIREIWNQCSSLYCLPTLE-EDEVVWKA---TTSGLFSTKSAWEITRFKH 1233 R R+ ++ E Y +E ED ++W+ FSTK W R Sbjct: 714 RRRRKRHRVEILNEFEEILLQKYQHRNIELEDAILWRGKEDVFKARFSTKDTWNHIRTSS 773 Query: 1234 PKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHL 1413 + +W K +WF PK + W ++ T ++ ++C+FC + ET HL Sbjct: 774 NQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHL 833 Query: 1414 FFSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVW 1593 FF C +S +W+ IA+ + F W A + I D + I S + + FQ SI+ +W Sbjct: 834 FFQCCYSSEIWTSIAKNVYKDRFSTKWSAVVNYI-SDSQPDRIQSFLSRYTFQVSIHSIW 892 Query: 1594 NERNRRRFQSRGR 1632 ERN RR + R Sbjct: 893 RERNSRRHGEKSR 905 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 313 bits (803), Expect = 1e-82 Identities = 187/549 (34%), Positives = 287/549 (52%), Gaps = 11/549 (2%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSV VNG AGFF S RG+RQG +SPYLF I M VLS + + ++ C+ G Sbjct: 935 FSVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIG 994 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ++HL FADDL+VF+ ++ + + + FA SGL+I+ +KS I+LAG+S++ + Sbjct: 995 LTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTL 1054 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 + F G PV+YLGLPL++ ++ + PLI+ +++K+ SW +RSLS+AGR LL SV Sbjct: 1055 SSFPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSV 1114 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717 + S+ +W A+ +P+ ++E + FL SG + K + I+W I +P +EGGL I+ Sbjct: 1115 IVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKEGGLGIK 1174 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYL-KGEXXXXXXXXXXXXXXXRRIFKLR 894 + NK LKLIW+L++ + SLWV W+ + KG +++ K R Sbjct: 1175 SLAEANKVSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFWSANERSSLGSWMWKKLLKYR 1234 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGI----DRNALVAHFRIN 1062 +LAK V NG STSFW D W + G L I LGI + ++ + Sbjct: 1235 ELAKSMHKVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVIDLGIPLETNLETVLRTHQHR 1294 Query: 1063 DGWRFPYSRIPL-IREIWNQCSSLYCLPTLEEDEVVWKATTSGL---FSTKSAWEITRFK 1230 Y+RI I+ + Q D +W++ + F TK W R Sbjct: 1295 QHRAAIYNRINAEIQRLQQQERE------AGPDISLWRSLKNDFNKRFITKVTWNNVRTH 1348 Query: 1231 HPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKH 1410 P+ +W K +WFP PK++ L+W ++ T ++K +C C EET H Sbjct: 1349 QPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEETRDH 1408 Query: 1411 LFFSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSI-VVKLAFQASIYH 1587 LFFSC ++ VW + Q+ ++ + W L+ S+ D + + + FQASIYH Sbjct: 1409 LFFSCQYTSYVWEALTQRLLSTNYSRDWNRLFTLLCT--SNLPRDHLFLFRYVFQASIYH 1466 Query: 1588 VWNERNRRR 1614 +W ERN RR Sbjct: 1467 IWRERNARR 1475 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 312 bits (799), Expect = 4e-82 Identities = 187/544 (34%), Positives = 286/544 (52%), Gaps = 6/544 (1%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSV VNG +GFFRS+RG+RQG +SPYL+ I M VLS + + + C+ Sbjct: 785 FSVQVNGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCRNMN 844 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ++HL FADD++VF +S ++ L + FA S L+I+ +KS IF+AGIS N + S++ Sbjct: 845 LTHLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSIL 904 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 + F G PVKYLGLPL++ R+ S PL++ + +++ SW +R LS+AGR L++SV Sbjct: 905 QQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSV 964 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717 L S+ +W F +P + ++E + FL SG + K + I+W ++ + EEGGL ++ Sbjct: 965 LSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLK 1024 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894 K+ N+ LLKLIW++++ +DSLWV+WV+ ++ E R+I K R Sbjct: 1025 PLKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQR 1084 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 D A+ V +G TSFW D W G L + I LGI NA VA + + R Sbjct: 1085 DKARLFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPNNATVA--EVMNTHR 1142 Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKA---TTSGLFSTKSAWEITRFKHPKCS 1245 R + +I +Q + + D +WK T FS+ W+ R +C Sbjct: 1143 RKRHRADFLNQIKSQIELARQDRSTDGDRSLWKQKEDTFKSSFSSSKTWQQIRSISLRCD 1202 Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425 W + +WF PK++ + W ++ T K+ C+FC ET HLFFSC Sbjct: 1203 WYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEELETRDHLFFSC 1262 Query: 1426 HFSLGVWSLIAQKCFNSTFHASWE-ASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602 +S VW + + N +W + L+D S + ++ AFQASI+ +W ER Sbjct: 1263 PYSSHVWFSLTKGLLNGRNILNWNLITPHLLDS--SRPYLHVFTLRYAFQASIHSLWRER 1320 Query: 1603 NRRR 1614 N RR Sbjct: 1321 NCRR 1324 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 298 bits (764), Expect = 5e-78 Identities = 177/583 (30%), Positives = 270/583 (46%), Gaps = 2/583 (0%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLN-HCKPF 177 FSV VNG AGFF +RG+RQGDP+SPYLF IAMEVLS + + + C Sbjct: 459 FSVCVNGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQL 518 Query: 178 GISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSL 357 +SHL FADDLL+F N++ +F S L+ N +S IFLAG+ N DS+ Sbjct: 519 NLSHLCFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSV 578 Query: 358 VEASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRS 537 ++ + F G PV+YLG+PLI+++L + C PL+D +E++++SW+++ LS+AGR L++S Sbjct: 579 LQVTNFSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQS 638 Query: 538 VLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNI 714 VL S+ +YW+ +P + +E +R FL +G+ S + ++W +I P EGGL I Sbjct: 639 VLSSIQVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGI 698 Query: 715 RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLR 894 + NKA ++ IW L+++ + W WV LKG R++ K+R Sbjct: 699 KDLHCWNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIR 758 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 +L I+G+GR+TS W D WH G L L I G+ ++A++ Sbjct: 759 ELCCSFFVNIIGDGRATSLWFDNWHPLGPLTLRWSSNIIGESGLSKSAML---------- 808 Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTSGLFSTKSAWEITRFKHPKCSWAK 1254 T +G +ST SAW R W + Sbjct: 809 ----------------------------------TPNGFYSTSSAWNTLRPSRFIVPWYR 834 Query: 1255 LIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSCHFS 1434 L+WF ET HLFF C +S Sbjct: 835 LVWFV-----------------------------------------AETHNHLFFDCAYS 853 Query: 1435 LGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNERNRRR 1614 G+W+ + KC S W + + ++ NS+ +++KLA QA +Y +W ERN RR Sbjct: 854 FGIWTHVLSKCDVSKPLLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNNRR 913 Query: 1615 FQSRGRXXXXXXXXXXXXVMSKLKNVSSNLSTSNRFLAENWGL 1743 F++ + L + + SN ++ W L Sbjct: 914 FRNESLPPAVVFKGIVESIRLCLLSWKIPHTPSNAYIFHEWRL 956 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 295 bits (754), Expect = 7e-77 Identities = 180/545 (33%), Positives = 267/545 (48%), Gaps = 8/545 (1%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FS+ V+GS G+F+ +G+RQGDP+SP LF IAME+LS + +++ G + Sbjct: 631 FSINVSGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVR 690 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGI-SSNLEDSL 357 IS L FADDL++F +++L ++ SF SGLE+N +KS ++ AG+ ++ ED+L Sbjct: 691 ISSLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL 750 Query: 358 VEASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRS 537 A GF+ G FP +YLGLPL+ +L S LID + ++ W +++LS+AGR L+ S Sbjct: 751 --AFGFVNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISS 808 Query: 538 VLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKHKFSL-ISWKDIARPLEEGGLNI 714 V+ S +W +F +P +E + RFL + + +SW++ P EGGL + Sbjct: 809 VIYSTVNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGL 868 Query: 715 RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLR 894 R NK L+LIW L +DSLWV W H L+ + I LR Sbjct: 869 RNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIWKAILGLR 928 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 LAK + VGNG+ S+W D W N G L I GI +A+V + GW Sbjct: 929 PLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAVVTEASSSTGWI 988 Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLE----EDEVVW--KATTSGLFSTKSAWEITRFKHP 1236 P +R + N S+L P ED W + ++S FS+K WE R + Sbjct: 989 LPSAR-TRNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSSTSFSSKLTWECLRQRDT 1047 Query: 1237 KCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLF 1416 WA +W+ IPK+A W L ++P + H S C C ET HLF Sbjct: 1048 TKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPSLCCVCQRETETRDHLF 1107 Query: 1417 FSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWN 1596 C +W + + S W+ + + + S + KLA Q +I+H+W Sbjct: 1108 IHCTLGSLIWQQVLARFGRSQMFREWKDIIEWMLS--NQGSFSGTLKKLAVQTAIFHIWK 1165 Query: 1597 ERNRR 1611 ERN R Sbjct: 1166 ERNSR 1170 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 291 bits (746), Expect = 6e-76 Identities = 173/483 (35%), Positives = 253/483 (52%), Gaps = 10/483 (2%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSV VNG AGFF+S RG+RQG +SPYLF I M+VLS + + +G + HCK G Sbjct: 191 FSVQVNGELAGFFQSSRGLRQGCALSPYLFVICMDVLSKLLDKVVGIGRIGYHPHCKRMG 250 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ++HL FADDL++ ++ + + F+++SGL+I+ +KS IF AG+SS L Sbjct: 251 LTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSSTSRAQLH 310 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 F G P++YLGLPL++ RL PLI+ + ++ SW SR LS+AGR L+ S+ Sbjct: 311 THFPFEVGELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGRFNLISSI 370 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSS-KHKFSLISWKDIARPLEEGGLNIR 717 + S +W AF +P + ++E + FL SG++ K + ISW + +P EGGL +R Sbjct: 371 IWSSCNFWLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKSEGGLGLR 430 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894 K+ N LKL+W++I++ DSLWV+WV LK E ++I K R Sbjct: 431 SLKEANDVCCLKLVWRIISHGDSLWVKWVEHNLLKREIFWIVKENANLGSWIWKKILKYR 490 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGW- 1071 +AK VGNG STSFW D W G L I +GI R VA D W Sbjct: 491 GVAKRFCKAEVGNGESTSFWFDDWSLLGRLIDVAGIRGTIDMGISRTMSVA-----DAWT 545 Query: 1072 --RFPYSRIPLIREIWNQCSSLYCLPTLEEDE--VVWKATT---SGLFSTKSAWEITRFK 1230 R + R ++ I S+ + T ++ + V+WK FSTK+ W R Sbjct: 546 SRRRRHHRQEILNTIEEVLSTQHQKRTQQQQQGRVLWKGKNDIYKDKFSTKNTWNYLRTT 605 Query: 1231 HPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKH 1410 + +W K +WFP PK++ +W ++ T ++ ++ C FC G ET H Sbjct: 606 SNEVAWHKGVWFPHATPKYSFCLWLAAHDRLATGARMIKWNRGETGDCTFCRQGIETRDH 665 Query: 1411 LFF 1419 LFF Sbjct: 666 LFF 668 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 273 bits (697), Expect = 3e-70 Identities = 163/484 (33%), Positives = 243/484 (50%), Gaps = 6/484 (1%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSV++NG AG F S +G+RQGDP+SPYLF +AMEV S + + G++ Sbjct: 529 FSVILNGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLE 588 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ISHLMFADD+++F S++L+ + + FA +SGL +N K+ ++ AG+S + DS+ Sbjct: 589 ISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMA 648 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 + GF G PV+YLGLPL+S +L I+ PLI+ + ++ SW R LS+AGR LL SV Sbjct: 649 -SYGFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASV 707 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNIR 717 + + +W +F +P K+ES+ RFL S K + ++W + P EGG+ +R Sbjct: 708 ISGIVNFWISSFILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLR 767 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYL-KGEXXXXXXXXXXXXXXXRRIFKLR 894 N+ L++IW L +N SLWV W L K + + +LR Sbjct: 768 RFAVSNRTLYLRMIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLRLR 827 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 +A+ I VGNGR SFW D W G L + L + NA ++ ++GW Sbjct: 828 VVAERFIRCNVGNGRDASFWFDNWTPFGPLIKFLGNEGPRDLRVHLNAKISDVCTSEGWS 887 Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEE----DEVVWKATTSGLFSTKSAWEITRFKHPKC 1242 R + +++ ++ D VV G FS + W R Sbjct: 888 IADPRSDQALSLHTHLTNISMPSDAQDLDSYDWVVDNKVCQG-FSAAATWSALRPSSAPV 946 Query: 1243 SWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFS 1422 WA+ +WF PKHA +W L ++PT +L M ++C C ET HLF S Sbjct: 947 PWARAVWFKGATPKHAFHLWTAHLDRLPTKVRLASWGMQIDTTCGLCSLHPETRDHLFLS 1006 Query: 1423 CHFS 1434 C F+ Sbjct: 1007 CDFA 1010 >gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CAB80742.1| AT4g02490 [Arabidopsis thaliana] Length = 657 Score = 270 bits (689), Expect = 2e-69 Identities = 164/496 (33%), Positives = 251/496 (50%), Gaps = 13/496 (2%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 +S+ NG GFF ++GIRQGDP+S +LF + M++L+ + G L C Sbjct: 179 YSIAYNGELIGFFVGKKGIRQGDPMSSHLFVLVMDILARSLDLGAVEGRFVLHPKCLAPM 238 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 I+HL FADD+LVF S ++L A L ++ F + SGL IN +K+ + L G + + Sbjct: 239 ITHLSFADDILVFCDGSLSSLVAILDILDVFKKGSGLGINLQKTALLLDGGNFERNRIMA 298 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 + G +G PV+YLG+PL+S +++ QPL+D + S+ SW +R LS+AGR LL+SV Sbjct: 299 ASLGVSQGSLPVRYLGVPLMSQKMKKHDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSV 358 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNIR 717 + S +W+ F +P+ KLE + FL SG+ + + + ISW + E GGL ++ Sbjct: 359 IYSTINFWASIFILPNQCLHKLEQMCNAFLWSGAPNSAREAKISWDIVCSSKESGGLGLK 418 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLRD 897 NK LKLIW L T SLWV WV R++ KLR+ Sbjct: 419 RLSSWNKVLALKLIWLLFTASGSLWVSWVR-------------------WVWRKLCKLRE 459 Query: 898 LAKDCICTIVGNGRSTSFWVDIWHNHGVL----ALCIPELIQISLGIDRNALVAHFRIND 1065 +A+ + VG+G + FW D W HG L L P+L+ +S+ ++V ND Sbjct: 460 VARPFVICEVGSGITARFWQDNWTGHGPLIHLTGLTGPQLVGLSI----TSVVRDAIRND 515 Query: 1066 GWRFPYSR-----IPLIREIWNQCSSLYCLPTLEEDEVVWKA---TTSGLFSTKSAWEIT 1221 W SR I L++ + +L + +D +WK S FST W Sbjct: 516 DWWIASSRSRNPVILLLKSLLPPVGNL--VDCEHDDSYLWKVGDRVPSSKFSTADTWRAL 573 Query: 1222 RFKHPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEET 1401 + SW K +WF +PKHA + W ++ T +L+ ++ + C+ C +ET Sbjct: 574 QPFSVSVSWHKAVWFTNQVPKHAFISWVTAWNRLHTRDRLRSWGLIVPAECVLCNLVDET 633 Query: 1402 EKHLFFSCHFSLGVWS 1449 HLFF+C FS +W+ Sbjct: 634 RDHLFFACRFSSRIWT 649 >gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] Length = 740 Score = 262 bits (670), Expect = 4e-67 Identities = 153/454 (33%), Positives = 238/454 (52%), Gaps = 6/454 (1%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSV VNG AGFF S+RG+RQG +SPYLF I M VLS + + ++ CK Sbjct: 211 FSVQVNGELAGFFGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYHPKCKKLS 270 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ++HL FADDL+VF+ ++ + + + FA SGL I+ +KS ++LAG+S ++++ Sbjct: 271 LTHLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSELNRNNIL 330 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 A F G PV+YLGLPL++ ++ + PL+D + SK+ SW +RSLS+AGR L+ SV Sbjct: 331 SAFPFASGQLPVRYLGLPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSV 390 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717 + SL +W A+ +P+ ++E + FL SG + K + I+W + + +EGGL I+ Sbjct: 391 IVSLSNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQEGGLGIK 450 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYL-KGEXXXXXXXXXXXXXXXRRIFKLR 894 + NK LKLIW+L++ + SLWV WV + KG +++ K R Sbjct: 451 SLLEANKVSCLKLIWRLVSRQSSLWVNWVWTYIIRKGSFWSANDRSSLGSWMWKKLLKYR 510 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 D+AK + +G STSFW D W G L I +GI A VA + R Sbjct: 511 DVAKSMCKVEIKSGSSTSFWYDNWSQLGQLVDVTNARRTIDMGIPLAATVA--TVLASHR 568 Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEEDEV-VWKATTSGL---FSTKSAWEITRFKHPKC 1242 + R + +I + S+ ++ +W+++ F TK W R H Sbjct: 569 TKHHRTAIYNKIEAEIQSILQRERSGAPDIFLWRSSGDNFRQSFITKVTWHNIRVIHTHR 628 Query: 1243 SWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLK 1344 W K +WF PK++ L+W ++ T ++K Sbjct: 629 QWYKGVWFSYNTPKYSFLLWLAIHDRLSTGDRIK 662 >emb|CAN78577.1| hypothetical protein VITISV_020585 [Vitis vinifera] Length = 1848 Score = 246 bits (627), Expect = 4e-62 Identities = 189/577 (32%), Positives = 268/577 (46%), Gaps = 37/577 (6%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSVL+NG+P GFF+S RG+RQGDP+SPYLF I MEV SS + G+ ++ C+ G Sbjct: 1252 FSVLINGTPKGFFQSSRGLRQGDPLSPYLFVIXMEVFSSFLNRAVDNGY---ISGCQVKG 1308 Query: 181 -------ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISS 339 ISHL+FADD LVF QAS + L L+ F SG+ IN KS + G Sbjct: 1309 RNEGGIQISHLLFADDTLVFCQASQDQLTYLSWLLMWFEAXSGMRINLDKSELIPVGRVV 1368 Query: 340 NLEDSLVEASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGR 519 +++D ++ G G P YLGLPL + ++ + + +L WK + LS GR Sbjct: 1369 DIDDLALD-FGCKVGSLPSTYLGLPLGAPFKSVAMWDGVEERFRKRLTMWKRQYLSKGGR 1427 Query: 520 TVLLRSVLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSS-KHKFSLISWKDIARPLE 696 L+RS L +L IY+ +PSS+ +LE I R FL G S + K L+ WK + + Sbjct: 1428 ATLIRSTLSNLPIYYMSVLRLPSSVRSRLEQIQRDFLWGGGSLERKPHLVRWKVVCLSKK 1487 Query: 697 EGGLNIRCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRY--LKGEXXXXXXXXXXXXXX 870 +GGL I+C ++NKA L K W+ +++LW + + G+Y +G Sbjct: 1488 KGGLGIKCLSNLNKALLSKWNWRYANEREALWNQVIRGKYGEDRGGWSTREVREAHGVGL 1547 Query: 871 XRRIFKLRDLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAH 1050 + I DL I VGNGR SFW D W L P + +L I++ A VA Sbjct: 1548 WKGIRMDWDLVGARISFSVGNGRRVSFWRDRWCGXAPLCDSFPSI--YALSIEKEAWVA- 1604 Query: 1051 FRINDGWRFPYSRIPLI---REIWNQCSS--------------LYCLPTL-----EEDEV 1164 D W PL+ R WN C S L CL E+D+V Sbjct: 1605 ----DVWD------PLVQGGRGGWNPCFSRALNDWEMEEAELFLGCLHGKRVIGDEDDKV 1654 Query: 1165 VWKATTSGLFSTKSAWEITRFKHPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLK 1344 VW T SG+FS KS + P + IW + PK + W K T+ ++ Sbjct: 1655 VWTETKSGIFSAKSLYLALEADCPSSFPSSCIWKVWVQPKISFFAWEAAWGKALTLDLVQ 1714 Query: 1345 HLKMVDSSSCIFCWTGEETEKHLFFSCHFSLGVWSLIAQKCFNSTFHASW--EASLRLID 1518 ++ C C EET HL C + +W L+ S F SW S+R Sbjct: 1715 RRGWSLANRCYMCMEKEETIDHLLLHCSKTRVLWELLF-----SLFGVSWVMPCSVRETL 1769 Query: 1519 KDFSSNSIDSIVVKLAFQASI---YHVWNERNRRRFQ 1620 + ++S+ K+ A + + VW RNR F+ Sbjct: 1770 LSWQTSSVGKKHRKVWRAAPLHIFWTVWKARNRLAFK 1806 >emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 241 bits (616), Expect = 7e-61 Identities = 197/695 (28%), Positives = 306/695 (44%), Gaps = 40/695 (5%) Frame = +1 Query: 4 SVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKP-FG 180 S+L+NGSP + RG+RQGDP+SP+LF + +E L+ + K + L + C+ Sbjct: 613 SILINGSPTPPIKLHRGLRQGDPLSPFLFDLVVEPLNLLIKKAVSLKLWDGIETCRNGLR 672 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 I+HL +ADD ++F L+ + F SGL++N KS++ + NL + Sbjct: 673 ITHLQYADDTIIFCPPKLEFLSNIKKTLILFQLASGLQVNFHKSSLLGVNVHENLLNDFA 732 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 + G P YLGLP+ +S P+I +E KL SWKS LS GR L+++ Sbjct: 733 KHLLCKVGKLPFTYLGLPIGGNITRLSLWDPVISKLEKKLASWKSNLLSIGGRLTLIKAC 792 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSG-SSKHKFSLISWKDIARPLEEGGLNIR 717 L +L +Y+ F IP + K+ +I RRFL SG SSK L+SW IA P GGL + Sbjct: 793 LSNLPLYYMSLFPIPKGVLGKIVAIQRRFLWSGNSSKKGMPLVSWDLIALPKHLGGLGLG 852 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRY-LKGEXXXXXXXXXXXXXXXRRIF--- 885 N A L K IW+ + +LW + VHG+Y LK I Sbjct: 853 NLHHKNTALLFKWIWRFLNEPHALWRQVVHGKYGLKDSFTTRDLSLSSYGGPWNGICNAI 912 Query: 886 ----KLRDLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHF 1053 + + LA + +G+G +T FW D+W L P L ++SL D + F Sbjct: 913 LKSPQAKKLAFHQVRVQIGDGSNTLFWHDVWVGANPLKTECPRLFRLSLQQDAYVSLCGF 972 Query: 1054 RINDGWRFP--YSRIPLIREIWNQCSSL-----YCLPTLEEDEVVWKATTSGLFSTKS-A 1209 WR+ +SR R++ Q + L L +D ++W + SG+FS KS + Sbjct: 973 WDGLCWRWSLLWSRPLRQRDLHEQATLLNIINRAVLQKDGKDHLIWAPSKSGIFSVKSFS 1032 Query: 1210 WEITRFKHPKCSWAKLIWFPQMIPKHASL-VWRLCLKKIPTMHKLKHLKMV--DSSSCIF 1380 E+ + + A + ++P + VW + L ++ T KL +LK++ + SSCIF Sbjct: 1033 LELANMEESRSFEATKELWKGLVPFRIEIFVWFVILGRLNTKEKLLNLKLISNEDSSCIF 1092 Query: 1381 CWTGEETEKHLFFSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVK 1560 C + E+ HLF C +S +W Q ++ +W + K+ ++ I K Sbjct: 1093 CSSSIESTNHLFLECSYSKELWHWWFQ-----IWNVAWVLPSSI--KELFTHWIPPFKGK 1145 Query: 1561 L-------AFQASIYHVWNERNRRRFQSRGRXXXXXXXXXXXXVMSKLKNVSSNLSTS-- 1713 F ++ +W ERN R FQ + + +K + S Sbjct: 1146 FFKKVWMSCFFIILWTIWKERNSRIFQEKPNSKLQLKELILLRLGWWIKGWNEPFPYSAE 1205 Query: 1714 ---NRFLAENWGLPLKLNVEFLEV----KWKPP---EREWQLACDGSFSSSRASCGGLLR 1863 L NW P+K + W PP +W + S ++S GG+LR Sbjct: 1206 DIVRNPLCLNWLTPVKPQKAIMPAPFPQHWSPPSIGSLKWNVDASIKSSLQKSSIGGVLR 1265 Query: 1864 SKKGELKLAFHSDCQIESSLRSEVKGLLFGLRIVA 1968 KG F S +EV + L+I A Sbjct: 1266 DHKGNFICMFSSPIPFMEINNAEVLAIHRALKISA 1300 >gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thaliana] Length = 504 Score = 238 bits (606), Expect = 1e-59 Identities = 143/449 (31%), Positives = 222/449 (49%), Gaps = 11/449 (2%) Frame = +1 Query: 100 MEVLSSIFKHELMLGHLQLLNHCKPFGISHLMFADDLLVFLQASSNNLNAFLGLMRSFAE 279 M+VLS + CK G++HL FADDL+V ++ + + +FA+ Sbjct: 1 MDVLSKLLDKAAGQRKFGYHPRCKQIGLTHLSFADDLMVLSDGKVRSIEGIVDVFDTFAK 60 Query: 280 YSGLEINKKKSNIFLAGISSNLEDSLVEASGFLKGLFPVKYLGLPLISARLEISHCQPLI 459 S L+I+ +KS ++LAG+S +++ F G PV+YLGLPL++ + + PLI Sbjct: 61 CSDLKISMEKSTVYLAGLSHTTRQEVIDRFSFAVGTLPVRYLGLPLVTKQFSSTDYLPLI 120 Query: 460 DLMESKLQSWKSRSLSWAGRTVLLRSVLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSG 639 D ++ K+ SW +R LS+ GR L+ S+L S+ +W GAF +P +++ + +L SG Sbjct: 121 DHIKQKICSWSARFLSYTGRLNLISSILWSICNFWMGAFRLPRDCIREIDKMCSAYLWSG 180 Query: 640 ----SSKHKFSLISWKDIARPLEEGGLNIRCPKDMNKAGLLKLIWKLITNKDSLWVRWVH 807 +SK K I+W + +P EEGGL +R K+ N LKLIW++I++ DSLWV+W+ Sbjct: 181 GELNTSKAK---IAWAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQ 237 Query: 808 GRYLKGE-XXXXXXXXXXXXXXXRRIFKLRDLAKDCICTIVGNGRSTSFWVDIWHNHGVL 984 LK R+I K RD+A+ + NG TSFW D W + G L Sbjct: 238 SSLLKKVFFWAVRENTSLGSWMWRKILKFRDIARTLCKVEINNGAQTSFWYDDWSDLGRL 297 Query: 985 ALCIPELIQISLGIDRNALVAHFRINDGW---RFPYSRIPLIREIWNQCSSLYCLPTLEE 1155 + I LGI+++A V + W R R + + + + E Sbjct: 298 IESAGDRGAIDLGINKHATVV-----EAWGNRRRRRHRANFLNRVEERLVLSWNSRNQAE 352 Query: 1156 DEVVWKATTS---GLFSTKSAWEITRFKHPKCSWAKLIWFPQMIPKHASLVWRLCLKKIP 1326 D +WK + +FSTK W R K +W K +WF Q IPKHA +W ++ Sbjct: 353 DCALWKGKENRFRSIFSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLS 412 Query: 1327 TMHKLKHLKMVDSSSCIFCWTGEETEKHL 1413 T ++ M ++CI C E+ HL Sbjct: 413 TGDRMTLWNMGVDATCILCNNALESRDHL 441 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 237 bits (605), Expect = 1e-59 Identities = 191/678 (28%), Positives = 307/678 (45%), Gaps = 20/678 (2%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCK--P 174 FS+L+NG G+F+S+RG+RQGD ISP LF +A E LS + L++ P Sbjct: 1499 FSLLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLSRGLN--ALYDQYPSLHYSSGVP 1556 Query: 175 FGISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFL-AGISSNLED 351 +SHL FADD+L+F S + L L ++ + E SG IN +KS I ++ Sbjct: 1557 LSVSHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINAQKSCFVTHTNIPNSRRQ 1616 Query: 352 SLVEASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLL 531 + +A+GF L P+ YLG PL ++ L+ +E ++ W+++ LS GR LL Sbjct: 1617 IIAQATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLL 1676 Query: 532 RSVLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKHK-FSLISWKDIARPLEEGGL 708 RSVL SL IY P + ++ + FL GS+ K SW IA P+ EGGL Sbjct: 1677 RSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGL 1736 Query: 709 NIRCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFK 888 +IR ++ +A +KL W+ T DSLW R++ +Y +G+ +R+ Sbjct: 1737 DIRSLAEVFEAFSMKLWWRFRTT-DSLWTRFMRMKYCRGQLPMQTQPKLHDSQTWKRMLT 1795 Query: 889 LRDLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDG 1068 + + + VG G + FW D W L E + V F N+ Sbjct: 1796 SSTITEQHMRWRVGQG-NVFFWHDCWMGEAPLISSNQEFTSSMV------QVCDFFTNNS 1848 Query: 1069 WRFPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTSGLFSTKSAWEITRFKHPKCSW 1248 W + L +E+ ++ + + + T+ +DE W T +G FSTKSAW++ R + Sbjct: 1849 WNIEKLKTVLQQEVVDEIAKI-PIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPV 1907 Query: 1249 AKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSCH 1428 IW + + +WRL IP K+K + +S C C EE+ H+ + Sbjct: 1908 FNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRC-RCCKSEESIMHVMWDNP 1966 Query: 1429 FSLGVWS--------LIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIY 1584 ++ VW+ LI C + +W D+ +V L ++ Sbjct: 1967 VAMQVWNYFAKLFQILIINPCTINQIIGAW-----FYSGDYCKPGHIRTLVPLFI---LW 2018 Query: 1585 HVWNERNRRRFQSRGRXXXXXXXXXXXXV--MSKLKNVSSNLSTSNRFLAENWGLPLKLN 1758 +W ERN + ++ G + +S + + ++ +A+ WG+ + Sbjct: 2019 FLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQ-- 2076 Query: 1759 VEFLE----VKW-KPPEREWQLACDGSFSSS-RASCGGLLRSKKGELKLAFHSDCQIESS 1920 E L W KP E++L DGS S A+ GG+LR GE+ F + ++S Sbjct: 2077 AESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGILRDHAGEMVFGFSENLGTQNS 2136 Query: 1921 LRSEVKGLLFGLRIVADY 1974 L++E+ L GL + DY Sbjct: 2137 LQAELLALYRGLILCRDY 2154 >dbj|BAB01344.1| reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1115 Score = 237 bits (605), Expect = 1e-59 Identities = 146/395 (36%), Positives = 208/395 (52%), Gaps = 3/395 (0%) Frame = +1 Query: 1 FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180 FSV VNG AG+FRS RGIRQG +SPYLF I+MEVLS + CK G Sbjct: 555 FSVQVNGELAGYFRSARGIRQGCALSPYLFVISMEVLSKMLDQAAGAKRFGFHPKCKNLG 614 Query: 181 ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360 ++HL FADDL++ +++ + +M FA+ SGL+IN +K+ ++ AG+S + ++ Sbjct: 615 LTHLCFADDLMILTDGKVRSVDGIVEVMNLFAKRSGLKINMEKTTLYTAGVSDHNRHMMI 674 Query: 361 EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540 F PV+YLGLPL++ RL PL + + +++ +W SR LS+AGR L+ SV Sbjct: 675 SRYPFGLAQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSV 734 Query: 541 LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSS-KHKFSLISWKDIARPLEEGGLNIR 717 L S +W AF +PS+ ++ SI FL SG + + +SW DI +P ++GGL +R Sbjct: 735 LWSTMNFWMSAFRLPSACLKEINSICSAFLWSGPELNRRKAKVSWDDICKP-KQGGLGLR 793 Query: 718 CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894 + N +LKLIW++ +N DSLWV+W LK E +++ K R Sbjct: 794 SLTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLKPNSSLGSWMWKKMLKYR 853 Query: 895 DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074 + AK V NG TSFW D W G L + QI LGI RN VA N R Sbjct: 854 ETAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNR--R 911 Query: 1075 FPYSRIPLIREIWNQCSSLY-CLPTLEEDEVVWKA 1176 R + +I + Y L ED +W+A Sbjct: 912 RRKHRTEQLNDIEAALNQKYQTRILLREDAALWRA 946