BLASTX nr result
ID: Akebia27_contig00030210
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00030210 (1321 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 235 4e-59 dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal... 217 8e-54 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 216 1e-53 dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ... 213 2e-52 emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li... 213 2e-52 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 212 3e-52 ref|XP_007224193.1| hypothetical protein PRUPE_ppa017155mg, part... 211 8e-52 gb|AAK71569.2|AC087852_29 putative reverse transcriptase [Oryza ... 211 8e-52 gb|EEC76169.1| hypothetical protein OsI_13484 [Oryza sativa Indi... 210 1e-51 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 209 3e-51 gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip... 208 5e-51 ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268... 207 6e-51 emb|CAN68838.1| hypothetical protein VITISV_030956 [Vitis vinifera] 207 6e-51 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 207 8e-51 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 206 1e-50 ref|XP_004298219.1| PREDICTED: uncharacterized protein LOC101304... 205 4e-50 ref|XP_004250606.1| PREDICTED: uncharacterized protein LOC101247... 205 4e-50 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 205 4e-50 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 203 2e-49 emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694... 202 2e-49 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 235 bits (599), Expect = 4e-59 Identities = 138/441 (31%), Positives = 225/441 (51%), Gaps = 5/441 (1%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIA---TTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDF 172 VYA R LW +++ +A TT PW++LGD N +L+ + G +E+F Sbjct: 108 VYAVNCRYGRRRLWSELELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEF 167 Query: 173 RECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLS 352 RECL + + DL H TW N Q I K+DR+LVN+ W+ ++ SF S Sbjct: 168 RECLLTSNISDLPFRGNHYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAMEFS 227 Query: 353 DHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWE-IKVSGNPMFKLIMKLKNV 529 DH P+ V IS + +PFK NF EF+ ++ W+ + G+ MF L K K + Sbjct: 228 DHCPSCVNISNQSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFL 287 Query: 530 KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709 K ++T+++ + ++ + +L Q N+ +P + LA E+ + + L+ E Sbjct: 288 KGTIRTFNREHYSGLEKRVVQAAQNLKTCQNNLLAAP-SSYLAGLEKEAHRSWAELALAE 346 Query: 710 ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889 E QKSRV WLK GDS+T FH+ M RRA N+I + + G + + +++ + Sbjct: 347 ERFLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDF 406 Query: 890 FKATFGTPAK-CNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGP 1066 FK FG+ + + SQ + +E R F + +K+ GP Sbjct: 407 FKELFGSSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGP 466 Query: 1067 DGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPIS 1246 DG+++ FF++ W I+ + A++ +G++ + N+TA+T++ K R+ EFRPIS Sbjct: 467 DGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPIS 526 Query: 1247 CCNVVCKAISKVLANRLKPLL 1309 CCN + K ISK+LA RL+ +L Sbjct: 527 CCNAIYKVISKLLARRLENIL 547 >dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 910 Score = 217 bits (553), Expect = 8e-54 Identities = 139/448 (31%), Positives = 223/448 (49%), Gaps = 8/448 (1%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDE--KIEGRMPSKNSIE 166 VY R LW DI ++ T + PW++LGD N E I + + +E Sbjct: 107 VYGRNSELDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGME 166 Query: 167 DFRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSG 346 D + CL +++L DL + TW N Q + I+ KLDR L N EW F A + F P G Sbjct: 167 DLQCCLRDSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPG 226 Query: 347 LSDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIK-VSGNPMFKLIMKLK 523 SDH+P ++ I + + FK+F+F + ++ + AWE + G+ MF L LK Sbjct: 227 DSDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLK 286 Query: 524 NVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSK 703 KL +T ++ +F N+ + L ++Q+ + SP + L E K+ F + Sbjct: 287 VAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSP-SDTLFRREHVARKQWIFFAA 345 Query: 704 CEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVI 883 ES +QKSR+ WL GD++TR FH+++ +A N I ++ ++G + + I+ +I Sbjct: 346 ALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLI 405 Query: 884 RHFKATFGTPAKCNTSVFS--Q*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKA 1057 ++ G P++ N + FS + + L + + F M +KA Sbjct: 406 AYYSHLLGIPSE-NVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKA 464 Query: 1058 LGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFR 1237 GPDGF FF AW I+ + AI+ +G + R NATAITLI KV RL +FR Sbjct: 465 PGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFR 524 Query: 1238 PISCCNVVCKAISKVLANRLKPLLHKLV 1321 P++CC + K I+++++ RLK + + V Sbjct: 525 PVACCTTIYKVITRIISRRLKLFIDQAV 552 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 216 bits (551), Expect = 1e-53 Identities = 139/448 (31%), Positives = 223/448 (49%), Gaps = 8/448 (1%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDE--KIEGRMPSKNSIE 166 VY R LW DI ++ T + PW++LGD N E I + + +E Sbjct: 150 VYGRNSELDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGME 209 Query: 167 DFRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSG 346 D + CL +++L DL + TW N Q + I+ KLDR L N EW F A + F P G Sbjct: 210 DLQCCLRDSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPG 269 Query: 347 LSDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIK-VSGNPMFKLIMKLK 523 SDH+P ++ I + + FK+F+F + ++ + AWE + G+ MF L LK Sbjct: 270 DSDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEENTLVGSHMFSLRQHLK 329 Query: 524 NVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSK 703 KL +T ++ +F N+ + L ++Q+ + SP + L E K+ F + Sbjct: 330 VAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSP-SDTLFRREHVARKQWIFFAA 388 Query: 704 CEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVI 883 ES +QKSR+ WL GD++TR FH+++ +A N I ++ ++G + + I+ +I Sbjct: 389 ALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLI 448 Query: 884 RHFKATFGTPAKCNTSVFS--Q*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKA 1057 ++ G P++ N + FS + + L + + F M +KA Sbjct: 449 AYYSHLLGIPSE-NVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKA 507 Query: 1058 LGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFR 1237 GPDGF FF AW I+ + AI+ +G + R NATAITLI KV RL +FR Sbjct: 508 PGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFR 567 Query: 1238 PISCCNVVCKAISKVLANRLKPLLHKLV 1321 P++CC + K I+++++ RLK + + V Sbjct: 568 PVACCTTIYKVITRIISRRLKLFIDQAV 595 >dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 213 bits (541), Expect = 2e-52 Identities = 135/449 (30%), Positives = 235/449 (52%), Gaps = 9/449 (2%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDEKIEGRMPSKNSIEDF 172 VYAS + R ELW ++ +A + V W+VLGD N LN + I + K I F Sbjct: 109 VYASNEEGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINANIGRK--IRAF 166 Query: 173 RECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLS 352 R CL ++ L DL + TW N+ + K+DR+LVN+ W F A+++F S Sbjct: 167 RSCLLDSDLYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFS 226 Query: 353 DHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKNV 529 DHS V + RPF+FFN++ + +F+ +++E W VSG+ M+++ KLK++ Sbjct: 227 DHSSCEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHL 286 Query: 530 KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709 KL + +S+ + +++ + + + + + Q +P + A E ++ L+K E Sbjct: 287 KLPICCFSRENYSDIEKRVSEAHAIVLHRQRITLTNP-SVVHATLELEATRKWQILAKAE 345 Query: 710 ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889 ES QKS ++WL GD++T FH+ R++ N I + + GE + + I+ E I+ Sbjct: 346 ESFFCQKSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIK-EGIKE 404 Query: 890 FKATFGTPAKCNTS-VFSQ*RDDLHHQLN---EEDRMXXXXXXXXXXXXXXXFQMMP-DK 1054 F C S + D++ L+ D++ F +P +K Sbjct: 405 HSCNFFESLLCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNK 464 Query: 1055 ALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEF 1234 A GPDG+S+ FF+ W ++ + +A++ +G++ ++ NAT + LI K+ S++ +F Sbjct: 465 ASGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDF 524 Query: 1235 RPISCCNVVCKAISKVLANRLKPLLHKLV 1321 RPISC N + K I+K+L +RLK LL++++ Sbjct: 525 RPISCLNTLYKVIAKLLTSRLKKLLNEVI 553 >emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 213 bits (541), Expect = 2e-52 Identities = 135/449 (30%), Positives = 235/449 (52%), Gaps = 9/449 (2%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDEKIEGRMPSKNSIEDF 172 VYAS + R ELW ++ +A + V W+VLGD N LN + I + K I F Sbjct: 109 VYASNEEGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINANIGRK--IRAF 166 Query: 173 RECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLS 352 R CL ++ L DL + TW N+ + K+DR+LVN+ W F A+++F S Sbjct: 167 RSCLLDSDLYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFS 226 Query: 353 DHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKNV 529 DHS V + RPF+FFN++ + +F+ +++E W VSG+ M+++ KLK++ Sbjct: 227 DHSSCEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHL 286 Query: 530 KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709 KL + +S+ + +++ + + + + + Q +P + A E ++ L+K E Sbjct: 287 KLPICCFSRENYSDIEKRVSEAHAIVLHRQRITLTNP-SVVHATLELEATRKWQILAKAE 345 Query: 710 ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889 ES QKS ++WL GD++T FH+ R++ N I + + GE + + I+ E I+ Sbjct: 346 ESFFCQKSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIK-EGIKE 404 Query: 890 FKATFGTPAKCNTS-VFSQ*RDDLHHQLN---EEDRMXXXXXXXXXXXXXXXFQMMP-DK 1054 F C S + D++ L+ D++ F +P +K Sbjct: 405 HSCNFFESLLCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNK 464 Query: 1055 ALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEF 1234 A GPDG+S+ FF+ W ++ + +A++ +G++ ++ NAT + LI K+ S++ +F Sbjct: 465 ASGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDF 524 Query: 1235 RPISCCNVVCKAISKVLANRLKPLLHKLV 1321 RPISC N + K I+K+L +RLK LL++++ Sbjct: 525 RPISCLNTLYKVIAKLLTSRLKKLLNEVI 553 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 212 bits (539), Expect = 3e-52 Identities = 138/449 (30%), Positives = 222/449 (49%), Gaps = 9/449 (2%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMT---VPWMVLGDMNVTLNHDEKIEGRMPSKNS-IED 169 VYA+ +A R ELW ++ ++ +++ PW++LGD N L E + + N ++ Sbjct: 58 VYAANEAITRKELWEELLLLSVSLSGNGKPWIMLGDFNQVLCPAEHSQATSLNVNRRMKV 117 Query: 170 FRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGL 349 FR+CLFEA L DL + TW N+ + KLDR+LVN W F A++ F Sbjct: 118 FRDCLFEAELCDLVFKGNTFTWWNKSATRPVAKKLDRILVNESWCSRFPSAYAVFGEPDF 177 Query: 350 SDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKN 526 SDH+ V I+ RPF+F+NF + +F+++V E W I V G+ MFK+ KLK Sbjct: 178 SDHASCGVIINPLMHREKRPFRFYNFLLQNPDFISLVGELWYSINVVGSSMFKMSKKLKA 237 Query: 527 VKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKC 706 +K ++T+S F N++ +K+ + + Q P + A + A K + L K Sbjct: 238 LKNPIRTFSMENFSNLEKRVKEAHNLVLYRQNKTLSDPTIPNAALEMEAQRKWL-ILVKA 296 Query: 707 EESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIR 886 EES Q+SRV W+ GDS+T FH+ R+A N I I +NG + + I+ I Sbjct: 297 EESFFCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDDNGVKIDTQLGIKEHCIE 356 Query: 887 HFKATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGP 1066 +F G + L + + + + F +K GP Sbjct: 357 YFSNLLGGEVGPPMLIQEDFDLLLPFRCSHDQKKELAMSFSRQDIKSAFFSFPSNKTSGP 416 Query: 1067 DGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPIS 1246 DGF FF+ W +I + A+ + + ++ NAT + LI K+ S++N+FRPIS Sbjct: 417 DGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTLVLIPKITNASKMNDFRPIS 476 Query: 1247 CCN----VVCKAISKVLANRLKPLLHKLV 1321 C + + K I+++L NRL+ LL +++ Sbjct: 477 CNDFGPITLYKVIARLLTNRLQCLLSQVI 505 >ref|XP_007224193.1| hypothetical protein PRUPE_ppa017155mg, partial [Prunus persica] gi|462421129|gb|EMJ25392.1| hypothetical protein PRUPE_ppa017155mg, partial [Prunus persica] Length = 916 Score = 211 bits (536), Expect = 8e-52 Identities = 134/425 (31%), Positives = 203/425 (47%), Gaps = 4/425 (0%) Frame = +2 Query: 59 IATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFRECLFEARLQDLKAERCHLTWC 238 + T +PW+ GD N L DEK+ GR + + FR+ + +D+ TW Sbjct: 1 LEATNYLPWLCCGDFNEILRADEKLGGRRRREGQMLGFRQAIDTCGFKDMGYTGPKYTWW 60 Query: 239 -NRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHSPAVVTISKKRKICGRPFK 415 N +E RI +LDRVL +W F L SDH P VTIS++ + GR K Sbjct: 61 RNNPMEIRI--RLDRVLATADWCSRFLGTKVIHLNPTKSDHLPLKVTISERMLLNGRRKK 118 Query: 416 FFNF---WADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLDLKTWSKNKFGNMDNSI 586 F F WA+ M +Q+ W+ G+ F KLK + L WSK FG++ N I Sbjct: 119 LFRFEEMWAEHVNCMQTIQDGWQRTCRGSAPFTTTEKLKCTRHQLLGWSKCNFGHLPNQI 178 Query: 587 KDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEESAAKQKSRVNWLKLGDSH 766 K + L L ++P + A+ K++ L E +Q+SR WLK GD + Sbjct: 179 KITREKLGELL----DAPPSHHTVELRNALTKQLDSLMAKNEVYWRQRSRATWLKAGDRN 234 Query: 767 TRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFKATFGTPAKCNTSVFSQ* 946 ++ FH R RN I++++ E+G + E + V+ +F+ F + +S +++ Sbjct: 235 SKFFHYKASSCRRRNTISALEDEHGHWQTTEQGLTQTVVNYFQHLFSS---IGSSDYTEV 291 Query: 947 RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDGFSACFFQRAWIIINRDFL 1126 D + ++ EE FQM P KA GPD FS F+Q+ W I+ D + Sbjct: 292 VDGVRGRVTEEMNQALLAEFTPEEIKIALFQMHPSKAPGPDDFSPFFYQKYWQIVGEDMV 351 Query: 1127 KAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCCNVVCKAISKVLANRLKPL 1306 A+ GK+ +++N T + LI KV P + + RPIS CNV K +KVLA LK + Sbjct: 352 AAVLHFFKTGKLLKKINFTHVALIPKVHEPKNMTQLRPISLCNVFNKIGAKVLATHLKAI 411 Query: 1307 LHKLV 1321 L L+ Sbjct: 412 LPTLI 416 >gb|AAK71569.2|AC087852_29 putative reverse transcriptase [Oryza sativa Japonica Group] Length = 1833 Score = 211 bits (536), Expect = 8e-52 Identities = 137/444 (30%), Positives = 220/444 (49%), Gaps = 4/444 (0%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181 VY AQ R +W ++ I + PW+++GD N + E R S++ + DFRE Sbjct: 85 VYGEPRAQDRHLMWSLLRRIRSNSGDPWLMIGDFNEAMWQTEHKSHRKRSESQMRDFREV 144 Query: 182 LFEARLQDLKAERCHLTWCNRQIEGR-IMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDH 358 L E L D+ + T+CN Q EGR + +LDR + + W F +A + L + SDH Sbjct: 145 LSECDLHDIGFQGAPWTFCNMQREGRNVKVRLDRGVASPAWSSRFPQAVITHLTTPSSDH 204 Query: 359 SPAVVTISKKRKICGRPFKFFNF---WADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNV 529 +P + + ++ RP K + W +S V+QEAW + + + + K+K Sbjct: 205 APLL--LEREETTLARPMKIMRYEEVWERESSLPEVIQEAWTMGADASTLGDINDKMKVT 262 Query: 530 KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709 L +WSK+K GN+ IKDL+ L L+ NI D + ++ KE+ + E Sbjct: 263 MTKLVSWSKDKIGNVRKKIKDLREKLGELR-NI----GLLDTDNEVHSVKKELEEMLHRE 317 Query: 710 ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889 E KQ+SR+ WLK GD +TR FH R +NKI +K +G ++ +++ E+ R Sbjct: 318 EIWWKQRSRITWLKEGDLNTRYFHLKASWRAKKNKIKKLKKNDGSTTMNKKEMK-EINRS 376 Query: 890 FKATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPD 1069 F T V + H +++E+ FQ+ P KA GPD Sbjct: 377 FFQQLYTKDDNLNPV--NLLNMFHEKISEQMNADLIKPFTNEEISDALFQIGPLKAPGPD 434 Query: 1070 GFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISC 1249 GF A F QR W ++ + + A+++ + + VN T I +I K + + +FRPIS Sbjct: 435 GFPARFLQRNWGLLKGEVIAAVRNFFEDEVMQEGVNDTVIVMIPKKNLAEDMKDFRPISL 494 Query: 1250 CNVVCKAISKVLANRLKPLLHKLV 1321 CNVV K ++K L NR++P+L +++ Sbjct: 495 CNVVYKVVAKCLVNRMRPMLQEII 518 >gb|EEC76169.1| hypothetical protein OsI_13484 [Oryza sativa Indica Group] Length = 1874 Score = 210 bits (534), Expect = 1e-51 Identities = 136/444 (30%), Positives = 220/444 (49%), Gaps = 4/444 (0%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181 +Y AQ R +W ++ I + PW+++GD N + E R S++ + DFRE Sbjct: 60 LYGEPRAQDRHLMWSLLRRIRSNSGDPWLMIGDFNEAMWQTEHKSHRKRSESQMRDFREV 119 Query: 182 LFEARLQDLKAERCHLTWCNRQIEGR-IMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDH 358 L E L D+ + T+CN Q EGR + +LDR + + W F +A + L + SDH Sbjct: 120 LSECDLHDIGFQGAPWTFCNMQREGRNVKVRLDRGVASPAWSSRFPQAVITHLTTPSSDH 179 Query: 359 SPAVVTISKKRKICGRPFKFFNF---WADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNV 529 +P + + ++ RP K + W +S V+QEAW + + + + K+K Sbjct: 180 APLL--LEREETTLARPMKIMRYEEVWERESSLPEVIQEAWTMGADASTLGDINDKMKVT 237 Query: 530 KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709 L +WSK+K GN+ IKDL+ L L+ NI D + ++ KE+ + E Sbjct: 238 MTKLVSWSKDKIGNVRKKIKDLREKLGELR-NI----GLLDTDNEVHSVKKELEEMLHRE 292 Query: 710 ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889 E KQ+SR+ WLK GD +TR FH R +NKI +K +G ++ +++ E+ R Sbjct: 293 EIWWKQRSRITWLKEGDLNTRYFHLKASWRAKKNKIKKLKKNDGSTTMNKKEMK-EISRS 351 Query: 890 FKATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPD 1069 F T V + H +++E+ FQ+ P KA GPD Sbjct: 352 FFQQLYTKDDNLNPV--NLLNMFHEKISEQMNADLIKPFTDEEISDALFQIGPLKAPGPD 409 Query: 1070 GFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISC 1249 GF A F QR W ++ + + A+++ + + VN T I +I K + + +FRPIS Sbjct: 410 GFPARFLQRNWGLLKGEVIAAVRNFFEDEVMQEGVNDTVIVMIPKKNLAEDMKDFRPISL 469 Query: 1250 CNVVCKAISKVLANRLKPLLHKLV 1321 CNVV K ++K L NR++P+L +++ Sbjct: 470 CNVVYKVVAKCLVNRMRPMLQEII 493 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 209 bits (531), Expect = 3e-51 Identities = 130/443 (29%), Positives = 215/443 (48%), Gaps = 3/443 (0%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181 VYA R LW ++ +A M PW+V GD N+ L +E++ G P + SIEDF Sbjct: 951 VYAKCTRSERTPLWNCLRNLAADMEGPWIVGGDFNIILKREERLYGADPHEGSIEDFASV 1010 Query: 182 LFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHS 361 L + L D E TW N R+ +LDR++ N +WI++F L SDH Sbjct: 1011 LLDCGLLDGGFEGNPFTWTNN----RMFQRLDRMVYNQQWINKFPITRIQHLNRDGSDHC 1066 Query: 362 PAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLDL 541 P +++ S + F+F + WA F V+ W + ++G+ + K K +K L Sbjct: 1067 PLLLSCSNSSEKAPSSFRFLHAWALHHNFNASVEGNWNLPINGSGLMAFWSKQKRLKQHL 1126 Query: 542 KTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSK---CEE 712 K W+K FG++ ++IK+ + + ++ + + K + L+K EE Sbjct: 1127 KWWNKTVFGDIFSNIKEAEKRVEECEI----LHQQEQTIGSRIQLNKSYAQLNKQLSMEE 1182 Query: 713 SAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHF 892 KQKS V W+ G+ +T+ FH M+++R R+ I I+ ++G ++ D ++ I F Sbjct: 1183 IFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKIQEQDGNWIEDPEQLQQSAIDFF 1242 Query: 893 KATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDG 1072 + + +T S + +++ D F + P+ A GPDG Sbjct: 1243 SSLLKAESCDDTRFQSSLCPSI---ISDTDNGFLCAEPTLQEVKEAVFGIDPESAAGPDG 1299 Query: 1073 FSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCC 1252 FS+ F+Q+ W II D +A+K I + + +T + LI K S+ +EFRPIS C Sbjct: 1300 FSSHFYQQCWDIIAHDLFEAVKEFFHGADIPQGMTSTTLVLIPKTTSASKWSEFRPISLC 1359 Query: 1253 NVVCKAISKVLANRLKPLLHKLV 1321 V+ K I+K+LANRL +L ++ Sbjct: 1360 TVMNKIITKILANRLAKILPSII 1382 >gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score: 72.31) [Arabidopsis thaliana] Length = 928 Score = 208 bits (529), Expect = 5e-51 Identities = 132/444 (29%), Positives = 218/444 (49%), Gaps = 13/444 (2%) Frame = +2 Query: 29 RWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDEKIEGRMP--SKNSIEDFRECLFEA 193 R ELW D++ + + + PW++ GD N L+ +E R + + DF+ + Sbjct: 4 RKELWNDLRDHSDSPIIRSKPWIIFGDFNEILDMEEHSNSRENPVTTTGMRDFQMAVNHC 63 Query: 194 RLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHSPAVV 373 + DL TW N++ I KLDRVLVN+ W+ F ++S F G SDH + Sbjct: 64 SITDLAYHGPLFTWSNKRENDLIAKKLDRVLVNDVWLQSFPRSYSVFEAGGCSDHLRCRI 123 Query: 374 TISKKRKIC---GRPFKFFNFWADDSEFMTVVQEAWE----IKVSGNPMFKLIMKLKNVK 532 ++ RPFKF N + F+ V+ W I +S + +F+ KLK +K Sbjct: 124 NLNVGAGAVVKGKRPFKFVNVITEMEHFIPTVESYWNETEAIFMSTSSLFRFSKKLKGLK 183 Query: 533 LDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEE 712 L+ K + GN+ K+ +L Q +P + + A K ++ EE Sbjct: 184 PLLRNLGKERLGNLVKQTKEAFETLCQKQAMKMANPSPSSMQEENEAYAKW-DHIAVLEE 242 Query: 713 SAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHF 892 KQ+S+++WL +GD + ++FH+++ R A+N I I +G + E I+ E HF Sbjct: 243 KFLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEHHF 302 Query: 893 KATFGT-PAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPD 1069 + P + +D L ++ ++ D+ F M DK+ GPD Sbjct: 303 REFLQLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDKSPGPD 362 Query: 1070 GFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISC 1249 G++A F++ AW II +F+ AI+S G + + +N+T + LI K + + ++RPISC Sbjct: 363 GYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAKEMKDYRPISC 422 Query: 1250 CNVVCKAISKVLANRLKPLLHKLV 1321 CNV+ K ISK++ANRLK +L K + Sbjct: 423 CNVLYKVISKIIANRLKLVLPKFI 446 >ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268853 [Solanum lycopersicum] Length = 1333 Score = 207 bits (528), Expect = 6e-51 Identities = 134/442 (30%), Positives = 213/442 (48%), Gaps = 2/442 (0%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181 VYA Q+R LW DI + PW ++GD NV + EK+ GR + N +F Sbjct: 50 VYAKCKDQLRKPLW-DIMLKRSETMYPWSIIGDFNVITSTSEKLGGRDYNINKSLEFINI 108 Query: 182 LFEARLQDLKAERCHLTWCNRQIEG-RIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDH 358 + L D+ TWCN + +G RI +LDR + N++WI+ + + LPS SDH Sbjct: 109 IEACGLVDMGYHGQDYTWCNHRKDGARIWKRLDRGMTNDKWIETIPHSSITHLPSVGSDH 168 Query: 359 SPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLD 538 P ++ I + + FKF N W ++ F+ V++ W+ V GNPM+ KL+ + Sbjct: 169 CPLLMEICDIQSNTIKYFKFLNCWTENDSFLETVEKCWKRDVIGNPMWNFHTKLRRLTKT 228 Query: 539 LKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEESA 718 L+ WSK ++G++ +K L L NI ++ + AI E SK E Sbjct: 229 LRIWSKQEYGDVFEKVK-LYEDLVKKAENIIIDNYSAKNSEKLNAINAEYIKFSKMEYKI 287 Query: 719 AKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFKA 898 +QK++++WL+ GD++T+ FH ++ +R R I + E+G ++ E +I +++ Sbjct: 288 LQQKTQLHWLQEGDANTKYFHTVIRGKRNRMSIHKLMDESGNWIKGEEEIAKHACDYYEK 347 Query: 899 TF-GTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDGF 1075 F G K + ++ + +E M P A GPDGF Sbjct: 348 IFTGMNGKIKEDIL----QCINPMITQEQNKDLDRIPDMDELRRTIMSMNPHSAPGPDGF 403 Query: 1076 SACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCCN 1255 F+Q + II D L A+K + R + +TLI K++ P RL +FRPIS N Sbjct: 404 GGKFYQVCFDIIKEDLLAAVKHFYVGNIMPRYLTHACLTLIPKIDHPCRLKDFRPISLSN 463 Query: 1256 VVCKAISKVLANRLKPLLHKLV 1321 K ISK+L+ RL +L +V Sbjct: 464 FTNKIISKILSTRLALILPSIV 485 >emb|CAN68838.1| hypothetical protein VITISV_030956 [Vitis vinifera] Length = 1881 Score = 207 bits (528), Expect = 6e-51 Identities = 142/443 (32%), Positives = 220/443 (49%), Gaps = 3/443 (0%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181 VY ++ +R +LW ++ IA + W V GD NV EK+ G + S++DF + Sbjct: 937 VYGPNNSALRKDLWVELSDIAGLASPRWCVGGDFNVIRRSSEKLGGSRLTP-SMKDFDDF 995 Query: 182 LFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHS 361 + + L DL TW N Q+ + +LDR L +NEW F ++ LP SDH Sbjct: 996 ISDCELIDLPLRSASFTWSNMQVNP-VCKRLDRFLYSNEWEQTFPQSIQGVLPRWTSDHW 1054 Query: 362 PAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKNVKLD 538 P V+ + K PF+F N W F W E + +G K + KL+ VK Sbjct: 1055 PIVLE-TNPFKWGPTPFRFENMWLQHPSFKENFGRWWREFQGNGWEGHKFMRKLQFVKAK 1113 Query: 539 LKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIK-EISFLSKCEES 715 LK W+K FG + +D+ S+L N E + +L A +RAI K E+ L EE Sbjct: 1114 LKVWNKASFGELSKRKEDILSALVNFDSLEQEGGLSHELLA-QRAIKKGELEELILREEI 1172 Query: 716 AAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFK 895 +QK+RV W+K GD +++ FH+ RR R I ++ ENG+ + + I+ E++R+F+ Sbjct: 1173 HWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKELENENGQMMNNSESIKEEILRYFE 1232 Query: 896 ATFGTPAKCNTSVFSQ*RDDLH-HQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDG 1072 + +P+ + V + L ++ E + FQM DKA GPDG Sbjct: 1233 KLYTSPSGESWRV-----EGLDWSPISGESAVRLESPFTEEEICKAIFQMDRDKAPGPDG 1287 Query: 1073 FSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCC 1252 F+ FQ W +I D +K +G I + NA+ I L+ K M R+++FRPIS Sbjct: 1288 FTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQSTNASFIVLLPKKSMSRRISDFRPISLI 1347 Query: 1253 NVVCKAISKVLANRLKPLLHKLV 1321 + K I+KVLA R++ +LH+ + Sbjct: 1348 TSLYKIIAKVLAGRIREVLHETI 1370 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 207 bits (527), Expect = 8e-51 Identities = 128/440 (29%), Positives = 211/440 (47%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181 VYA R ELW ++ I+ M PW+V GD N ++ DE++ G +P S+ED Sbjct: 952 VYAKCTRIERRELWTSLRIISDGMQAPWLVGGDFNSIVSCDERLNGAIPHDGSMEDLSST 1011 Query: 182 LFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHS 361 LF+ L D E TW N R+ +LDRV+ N EW + F+ L SDH Sbjct: 1012 LFDCGLLDAGFEGNSFTWTNN----RMFQRLDRVVYNQEWAEFFSSTRVQHLNRDGSDHC 1067 Query: 362 PAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLDL 541 P +++ S + F+F + W +F++ V+++W + + K + +K DL Sbjct: 1068 PLLISCSNTNQRGPATFRFLHAWTKHHDFISFVEKSWNTPIHAEGLNAFWTKQQRLKRDL 1127 Query: 542 KTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEESAA 721 K W+K+ FG++ ++ + ++N ++P + +A K LS EE Sbjct: 1128 KWWNKHIFGDIFKILRLAEVEAEQRELNFQQNPSAANRELMHKAYAKLNRQLS-IEELFW 1186 Query: 722 KQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFKAT 901 +QKS V WL G+ +T+ FH M+++R RN I I+ + G L + I+ + F+ Sbjct: 1187 QQKSGVKWLVEGERNTKFFHMRMRKKRMRNHIFRIQDQEGNVLEEPHLIQNSGVEFFQNL 1246 Query: 902 FGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDGFSA 1081 +C+ S F ++ D F + D GPDGFS+ Sbjct: 1247 LKAE-QCDISRFDP--SITPRIISTTDNEFLCATPSLQEVKEAVFNINKDSVAGPDGFSS 1303 Query: 1082 CFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCCNVV 1261 F+Q W II +D +A+ + R + +T + L+ K + S+ +EFRPIS C V+ Sbjct: 1304 LFYQHCWDIIKQDLFEAVLDFFKGSPLPRGITSTTLVLLPKTQNVSQWSEFRPISLCTVL 1363 Query: 1262 CKAISKVLANRLKPLLHKLV 1321 K ++K+LANRL +L ++ Sbjct: 1364 NKIVTKLLANRLSKILPSII 1383 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 206 bits (525), Expect = 1e-50 Identities = 134/454 (29%), Positives = 231/454 (50%), Gaps = 14/454 (3%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLN---HDEKIEGRMPSKNSI 163 VYAS + R LW ++K + + PW +LGD N TL+ H + M + + Sbjct: 107 VYASNYVEERKVLWSELKDHYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPG-M 165 Query: 164 EDFRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPS 343 DF++ + L D+ A+ TWCN++ G IM KLDRVL+N+ W F++++S F Sbjct: 166 RDFQQVINYCSLTDMAAQGPLFTWCNKREHGLIMKKLDRVLINDCWNQTFSQSYSVFEAG 225 Query: 344 GLSDHSPAVVTISKK--RKICG-RPFKFFNFWADDSEFMTVVQEAWE----IKVSGNPMF 502 G SDH ++++ + K+ G +PFKF N D +F +V W+ + +S + +F Sbjct: 226 GCSDHLRCRISLNSEAGNKVQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLF 285 Query: 503 KLIMKLKNVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIK 682 + LK +K +++ ++++ GN+ + L Q +P + + +E A Sbjct: 286 RFSKNLKGLKPKIRSMARDRLGNLSKKANEAYKILCAKQHVNLTNPSSMAME-EENAAYS 344 Query: 683 EISFLSKCEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEA 862 ++ EE KQKS+++W ++GD +T++FH++ R A N I I +G Sbjct: 345 RWDRVAILEEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGD 404 Query: 863 DIEMEVIRHFKATFGT-PAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQ 1039 +I+ E R F+ P ++ + L + ++ D+ F+ Sbjct: 405 EIKAEAERFFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFR 464 Query: 1040 MMPDKALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPS 1219 M DK+ GPDG+++ FF+ W II +F A++S G + + +N+T + LI K Sbjct: 465 MPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAR 524 Query: 1220 RLNEFRPISCCNVVCKAISKVLANRLKPLLHKLV 1321 + ++RPISCCNV+ K ISK++ANRLK +L K + Sbjct: 525 EMKDYRPISCCNVLYKVISKIIANRLKLVLPKFI 558 >ref|XP_004298219.1| PREDICTED: uncharacterized protein LOC101304768 [Fragaria vesca subsp. vesca] Length = 1687 Score = 205 bits (521), Expect = 4e-50 Identities = 143/456 (31%), Positives = 223/456 (48%), Gaps = 17/456 (3%) Frame = +2 Query: 5 YASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFRECL 184 Y + D+Q+R W ++ IA ++ PW+V GD N L +K G + I FRE + Sbjct: 106 YGNPDSQLRHFSWDLLRRIAKSVRGPWIVFGDFNELLCIGDKRGGGERPEAQIRRFREAV 165 Query: 185 FEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHSP 364 E LQ+++ TW G ++ +LDR +N E F + + G SDH Sbjct: 166 DECGLQEVEFSGPTFTWKR----GTLLERLDRCFINEEAGVLFPRFHEAHVDVGASDHLS 221 Query: 365 AVVTISKKRKICGRP--------FKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKL 520 V + + CGR F+F FWA + E VV +AW+ GN + + KL Sbjct: 222 LV--LFSEGLNCGRKGGWKGLRRFQFEPFWAKEQESKQVVADAWQS--DGNQLNNVRAKL 277 Query: 521 KNVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNK--DLAADER-AIIKEIS 691 V +L+ W++NKFG + I+ L L + PF+ ++ + R AI+ E++ Sbjct: 278 AGVSKELQRWNENKFGLIPKKIRQLNKELE-------QCPFDSSDEVVQNRRNAIVAELN 330 Query: 692 FLSKCEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIE 871 + EES +Q+SR+NWL+ GD +T+ FH K R +N++ I GE++ E +I+ Sbjct: 331 KSLEIEESIWRQRSRINWLQEGDRNTKFFHGFAKGRGRKNRVLGIMSSTGEWIEQETEIQ 390 Query: 872 MEVIRHFKATFGTPAKCN------TSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXX 1033 HF F T C+ +V + DD++ +LN+ Sbjct: 391 QAFNTHFSQLF-TSEGCDHMELVLDTVQRKVTDDMNAKLNKPFTKLDIDEALK------- 442 Query: 1034 FQMMPDKALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEM 1213 QM PDK+ G DGFSA F+Q W I+ + L G +++N T + LI K+E Sbjct: 443 -QMGPDKSPGEDGFSARFYQAYWEIVGDEVSNRCLQVLNEGASVKDLNHTLLALIPKIEN 501 Query: 1214 PSRLNEFRPISCCNVVCKAISKVLANRLKPLLHKLV 1321 P + +FRPIS CNV+ K ISK + NR+K LL +++ Sbjct: 502 PQGVADFRPISLCNVLYKLISKAMVNRMKVLLPEVI 537 >ref|XP_004250606.1| PREDICTED: uncharacterized protein LOC101247390 [Solanum lycopersicum] Length = 612 Score = 205 bits (521), Expect = 4e-50 Identities = 125/441 (28%), Positives = 211/441 (47%), Gaps = 1/441 (0%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181 +YA +R LW + + A+ T PW +GD NV + DEK+ G + DF Sbjct: 77 IYAKCKEYLRRPLWDKLLHHASVSTNPWCAVGDYNVIFDVDEKLGGLPYNMRKSMDFIAL 136 Query: 182 LFEARLQDLKAERCHLTWCNRQ-IEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDH 358 + L D+ TW N++ RI +LDR LVN+ W+++ + + L + SDH Sbjct: 137 IEACGLVDIGFSGHRFTWSNKRGFNNRIWKRLDRALVNDLWLEKMPQTTITHLSTTGSDH 196 Query: 359 SPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLD 538 P ++ + + F+F N W D+ FM V+ W+ + GN M+K K+K + Sbjct: 197 CPYLLEMVSTEVDRIKYFRFLNCWVDNPNFMLTVKNCWDRPMEGNAMWKFHQKMKRLSNT 256 Query: 539 LKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEESA 718 L WS+N+FG++ ++ + ++ + N + + + I FL K E++ Sbjct: 257 LSVWSRNEFGDIFQKVRMYEEQVHEAEENYIRDQTDSNRITLHELNAQYIKFL-KIEDTI 315 Query: 719 AKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFKA 898 KQK+++ K GD++ + FH ++ RR + I I ENG+++ E +I HF A Sbjct: 316 LKQKTQLQLFKDGDTNFKYFHSIIRARRRKLFIHKIITENGDWIQGENNIAQNACDHFNA 375 Query: 899 TFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDGFS 1078 F + N + Q + + +N++ F M P+ A GPDG + Sbjct: 376 IFTSE---NKHINEQNLECIPRMVNKDQNTQLTKLPDMDELKEVVFSMNPNSAAGPDGMN 432 Query: 1079 ACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCCNV 1258 FF++ II D ++ + I + + + I L+ KV ++L EFRPIS N Sbjct: 433 GYFFKKCLNIIKNDLVEVLHPFFSGQMIPKYFSHSCIVLLPKVNNTNKLTEFRPISLSNF 492 Query: 1259 VCKAISKVLANRLKPLLHKLV 1321 K ISK+++NRL P+L L+ Sbjct: 493 TSKIISKLVSNRLSPILLSLI 513 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 205 bits (521), Expect = 4e-50 Identities = 135/450 (30%), Positives = 221/450 (49%), Gaps = 14/450 (3%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDEKIEGRMPSKNSIED- 169 VYASTD R LW +I + V PW VLGD N L+ PS++S D Sbjct: 6 VYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILH---------PSEHSTSDG 56 Query: 170 ---------FRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEA 322 FRE + A L DL TW N++ + KLDR+LVN++W F + Sbjct: 57 FNVDRPTRIFRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTTFPSS 116 Query: 323 FSSFLPSGLSDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPM 499 F SDHS +++ +PF+F NF D F++++ W V+G+ M Sbjct: 117 LGLFGEPDFSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKWFSTSVTGSAM 176 Query: 500 FKLIMKLKNVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAII 679 +++ +KLK +K ++ +S++ + +++ K+ +L Q + SP + AA E Sbjct: 177 YRVSVKLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSN-AAIEAETQ 235 Query: 680 KEISFLSKCEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADE 859 ++ L++ E S Q+SRVNWL+ GD ++ FH+ R++ N I + G+ + + Sbjct: 236 RKWRILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQ 295 Query: 860 ADIEMEVIRHFKATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQ 1039 ++E + +F++ G+ + + L ++ + ++ F Sbjct: 296 QNLENHCVEYFQSNLGSEQGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFS 355 Query: 1040 MMPDKALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPS 1219 + +KA GPDGFS FF W II + +AI +GK+ ++ NAT + LI K+ S Sbjct: 356 LPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITNAS 415 Query: 1220 RLNEFRPISCCNVVCKAISKVLANRLKPLL 1309 +++FRPISC N V K ISK+L +RLK L Sbjct: 416 SMSDFRPISCLNTVYKVISKLLTDRLKDFL 445 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 203 bits (516), Expect = 2e-49 Identities = 125/443 (28%), Positives = 217/443 (48%), Gaps = 3/443 (0%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181 VYA R LW ++ +A + VPW+V GD N+ L +E++ G P + ++EDF Sbjct: 1158 VYAKCTRSERTLLWDCLRRLAADIEVPWLVGGDFNIILKREERLYGSAPHEGAMEDFAST 1217 Query: 182 LFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHS 361 L + L D E TW N R+ +LDR++ N+ WI++F L SDH Sbjct: 1218 LLDCGLLDGGFEGNPFTWTNN----RMFQRLDRIVYNHHWINKFPITRIQHLNRDGSDHC 1273 Query: 362 PAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLDL 541 P +++ + F+F + W +F T V+ W + ++G+ + K +K L Sbjct: 1274 PLLISCFNSSEKAPSSFRFQHAWVLHHDFKTSVESNWNLPINGSGLQAFWSKQHRLKQHL 1333 Query: 542 KTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSK---CEE 712 K W+K FG++ + +K+ + + ++ N+ + K + L+K EE Sbjct: 1334 KWWNKVMFGDIFSKLKEAEKRVEECEI----LHQNEQTVESIIKLNKSYAQLNKQLNIEE 1389 Query: 713 SAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHF 892 KQKS V W+ G+ +T+ FH M+++R R+ I ++ +G ++ D+ ++ I++F Sbjct: 1390 IFWKQKSGVKWVVEGERNTKFFHTRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIKYF 1449 Query: 893 KATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDG 1072 + C+ S F R + ++ + F + P+ A GPDG Sbjct: 1450 SSLLKFEP-CDDSRFQ--RSLIPSIISNSENELLCAEPNLQEVKDAVFGIDPESAAGPDG 1506 Query: 1073 FSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCC 1252 FS+ F+Q+ W II D L A++ I R V +T + L+ K S+ ++FRPIS C Sbjct: 1507 FSSYFYQQCWNIIAHDLLDAVRDFFHGANIPRGVTSTTLILLPKKPSASKWSDFRPISLC 1566 Query: 1253 NVVCKAISKVLANRLKPLLHKLV 1321 V+ K I+K+L+NRL +L ++ Sbjct: 1567 TVMNKIITKLLSNRLAKILPSII 1589 >emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1| putative protein [Arabidopsis thaliana] Length = 1141 Score = 202 bits (515), Expect = 2e-49 Identities = 135/445 (30%), Positives = 214/445 (48%), Gaps = 9/445 (2%) Frame = +2 Query: 2 VYASTDAQVRWELWRDIKYIAT---TMTVPWMVLGDMNVTLN-HDEKIEGRMPSKNSIED 169 VYA+ + R ELWR+I + T PW++LGD N L+ H+ + I D Sbjct: 101 VYAANEDDKRKELWREITALVASPVTFNRPWILLGDFNQVLHPHEHSRHVSLNVDRRIRD 160 Query: 170 FRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGL 349 FRECL +A L DL + TW N+ + K+DR+LVN W + F +F F P Sbjct: 161 FRECLLDAELSDLVYKGSSFTWWNKSKTRPVAKKIDRILVNESWSNLFPSSFGLFGPPDF 220 Query: 350 SDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKN 526 SDH+ V + RPFKFFNF + EF+ +V + W V G+ MF++ KLK Sbjct: 221 SDHASCGVVLELDPIKAKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSMFRVSKKLKA 280 Query: 527 VKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKC 706 +K +K +S+ + N++ ++ +L + Q ++P + + AA E ++ L+ Sbjct: 281 LKKPIKDFSRLNYSNLEKRTEEAHETLLSFQNLTLDNP-SLENAAHELEAQRKWQILATA 339 Query: 707 EESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIR 886 EES +Q+SRV W GD +TR FH+ R++ N IT++ ++G + D + + Sbjct: 340 EESFFRQRSRVTWFAEGDGNTRYFHRMADSRKSVNTITTLVDDSG----TQIDSQQGIAD 395 Query: 887 HFKATFGTPAKCNTSVFSQ*RDDLH----HQLNEEDRMXXXXXXXXXXXXXXXFQMMPDK 1054 H F + +S +DD++ ++ F + +K Sbjct: 396 HCALYFENLLSDDNDPYSLEQDDMNLLLTYRCPYSQVADLEAMFSDEDIKAAFFGLPSNK 455 Query: 1055 ALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEF 1234 A GPDGF R + I +G + ++ NAT I LI K S ++F Sbjct: 456 ACGPDGFPVTAAVREFFI--------------SGNLLKQWNATTIVLIPKFPNASCTSDF 501 Query: 1235 RPISCCNVVCKAISKVLANRLKPLL 1309 RPISC N + K I+++L +RL+ LL Sbjct: 502 RPISCMNTLYKVIARLLTDRLQKLL 526