BLASTX nr result
ID: Akebia27_contig00035719
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00035719 (1035 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 179 1e-42 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 171 6e-40 dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal... 168 4e-39 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 167 7e-39 ref|XP_007224193.1| hypothetical protein PRUPE_ppa017155mg, part... 162 3e-37 ref|XP_007203701.1| hypothetical protein PRUPE_ppa020995mg, part... 157 7e-36 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 156 1e-35 ref|XP_007203344.1| hypothetical protein PRUPE_ppa020282mg [Prun... 155 2e-35 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 152 3e-34 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 149 1e-33 emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulga... 149 2e-33 ref|XP_006577697.1| PREDICTED: uncharacterized protein LOC102664... 147 5e-33 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 147 7e-33 ref|XP_007203452.1| hypothetical protein PRUPE_ppa022115mg [Prun... 147 9e-33 ref|XP_007217321.1| hypothetical protein PRUPE_ppa019733mg [Prun... 146 1e-32 emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulga... 146 1e-32 emb|CAH65761.1| H0215A08.3 [Oryza sativa Indica Group] 146 2e-32 gb|AAF63113.1|AC006423_14 Hypothetical protein [Arabidopsis thal... 145 2e-32 gb|AAF63129.1|AC009526_14 Similar to reverse transcriptase [Arab... 145 2e-32 ref|NP_175044.1| DNAse I-like superfamily protein [Arabidopsis t... 145 2e-32 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 179 bits (455), Expect = 1e-42 Identities = 101/289 (34%), Positives = 158/289 (54%) Frame = +2 Query: 146 LNSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQ 325 +N WIN++ D+ V +R GISDH P++ +L G+ FKF NF AD F+ VV++ Sbjct: 204 VNVAWINQYPDVVVEYREAGISDHSPLIFNLATQHDEGGRPFKFLNFLADQNGFVEVVKE 263 Query: 326 AWNVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 AW + M + +L+ VK LK++ KF L++ L Q AL E Sbjct: 264 AWGSANHRFKMKNIWVRLQAVKRALKSFHSKKFSKAHCQVEELRRKLAAVQ-ALPEVSQV 322 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 L E+E+ + + +E+IL+QKSRI WL LGDSN+ FF ++K R+ARN I L Sbjct: 323 SELQEEEKDLIAQLRKWSTIDESILKQKSRIQWLSLGDSNSKFFFTAIKVRKARNKIVLL 382 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMDI 865 Q D G +T+ +I+ +F++ LLG + ++ + + + V +L+ L I Sbjct: 383 QNDRGDQLTENTEIQNEICNFYRRLLGTS-SSQLEAIDLHVVRVGAKLSATSCAQLVQPI 441 Query: 866 SEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRS 1012 + +EI AL +D+ KAPG DGF + FFKK+W ++ Q+ + I F + Sbjct: 442 TIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLVIKQEIYEGILDFFEN 490 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 171 bits (432), Expect = 6e-40 Identities = 100/298 (33%), Positives = 159/298 (53%), Gaps = 3/298 (1%) Frame = +2 Query: 146 LNSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQ 325 +N W+ +F A SDHCP V++ + + K FK NF P F+ ++ Sbjct: 206 VNDSWLIASPLSYGSFCAMEFSDHCPSCVNISNQSGGRNKPFKLSNFLMHHPEFIEKIRV 265 Query: 326 AWNVKTY-GTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPL 502 W+ Y G+ MF L++K K +KG ++T++R + L + + L+ Q L AP Sbjct: 266 TWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREHYSGLEKRVVQAAQNLKTCQNNLLAAP- 324 Query: 503 CKHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQC 682 +LA E+ ++ L EE L QKSR+ WL+ GDSNT FFH+ + +RRA N I Sbjct: 325 SSYLAGLEKEAHRSWAELALAEERFLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHY 384 Query: 683 LQLDCGRTVTDEEDIEKSAIDFFKNLLGNAYE--TERDFSHIDLSEVKNRLNYEDVLMLG 856 L GR + + ++++ +DFFK L G++ + S I+ S + + + +L Sbjct: 385 LLDQTGRRIENTDELQTHCVDFFKELFGSSSHLISAEGISQIN-SLTRFKCDENTRQLLE 443 Query: 857 MDISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLINE 1030 ++SE +IK F + NK+PGPDG+T+ FFKKTW +VG A+ FRS +L+ + Sbjct: 444 AEVSEADIKSEFFALPSNKSPGPDGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQ 501 >dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 910 Score = 168 bits (425), Expect = 4e-39 Identities = 105/296 (35%), Positives = 150/296 (50%), Gaps = 5/296 (1%) Frame = +2 Query: 149 NSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQA 328 N EW F F PG SDH P ++ + + P K+FK+F+F + PS+L + A Sbjct: 208 NGEWFAVFPSALAVFDPPGDSDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTA 267 Query: 329 WNVKTY-GTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 W T G+ MF L Q LK K +T +RL+F N+ T++ LE+ Q+ L +P Sbjct: 268 WEANTLVGSHMFSLRQHLKVAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSP-S 326 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 L +E + ++ E+ RQKSRI WL GD+NT FFH++V + +A N I+ L Sbjct: 327 DTLFRREHVARKQWIFFAAALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFL 386 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMDI 865 + D G V + + I+ I ++ +LLG E FS + ++K L + L + Sbjct: 387 RGDDGFRVENVDQIKGMLIAYYSHLLGIPSENVTPFS---VEKIKGLLPFRCDSFLASQL 443 Query: 866 ----SEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKL 1021 SEEEI LF M NKAPGPDGF FF + W +V AI F S L Sbjct: 444 TTIPSEEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNL 499 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 167 bits (423), Expect = 7e-39 Identities = 105/296 (35%), Positives = 150/296 (50%), Gaps = 5/296 (1%) Frame = +2 Query: 149 NSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQA 328 N EW F F PG SDH P ++ + + P K+FK+F+F + PS+L + A Sbjct: 251 NGEWFAVFPSALAVFDPPGDSDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTA 310 Query: 329 WNVKTY-GTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 W T G+ MF L Q LK K +T +RL+F N+ T++ LE+ Q+ L +P Sbjct: 311 WEENTLVGSHMFSLRQHLKVAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSP-S 369 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 L +E + ++ E+ RQKSRI WL GD+NT FFH++V + +A N I+ L Sbjct: 370 DTLFRREHVARKQWIFFAAALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFL 429 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMDI 865 + D G V + + I+ I ++ +LLG E FS + ++K L + L + Sbjct: 430 RGDDGFRVENVDQIKGMLIAYYSHLLGIPSENVTPFS---VEKIKGLLPFRCDSFLASQL 486 Query: 866 ----SEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKL 1021 SEEEI LF M NKAPGPDGF FF + W +V AI F S L Sbjct: 487 TTIPSEEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNL 542 >ref|XP_007224193.1| hypothetical protein PRUPE_ppa017155mg, partial [Prunus persica] gi|462421129|gb|EMJ25392.1| hypothetical protein PRUPE_ppa017155mg, partial [Prunus persica] Length = 916 Score = 162 bits (409), Expect = 3e-37 Identities = 92/297 (30%), Positives = 156/297 (52%), Gaps = 3/297 (1%) Frame = +2 Query: 152 SEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNF---WADDPSFLGVVQ 322 ++W + F +V P SDH P+ V++ + ++ G+ K F F WA+ + + +Q Sbjct: 78 ADWCSRFLGTKVIHLNPTKSDHLPLKVTISERMLLNGRRKKLFRFEEMWAEHVNCMQTIQ 137 Query: 323 QAWNVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPL 502 W G+ F T+KLK + +L WS+ FG+L ++ L E L +AP Sbjct: 138 DGWQRTCRGSAPFTTTEKLKCTRHQLLGWSKCNFGHLPNQIKITREKLGE----LLDAPP 193 Query: 503 CKHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQC 682 H E + + +L+ E RQ+SR WL+ GD N+ FFH S R RN I Sbjct: 194 SHHTVELRNALTKQLDSLMAKNEVYWRQRSRATWLKAGDRNSKFFHYKASSCRRRNTISA 253 Query: 683 LQLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMD 862 L+ + G T E+ + ++ +++F++L + + D++ + + V+ R+ E L + Sbjct: 254 LEDEHGHWQTTEQGLTQTVVNYFQHLFSSIGSS--DYTEV-VDGVRGRVTEEMNQALLAE 310 Query: 863 ISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLINEV 1033 + EEIK ALFQM +KAPGPD F+ F++K W +VG+D A+ F++ KL+ ++ Sbjct: 311 FTPEEIKIALFQMHPSKAPGPDDFSPFFYQKYWQIVGEDMVAAVLHFFKTGKLLKKI 367 >ref|XP_007203701.1| hypothetical protein PRUPE_ppa020995mg, partial [Prunus persica] gi|462399232|gb|EMJ04900.1| hypothetical protein PRUPE_ppa020995mg, partial [Prunus persica] Length = 1367 Score = 157 bits (397), Expect = 7e-36 Identities = 92/294 (31%), Positives = 152/294 (51%) Frame = +2 Query: 152 SEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQAW 331 ++W + F +V P SDH P+ K F+F WA+ + + +Q W Sbjct: 483 ADWCSRFLGTKVIHLNPTKSDHLPLK-----------KLFRFEEMWAEHVNCMQTIQDGW 531 Query: 332 NVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLCKH 511 + G+ F T+KLK +L WS+ FG+L ++ L E L +AP H Sbjct: 532 QRTSRGSAPFTTTEKLKCTCHQLLGWSKCNFGHLPNQIKITQEKLGE----LLDAPPSHH 587 Query: 512 LAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCLQL 691 E ++ + +L+ E RQ+SR WL+ GD N+ FFH SRR RN I L+ Sbjct: 588 TVELRNVLTKQLDSLMAKNEVYWRQRSRATWLKAGDRNSKFFHYKASSRRRRNTISALED 647 Query: 692 DCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMDISE 871 + G T E+ + ++ +++F++L + +E ++ + + V+ R+ E L + + Sbjct: 648 EHGHWQTTEQGLTQTVVNYFQHLFSSTGSSE--YTGV-VDGVRGRVTEEMNQTLLAEFTP 704 Query: 872 EEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLINEV 1033 EEIK ALFQM +KAPGPDGF+ F++K W +VG+D A+ F++ KL+ ++ Sbjct: 705 EEIKIALFQMHPSKAPGPDGFSPFFYQKYWQIVGEDVVAAVLHFFKTGKLLKKI 758 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 156 bits (395), Expect = 1e-35 Identities = 90/300 (30%), Positives = 149/300 (49%), Gaps = 5/300 (1%) Frame = +2 Query: 146 LNSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQ 325 +N W + F F P SDH V + + + + F+F+NF +P F+ +V + Sbjct: 157 VNESWCSRFPSAYAVFGEPDFSDHASCGVIINPLMHREKRPFRFYNFLLQNPDFISLVGE 216 Query: 326 AW-NVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPL 502 W ++ G+ MF++++KLK +K ++T+S F NL + Q P Sbjct: 217 LWYSINVVGSSMFKMSKKLKALKNPIRTFSMENFSNLEKRVKEAHNLVLYRQNKTLSDPT 276 Query: 503 CKHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQC 682 + A E ++ L++ EE+ Q+SR+ W+ GDSNT++FH+ SR+A N I Sbjct: 277 IPNAA-LEMEAQRKWLILVKAEESFFCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHI 335 Query: 683 LQLDCGRTVTDEEDIEKSAIDFFKNLLGNAYE----TERDFSHIDLSEVKNRLNYEDVLM 850 + D G + + I++ I++F NLLG + DF + + R +++ Sbjct: 336 IIDDNGVKIDTQLGIKEHCIEYFSNLLGGEVGPPMLIQEDFDLL----LPFRCSHDQKKE 391 Query: 851 LGMDISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLINE 1030 L M S ++IK A F NK GPDGF FFK+TW ++G + T A+S F S L+ + Sbjct: 392 LAMSFSRQDIKSAFFSFPSNKTSGPDGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQ 451 >ref|XP_007203344.1| hypothetical protein PRUPE_ppa020282mg [Prunus persica] gi|462398875|gb|EMJ04543.1| hypothetical protein PRUPE_ppa020282mg [Prunus persica] Length = 1496 Score = 155 bits (393), Expect = 2e-35 Identities = 93/294 (31%), Positives = 149/294 (50%) Frame = +2 Query: 152 SEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQAW 331 ++W + F +V P SDH P+ K F+F WA+ + + +Q W Sbjct: 572 ADWCSRFLGTKVIHLNPTKSDHLPLK-----------KLFRFEEMWAEHVNCMQTIQDGW 620 Query: 332 NVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLCKH 511 G+ F T+KLK + +L WS+ FG+L ++ L E L +AP H Sbjct: 621 QRTCRGSAPFTTTEKLKCTRHKLLGWSKCNFGHLPNQIKITREKLGE----LLDAPPSHH 676 Query: 512 LAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCLQL 691 AE + + +L+ E RQ SR WL+ GD N+ FFH SRR RN I L+ Sbjct: 677 TAELRNALTKQLDSLMAKNEVYWRQCSRATWLKAGDRNSKFFHYKASSRRRRNTISALED 736 Query: 692 DCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMDISE 871 + G T E+ + ++ +++F++L + +E ++ + + V+ R+ E L + Sbjct: 737 EHGHWQTTEQGLTQTVVNYFQHLFSSTGSSE--YTEV-VDGVRGRVTEEMNQALLAVFTP 793 Query: 872 EEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLINEV 1033 EEIK ALFQM +KAPGPDGF+ F++K W +VG+D A+ F++ KL+ + Sbjct: 794 EEIKIALFQMHPSKAPGPDGFSPFFYQKYWPIVGEDVVAAVLHFFKTGKLLKRI 847 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 152 bits (383), Expect = 3e-34 Identities = 91/294 (30%), Positives = 148/294 (50%), Gaps = 2/294 (0%) Frame = +2 Query: 146 LNSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQ 325 +N W+ + ++ V + PGISDH P++ +L GK FKF N A+ FL V++ Sbjct: 201 VNLVWLGMYAEVSVQYLPPGISDHSPLLFNLMTGRPQGGKPFKFMNVMAEQGEFLETVEK 260 Query: 326 AWNVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 AWN + + LK VK LK K G L+ L++ Q + + Sbjct: 261 AWNSVNGRFKLQAIWLNLKAVKRELKQMKTQKIGLAHEKVKNLRHQLQDLQ-SQDDFDHN 319 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 + + + + + E++IL+QKSRI WLQ GD+N+ F +VK+R A N I L Sbjct: 320 DIMQTDAKSIMNDLRHWSHIEDSILQQKSRITWLQQGDTNSKLFFTAVKARHAINRIDML 379 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNR--LNYEDVLMLGM 859 + GR + D +++++ ++F+K LLG T +DL+ V+ L+ + L Sbjct: 380 NTEDGRVIQDADEVQEEILEFYKKLLGTRAST---LMGVDLNTVRGGKCLSAQAKESLIR 436 Query: 860 DISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKL 1021 +++ EI AL + +KAPG DGF A FFKK+W + Q+ I F + ++ Sbjct: 437 EVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQEIYAGIQEFFNNSRM 490 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 149 bits (377), Expect = 1e-33 Identities = 89/300 (29%), Positives = 150/300 (50%), Gaps = 5/300 (1%) Frame = +2 Query: 146 LNSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQ 325 +N +W F F P SDH +SL K F+F NF D +FL ++ Sbjct: 105 VNDKWTTTFPSSLGLFGEPDFSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICL 164 Query: 326 AW-NVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPL 502 W + G+ M+R++ KLK +K ++ +SR + ++ T L Q L +P Sbjct: 165 KWFSTSVTGSAMYRVSVKLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASP- 223 Query: 503 CKHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQC 682 C A E ++ L E + Q+SR+NWL+ GD N+++FH+ +R++ NHI Sbjct: 224 CPSNAAIEAETQRKWRILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHF 283 Query: 683 LQLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYE----DVLM 850 L G + ++++E +++F++ LG +E+ + +++ N L+Y + Sbjct: 284 LSDPVGDRIEGQQNLENHCVEYFQSNLG----SEQGLPLFEQADISNLLSYRCSPAQQVS 339 Query: 851 LGMDISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLINE 1030 L S E+IK+A F + NKA GPDGF+ FF W ++G + T+AI F S KL+ + Sbjct: 340 LDTPFSSEQIKNAFFSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQ 399 >emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1379 Score = 149 bits (375), Expect = 2e-33 Identities = 87/292 (29%), Positives = 145/292 (49%) Frame = +2 Query: 146 LNSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQ 325 LN EWINEF + ++ G+SDHCP++ ++ K F+F N W DP L +V + Sbjct: 199 LNPEWINEFPSMRLSLLQRGLSDHCPLLTNIHTQNWGP-KPFRFQNCWLTDPHCLEIVNK 257 Query: 326 AWNVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 W +++ PM KL+ VK RLK W+R +FG++ T+ ++ +++ E L Sbjct: 258 TW-LESTNMPMI---DKLRRVKIRLKAWNRDEFGHIDTNIKIMEDEIQKFDTISNERELD 313 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 + E+ + + ++ +E Q SRI WL+ GD NT FFH +++ RN I + Sbjct: 314 EQEIERRKEAQSDLWMWMKRKELYWAQNSRILWLKHGDRNTKFFHMVASNKKRRNFIASI 373 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMDI 865 +++ GR + I++ A+ FFK + + + N+L+ L Sbjct: 374 KVN-GRRIEKPNQIKEEAVTFFKEIFTEEFTERPTLEGLQF----NQLSQNQADSLIQPF 428 Query: 866 SEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKL 1021 S+EEI +A+ +KAPGPDGF F K W + +D + + + KL Sbjct: 429 SDEEIDYAVNSCASDKAPGPDGFNFKFIKNAWETIKEDVYTLVREFWATSKL 480 >ref|XP_006577697.1| PREDICTED: uncharacterized protein LOC102664381 [Glycine max] Length = 515 Score = 147 bits (372), Expect = 5e-33 Identities = 96/300 (32%), Positives = 151/300 (50%), Gaps = 9/300 (3%) Frame = +2 Query: 149 NSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQA 328 N W+ D + AP +SDH + +S KD FK+ N A F V++ Sbjct: 203 NLNWLQMHIDSTLKILAPSVSDHALMFLSCKDQSSRLRGRFKYRNSLARLNGFHDEVKKN 262 Query: 329 WNVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEA--PL 502 WN+ +G PM++L KL ++ LK NLS+ + L++ ++E + LQ+A L Sbjct: 263 WNLGVHGNPMYKLWTKLSRLQSVLK--------NLSSPLNGLREKIDEARRNLQQAHEDL 314 Query: 503 CKHLAEKERMV-----AGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRAR 667 C+ + + E L E+ LRQK++INW++ GD N ++FH ++K R Sbjct: 315 CRDRFNVDNINRVKDRTSELLQLNELEDNDLRQKAKINWIRQGDGNNSYFHATIKGRYKH 374 Query: 668 NHIQCLQLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVK--NRLNYED 841 N I+ L + G +T EDIE+ + F+ LLG+ +E + + +++ ++ N LN Sbjct: 375 NAIRSLIKEDGSCITSHEDIEEEVLKFYSALLGS---SESNLAGLNIPAIRNGNTLNQFQ 431 Query: 842 VLMLGMDISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKL 1021 ML +S EI + MD NK PG DG+ GFFK W +VG D +AI F +L Sbjct: 432 RDMLIGPVSNAEIDTTIKGMDVNKTPGIDGYGVGFFKDAWSIVGSDVREAILDFFLRNRL 491 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 147 bits (371), Expect = 7e-33 Identities = 82/286 (28%), Positives = 146/286 (51%) Frame = +2 Query: 149 NSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQA 328 N +WIN+F + SDHCP+++S + +F+F + WA +F V+ Sbjct: 1043 NQQWINKFPITRIQHLNRDGSDHCPLLLSCSNSSEKAPSSFRFLHAWALHHNFNASVEGN 1102 Query: 329 WNVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLCK 508 WN+ G+ + K K +K LK W++ FG++ ++ +K +EE +I Q+ Sbjct: 1103 WNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDIFSNIKEAEKRVEECEILHQQEQTIG 1162 Query: 509 HLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCLQ 688 + + A + + + EE +QKS + W+ G+ NT FFH ++ +R R+HI +Q Sbjct: 1163 SRIQLNKSYA-QLNKQLSMEEIFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKIQ 1221 Query: 689 LDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMDIS 868 G + D E +++SAIDFF +LL D + S + ++ D L + + Sbjct: 1222 EQDGNWIEDPEQLQQSAIDFFSSLL---KAESCDDTRFQSSLCPSIISDTDNGFLCAEPT 1278 Query: 869 EEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCF 1006 +E+K A+F +D A GPDGF++ F+++ W ++ D +A+ F Sbjct: 1279 LQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFF 1324 >ref|XP_007203452.1| hypothetical protein PRUPE_ppa022115mg [Prunus persica] gi|462398983|gb|EMJ04651.1| hypothetical protein PRUPE_ppa022115mg [Prunus persica] Length = 1755 Score = 147 bits (370), Expect = 9e-33 Identities = 91/296 (30%), Positives = 148/296 (50%), Gaps = 2/296 (0%) Frame = +2 Query: 152 SEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGK--TFKFFNFWADDPSFLGVVQQ 325 + W N F V P SDH PI+V ++ K + F F W ++Q Sbjct: 614 TSWQNLFPGFSVQHLDPSRSDHLPILVRIRHATCQKSRYRRFHFEAMWTTHVDCEKTIKQ 673 Query: 326 AWNVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 W PM L +K+K++ L+ WS+ FG++ +T L+ L +L +AP Sbjct: 674 VWESVGNLDPMVGLDKKIKQMTWVLQRWSKSTFGHIKEETRVLRAKLA----SLFQAPYS 729 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 + + E R+V L+ E Q+SR NWL+ GD NT++FHQ +RR RN I+ L Sbjct: 730 ERVEEDRRVVQKSLDELLAKNELYWCQRSRENWLKAGDKNTSYFHQKATNRRRRNIIKGL 789 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMDI 865 + G T + I ID+F +L ++ + + LS ++ ++ + +L D Sbjct: 790 EDSNGCWRTSRQGITSIVIDYFGDLFRSSGSSMMEEI---LSALEPKVTADMQQVLIADF 846 Query: 866 SEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLINEV 1033 S +EIK A+FQM +KAPGPDG F++K W +VG D A+ + +S +++ ++ Sbjct: 847 SYQEIKDAVFQMQPSKAPGPDGLPPLFYQKYWRIVGDDVVAAVRAFLQSNEMLRQL 902 >ref|XP_007217321.1| hypothetical protein PRUPE_ppa019733mg [Prunus persica] gi|462413471|gb|EMJ18520.1| hypothetical protein PRUPE_ppa019733mg [Prunus persica] Length = 1275 Score = 146 bits (369), Expect = 1e-32 Identities = 91/296 (30%), Positives = 148/296 (50%), Gaps = 2/296 (0%) Frame = +2 Query: 152 SEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGK--TFKFFNFWADDPSFLGVVQQ 325 + W N F V P SDH PI+V ++ K + F F W ++Q Sbjct: 160 TSWQNLFPGFSVQHLDPSRSDHLPILVRIRHATCQKSRYHRFHFEAMWTTHVDCEKTIKQ 219 Query: 326 AWNVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 W PM L +K+K++ L+ WS+ FG++ +T L+ L +L +AP Sbjct: 220 VWESVGDLDPMVGLDKKIKQMTWVLQRWSKSTFGHIKEETRVLRAKLA----SLFQAPYS 275 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 + + E R+V L+ E Q+SR NWL+ GD NT++FHQ +RR RN I+ L Sbjct: 276 ERVEEDRRVVQKSLDELLAKNELYWCQRSRENWLKAGDKNTSYFHQKATNRRRRNIIKGL 335 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMDI 865 + G T + I ID+F +L ++ + + LS ++ ++ + +L D Sbjct: 336 EDSNGCWRTSRQGITSIVIDYFGDLFRSSGSSMMEEI---LSALEPKVTADMQQVLIADF 392 Query: 866 SEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLINEV 1033 S +EIK A+FQM +KAPGPDG F++K W +VG D A+ + +S +++ ++ Sbjct: 393 SYQEIKDAVFQMQPSKAPGPDGLPPLFYQKYWRIVGDDVVAAVRAFLQSNEMLRQL 448 >emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1383 Score = 146 bits (369), Expect = 1e-32 Identities = 85/279 (30%), Positives = 139/279 (49%), Gaps = 1/279 (0%) Frame = +2 Query: 146 LNSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKG-KTFKFFNFWADDPSFLGVVQ 322 ++ EW++ +I+V+ G+SDHCP++V + G K F+F N W DP + +V+ Sbjct: 200 VSPEWVSHCPNIKVSILQRGLSDHCPLLVH--SHIQEWGPKPFRFNNCWLTDPKCMKIVE 257 Query: 323 QAWNVKTYGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPL 502 +W+ +P + +KLKE K RLK W+ +FG++ + +L+ + E L Sbjct: 258 ASWS----SSPKISVVEKLKETKKRLKEWNLNEFGSIDANIRKLEDCIANFDKEADEREL 313 Query: 503 CKHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQC 682 K EK R + ++ +E Q+SRI WL+ GD NT FFH +++ +N + C Sbjct: 314 DKEELEKRREAQADLWKWMKRKEIYWAQRSRITWLKAGDKNTKFFHAIASNKKRKNMMAC 373 Query: 683 LQLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDVLMLGMD 862 ++ D G++ D I+K A FFK + + ++ L RL+ L Sbjct: 374 IETD-GQSTNDPSQIKKEARAFFKKIFKEDHVKRPTLENLHL----KRLSQNQANSLITP 428 Query: 863 ISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQD 979 + EEI A+ +KAPGPDGF F K W ++ D Sbjct: 429 FTTEEIDTAVSSCASDKAPGPDGFNFKFVKSAWDIIKTD 467 >emb|CAH65761.1| H0215A08.3 [Oryza sativa Indica Group] Length = 1186 Score = 146 bits (368), Expect = 2e-32 Identities = 100/295 (33%), Positives = 155/295 (52%), Gaps = 9/295 (3%) Frame = +2 Query: 152 SEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQAW 331 S+W + F + +SDHCP+++S P+ P+G F+F +FW F VV+++W Sbjct: 208 SDWEDMFPQCFLMALPSTVSDHCPLLLSTY-PVHPRGGRFRFESFWCKLEGFEEVVRESW 266 Query: 332 NVKTYGT-PMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTS-------RLKKTLEETQIAL 487 + P+ RL +KL+ +L++WS K GN+ R K E ++L Sbjct: 267 SAPCSTVDPVARLDRKLRTAARKLRSWSDKKVGNVKLQMEMARELIGRFDKAEESRILSL 326 Query: 488 QEAPLCKHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRAR 667 QE L K+L K Y L E + Q+SRI WLQ GD+NT FFHQ +R+ R Sbjct: 327 QERFLHKNLKLK-------YLALASLERTMAWQRSRITWLQEGDANTRFFHQQAAARKRR 379 Query: 668 NHIQCLQLDCGRTV-TDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKNRLNYEDV 844 N IQ QL+ V T E+ ++ A ++F NLLG+A+ +R FS +D+ ++ + + Sbjct: 380 NFIQ--QLEHNEAVATSPEEKDQLAHEYFTNLLGSAH--QRSFS-LDMDFLQQQ--GASL 432 Query: 845 LMLGMDISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFR 1009 L +EEE+ A+ +KAPGPDGFT+ F+ W ++ QD +A+ R Sbjct: 433 PQLETPFTEEEVWAAIRDTPRDKAPGPDGFTSLFYHTCWQILRQDVMEALHQLHR 487 >gb|AAF63113.1|AC006423_14 Hypothetical protein [Arabidopsis thaliana] Length = 668 Score = 145 bits (367), Expect = 2e-32 Identities = 93/294 (31%), Positives = 148/294 (50%), Gaps = 2/294 (0%) Frame = +2 Query: 149 NSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQA 328 N +W + F F G+SDH P ++ L++ K F++F+F + P+FL + A Sbjct: 228 NGDWFSSFPSAIAVFELSGVSDHSPCIIILENLPKRSKKCFRYFSFLSTHPTFLVSLTVA 287 Query: 329 WNVKT-YGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 W + G+ MF L + LK K K +R FGN+ T +LE Q L P Sbjct: 288 WEEQIPVGSHMFSLGEHLKAAKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNP-S 346 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 L E + +++ E+ RQKSRI WLQ GD+NT FFH+ + + +A+N I+ L Sbjct: 347 DSLFRVEHVARKKWNFFAAALESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFL 406 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKN-RLNYEDVLMLGMD 862 ++D V + +++ + ++ +LLG+ + S + ++ R N L Sbjct: 407 RMDDDVRVENVTQVKEMIVAYYTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSAL 466 Query: 863 ISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLI 1024 S++EI A+F M NKAPGPD FTA FF ++W +V A+ FR+ L+ Sbjct: 467 PSDKEITAAVFAMPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLL 520 >gb|AAF63129.1|AC009526_14 Similar to reverse transcriptase [Arabidopsis thaliana] Length = 602 Score = 145 bits (367), Expect = 2e-32 Identities = 93/294 (31%), Positives = 148/294 (50%), Gaps = 2/294 (0%) Frame = +2 Query: 149 NSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQA 328 N +W + F F G+SDH P ++ L++ K F++F+F + P+FL + A Sbjct: 228 NGDWFSSFPSAIAVFELSGVSDHSPCIIILENLPKRSKKCFRYFSFLSTHPTFLVSLTVA 287 Query: 329 WNVKT-YGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 W + G+ MF L + LK K K +R FGN+ T +LE Q L P Sbjct: 288 WEEQIPVGSHMFSLGEHLKAAKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNP-S 346 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 L E + +++ E+ RQKSRI WLQ GD+NT FFH+ + + +A+N I+ L Sbjct: 347 DSLFRVEHVARKKWNFFAAALESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFL 406 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKN-RLNYEDVLMLGMD 862 ++D V + +++ + ++ +LLG+ + S + ++ R N L Sbjct: 407 RMDDDVRVENVTQVKEMIVAYYTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSAL 466 Query: 863 ISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLI 1024 S++EI A+F M NKAPGPD FTA FF ++W +V A+ FR+ L+ Sbjct: 467 PSDKEITAAVFAMPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLL 520 >ref|NP_175044.1| DNAse I-like superfamily protein [Arabidopsis thaliana] gi|332193872|gb|AEE31993.1| DNAse I-like superfamily protein [Arabidopsis thaliana] Length = 626 Score = 145 bits (367), Expect = 2e-32 Identities = 93/294 (31%), Positives = 148/294 (50%), Gaps = 2/294 (0%) Frame = +2 Query: 149 NSEWINEFNDIEVAFRAPGISDHCPIVVSLKDPLIPKGKTFKFFNFWADDPSFLGVVQQA 328 N +W + F F G+SDH P ++ L++ K F++F+F + P+FL + A Sbjct: 292 NGDWFSSFPSAIAVFELSGVSDHSPCIIILENLPKRSKKCFRYFSFLSTHPTFLVSLTVA 351 Query: 329 WNVKT-YGTPMFRLTQKLKEVKGRLKTWSRLKFGNLSTDTSRLKKTLEETQIALQEAPLC 505 W + G+ MF L + LK K K +R FGN+ T +LE Q L P Sbjct: 352 WEEQIPVGSHMFSLGEHLKAAKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNP-S 410 Query: 506 KHLAEKERMVAGEYSNLIRCEEAILRQKSRINWLQLGDSNTAFFHQSVKSRRARNHIQCL 685 L E + +++ E+ RQKSRI WLQ GD+NT FFH+ + + +A+N I+ L Sbjct: 411 DSLFRVEHVARKKWNFFAAALESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFL 470 Query: 686 QLDCGRTVTDEEDIEKSAIDFFKNLLGNAYETERDFSHIDLSEVKN-RLNYEDVLMLGMD 862 ++D V + +++ + ++ +LLG+ + S + ++ R N L Sbjct: 471 RMDDDVRVENVTQVKEMIVAYYTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSAL 530 Query: 863 ISEEEIKHALFQMDENKAPGPDGFTAGFFKKTWHLVGQDFTKAISSCFRSKKLI 1024 S++EI A+F M NKAPGPD FTA FF ++W +V A+ FR+ L+ Sbjct: 531 PSDKEITAAVFAMPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLL 584