BLASTX nr result
ID: Mentha27_contig00045864
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00045864 (817 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006375647.1| hypothetical protein POPTR_0014s18610g, part... 118 2e-24 dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal... 110 6e-22 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 110 8e-22 ref|XP_006582542.1| PREDICTED: uncharacterized protein LOC102668... 105 2e-20 gb|AAF63113.1|AC006423_14 Hypothetical protein [Arabidopsis thal... 104 4e-20 gb|AAF63129.1|AC009526_14 Similar to reverse transcriptase [Arab... 104 4e-20 ref|NP_175044.1| DNAse I-like superfamily protein [Arabidopsis t... 104 4e-20 dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ... 103 6e-20 emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li... 103 6e-20 ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659... 102 2e-19 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 102 2e-19 ref|XP_006595271.1| PREDICTED: uncharacterized protein LOC100781... 101 3e-19 emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694... 101 3e-19 ref|XP_004240779.1| PREDICTED: uncharacterized protein LOC101256... 100 6e-19 ref|XP_004239563.1| PREDICTED: uncharacterized protein LOC101259... 100 6e-19 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 100 1e-18 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 99 2e-18 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 97 7e-18 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 97 7e-18 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 96 2e-17 >ref|XP_006375647.1| hypothetical protein POPTR_0014s18610g, partial [Populus trichocarpa] gi|550324501|gb|ERP53444.1| hypothetical protein POPTR_0014s18610g, partial [Populus trichocarpa] Length = 303 Score = 118 bits (296), Expect = 2e-24 Identities = 67/223 (30%), Positives = 111/223 (49%) Frame = -1 Query: 790 DFVDTAAYLTLQDVPSTGCFFTWRDKCISSKIDRTMINTIWLEKDWFCRSNFLTSGIASD 611 DF D L L DV TGC F+W + + SK+DR +IN W +F + + D Sbjct: 80 DFQDCCFDLGLHDVNFTGCHFSWTNSSVWSKLDRVLINPSWSSLQRLTHVHFGSPSVFLD 139 Query: 610 HTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLR 431 H+P + L ++ +++F F N W H F Q + W++ + G L +L L+ Sbjct: 140 HSPAVVRLDPYMQG-RQNFNFFNMWATHDQFLQVVSSCWSS-PVYGTPMYILCRRLKLLK 197 Query: 430 PILRQLNRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAER 251 L++LNR H+N++SE+ + QLE Q ++ N+ + + R K L AE+ Sbjct: 198 GPLKELNRLHFNHISERVSRLESQLEQLQNAFQQDRDNQFLFAQDRFLRSKLSSLKFAEK 257 Query: 250 DFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENG 122 F +Q+ K + SD +K+FH+L+ + RN I + +G Sbjct: 258 QFFSQKIKCNFLKHSDNGSKFFHALLGQNHQRNFILAIMCSHG 300 >dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 910 Score = 110 bits (275), Expect = 6e-22 Identities = 73/239 (30%), Positives = 112/239 (46%), Gaps = 4/239 (1%) Frame = -1 Query: 760 LQDVPSTGCFFTW----RDKCISSKIDRTMINTIWLEKDWFCRSNFLTSGIASDHTPCIS 593 L D+PS G FFTW +D I K+DR + N W + F G SDH PCI Sbjct: 177 LSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPG-DSDHAPCII 235 Query: 592 TLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLRPILRQL 413 + + K+ F++ + HPS+ L W ++ G L L + R L Sbjct: 236 LIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLKVAKLCCRTL 295 Query: 412 NRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAERDFLAQR 233 NR ++N+ ++TA + +LED Q P + L R E ARK++ A F Q+ Sbjct: 296 NRLRFSNIQQRTAQSLTRLEDIQVELLTSPSDTLFRR-EHVARKQWIFFAAALESFFRQK 354 Query: 232 AKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIVADFVGYYSDLFG 56 ++ + ++ D +T++FH V + N I FLR ++G +V I + YYS L G Sbjct: 355 SRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHLLG 413 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 110 bits (274), Expect = 8e-22 Identities = 73/239 (30%), Positives = 112/239 (46%), Gaps = 4/239 (1%) Frame = -1 Query: 760 LQDVPSTGCFFTW----RDKCISSKIDRTMINTIWLEKDWFCRSNFLTSGIASDHTPCIS 593 L D+PS G FFTW +D I K+DR + N W + F G SDH PCI Sbjct: 220 LSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPG-DSDHAPCII 278 Query: 592 TLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLRPILRQL 413 + + K+ F++ + HPS+ L W ++ G L L + R L Sbjct: 279 LIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEENTLVGSHMFSLRQHLKVAKLCCRTL 338 Query: 412 NRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAERDFLAQR 233 NR ++N+ ++TA + +LED Q P + L R E ARK++ A F Q+ Sbjct: 339 NRLRFSNIQQRTAQSLTRLEDIQVELLTSPSDTLFRR-EHVARKQWIFFAAALESFFRQK 397 Query: 232 AKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIVADFVGYYSDLFG 56 ++ + ++ D +T++FH V + N I FLR ++G +V I + YYS L G Sbjct: 398 SRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHLLG 456 >ref|XP_006582542.1| PREDICTED: uncharacterized protein LOC102668030 [Glycine max] Length = 411 Score = 105 bits (262), Expect = 2e-20 Identities = 69/251 (27%), Positives = 113/251 (45%) Frame = -1 Query: 811 PKEKDYVDFVDTAAYLTLQDVPSTGCFFTWRDKCISSKIDRTMINTIWLEKDWFCRSNFL 632 P + DFVD + L L + + G +TW + + SK+DR + N W + Sbjct: 156 PNAYELQDFVDCYSDLGLGSINTHGPLYTWTNGRVWSKLDRALCNQAWFNSFGNSACEVM 215 Query: 631 TSGIASDHTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLA 452 SDHTP + T V F+F NA M+HP+F + + D W +I G ++ Sbjct: 216 EFISISDHTPLVVTTELVVPRGNSPFKFNNAIMDHPNFLRIVADSWK-QNIHGYSMFKVC 274 Query: 451 IKLHNLRPILRQLNRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQ 272 KL L+ L+ L + + N+S + A + + P + + R + Sbjct: 275 KKLKALKAPLKNLFKQEFRNISNRVELAEAEYNSVLNSLKQNPQDPSLLALANRTRGQTI 334 Query: 271 QLDNAERDFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIV 92 L AE AQ K K++ +DK +K+FH+L+ R + I+ +R E+G T I Sbjct: 335 MLRKAESMKFAQLIKNKYLLQADKCSKFFHALIKRNRHSRFIAAIRLEDGHNTSSQDEIS 394 Query: 91 ADFVGYYSDLF 59 FV ++ +LF Sbjct: 395 LAFVNHFRNLF 405 >gb|AAF63113.1|AC006423_14 Hypothetical protein [Arabidopsis thaliana] Length = 668 Score = 104 bits (259), Expect = 4e-20 Identities = 72/239 (30%), Positives = 107/239 (44%), Gaps = 4/239 (1%) Frame = -1 Query: 760 LQDVPSTGCFFTWR----DKCISSKIDRTMINTIWLEKDWFCRSNFLTSGIASDHTPCIS 593 L D+PS G +TW D I K+DR + N W + F SG+ SDH+PCI Sbjct: 197 LVDIPSRGVHYTWSNHQDDNPIIRKLDRAIANGDWFSSFPSAIAVFELSGV-SDHSPCII 255 Query: 592 TLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLRPILRQL 413 L + K+ FR+ + HP+F +L W G L L + + L Sbjct: 256 ILENLPKRSKKCFRYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFSLGEHLKAAKKCCKLL 315 Query: 412 NRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAERDFLAQR 233 NR + N+ KT A LE Q P + L R E ARKK+ A F Q+ Sbjct: 316 NRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFR-VEHVARKKWNFFAAALESFYRQK 374 Query: 232 AKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIVADFVGYYSDLFG 56 ++ K + D +T++FH ++ + +N I FLR ++ +V + V YY+ L G Sbjct: 375 SRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLLG 433 >gb|AAF63129.1|AC009526_14 Similar to reverse transcriptase [Arabidopsis thaliana] Length = 602 Score = 104 bits (259), Expect = 4e-20 Identities = 72/239 (30%), Positives = 107/239 (44%), Gaps = 4/239 (1%) Frame = -1 Query: 760 LQDVPSTGCFFTWR----DKCISSKIDRTMINTIWLEKDWFCRSNFLTSGIASDHTPCIS 593 L D+PS G +TW D I K+DR + N W + F SG+ SDH+PCI Sbjct: 197 LVDIPSRGVHYTWSNHQDDNPIIRKLDRAIANGDWFSSFPSAIAVFELSGV-SDHSPCII 255 Query: 592 TLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLRPILRQL 413 L + K+ FR+ + HP+F +L W G L L + + L Sbjct: 256 ILENLPKRSKKCFRYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFSLGEHLKAAKKCCKLL 315 Query: 412 NRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAERDFLAQR 233 NR + N+ KT A LE Q P + L R E ARKK+ A F Q+ Sbjct: 316 NRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFR-VEHVARKKWNFFAAALESFYRQK 374 Query: 232 AKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIVADFVGYYSDLFG 56 ++ K + D +T++FH ++ + +N I FLR ++ +V + V YY+ L G Sbjct: 375 SRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLLG 433 >ref|NP_175044.1| DNAse I-like superfamily protein [Arabidopsis thaliana] gi|332193872|gb|AEE31993.1| DNAse I-like superfamily protein [Arabidopsis thaliana] Length = 626 Score = 104 bits (259), Expect = 4e-20 Identities = 72/239 (30%), Positives = 107/239 (44%), Gaps = 4/239 (1%) Frame = -1 Query: 760 LQDVPSTGCFFTWR----DKCISSKIDRTMINTIWLEKDWFCRSNFLTSGIASDHTPCIS 593 L D+PS G +TW D I K+DR + N W + F SG+ SDH+PCI Sbjct: 261 LVDIPSRGVHYTWSNHQDDNPIIRKLDRAIANGDWFSSFPSAIAVFELSGV-SDHSPCII 319 Query: 592 TLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLRPILRQL 413 L + K+ FR+ + HP+F +L W G L L + + L Sbjct: 320 ILENLPKRSKKCFRYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFSLGEHLKAAKKCCKLL 379 Query: 412 NRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAERDFLAQR 233 NR + N+ KT A LE Q P + L R E ARKK+ A F Q+ Sbjct: 380 NRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFR-VEHVARKKWNFFAAALESFYRQK 438 Query: 232 AKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIVADFVGYYSDLFG 56 ++ K + D +T++FH ++ + +N I FLR ++ +V + V YY+ L G Sbjct: 439 SRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLLG 497 >dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 103 bits (258), Expect = 6e-20 Identities = 65/218 (29%), Positives = 113/218 (51%), Gaps = 4/218 (1%) Frame = -1 Query: 760 LQDVPSTGCFFTWRDKC----ISSKIDRTMINTIWLEKDWFCRSNFLTSGIASDHTPCIS 593 L D+ G +TW +KC ++ KIDR ++N W +NF SDH+ C Sbjct: 175 LYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDF-SDHSSCEV 233 Query: 592 TLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLRPILRQL 413 L V KR FRF N ++ +P F Q + + W + +++G +++ KL +L+ + Sbjct: 234 VLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCF 293 Query: 412 NRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAERDFLAQR 233 +R +Y+++ ++ + A + QR++ P + + ELEA +K+Q L AE F Q+ Sbjct: 294 SRENYSDIEKRVSEAHAIVLHRQRITLTNP-SVVHATLELEATRKWQILAKAEESFFCQK 352 Query: 232 AKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGE 119 + + D +T YFH + + +K NTI+FL + GE Sbjct: 353 SSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGE 390 >emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 103 bits (258), Expect = 6e-20 Identities = 65/218 (29%), Positives = 113/218 (51%), Gaps = 4/218 (1%) Frame = -1 Query: 760 LQDVPSTGCFFTWRDKC----ISSKIDRTMINTIWLEKDWFCRSNFLTSGIASDHTPCIS 593 L D+ G +TW +KC ++ KIDR ++N W +NF SDH+ C Sbjct: 175 LYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDF-SDHSSCEV 233 Query: 592 TLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLRPILRQL 413 L V KR FRF N ++ +P F Q + + W + +++G +++ KL +L+ + Sbjct: 234 VLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCF 293 Query: 412 NRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAERDFLAQR 233 +R +Y+++ ++ + A + QR++ P + + ELEA +K+Q L AE F Q+ Sbjct: 294 SRENYSDIEKRVSEAHAIVLHRQRITLTNP-SVVHATLELEATRKWQILAKAEESFFCQK 352 Query: 232 AKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGE 119 + + D +T YFH + + +K NTI+FL + GE Sbjct: 353 SSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGE 390 >ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659506 [Glycine max] Length = 964 Score = 102 bits (253), Expect = 2e-19 Identities = 66/244 (27%), Positives = 110/244 (45%) Frame = -1 Query: 790 DFVDTAAYLTLQDVPSTGCFFTWRDKCISSKIDRTMINTIWLEKDWFCRSNFLTSGIASD 611 DFVD + L L + + G +TW + + SK+DR + N W + SD Sbjct: 535 DFVDCYSDLGLGSINTHGPLYTWTNSRVWSKLDRALCNQAWFNSFGNSACEVMEFISISD 594 Query: 610 HTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLR 431 HTP + T V F+F N ++HP+F + + D W +I G ++ KL L+ Sbjct: 595 HTPLVVTTELVVPRGNSPFKFNNLIVDHPNFLRIVADGWK-QNIHGCSMFKVCKKLKALK 653 Query: 430 PILRQLNRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAER 251 L+ L + ++N+S + A + + P + + R + L AE Sbjct: 654 APLKNLFKQEFSNISNRVELAEAEYNSVLNSIKQNPQDPSLLALANRTRGQTIMLRKAES 713 Query: 250 DFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIVADFVGYY 71 AQ K K++ +DK +K+FH+L+ R K I+ +R E+G T I FV ++ Sbjct: 714 MKFAQLIKNKYLLQADKCSKFFHALIKRNKHSRFIAAIRLEDGHNTSSQDEIALAFVNHF 773 Query: 70 SDLF 59 + F Sbjct: 774 RNFF 777 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 102 bits (253), Expect = 2e-19 Identities = 73/255 (28%), Positives = 119/255 (46%), Gaps = 7/255 (2%) Frame = -1 Query: 790 DFVDTAAYLTLQDVPSTGCFFTWRDKC----ISSKIDRTMINTIWLEKDWFCRSNFLTSG 623 DF D L D+ G FTW +K ++ KIDR ++N W F S + Sbjct: 168 DFRDCLLAAELSDLRYKGNTFTWWNKSHTTPVAKKIDRILVNDSW--NALFPSSLGIFGS 225 Query: 622 IA-SDHTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIK 446 + SDH C L + KR F+F N +++ F + D W T+++ G +++ K Sbjct: 226 LDFSDHVSCGVVLEETSIKAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKK 285 Query: 445 LHNLRPILRQLNRTHYNNLSEKTAAARVQLEDAQ--RLSDREPLNRLIREAELEARKKYQ 272 L L+ ++ +R +Y+ L ++T A L Q L+D P+N ELEA +K+ Sbjct: 286 LKALKKPIKDFSRLNYSELEKRTKEAHDFLIGCQDRTLADPTPINASF---ELEAERKWH 342 Query: 271 QLDNAERDFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIV 92 L AE F Q+++ D +TKYFH + + + N+IS L NG+ + I+ Sbjct: 343 ILTAAEESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGIL 402 Query: 91 ADFVGYYSDLFGKKI 47 Y+ L G ++ Sbjct: 403 DLCASYFGSLLGDEV 417 >ref|XP_006595271.1| PREDICTED: uncharacterized protein LOC100781932 [Glycine max] Length = 952 Score = 101 bits (252), Expect = 3e-19 Identities = 65/251 (25%), Positives = 112/251 (44%) Frame = -1 Query: 811 PKEKDYVDFVDTAAYLTLQDVPSTGCFFTWRDKCISSKIDRTMINTIWLEKDWFCRSNFL 632 P + DFVD + L L + + G +TW + + SK+DR + N +W + Sbjct: 603 PNAYELQDFVDCCSDLGLGSINTHGPLYTWTNGRVWSKLDRALCNQVWFNSFGNSACEVM 662 Query: 631 TSGIASDHTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLA 452 SDHTP + T V F+F NA ++HP+F + + D W +I G ++ Sbjct: 663 EFISISDHTPLVVTTKLVVPRGNSPFKFNNAIVDHPNFSRIVADGWK-QNIHGCSMFKVC 721 Query: 451 IKLHNLRPILRQLNRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQ 272 KL L+ L+ L + ++N+S + A V+ + P + + R + Sbjct: 722 KKLKVLKASLKNLFKQEFSNISNRVELAEVEYNSVLNSLKQNPQDHSLLALANRTRGQTI 781 Query: 271 QLDNAERDFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIV 92 E AQ K +++ D +K+FH+L+ R + I+ +R E+G T I Sbjct: 782 MFRKVESMKFAQLIKNRYLLQVDICSKFFHALIKRNRHSRFIAAIRLEDGHNTSSQDEIA 841 Query: 91 ADFVGYYSDLF 59 FV ++ +LF Sbjct: 842 LAFVNHFRNLF 852 >emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1| putative protein [Arabidopsis thaliana] Length = 1141 Score = 101 bits (252), Expect = 3e-19 Identities = 75/252 (29%), Positives = 117/252 (46%), Gaps = 12/252 (4%) Frame = -1 Query: 790 DFVDTAAYLTLQDVPSTGCFFTWRDKC----ISSKIDRTMINTIWLEKDWFCRSNFLTSG 623 DF + L D+ G FTW +K ++ KIDR ++N W SN S Sbjct: 160 DFRECLLDAELSDLVYKGSSFTWWNKSKTRPVAKKIDRILVNESW--------SNLFPSS 211 Query: 622 IA-------SDHTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQ 464 SDH C L KR F+F N +++P F + D W + ++ G Sbjct: 212 FGLFGPPDFSDHASCGVVLELDPIKAKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSM 271 Query: 463 EQLAIKLHNLRPILRQLNRTHYNNLSEKTAAARVQLEDAQRLS-DREPLNRLIREAELEA 287 +++ KL L+ ++ +R +Y+NL ++T A L Q L+ D L E LEA Sbjct: 272 FRVSKKLKALKKPIKDFSRLNYSNLEKRTEEAHETLLSFQNLTLDNPSLENAAHE--LEA 329 Query: 286 RKKYQQLDNAERDFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGD 107 ++K+Q L AE F QR++ D +T+YFH + + +K NTI+ L ++G T D Sbjct: 330 QRKWQILATAEESFFRQRSRVTWFAEGDGNTRYFHRMADSRKSVNTITTLVDDSG-TQID 388 Query: 106 VKTIVADFVGYY 71 + +AD Y Sbjct: 389 SQQGIADHCALY 400 >ref|XP_004240779.1| PREDICTED: uncharacterized protein LOC101256493 [Solanum lycopersicum] Length = 441 Score = 100 bits (249), Expect = 6e-19 Identities = 72/262 (27%), Positives = 118/262 (45%), Gaps = 12/262 (4%) Frame = -1 Query: 805 EKDYVDFVDTAAYLTLQDVPSTGCFFTWRDKCI-----SSKIDRTMINTIWLEKDWFCRS 641 E + DF D + + ++ G ++TW +K I S +IDR N W++K Sbjct: 63 ENEIKDFADCVKAMGIHELQWKGSYYTWSNKQIGNARVSRRIDRAFGNDEWMDKWGHVIL 122 Query: 640 NFLTSGIASDHTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQE 461 + G+ SDH+ L Q + + F+F N W EH F +E W KQE Sbjct: 123 EYGNPGV-SDHSTMQLVLHQSNQHVRASFKFFNIWTEHDLFLDLVEKVW--------KQE 173 Query: 460 Q-------LAIKLHNLRPILRQLNRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIRE 302 + + KL L+P+L+QLNR + +S + AR +L D Q + + L+ + Sbjct: 174 KDRDAIKKVWYKLKALQPVLKQLNRKEFKYISNQIEEARNELIDIQNQLCHQAKDELVTK 233 Query: 301 AELEARKKYQQLDNAERDFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENG 122 E E K ++L + L Q+ +AK I L D + KY S++ + + I L +G Sbjct: 234 -EKELLTKLEKLSLIKESALRQKVRAKWIKLGDANNKYLSSVIKERNHKKNIRILMSLDG 292 Query: 121 ETTGDVKTIVADFVGYYSDLFG 56 + + I +FV + L G Sbjct: 293 RKLSEPQEIQDEFVLFDKSLMG 314 >ref|XP_004239563.1| PREDICTED: uncharacterized protein LOC101259634 [Solanum lycopersicum] Length = 425 Score = 100 bits (249), Expect = 6e-19 Identities = 69/236 (29%), Positives = 112/236 (47%), Gaps = 1/236 (0%) Frame = -1 Query: 709 ISSKIDRTMINTIWLEKDWFCRSNFLTSGIASDHTPCISTLFQKVETFKRDFRFCNAWME 530 ISS+IDR N W++K + G+ SDH+ L Q + + F+F N W E Sbjct: 167 ISSRIDRAFGNDAWMDKWGHVILEYGNPGV-SDHSSMQLLLHQNYQQVRASFKFFNVWTE 225 Query: 529 HPSFKQTLEDYWATMSITGGKQEQLAIKLHNLRPILRQLNRTHYNNLSEKTAAARVQLED 350 H SF + +E W + + + KL L+P+L+QLNR + + ++ AR L D Sbjct: 226 HESFLELVETVWK-QNKGRDAMKMVWYKLKALQPVLKQLNRREFKYIGKQIEEARNDLAD 284 Query: 349 AQRLSDREPLNRLIREAELEARKKYQQLDNAERDFLAQRAKAKHINLSDKSTKYFHSLVN 170 Q + + L+ + E + K ++ E L Q+A+AK I L D + KYF S++ Sbjct: 285 IQNQLCNQANDDLVTK-EKDLLTKLEKWSLIEESSLRQKARAKWIKLGDANNKYFSSVIK 343 Query: 169 RKKLRNTISFLRRENGETTGDVKTIVADFVGYYSDLFGKKIPRSP-VDWSVMGAGY 5 + + I L +G+ D + I +FV +Y L G P ++ VM G+ Sbjct: 344 ERNYKKHIRSLMSIDGKMLYDPQEIQDEFVLFYKSLMGTAADNLPAINVRVMKRGH 399 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 99.8 bits (247), Expect = 1e-18 Identities = 68/253 (26%), Positives = 116/253 (45%), Gaps = 3/253 (1%) Frame = -1 Query: 811 PKEKDYVDFVDTAAYLTLQDVPSTGCFFTWRDKCISSKIDRTMINTIWLEKDWFCRSNFL 632 P E DF T L D G FTW + + ++DR + N W+ K R L Sbjct: 1206 PHEGAMEDFASTLLDCGLLDGGFEGNPFTWTNNRMFQRLDRIVYNHHWINKFPITRIQHL 1265 Query: 631 TSGIASDHTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLA 452 SDH P + + F E FRF +AW+ H FK ++E W + I G + Sbjct: 1266 NRD-GSDHCPLLISCFNSSEKAPSSFRFQHAWVLHHDFKTSVESNW-NLPINGSGLQAFW 1323 Query: 451 IKLHNLRPILRQLNRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQ 272 K H L+ L+ N+ + ++ K A ++E+ + L N E+ ++ K Y Sbjct: 1324 SKQHRLKQHLKWWNKVMFGDIFSKLKEAEKRVEECEILHQ----NEQTVESIIKLNKSYA 1379 Query: 271 QLD---NAERDFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVK 101 QL+ N E F Q++ K + +++TK+FH+ + +K++R+ I ++ +G D + Sbjct: 1380 QLNKQLNIEEIFWKQKSGVKWVVEGERNTKFFHTRMQKKRIRSHIFKVQEPDGRWIEDQE 1439 Query: 100 TIVADFVGYYSDL 62 + + Y+S L Sbjct: 1440 QLKQSAIKYFSSL 1452 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 98.6 bits (244), Expect = 2e-18 Identities = 68/253 (26%), Positives = 115/253 (45%), Gaps = 3/253 (1%) Frame = -1 Query: 811 PKEKDYVDFVDTAAYLTLQDVPSTGCFFTWRDKCISSKIDRTMINTIWLEKDWFCRSNFL 632 P E DF T L D G FTW + + ++DR + N W+ K R L Sbjct: 1034 PHEGAMEDFASTLLDCGLLDGGFEGNSFTWTNNRMFQRLDRIVYNHHWINKFPVTRIQHL 1093 Query: 631 TSGIASDHTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLA 452 SDH P + + F E FRF +AW+ H FK ++E W + I G + Sbjct: 1094 NRD-GSDHCPLLISCFNSSEKAPSSFRFQHAWVLHHDFKTSVESNW-NLPINGSGLQAFW 1151 Query: 451 IKLHNLRPILRQLNRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQ 272 K H L+ L+ N+ + ++ K A ++E+ + L +E E+ ++ K Y Sbjct: 1152 SKQHRLKQHLKWWNKAVFGDIFSKLKEAEKRVEECEILHQQEQ----TFESRIKLNKSYA 1207 Query: 271 QLD---NAERDFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVK 101 QL+ N E F Q++ K + +++TK+FH + +K++R+ I ++ G D + Sbjct: 1208 QLNKQLNIEELFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKVQDPEGRWIEDQE 1267 Query: 100 TIVADFVGYYSDL 62 + + Y+S L Sbjct: 1268 QLKHSAIEYFSSL 1280 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 97.1 bits (240), Expect = 7e-18 Identities = 71/250 (28%), Positives = 117/250 (46%), Gaps = 5/250 (2%) Frame = -1 Query: 790 DFVDTAAYLTLQDVPSTGCFFTWRDKC----ISSKIDRTMINTIWLEKDWFCRSNFLTSG 623 DF + + L D+ G FTW +K I+ K+DR + N W + + S+ L Sbjct: 28 DFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIAKKLDRILANDSWC--NLYPSSHGLFGN 85 Query: 622 IA-SDHTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIK 446 + SDH C L + KR F+F N +++ F + D W + ++ G +++ K Sbjct: 86 LDFSDHVSCGVVLEANGISAKRPFKFFNFLLKNEDFLNVVMDNWFSTNVVGSSMYRVSKK 145 Query: 445 LHNLRPILRQLNRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQL 266 L ++ ++ +R +Y+ + +T A L Q L+ P + ELEA++K+ L Sbjct: 146 LKAMKKPIKDFSRLNYSGIELRTKEAHELLITCQNLTLANP-SVSNAALELEAQRKWVLL 204 Query: 265 DNAERDFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIVAD 86 AE F QR++ D +T YFH +V+ +K NTI+ L NG + I+ Sbjct: 205 SCAEESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGLLIDSQQGILDH 264 Query: 85 FVGYYSDLFG 56 V YY L G Sbjct: 265 CVTYYERLLG 274 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 97.1 bits (240), Expect = 7e-18 Identities = 71/250 (28%), Positives = 117/250 (46%), Gaps = 5/250 (2%) Frame = -1 Query: 790 DFVDTAAYLTLQDVPSTGCFFTWRDKC----ISSKIDRTMINTIWLEKDWFCRSNFLTSG 623 DF + + L D+ G FTW +K I+ K+DR + N W + + S+ L Sbjct: 28 DFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIAKKLDRILANDSWC--NLYPSSHGLFGN 85 Query: 622 IA-SDHTPCISTLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIK 446 + SDH C L + KR F+F N +++ F + D W + ++ G +++ K Sbjct: 86 LDFSDHVSCGVVLEANGISAKRPFKFFNFLLKNEDFLNVVMDNWFSTNVVGSSMYRVSKK 145 Query: 445 LHNLRPILRQLNRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQL 266 L ++ ++ +R +Y+ + +T A L Q L+ P + ELEA++K+ L Sbjct: 146 LKAMKKPIKDFSRLNYSGIELRTKEAHELLITCQNLTLANP-SVSNAALELEAQRKWVLL 204 Query: 265 DNAERDFLAQRAKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIVAD 86 AE F QR++ D +T YFH +V+ +K NTI+ L NG + I+ Sbjct: 205 SCAEESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGLLIDSQQGILDH 264 Query: 85 FVGYYSDLFG 56 V YY L G Sbjct: 265 CVTYYERLLG 274 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 95.9 bits (237), Expect = 2e-17 Identities = 60/239 (25%), Positives = 105/239 (43%), Gaps = 4/239 (1%) Frame = -1 Query: 760 LQDVPSTGCFFTW----RDKCISSKIDRTMINTIWLEKDWFCRSNFLTSGIASDHTPCIS 593 + D+P G +TW + I+ KIDR ++N WL +F SDH P Sbjct: 176 ISDLPFRGNHYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAMEF-SDHCPSCV 234 Query: 592 TLFQKVETFKRDFRFCNAWMEHPSFKQTLEDYWATMSITGGKQEQLAIKLHNLRPILRQL 413 + + + F+ N M HP F + + W ++ G L+ K L+ +R Sbjct: 235 NISNQSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTF 294 Query: 412 NRTHYNNLSEKTAAARVQLEDAQRLSDREPLNRLIREAELEARKKYQQLDNAERDFLAQR 233 NR HY+ L ++ A L+ Q P + + E EA + + +L AE FL Q+ Sbjct: 295 NREHYSGLEKRVVQAAQNLKTCQNNLLAAP-SSYLAGLEKEAHRSWAELALAEERFLCQK 353 Query: 232 AKAKHINLSDKSTKYFHSLVNRKKLRNTISFLRRENGETTGDVKTIVADFVGYYSDLFG 56 ++ + D +T +FH ++ ++ N I +L + G + + V ++ +LFG Sbjct: 354 SRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFG 412