BLASTX nr result
ID: Akebia26_contig00033121
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia26_contig00033121 (835 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596... 179 1e-42 ref|XP_004253277.1| PREDICTED: uncharacterized protein LOC101244... 175 2e-41 ref|XP_004253436.1| PREDICTED: uncharacterized protein LOC101262... 174 3e-41 ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260... 174 3e-41 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 172 1e-40 ref|XP_007010390.1| Retrotransposon, unclassified-like protein [... 171 2e-40 ref|XP_004237272.1| PREDICTED: uncharacterized protein LOC101266... 170 5e-40 ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom... 170 7e-40 ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein A... 170 7e-40 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 168 2e-39 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 168 3e-39 ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom... 166 8e-39 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 165 2e-38 ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258... 165 2e-38 gb|AAC63844.1| putative non-LTR retroelement reverse transcripta... 165 2e-38 ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobrom... 164 4e-38 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 164 5e-38 ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261... 164 5e-38 ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein A... 164 5e-38 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 162 1e-37 >ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum] Length = 1135 Score = 179 bits (453), Expect = 1e-42 Identities = 101/278 (36%), Positives = 152/278 (54%), Gaps = 7/278 (2%) Frame = -2 Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649 + + K SHL +ADD I+F + S+ + L YEK S Q I+L KS ++ ++ Sbjct: 537 YGMPKWSPVVSHLSYADDTILFCSGQTTSMRKMINILRGYEKVSGQMINLDKSMIYLHKQ 596 Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469 V +R LVK I G +GS P YLG P+F G+ + L+ K+ +M+ W++KL+S Sbjct: 597 VPNRVCNLVKRITGIRQGSFPFTYLGCPIFYGRKNKGHFENLLKKVSNRMNTWQNKLMSF 656 Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289 G R L++HVL SIPVYL++ + PK+I L FA FFW ++ G H VAW K+C Sbjct: 657 GERYILIAHVLQSIPVYLLAAMNPPKSIIDQLHKLFAIFFWSNSSGARNKHWVAWDKMCY 716 Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTE-KGLWIEFLKKKYARDGNWWNPTNSRG--- 121 PK+EG LG R++ +V K K+ W T+ LW F+ KY + +PT +RG Sbjct: 717 PKVEGGLGFRSLHDVSKAFFAKLWWNFRTDTSSLWASFMWNKYCKK---MHPTVARGQGA 773 Query: 120 SKLWR---SIRSFLQPTIRLSKRLIGDGASSSLFLHNW 16 S +WR ++R ++ I + +SS + NW Sbjct: 774 SHVWRKMITVREEVEHNIWWQIK----AGNSSFWFDNW 807 >ref|XP_004253277.1| PREDICTED: uncharacterized protein LOC101244169 [Solanum lycopersicum] Length = 764 Score = 175 bits (443), Expect = 2e-41 Identities = 92/276 (33%), Positives = 157/276 (56%), Gaps = 2/276 (0%) Frame = -2 Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649 F+++KKG +HL FADD+IIFT + SL I +E YE S+Q+++ +KS F V + Sbjct: 226 FNMNKKGPQVNHLSFADDIIIFTSTDNTSLQLIMKVIEDYEAVSDQKVNKEKSYFMVTPK 285 Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469 ++ ++ +K I GF + P NYLG PL++G +V K+++K+ GW+SK+L+ Sbjct: 286 TSNGIIDNIKRITGFSMKNSPINYLGCPLYIGGQRIIYFSEVVDKVIKKISGWQSKILNF 345 Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289 GG++TL+ HVL SIP++ ++ + KT ++ A FFWG + K H +W + Sbjct: 346 GGKITLIKHVLQSIPIHTLAAISPHKTTINHIKKLMADFFWGIDKEGKKYHWASWDTMAY 405 Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN-WWNPTNSRGSKL 112 P EG +G+R + ++ K K W T+ LW FLK KY + + N+ S + Sbjct: 406 PTNEGGIGVRLLDDICKAFQYKHWWDFRTKNSLWSNFLKSKYCQRAHPVAKKYNTGDSLM 465 Query: 111 WRSI-RSFLQPTIRLSKRLIGDGASSSLFLHNWRGS 7 WR + R+ ++ +++ ++ +S+L+ NW G+ Sbjct: 466 WRYLTRNRIEVEVQIRWQI--QSGTSNLWWDNWTGN 499 >ref|XP_004253436.1| PREDICTED: uncharacterized protein LOC101262707 [Solanum lycopersicum] Length = 764 Score = 174 bits (442), Expect = 3e-41 Identities = 93/276 (33%), Positives = 155/276 (56%), Gaps = 2/276 (0%) Frame = -2 Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649 F+++KKG +HL FADD+IIFT + SL I +E YE S+Q+++ +KS F V + Sbjct: 226 FNMNKKGPQVNHLSFADDIIIFTSTDNTSLQLIMKVIEDYEAVSDQKVNKEKSYFMVTLK 285 Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469 ++ ++ +K I GF + P NYLG PL++G +V K+++K+ GW+SK+L+ Sbjct: 286 TSNGIIDNIKRITGFSMKNSPINYLGCPLYIGGQRIIYFFEVVDKVIKKISGWQSKILNF 345 Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289 GG++TL+ HVL SIP++ ++ + PKT ++ A FFWG + K H +W + Sbjct: 346 GGKITLIKHVLQSIPIHTLAAISPPKTTINHIKKLMADFFWGIDKEGKKYHWASWDTMAY 405 Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN-WWNPTNSRGSKL 112 P EG +G+R + ++ K K W T+ LW FL KY + + N+ S + Sbjct: 406 PTNEGGIGVRLLDDICKAFQYKHWWDFRTKHSLWSNFLMSKYCQRAHPVAKKYNTGDSLM 465 Query: 111 WRSI-RSFLQPTIRLSKRLIGDGASSSLFLHNWRGS 7 WR + R+ ++ + + + +SSL+ NW G+ Sbjct: 466 WRYLTRNRIEVEVHIRWHI--QSGTSSLWWDNWTGN 499 >ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260201 [Solanum lycopersicum] Length = 1531 Score = 174 bits (442), Expect = 3e-41 Identities = 94/278 (33%), Positives = 154/278 (55%), Gaps = 4/278 (1%) Frame = -2 Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649 FS+ K G +HL FADD IIFT +SL+ I ++ YE+ +Q+++ KS F V + Sbjct: 961 FSMEKNGPQTNHLSFADDCIIFTSTDRRSLTLIMRIIDDYERVFDQKVNKDKSFFMVTRK 1020 Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469 + +E +K++ GF + P NYLG PL++G +V K+++++ GW+SK+L+ Sbjct: 1021 TSHEIIEDIKVVTGFGMKNSPINYLGCPLYIGGQRIIYFSEVVEKVIKRISGWQSKILNF 1080 Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289 GG++TL+ HVL ++P++ ++++ PKT ++ A FFWG + K H +W + Sbjct: 1081 GGKVTLVKHVLQAMPIHTLAVMSPPKTTLNYIKRAIADFFWGVDKDGKKYHWASWDTLAY 1140 Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSRGSKL- 112 P EG +G+R + ++ K K W T+K LW +FLK KY + N G L Sbjct: 1141 PTNEGGIGVRLLDDICKAFQYKHWWEFRTKKSLWSQFLKAKYCQRANPVAKKYDTGDSLV 1200 Query: 111 WRSI---RSFLQPTIRLSKRLIGDGASSSLFLHNWRGS 7 WR + RS ++ IR + I G +S + NW G+ Sbjct: 1201 WRYLTRNRSEMEAYIRWN---INSG-TSKFWWDNWLGN 1234 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 172 bits (436), Expect = 1e-40 Identities = 97/269 (36%), Positives = 144/269 (53%), Gaps = 3/269 (1%) Frame = -2 Query: 798 SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619 SHL FADD++IFT + +L I FL+ YE+ S Q+I+ +KS F V+ R +++ Sbjct: 1730 SHLAFADDVLIFTNGSKSALQRILAFLQEYEEISRQRINAQKSCFVTHTNVSSSRRQIIA 1789 Query: 618 MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439 GF LP YLG PL+ G + LV+KI ++ GW++K+LS GGR+TLL V Sbjct: 1790 QTTGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLKSV 1849 Query: 438 LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259 LTS+P+YL +L P + + F F WG + K H +W K+ P EG L IR Sbjct: 1850 LTSLPIYLFQVLKPPVCVLERINRIFNSFLWGGSAASKKIHWTSWAKISLPVKEGGLDIR 1909 Query: 258 TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSR--GSKLWRSIRSFLQ 85 +++EV + MK+ WR T LW F++ KY R G T + S+ W+ + + Sbjct: 1910 SLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCR-GQLPMHTQPKLHDSQTWKRMVASSA 1968 Query: 84 PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1 T + + +G G + F H+ W G P Sbjct: 1969 ITEQNMRWRVGQG--NLFFWHDCWMGETP 1995 >ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508727303|gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 171 bits (434), Expect = 2e-40 Identities = 82/212 (38%), Positives = 125/212 (58%) Frame = -2 Query: 798 SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619 SHL FADD++IFT + L I FL+ YE+ S Q+++ +KS F A + R +++ Sbjct: 644 SHLAFADDIMIFTNGSKSVLEKILEFLQEYEQISGQRVNHQKSCFVTANNMPSSRRQIIS 703 Query: 618 MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439 +GF +LP YLG PLF G + L++KI ++ GW++K+LS GGR+TLL V Sbjct: 704 QTIGFLHKTLPITYLGAPLFKGPKKVMLFDSLINKIRERITGWENKILSPGGRITLLRSV 763 Query: 438 LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259 L+S+P+YL+ +L P + +E F F WGS+ T+ H AWH + P EG LGIR Sbjct: 764 LSSMPIYLLQVLKPPACVIQKIERLFNSFLWGSSMDSTRIHWTAWHNITFPSSEGGLGIR 823 Query: 258 TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY 163 ++ + K+ WR T + LW+ +++ KY Sbjct: 824 SLKDSFDAFSAKLWWRFDTCQSLWVRYMRLKY 855 >ref|XP_004237272.1| PREDICTED: uncharacterized protein LOC101266714 [Solanum lycopersicum] Length = 584 Score = 170 bits (431), Expect = 5e-40 Identities = 82/227 (36%), Positives = 136/227 (59%) Frame = -2 Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649 FS+ KKG +HL FADD+IIFT +SL+ I +E YEK S+Q+++ KS F V + Sbjct: 72 FSMEKKGPQINHLSFADDIIIFTSTDRRSLNLIMRIIEDYEKVSDQKVNKDKSFFMVTSK 131 Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469 + + +K++ F + P +YLG PL++G +V K+++++ GW+SK+L+ Sbjct: 132 TSQYIIGDIKLVTSFCMKNSPIHYLGCPLYIGGQRIIYFSEVVEKVIKRISGWQSKILNY 191 Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289 GG++TL+ HVL ++P+++++ + PKT + ++ A FFWG + K H +W+ + Sbjct: 192 GGKVTLVKHVLQAMPIHILAAMSPPKTTLMYIKREIAAFFWGVDKDGKKYHWASWNTLDY 251 Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN 148 P MEG +G+R + +V K K W T+ LW +FLK KY + N Sbjct: 252 PTMEGGIGVRLLDDVCKAFQYKHWWEFRTKGSLWSQFLKAKYCQRAN 298 >ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao] gi|508710339|gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 170 bits (430), Expect = 7e-40 Identities = 97/269 (36%), Positives = 149/269 (55%), Gaps = 3/269 (1%) Frame = -2 Query: 798 SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619 SHL FADD++IFT +L I FL+ YE+ S QQ++ +KS F A R +++ Sbjct: 1263 SHLAFADDIVIFTNGCRPALQKILVFLQEYEEVSGQQVNHQKSCFITANGCPMTRRQIIA 1322 Query: 618 MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439 GF +LP YLG PL G + L++KI ++ GW++K LS GGR+TLL V Sbjct: 1323 HTTGFQHKTLPVIYLGAPLHKGPKKVTLFDSLITKIRDRISGWENKTLSPGGRITLLRSV 1382 Query: 438 LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259 L+S+P+YL+ +L P + +E F F WG + + H AWHK+ P EG L IR Sbjct: 1383 LSSLPLYLLQVLKPPVVVIEKIERLFNSFLWGDSTNDKRIHWAAWHKLTFPCSEGGLDIR 1442 Query: 258 TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY--ARDGNWWNPTNSRGSKLWRSIRSFLQ 85 ++++ +K+ WR +T +GLW +FLK KY + ++ +P S++W+ + + Sbjct: 1443 RLTDMFDAFSLKLWWRFSTCEGLWTKFLKTKYCMGQIPHYVHP-KLHDSQVWKRMVRGRE 1501 Query: 84 PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1 I+ ++ IG G S F H+ W G P Sbjct: 1502 VAIQNTRWRIGKG--SLFFWHDCWMGDQP 1528 >ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 955 Score = 170 bits (430), Expect = 7e-40 Identities = 98/275 (35%), Positives = 143/275 (52%), Gaps = 1/275 (0%) Frame = -2 Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649 F + G +HL FA+D+IIFT +SL I +E YE S+QQ++ KS F V + Sbjct: 239 FQMDSNGPQINHLSFANDIIIFTSTDRQSLQLIVKTIEEYELISDQQVNKDKSFFMVTTK 298 Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469 + +K+ GF + P YLG PL+VG G+V KI+RK+ GW +K+L+ Sbjct: 299 TNQAIINSIKIETGFGIQNSPITYLGCPLYVGGQRIIYFSGIVEKIIRKISGWHAKILNF 358 Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289 GG++TL+ HVL SIP++L++ + PKT +++ A FFWG + K H +W + Sbjct: 359 GGKITLVKHVLQSIPIHLLAAVSPPKTTLKYIKNVIADFFWGMDKDGKKYHWASWETLAY 418 Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSRGSKL- 112 P EG +G+R + +V K W T+ LW +FLK KY + N G+ L Sbjct: 419 PTNEGGIGVRNLEDVCIAFQYKQWWEFRTKNSLWSKFLKAKYCKRANPVAKKYDTGNSLV 478 Query: 111 WRSIRSFLQPTIRLSKRLIGDGASSSLFLHNWRGS 7 WR Q K I G SSS + NW G+ Sbjct: 479 WRYFTRNRQAVESYIKWNIHSG-SSSFWWDNWLGN 512 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 168 bits (426), Expect = 2e-39 Identities = 98/269 (36%), Positives = 142/269 (52%), Gaps = 3/269 (1%) Frame = -2 Query: 798 SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619 SHL FADD++IFT +L I FL+ YE+ S QQ++ +KS F A R +++ Sbjct: 1524 SHLAFADDIVIFTNGCHSALQKILVFLQEYEQVSGQQVNHQKSCFITANGCPLSRRQIIA 1583 Query: 618 MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439 + GF +LP YLG PL G + L+SKI ++ GW++K+LS G R+TLL V Sbjct: 1584 QVTGFQHKTLPVTYLGAPLHKGPKKVFLFDSLISKIRDRISGWENKILSPGSRITLLRSV 1643 Query: 438 LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259 L+S+P+YL+ +L P + +E F F WG + + H AW+K+ P EG L IR Sbjct: 1644 LSSLPMYLLQVLKPPAIVIEKIERLFNSFLWGDSNEGKRMHWAAWNKINFPCSEGGLDIR 1703 Query: 258 TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY--ARDGNWWNPTNSRGSKLWRSIRSFLQ 85 + +V +K+ WR T LW FLK KY R ++ P S +W+ I Sbjct: 1704 NLKDVFDAFTLKLWWRFYTCDSLWTLFLKTKYCLGRIPHYVQP-KIHSSSIWKRITGGRD 1762 Query: 84 PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1 TI+ ++ IG G F H+ W G P Sbjct: 1763 VTIQNTRWKIGRG--ELFFWHDCWMGDQP 1789 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 168 bits (425), Expect = 3e-39 Identities = 94/269 (34%), Positives = 144/269 (53%), Gaps = 3/269 (1%) Frame = -2 Query: 798 SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619 SHL FADD++IFT + +L I FL+ YE+ S Q+I+ +KS F + + R +++ Sbjct: 1560 SHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINAQKSCFVTHTNIPNSRRQIIA 1619 Query: 618 MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439 GF LP YLG PL+ G + LV+KI ++ GW++K+LS GGR+TLL V Sbjct: 1620 QATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSV 1679 Query: 438 LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259 L S+P+YL+ +L P + + F F WG + + H +W K+ P EG L IR Sbjct: 1680 LASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGLDIR 1739 Query: 258 TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSR--GSKLWRSIRSFLQ 85 +++EV + MK+ WR T LW F++ KY R G T + S+ W+ + + Sbjct: 1740 SLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCR-GQLPMQTQPKLHDSQTWKRMLTSST 1798 Query: 84 PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1 T + + +G G + F H+ W G P Sbjct: 1799 ITEQHMRWRVGQG--NVFFWHDCWMGEAP 1825 >ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao] gi|508715062|gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 166 bits (421), Expect = 8e-39 Identities = 98/269 (36%), Positives = 140/269 (52%), Gaps = 3/269 (1%) Frame = -2 Query: 798 SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619 SHL FADD++IFT SL I FL+ YE+ S QQ++ +KS F A A R +++ Sbjct: 1260 SHLSFADDIVIFTNGCRSSLQKILNFLQEYEQVSGQQVNHQKSCFITANGCALSRRQIIS 1319 Query: 618 MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439 GF +LP YLG PL G + L+SKI ++ GW++K+LS GGR+TLL V Sbjct: 1320 HTTGFHHKTLPVTYLGAPLHKGPKKVFLFDSLISKIRDRISGWENKILSPGGRITLLRSV 1379 Query: 438 LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259 L+S P+YL+ +L P T+ +E F F WG + K H W K+ P EG L IR Sbjct: 1380 LSSQPMYLLQVLKPPVTVIEKIERIFNSFLWGDSNDGKKLHWTVWSKITFPVSEGGLDIR 1439 Query: 258 TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY--ARDGNWWNPTNSRGSKLWRSIRSFLQ 85 + +V + +K+ WR T LW +FL+ KY R ++ P S++W+ R + Sbjct: 1440 NLRDVFEAFSLKLWWRFQTCNSLWTKFLRTKYCLGRIPHFVQP-KLHDSQVWK--RMIVG 1496 Query: 84 PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1 + L G F H+ W G P Sbjct: 1497 RDVALQNIRWRIGKGELFFWHDCWMGDQP 1525 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 165 bits (417), Expect = 2e-38 Identities = 96/269 (35%), Positives = 143/269 (53%), Gaps = 3/269 (1%) Frame = -2 Query: 798 SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619 SHL FADD++IFT +L I FL+ YE+ S Q+++ +KS F A + R +++ Sbjct: 1437 SHLSFADDIVIFTNGGRSALQKILSFLQEYEQVSGQKVNHQKSCFITANGCSLSRRQIIS 1496 Query: 618 MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439 GF +LP YLG PL G + L+SKI ++ GW++K+LS GGR+TLL V Sbjct: 1497 HTTGFQHKTLPVTYLGAPLHKGPKKVLLFDSLISKIRDRISGWENKILSPGGRITLLRSV 1556 Query: 438 LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259 L+S+P+YL+ +L P T+ ++ F F WG + K H W K+ P EG LGIR Sbjct: 1557 LSSLPMYLLQVLKPPVTVIERIDRLFNSFLWGDSTECKKMHWAEWAKISFPCAEGGLGIR 1616 Query: 258 TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY--ARDGNWWNPTNSRGSKLWRSIRSFLQ 85 + +V +K+ WR T LW +FL+ KY R + P S +W+ + S + Sbjct: 1617 KLEDVCAAFTLKLWWRFQTGNSLWTQFLRTKYCLGRIPHHIQP-KLHDSHVWKRMISGRE 1675 Query: 84 PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1 ++ + IG G F H+ W G P Sbjct: 1676 MALQNIRWKIGKG--DLFFWHDCWMGDKP 1702 >ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum lycopersicum] Length = 1454 Score = 165 bits (417), Expect = 2e-38 Identities = 92/265 (34%), Positives = 141/265 (53%), Gaps = 1/265 (0%) Frame = -2 Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649 F + G +HL FADD+IIF+ + SL+ I ++ YE+ S+Q+++ KS F V Sbjct: 739 FHMESNGPKINHLSFADDIIIFSSTDNNSLNLIMKTIDQYEEVSDQKVNKDKSFFMVTSN 798 Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469 + +E + I GF + + P NYLG PL+VG +V K+++K+ GW K+L+ Sbjct: 799 TSHDIIEEISRITGFSRKNSPINYLGCPLYVGGQRIIYYSEIVEKVIKKIAGWHLKILNF 858 Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289 GG++TL+ HVL S+P++ +S + PKTI S++ A FFWG + K H +W+ + Sbjct: 859 GGKVTLVKHVLQSMPIHTLSAISPPKTILNSIKKVIADFFWGIEKDGKKYHWSSWNNMAF 918 Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN-WWNPTNSRGSKL 112 P EG +G+R I ++ K W T LW +FLK KY + N N+ S + Sbjct: 919 PTNEGGIGVRLIEDMCTAFQYKQWWAFRTNNSLWSKFLKAKYNQRANPVAKKYNTGDSIV 978 Query: 111 WRSIRSFLQPTIRLSKRLIGDGASS 37 WR + Q L K I G S Sbjct: 979 WRYLTRNRQKVESLIKWHIQSGTCS 1003 >gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1231 Score = 165 bits (417), Expect = 2e-38 Identities = 94/276 (34%), Positives = 154/276 (55%), Gaps = 6/276 (2%) Frame = -2 Query: 825 SVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERV 646 +VS G SH+ FADDLI+F +A+ + I+ LE + +AS Q++SL+KS F + V Sbjct: 531 AVSCGGSKLSHVCFADDLILFAEASVAQIRIIRRVLERFCEASGQKVSLEKSKIFFSHNV 590 Query: 645 ADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEG 466 + +L+ G YLG+P+ ++ +E ++ ++ ++ GWK + LS Sbjct: 591 SREMEQLISEESGIGCTKELGKYLGMPILQKRMNKETFGEVLERVSARLAGWKGRSLSLA 650 Query: 465 GRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKP 286 GR+TL VL+SIPV+++S + +P + +L+ F WGST K K HL++W K+CKP Sbjct: 651 GRITLTKAVLSSIPVHVMSAILLPVSTLDTLDRYSRTFLWGSTMEKKKQHLLSWRKICKP 710 Query: 285 KMEGELGIRTISEVHKTGLMKMCWRLTTEK-GLWIEFLKKKY----ARDGNWWNPTNSRG 121 K EG +G+R+ +++K + K+ WRL +K LW ++KKY +D +W P R Sbjct: 711 KAEGGIGLRSARDMNKALVAKVGWRLLQDKESLWARVVRKKYKVGGVQDTSWLKP-QPRW 769 Query: 120 SKLWRSIRSFLQPTIRLSKRLI-GDGASSSLFLHNW 16 S WRS+ L+ + + GDG + +L W Sbjct: 770 SSTWRSVAVGLREVVVKGVGWVPGDGCTIRFWLDRW 805 >ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobroma cacao] gi|508778193|gb|EOY25449.1| Uncharacterized protein TCM_016755 [Theobroma cacao] Length = 1245 Score = 164 bits (415), Expect = 4e-38 Identities = 85/212 (40%), Positives = 120/212 (56%) Frame = -2 Query: 798 SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619 SHL FADD++IFT +L I FL+ YE S QQ++ +KS F + R +++ Sbjct: 1012 SHLAFADDIVIFTNGCRPALQKILIFLQEYEAVSGQQVNHQKSCFITSNGCPMTRRQIIA 1071 Query: 618 MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439 GF +LP YLG PL G + L++KI ++ GW++K LS GGR+TLL V Sbjct: 1072 HTTGFQHKTLPVIYLGAPLHKGPKKVALFDSLITKIRDRISGWENKTLSPGGRITLLRSV 1131 Query: 438 LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259 L+S+P+YL+ +L P + +E F F WG + + H VAWHK+ P EG + IR Sbjct: 1132 LSSMPMYLLQVLKPPVVVIEKIERLFNSFLWGDSTTDKRMHWVAWHKLTFPCSEGGIDIR 1191 Query: 258 TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY 163 +++V MK+ WR T GLW FLK KY Sbjct: 1192 RLNDVSDAFTMKLWWRFQTCDGLWTNFLKTKY 1223 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 164 bits (414), Expect = 5e-38 Identities = 94/272 (34%), Positives = 144/272 (52%), Gaps = 3/272 (1%) Frame = -2 Query: 807 VAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVE 628 ++ SHL FADD++IFT + +L I FL+ Y++ S Q+I+++KS F V+ R + Sbjct: 1555 ISVSHLAFADDVLIFTNGSKSALQRILAFLQEYQEISGQRINVQKSCFVTHTNVSSSRRQ 1614 Query: 627 LVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLL 448 ++ GF L YLG PL+ G + LV+KI ++ GW++K+LS GGR+TLL Sbjct: 1615 IIAQTTGFSHQLLLITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLL 1674 Query: 447 SHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGEL 268 VL S+P+YL+ +L P + + F F WG + K H +W K+ P EG L Sbjct: 1675 RSVLASLPIYLLQVLKPPICVLERVNRIFNSFLWGGSAASKKIHWASWAKISLPIKEGGL 1734 Query: 267 GIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSR--GSKLWRSIRS 94 IR ++EV + MK+ WR T LW F++ KY R G T + S+ W+ + + Sbjct: 1735 DIRNLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCR-GQLPMHTQPKLHDSQTWKRMVA 1793 Query: 93 FLQPTIRLSKRLIGDGASSSLFLHN-WRGSCP 1 T + + +G G F H+ W G P Sbjct: 1794 NSAITEQNMRWRVGQG--KLFFWHDCWMGETP 1823 >ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261371 [Solanum lycopersicum] Length = 1246 Score = 164 bits (414), Expect = 5e-38 Identities = 92/272 (33%), Positives = 141/272 (51%), Gaps = 1/272 (0%) Frame = -2 Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649 F + + G +HL FADD+IIF S SL I +E YE+ S+QQ++ KS F V Sbjct: 545 FHMERNGPKINHLSFADDIIIFASTDSNSLHLIMKTIELYEEVSDQQVNKHKSFFMVTSN 604 Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469 +E +K G+ + + P NYLG PL++G +V K+++++ GW SK+L+ Sbjct: 605 TGHDIIEEIKRATGYSRKNSPINYLGCPLYIGGQRIIYYSEVVEKVIKRIAGWHSKILNF 664 Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289 GG++TL+ HVL SIP++ ++ + PKT ++ A FFWG + K H +W + Sbjct: 665 GGKITLVKHVLQSIPIHTLAAISPPKTTLNCIKKLIADFFWGIDKDGKKYHWSSWENMAY 724 Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN-WWNPTNSRGSKL 112 P EG +G+R + +V W T+ LW +FLK KY + N +S S + Sbjct: 725 PTSEGGIGVRLLEDVCTAFQYMQWWDFRTKNSLWSQFLKAKYCQRANPLAKKYDSGDSLV 784 Query: 111 WRSIRSFLQPTIRLSKRLIGDGASSSLFLHNW 16 WR + L K I G +SS + NW Sbjct: 785 WRYLTRNRLKVESLIKWQIHSG-TSSFWWDNW 815 >ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 1010 Score = 164 bits (414), Expect = 5e-38 Identities = 94/274 (34%), Positives = 146/274 (53%), Gaps = 1/274 (0%) Frame = -2 Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649 F+++KKG +HL FADD+IIFT KSL I ++ YE S+Q+++ KS Sbjct: 294 FNMNKKGPQINHLSFADDIIIFTSTDLKSLQLIMHTIKEYEGVSDQRVNKDKSFCMATVN 353 Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469 E+VK + G+ + P NYLG PL++G + LV K+++++ GW+SK+L+ Sbjct: 354 TRTDIQEIVKSVTGYHMKTSPINYLGCPLYIGGKSIIYYSELVDKVIKRITGWQSKILNF 413 Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289 GG++TL+ HVL SIP++ ++ + PKTI ++ A FFWGS K H + + Sbjct: 414 GGKITLVKHVLQSIPIHTLATISPPKTIIKNINKVIADFFWGSDSVGKKYHWASLETMAY 473 Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSRGS-KL 112 P EG +G+R + +V ++ K W T+ LW +FLK KY + N G + Sbjct: 474 PISEGGIGVRLLDDVCRSFQYKHWWEFRTKDTLWSKFLKAKYCQRSNIVAKKFDTGDYVV 533 Query: 111 WRSIRSFLQPTIRLSKRLIGDGASSSLFLHNWRG 10 WR + Q + K I G + S + NW G Sbjct: 534 WRYLTRIRQEVEKYIKWNIHTG-NCSFWWDNWIG 566 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 162 bits (411), Expect = 1e-37 Identities = 81/212 (38%), Positives = 119/212 (56%) Frame = -2 Query: 798 SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619 SHL FADD+IIF + +L I FL+ YE+ S Q+I+ +KS +A R +++ Sbjct: 2811 SHLAFADDVIIFANGSKSALQRILAFLQEYEELSGQRINPQKSCVVTHTNMASSRRQIIL 2870 Query: 618 MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439 GF LP YLG PLF G + LV+KI ++ GW++K+LS GGR+TLL Sbjct: 2871 QATGFSHRPLPITYLGAPLFKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRST 2930 Query: 438 LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259 L+S+P+YL+ +L P + + F F WG + + H +W K+ P EG L IR Sbjct: 2931 LSSLPIYLLQVLKPPIIVLERINRLFNNFLWGGSASSKRIHWASWGKIALPIAEGGLDIR 2990 Query: 258 TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY 163 + +V K MK+ WR T LW++F++ KY Sbjct: 2991 NLEDVFKAFSMKLWWRFRTTNSLWMQFMRAKY 3022 Score = 155 bits (393), Expect = 1e-35 Identities = 81/206 (39%), Positives = 115/206 (55%) Frame = -2 Query: 780 DDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVKMILGFP 601 DD++IFT SL I FL+ YE+ S QQ++ +KS F A R +++ GF Sbjct: 1023 DDIVIFTNGCRSSLQKILNFLQEYEQVSGQQVNHQKSCFITTNGCALSRRQIISHTTGFH 1082 Query: 600 KGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHVLTSIPV 421 +LP YLG PL G+ + L+SKI ++ GW++K+LS GGR+TLL VL+S P+ Sbjct: 1083 HKTLPVTYLGAPLHKGQKKVILFDSLISKIRDRISGWENKILSPGGRITLLRSVLSSQPM 1142 Query: 420 YLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIRTISEVH 241 YL+ +L P T+ +E F F WG + K H AW K+ P EG L IR + +V Sbjct: 1143 YLLQVLKPPVTVIEKIERLFNSFLWGDSCDGKKLHWTAWSKITFPVSEGGLDIRNLRDVF 1202 Query: 240 KTGLMKMCWRLTTEKGLWIEFLKKKY 163 + +K+ WR T LW FL+ KY Sbjct: 1203 EAFSLKLWWRFQTCNSLWTRFLRTKY 1228