BLASTX nr result
ID: Mentha23_contig00017151
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00017151 (1532 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU28909.1| hypothetical protein MIMGU_mgv1a002591mg [Mimulus... 674 0.0 gb|EPS60750.1| hypothetical protein M569_14050, partial [Genlise... 612 e-172 ref|XP_007048823.1| BED zinc finger,hAT family dimerization doma... 466 e-129 ref|XP_007216990.1| hypothetical protein PRUPE_ppa002590mg [Prun... 447 e-123 ref|XP_007146367.1| hypothetical protein PHAVU_006G034500g [Phas... 439 e-120 ref|XP_006377715.1| hypothetical protein POPTR_0011s10500g [Popu... 424 e-116 ref|XP_006370067.1| hypothetical protein POPTR_0001s39240g [Popu... 424 e-116 emb|CBI20108.3| unnamed protein product [Vitis vinifera] 422 e-115 emb|CAN60218.1| hypothetical protein VITISV_006612 [Vitis vinifera] 421 e-115 ref|XP_007213601.1| hypothetical protein PRUPE_ppa002416mg [Prun... 414 e-113 ref|XP_007133312.1| hypothetical protein PHAVU_011G169000g [Phas... 400 e-109 ref|XP_007021998.1| BED zinc finger,hAT family dimerization doma... 398 e-108 ref|XP_007022001.1| BED zinc finger,hAT family dimerization doma... 395 e-107 ref|XP_006407043.1| hypothetical protein EUTSA_v10020233mg [Eutr... 392 e-106 gb|AAG52564.1|AC010675_12 unknown protein; 6859-4829 [Arabidopsi... 389 e-105 ref|XP_006297141.1| hypothetical protein CARUB_v10013145mg [Caps... 384 e-104 dbj|BAB02646.1| Ac transposase-like protein [Arabidopsis thalian... 380 e-102 ref|XP_007022002.1| BED zinc finger,hAT family dimerization doma... 378 e-102 ref|XP_006390942.1| hypothetical protein EUTSA_v10018229mg [Eutr... 373 e-100 ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [A... 338 5e-90 >gb|EYU28909.1| hypothetical protein MIMGU_mgv1a002591mg [Mimulus guttatus] Length = 656 Score = 674 bits (1740), Expect = 0.0 Identities = 325/376 (86%), Positives = 352/376 (93%), Gaps = 1/376 (0%) Frame = +3 Query: 6 VCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTRG 185 VCRIRDQLCQHRFLMCEGQLFDVRC ASTVKLLVQDVLETSR+ITNKVRETI+Y+KG++ Sbjct: 276 VCRIRDQLCQHRFLMCEGQLFDVRCAASTVKLLVQDVLETSREITNKVRETIRYVKGSQA 335 Query: 186 TQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCPS 365 TQEKFNEIVQLVGI+ QK LSVDNPFQWNST +MLEAALEYKEAFPQLQE+DP FSMCPS Sbjct: 336 TQEKFNEIVQLVGINCQKSLSVDNPFQWNSTCMMLEAALEYKEAFPQLQEHDPGFSMCPS 395 Query: 366 GIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISSL 545 IDWDRLR+ITSI KFFHEVSNVF GRKH+T+NSYF EICDIHLQLI WCQKSD+FISSL Sbjct: 396 DIDWDRLRAITSIFKFFHEVSNVFAGRKHITSNSYFNEICDIHLQLIGWCQKSDEFISSL 455 Query: 546 ALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKAL 725 ALKLKSKFDEYWKKCSLIMAIAAILDPR+KM+LVEYYYPQIYGDSAPDCIDIV NCMKAL Sbjct: 456 ALKLKSKFDEYWKKCSLIMAIAAILDPRYKMQLVEYYYPQIYGDSAPDCIDIVKNCMKAL 515 Query: 726 YSGHAIYSPLAAHGQNSSSESN-GIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEEPLFP 902 YSGHAIYSPL+AHGQ+S+SES+ I KD+L+GFDRFLHETSVSQNTKSDLDKYLEEPLFP Sbjct: 516 YSGHAIYSPLSAHGQSSASESSVSIVKDKLTGFDRFLHETSVSQNTKSDLDKYLEEPLFP 575 Query: 903 RSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTEKSDT 1082 R S+LNWWKVHEPRYPVLSMMARNILGIPISKVA+ESLFDTG+RALDH W T KSDT Sbjct: 576 RKNVISVLNWWKVHEPRYPVLSMMARNILGIPISKVAVESLFDTGERALDHCWSTMKSDT 635 Query: 1083 LQALMCSQDWMRNELE 1130 LQALMCS+DW+ ++ E Sbjct: 636 LQALMCSRDWISSDFE 651 >gb|EPS60750.1| hypothetical protein M569_14050, partial [Genlisea aurea] Length = 647 Score = 612 bits (1578), Expect = e-172 Identities = 289/376 (76%), Positives = 335/376 (89%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV +IRDQLCQHRFLMCEGQLFDVRC STV++LVQ+VLETSR++T KVRE ++Y+KG+R Sbjct: 275 IVSKIRDQLCQHRFLMCEGQLFDVRCATSTVRVLVQEVLETSREMTKKVREIVRYVKGSR 334 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 EKFNEIV+L+G++ QK+LS+DNP +WNST MLEAALEYKE FPQLQE DP FS P Sbjct: 335 AAYEKFNEIVRLLGVNSQKVLSIDNPLKWNSTSTMLEAALEYKEVFPQLQELDPEFSTWP 394 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 SG+DWDRLR+I ILKFF EVS VFVG KH+TANS+FAEICDIHL+LI+WCQKSDDFISS Sbjct: 395 SGMDWDRLRAIAGILKFFIEVSEVFVGGKHITANSFFAEICDIHLKLIEWCQKSDDFISS 454 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LALKLKS FDEYWKKCSLIMA+AAILDPR+KMKLVEYYYPQIYGDSAP+CI+IVSNCMK+ Sbjct: 455 LALKLKSVFDEYWKKCSLIMAVAAILDPRYKMKLVEYYYPQIYGDSAPECIEIVSNCMKS 514 Query: 723 LYSGHAIYSPLAAHGQNSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEEPLFP 902 LY+GH IYSPLAAH +S AKDRL+GFDRFLHETSVSQNTKSDL+KYLE+PLFP Sbjct: 515 LYNGHIIYSPLAAH---ASENGGAAAKDRLTGFDRFLHETSVSQNTKSDLEKYLEDPLFP 571 Query: 903 RSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTEKSDT 1082 R+ D +IL+WWKV+EPRYPVLSMMARNILGIPISKV+ +++FDTG++ +DH W T KS+T Sbjct: 572 RNNDLNILSWWKVNEPRYPVLSMMARNILGIPISKVSSDAVFDTGNKPIDHCWATLKSET 631 Query: 1083 LQALMCSQDWMRNELE 1130 LQALMCSQDW+ NELE Sbjct: 632 LQALMCSQDWLHNELE 647 >ref|XP_007048823.1| BED zinc finger,hAT family dimerization domain [Theobroma cacao] gi|508701084|gb|EOX92980.1| BED zinc finger,hAT family dimerization domain [Theobroma cacao] Length = 657 Score = 466 bits (1200), Expect = e-129 Identities = 224/384 (58%), Positives = 289/384 (75%), Gaps = 8/384 (2%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RIRD+L Q+RFL C GQLFDVRC + +VQD L+ ++T K+RE+I+Y+K + Sbjct: 273 IVDRIRDRLSQNRFLYCNGQLFDVRCAVDLLNRMVQDALDAVCEVTQKIRESIRYVKSSE 332 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDP-SFSMC 359 TQ F E+ V + QK L +DNP +WNST++MLE ALEY++ F LQ+ DP + Sbjct: 333 ATQSMFIELAHEVQVESQKCLRIDNPLKWNSTFLMLEVALEYRKVFCCLQDRDPVNMKFL 392 Query: 360 PSGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFIS 539 PS ++WDR+ I S LK F EV+NVF K+ TAN +F EICDIHLQLI+WC+ DD+I+ Sbjct: 393 PSDLEWDRVSVIASFLKLFVEVTNVFTRSKYPTANIFFPEICDIHLQLIEWCKNPDDYIN 452 Query: 540 SLALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMK 719 SLA+K++ KF++YW KCSL +A+AA+LDPRFKMKL+EYYYPQ+YGDSA + ID V C+K Sbjct: 453 SLAVKMRKKFEDYWDKCSLGLAVAAMLDPRFKMKLLEYYYPQLYGDSASELIDDVFECIK 512 Query: 720 ALYSGHAIYSPLAAH-GQNSSSESNGI------AKDRLSGFDRFLHETSVSQNTKSDLDK 878 +LY+ H++ SPLA+ Q S + +GI ++DRL GFD+FLHETS S + SDLDK Sbjct: 513 SLYNEHSMVSPLASSLDQGLSWQVSGIPGSGKDSRDRLMGFDKFLHETSQSDGSNSDLDK 572 Query: 879 YLEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHS 1058 YLE+PLFPR+VDF+ILNWWKVH P YP+LSMMA NILGIPISKVA ES FDTG R +DH+ Sbjct: 573 YLEDPLFPRNVDFNILNWWKVHTPSYPILSMMAHNILGIPISKVAAESTFDTGGRVVDHN 632 Query: 1059 WGTEKSDTLQALMCSQDWMRNELE 1130 W + T+QALMCSQDW+R+ELE Sbjct: 633 WSSLPPTTVQALMCSQDWIRSELE 656 >ref|XP_007216990.1| hypothetical protein PRUPE_ppa002590mg [Prunus persica] gi|462413140|gb|EMJ18189.1| hypothetical protein PRUPE_ppa002590mg [Prunus persica] Length = 655 Score = 447 bits (1150), Expect = e-123 Identities = 212/383 (55%), Positives = 281/383 (73%), Gaps = 7/383 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 +V RIRD+L Q++ L C+GQLFDVRC A+ + ++ QD LE ++T+K+R +I+Y+K ++ Sbjct: 272 VVFRIRDRLSQNKLLSCDGQLFDVRCAANVINMMSQDALEALCEMTDKIRGSIRYVKSSQ 331 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 QEKFN IV VG ++ L +DNP QWNSTYVM+E ALEY++AF LQE DP ++MCP Sbjct: 332 VIQEKFNSIVHQVGGESRRCLCLDNPLQWNSTYVMVEIALEYRDAFALLQENDPVYAMCP 391 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S ++WDR+ ITS LK F V+NVF K TAN YF E+C+++ QL +WC+ +DD+ISS Sbjct: 392 SDVEWDRVNIITSYLKLFVGVTNVFTRFKSPTANLYFPELCEVYSQLNEWCKNADDYISS 451 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LALK++SKF+EYW +CSL +A+A +LDPRFKMK V+YYY Q +G AP I V C+K Sbjct: 452 LALKMRSKFEEYWMRCSLSLAVAVMLDPRFKMKPVDYYYAQFFGSGAPGRISDVFECVKT 511 Query: 723 LYSGHA-----IYSPLA--AHGQNSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKY 881 LY+ H+ + LA G + S +DRL+GFD+FLHET+ TKSDLDKY Sbjct: 512 LYNEHSTCLAYVDQGLAWQVGGSSRLPGSGRDLRDRLTGFDKFLHETTEIDGTKSDLDKY 571 Query: 882 LEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSW 1061 LEEPLFPR+ +F ILNWWKVH PRYP+LSMMARN+LGIP+SKV ++S F+TG R LD W Sbjct: 572 LEEPLFPRNAEFDILNWWKVHAPRYPILSMMARNVLGIPVSKVPIDSTFNTGGRVLDRDW 631 Query: 1062 GTEKSDTLQALMCSQDWMRNELE 1130 + T+QALMC+QDW+R+ELE Sbjct: 632 SSMNPATIQALMCAQDWIRSELE 654 >ref|XP_007146367.1| hypothetical protein PHAVU_006G034500g [Phaseolus vulgaris] gi|561019590|gb|ESW18361.1| hypothetical protein PHAVU_006G034500g [Phaseolus vulgaris] Length = 663 Score = 439 bits (1130), Expect = e-120 Identities = 220/387 (56%), Positives = 280/387 (72%), Gaps = 10/387 (2%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 I RI D+L Q+RFL C GQLFD+RC A+ + +VQ L +I K+RETI YIK ++ Sbjct: 277 IAVRIGDRLLQNRFLYCNGQLFDIRCAANVINAMVQHALGAVSEIVIKIRETIGYIKSSQ 336 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 KFNE+ + VGI QK L +DN QWNSTY MLE ALE+K+ LQE D ++ + Sbjct: 337 IILAKFNEMAKEVGILSQKGLCLDNASQWNSTYSMLEVALEFKDVLILLQENDAAYKVYL 396 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S ++W+R+ ++TS LK F EV NVF K+ TAN YF E+CD+ L LI+WC+ SD++ISS Sbjct: 397 SDVEWERVTAVTSYLKLFVEVINVFTKNKYPTANIYFPELCDVKLHLIEWCKNSDEYISS 456 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LA +L+SKFDEYW+KCSL +A+AA+LDPRFKMKLV+YYYPQIYG + I+ V + +KA Sbjct: 457 LASRLRSKFDEYWEKCSLGLAVAAMLDPRFKMKLVDYYYPQIYGSMSASRIEEVFDGVKA 516 Query: 723 LYSGHAIYSPLAAHGQ-------NSSSESNGIAK---DRLSGFDRFLHETSVSQNTKSDL 872 LY+ H+I SPLA+H Q N G AK DRL GFD+FLHETS + TKSDL Sbjct: 517 LYNEHSIGSPLASHDQGLAWQVGNGPLLLQGSAKDSRDRLMGFDKFLHETSQGEGTKSDL 576 Query: 873 DKYLEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALD 1052 DKYLEEPLFPR+VDF+ILNWW+VH PRYPVLSMMARN+LGIP++KVA E F+ R LD Sbjct: 577 DKYLEEPLFPRNVDFNILNWWRVHTPRYPVLSMMARNVLGIPMAKVAPELAFNHSGRVLD 636 Query: 1053 HSWGTEKSDTLQALMCSQDWMRNELED 1133 W + T+QAL+CSQDW+R+ELE+ Sbjct: 637 RDWSSLNPATVQALVCSQDWIRSELEN 663 >ref|XP_006377715.1| hypothetical protein POPTR_0011s10500g [Populus trichocarpa] gi|550328098|gb|ERP55512.1| hypothetical protein POPTR_0011s10500g [Populus trichocarpa] Length = 673 Score = 424 bits (1090), Expect = e-116 Identities = 211/395 (53%), Positives = 283/395 (71%), Gaps = 5/395 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RI+D++ Q+R L+ GQLFDVR A + L+VQD +ET R++T KVR +++Y+K ++ Sbjct: 279 IVLRIKDRISQNRPLLSNGQLFDVRSAAHVLNLIVQDAMETIREVTEKVRGSVRYVKSSQ 338 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 Q KFNEI + +GIS QK L +D P +WNSTY MLE + YK AF LQE DP+++ Sbjct: 339 VIQGKFNEIAEQIGISSQKNLVLDLPTRWNSTYFMLETVIGYKSAFCFLQERDPAYTSAL 398 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 + +W+ SIT LK F E++N+F G K TAN YF EICD+H+QLI+WC+ DDF+SS Sbjct: 399 TDTEWEWASSITGYLKLFVEITNIFSGDKCPTANIYFPEICDVHIQLIEWCKNPDDFLSS 458 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 +A K+K+KFD YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A D I VS+ +K Sbjct: 459 MASKMKAKFDRYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALDRIKEVSDGIKE 518 Query: 723 LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890 L++ ++I S L G +S ++ ++DRL GFD+FLHE+S Q+ SDLDKYLEE Sbjct: 519 LFNAYSICSTLVDQGSTLPGSSLPSTSTDSRDRLKGFDKFLHESSQGQSAISDLDKYLEE 578 Query: 891 PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070 P+FPR+ DF+ILNWWKVH PRYP+LSMMAR+ILG P+S +A E F G R LD + Sbjct: 579 PVFPRNCDFNILNWWKVHTPRYPILSMMARDILGTPMSTIAPELAFGVGGRVLDSYRSSL 638 Query: 1071 KSDTLQALMCSQDWMRNELED-SKTPAFALHSDAN 1172 DT QAL+C++DW++ E ED + + A AL+ +AN Sbjct: 639 NPDTRQALICTRDWLQVESEDHNPSSALALYVEAN 673 >ref|XP_006370067.1| hypothetical protein POPTR_0001s39240g [Populus trichocarpa] gi|550349246|gb|ERP66636.1| hypothetical protein POPTR_0001s39240g [Populus trichocarpa] Length = 673 Score = 424 bits (1089), Expect = e-116 Identities = 209/395 (52%), Positives = 286/395 (72%), Gaps = 5/395 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RI+D++ Q+R L+ GQLFDVR + L+V+D +ET +++T KVR ++ Y+K ++ Sbjct: 279 IVLRIKDRISQNRPLLSNGQLFDVRSAVHVLNLIVKDAMETLQEVTEKVRGSVSYVKSSQ 338 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 Q KFN+I Q +GIS Q+ L +D+ +WNSTY MLE + YK AF LQE+DP+++ Sbjct: 339 VIQGKFNDIAQQIGISSQRNLVLDSSTRWNSTYSMLETVIGYKSAFCFLQEHDPAYTSAL 398 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S I+W+ +SIT LK F E++N+F G K TAN YF EICD+H+QLI+WC+ DDF+SS Sbjct: 399 SDIEWEWAKSITGYLKLFVEITNIFSGDKCPTANRYFPEICDVHIQLIEWCKNPDDFLSS 458 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 +A K+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A D I VS+ +K Sbjct: 459 IASKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALDRIKEVSDGIKE 518 Query: 723 LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890 L++ ++I S L G +S ++ ++DRL GFD+FLHE+S Q++ SDLDKYLEE Sbjct: 519 LFNAYSICSTLVDQGSALPGSSLPSTSTDSRDRLKGFDKFLHESSQGQSSISDLDKYLEE 578 Query: 891 PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070 P+FPR+ DF+ILNWWKVH PRYP+LSMMAR+ILG P+S V+ E F G R LD + Sbjct: 579 PVFPRNCDFNILNWWKVHTPRYPILSMMARDILGTPMSTVSPELAFGVGGRVLDSYRSSL 638 Query: 1071 KSDTLQALMCSQDWMRNELED-SKTPAFALHSDAN 1172 DT QAL+C++DW+R E ED + + A AL+ +AN Sbjct: 639 NPDTRQALICTRDWLRVESEDHNPSSALALYVEAN 673 >emb|CBI20108.3| unnamed protein product [Vitis vinifera] Length = 677 Score = 422 bits (1085), Expect = e-115 Identities = 212/404 (52%), Positives = 281/404 (69%), Gaps = 14/404 (3%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 + R+++ Q R L+ GQL DVRC + L+VQD +E R++T+K+RE+++Y+K ++ Sbjct: 275 VALRVKEHFSQDRPLLGSGQLLDVRCVGHVLNLIVQDCIEALREVTHKIRESVRYVKTSQ 334 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 T KFNEI Q VGI+ Q+ L +D P QWNSTY+ML+ LEYK AF LQE+DP +++ Sbjct: 335 ATLGKFNEIAQQVGINSQQNLFLDCPTQWNSTYLMLDRVLEYKGAFSLLQEHDPGYTVAL 394 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S +W+ SITS +K E+ V K TAN YF EICDIH+QLI+WC+ DDFISS Sbjct: 395 SDTEWEWASSITSYMKLLLEIIAVLSSNKCPTANIYFPEICDIHIQLIEWCKSPDDFISS 454 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LALK+K+KFD+YW KCSL +A+A ILDPRFKMKLVEYYYPQIYG A D I VS+ +K Sbjct: 455 LALKMKAKFDKYWSKCSLALAVAVILDPRFKMKLVEYYYPQIYGTDAADRIKDVSDGIKE 514 Query: 723 LYSGH-----AIYSPLAAHGQNSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLE 887 L++ + +++ +A G + S SN ++DRL GFD+F+HETS +QN SDLDKYLE Sbjct: 515 LFNVYCSTSASLHQGVALPGSSLPSTSND-SRDRLKGFDKFIHETSQNQNIVSDLDKYLE 573 Query: 888 EPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGT 1067 EP+FPR+ DF ILNWWKV +PRYP+LSMM R++LGIP+S VA E +F TG R LDH + Sbjct: 574 EPVFPRNCDFHILNWWKVQKPRYPILSMMVRDVLGIPMSTVAPEVVFSTGARVLDHYRSS 633 Query: 1068 EKSDTLQALMCSQDWMRNELED---------SKTPAFALHSDAN 1172 DT QAL+C+QDW++ LE+ S PA L +AN Sbjct: 634 LNPDTRQALICTQDWLQTGLEEPNQSSPHQTSPHPAIPLAIEAN 677 >emb|CAN60218.1| hypothetical protein VITISV_006612 [Vitis vinifera] Length = 667 Score = 421 bits (1083), Expect = e-115 Identities = 206/381 (54%), Positives = 274/381 (71%), Gaps = 5/381 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 + R+++ Q R L+ GQL DVRC + L+VQD +E R++T+K+RE+++Y+K ++ Sbjct: 275 VALRVKEHFSQDRPLLGSGQLLDVRCVGHVLNLIVQDCIEALREVTHKIRESVRYVKTSQ 334 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 T KFNEI Q VGI+ Q+ L +D P QWNSTY+ML+ LEYK AF LQE+DP +++ Sbjct: 335 ATLGKFNEIAQQVGINSQQNLFLDCPTQWNSTYLMLDTVLEYKGAFSLLQEHDPGYTVAL 394 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S +W+ SITS +K E+ V K TAN YF EICDIH+QLI+WC+ DDFISS Sbjct: 395 SDTEWEWASSITSYMKLLLEIIAVLSSNKCPTANIYFPEICDIHIQLIEWCKSPDDFISS 454 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LALK+K+KFD+YW KCSL +A+A ILDPRFKMKLVEYYYPQIYG+ A D I VS+ +K Sbjct: 455 LALKMKAKFDKYWSKCSLALAVAVILDPRFKMKLVEYYYPQIYGNDAADRIKDVSDGIKE 514 Query: 723 LYSGH-----AIYSPLAAHGQNSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLE 887 L++ + +++ +A G + S SN ++DRL GFD+F+HETS +QN SDLDKYLE Sbjct: 515 LFNVYCSTSASLHQGVALPGSSLPSTSND-SRDRLKGFDKFIHETSQNQNIVSDLDKYLE 573 Query: 888 EPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGT 1067 EP+FPR+ DF ILNWWKV +PRYP+LSMM R++LGIP+S VA E +F TG R LDH + Sbjct: 574 EPVFPRNCDFHILNWWKVQKPRYPILSMMVRDVLGIPMSTVAPEVVFSTGARVLDHYRSS 633 Query: 1068 EKSDTLQALMCSQDWMRNELE 1130 DT QAL+C+QDW++ LE Sbjct: 634 LNPDTRQALICTQDWLQTGLE 654 >ref|XP_007213601.1| hypothetical protein PRUPE_ppa002416mg [Prunus persica] gi|462409466|gb|EMJ14800.1| hypothetical protein PRUPE_ppa002416mg [Prunus persica] Length = 675 Score = 414 bits (1063), Expect = e-113 Identities = 205/393 (52%), Positives = 276/393 (70%), Gaps = 4/393 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RI+D++ Q R L GQLFD+R A + +VQDVLE R++ K+R + ++++ ++ Sbjct: 277 IVLRIKDRISQSRPLAGHGQLFDIRSAAHLLNSIVQDVLEALREVIQKIRGSFKHVRSSQ 336 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 Q KFNEI Q VGI+ ++ L +D P +WNSTY+MLE ALEY+ AF LQE+DPS++ Sbjct: 337 VVQGKFNEIAQQVGINSERRLILDFPVRWNSTYIMLETALEYRGAFSLLQEHDPSYASSL 396 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 + +W+ +T LK E++NVF G K TA+ YF EIC +H+QLI+WC+ DDF+S Sbjct: 397 TDTEWEWTSFVTGYLKLLVEITNVFSGNKSPTASIYFPEICHVHIQLIEWCKSPDDFLSC 456 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 +ALK+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A D I VS+ +K Sbjct: 457 MALKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALDRIKEVSDGIKE 516 Query: 723 LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890 L+ ++I S + G +S ++ +DRL GFD+FL+ETS SQN SDLDKYLEE Sbjct: 517 LFDAYSICSTMVDQGSALPGSSLPSTSSDTRDRLKGFDKFLYETSQSQNVISDLDKYLEE 576 Query: 891 PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070 P+FPR+ DF+ILNWWKVH PRYP+LSMMAR++LG P+S VA ES F G R LD + Sbjct: 577 PVFPRNCDFNILNWWKVHTPRYPILSMMARDVLGTPMSTVAPESAFSIGGRVLDQCRSSL 636 Query: 1071 KSDTLQALMCSQDWMRNELEDSKTPAFALHSDA 1169 D QAL+C+QDW++ EL+D F+ HS A Sbjct: 637 NPDIRQALVCTQDWLQVELKD--VNPFSSHSAA 667 >ref|XP_007133312.1| hypothetical protein PHAVU_011G169000g [Phaseolus vulgaris] gi|561006312|gb|ESW05306.1| hypothetical protein PHAVU_011G169000g [Phaseolus vulgaris] Length = 672 Score = 400 bits (1029), Expect = e-109 Identities = 194/381 (50%), Positives = 267/381 (70%), Gaps = 4/381 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 I RI++++ + R + QL D+R A + + QD +E +++ K+RE+I+Y++ ++ Sbjct: 277 ITLRIKERVSEKRPFLSTRQLLDIRSAAHLINSIAQDAMEALQEVIQKIRESIRYVRSSQ 336 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 Q KFNEI Q I+ QK+L +D P QW STY+MLE A+EY+ AF Q++DPS+S Sbjct: 337 VVQAKFNEIAQHATINTQKVLFLDFPVQWKSTYLMLETAVEYRSAFSLFQDHDPSYSSTL 396 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S +W+ S+T LK E++NVF G K TAN YF EICD H+QLI WC+ SD F+S Sbjct: 397 SDEEWEWATSVTGYLKLLVEITNVFSGNKFPTANVYFPEICDAHIQLIDWCRSSDSFLSP 456 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 +A+K+K+KFD+YW KCSL +A+AA+LDPRFKMKLVEYYY IYG +A + I VS+ +K Sbjct: 457 MAMKMKAKFDKYWGKCSLALALAAVLDPRFKMKLVEYYYSLIYGSTALERIKEVSDGIKE 516 Query: 723 LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890 L++ ++I S + G +S ++ ++DRL GFDRFLHETS SQ+ SDLDKYLEE Sbjct: 517 LFNAYSICSTMIDQGSALPGSSLPSTSCSSRDRLKGFDRFLHETSQSQSMTSDLDKYLEE 576 Query: 891 PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070 P+FPR+ DF+ILNWWKVH PRYP+LSMMAR++LG P+S +A E F TG R LD S + Sbjct: 577 PIFPRNSDFNILNWWKVHMPRYPILSMMARDVLGTPMSTLAPELAFTTGGRVLDSSRSSL 636 Query: 1071 KSDTLQALMCSQDWMRNELED 1133 DT +AL+C+QDW+RNE D Sbjct: 637 NPDTREALICTQDWLRNESGD 657 >ref|XP_007021998.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma cacao] gi|590611078|ref|XP_007021999.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma cacao] gi|590611082|ref|XP_007022000.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma cacao] gi|508721626|gb|EOY13523.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma cacao] gi|508721627|gb|EOY13524.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma cacao] gi|508721628|gb|EOY13525.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma cacao] Length = 672 Score = 398 bits (1022), Expect = e-108 Identities = 202/398 (50%), Positives = 275/398 (69%), Gaps = 8/398 (2%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RI++Q+ ++R + GQL DVR A + LVQD +E + + K+R +++Y+K ++ Sbjct: 277 IVLRIKEQISENRPRLSNGQLLDVRSAAHILNSLVQDAVEALQVVIQKIRGSVRYVKSSQ 336 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 Q KFNEI Q GI QK L +D P +WNSTYVMLE A+EY+ AF L E DP ++ Sbjct: 337 SIQGKFNEIAQQTGIISQKSLVLDCPIRWNSTYVMLETAVEYRNAFCHLPELDPDLAL-- 394 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S +W+ S+T LK F E+ NVF G K TAN YF EIC +H+QLI+WC+ D+F+SS Sbjct: 395 SDDEWEWASSVTGYLKLFIEIINVFSGNKCPTANIYFPEICHVHIQLIEWCKSPDNFLSS 454 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LA K+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A + I VS+ +K Sbjct: 455 LAAKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALERIKEVSDGIKE 514 Query: 723 LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890 L++ ++I S L G +S S+ ++DRL GFD+FLHET+ SQ+ SDL+KYLEE Sbjct: 515 LFNAYSICSTLIDEGTALPGSSLPSSSNDSRDRLKGFDKFLHETAQSQSAISDLEKYLEE 574 Query: 891 PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070 +FPR+ DF+ILNWW+VH PRYP+LSMMAR++LG P+S VA ES F+ G R LD + Sbjct: 575 AVFPRNCDFNILNWWRVHTPRYPILSMMARDVLGTPMSTVAQESAFNAGGRVLDSCRSSL 634 Query: 1071 KSDTLQALMCSQDWMRNELED----SKTPAFALHSDAN 1172 +DT QAL+C++DW+ + +D S A L+ +AN Sbjct: 635 TADTRQALICTRDWLWMQSDDPSPSSSHYALPLYVEAN 672 >ref|XP_007022001.1| BED zinc finger,hAT family dimerization domain isoform 4 [Theobroma cacao] gi|590611092|ref|XP_007022003.1| BED zinc finger,hAT family dimerization domain isoform 4 [Theobroma cacao] gi|508721629|gb|EOY13526.1| BED zinc finger,hAT family dimerization domain isoform 4 [Theobroma cacao] gi|508721631|gb|EOY13528.1| BED zinc finger,hAT family dimerization domain isoform 4 [Theobroma cacao] Length = 689 Score = 395 bits (1016), Expect = e-107 Identities = 196/375 (52%), Positives = 265/375 (70%), Gaps = 4/375 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RI++Q+ ++R + GQL DVR A + LVQD +E + + K+R +++Y+K ++ Sbjct: 277 IVLRIKEQISENRPRLSNGQLLDVRSAAHILNSLVQDAVEALQVVIQKIRGSVRYVKSSQ 336 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 Q KFNEI Q GI QK L +D P +WNSTYVMLE A+EY+ AF L E DP ++ Sbjct: 337 SIQGKFNEIAQQTGIISQKSLVLDCPIRWNSTYVMLETAVEYRNAFCHLPELDPDLAL-- 394 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S +W+ S+T LK F E+ NVF G K TAN YF EIC +H+QLI+WC+ D+F+SS Sbjct: 395 SDDEWEWASSVTGYLKLFIEIINVFSGNKCPTANIYFPEICHVHIQLIEWCKSPDNFLSS 454 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LA K+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A + I VS+ +K Sbjct: 455 LAAKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALERIKEVSDGIKE 514 Query: 723 LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890 L++ ++I S L G +S S+ ++DRL GFD+FLHET+ SQ+ SDL+KYLEE Sbjct: 515 LFNAYSICSTLIDEGTALPGSSLPSSSNDSRDRLKGFDKFLHETAQSQSAISDLEKYLEE 574 Query: 891 PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070 +FPR+ DF+ILNWW+VH PRYP+LSMMAR++LG P+S VA ES F+ G R LD + Sbjct: 575 AVFPRNCDFNILNWWRVHTPRYPILSMMARDVLGTPMSTVAQESAFNAGGRVLDSCRSSL 634 Query: 1071 KSDTLQALMCSQDWM 1115 +DT QAL+C++DW+ Sbjct: 635 TADTRQALICTRDWL 649 >ref|XP_006407043.1| hypothetical protein EUTSA_v10020233mg [Eutrema salsugineum] gi|557108189|gb|ESQ48496.1| hypothetical protein EUTSA_v10020233mg [Eutrema salsugineum] Length = 662 Score = 392 bits (1007), Expect = e-106 Identities = 194/382 (50%), Positives = 265/382 (69%), Gaps = 6/382 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RI+D + Q ++ GQL++++ + LVQD LE R + K+R +++Y+K ++ Sbjct: 276 IVLRIKDHMSQSSPILINGQLYELKSANHLLNSLVQDCLEAMRDVIQKIRGSVRYVKSSQ 335 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 TQ +FNEI QL GI+ +K+L +D+ WNSTY MLE LEY+ AF L+++D F Sbjct: 336 STQARFNEIAQLAGINSEKILVLDSLGTWNSTYAMLETVLEYQGAFCHLRDHDHGFDSSL 395 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 + +W+ RS+T LK E++ F G + TAN YFAE+CDIH+QLI+WC+ D F+SS Sbjct: 396 TDEEWEWTRSVTGYLKLVFEIAADFSGNRCPTANVYFAEMCDIHIQLIEWCKNQDSFLSS 455 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LA K+K+KFDEYW KCSL++AIAAILDPRFKMKLVEYYY +IYG A D I VSN +K Sbjct: 456 LAAKMKAKFDEYWNKCSLVLAIAAILDPRFKMKLVEYYYSKIYGSVALDRIKEVSNGVKE 515 Query: 723 LYSGHAIYSPLAAHGQNSSSESNGIA------KDRLSGFDRFLHETSVSQNTKSDLDKYL 884 L +++ S + G++SS +G+A +DRL GFD+FLHETS +QNT SDLDKYL Sbjct: 516 LLDAYSMCSSI--DGEDSSFSGSGLARGSMDTRDRLKGFDKFLHETSQNQNTTSDLDKYL 573 Query: 885 EEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWG 1064 EP+FPRS +F+ILN+WKVH PRYP+LSMMAR+ILG P+S +A +S F++G +D S Sbjct: 574 SEPIFPRSGEFNILNYWKVHTPRYPILSMMARDILGTPMSILAPDSTFNSGRPVIDESKS 633 Query: 1065 TEKSDTLQALMCSQDWMRNELE 1130 + D QAL C+ DW+ E E Sbjct: 634 SLSPDIRQALFCAHDWLSTEAE 655 >gb|AAG52564.1|AC010675_12 unknown protein; 6859-4829 [Arabidopsis thaliana] Length = 676 Score = 389 bits (999), Expect = e-105 Identities = 191/392 (48%), Positives = 267/392 (68%), Gaps = 14/392 (3%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 + +IRD+L Q++FL C GQLFDV C + + +VQD LE N +RE+I+Y+K + Sbjct: 285 VASKIRDRLSQNKFLYCYGQLFDVSCGVNVINEMVQDSLEACCDTINIIRESIRYVKSSE 344 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 Q++FN+ + G ++ L +D+P +W+ST MLE ALE K AF + E+DP +CP Sbjct: 345 SIQDRFNQWIVETGAVSERNLCIDDPMRWDSTCTMLENALEQKSAFSLMNEHDPDSVLCP 404 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S ++W+RL +I LK F EV N F + AN YF E+CDIHL+LI+W + DDFISS Sbjct: 405 SDLEWERLGTIVEFLKVFVEVINAFTKSSCLPANMYFPEVCDIHLRLIEWSKNPDDFISS 464 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 L + ++ KFD++W K L++AIA ILDPRFKMKLVEYYYP YG SA + I+ +S C+K Sbjct: 465 LVVNMRKKFDDFWDKNYLVLAIATILDPRFKMKLVEYYYPLFYGTSASELIEDISECIKL 524 Query: 723 LYSGHAIYSPLAAHG-----QNSSSESNGIA-----KDRLSGFDRFLHETSVS--QNTKS 866 LY H++ S LA+ QN SNG+A DRL+ FDR+++ET+ + Q++KS Sbjct: 525 LYDEHSVGSLLASSNQALDWQNHHHRSNGVAHGKEPDDRLTEFDRYINETTTTPGQDSKS 584 Query: 867 DLDKYLEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVAL-ESLFDTGD- 1040 DL+KYLEEPLFPR+ DF ILNWWKVH P+YP+LSMMARN+L +P+ V+ E F+T Sbjct: 585 DLEKYLEEPLFPRNSDFDILNWWKVHTPKYPILSMMARNVLAVPMLNVSSEEDAFETCQR 644 Query: 1041 RALDHSWGTEKSDTLQALMCSQDWMRNELEDS 1136 R + +W + + T+QALMC+QDW+++ELE S Sbjct: 645 RRVSETWRSLRPSTVQALMCAQDWIQSELESS 676 >ref|XP_006297141.1| hypothetical protein CARUB_v10013145mg [Capsella rubella] gi|565479004|ref|XP_006297142.1| hypothetical protein CARUB_v10013145mg [Capsella rubella] gi|482565850|gb|EOA30039.1| hypothetical protein CARUB_v10013145mg [Capsella rubella] gi|482565851|gb|EOA30040.1| hypothetical protein CARUB_v10013145mg [Capsella rubella] Length = 667 Score = 384 bits (986), Expect = e-104 Identities = 191/382 (50%), Positives = 262/382 (68%), Gaps = 5/382 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RI+D + Q ++ GQLF+++ A + LVQD LE R + K+R +++Y+K ++ Sbjct: 276 IVLRIKDHMSQSSQILINGQLFELKSAAHLLNSLVQDCLEAMRDVIQKIRGSVRYVKSSQ 335 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 Q +FNEI QL GI+ K+L +D+ NSTYVMLE LEYK AF L+++D F Sbjct: 336 SAQVRFNEIAQLAGINSHKILVLDSLVNSNSTYVMLETVLEYKGAFCHLRDHDHGFDSSL 395 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 + +W+ R +T LK ++++ F G K TAN YF E+CDIH+QLI+WC+ D+F+SS Sbjct: 396 TDEEWEWTRYVTGYLKLVFDIASDFSGNKCPTANVYFPEMCDIHIQLIEWCKNQDNFLSS 455 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LA +K+KFDEYW KCSL++AIAAILDPR+KMKLVEYYY +IYG +A D I VSN +K Sbjct: 456 LAASMKAKFDEYWNKCSLVLAIAAILDPRYKMKLVEYYYSKIYGSTALDRIKEVSNGVKE 515 Query: 723 LYSGHAIYSPLAAHGQNSSSESNGI-----AKDRLSGFDRFLHETSVSQNTKSDLDKYLE 887 L +++ S + G++SS +G+ +DRL GFD+FLHETS +QNT SDLDKYL Sbjct: 516 LLDAYSMCSAIV--GEDSSFSGSGLGRAMDTRDRLKGFDKFLHETSQNQNTTSDLDKYLS 573 Query: 888 EPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGT 1067 EP FPRS +F+ILN+WKVH PRYP+LSMMAR+ILG PIS +A +S F++G + S + Sbjct: 574 EPNFPRSGEFNILNYWKVHTPRYPILSMMARDILGTPISIIAPDSTFNSGTPMIADSQSS 633 Query: 1068 EKSDTLQALMCSQDWMRNELED 1133 D QAL C+ DW+ E E+ Sbjct: 634 LNPDIRQALFCAHDWLSTETEE 655 >dbj|BAB02646.1| Ac transposase-like protein [Arabidopsis thaliana] gi|18176330|gb|AAL60024.1| unknown protein [Arabidopsis thaliana] gi|20465375|gb|AAM20091.1| unknown protein [Arabidopsis thaliana] Length = 662 Score = 380 bits (975), Expect = e-102 Identities = 186/379 (49%), Positives = 260/379 (68%), Gaps = 3/379 (0%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RI+D + Q ++ GQLF+++ A + LV+D LE R + K+R +++Y+K ++ Sbjct: 277 IVLRIKDHMSQSSQILINGQLFELKSAAHLLNSLVEDCLEAMRDVIQKIRGSVRYVKSSQ 336 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 TQ +FNEI QL GI+ QK+L +D+ NST+VMLE LEYK AF L+++D SF Sbjct: 337 STQVRFNEIAQLAGINSQKILVLDSIVNSNSTFVMLETVLEYKGAFCHLRDHDHSFDSSL 396 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 + +W+ R +T LK ++++ F K TAN YFAE+CDIH+QL++WC+ D+F+SS Sbjct: 397 TDEEWEWTRYVTGYLKLVFDIASDFSANKCPTANVYFAEMCDIHIQLVEWCKNQDNFLSS 456 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LA +K+KFDEYW KCSL++AIAAILDPRFKMKLVEYYY +IYG +A D I VSN +K Sbjct: 457 LAANMKAKFDEYWNKCSLVLAIAAILDPRFKMKLVEYYYSKIYGSTALDRIKEVSNGVKE 516 Query: 723 LYSGHAIYSPLAAHGQNSSS---ESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEEP 893 L +++ S + S S ++ +DRL GFD+FLHETS +QNT +DLDKYL EP Sbjct: 517 LLDAYSMCSAIVGEDSFSGSGLGRASMDTRDRLKGFDKFLHETSQNQNTTTDLDKYLSEP 576 Query: 894 LFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTEK 1073 +FPRS +F+ILN+WKVH PRYP+LS++AR+ILG P+S A +S F++G + S + Sbjct: 577 IFPRSGEFNILNYWKVHTPRYPILSLLARDILGTPMSICAPDSTFNSGTPVISDSQSSLN 636 Query: 1074 SDTLQALMCSQDWMRNELE 1130 D QAL C+ DW+ E E Sbjct: 637 PDIRQALFCAHDWLSTETE 655 >ref|XP_007022002.1| BED zinc finger,hAT family dimerization domain isoform 5 [Theobroma cacao] gi|508721630|gb|EOY13527.1| BED zinc finger,hAT family dimerization domain isoform 5 [Theobroma cacao] Length = 639 Score = 378 bits (970), Expect = e-102 Identities = 188/354 (53%), Positives = 251/354 (70%), Gaps = 4/354 (1%) Frame = +3 Query: 3 IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182 IV RI++Q+ ++R + GQL DVR A + LVQD +E + + K+R +++Y+K ++ Sbjct: 277 IVLRIKEQISENRPRLSNGQLLDVRSAAHILNSLVQDAVEALQVVIQKIRGSVRYVKSSQ 336 Query: 183 GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362 Q KFNEI Q GI QK L +D P +WNSTYVMLE A+EY+ AF L E DP ++ Sbjct: 337 SIQGKFNEIAQQTGIISQKSLVLDCPIRWNSTYVMLETAVEYRNAFCHLPELDPDLAL-- 394 Query: 363 SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542 S +W+ S+T LK F E+ NVF G K TAN YF EIC +H+QLI+WC+ D+F+SS Sbjct: 395 SDDEWEWASSVTGYLKLFIEIINVFSGNKCPTANIYFPEICHVHIQLIEWCKSPDNFLSS 454 Query: 543 LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722 LA K+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A + I VS+ +K Sbjct: 455 LAAKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALERIKEVSDGIKE 514 Query: 723 LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890 L++ ++I S L G +S S+ ++DRL GFD+FLHET+ SQ+ SDL+KYLEE Sbjct: 515 LFNAYSICSTLIDEGTALPGSSLPSSSNDSRDRLKGFDKFLHETAQSQSAISDLEKYLEE 574 Query: 891 PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALD 1052 +FPR+ DF+ILNWW+VH PRYP+LSMMAR++LG P+S VA ES F+ G R LD Sbjct: 575 AVFPRNCDFNILNWWRVHTPRYPILSMMARDVLGTPMSTVAQESAFNAGGRVLD 628 >ref|XP_006390942.1| hypothetical protein EUTSA_v10018229mg [Eutrema salsugineum] gi|557087376|gb|ESQ28228.1| hypothetical protein EUTSA_v10018229mg [Eutrema salsugineum] Length = 674 Score = 373 bits (958), Expect = e-100 Identities = 185/391 (47%), Positives = 261/391 (66%), Gaps = 16/391 (4%) Frame = +3 Query: 12 RIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTRGTQ 191 +IRD+L Q++FL C GQLFDV C + + QD L+T + +K+R I+Y+K + Q Sbjct: 284 KIRDRLSQNKFLYCNGQLFDVSCGVYVINQMAQDSLQTCCETIDKIRNCIRYVKSSESIQ 343 Query: 192 EKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSM-CPSG 368 E FN+ G +K L +D+ +W++T MLE LE K F ++E DP + CPS Sbjct: 344 ESFNQWRAEAGAESEKDLCIDDSTRWDTTCSMLEIVLEQKNVFLLMKERDPDSCLPCPSD 403 Query: 369 IDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISSLA 548 ++W+RL +I LK F EV+N F +TAN YF EICDIHL+LI+W + +DDFISS+A Sbjct: 404 LEWERLETIVGFLKVFVEVANAFTKSSCLTANIYFPEICDIHLRLIEWSKNTDDFISSVA 463 Query: 549 LKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKALY 728 + ++ FDE+W K +L++AIA ILDPRFKMKLVEYYYP Y SA + I+ +S C+KALY Sbjct: 464 VNMRKLFDEFWDKNNLVLAIATILDPRFKMKLVEYYYPLFYDSSASELIEDISECIKALY 523 Query: 729 SGHAIYSPLAAHG-----QNSSSESNGIA-----KDRLSGFDRFLHETSVS---QNTKSD 869 + H++ S LA+ Q + + NG+ +RL FDR++H+T+ + Q+++SD Sbjct: 524 NEHSVRSLLASSDQALDWQENHHQPNGVVHGIEPDNRLIEFDRYIHDTTTTTQGQDSRSD 583 Query: 870 LDKYLEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALE--SLFDTGDR 1043 LDKYLEEPLFPR+ DF ILNWWKVH PRYP+LS MARN+L +P+S V+ E + R Sbjct: 584 LDKYLEEPLFPRNTDFDILNWWKVHTPRYPILSTMARNVLAVPMSNVSSEEDAFKSCPRR 643 Query: 1044 ALDHSWGTEKSDTLQALMCSQDWMRNELEDS 1136 + +W + + T+QALMC+QDW+R+ELE S Sbjct: 644 QISETWWSLRPSTVQALMCAQDWIRSELESS 674 >ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda] gi|548861481|gb|ERN18855.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda] Length = 685 Score = 338 bits (866), Expect = 5e-90 Identities = 165/384 (42%), Positives = 257/384 (66%), Gaps = 8/384 (2%) Frame = +3 Query: 15 IRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTRGTQE 194 +R +L ++ L EG++F + C + V L+VQD LE +++ K+RE+I+Y+K + QE Sbjct: 296 LRSRLSRNSSLPLEGKIFHLCCCSHVVNLMVQDGLEVIQEVLQKIRESIKYVKTSHVRQE 355 Query: 195 KFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCPSGID 374 +FNEI+ +GI ++ + +D P +WNSTY ML+ LE +EAF + D +M PS + Sbjct: 356 RFNEIINQLGIQSKQNIFLDVPTRWNSTYHMLDVTLELREAFSCFAQCDSMCNMVPSEDE 415 Query: 375 WDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISSLALK 554 W+R++ I LK F++++N F+G K+ TAN YF E+ +HL+L++W + ISS+A+K Sbjct: 416 WERVKEICDCLKLFYDITNTFLGSKYPTANLYFPEVYQMHLRLVEWSMSLNKHISSMAIK 475 Query: 555 LKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKALYSG 734 +K KFD+YWK +L++AIA ++DPRFK+K VEY Y QIYG+ A I +V + L + Sbjct: 476 MKEKFDKYWKISNLVLAIAVVIDPRFKLKFVEYSYSQIYGNDAEHHIRMVRQGVYDLCNE 535 Query: 735 HAIYSPLAAHGQNS-----SSESNGI---AKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890 + PLA++ ++S S+ S G+ K F++F+ E+S +Q KS+LD+YLEE Sbjct: 536 YESKEPLASNSESSLAVSASTSSGGVDTHGKLWAMEFEKFVRESSSNQARKSELDRYLEE 595 Query: 891 PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070 P+FPR++DF+I NWW+++ PR+P LS MAR+ILGIP+S V +S FD G + LD + Sbjct: 596 PIFPRNLDFNIRNWWQLNAPRFPTLSKMARDILGIPVSTVTSDSTFDIGGQVLDQYRSSL 655 Query: 1071 KSDTLQALMCSQDWMRNELEDSKT 1142 +T+QALMC+QDW+ NEL+ K+ Sbjct: 656 LPETIQALMCAQDWLWNELKGGKS 679