BLASTX nr result

ID: Mentha23_contig00017151 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00017151
         (1532 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU28909.1| hypothetical protein MIMGU_mgv1a002591mg [Mimulus...   674   0.0  
gb|EPS60750.1| hypothetical protein M569_14050, partial [Genlise...   612   e-172
ref|XP_007048823.1| BED zinc finger,hAT family dimerization doma...   466   e-129
ref|XP_007216990.1| hypothetical protein PRUPE_ppa002590mg [Prun...   447   e-123
ref|XP_007146367.1| hypothetical protein PHAVU_006G034500g [Phas...   439   e-120
ref|XP_006377715.1| hypothetical protein POPTR_0011s10500g [Popu...   424   e-116
ref|XP_006370067.1| hypothetical protein POPTR_0001s39240g [Popu...   424   e-116
emb|CBI20108.3| unnamed protein product [Vitis vinifera]              422   e-115
emb|CAN60218.1| hypothetical protein VITISV_006612 [Vitis vinifera]   421   e-115
ref|XP_007213601.1| hypothetical protein PRUPE_ppa002416mg [Prun...   414   e-113
ref|XP_007133312.1| hypothetical protein PHAVU_011G169000g [Phas...   400   e-109
ref|XP_007021998.1| BED zinc finger,hAT family dimerization doma...   398   e-108
ref|XP_007022001.1| BED zinc finger,hAT family dimerization doma...   395   e-107
ref|XP_006407043.1| hypothetical protein EUTSA_v10020233mg [Eutr...   392   e-106
gb|AAG52564.1|AC010675_12 unknown protein; 6859-4829 [Arabidopsi...   389   e-105
ref|XP_006297141.1| hypothetical protein CARUB_v10013145mg [Caps...   384   e-104
dbj|BAB02646.1| Ac transposase-like protein [Arabidopsis thalian...   380   e-102
ref|XP_007022002.1| BED zinc finger,hAT family dimerization doma...   378   e-102
ref|XP_006390942.1| hypothetical protein EUTSA_v10018229mg [Eutr...   373   e-100
ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [A...   338   5e-90

>gb|EYU28909.1| hypothetical protein MIMGU_mgv1a002591mg [Mimulus guttatus]
          Length = 656

 Score =  674 bits (1740), Expect = 0.0
 Identities = 325/376 (86%), Positives = 352/376 (93%), Gaps = 1/376 (0%)
 Frame = +3

Query: 6    VCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTRG 185
            VCRIRDQLCQHRFLMCEGQLFDVRC ASTVKLLVQDVLETSR+ITNKVRETI+Y+KG++ 
Sbjct: 276  VCRIRDQLCQHRFLMCEGQLFDVRCAASTVKLLVQDVLETSREITNKVRETIRYVKGSQA 335

Query: 186  TQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCPS 365
            TQEKFNEIVQLVGI+ QK LSVDNPFQWNST +MLEAALEYKEAFPQLQE+DP FSMCPS
Sbjct: 336  TQEKFNEIVQLVGINCQKSLSVDNPFQWNSTCMMLEAALEYKEAFPQLQEHDPGFSMCPS 395

Query: 366  GIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISSL 545
             IDWDRLR+ITSI KFFHEVSNVF GRKH+T+NSYF EICDIHLQLI WCQKSD+FISSL
Sbjct: 396  DIDWDRLRAITSIFKFFHEVSNVFAGRKHITSNSYFNEICDIHLQLIGWCQKSDEFISSL 455

Query: 546  ALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKAL 725
            ALKLKSKFDEYWKKCSLIMAIAAILDPR+KM+LVEYYYPQIYGDSAPDCIDIV NCMKAL
Sbjct: 456  ALKLKSKFDEYWKKCSLIMAIAAILDPRYKMQLVEYYYPQIYGDSAPDCIDIVKNCMKAL 515

Query: 726  YSGHAIYSPLAAHGQNSSSESN-GIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEEPLFP 902
            YSGHAIYSPL+AHGQ+S+SES+  I KD+L+GFDRFLHETSVSQNTKSDLDKYLEEPLFP
Sbjct: 516  YSGHAIYSPLSAHGQSSASESSVSIVKDKLTGFDRFLHETSVSQNTKSDLDKYLEEPLFP 575

Query: 903  RSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTEKSDT 1082
            R    S+LNWWKVHEPRYPVLSMMARNILGIPISKVA+ESLFDTG+RALDH W T KSDT
Sbjct: 576  RKNVISVLNWWKVHEPRYPVLSMMARNILGIPISKVAVESLFDTGERALDHCWSTMKSDT 635

Query: 1083 LQALMCSQDWMRNELE 1130
            LQALMCS+DW+ ++ E
Sbjct: 636  LQALMCSRDWISSDFE 651


>gb|EPS60750.1| hypothetical protein M569_14050, partial [Genlisea aurea]
          Length = 647

 Score =  612 bits (1578), Expect = e-172
 Identities = 289/376 (76%), Positives = 335/376 (89%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV +IRDQLCQHRFLMCEGQLFDVRC  STV++LVQ+VLETSR++T KVRE ++Y+KG+R
Sbjct: 275  IVSKIRDQLCQHRFLMCEGQLFDVRCATSTVRVLVQEVLETSREMTKKVREIVRYVKGSR 334

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
               EKFNEIV+L+G++ QK+LS+DNP +WNST  MLEAALEYKE FPQLQE DP FS  P
Sbjct: 335  AAYEKFNEIVRLLGVNSQKVLSIDNPLKWNSTSTMLEAALEYKEVFPQLQELDPEFSTWP 394

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            SG+DWDRLR+I  ILKFF EVS VFVG KH+TANS+FAEICDIHL+LI+WCQKSDDFISS
Sbjct: 395  SGMDWDRLRAIAGILKFFIEVSEVFVGGKHITANSFFAEICDIHLKLIEWCQKSDDFISS 454

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LALKLKS FDEYWKKCSLIMA+AAILDPR+KMKLVEYYYPQIYGDSAP+CI+IVSNCMK+
Sbjct: 455  LALKLKSVFDEYWKKCSLIMAVAAILDPRYKMKLVEYYYPQIYGDSAPECIEIVSNCMKS 514

Query: 723  LYSGHAIYSPLAAHGQNSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEEPLFP 902
            LY+GH IYSPLAAH   +S      AKDRL+GFDRFLHETSVSQNTKSDL+KYLE+PLFP
Sbjct: 515  LYNGHIIYSPLAAH---ASENGGAAAKDRLTGFDRFLHETSVSQNTKSDLEKYLEDPLFP 571

Query: 903  RSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTEKSDT 1082
            R+ D +IL+WWKV+EPRYPVLSMMARNILGIPISKV+ +++FDTG++ +DH W T KS+T
Sbjct: 572  RNNDLNILSWWKVNEPRYPVLSMMARNILGIPISKVSSDAVFDTGNKPIDHCWATLKSET 631

Query: 1083 LQALMCSQDWMRNELE 1130
            LQALMCSQDW+ NELE
Sbjct: 632  LQALMCSQDWLHNELE 647


>ref|XP_007048823.1| BED zinc finger,hAT family dimerization domain [Theobroma cacao]
            gi|508701084|gb|EOX92980.1| BED zinc finger,hAT family
            dimerization domain [Theobroma cacao]
          Length = 657

 Score =  466 bits (1200), Expect = e-129
 Identities = 224/384 (58%), Positives = 289/384 (75%), Gaps = 8/384 (2%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RIRD+L Q+RFL C GQLFDVRC    +  +VQD L+   ++T K+RE+I+Y+K + 
Sbjct: 273  IVDRIRDRLSQNRFLYCNGQLFDVRCAVDLLNRMVQDALDAVCEVTQKIRESIRYVKSSE 332

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDP-SFSMC 359
             TQ  F E+   V +  QK L +DNP +WNST++MLE ALEY++ F  LQ+ DP +    
Sbjct: 333  ATQSMFIELAHEVQVESQKCLRIDNPLKWNSTFLMLEVALEYRKVFCCLQDRDPVNMKFL 392

Query: 360  PSGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFIS 539
            PS ++WDR+  I S LK F EV+NVF   K+ TAN +F EICDIHLQLI+WC+  DD+I+
Sbjct: 393  PSDLEWDRVSVIASFLKLFVEVTNVFTRSKYPTANIFFPEICDIHLQLIEWCKNPDDYIN 452

Query: 540  SLALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMK 719
            SLA+K++ KF++YW KCSL +A+AA+LDPRFKMKL+EYYYPQ+YGDSA + ID V  C+K
Sbjct: 453  SLAVKMRKKFEDYWDKCSLGLAVAAMLDPRFKMKLLEYYYPQLYGDSASELIDDVFECIK 512

Query: 720  ALYSGHAIYSPLAAH-GQNSSSESNGI------AKDRLSGFDRFLHETSVSQNTKSDLDK 878
            +LY+ H++ SPLA+   Q  S + +GI      ++DRL GFD+FLHETS S  + SDLDK
Sbjct: 513  SLYNEHSMVSPLASSLDQGLSWQVSGIPGSGKDSRDRLMGFDKFLHETSQSDGSNSDLDK 572

Query: 879  YLEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHS 1058
            YLE+PLFPR+VDF+ILNWWKVH P YP+LSMMA NILGIPISKVA ES FDTG R +DH+
Sbjct: 573  YLEDPLFPRNVDFNILNWWKVHTPSYPILSMMAHNILGIPISKVAAESTFDTGGRVVDHN 632

Query: 1059 WGTEKSDTLQALMCSQDWMRNELE 1130
            W +    T+QALMCSQDW+R+ELE
Sbjct: 633  WSSLPPTTVQALMCSQDWIRSELE 656


>ref|XP_007216990.1| hypothetical protein PRUPE_ppa002590mg [Prunus persica]
            gi|462413140|gb|EMJ18189.1| hypothetical protein
            PRUPE_ppa002590mg [Prunus persica]
          Length = 655

 Score =  447 bits (1150), Expect = e-123
 Identities = 212/383 (55%), Positives = 281/383 (73%), Gaps = 7/383 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            +V RIRD+L Q++ L C+GQLFDVRC A+ + ++ QD LE   ++T+K+R +I+Y+K ++
Sbjct: 272  VVFRIRDRLSQNKLLSCDGQLFDVRCAANVINMMSQDALEALCEMTDKIRGSIRYVKSSQ 331

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              QEKFN IV  VG   ++ L +DNP QWNSTYVM+E ALEY++AF  LQE DP ++MCP
Sbjct: 332  VIQEKFNSIVHQVGGESRRCLCLDNPLQWNSTYVMVEIALEYRDAFALLQENDPVYAMCP 391

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S ++WDR+  ITS LK F  V+NVF   K  TAN YF E+C+++ QL +WC+ +DD+ISS
Sbjct: 392  SDVEWDRVNIITSYLKLFVGVTNVFTRFKSPTANLYFPELCEVYSQLNEWCKNADDYISS 451

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LALK++SKF+EYW +CSL +A+A +LDPRFKMK V+YYY Q +G  AP  I  V  C+K 
Sbjct: 452  LALKMRSKFEEYWMRCSLSLAVAVMLDPRFKMKPVDYYYAQFFGSGAPGRISDVFECVKT 511

Query: 723  LYSGHA-----IYSPLA--AHGQNSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKY 881
            LY+ H+     +   LA    G +    S    +DRL+GFD+FLHET+    TKSDLDKY
Sbjct: 512  LYNEHSTCLAYVDQGLAWQVGGSSRLPGSGRDLRDRLTGFDKFLHETTEIDGTKSDLDKY 571

Query: 882  LEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSW 1061
            LEEPLFPR+ +F ILNWWKVH PRYP+LSMMARN+LGIP+SKV ++S F+TG R LD  W
Sbjct: 572  LEEPLFPRNAEFDILNWWKVHAPRYPILSMMARNVLGIPVSKVPIDSTFNTGGRVLDRDW 631

Query: 1062 GTEKSDTLQALMCSQDWMRNELE 1130
             +    T+QALMC+QDW+R+ELE
Sbjct: 632  SSMNPATIQALMCAQDWIRSELE 654


>ref|XP_007146367.1| hypothetical protein PHAVU_006G034500g [Phaseolus vulgaris]
            gi|561019590|gb|ESW18361.1| hypothetical protein
            PHAVU_006G034500g [Phaseolus vulgaris]
          Length = 663

 Score =  439 bits (1130), Expect = e-120
 Identities = 220/387 (56%), Positives = 280/387 (72%), Gaps = 10/387 (2%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            I  RI D+L Q+RFL C GQLFD+RC A+ +  +VQ  L    +I  K+RETI YIK ++
Sbjct: 277  IAVRIGDRLLQNRFLYCNGQLFDIRCAANVINAMVQHALGAVSEIVIKIRETIGYIKSSQ 336

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
                KFNE+ + VGI  QK L +DN  QWNSTY MLE ALE+K+    LQE D ++ +  
Sbjct: 337  IILAKFNEMAKEVGILSQKGLCLDNASQWNSTYSMLEVALEFKDVLILLQENDAAYKVYL 396

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S ++W+R+ ++TS LK F EV NVF   K+ TAN YF E+CD+ L LI+WC+ SD++ISS
Sbjct: 397  SDVEWERVTAVTSYLKLFVEVINVFTKNKYPTANIYFPELCDVKLHLIEWCKNSDEYISS 456

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LA +L+SKFDEYW+KCSL +A+AA+LDPRFKMKLV+YYYPQIYG  +   I+ V + +KA
Sbjct: 457  LASRLRSKFDEYWEKCSLGLAVAAMLDPRFKMKLVDYYYPQIYGSMSASRIEEVFDGVKA 516

Query: 723  LYSGHAIYSPLAAHGQ-------NSSSESNGIAK---DRLSGFDRFLHETSVSQNTKSDL 872
            LY+ H+I SPLA+H Q       N      G AK   DRL GFD+FLHETS  + TKSDL
Sbjct: 517  LYNEHSIGSPLASHDQGLAWQVGNGPLLLQGSAKDSRDRLMGFDKFLHETSQGEGTKSDL 576

Query: 873  DKYLEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALD 1052
            DKYLEEPLFPR+VDF+ILNWW+VH PRYPVLSMMARN+LGIP++KVA E  F+   R LD
Sbjct: 577  DKYLEEPLFPRNVDFNILNWWRVHTPRYPVLSMMARNVLGIPMAKVAPELAFNHSGRVLD 636

Query: 1053 HSWGTEKSDTLQALMCSQDWMRNELED 1133
              W +    T+QAL+CSQDW+R+ELE+
Sbjct: 637  RDWSSLNPATVQALVCSQDWIRSELEN 663


>ref|XP_006377715.1| hypothetical protein POPTR_0011s10500g [Populus trichocarpa]
            gi|550328098|gb|ERP55512.1| hypothetical protein
            POPTR_0011s10500g [Populus trichocarpa]
          Length = 673

 Score =  424 bits (1090), Expect = e-116
 Identities = 211/395 (53%), Positives = 283/395 (71%), Gaps = 5/395 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RI+D++ Q+R L+  GQLFDVR  A  + L+VQD +ET R++T KVR +++Y+K ++
Sbjct: 279  IVLRIKDRISQNRPLLSNGQLFDVRSAAHVLNLIVQDAMETIREVTEKVRGSVRYVKSSQ 338

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              Q KFNEI + +GIS QK L +D P +WNSTY MLE  + YK AF  LQE DP+++   
Sbjct: 339  VIQGKFNEIAEQIGISSQKNLVLDLPTRWNSTYFMLETVIGYKSAFCFLQERDPAYTSAL 398

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            +  +W+   SIT  LK F E++N+F G K  TAN YF EICD+H+QLI+WC+  DDF+SS
Sbjct: 399  TDTEWEWASSITGYLKLFVEITNIFSGDKCPTANIYFPEICDVHIQLIEWCKNPDDFLSS 458

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            +A K+K+KFD YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A D I  VS+ +K 
Sbjct: 459  MASKMKAKFDRYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALDRIKEVSDGIKE 518

Query: 723  LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890
            L++ ++I S L   G     +S   ++  ++DRL GFD+FLHE+S  Q+  SDLDKYLEE
Sbjct: 519  LFNAYSICSTLVDQGSTLPGSSLPSTSTDSRDRLKGFDKFLHESSQGQSAISDLDKYLEE 578

Query: 891  PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070
            P+FPR+ DF+ILNWWKVH PRYP+LSMMAR+ILG P+S +A E  F  G R LD    + 
Sbjct: 579  PVFPRNCDFNILNWWKVHTPRYPILSMMARDILGTPMSTIAPELAFGVGGRVLDSYRSSL 638

Query: 1071 KSDTLQALMCSQDWMRNELED-SKTPAFALHSDAN 1172
              DT QAL+C++DW++ E ED + + A AL+ +AN
Sbjct: 639  NPDTRQALICTRDWLQVESEDHNPSSALALYVEAN 673


>ref|XP_006370067.1| hypothetical protein POPTR_0001s39240g [Populus trichocarpa]
            gi|550349246|gb|ERP66636.1| hypothetical protein
            POPTR_0001s39240g [Populus trichocarpa]
          Length = 673

 Score =  424 bits (1089), Expect = e-116
 Identities = 209/395 (52%), Positives = 286/395 (72%), Gaps = 5/395 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RI+D++ Q+R L+  GQLFDVR     + L+V+D +ET +++T KVR ++ Y+K ++
Sbjct: 279  IVLRIKDRISQNRPLLSNGQLFDVRSAVHVLNLIVKDAMETLQEVTEKVRGSVSYVKSSQ 338

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              Q KFN+I Q +GIS Q+ L +D+  +WNSTY MLE  + YK AF  LQE+DP+++   
Sbjct: 339  VIQGKFNDIAQQIGISSQRNLVLDSSTRWNSTYSMLETVIGYKSAFCFLQEHDPAYTSAL 398

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S I+W+  +SIT  LK F E++N+F G K  TAN YF EICD+H+QLI+WC+  DDF+SS
Sbjct: 399  SDIEWEWAKSITGYLKLFVEITNIFSGDKCPTANRYFPEICDVHIQLIEWCKNPDDFLSS 458

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            +A K+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A D I  VS+ +K 
Sbjct: 459  IASKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALDRIKEVSDGIKE 518

Query: 723  LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890
            L++ ++I S L   G     +S   ++  ++DRL GFD+FLHE+S  Q++ SDLDKYLEE
Sbjct: 519  LFNAYSICSTLVDQGSALPGSSLPSTSTDSRDRLKGFDKFLHESSQGQSSISDLDKYLEE 578

Query: 891  PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070
            P+FPR+ DF+ILNWWKVH PRYP+LSMMAR+ILG P+S V+ E  F  G R LD    + 
Sbjct: 579  PVFPRNCDFNILNWWKVHTPRYPILSMMARDILGTPMSTVSPELAFGVGGRVLDSYRSSL 638

Query: 1071 KSDTLQALMCSQDWMRNELED-SKTPAFALHSDAN 1172
              DT QAL+C++DW+R E ED + + A AL+ +AN
Sbjct: 639  NPDTRQALICTRDWLRVESEDHNPSSALALYVEAN 673


>emb|CBI20108.3| unnamed protein product [Vitis vinifera]
          Length = 677

 Score =  422 bits (1085), Expect = e-115
 Identities = 212/404 (52%), Positives = 281/404 (69%), Gaps = 14/404 (3%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            +  R+++   Q R L+  GQL DVRC    + L+VQD +E  R++T+K+RE+++Y+K ++
Sbjct: 275  VALRVKEHFSQDRPLLGSGQLLDVRCVGHVLNLIVQDCIEALREVTHKIRESVRYVKTSQ 334

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
             T  KFNEI Q VGI+ Q+ L +D P QWNSTY+ML+  LEYK AF  LQE+DP +++  
Sbjct: 335  ATLGKFNEIAQQVGINSQQNLFLDCPTQWNSTYLMLDRVLEYKGAFSLLQEHDPGYTVAL 394

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S  +W+   SITS +K   E+  V    K  TAN YF EICDIH+QLI+WC+  DDFISS
Sbjct: 395  SDTEWEWASSITSYMKLLLEIIAVLSSNKCPTANIYFPEICDIHIQLIEWCKSPDDFISS 454

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LALK+K+KFD+YW KCSL +A+A ILDPRFKMKLVEYYYPQIYG  A D I  VS+ +K 
Sbjct: 455  LALKMKAKFDKYWSKCSLALAVAVILDPRFKMKLVEYYYPQIYGTDAADRIKDVSDGIKE 514

Query: 723  LYSGH-----AIYSPLAAHGQNSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLE 887
            L++ +     +++  +A  G +  S SN  ++DRL GFD+F+HETS +QN  SDLDKYLE
Sbjct: 515  LFNVYCSTSASLHQGVALPGSSLPSTSND-SRDRLKGFDKFIHETSQNQNIVSDLDKYLE 573

Query: 888  EPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGT 1067
            EP+FPR+ DF ILNWWKV +PRYP+LSMM R++LGIP+S VA E +F TG R LDH   +
Sbjct: 574  EPVFPRNCDFHILNWWKVQKPRYPILSMMVRDVLGIPMSTVAPEVVFSTGARVLDHYRSS 633

Query: 1068 EKSDTLQALMCSQDWMRNELED---------SKTPAFALHSDAN 1172
               DT QAL+C+QDW++  LE+         S  PA  L  +AN
Sbjct: 634  LNPDTRQALICTQDWLQTGLEEPNQSSPHQTSPHPAIPLAIEAN 677


>emb|CAN60218.1| hypothetical protein VITISV_006612 [Vitis vinifera]
          Length = 667

 Score =  421 bits (1083), Expect = e-115
 Identities = 206/381 (54%), Positives = 274/381 (71%), Gaps = 5/381 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            +  R+++   Q R L+  GQL DVRC    + L+VQD +E  R++T+K+RE+++Y+K ++
Sbjct: 275  VALRVKEHFSQDRPLLGSGQLLDVRCVGHVLNLIVQDCIEALREVTHKIRESVRYVKTSQ 334

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
             T  KFNEI Q VGI+ Q+ L +D P QWNSTY+ML+  LEYK AF  LQE+DP +++  
Sbjct: 335  ATLGKFNEIAQQVGINSQQNLFLDCPTQWNSTYLMLDTVLEYKGAFSLLQEHDPGYTVAL 394

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S  +W+   SITS +K   E+  V    K  TAN YF EICDIH+QLI+WC+  DDFISS
Sbjct: 395  SDTEWEWASSITSYMKLLLEIIAVLSSNKCPTANIYFPEICDIHIQLIEWCKSPDDFISS 454

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LALK+K+KFD+YW KCSL +A+A ILDPRFKMKLVEYYYPQIYG+ A D I  VS+ +K 
Sbjct: 455  LALKMKAKFDKYWSKCSLALAVAVILDPRFKMKLVEYYYPQIYGNDAADRIKDVSDGIKE 514

Query: 723  LYSGH-----AIYSPLAAHGQNSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLE 887
            L++ +     +++  +A  G +  S SN  ++DRL GFD+F+HETS +QN  SDLDKYLE
Sbjct: 515  LFNVYCSTSASLHQGVALPGSSLPSTSND-SRDRLKGFDKFIHETSQNQNIVSDLDKYLE 573

Query: 888  EPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGT 1067
            EP+FPR+ DF ILNWWKV +PRYP+LSMM R++LGIP+S VA E +F TG R LDH   +
Sbjct: 574  EPVFPRNCDFHILNWWKVQKPRYPILSMMVRDVLGIPMSTVAPEVVFSTGARVLDHYRSS 633

Query: 1068 EKSDTLQALMCSQDWMRNELE 1130
               DT QAL+C+QDW++  LE
Sbjct: 634  LNPDTRQALICTQDWLQTGLE 654


>ref|XP_007213601.1| hypothetical protein PRUPE_ppa002416mg [Prunus persica]
            gi|462409466|gb|EMJ14800.1| hypothetical protein
            PRUPE_ppa002416mg [Prunus persica]
          Length = 675

 Score =  414 bits (1063), Expect = e-113
 Identities = 205/393 (52%), Positives = 276/393 (70%), Gaps = 4/393 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RI+D++ Q R L   GQLFD+R  A  +  +VQDVLE  R++  K+R + ++++ ++
Sbjct: 277  IVLRIKDRISQSRPLAGHGQLFDIRSAAHLLNSIVQDVLEALREVIQKIRGSFKHVRSSQ 336

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              Q KFNEI Q VGI+ ++ L +D P +WNSTY+MLE ALEY+ AF  LQE+DPS++   
Sbjct: 337  VVQGKFNEIAQQVGINSERRLILDFPVRWNSTYIMLETALEYRGAFSLLQEHDPSYASSL 396

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            +  +W+    +T  LK   E++NVF G K  TA+ YF EIC +H+QLI+WC+  DDF+S 
Sbjct: 397  TDTEWEWTSFVTGYLKLLVEITNVFSGNKSPTASIYFPEICHVHIQLIEWCKSPDDFLSC 456

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            +ALK+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A D I  VS+ +K 
Sbjct: 457  MALKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALDRIKEVSDGIKE 516

Query: 723  LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890
            L+  ++I S +   G     +S   ++   +DRL GFD+FL+ETS SQN  SDLDKYLEE
Sbjct: 517  LFDAYSICSTMVDQGSALPGSSLPSTSSDTRDRLKGFDKFLYETSQSQNVISDLDKYLEE 576

Query: 891  PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070
            P+FPR+ DF+ILNWWKVH PRYP+LSMMAR++LG P+S VA ES F  G R LD    + 
Sbjct: 577  PVFPRNCDFNILNWWKVHTPRYPILSMMARDVLGTPMSTVAPESAFSIGGRVLDQCRSSL 636

Query: 1071 KSDTLQALMCSQDWMRNELEDSKTPAFALHSDA 1169
              D  QAL+C+QDW++ EL+D     F+ HS A
Sbjct: 637  NPDIRQALVCTQDWLQVELKD--VNPFSSHSAA 667


>ref|XP_007133312.1| hypothetical protein PHAVU_011G169000g [Phaseolus vulgaris]
            gi|561006312|gb|ESW05306.1| hypothetical protein
            PHAVU_011G169000g [Phaseolus vulgaris]
          Length = 672

 Score =  400 bits (1029), Expect = e-109
 Identities = 194/381 (50%), Positives = 267/381 (70%), Gaps = 4/381 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            I  RI++++ + R  +   QL D+R  A  +  + QD +E  +++  K+RE+I+Y++ ++
Sbjct: 277  ITLRIKERVSEKRPFLSTRQLLDIRSAAHLINSIAQDAMEALQEVIQKIRESIRYVRSSQ 336

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              Q KFNEI Q   I+ QK+L +D P QW STY+MLE A+EY+ AF   Q++DPS+S   
Sbjct: 337  VVQAKFNEIAQHATINTQKVLFLDFPVQWKSTYLMLETAVEYRSAFSLFQDHDPSYSSTL 396

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S  +W+   S+T  LK   E++NVF G K  TAN YF EICD H+QLI WC+ SD F+S 
Sbjct: 397  SDEEWEWATSVTGYLKLLVEITNVFSGNKFPTANVYFPEICDAHIQLIDWCRSSDSFLSP 456

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            +A+K+K+KFD+YW KCSL +A+AA+LDPRFKMKLVEYYY  IYG +A + I  VS+ +K 
Sbjct: 457  MAMKMKAKFDKYWGKCSLALALAAVLDPRFKMKLVEYYYSLIYGSTALERIKEVSDGIKE 516

Query: 723  LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890
            L++ ++I S +   G     +S   ++  ++DRL GFDRFLHETS SQ+  SDLDKYLEE
Sbjct: 517  LFNAYSICSTMIDQGSALPGSSLPSTSCSSRDRLKGFDRFLHETSQSQSMTSDLDKYLEE 576

Query: 891  PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070
            P+FPR+ DF+ILNWWKVH PRYP+LSMMAR++LG P+S +A E  F TG R LD S  + 
Sbjct: 577  PIFPRNSDFNILNWWKVHMPRYPILSMMARDVLGTPMSTLAPELAFTTGGRVLDSSRSSL 636

Query: 1071 KSDTLQALMCSQDWMRNELED 1133
              DT +AL+C+QDW+RNE  D
Sbjct: 637  NPDTREALICTQDWLRNESGD 657


>ref|XP_007021998.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma
            cacao] gi|590611078|ref|XP_007021999.1| BED zinc
            finger,hAT family dimerization domain isoform 1
            [Theobroma cacao] gi|590611082|ref|XP_007022000.1| BED
            zinc finger,hAT family dimerization domain isoform 1
            [Theobroma cacao] gi|508721626|gb|EOY13523.1| BED zinc
            finger,hAT family dimerization domain isoform 1
            [Theobroma cacao] gi|508721627|gb|EOY13524.1| BED zinc
            finger,hAT family dimerization domain isoform 1
            [Theobroma cacao] gi|508721628|gb|EOY13525.1| BED zinc
            finger,hAT family dimerization domain isoform 1
            [Theobroma cacao]
          Length = 672

 Score =  398 bits (1022), Expect = e-108
 Identities = 202/398 (50%), Positives = 275/398 (69%), Gaps = 8/398 (2%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RI++Q+ ++R  +  GQL DVR  A  +  LVQD +E  + +  K+R +++Y+K ++
Sbjct: 277  IVLRIKEQISENRPRLSNGQLLDVRSAAHILNSLVQDAVEALQVVIQKIRGSVRYVKSSQ 336

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              Q KFNEI Q  GI  QK L +D P +WNSTYVMLE A+EY+ AF  L E DP  ++  
Sbjct: 337  SIQGKFNEIAQQTGIISQKSLVLDCPIRWNSTYVMLETAVEYRNAFCHLPELDPDLAL-- 394

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S  +W+   S+T  LK F E+ NVF G K  TAN YF EIC +H+QLI+WC+  D+F+SS
Sbjct: 395  SDDEWEWASSVTGYLKLFIEIINVFSGNKCPTANIYFPEICHVHIQLIEWCKSPDNFLSS 454

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LA K+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A + I  VS+ +K 
Sbjct: 455  LAAKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALERIKEVSDGIKE 514

Query: 723  LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890
            L++ ++I S L   G     +S   S+  ++DRL GFD+FLHET+ SQ+  SDL+KYLEE
Sbjct: 515  LFNAYSICSTLIDEGTALPGSSLPSSSNDSRDRLKGFDKFLHETAQSQSAISDLEKYLEE 574

Query: 891  PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070
             +FPR+ DF+ILNWW+VH PRYP+LSMMAR++LG P+S VA ES F+ G R LD    + 
Sbjct: 575  AVFPRNCDFNILNWWRVHTPRYPILSMMARDVLGTPMSTVAQESAFNAGGRVLDSCRSSL 634

Query: 1071 KSDTLQALMCSQDWMRNELED----SKTPAFALHSDAN 1172
             +DT QAL+C++DW+  + +D    S   A  L+ +AN
Sbjct: 635  TADTRQALICTRDWLWMQSDDPSPSSSHYALPLYVEAN 672


>ref|XP_007022001.1| BED zinc finger,hAT family dimerization domain isoform 4 [Theobroma
            cacao] gi|590611092|ref|XP_007022003.1| BED zinc
            finger,hAT family dimerization domain isoform 4
            [Theobroma cacao] gi|508721629|gb|EOY13526.1| BED zinc
            finger,hAT family dimerization domain isoform 4
            [Theobroma cacao] gi|508721631|gb|EOY13528.1| BED zinc
            finger,hAT family dimerization domain isoform 4
            [Theobroma cacao]
          Length = 689

 Score =  395 bits (1016), Expect = e-107
 Identities = 196/375 (52%), Positives = 265/375 (70%), Gaps = 4/375 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RI++Q+ ++R  +  GQL DVR  A  +  LVQD +E  + +  K+R +++Y+K ++
Sbjct: 277  IVLRIKEQISENRPRLSNGQLLDVRSAAHILNSLVQDAVEALQVVIQKIRGSVRYVKSSQ 336

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              Q KFNEI Q  GI  QK L +D P +WNSTYVMLE A+EY+ AF  L E DP  ++  
Sbjct: 337  SIQGKFNEIAQQTGIISQKSLVLDCPIRWNSTYVMLETAVEYRNAFCHLPELDPDLAL-- 394

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S  +W+   S+T  LK F E+ NVF G K  TAN YF EIC +H+QLI+WC+  D+F+SS
Sbjct: 395  SDDEWEWASSVTGYLKLFIEIINVFSGNKCPTANIYFPEICHVHIQLIEWCKSPDNFLSS 454

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LA K+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A + I  VS+ +K 
Sbjct: 455  LAAKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALERIKEVSDGIKE 514

Query: 723  LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890
            L++ ++I S L   G     +S   S+  ++DRL GFD+FLHET+ SQ+  SDL+KYLEE
Sbjct: 515  LFNAYSICSTLIDEGTALPGSSLPSSSNDSRDRLKGFDKFLHETAQSQSAISDLEKYLEE 574

Query: 891  PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070
             +FPR+ DF+ILNWW+VH PRYP+LSMMAR++LG P+S VA ES F+ G R LD    + 
Sbjct: 575  AVFPRNCDFNILNWWRVHTPRYPILSMMARDVLGTPMSTVAQESAFNAGGRVLDSCRSSL 634

Query: 1071 KSDTLQALMCSQDWM 1115
             +DT QAL+C++DW+
Sbjct: 635  TADTRQALICTRDWL 649


>ref|XP_006407043.1| hypothetical protein EUTSA_v10020233mg [Eutrema salsugineum]
            gi|557108189|gb|ESQ48496.1| hypothetical protein
            EUTSA_v10020233mg [Eutrema salsugineum]
          Length = 662

 Score =  392 bits (1007), Expect = e-106
 Identities = 194/382 (50%), Positives = 265/382 (69%), Gaps = 6/382 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RI+D + Q   ++  GQL++++     +  LVQD LE  R +  K+R +++Y+K ++
Sbjct: 276  IVLRIKDHMSQSSPILINGQLYELKSANHLLNSLVQDCLEAMRDVIQKIRGSVRYVKSSQ 335

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
             TQ +FNEI QL GI+ +K+L +D+   WNSTY MLE  LEY+ AF  L+++D  F    
Sbjct: 336  STQARFNEIAQLAGINSEKILVLDSLGTWNSTYAMLETVLEYQGAFCHLRDHDHGFDSSL 395

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            +  +W+  RS+T  LK   E++  F G +  TAN YFAE+CDIH+QLI+WC+  D F+SS
Sbjct: 396  TDEEWEWTRSVTGYLKLVFEIAADFSGNRCPTANVYFAEMCDIHIQLIEWCKNQDSFLSS 455

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LA K+K+KFDEYW KCSL++AIAAILDPRFKMKLVEYYY +IYG  A D I  VSN +K 
Sbjct: 456  LAAKMKAKFDEYWNKCSLVLAIAAILDPRFKMKLVEYYYSKIYGSVALDRIKEVSNGVKE 515

Query: 723  LYSGHAIYSPLAAHGQNSSSESNGIA------KDRLSGFDRFLHETSVSQNTKSDLDKYL 884
            L   +++ S +   G++SS   +G+A      +DRL GFD+FLHETS +QNT SDLDKYL
Sbjct: 516  LLDAYSMCSSI--DGEDSSFSGSGLARGSMDTRDRLKGFDKFLHETSQNQNTTSDLDKYL 573

Query: 885  EEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWG 1064
             EP+FPRS +F+ILN+WKVH PRYP+LSMMAR+ILG P+S +A +S F++G   +D S  
Sbjct: 574  SEPIFPRSGEFNILNYWKVHTPRYPILSMMARDILGTPMSILAPDSTFNSGRPVIDESKS 633

Query: 1065 TEKSDTLQALMCSQDWMRNELE 1130
            +   D  QAL C+ DW+  E E
Sbjct: 634  SLSPDIRQALFCAHDWLSTEAE 655


>gb|AAG52564.1|AC010675_12 unknown protein; 6859-4829 [Arabidopsis thaliana]
          Length = 676

 Score =  389 bits (999), Expect = e-105
 Identities = 191/392 (48%), Positives = 267/392 (68%), Gaps = 14/392 (3%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            +  +IRD+L Q++FL C GQLFDV C  + +  +VQD LE      N +RE+I+Y+K + 
Sbjct: 285  VASKIRDRLSQNKFLYCYGQLFDVSCGVNVINEMVQDSLEACCDTINIIRESIRYVKSSE 344

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              Q++FN+ +   G   ++ L +D+P +W+ST  MLE ALE K AF  + E+DP   +CP
Sbjct: 345  SIQDRFNQWIVETGAVSERNLCIDDPMRWDSTCTMLENALEQKSAFSLMNEHDPDSVLCP 404

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S ++W+RL +I   LK F EV N F     + AN YF E+CDIHL+LI+W +  DDFISS
Sbjct: 405  SDLEWERLGTIVEFLKVFVEVINAFTKSSCLPANMYFPEVCDIHLRLIEWSKNPDDFISS 464

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            L + ++ KFD++W K  L++AIA ILDPRFKMKLVEYYYP  YG SA + I+ +S C+K 
Sbjct: 465  LVVNMRKKFDDFWDKNYLVLAIATILDPRFKMKLVEYYYPLFYGTSASELIEDISECIKL 524

Query: 723  LYSGHAIYSPLAAHG-----QNSSSESNGIA-----KDRLSGFDRFLHETSVS--QNTKS 866
            LY  H++ S LA+       QN    SNG+A      DRL+ FDR+++ET+ +  Q++KS
Sbjct: 525  LYDEHSVGSLLASSNQALDWQNHHHRSNGVAHGKEPDDRLTEFDRYINETTTTPGQDSKS 584

Query: 867  DLDKYLEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVAL-ESLFDTGD- 1040
            DL+KYLEEPLFPR+ DF ILNWWKVH P+YP+LSMMARN+L +P+  V+  E  F+T   
Sbjct: 585  DLEKYLEEPLFPRNSDFDILNWWKVHTPKYPILSMMARNVLAVPMLNVSSEEDAFETCQR 644

Query: 1041 RALDHSWGTEKSDTLQALMCSQDWMRNELEDS 1136
            R +  +W + +  T+QALMC+QDW+++ELE S
Sbjct: 645  RRVSETWRSLRPSTVQALMCAQDWIQSELESS 676


>ref|XP_006297141.1| hypothetical protein CARUB_v10013145mg [Capsella rubella]
            gi|565479004|ref|XP_006297142.1| hypothetical protein
            CARUB_v10013145mg [Capsella rubella]
            gi|482565850|gb|EOA30039.1| hypothetical protein
            CARUB_v10013145mg [Capsella rubella]
            gi|482565851|gb|EOA30040.1| hypothetical protein
            CARUB_v10013145mg [Capsella rubella]
          Length = 667

 Score =  384 bits (986), Expect = e-104
 Identities = 191/382 (50%), Positives = 262/382 (68%), Gaps = 5/382 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RI+D + Q   ++  GQLF+++  A  +  LVQD LE  R +  K+R +++Y+K ++
Sbjct: 276  IVLRIKDHMSQSSQILINGQLFELKSAAHLLNSLVQDCLEAMRDVIQKIRGSVRYVKSSQ 335

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              Q +FNEI QL GI+  K+L +D+    NSTYVMLE  LEYK AF  L+++D  F    
Sbjct: 336  SAQVRFNEIAQLAGINSHKILVLDSLVNSNSTYVMLETVLEYKGAFCHLRDHDHGFDSSL 395

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            +  +W+  R +T  LK   ++++ F G K  TAN YF E+CDIH+QLI+WC+  D+F+SS
Sbjct: 396  TDEEWEWTRYVTGYLKLVFDIASDFSGNKCPTANVYFPEMCDIHIQLIEWCKNQDNFLSS 455

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LA  +K+KFDEYW KCSL++AIAAILDPR+KMKLVEYYY +IYG +A D I  VSN +K 
Sbjct: 456  LAASMKAKFDEYWNKCSLVLAIAAILDPRYKMKLVEYYYSKIYGSTALDRIKEVSNGVKE 515

Query: 723  LYSGHAIYSPLAAHGQNSSSESNGI-----AKDRLSGFDRFLHETSVSQNTKSDLDKYLE 887
            L   +++ S +   G++SS   +G+      +DRL GFD+FLHETS +QNT SDLDKYL 
Sbjct: 516  LLDAYSMCSAIV--GEDSSFSGSGLGRAMDTRDRLKGFDKFLHETSQNQNTTSDLDKYLS 573

Query: 888  EPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGT 1067
            EP FPRS +F+ILN+WKVH PRYP+LSMMAR+ILG PIS +A +S F++G   +  S  +
Sbjct: 574  EPNFPRSGEFNILNYWKVHTPRYPILSMMARDILGTPISIIAPDSTFNSGTPMIADSQSS 633

Query: 1068 EKSDTLQALMCSQDWMRNELED 1133
               D  QAL C+ DW+  E E+
Sbjct: 634  LNPDIRQALFCAHDWLSTETEE 655


>dbj|BAB02646.1| Ac transposase-like protein [Arabidopsis thaliana]
            gi|18176330|gb|AAL60024.1| unknown protein [Arabidopsis
            thaliana] gi|20465375|gb|AAM20091.1| unknown protein
            [Arabidopsis thaliana]
          Length = 662

 Score =  380 bits (975), Expect = e-102
 Identities = 186/379 (49%), Positives = 260/379 (68%), Gaps = 3/379 (0%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RI+D + Q   ++  GQLF+++  A  +  LV+D LE  R +  K+R +++Y+K ++
Sbjct: 277  IVLRIKDHMSQSSQILINGQLFELKSAAHLLNSLVEDCLEAMRDVIQKIRGSVRYVKSSQ 336

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
             TQ +FNEI QL GI+ QK+L +D+    NST+VMLE  LEYK AF  L+++D SF    
Sbjct: 337  STQVRFNEIAQLAGINSQKILVLDSIVNSNSTFVMLETVLEYKGAFCHLRDHDHSFDSSL 396

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            +  +W+  R +T  LK   ++++ F   K  TAN YFAE+CDIH+QL++WC+  D+F+SS
Sbjct: 397  TDEEWEWTRYVTGYLKLVFDIASDFSANKCPTANVYFAEMCDIHIQLVEWCKNQDNFLSS 456

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LA  +K+KFDEYW KCSL++AIAAILDPRFKMKLVEYYY +IYG +A D I  VSN +K 
Sbjct: 457  LAANMKAKFDEYWNKCSLVLAIAAILDPRFKMKLVEYYYSKIYGSTALDRIKEVSNGVKE 516

Query: 723  LYSGHAIYSPLAAHGQNSSS---ESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEEP 893
            L   +++ S +      S S    ++   +DRL GFD+FLHETS +QNT +DLDKYL EP
Sbjct: 517  LLDAYSMCSAIVGEDSFSGSGLGRASMDTRDRLKGFDKFLHETSQNQNTTTDLDKYLSEP 576

Query: 894  LFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTEK 1073
            +FPRS +F+ILN+WKVH PRYP+LS++AR+ILG P+S  A +S F++G   +  S  +  
Sbjct: 577  IFPRSGEFNILNYWKVHTPRYPILSLLARDILGTPMSICAPDSTFNSGTPVISDSQSSLN 636

Query: 1074 SDTLQALMCSQDWMRNELE 1130
             D  QAL C+ DW+  E E
Sbjct: 637  PDIRQALFCAHDWLSTETE 655


>ref|XP_007022002.1| BED zinc finger,hAT family dimerization domain isoform 5 [Theobroma
            cacao] gi|508721630|gb|EOY13527.1| BED zinc finger,hAT
            family dimerization domain isoform 5 [Theobroma cacao]
          Length = 639

 Score =  378 bits (970), Expect = e-102
 Identities = 188/354 (53%), Positives = 251/354 (70%), Gaps = 4/354 (1%)
 Frame = +3

Query: 3    IVCRIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTR 182
            IV RI++Q+ ++R  +  GQL DVR  A  +  LVQD +E  + +  K+R +++Y+K ++
Sbjct: 277  IVLRIKEQISENRPRLSNGQLLDVRSAAHILNSLVQDAVEALQVVIQKIRGSVRYVKSSQ 336

Query: 183  GTQEKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCP 362
              Q KFNEI Q  GI  QK L +D P +WNSTYVMLE A+EY+ AF  L E DP  ++  
Sbjct: 337  SIQGKFNEIAQQTGIISQKSLVLDCPIRWNSTYVMLETAVEYRNAFCHLPELDPDLAL-- 394

Query: 363  SGIDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISS 542
            S  +W+   S+T  LK F E+ NVF G K  TAN YF EIC +H+QLI+WC+  D+F+SS
Sbjct: 395  SDDEWEWASSVTGYLKLFIEIINVFSGNKCPTANIYFPEICHVHIQLIEWCKSPDNFLSS 454

Query: 543  LALKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKA 722
            LA K+K+KFD+YW KCSL +A+AAILDPRFKMKLVEYYY QIYG +A + I  VS+ +K 
Sbjct: 455  LAAKMKAKFDKYWSKCSLALAVAAILDPRFKMKLVEYYYSQIYGSTALERIKEVSDGIKE 514

Query: 723  LYSGHAIYSPLAAHGQ----NSSSESNGIAKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890
            L++ ++I S L   G     +S   S+  ++DRL GFD+FLHET+ SQ+  SDL+KYLEE
Sbjct: 515  LFNAYSICSTLIDEGTALPGSSLPSSSNDSRDRLKGFDKFLHETAQSQSAISDLEKYLEE 574

Query: 891  PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALD 1052
             +FPR+ DF+ILNWW+VH PRYP+LSMMAR++LG P+S VA ES F+ G R LD
Sbjct: 575  AVFPRNCDFNILNWWRVHTPRYPILSMMARDVLGTPMSTVAQESAFNAGGRVLD 628


>ref|XP_006390942.1| hypothetical protein EUTSA_v10018229mg [Eutrema salsugineum]
            gi|557087376|gb|ESQ28228.1| hypothetical protein
            EUTSA_v10018229mg [Eutrema salsugineum]
          Length = 674

 Score =  373 bits (958), Expect = e-100
 Identities = 185/391 (47%), Positives = 261/391 (66%), Gaps = 16/391 (4%)
 Frame = +3

Query: 12   RIRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTRGTQ 191
            +IRD+L Q++FL C GQLFDV C    +  + QD L+T  +  +K+R  I+Y+K +   Q
Sbjct: 284  KIRDRLSQNKFLYCNGQLFDVSCGVYVINQMAQDSLQTCCETIDKIRNCIRYVKSSESIQ 343

Query: 192  EKFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSM-CPSG 368
            E FN+     G   +K L +D+  +W++T  MLE  LE K  F  ++E DP   + CPS 
Sbjct: 344  ESFNQWRAEAGAESEKDLCIDDSTRWDTTCSMLEIVLEQKNVFLLMKERDPDSCLPCPSD 403

Query: 369  IDWDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISSLA 548
            ++W+RL +I   LK F EV+N F     +TAN YF EICDIHL+LI+W + +DDFISS+A
Sbjct: 404  LEWERLETIVGFLKVFVEVANAFTKSSCLTANIYFPEICDIHLRLIEWSKNTDDFISSVA 463

Query: 549  LKLKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKALY 728
            + ++  FDE+W K +L++AIA ILDPRFKMKLVEYYYP  Y  SA + I+ +S C+KALY
Sbjct: 464  VNMRKLFDEFWDKNNLVLAIATILDPRFKMKLVEYYYPLFYDSSASELIEDISECIKALY 523

Query: 729  SGHAIYSPLAAHG-----QNSSSESNGIA-----KDRLSGFDRFLHETSVS---QNTKSD 869
            + H++ S LA+       Q +  + NG+       +RL  FDR++H+T+ +   Q+++SD
Sbjct: 524  NEHSVRSLLASSDQALDWQENHHQPNGVVHGIEPDNRLIEFDRYIHDTTTTTQGQDSRSD 583

Query: 870  LDKYLEEPLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALE--SLFDTGDR 1043
            LDKYLEEPLFPR+ DF ILNWWKVH PRYP+LS MARN+L +P+S V+ E  +      R
Sbjct: 584  LDKYLEEPLFPRNTDFDILNWWKVHTPRYPILSTMARNVLAVPMSNVSSEEDAFKSCPRR 643

Query: 1044 ALDHSWGTEKSDTLQALMCSQDWMRNELEDS 1136
             +  +W + +  T+QALMC+QDW+R+ELE S
Sbjct: 644  QISETWWSLRPSTVQALMCAQDWIRSELESS 674


>ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda]
            gi|548861481|gb|ERN18855.1| hypothetical protein
            AMTR_s00067p00136180 [Amborella trichopoda]
          Length = 685

 Score =  338 bits (866), Expect = 5e-90
 Identities = 165/384 (42%), Positives = 257/384 (66%), Gaps = 8/384 (2%)
 Frame = +3

Query: 15   IRDQLCQHRFLMCEGQLFDVRCTASTVKLLVQDVLETSRQITNKVRETIQYIKGTRGTQE 194
            +R +L ++  L  EG++F + C +  V L+VQD LE  +++  K+RE+I+Y+K +   QE
Sbjct: 296  LRSRLSRNSSLPLEGKIFHLCCCSHVVNLMVQDGLEVIQEVLQKIRESIKYVKTSHVRQE 355

Query: 195  KFNEIVQLVGISGQKLLSVDNPFQWNSTYVMLEAALEYKEAFPQLQEYDPSFSMCPSGID 374
            +FNEI+  +GI  ++ + +D P +WNSTY ML+  LE +EAF    + D   +M PS  +
Sbjct: 356  RFNEIINQLGIQSKQNIFLDVPTRWNSTYHMLDVTLELREAFSCFAQCDSMCNMVPSEDE 415

Query: 375  WDRLRSITSILKFFHEVSNVFVGRKHVTANSYFAEICDIHLQLIQWCQKSDDFISSLALK 554
            W+R++ I   LK F++++N F+G K+ TAN YF E+  +HL+L++W    +  ISS+A+K
Sbjct: 416  WERVKEICDCLKLFYDITNTFLGSKYPTANLYFPEVYQMHLRLVEWSMSLNKHISSMAIK 475

Query: 555  LKSKFDEYWKKCSLIMAIAAILDPRFKMKLVEYYYPQIYGDSAPDCIDIVSNCMKALYSG 734
            +K KFD+YWK  +L++AIA ++DPRFK+K VEY Y QIYG+ A   I +V   +  L + 
Sbjct: 476  MKEKFDKYWKISNLVLAIAVVIDPRFKLKFVEYSYSQIYGNDAEHHIRMVRQGVYDLCNE 535

Query: 735  HAIYSPLAAHGQNS-----SSESNGI---AKDRLSGFDRFLHETSVSQNTKSDLDKYLEE 890
            +    PLA++ ++S     S+ S G+    K     F++F+ E+S +Q  KS+LD+YLEE
Sbjct: 536  YESKEPLASNSESSLAVSASTSSGGVDTHGKLWAMEFEKFVRESSSNQARKSELDRYLEE 595

Query: 891  PLFPRSVDFSILNWWKVHEPRYPVLSMMARNILGIPISKVALESLFDTGDRALDHSWGTE 1070
            P+FPR++DF+I NWW+++ PR+P LS MAR+ILGIP+S V  +S FD G + LD    + 
Sbjct: 596  PIFPRNLDFNIRNWWQLNAPRFPTLSKMARDILGIPVSTVTSDSTFDIGGQVLDQYRSSL 655

Query: 1071 KSDTLQALMCSQDWMRNELEDSKT 1142
              +T+QALMC+QDW+ NEL+  K+
Sbjct: 656  LPETIQALMCAQDWLWNELKGGKS 679


Top