BLASTX nr result

ID: Sinomenium21_contig00019247 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00019247
         (2184 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI20108.3| unnamed protein product [Vitis vinifera]              723   0.0  
emb|CAN60218.1| hypothetical protein VITISV_006612 [Vitis vinifera]   721   0.0  
ref|XP_006377715.1| hypothetical protein POPTR_0011s10500g [Popu...   721   0.0  
ref|XP_006370067.1| hypothetical protein POPTR_0001s39240g [Popu...   711   0.0  
ref|XP_007022001.1| BED zinc finger,hAT family dimerization doma...   711   0.0  
ref|XP_007021998.1| BED zinc finger,hAT family dimerization doma...   708   0.0  
ref|XP_007146367.1| hypothetical protein PHAVU_006G034500g [Phas...   694   0.0  
ref|XP_007048823.1| BED zinc finger,hAT family dimerization doma...   693   0.0  
ref|XP_007133312.1| hypothetical protein PHAVU_011G169000g [Phas...   691   0.0  
ref|XP_007213601.1| hypothetical protein PRUPE_ppa002416mg [Prun...   690   0.0  
ref|XP_007022002.1| BED zinc finger,hAT family dimerization doma...   684   0.0  
ref|XP_007216990.1| hypothetical protein PRUPE_ppa002590mg [Prun...   682   0.0  
ref|XP_006407043.1| hypothetical protein EUTSA_v10020233mg [Eutr...   645   0.0  
dbj|BAB02646.1| Ac transposase-like protein [Arabidopsis thalian...   634   e-179
ref|XP_006297141.1| hypothetical protein CARUB_v10013145mg [Caps...   633   e-179
gb|EYU28909.1| hypothetical protein MIMGU_mgv1a002591mg [Mimulus...   622   e-175
gb|EPS60750.1| hypothetical protein M569_14050, partial [Genlise...   611   e-172
gb|AAG52564.1|AC010675_12 unknown protein; 6859-4829 [Arabidopsi...   578   e-162
ref|XP_006390942.1| hypothetical protein EUTSA_v10018229mg [Eutr...   573   e-161
ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [A...   554   e-155

>emb|CBI20108.3| unnamed protein product [Vitis vinifera]
          Length = 677

 Score =  723 bits (1866), Expect = 0.0
 Identities = 370/660 (56%), Positives = 477/660 (72%), Gaps = 14/660 (2%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            ME+ S  S I+K KRLTS VWN FERV++ D+  A+C HC K+L        +HLRNHL 
Sbjct: 1    MEI-SNESAIKKPKRLTSVVWNHFERVRKADICYAVCIHCNKRLSGSSNSGTTHLRNHLM 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ------LAVTFDQEQNQFDP-- 1801
            RCL+R+N DV+Q+L  + +K+  A  L    YD+   +        + FDQEQ + +P  
Sbjct: 60   RCLKRSNYDVSQLLAAKRRKKEGALSLTAINYDEGQRKEENIKPTILKFDQEQKKDEPIN 119

Query: 1800 ----KFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQI 1633
                +FDQE+SR DLARMIILH YPLAMV HVGFK FV++LQPLF + S+   + DC++I
Sbjct: 120  LGSIRFDQERSRLDLARMIILHGYPLAMVNHVGFKVFVKDLQPLFEVNSA--IELDCMEI 177

Query: 1632 YEKEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDP 1453
            Y KEKQKVY+++ +  GRI+L V  WTS + + YLCLT+HYI E W L+K+ILNFV +DP
Sbjct: 178  YGKEKQKVYEVMSRSHGRINLAVDMWTSPEQAEYLCLTAHYIDEDWKLQKKILNFVSLDP 237

Query: 1452 S-TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFH 1276
            S TE  LSE II CLM+W++  KLFS+TF  C++N++  LR+K+  SQ+R LL +GQL  
Sbjct: 238  SHTEDMLSEVIIKCLMEWEVGHKLFSMTFHDCATNDDVALRVKEHFSQDRPLLGSGQLLD 297

Query: 1275 VCCAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRL 1096
            V C  HVLN+I+QD  +A+ E+ +KIRES+RYVK+S+A   KFNE+A QV  N++++L L
Sbjct: 298  VRCVGHVLNLIVQDCIEALREVTHKIRESVRYVKTSQATLGKFNEIAQQVGINSQQNLFL 357

Query: 1095 DCPTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNK 916
            DCPTQW STY ML+  +EYK AFS LQE D GYT+A S+ EWE A+ ITSY+K+  E+  
Sbjct: 358  DCPTQWNSTYLMLDRVLEYKGAFSLLQEHDPGYTVALSDTEWEWASSITSYMKLLLEIIA 417

Query: 915  VLSSVKCLTSNHYFSEMSDIHLNLIEWCKI-SNCISSVAMKSKCKFDAYWNICSLKLAIA 739
            VLSS KC T+N YF E+ DIH+ LIEWCK   + ISS+A+K K KFD YW+ CSL LA+A
Sbjct: 418  VLSSNKCPTANIYFPEICDIHIQLIEWCKSPDDFISSLALKMKAKFDKYWSKCSLALAVA 477

Query: 738  AVLDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQ 559
             +LDPRFKMKLVEYYY QIYG++AAD+IK VS  I++L+N Y  CST ASL QG+A    
Sbjct: 478  VILDPRFKMKLVEYYYPQIYGTDAADRIKDVSDGIKELFNVY--CSTSASLHQGVALP-G 534

Query: 558  NGVFRSGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSP 379
            + +  + N   DRL+GFDKF+HETS +  + ++LDKYLEEPVFPRN DF+IL+WWKV  P
Sbjct: 535  SSLPSTSNDSRDRLKGFDKFIHETSQNQNIVSDLDKYLEEPVFPRNCDFHILNWWKVQKP 594

Query: 378  KFPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSEL 199
            ++PILSMM RDVLGIPMS  V+P     TG R LD  +SSL+PD  QALICT DWLQ+ L
Sbjct: 595  RYPILSMMVRDVLGIPMS-TVAPEVVFSTGARVLDHYRSSLNPDTRQALICTQDWLQTGL 653


>emb|CAN60218.1| hypothetical protein VITISV_006612 [Vitis vinifera]
          Length = 667

 Score =  721 bits (1862), Expect = 0.0
 Identities = 369/660 (55%), Positives = 477/660 (72%), Gaps = 14/660 (2%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            ME+ S  S I+K KRLTS VWN FERV++ D+  A+C HC K+L        +HLRNHL 
Sbjct: 1    MEI-SNESAIKKPKRLTSVVWNHFERVRKADICYAVCIHCNKRLSGSSNSGTTHLRNHLM 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ------LAVTFDQEQNQFDP-- 1801
            RCL+R+N DV+Q+L  + +K+  A  L    YD+   +        + FDQEQ + +P  
Sbjct: 60   RCLKRSNYDVSQLLAAKRRKKEGALSLTAINYDEGQRKEENIKPTILKFDQEQKKDEPIN 119

Query: 1800 ----KFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQI 1633
                +FDQE+SR DLARMIILH YPLAMV HVGFK FV++LQPLF + S+   + DC++I
Sbjct: 120  LGSIRFDQERSRLDLARMIILHGYPLAMVNHVGFKVFVKDLQPLFEVNSA--IELDCMEI 177

Query: 1632 YEKEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDP 1453
            Y KEKQKVY+++ +  GRI+L V  WTS + + YLCLT+HYI E W L+K+ILNF+ +DP
Sbjct: 178  YGKEKQKVYEVMSRSHGRINLAVDMWTSPEQAEYLCLTAHYIDEDWKLQKKILNFLSLDP 237

Query: 1452 S-TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFH 1276
            S TE  LSE II CLM+W++  KLFS+TF  C++N++  LR+K+  SQ+R LL +GQL  
Sbjct: 238  SHTEDMLSEFIIKCLMEWEVGHKLFSMTFHDCATNDDVALRVKEHFSQDRPLLGSGQLLD 297

Query: 1275 VCCAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRL 1096
            V C  HVLN+I+QD  +A+ E+ +KIRES+RYVK+S+A   KFNE+A QV  N++++L L
Sbjct: 298  VRCVGHVLNLIVQDCIEALREVTHKIRESVRYVKTSQATLGKFNEIAQQVGINSQQNLFL 357

Query: 1095 DCPTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNK 916
            DCPTQW STY ML+  +EYK AFS LQE D GYT+A S+ EWE A+ ITSY+K+  E+  
Sbjct: 358  DCPTQWNSTYLMLDTVLEYKGAFSLLQEHDPGYTVALSDTEWEWASSITSYMKLLLEIIA 417

Query: 915  VLSSVKCLTSNHYFSEMSDIHLNLIEWCKI-SNCISSVAMKSKCKFDAYWNICSLKLAIA 739
            VLSS KC T+N YF E+ DIH+ LIEWCK   + ISS+A+K K KFD YW+ CSL LA+A
Sbjct: 418  VLSSNKCPTANIYFPEICDIHIQLIEWCKSPDDFISSLALKMKAKFDKYWSKCSLALAVA 477

Query: 738  AVLDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQ 559
             +LDPRFKMKLVEYYY QIYG++AAD+IK VS  I++L+N Y  CST ASL QG+A    
Sbjct: 478  VILDPRFKMKLVEYYYPQIYGNDAADRIKDVSDGIKELFNVY--CSTSASLHQGVALP-G 534

Query: 558  NGVFRSGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSP 379
            + +  + N   DRL+GFDKF+HETS +  + ++LDKYLEEPVFPRN DF+IL+WWKV  P
Sbjct: 535  SSLPSTSNDSRDRLKGFDKFIHETSQNQNIVSDLDKYLEEPVFPRNCDFHILNWWKVQKP 594

Query: 378  KFPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSEL 199
            ++PILSMM RDVLGIPMS  V+P     TG R LD  +SSL+PD  QALICT DWLQ+ L
Sbjct: 595  RYPILSMMVRDVLGIPMS-TVAPEVVFSTGARVLDHYRSSLNPDTRQALICTQDWLQTGL 653


>ref|XP_006377715.1| hypothetical protein POPTR_0011s10500g [Populus trichocarpa]
            gi|550328098|gb|ERP55512.1| hypothetical protein
            POPTR_0011s10500g [Populus trichocarpa]
          Length = 673

 Score =  721 bits (1861), Expect = 0.0
 Identities = 367/662 (55%), Positives = 479/662 (72%), Gaps = 17/662 (2%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            MEV S  S I+K KRLTS VWN F+R+++ D+  A+C HC KKL        +HLRNHL 
Sbjct: 1    MEV-SNESAIKKPKRLTSVVWNHFQRIRKADVCYAVCVHCDKKLSGSSNSGTTHLRNHLM 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ--------LAVTFDQEQNQFDP 1801
            RCL+R+N DV+Q+L  ++KK+  +  + N   + + +Q          + FD EQ + + 
Sbjct: 60   RCLKRSNYDVSQLLAAKKKKKDTSLSIANVNANYDETQRKDEYIKPTIIKFDHEQRKDEI 119

Query: 1800 ------KFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCL 1639
                  +FDQEQSR DLARMIILH YPL MVEHVGFK FV+NLQPLF  + + S +  C+
Sbjct: 120  ISLGSCRFDQEQSRLDLARMIILHGYPLTMVEHVGFKIFVKNLQPLFEFVPNSSIEVSCI 179

Query: 1638 QIYEKEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLV 1459
            +IY KEKQKVY+M++++ GRI+L V  W+S +++ YLCL +HYI E W L+++ILNFV +
Sbjct: 180  EIYMKEKQKVYEMINRLHGRINLAVEMWSSPENAEYLCLIAHYIDEDWKLQQKILNFVTL 239

Query: 1458 DPS-TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQL 1282
            D S TE  LSE II+CLM+WD++ KLF++TFD C ++++ VLRIKDR+SQNR LLSNGQL
Sbjct: 240  DSSHTEDMLSEVIINCLMEWDVECKLFAMTFDDCFADDDIVLRIKDRISQNRPLLSNGQL 299

Query: 1281 FHVCCAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSL 1102
            F V  A HVLN+I+QDA + I E+  K+R S+RYVKSS+ +Q KFNE+A Q+  +++K+L
Sbjct: 300  FDVRSAAHVLNLIVQDAMETIREVTEKVRGSVRYVKSSQVIQGKFNEIAEQIGISSQKNL 359

Query: 1101 RLDCPTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEV 922
             LD PT+W STY MLE  + YKSAF  LQE D  YT A ++ EWE A+ IT YLK+F E+
Sbjct: 360  VLDLPTRWNSTYFMLETVIGYKSAFCFLQERDPAYTSALTDTEWEWASSITGYLKLFVEI 419

Query: 921  NKVLSSVKCLTSNHYFSEMSDIHLNLIEWCK-ISNCISSVAMKSKCKFDAYWNICSLKLA 745
              + S  KC T+N YF E+ D+H+ LIEWCK   + +SS+A K K KFD YW+ CSL LA
Sbjct: 420  TNIFSGDKCPTANIYFPEICDVHIQLIEWCKNPDDFLSSMASKMKAKFDRYWSKCSLALA 479

Query: 744  IAAVLDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACE 565
            +AA+LDPRFKMKLVEYYY+QIYGS A D+IK VS  I++L+N YSICSTL  +DQG    
Sbjct: 480  VAAILDPRFKMKLVEYYYSQIYGSTALDRIKEVSDGIKELFNAYSICSTL--VDQG--ST 535

Query: 564  VQNGVFRSGNSDT-DRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKV 388
            +      S ++D+ DRL+GFDKFLHE+S      ++LDKYLEEPVFPRN DFNIL+WWKV
Sbjct: 536  LPGSSLPSTSTDSRDRLKGFDKFLHESSQGQSAISDLDKYLEEPVFPRNCDFNILNWWKV 595

Query: 387  NSPKFPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQ 208
            ++P++PILSMMARD+LG PMS  ++P  A   GGR LDS +SSL+PD  QALICT DWLQ
Sbjct: 596  HTPRYPILSMMARDILGTPMS-TIAPELAFGVGGRVLDSYRSSLNPDTRQALICTRDWLQ 654

Query: 207  SE 202
             E
Sbjct: 655  VE 656


>ref|XP_006370067.1| hypothetical protein POPTR_0001s39240g [Populus trichocarpa]
            gi|550349246|gb|ERP66636.1| hypothetical protein
            POPTR_0001s39240g [Populus trichocarpa]
          Length = 673

 Score =  711 bits (1835), Expect = 0.0
 Identities = 361/662 (54%), Positives = 477/662 (72%), Gaps = 17/662 (2%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            MEV S    I+K KRLTS VWN F+R+++ D+  A+C HC KKL        +HLRNHL 
Sbjct: 1    MEV-SNELAIKKPKRLTSVVWNHFQRIRKADVCYAVCVHCDKKLSGSSNSGTTHLRNHLL 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ--------LAVTFDQEQNQFDP 1801
            RCL+R+N DV+Q+LV ++KK+  +  L N     + +Q          +  D EQ + + 
Sbjct: 60   RCLKRSNYDVSQLLVAKKKKKDTSLSLANVNVSYDEAQRKDEYIKPTVMKSDLEQRKDEV 119

Query: 1800 ------KFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCL 1639
                  +FDQE+S+ DLARMIILH YPL MVEHVGFKRFV+NLQPLF  + + S +  C+
Sbjct: 120  ISLGSCRFDQERSQLDLARMIILHGYPLTMVEHVGFKRFVKNLQPLFEFVPNSSIEVSCM 179

Query: 1638 QIYEKEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLV 1459
            + Y KEKQKVY+M++++ GRI+L +  W+S +++ Y+CL +HYI E W L+++ILNFV +
Sbjct: 180  EFYLKEKQKVYEMINRLHGRINLAIEMWSSPENAEYMCLIAHYIDEDWKLQQKILNFVTL 239

Query: 1458 DPS-TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQL 1282
            D S TE  LSE II+CLM+WD++ KLF++TFD CS++++ VLRIKDR+SQNR LLSNGQL
Sbjct: 240  DSSHTEDVLSEVIINCLMEWDVEYKLFAMTFDDCSADDDIVLRIKDRISQNRPLLSNGQL 299

Query: 1281 FHVCCAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSL 1102
            F V  A HVLN+I++DA + + E+  K+R S+ YVKSS+ +Q KFN++A Q+  +++++L
Sbjct: 300  FDVRSAVHVLNLIVKDAMETLQEVTEKVRGSVSYVKSSQVIQGKFNDIAQQIGISSQRNL 359

Query: 1101 RLDCPTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEV 922
             LD  T+W STY+MLE  + YKSAF  LQE D  YT A S+ EWE A  IT YLK+F E+
Sbjct: 360  VLDSSTRWNSTYSMLETVIGYKSAFCFLQEHDPAYTSALSDIEWEWAKSITGYLKLFVEI 419

Query: 921  NKVLSSVKCLTSNHYFSEMSDIHLNLIEWCK-ISNCISSVAMKSKCKFDAYWNICSLKLA 745
              + S  KC T+N YF E+ D+H+ LIEWCK   + +SS+A K K KFD YW+ CSL LA
Sbjct: 420  TNIFSGDKCPTANRYFPEICDVHIQLIEWCKNPDDFLSSIASKMKAKFDKYWSKCSLALA 479

Query: 744  IAAVLDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACE 565
            +AA+LDPRFKMKLVEYYY+QIYGS A D+IK VS  I++L+N YSICSTL  +DQG A  
Sbjct: 480  VAAILDPRFKMKLVEYYYSQIYGSTALDRIKEVSDGIKELFNAYSICSTL--VDQGSA-- 535

Query: 564  VQNGVFRSGNSDT-DRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKV 388
            +      S ++D+ DRL+GFDKFLHE+S      ++LDKYLEEPVFPRN DFNIL+WWKV
Sbjct: 536  LPGSSLPSTSTDSRDRLKGFDKFLHESSQGQSSISDLDKYLEEPVFPRNCDFNILNWWKV 595

Query: 387  NSPKFPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQ 208
            ++P++PILSMMARD+LG PMS  VSP  A   GGR LDS +SSL+PD  QALICT DWL+
Sbjct: 596  HTPRYPILSMMARDILGTPMS-TVSPELAFGVGGRVLDSYRSSLNPDTRQALICTRDWLR 654

Query: 207  SE 202
             E
Sbjct: 655  VE 656


>ref|XP_007022001.1| BED zinc finger,hAT family dimerization domain isoform 4 [Theobroma
            cacao] gi|590611092|ref|XP_007022003.1| BED zinc
            finger,hAT family dimerization domain isoform 4
            [Theobroma cacao] gi|508721629|gb|EOY13526.1| BED zinc
            finger,hAT family dimerization domain isoform 4
            [Theobroma cacao] gi|508721631|gb|EOY13528.1| BED zinc
            finger,hAT family dimerization domain isoform 4
            [Theobroma cacao]
          Length = 689

 Score =  711 bits (1834), Expect = 0.0
 Identities = 371/680 (54%), Positives = 485/680 (71%), Gaps = 16/680 (2%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            MEV ++ S I+K KRLTS VWN FERV++ D+  A+C HC KKL        +HLRNHL 
Sbjct: 1    MEVANE-SAIKKPKRLTSVVWNHFERVRKADVCYAVCVHCNKKLSGSSNSGTTHLRNHLM 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ------LAVTFDQEQ------N 1813
            RCL+R+N DV+Q+L  + +K+     + N  YD+   +        V ++Q+Q      N
Sbjct: 60   RCLKRSNYDVSQLLAAKRRKKDNTLTIANISYDEGQRKEDYIKPTIVKYEQDQRKDEVFN 119

Query: 1812 QFDPKFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQI 1633
                +FDQE+SR DLARMIILH YPLAMVEHVGFK FV+NLQPLF ++ + + +  C++I
Sbjct: 120  LGSSRFDQERSRLDLARMIILHGYPLAMVEHVGFKVFVKNLQPLFDLVPNSTIELFCMEI 179

Query: 1632 YEKEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDP 1453
            Y KEKQKVYDML K+ GRI+L V  W+S ++S YLCLT+HYI + W L+K+ILNFV +D 
Sbjct: 180  YGKEKQKVYDMLSKLQGRINLAVEMWSSPENSNYLCLTAHYIDDDWKLQKKILNFVTLDS 239

Query: 1452 S-TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFH 1276
            S TE  LSE I+ CLMDWDI+ KLF++TFD CS+N++ VLRIK+++S+NR  LSNGQL  
Sbjct: 240  SHTEDLLSEVIMKCLMDWDIECKLFAMTFDDCSTNDDIVLRIKEQISENRPRLSNGQLLD 299

Query: 1275 VCCAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRL 1096
            V  A H+LN ++QDA +A+  ++ KIR S+RYVKSS+++Q KFNE+A Q    ++KSL L
Sbjct: 300  VRSAAHILNSLVQDAVEALQVVIQKIRGSVRYVKSSQSIQGKFNEIAQQTGIISQKSLVL 359

Query: 1095 DCPTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNK 916
            DCP +W STY MLE AVEY++AF HL E D    +A S+ EWE A+ +T YLK+F E+  
Sbjct: 360  DCPIRWNSTYVMLETAVEYRNAFCHLPELDP--DLALSDDEWEWASSVTGYLKLFIEIIN 417

Query: 915  VLSSVKCLTSNHYFSEMSDIHLNLIEWCKI-SNCISSVAMKSKCKFDAYWNICSLKLAIA 739
            V S  KC T+N YF E+  +H+ LIEWCK   N +SS+A K K KFD YW+ CSL LA+A
Sbjct: 418  VFSGNKCPTANIYFPEICHVHIQLIEWCKSPDNFLSSLAAKMKAKFDKYWSKCSLALAVA 477

Query: 738  AVLDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQ 559
            A+LDPRFKMKLVEYYY+QIYGS A ++IK VS  I++L+N YSICSTL  +D+G A    
Sbjct: 478  AILDPRFKMKLVEYYYSQIYGSTALERIKEVSDGIKELFNAYSICSTL--IDEGTALP-G 534

Query: 558  NGVFRSGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSP 379
            + +  S N   DRL+GFDKFLHET+ S    ++L+KYLEE VFPRN DFNIL+WW+V++P
Sbjct: 535  SSLPSSSNDSRDRLKGFDKFLHETAQSQSAISDLEKYLEEAVFPRNCDFNILNWWRVHTP 594

Query: 378  KFPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWL--QS 205
            ++PILSMMARDVLG PMS  V+  +A + GGR LDS +SSL+ D  QALICT DWL  QS
Sbjct: 595  RYPILSMMARDVLGTPMS-TVAQESAFNAGGRVLDSCRSSLTADTRQALICTRDWLWMQS 653

Query: 204  ELGQVI*N*VFSDFVICIIY 145
            +   +I +     F+I  IY
Sbjct: 654  DGACIIFDLFAQSFLIHYIY 673


>ref|XP_007021998.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma
            cacao] gi|590611078|ref|XP_007021999.1| BED zinc
            finger,hAT family dimerization domain isoform 1
            [Theobroma cacao] gi|590611082|ref|XP_007022000.1| BED
            zinc finger,hAT family dimerization domain isoform 1
            [Theobroma cacao] gi|508721626|gb|EOY13523.1| BED zinc
            finger,hAT family dimerization domain isoform 1
            [Theobroma cacao] gi|508721627|gb|EOY13524.1| BED zinc
            finger,hAT family dimerization domain isoform 1
            [Theobroma cacao] gi|508721628|gb|EOY13525.1| BED zinc
            finger,hAT family dimerization domain isoform 1
            [Theobroma cacao]
          Length = 672

 Score =  708 bits (1828), Expect = 0.0
 Identities = 364/656 (55%), Positives = 474/656 (72%), Gaps = 14/656 (2%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            MEV ++ S I+K KRLTS VWN FERV++ D+  A+C HC KKL        +HLRNHL 
Sbjct: 1    MEVANE-SAIKKPKRLTSVVWNHFERVRKADVCYAVCVHCNKKLSGSSNSGTTHLRNHLM 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ------LAVTFDQEQ------N 1813
            RCL+R+N DV+Q+L  + +K+     + N  YD+   +        V ++Q+Q      N
Sbjct: 60   RCLKRSNYDVSQLLAAKRRKKDNTLTIANISYDEGQRKEDYIKPTIVKYEQDQRKDEVFN 119

Query: 1812 QFDPKFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQI 1633
                +FDQE+SR DLARMIILH YPLAMVEHVGFK FV+NLQPLF ++ + + +  C++I
Sbjct: 120  LGSSRFDQERSRLDLARMIILHGYPLAMVEHVGFKVFVKNLQPLFDLVPNSTIELFCMEI 179

Query: 1632 YEKEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDP 1453
            Y KEKQKVYDML K+ GRI+L V  W+S ++S YLCLT+HYI + W L+K+ILNFV +D 
Sbjct: 180  YGKEKQKVYDMLSKLQGRINLAVEMWSSPENSNYLCLTAHYIDDDWKLQKKILNFVTLDS 239

Query: 1452 S-TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFH 1276
            S TE  LSE I+ CLMDWDI+ KLF++TFD CS+N++ VLRIK+++S+NR  LSNGQL  
Sbjct: 240  SHTEDLLSEVIMKCLMDWDIECKLFAMTFDDCSTNDDIVLRIKEQISENRPRLSNGQLLD 299

Query: 1275 VCCAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRL 1096
            V  A H+LN ++QDA +A+  ++ KIR S+RYVKSS+++Q KFNE+A Q    ++KSL L
Sbjct: 300  VRSAAHILNSLVQDAVEALQVVIQKIRGSVRYVKSSQSIQGKFNEIAQQTGIISQKSLVL 359

Query: 1095 DCPTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNK 916
            DCP +W STY MLE AVEY++AF HL E D    +A S+ EWE A+ +T YLK+F E+  
Sbjct: 360  DCPIRWNSTYVMLETAVEYRNAFCHLPELDP--DLALSDDEWEWASSVTGYLKLFIEIIN 417

Query: 915  VLSSVKCLTSNHYFSEMSDIHLNLIEWCKI-SNCISSVAMKSKCKFDAYWNICSLKLAIA 739
            V S  KC T+N YF E+  +H+ LIEWCK   N +SS+A K K KFD YW+ CSL LA+A
Sbjct: 418  VFSGNKCPTANIYFPEICHVHIQLIEWCKSPDNFLSSLAAKMKAKFDKYWSKCSLALAVA 477

Query: 738  AVLDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQ 559
            A+LDPRFKMKLVEYYY+QIYGS A ++IK VS  I++L+N YSICSTL  +D+G A    
Sbjct: 478  AILDPRFKMKLVEYYYSQIYGSTALERIKEVSDGIKELFNAYSICSTL--IDEGTALP-G 534

Query: 558  NGVFRSGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSP 379
            + +  S N   DRL+GFDKFLHET+ S    ++L+KYLEE VFPRN DFNIL+WW+V++P
Sbjct: 535  SSLPSSSNDSRDRLKGFDKFLHETAQSQSAISDLEKYLEEAVFPRNCDFNILNWWRVHTP 594

Query: 378  KFPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWL 211
            ++PILSMMARDVLG PMS  V+  +A + GGR LDS +SSL+ D  QALICT DWL
Sbjct: 595  RYPILSMMARDVLGTPMS-TVAQESAFNAGGRVLDSCRSSLTADTRQALICTRDWL 649


>ref|XP_007146367.1| hypothetical protein PHAVU_006G034500g [Phaseolus vulgaris]
            gi|561019590|gb|ESW18361.1| hypothetical protein
            PHAVU_006G034500g [Phaseolus vulgaris]
          Length = 663

 Score =  694 bits (1792), Expect = 0.0
 Identities = 356/657 (54%), Positives = 470/657 (71%), Gaps = 18/657 (2%)
 Frame = -1

Query: 2115 SVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLKRCLRRTN 1936
            +VI K+ RL S VWNDF+R+K+GD  VA+C+HCKKKL        SHLRNHL RC RR++
Sbjct: 6    AVIVKSSRLKSVVWNDFDRIKKGDTCVAVCRHCKKKLSGSSTSGTSHLRNHLIRCQRRSS 65

Query: 1935 RDVTQVLVVREKKRHVATELENFKYDQEHSQ-------LAVTFDQEQNQFDP------KF 1795
              + Q +  REK++     + NF  DQ+ ++       + + F+Q Q + D        F
Sbjct: 66   HGIAQYISAREKRKEGTLAIANFNIDQDTNKDDNTLSLVNIKFEQTQLKDDTVNTGTSNF 125

Query: 1794 DQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKEKQ 1615
            DQ +SRFDLARMIILH YPLAMVEHVGF+ FV+NLQPLF ++S +  +ADC++IYE+EK+
Sbjct: 126  DQRRSRFDLARMIILHGYPLAMVEHVGFRAFVKNLQPLFELVSLNRVEADCIEIYEREKK 185

Query: 1614 KVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDPS-TEHA 1438
            KV +MLDK+PG+ISL+   W +  D+ YLCLTS+YI E+W L +RILNF+ +DPS TE  
Sbjct: 186  KVNEMLDKLPGKISLSADVWNAVGDAEYLCLTSNYIDESWQLRRRILNFIRIDPSHTEDM 245

Query: 1437 LSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVCCAKH 1258
            +SE I++CLM WDIDRKLFS+  DSCS+ +N  +RI DRL QNR L  NGQLF + CA +
Sbjct: 246  VSEAIMNCLMYWDIDRKLFSMILDSCSTCDNIAVRIGDRLLQNRFLYCNGQLFDIRCAAN 305

Query: 1257 VLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCPTQW 1078
            V+N ++Q A  A+ EI+ KIRE+I Y+KSS+ +  KFNEMA +V   ++K L LD  +QW
Sbjct: 306  VINAMVQHALGAVSEIVIKIRETIGYIKSSQIILAKFNEMAKEVGILSQKGLCLDNASQW 365

Query: 1077 MSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNKVLSSVK 898
             STY+MLE A+E+K     LQE D+ Y +  S+ EWER   +TSYLK+F EV  V +  K
Sbjct: 366  NSTYSMLEVALEFKDVLILLQENDAAYKVYLSDVEWERVTAVTSYLKLFVEVINVFTKNK 425

Query: 897  CLTSNHYFSEMSDIHLNLIEWCKISN-CISSVAMKSKCKFDAYWNICSLKLAIAAVLDPR 721
              T+N YF E+ D+ L+LIEWCK S+  ISS+A + + KFD YW  CSL LA+AA+LDPR
Sbjct: 426  YPTANIYFPELCDVKLHLIEWCKNSDEYISSLASRLRSKFDEYWEKCSLGLAVAAMLDPR 485

Query: 720  FKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQNG---V 550
            FKMKLV+YYY QIYGS +A +I+ V   ++ LYNE+SI S LAS DQGLA +V NG   +
Sbjct: 486  FKMKLVDYYYPQIYGSMSASRIEEVFDGVKALYNEHSIGSPLASHDQGLAWQVGNGPLLL 545

Query: 549  FRSGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPKFP 370
              S     DRL GFDKFLHETS      ++LDKYLEEP+FPRN+DFNIL+WW+V++P++P
Sbjct: 546  QGSAKDSRDRLMGFDKFLHETSQGEGTKSDLDKYLEEPLFPRNVDFNILNWWRVHTPRYP 605

Query: 369  ILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSEL 199
            +LSMMAR+VLGIPM+  V+P  A +  GR LD D SSL+P  +QAL+C+ DW++SEL
Sbjct: 606  VLSMMARNVLGIPMA-KVAPELAFNHSGRVLDRDWSSLNPATVQALVCSQDWIRSEL 661


>ref|XP_007048823.1| BED zinc finger,hAT family dimerization domain [Theobroma cacao]
            gi|508701084|gb|EOX92980.1| BED zinc finger,hAT family
            dimerization domain [Theobroma cacao]
          Length = 657

 Score =  693 bits (1788), Expect = 0.0
 Identities = 363/653 (55%), Positives = 471/653 (72%), Gaps = 14/653 (2%)
 Frame = -1

Query: 2115 SVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLKRCLRRTN 1936
            +V+  + RL S VWNDF+RVK+GD  VAIC+HCKKKL        SHLRNHL RC RR+N
Sbjct: 6    AVVANSSRLKSIVWNDFDRVKKGDTFVAICRHCKKKLSGSSTSGTSHLRNHLIRCQRRSN 65

Query: 1935 RDVTQVLVVREKKRH----VATELENFKYDQEHSQLAVTFDQEQNQFDP------KFDQE 1786
              + Q    REKK+     V T  +  K D+  S + + ++QEQ + +P        DQ 
Sbjct: 66   HGIAQYFSGREKKKEGSLAVVTIDQEQKKDEVLSLVNLRYEQEQIKNEPVTIGNSSLDQR 125

Query: 1785 QSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKEKQKVY 1606
            +S+FDLARMIILH YPL MV+HVGFK FV NLQPLF +++ +  +ADC++IY KEKQ+VY
Sbjct: 126  RSQFDLARMIILHNYPLDMVDHVGFKIFVRNLQPLFELVTYNKVEADCMEIYAKEKQRVY 185

Query: 1605 DMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDPS-TEHALSE 1429
            ++LDK PG+IS+T   WT+S DS YL LT+HYI E W L+KR LNFV +DPS TE   SE
Sbjct: 186  EVLDKFPGKISVTADVWTASDDSAYLSLTAHYIDEDWQLKKRTLNFVTIDPSHTEDMHSE 245

Query: 1428 TIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVCCAKHVLN 1249
             I++CLMDWDIDRKLFS+ FDS +S EN V RI+DRLSQNR L  NGQLF V CA  +LN
Sbjct: 246  VIMTCLMDWDIDRKLFSMIFDSYTS-ENIVDRIRDRLSQNRFLYCNGQLFDVRCAVDLLN 304

Query: 1248 VIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCPTQWMST 1069
             ++QDA DA+ E+  KIRESIRYVKSSEA Q  F E+AH+VQ  ++K LR+D P +W ST
Sbjct: 305  RMVQDALDAVCEVTQKIRESIRYVKSSEATQSMFIELAHEVQVESQKCLRIDNPLKWNST 364

Query: 1068 YTMLEAAVEYKSAFSHLQECDS-GYTMAPSEREWERANDITSYLKMFDEVNKVLSSVKCL 892
            + MLE A+EY+  F  LQ+ D       PS+ EW+R + I S+LK+F EV  V +  K  
Sbjct: 365  FLMLEVALEYRKVFCCLQDRDPVNMKFLPSDLEWDRVSVIASFLKLFVEVTNVFTRSKYP 424

Query: 891  TSNHYFSEMSDIHLNLIEWCK-ISNCISSVAMKSKCKFDAYWNICSLKLAIAAVLDPRFK 715
            T+N +F E+ DIHL LIEWCK   + I+S+A+K + KF+ YW+ CSL LA+AA+LDPRFK
Sbjct: 425  TANIFFPEICDIHLQLIEWCKNPDDYINSLAVKMRKKFEDYWDKCSLGLAVAAMLDPRFK 484

Query: 714  MKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLA-SLDQGLACEVQNGVFRSG 538
            MKL+EYYY Q+YG +A++ I  V + I+ LYNE+S+ S LA SLDQGL+ +V +G+  SG
Sbjct: 485  MKLLEYYYPQLYGDSASELIDDVFECIKSLYNEHSMVSPLASSLDQGLSWQV-SGIPGSG 543

Query: 537  NSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPKFPILSM 358
                DRL GFDKFLHETS S   N++LDKYLE+P+FPRN+DFNIL+WWKV++P +PILSM
Sbjct: 544  KDSRDRLMGFDKFLHETSQSDGSNSDLDKYLEDPLFPRNVDFNILNWWKVHTPSYPILSM 603

Query: 357  MARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSEL 199
            MA ++LGIP+S  V+  +  DTGGR +D + SSL P  +QAL+C+ DW++SEL
Sbjct: 604  MAHNILGIPIS-KVAAESTFDTGGRVVDHNWSSLPPTTVQALMCSQDWIRSEL 655


>ref|XP_007133312.1| hypothetical protein PHAVU_011G169000g [Phaseolus vulgaris]
            gi|561006312|gb|ESW05306.1| hypothetical protein
            PHAVU_011G169000g [Phaseolus vulgaris]
          Length = 672

 Score =  691 bits (1783), Expect = 0.0
 Identities = 350/659 (53%), Positives = 460/659 (69%), Gaps = 14/659 (2%)
 Frame = -1

Query: 2124 SQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLKRCLR 1945
            S  S  +K KRLTS VWN FERV++ D+  A+C HC K+L        +HLRNHL RCL+
Sbjct: 4    SNDSGTKKPKRLTSVVWNHFERVRKADICYAVCVHCNKRLSGSSNSGTTHLRNHLMRCLK 63

Query: 1944 RTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ------LAVTFDQEQNQFD------P 1801
            R+N DV+Q+L  + +K+     L N  +D+   +        + F+QE  + D       
Sbjct: 64   RSNFDVSQLLAAKRRKKDNTISLANISFDEGQRKEEYVKPTIIKFEQEHKKDDIINFGSS 123

Query: 1800 KFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKE 1621
            KFDQE+S+ DLARMIILH YPL++VE VGFK FV+NLQPLF  M + + +  C+ IY +E
Sbjct: 124  KFDQERSQHDLARMIILHGYPLSLVEQVGFKVFVKNLQPLFEFMPNGAVEVSCIDIYRRE 183

Query: 1620 KQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDP-STE 1444
            K+KVYDM++++ GRI+L++  W+S+++  YLCL++HYI E W L+K+ILNFV +D   TE
Sbjct: 184  KEKVYDMINRLQGRINLSIEMWSSTENYSYLCLSAHYIDEEWTLQKKILNFVTLDSLHTE 243

Query: 1443 HALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVCCA 1264
              L E II CL +WDID KLF++T D CS +E+  LRIK+R+S+ R  LS  QL  +  A
Sbjct: 244  DLLPEVIIKCLNEWDIDGKLFALTLDDCSISEDITLRIKERVSEKRPFLSTRQLLDIRSA 303

Query: 1263 KHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCPT 1084
             H++N I QDA +A+ E++ KIRESIRYV+SS+ VQ KFNE+A     NT+K L LD P 
Sbjct: 304  AHLINSIAQDAMEALQEVIQKIRESIRYVRSSQVVQAKFNEIAQHATINTQKVLFLDFPV 363

Query: 1083 QWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNKVLSS 904
            QW STY MLE AVEY+SAFS  Q+ D  Y+   S+ EWE A  +T YLK+  E+  V S 
Sbjct: 364  QWKSTYLMLETAVEYRSAFSLFQDHDPSYSSTLSDEEWEWATSVTGYLKLLVEITNVFSG 423

Query: 903  VKCLTSNHYFSEMSDIHLNLIEWCKISNC-ISSVAMKSKCKFDAYWNICSLKLAIAAVLD 727
             K  T+N YF E+ D H+ LI+WC+ S+  +S +AMK K KFD YW  CSL LA+AAVLD
Sbjct: 424  NKFPTANVYFPEICDAHIQLIDWCRSSDSFLSPMAMKMKAKFDKYWGKCSLALALAAVLD 483

Query: 726  PRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQNGVF 547
            PRFKMKLVEYYY+ IYGS A ++IK VS  I++L+N YSICST+  +DQG A    + + 
Sbjct: 484  PRFKMKLVEYYYSLIYGSTALERIKEVSDGIKELFNAYSICSTM--IDQGSALP-GSSLP 540

Query: 546  RSGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPKFPI 367
             +  S  DRL+GFD+FLHETS S  M ++LDKYLEEP+FPRN DFNIL+WWKV+ P++PI
Sbjct: 541  STSCSSRDRLKGFDRFLHETSQSQSMTSDLDKYLEEPIFPRNSDFNILNWWKVHMPRYPI 600

Query: 366  LSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSELGQV 190
            LSMMARDVLG PMS  ++P  A  TGGR LDS +SSL+PD  +ALICT DWL++E G +
Sbjct: 601  LSMMARDVLGTPMS-TLAPELAFTTGGRVLDSSRSSLNPDTREALICTQDWLRNESGDL 658


>ref|XP_007213601.1| hypothetical protein PRUPE_ppa002416mg [Prunus persica]
            gi|462409466|gb|EMJ14800.1| hypothetical protein
            PRUPE_ppa002416mg [Prunus persica]
          Length = 675

 Score =  690 bits (1780), Expect = 0.0
 Identities = 353/664 (53%), Positives = 471/664 (70%), Gaps = 15/664 (2%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            ME+P + S I+K KRLTS VWN FERV++ D+  A+C HC KKL        +HLRNHL 
Sbjct: 1    MEIPIE-SAIKKPKRLTSIVWNHFERVRKADICYAVCVHCNKKLSGSSNSGTTHLRNHLM 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ------LAVTFDQEQNQFD--- 1804
            RCL+R+N DV+Q+L  + +K+     L N   D+   +        + FDQ+  + D   
Sbjct: 60   RCLKRSNFDVSQLLAAKRRKKDNTVGLANINCDEAQRKDEYMKPALIKFDQDLKKDDIVT 119

Query: 1803 ---PKFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQI 1633
                KFD ++SR DLARMIILH YPL MV+HVGFK FV+NLQPLF ++ ++  +  C++I
Sbjct: 120  IASGKFDNDRSRLDLARMIILHGYPLTMVDHVGFKVFVKNLQPLFEVVPNNDVEHFCMEI 179

Query: 1632 YEKEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDP 1453
            Y KEK++VY  ++ + GRI+L+V  W+S ++  YLCLT+HYI E W L+K++LNFV +DP
Sbjct: 180  YRKEKRQVYQAINSLQGRINLSVEMWSSPENVEYLCLTAHYIDEDWKLQKKVLNFVTLDP 239

Query: 1452 S-TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFH 1276
            + TE +LSE I  CLMDWDI  KLF+ T D CS++++ VLRIKDR+SQ+R L  +GQLF 
Sbjct: 240  THTEDSLSEVISKCLMDWDIHSKLFAFTLDDCSTDDDIVLRIKDRISQSRPLAGHGQLFD 299

Query: 1275 VCCAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRL 1096
            +  A H+LN I+QD  +A+ E++ KIR S ++V+SS+ VQ KFNE+A QV  N+++ L L
Sbjct: 300  IRSAAHLLNSIVQDVLEALREVIQKIRGSFKHVRSSQVVQGKFNEIAQQVGINSERRLIL 359

Query: 1095 DCPTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNK 916
            D P +W STY MLE A+EY+ AFS LQE D  Y  + ++ EWE  + +T YLK+  E+  
Sbjct: 360  DFPVRWNSTYIMLETALEYRGAFSLLQEHDPSYASSLTDTEWEWTSFVTGYLKLLVEITN 419

Query: 915  VLSSVKCLTSNHYFSEMSDIHLNLIEWCKI-SNCISSVAMKSKCKFDAYWNICSLKLAIA 739
            V S  K  T++ YF E+  +H+ LIEWCK   + +S +A+K K KFD YW+ CSL LA+A
Sbjct: 420  VFSGNKSPTASIYFPEICHVHIQLIEWCKSPDDFLSCMALKMKAKFDKYWSKCSLALAVA 479

Query: 738  AVLDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQ 559
            A+LDPRFKMKLVEYYY+QIYGS A D+IK VS  I++L++ YSICST+  +DQG A  + 
Sbjct: 480  AILDPRFKMKLVEYYYSQIYGSTALDRIKEVSDGIKELFDAYSICSTM--VDQGSA--LP 535

Query: 558  NGVFRSGNSDT-DRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNS 382
                 S +SDT DRL+GFDKFL+ETS S  + ++LDKYLEEPVFPRN DFNIL+WWKV++
Sbjct: 536  GSSLPSTSSDTRDRLKGFDKFLYETSQSQNVISDLDKYLEEPVFPRNCDFNILNWWKVHT 595

Query: 381  PKFPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSE 202
            P++PILSMMARDVLG PMS  V+P +A   GGR LD  +SSL+PDI QAL+CT DWLQ E
Sbjct: 596  PRYPILSMMARDVLGTPMS-TVAPESAFSIGGRVLDQCRSSLNPDIRQALVCTQDWLQVE 654

Query: 201  LGQV 190
            L  V
Sbjct: 655  LKDV 658


>ref|XP_007022002.1| BED zinc finger,hAT family dimerization domain isoform 5 [Theobroma
            cacao] gi|508721630|gb|EOY13527.1| BED zinc finger,hAT
            family dimerization domain isoform 5 [Theobroma cacao]
          Length = 639

 Score =  684 bits (1765), Expect = 0.0
 Identities = 352/639 (55%), Positives = 461/639 (72%), Gaps = 14/639 (2%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            MEV ++ S I+K KRLTS VWN FERV++ D+  A+C HC KKL        +HLRNHL 
Sbjct: 1    MEVANE-SAIKKPKRLTSVVWNHFERVRKADVCYAVCVHCNKKLSGSSNSGTTHLRNHLM 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ------LAVTFDQEQ------N 1813
            RCL+R+N DV+Q+L  + +K+     + N  YD+   +        V ++Q+Q      N
Sbjct: 60   RCLKRSNYDVSQLLAAKRRKKDNTLTIANISYDEGQRKEDYIKPTIVKYEQDQRKDEVFN 119

Query: 1812 QFDPKFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQI 1633
                +FDQE+SR DLARMIILH YPLAMVEHVGFK FV+NLQPLF ++ + + +  C++I
Sbjct: 120  LGSSRFDQERSRLDLARMIILHGYPLAMVEHVGFKVFVKNLQPLFDLVPNSTIELFCMEI 179

Query: 1632 YEKEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDP 1453
            Y KEKQKVYDML K+ GRI+L V  W+S ++S YLCLT+HYI + W L+K+ILNFV +D 
Sbjct: 180  YGKEKQKVYDMLSKLQGRINLAVEMWSSPENSNYLCLTAHYIDDDWKLQKKILNFVTLDS 239

Query: 1452 S-TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFH 1276
            S TE  LSE I+ CLMDWDI+ KLF++TFD CS+N++ VLRIK+++S+NR  LSNGQL  
Sbjct: 240  SHTEDLLSEVIMKCLMDWDIECKLFAMTFDDCSTNDDIVLRIKEQISENRPRLSNGQLLD 299

Query: 1275 VCCAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRL 1096
            V  A H+LN ++QDA +A+  ++ KIR S+RYVKSS+++Q KFNE+A Q    ++KSL L
Sbjct: 300  VRSAAHILNSLVQDAVEALQVVIQKIRGSVRYVKSSQSIQGKFNEIAQQTGIISQKSLVL 359

Query: 1095 DCPTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNK 916
            DCP +W STY MLE AVEY++AF HL E D    +A S+ EWE A+ +T YLK+F E+  
Sbjct: 360  DCPIRWNSTYVMLETAVEYRNAFCHLPELDP--DLALSDDEWEWASSVTGYLKLFIEIIN 417

Query: 915  VLSSVKCLTSNHYFSEMSDIHLNLIEWCKI-SNCISSVAMKSKCKFDAYWNICSLKLAIA 739
            V S  KC T+N YF E+  +H+ LIEWCK   N +SS+A K K KFD YW+ CSL LA+A
Sbjct: 418  VFSGNKCPTANIYFPEICHVHIQLIEWCKSPDNFLSSLAAKMKAKFDKYWSKCSLALAVA 477

Query: 738  AVLDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQ 559
            A+LDPRFKMKLVEYYY+QIYGS A ++IK VS  I++L+N YSICSTL  +D+G A    
Sbjct: 478  AILDPRFKMKLVEYYYSQIYGSTALERIKEVSDGIKELFNAYSICSTL--IDEGTALP-G 534

Query: 558  NGVFRSGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSP 379
            + +  S N   DRL+GFDKFLHET+ S    ++L+KYLEE VFPRN DFNIL+WW+V++P
Sbjct: 535  SSLPSSSNDSRDRLKGFDKFLHETAQSQSAISDLEKYLEEAVFPRNCDFNILNWWRVHTP 594

Query: 378  KFPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQS 262
            ++PILSMMARDVLG PMS  V+  +A + GGR LDS +S
Sbjct: 595  RYPILSMMARDVLGTPMS-TVAQESAFNAGGRVLDSCRS 632


>ref|XP_007216990.1| hypothetical protein PRUPE_ppa002590mg [Prunus persica]
            gi|462413140|gb|EMJ18189.1| hypothetical protein
            PRUPE_ppa002590mg [Prunus persica]
          Length = 655

 Score =  682 bits (1759), Expect = 0.0
 Identities = 354/652 (54%), Positives = 464/652 (71%), Gaps = 13/652 (1%)
 Frame = -1

Query: 2115 SVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLKRCLRRTN 1936
            +VI K+ RL S VWNDF+R+K+GD  +A+C+HCKKKL        SHLRNHL RC RR+N
Sbjct: 6    AVIVKSTRLKSVVWNDFDRIKKGDKCIAVCRHCKKKLSGSSTSGTSHLRNHLIRCQRRSN 65

Query: 1935 RDVTQVLVVREKKRHVATEL---ENFKYDQEHSQLAVTFDQEQNQFD------PKFDQEQ 1783
              + Q+   REKK+   T L   +  K D+  + + + F+QEQ + D        FDQ +
Sbjct: 66   LGIPQLFAAREKKKE-GTYLNLDQEQKKDEAFNLVNIRFEQEQTKDDIINYGSGNFDQRR 124

Query: 1782 SRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKEKQKVYD 1603
            SRFDLARMIILH YPL MVEHVGF+ FV+NLQPLF +++S+  +ADC++IY KEKQKV D
Sbjct: 125  SRFDLARMIILHGYPLDMVEHVGFRVFVKNLQPLFELVTSERVEADCMEIYGKEKQKVKD 184

Query: 1602 MLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDPS-TEHALSET 1426
            ML K+PG+ISLTV  W S   + YLCLT+HYI E+W L K+ILNF+++D S TE   SE 
Sbjct: 185  MLGKLPGKISLTVDMWASLDGTEYLCLTAHYIDESWQLNKKILNFIVIDSSHTEDKHSEI 244

Query: 1425 IISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVCCAKHVLNV 1246
            I+  LMDWDIDR LFS+TFDS S+N+N V RI+DRLSQN+LL  +GQLF V CA +V+N+
Sbjct: 245  IMESLMDWDIDRNLFSMTFDSYSTNDNVVFRIRDRLSQNKLLSCDGQLFDVRCAANVINM 304

Query: 1245 IIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCPTQWMSTY 1066
            + QDA +A+ E+  KIR SIRYVKSS+ +Q KFN + HQV   +++ L LD P QW STY
Sbjct: 305  MSQDALEALCEMTDKIRGSIRYVKSSQVIQEKFNSIVHQVGGESRRCLCLDNPLQWNSTY 364

Query: 1065 TMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNKVLSSVKCLTS 886
             M+E A+EY+ AF+ LQE D  Y M PS+ EW+R N ITSYLK+F  V  V +  K  T+
Sbjct: 365  VMVEIALEYRDAFALLQENDPVYAMCPSDVEWDRVNIITSYLKLFVGVTNVFTRFKSPTA 424

Query: 885  NHYFSEMSDIHLNLIEWCK-ISNCISSVAMKSKCKFDAYWNICSLKLAIAAVLDPRFKMK 709
            N YF E+ +++  L EWCK   + ISS+A+K + KF+ YW  CSL LA+A +LDPRFKMK
Sbjct: 425  NLYFPELCEVYSQLNEWCKNADDYISSLALKMRSKFEEYWMRCSLSLAVAVMLDPRFKMK 484

Query: 708  LVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEV--QNGVFRSGN 535
             V+YYY Q +GS A  +I  V + ++ LYNE+S C  LA +DQGLA +V   + +  SG 
Sbjct: 485  PVDYYYAQFFGSGAPGRISDVFECVKTLYNEHSTC--LAYVDQGLAWQVGGSSRLPGSGR 542

Query: 534  SDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPKFPILSMM 355
               DRL GFDKFLHET+      ++LDKYLEEP+FPRN +F+IL+WWKV++P++PILSMM
Sbjct: 543  DLRDRLTGFDKFLHETTEIDGTKSDLDKYLEEPLFPRNAEFDILNWWKVHAPRYPILSMM 602

Query: 354  ARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSEL 199
            AR+VLGIP+S  V   +  +TGGR LD D SS++P  +QAL+C  DW++SEL
Sbjct: 603  ARNVLGIPVS-KVPIDSTFNTGGRVLDRDWSSMNPATIQALMCAQDWIRSEL 653


>ref|XP_006407043.1| hypothetical protein EUTSA_v10020233mg [Eutrema salsugineum]
            gi|557108189|gb|ESQ48496.1| hypothetical protein
            EUTSA_v10020233mg [Eutrema salsugineum]
          Length = 662

 Score =  645 bits (1664), Expect = 0.0
 Identities = 319/654 (48%), Positives = 447/654 (68%), Gaps = 13/654 (1%)
 Frame = -1

Query: 2124 SQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLKRCLR 1945
            S   +++K+KRLTS VWN FERV++ D+  A+C  C KKL        +HLRNHL RCL+
Sbjct: 4    SNEIILQKSKRLTSVVWNYFERVRKADVCYAVCIQCNKKLSGSSNSGTTHLRNHLMRCLK 63

Query: 1944 RTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ---LAVTFDQEQNQFD--------PK 1798
            RTN D++Q+L  + +K+     +    +D+   +   L   FDQE    +         +
Sbjct: 64   RTNHDMSQLLTPKRRKKENPVTVATINFDEAQGKDDYLRPKFDQEPRSNELVLSRGSGGR 123

Query: 1797 FDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKEK 1618
            F QE+S+ DLARMIILH YPLAMV+HVGFK F  NLQPLF  + + + +  C++IY +EK
Sbjct: 124  FSQERSQIDLARMIILHGYPLAMVDHVGFKVFARNLQPLFEAVPNSTIEESCMEIYIREK 183

Query: 1617 QKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDPS-TEH 1441
            Q+V   L+ + G+I+L+V  W+S  ++ Y+CL SHYI E W L++ +LNF+ +DPS TE 
Sbjct: 184  QRVQHTLNNLYGKINLSVEMWSSKDNANYVCLASHYIDEEWRLQRNVLNFITLDPSHTED 243

Query: 1440 ALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVCCAK 1261
             LSE II CLM+W ++ KLF++TFD+ S N+  VLRIKD +SQ+  +L NGQL+ +  A 
Sbjct: 244  MLSEVIIRCLMEWSLETKLFAVTFDNFSVNDEIVLRIKDHMSQSSPILINGQLYELKSAN 303

Query: 1260 HVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCPTQ 1081
            H+LN ++QD  +A+ +++ KIR S+RYVKSS++ Q +FNE+A     N++K L LD    
Sbjct: 304  HLLNSLVQDCLEAMRDVIQKIRGSVRYVKSSQSTQARFNEIAQLAGINSEKILVLDSLGT 363

Query: 1080 WMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNKVLSSV 901
            W STY MLE  +EY+ AF HL++ D G+  + ++ EWE    +T YLK+  E+    S  
Sbjct: 364  WNSTYAMLETVLEYQGAFCHLRDHDHGFDSSLTDEEWEWTRSVTGYLKLVFEIAADFSGN 423

Query: 900  KCLTSNHYFSEMSDIHLNLIEWCKISNC-ISSVAMKSKCKFDAYWNICSLKLAIAAVLDP 724
            +C T+N YF+EM DIH+ LIEWCK  +  +SS+A K K KFD YWN CSL LAIAA+LDP
Sbjct: 424  RCPTANVYFAEMCDIHIQLIEWCKNQDSFLSSLAAKMKAKFDEYWNKCSLVLAIAAILDP 483

Query: 723  RFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQNGVFR 544
            RFKMKLVEYYY++IYGS A D+IK VS  +++L + YS+CS++   D   +    +G+ R
Sbjct: 484  RFKMKLVEYYYSKIYGSVALDRIKEVSNGVKELLDAYSMCSSIDGEDSSFS---GSGLAR 540

Query: 543  SGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPKFPIL 364
                  DRL+GFDKFLHETS +    ++LDKYL EP+FPR+ +FNIL++WKV++P++PIL
Sbjct: 541  GSMDTRDRLKGFDKFLHETSQNQNTTSDLDKYLSEPIFPRSGEFNILNYWKVHTPRYPIL 600

Query: 363  SMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSE 202
            SMMARD+LG PMSI ++P +  ++G   +D  +SSLSPDI QAL C HDWL +E
Sbjct: 601  SMMARDILGTPMSI-LAPDSTFNSGRPVIDESKSSLSPDIRQALFCAHDWLSTE 653


>dbj|BAB02646.1| Ac transposase-like protein [Arabidopsis thaliana]
            gi|18176330|gb|AAL60024.1| unknown protein [Arabidopsis
            thaliana] gi|20465375|gb|AAM20091.1| unknown protein
            [Arabidopsis thaliana]
          Length = 662

 Score =  634 bits (1634), Expect = e-179
 Identities = 316/661 (47%), Positives = 446/661 (67%), Gaps = 13/661 (1%)
 Frame = -1

Query: 2124 SQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLKRCLR 1945
            S   +++K+KRLTS VWN FERV++ D+  A+C  C KKL        +HLRNHL RCL+
Sbjct: 5    SNEIILQKSKRLTSVVWNYFERVRKADVCYAVCIQCNKKLSGSSNSGTTHLRNHLMRCLK 64

Query: 1944 RTNRDVTQVLVVREKKRHVATELENFKYDQEHSQ---LAVTFDQEQNQFDP--------K 1798
            RTN D++Q+L  + +K+     +    +D   ++   L   FDQ+Q + +         +
Sbjct: 65   RTNHDMSQLLTPKRRKKENPVTVATINFDDGQAKEEYLRPKFDQDQRRDEVVLSRGSGGR 124

Query: 1797 FDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKEK 1618
            F QE+S+ DLARMIILH YPLAMV+HVGFK F  NLQPLF  + + + +  C++IY +EK
Sbjct: 125  FSQERSQVDLARMIILHNYPLAMVDHVGFKVFARNLQPLFEAVPNSTIEDSCMEIYIREK 184

Query: 1617 QKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDPS-TEH 1441
            Q+V   L+ + G+++L+V  W+S  +S Y+CL S+YI E W L + +LNF+ +DPS TE 
Sbjct: 185  QRVQHTLNHLYGKVNLSVEMWSSRDNSNYVCLASNYIDEEWRLHRNVLNFITLDPSHTED 244

Query: 1440 ALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVCCAK 1261
             LSE II CL++W ++ KLF++TFDS S NE  VLRIKD +SQ+  +L NGQLF +  A 
Sbjct: 245  MLSEVIIRCLIEWSLENKLFAVTFDSVSVNEEIVLRIKDHMSQSSQILINGQLFELKSAA 304

Query: 1260 HVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCPTQ 1081
            H+LN +++D  +A+ +++ KIR S+RYVKSS++ Q++FNE+A     N++K L LD    
Sbjct: 305  HLLNSLVEDCLEAMRDVIQKIRGSVRYVKSSQSTQVRFNEIAQLAGINSQKILVLDSIVN 364

Query: 1080 WMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNKVLSSV 901
              ST+ MLE  +EYK AF HL++ D  +  + ++ EWE    +T YLK+  ++    S+ 
Sbjct: 365  SNSTFVMLETVLEYKGAFCHLRDHDHSFDSSLTDEEWEWTRYVTGYLKLVFDIASDFSAN 424

Query: 900  KCLTSNHYFSEMSDIHLNLIEWCK-ISNCISSVAMKSKCKFDAYWNICSLKLAIAAVLDP 724
            KC T+N YF+EM DIH+ L+EWCK   N +SS+A   K KFD YWN CSL LAIAA+LDP
Sbjct: 425  KCPTANVYFAEMCDIHIQLVEWCKNQDNFLSSLAANMKAKFDEYWNKCSLVLAIAAILDP 484

Query: 723  RFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQNGVFR 544
            RFKMKLVEYYY++IYGS A D+IK VS  +++L + YS+CS +   D        +G+ R
Sbjct: 485  RFKMKLVEYYYSKIYGSTALDRIKEVSNGVKELLDAYSMCSAIVGEDSFSG----SGLGR 540

Query: 543  SGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPKFPIL 364
            +     DRL+GFDKFLHETS +    T+LDKYL EP+FPR+ +FNIL++WKV++P++PIL
Sbjct: 541  ASMDTRDRLKGFDKFLHETSQNQNTTTDLDKYLSEPIFPRSGEFNILNYWKVHTPRYPIL 600

Query: 363  SMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSELGQVI* 184
            S++ARD+LG PMSI  +P +  ++G   +   QSSL+PDI QAL C HDWL +E    I 
Sbjct: 601  SLLARDILGTPMSI-CAPDSTFNSGTPVISDSQSSLNPDIRQALFCAHDWLSTETEGTIS 659

Query: 183  N 181
            N
Sbjct: 660  N 660


>ref|XP_006297141.1| hypothetical protein CARUB_v10013145mg [Capsella rubella]
            gi|565479004|ref|XP_006297142.1| hypothetical protein
            CARUB_v10013145mg [Capsella rubella]
            gi|482565850|gb|EOA30039.1| hypothetical protein
            CARUB_v10013145mg [Capsella rubella]
            gi|482565851|gb|EOA30040.1| hypothetical protein
            CARUB_v10013145mg [Capsella rubella]
          Length = 667

 Score =  633 bits (1633), Expect = e-179
 Identities = 318/659 (48%), Positives = 448/659 (67%), Gaps = 13/659 (1%)
 Frame = -1

Query: 2124 SQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLKRCLR 1945
            S   +++K+KRLTS VWN FERV++ D+  A+C  C KKL        +HLRNHL RCL+
Sbjct: 4    SNEIILQKSKRLTSVVWNYFERVRKADVCYAVCIQCNKKLSGSSNSGTTHLRNHLMRCLK 63

Query: 1944 RTNRDVTQVLVVREKKRHVATELENFKYDQ---EHSQLAVTFDQEQNQFDP--------K 1798
            RTN D++Q+L  + +K+     +    +D+   +   L   FDQEQ + +         +
Sbjct: 64   RTNHDMSQLLTPKRRKKENPVTVATISFDEGQPKDEYLRPKFDQEQRRDEVVLSRGSGGR 123

Query: 1797 FDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKEK 1618
            F QE+S+ DLARMII+H YPLAMV+HVGFK F  NLQPLF  + + + +  C++IY +EK
Sbjct: 124  FSQERSQVDLARMIIMHGYPLAMVDHVGFKVFARNLQPLFEAVPNSTIEDSCMEIYMREK 183

Query: 1617 QKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDPS-TEH 1441
            Q+V   L+ + G+I+L+V  W+S  ++ Y+CL SHYI E W L + +LNF+ +DPS TE 
Sbjct: 184  QRVQHTLNNLYGKINLSVEMWSSRDNANYVCLASHYIDEEWRLHRNVLNFITLDPSHTED 243

Query: 1440 ALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVCCAK 1261
             LSE II CL++W ++ KLF++TFDS S NE  VLRIKD +SQ+  +L NGQLF +  A 
Sbjct: 244  MLSEVIIRCLIEWRLESKLFAVTFDSFSVNEEIVLRIKDHMSQSSQILINGQLFELKSAA 303

Query: 1260 HVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCPTQ 1081
            H+LN ++QD  +A+ +++ KIR S+RYVKSS++ Q++FNE+A     N+ K L LD    
Sbjct: 304  HLLNSLVQDCLEAMRDVIQKIRGSVRYVKSSQSAQVRFNEIAQLAGINSHKILVLDSLVN 363

Query: 1080 WMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNKVLSSV 901
              STY MLE  +EYK AF HL++ D G+  + ++ EWE    +T YLK+  ++    S  
Sbjct: 364  SNSTYVMLETVLEYKGAFCHLRDHDHGFDSSLTDEEWEWTRYVTGYLKLVFDIASDFSGN 423

Query: 900  KCLTSNHYFSEMSDIHLNLIEWCK-ISNCISSVAMKSKCKFDAYWNICSLKLAIAAVLDP 724
            KC T+N YF EM DIH+ LIEWCK   N +SS+A   K KFD YWN CSL LAIAA+LDP
Sbjct: 424  KCPTANVYFPEMCDIHIQLIEWCKNQDNFLSSLAASMKAKFDEYWNKCSLVLAIAAILDP 483

Query: 723  RFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQNGVFR 544
            R+KMKLVEYYY++IYGS A D+IK VS  +++L + YS+CS +   D   +    +G+ R
Sbjct: 484  RYKMKLVEYYYSKIYGSTALDRIKEVSNGVKELLDAYSMCSAIVGEDSSFS---GSGLGR 540

Query: 543  SGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPKFPIL 364
            + ++  DRL+GFDKFLHETS +    ++LDKYL EP FPR+ +FNIL++WKV++P++PIL
Sbjct: 541  AMDT-RDRLKGFDKFLHETSQNQNTTSDLDKYLSEPNFPRSGEFNILNYWKVHTPRYPIL 599

Query: 363  SMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSELGQVI 187
            SMMARD+LG P+SI ++P +  ++G   +   QSSL+PDI QAL C HDWL +E  +++
Sbjct: 600  SMMARDILGTPISI-IAPDSTFNSGTPMIADSQSSLNPDIRQALFCAHDWLSTETEEML 657


>gb|EYU28909.1| hypothetical protein MIMGU_mgv1a002591mg [Mimulus guttatus]
          Length = 656

 Score =  622 bits (1604), Expect = e-175
 Identities = 318/658 (48%), Positives = 438/658 (66%), Gaps = 13/658 (1%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            ME+P +  VI  + RL S VWNDF+RVK+G+   AIC+HCK+ L        SHLRNHL 
Sbjct: 1    MEIPEE-GVIVNSSRLKSVVWNDFDRVKKGETFAAICRHCKRILSGSSTSGTSHLRNHLI 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRHVATELENFKYDQEHSQLAVTFDQEQNQFDP-------- 1801
            RC RR+N D+TQ+L  R K++     + +F Y+Q   +  +      N  +         
Sbjct: 60   RCRRRSNHDITQLLT-RGKRKQNTLAITSFSYNQSPIKNEIVTVASMNMEEGVKVGNNNT 118

Query: 1800 ---KFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIY 1630
                 D  +S+ DLARMII+H YPL MVE +GFK FV NLQPLF ++++   + DC++IY
Sbjct: 119  GVLNLDHRRSQLDLARMIIMHGYPLGMVEDIGFKIFVRNLQPLFDLVTASGVEDDCIEIY 178

Query: 1629 EKEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDPS 1450
             KE+QKVY+ LDK+PG++SL+   W+++  + YLCL +HYI ++W L+K+ILNF+++DP 
Sbjct: 179  NKERQKVYEELDKLPGKVSLSADRWSTNGGTEYLCLIAHYIDDSWELKKKILNFLVIDPD 238

Query: 1449 -TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHV 1273
              E  LSE I++ L  WDIDRKLFS+T D+ ++ E  V RI+D+L Q+R L+  GQLF V
Sbjct: 239  QAEETLSELIMTSLRKWDIDRKLFSLTIDNRATYEKTVCRIRDQLCQHRFLMCEGQLFDV 298

Query: 1272 CCAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLD 1093
             CA   + +++QD  +   EI  K+RE+IRYVK S+A Q KFNE+   V  N +KSL +D
Sbjct: 299  RCAASTVKLLVQDVLETSREITNKVRETIRYVKGSQATQEKFNEIVQLVGINCQKSLSVD 358

Query: 1092 CPTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNKV 913
             P QW ST  MLEAA+EYK AF  LQE D G++M PS+ +W+R   ITS  K F EV+ V
Sbjct: 359  NPFQWNSTCMMLEAALEYKEAFPQLQEHDPGFSMCPSDIDWDRLRAITSIFKFFHEVSNV 418

Query: 912  LSSVKCLTSNHYFSEMSDIHLNLIEWC-KISNCISSVAMKSKCKFDAYWNICSLKLAIAA 736
             +  K +TSN YF+E+ DIHL LI WC K    ISS+A+K K KFD YW  CSL +AIAA
Sbjct: 419  FAGRKHITSNSYFNEICDIHLQLIGWCQKSDEFISSLALKLKSKFDEYWKKCSLIMAIAA 478

Query: 735  VLDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQN 556
            +LDPR+KM+LVEYYY QIYG +A D I +V   ++ LY+ ++I S L++  Q  A E   
Sbjct: 479  ILDPRYKMQLVEYYYPQIYGDSAPDCIDIVKNCMKALYSGHAIYSPLSAHGQSSASESSV 538

Query: 555  GVFRSGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPK 376
             + +      D+L GFD+FLHETS S    ++LDKYLEEP+FPR    ++L+WWKV+ P+
Sbjct: 539  SIVK------DKLTGFDRFLHETSVSQNTKSDLDKYLEEPLFPRKNVISVLNWWKVHEPR 592

Query: 375  FPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSE 202
            +P+LSMMAR++LGIP+S  V+  +  DTG R LD   S++  D LQAL+C+ DW+ S+
Sbjct: 593  YPVLSMMARNILGIPIS-KVAVESLFDTGERALDHCWSTMKSDTLQALMCSRDWISSD 649


>gb|EPS60750.1| hypothetical protein M569_14050, partial [Genlisea aurea]
          Length = 647

 Score =  611 bits (1576), Expect = e-172
 Identities = 318/658 (48%), Positives = 433/658 (65%), Gaps = 12/658 (1%)
 Frame = -1

Query: 2136 MEVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLK 1957
            ME+P + +VI  T RL S VWNDF+RVK+GD  VAIC+HCK+ L        SHLRNHL 
Sbjct: 1    MELPEE-AVIVNTSRLKSVVWNDFDRVKKGDTFVAICRHCKRILSGSSSSGTSHLRNHLI 59

Query: 1956 RCLRRTNRDVTQVLVVREKKRH---------VATELENFKYDQEHSQL-AVTFDQEQNQF 1807
            RC RR N D+TQ L   ++K+           A  ++N      HS    V         
Sbjct: 60   RCRRRLNHDITQYLTRGKRKQQQQSTTHPQSAAAAVKNEIVTVAHSNYEGVKAGNVNVGG 119

Query: 1806 DPKFDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYE 1627
               FD  +S+ DLARMIILH YPL +V+ +GFK FV NLQP F +++    +A CL+IY+
Sbjct: 120  SLNFDCRRSQLDLARMIILHGYPLNLVDDIGFKAFVRNLQPFFDLLTVGGVEAHCLEIYK 179

Query: 1626 KEKQKVYDMLDKVPGRISLTVGTWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDPS- 1450
            +EKQKVY+ LDK+PG++SL++  W ++  + YLC  +HYI ++W L+K+ILNF++++PS 
Sbjct: 180  REKQKVYEELDKLPGKVSLSIDRWVTNAGTEYLCPVAHYIDDSWELKKKILNFLVIEPSQ 239

Query: 1449 TEHALSETIISCLMDWDIDRKLFSITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVC 1270
             E  LSE  ++CL  WDIDRKLFS+T D CSS ++ V +I+D+L Q+R L+  GQLF V 
Sbjct: 240  AEEMLSELTMTCLRSWDIDRKLFSLTIDGCSSYDHIVSKIRDQLCQHRFLMCEGQLFDVR 299

Query: 1269 CAKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDC 1090
            CA   + V++Q+  +   E+  K+RE +RYVK S A   KFNE+   +  N++K L +D 
Sbjct: 300  CATSTVRVLVQEVLETSREMTKKVREIVRYVKGSRAAYEKFNEIVRLLGVNSQKVLSIDN 359

Query: 1089 PTQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNKVL 910
            P +W ST TMLEAA+EYK  F  LQE D  ++  PS  +W+R   I   LK F EV++V 
Sbjct: 360  PLKWNSTSTMLEAALEYKEVFPQLQELDPEFSTWPSGMDWDRLRAIAGILKFFIEVSEVF 419

Query: 909  SSVKCLTSNHYFSEMSDIHLNLIEWC-KISNCISSVAMKSKCKFDAYWNICSLKLAIAAV 733
               K +T+N +F+E+ DIHL LIEWC K  + ISS+A+K K  FD YW  CSL +A+AA+
Sbjct: 420  VGGKHITANSFFAEICDIHLKLIEWCQKSDDFISSLALKLKSVFDEYWKKCSLIMAVAAI 479

Query: 732  LDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQNG 553
            LDPR+KMKLVEYYY QIYG +A + I++VS  ++ LYN + I S LA      A   +NG
Sbjct: 480  LDPRYKMKLVEYYYPQIYGDSAPECIEIVSNCMKSLYNGHIIYSPLA------AHASENG 533

Query: 552  VFRSGNSDTDRLRGFDKFLHETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPKF 373
                G +  DRL GFD+FLHETS S    ++L+KYLE+P+FPRN D NILSWWKVN P++
Sbjct: 534  ----GAAAKDRLTGFDRFLHETSVSQNTKSDLEKYLEDPLFPRNNDLNILSWWKVNEPRY 589

Query: 372  PILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSEL 199
            P+LSMMAR++LGIP+S  VS     DTG + +D   ++L  + LQAL+C+ DWL +EL
Sbjct: 590  PVLSMMARNILGIPIS-KVSSDAVFDTGNKPIDHCWATLKSETLQALMCSQDWLHNEL 646


>gb|AAG52564.1|AC010675_12 unknown protein; 6859-4829 [Arabidopsis thaliana]
          Length = 676

 Score =  578 bits (1491), Expect = e-162
 Identities = 315/665 (47%), Positives = 442/665 (66%), Gaps = 20/665 (3%)
 Frame = -1

Query: 2133 EVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLKR 1954
            E+    +VI K+ RL S VWNDF+RV++G+  +AIC+HCKK+L        SHLRNHL R
Sbjct: 13   EMDLSDAVIVKSGRLKSVVWNDFDRVRKGETYIAICRHCKKRLSGSSASGTSHLRNHLIR 72

Query: 1953 CLRRTNRDVTQVL--VVREKKRHVATELENFKYDQEHSQLAVTFDQEQNQFDPK------ 1798
            C RRTN +   V    V+ KK+ +A E    K ++  S + V ++ E+ + +        
Sbjct: 73   CRRRTNGNNNGVAQYFVKGKKKELANE--RIKDEEVLSVVNVRYEHEKEEHEDVNVVSMG 130

Query: 1797 FDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKEK 1618
             DQ + RFDLARMIILH YPL+MVE VGF+ F+ NLQPLF +++ +  ++DC++IY KEK
Sbjct: 131  LDQRRCRFDLARMIILHGYPLSMVEDVGFRMFIGNLQPLFELVAFERVESDCMEIYAKEK 190

Query: 1617 QKVYDMLDKVPGRISLTVGTWTSSQDS-RYLCLTSHYISEAWLLEKRILNFVLVDPS-TE 1444
             K+++ LDK+PG+IS++V  W+ S DS  +LCL +HYI E W L+KR+LNF +VDPS + 
Sbjct: 191  HKIFEALDKLPGKISISVDVWSGSGDSDEFLCLAAHYIDEGWELKKRVLNFFMVDPSHSG 250

Query: 1443 HALSETIISCLMDWDIDRKLFSITFDSCSS-NENAVLRIKDRLSQNRLLLSNGQLFHVCC 1267
              L+E I++CLM+WDIDRKLFS+        +EN   +I+DRLSQN+ L   GQLF V C
Sbjct: 251  EMLAEVIMTCLMEWDIDRKLFSMASSHAPPFSENVASKIRDRLSQNKFLYCYGQLFDVSC 310

Query: 1266 AKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCP 1087
              +V+N ++QD+ +A  + +  IRESIRYVKSSE++Q +FN+   +    ++++L +D P
Sbjct: 311  GVNVINEMVQDSLEACCDTINIIRESIRYVKSSESIQDRFNQWIVETGAVSERNLCIDDP 370

Query: 1086 TQWMSTYTMLEAAVEYKSAFSHLQECDSGYTMAPSEREWERANDITSYLKMFDEVNKVLS 907
             +W ST TMLE A+E KSAFS + E D    + PS+ EWER   I  +LK+F EV    +
Sbjct: 371  MRWDSTCTMLENALEQKSAFSLMNEHDPDSVLCPSDLEWERLGTIVEFLKVFVEVINAFT 430

Query: 906  SVKCLTSNHYFSEMSDIHLNLIEWCK-ISNCISSVAMKSKCKFDAYWNICSLKLAIAAVL 730
               CL +N YF E+ DIHL LIEW K   + ISS+ +  + KFD +W+   L LAIA +L
Sbjct: 431  KSSCLPANMYFPEVCDIHLRLIEWSKNPDDFISSLVVNMRKKFDDFWDKNYLVLAIATIL 490

Query: 729  DPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQNGV 550
            DPRFKMKLVEYYY   YG++A++ I+ +S+ I+ LY+E+S+ S LAS +Q L  + QN  
Sbjct: 491  DPRFKMKLVEYYYPLFYGTSASELIEDISECIKLLYDEHSVGSLLASSNQAL--DWQNHH 548

Query: 549  FRS-----GNSDTDRLRGFDKFLHETSNS--SQMNTELDKYLEEPVFPRNMDFNILSWWK 391
             RS     G    DRL  FD++++ET+ +      ++L+KYLEEP+FPRN DF+IL+WWK
Sbjct: 549  HRSNGVAHGKEPDDRLTEFDRYINETTTTPGQDSKSDLEKYLEEPLFPRNSDFDILNWWK 608

Query: 390  VNSPKFPILSMMARDVLGIPMSINVSPGTACDTGGRGLDSDQ-SSLSPDILQALICTHDW 214
            V++PK+PILSMMAR+VL +PM    S   A +T  R   S+   SL P  +QAL+C  DW
Sbjct: 609  VHTPKYPILSMMARNVLAVPMLNVSSEEDAFETCQRRRVSETWRSLRPSTVQALMCAQDW 668

Query: 213  LQSEL 199
            +QSEL
Sbjct: 669  IQSEL 673


>ref|XP_006390942.1| hypothetical protein EUTSA_v10018229mg [Eutrema salsugineum]
            gi|557087376|gb|ESQ28228.1| hypothetical protein
            EUTSA_v10018229mg [Eutrema salsugineum]
          Length = 674

 Score =  573 bits (1478), Expect = e-161
 Identities = 311/666 (46%), Positives = 433/666 (65%), Gaps = 21/666 (3%)
 Frame = -1

Query: 2133 EVPSQPSVIRKTKRLTSAVWNDFERVKRGDMMVAICQHCKKKLXXXXXXXXSHLRNHLKR 1954
            E+    +VI K+ +L SAVWNDF+RV++G+  VAIC+HCKK+L        SHLRNHL R
Sbjct: 7    EMDLSDAVIVKSGKLKSAVWNDFDRVRKGETYVAICRHCKKRLSGSSASGTSHLRNHLIR 66

Query: 1953 CLRRT---NRDVTQVLVVREKKRH-----VATELENFKYDQEHSQLAVTFDQEQNQFDPK 1798
            C R+T   N  V+Q  V  +KK+      VA  +++  ++Q   +L    D         
Sbjct: 67   CRRKTTSSNGVVSQCFVRGKKKKEERLEEVANVVDDDDHEQRKDELVTGHDASVTVVSAG 126

Query: 1797 FDQEQSRFDLARMIILHEYPLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKEK 1618
             DQ +SRFDLARM+ILH YPL MVE VGF+ F+ NLQPLF ++S +  ++DC++IY KEK
Sbjct: 127  LDQRRSRFDLARMMILHGYPLTMVEDVGFRVFIRNLQPLFELVSFERVESDCMEIYAKEK 186

Query: 1617 QKVYDMLDKVPGRISLTVGTWTSSQDS-RYLCLTSHYISEAWLLEKRILNFVLVDPS-TE 1444
             K+++ LDK+PG+IS++V  W+ S DS ++LCL +HYI E W L KR+LNF +VDPS  +
Sbjct: 187  HKIFEDLDKLPGKISISVDVWSGSDDSDQFLCLAAHYIDETWELRKRVLNFFMVDPSHND 246

Query: 1443 HALSETIISCLMDWDIDRKLFSI-TFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVCC 1267
              L+E II+CLM+WDIDRKLFS+ +  S    EN   +I+DRLSQN+ L  NGQLF V C
Sbjct: 247  EMLAEVIITCLMEWDIDRKLFSMASSHSPPFGENVANKIRDRLSQNKFLYCNGQLFDVSC 306

Query: 1266 AKHVLNVIIQDAFDAIHEILYKIRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCP 1087
              +V+N + QD+     E + KIR  IRYVKSSE++Q  FN+   +    ++K L +D  
Sbjct: 307  GVYVINQMAQDSLQTCCETIDKIRNCIRYVKSSESIQESFNQWRAEAGAESEKDLCIDDS 366

Query: 1086 TQWMSTYTMLEAAVEYKSAFSHLQECDSGYTM-APSEREWERANDITSYLKMFDEVNKVL 910
            T+W +T +MLE  +E K+ F  ++E D    +  PS+ EWER   I  +LK+F EV    
Sbjct: 367  TRWDTTCSMLEIVLEQKNVFLLMKERDPDSCLPCPSDLEWERLETIVGFLKVFVEVANAF 426

Query: 909  SSVKCLTSNHYFSEMSDIHLNLIEWCK-ISNCISSVAMKSKCKFDAYWNICSLKLAIAAV 733
            +   CLT+N YF E+ DIHL LIEW K   + ISSVA+  +  FD +W+  +L LAIA +
Sbjct: 427  TKSSCLTANIYFPEICDIHLRLIEWSKNTDDFISSVAVNMRKLFDEFWDKNNLVLAIATI 486

Query: 732  LDPRFKMKLVEYYYTQIYGSNAADQIKVVSKAIRDLYNEYSICSTLASLDQGLACEVQ-- 559
            LDPRFKMKLVEYYY   Y S+A++ I+ +S+ I+ LYNE+S+ S LAS DQ L  +    
Sbjct: 487  LDPRFKMKLVEYYYPLFYDSSASELIEDISECIKALYNEHSVRSLLASSDQALDWQENHH 546

Query: 558  --NGVFRSGNSDTDRLRGFDKFLHETSNSSQ---MNTELDKYLEEPVFPRNMDFNILSWW 394
              NGV   G    +RL  FD+++H+T+ ++Q     ++LDKYLEEP+FPRN DF+IL+WW
Sbjct: 547  QPNGVVH-GIEPDNRLIEFDRYIHDTTTTTQGQDSRSDLDKYLEEPLFPRNTDFDILNWW 605

Query: 393  KVNSPKFPILSMMARDVLGIPMS-INVSPGTACDTGGRGLDSDQSSLSPDILQALICTHD 217
            KV++P++PILS MAR+VL +PMS ++           R +     SL P  +QAL+C  D
Sbjct: 606  KVHTPRYPILSTMARNVLAVPMSNVSSEEDAFKSCPRRQISETWWSLRPSTVQALMCAQD 665

Query: 216  WLQSEL 199
            W++SEL
Sbjct: 666  WIRSEL 671


>ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda]
            gi|548861481|gb|ERN18855.1| hypothetical protein
            AMTR_s00067p00136180 [Amborella trichopoda]
          Length = 685

 Score =  554 bits (1427), Expect = e-155
 Identities = 288/639 (45%), Positives = 415/639 (64%), Gaps = 6/639 (0%)
 Frame = -1

Query: 2097 KRLTSAVWNDFERVKRGDMMV-AICQHCKKKLXXXXXXXXSHLRNHLKRCLRRTNRDVTQ 1921
            ++  S+VW++FE+V+  D  V A C+HC + L        SHL+ HL RC +R +    Q
Sbjct: 63   RKTISSVWDEFEKVRSEDGSVKAACKHCHRNLVGSSAHGTSHLKRHLGRCAKRVHIGSGQ 122

Query: 1920 VLVVREKKRHVATELENFKYDQEHSQLAVTFDQEQNQFDPKFDQEQSRFDLARMIILHEY 1741
             LVV   K+  A+ + NFK                      FDQ +SR+DLA+MI+LHEY
Sbjct: 123  QLVVTCIKKGEASSV-NFK----------------------FDQGRSRYDLAKMILLHEY 159

Query: 1740 PLAMVEHVGFKRFVENLQPLFHIMSSDSAKADCLQIYEKEKQKVYDMLDKVPGRISLTVG 1561
            P +MVEH  F+ FV NLQPLF ++S  + ++D ++IY+KEK+K+Y+ L+K+P RISL+  
Sbjct: 160  PSSMVEHTTFRTFVRNLQPLFSMVSPSTIESDIIEIYKKEKKKLYEELEKIPSRISLSAN 219

Query: 1560 TWTSSQDSRYLCLTSHYISEAWLLEKRILNFVLVDPSTEHALSETIISCLMDWDIDRKLF 1381
             W+S Q+  YLCL +HYI +AW+L+K+IL+FV +   T  A++E ++  L  W++D+KLF
Sbjct: 220  IWSSCQNLEYLCLIAHYIDDAWVLQKQILSFVNLPSRTGGAIAEVLLDLLSQWNVDKKLF 279

Query: 1380 SITFDSCSSNENAVLRIKDRLSQNRLLLSNGQLFHVCCAKHVLNVIIQDAFDAIHEILYK 1201
            SIT +S S N+ A   ++ RLS+N  L   G++FH+CC  HV+N+++QD  + I E+L K
Sbjct: 280  SITLNSASYNDVAASSLRSRLSRNSSLPLEGKIFHLCCCSHVVNLMVQDGLEVIQEVLQK 339

Query: 1200 IRESIRYVKSSEAVQLKFNEMAHQVQTNTKKSLRLDCPTQWMSTYTMLEAAVEYKSAFSH 1021
            IRESI+YVK+S   Q +FNE+ +Q+   +K+++ LD PT+W STY ML+  +E + AFS 
Sbjct: 340  IRESIKYVKTSHVRQERFNEIINQLGIQSKQNIFLDVPTRWNSTYHMLDVTLELREAFSC 399

Query: 1020 LQECDSGYTMAPSEREWERANDITSYLKMFDEVNKVLSSVKCLTSNHYFSEMSDIHLNLI 841
              +CDS   M PSE EWER  +I   LK+F ++       K  T+N YF E+  +HL L+
Sbjct: 400  FAQCDSMCNMVPSEDEWERVKEICDCLKLFYDITNTFLGSKYPTANLYFPEVYQMHLRLV 459

Query: 840  EW-CKISNCISSVAMKSKCKFDAYWNICSLKLAIAAVLDPRFKMKLVEYYYTQIYGSNAA 664
            EW   ++  ISS+A+K K KFD YW I +L LAIA V+DPRFK+K VEY Y+QIYG++A 
Sbjct: 460  EWSMSLNKHISSMAIKMKEKFDKYWKISNLVLAIAVVIDPRFKLKFVEYSYSQIYGNDAE 519

Query: 663  DQIKVVSKAIRDLYNEYSICSTLAS-LDQGLACEVQNGVFRSGNSDTDR---LRGFDKFL 496
              I++V + + DL NEY     LAS  +  LA         SG  DT        F+KF+
Sbjct: 520  HHIRMVRQGVYDLCNEYESKEPLASNSESSLAVSASTS---SGGVDTHGKLWAMEFEKFV 576

Query: 495  HETSNSSQMNTELDKYLEEPVFPRNMDFNILSWWKVNSPKFPILSMMARDVLGIPMSINV 316
             E+S++    +ELD+YLEEP+FPRN+DFNI +WW++N+P+FP LS MARD+LGIP+S  V
Sbjct: 577  RESSSNQARKSELDRYLEEPIFPRNLDFNIRNWWQLNAPRFPTLSKMARDILGIPVS-TV 635

Query: 315  SPGTACDTGGRGLDSDQSSLSPDILQALICTHDWLQSEL 199
            +  +  D GG+ LD  +SSL P+ +QAL+C  DWL +EL
Sbjct: 636  TSDSTFDIGGQVLDQYRSSLLPETIQALMCAQDWLWNEL 674


Top