BLASTX nr result

ID: Akebia25_contig00020815 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00020815
         (2306 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003633578.1| PREDICTED: uncharacterized protein LOC100852...   664   0.0  
ref|XP_007049744.1| DNA binding protein, putative isoform 1 [The...   648   0.0  
emb|CBI24867.3| unnamed protein product [Vitis vinifera]              644   0.0  
ref|XP_006481815.1| PREDICTED: uncharacterized protein LOC102609...   607   e-171
ref|XP_006481813.1| PREDICTED: uncharacterized protein LOC102609...   607   e-171
ref|XP_007049745.1| DNA binding protein, putative isoform 2 [The...   602   e-169
ref|XP_002311825.2| hypothetical protein POPTR_0008s20540g [Popu...   590   e-165
ref|XP_004303006.1| PREDICTED: uncharacterized protein LOC101299...   576   e-161
ref|XP_006416501.1| hypothetical protein EUTSA_v10006802mg [Eutr...   557   e-156
ref|XP_004149225.1| PREDICTED: uncharacterized protein LOC101210...   557   e-156
ref|XP_006304420.1| hypothetical protein CARUB_v10010997mg [Caps...   552   e-154
ref|XP_004168803.1| PREDICTED: uncharacterized LOC101210135 [Cuc...   545   e-152
ref|NP_564086.1| transducin/WD-40 repeat-containing protein [Ara...   543   e-151
ref|XP_002533545.1| DNA binding protein, putative [Ricinus commu...   535   e-149
ref|XP_002890343.1| predicted protein [Arabidopsis lyrata subsp....   531   e-148
ref|XP_006588830.1| PREDICTED: uncharacterized protein LOC100816...   525   e-146
ref|XP_007133414.1| hypothetical protein PHAVU_011G176600g [Phas...   517   e-143
ref|XP_007145328.1| hypothetical protein PHAVU_007G229800g [Phas...   513   e-142
ref|XP_007203691.1| hypothetical protein PRUPE_ppa024767mg [Prun...   510   e-142
ref|XP_004515185.1| PREDICTED: uncharacterized protein LOC101510...   508   e-141

>ref|XP_003633578.1| PREDICTED: uncharacterized protein LOC100852537 [Vitis vinifera]
          Length = 942

 Score =  664 bits (1712), Expect = 0.0
 Identities = 349/632 (55%), Positives = 431/632 (68%), Gaps = 30/632 (4%)
 Frame = +2

Query: 104  EPLDDFHGGSQCVVALSAELLEDS----SVTGADKNTQELAMLKSHDKDIVLETATHEGC 271
            E LD     +Q +  L+ +  E+S    ++ G   ++ E ++ +  +K         +G 
Sbjct: 323  ESLDGLDCENQLLQPLAVQFPENSCKSFAIDGLSTSSHEYSVQECANKQ-------EKGF 375

Query: 272  IHAVRTCKASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVANST 451
               +  C ++  TPT +R+ K K R  +Y++  S  L TQN+++ES  A   TH  +   
Sbjct: 376  NQVMAACNSAPKTPTERRRSKRKTRVVNYSDESSLPLSTQNKNKESSPANFQTHINSEEH 435

Query: 452  PLENITD-----SF-------HLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKH 595
            P+ +  D     SF        +PNDV+LPR+VLCLAHNGKVAWDVKWRP +  DLECKH
Sbjct: 436  PMMSSDDMPQNSSFGISSANDSIPNDVALPRIVLCLAHNGKVAWDVKWRPSSMSDLECKH 495

Query: 596  RMGYLAVLLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQS 775
            RMGYLAVLLGNGSLEVWEVPS  TIK  +SS +K+G DPRF+KL+PVF+CS LK GDRQS
Sbjct: 496  RMGYLAVLLGNGSLEVWEVPSLHTIKVIYSSSKKEGTDPRFIKLKPVFRCSNLKYGDRQS 555

Query: 776  IPLTVEWSPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWA 955
            IPLTVEWS   PHDLI+AGCHDGTVALWKF+A+GS +  DTRPLLCFSADTVPIRALAWA
Sbjct: 556  IPLTVEWSAFSPHDLIVAGCHDGTVALWKFSANGSFE--DTRPLLCFSADTVPIRALAWA 613

Query: 956  PDESDPESVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDD 1135
            P E+DPES N IVTAGH G++FWD+RDP RPLW++N VRR+IY VDWLPDPRC+ILSFDD
Sbjct: 614  PVETDPESANIIVTAGHAGVKFWDIRDPFRPLWEINPVRRVIYSVDWLPDPRCIILSFDD 673

Query: 1136 GTLRILSLLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGS 1315
            GTLRI SL + A DVPVTGKPF GTQQ GL  Y CS FPIWSVQ SR TG+ AYCS+DG+
Sbjct: 674  GTLRIFSLAKIANDVPVTGKPFSGTQQPGLICYSCSPFPIWSVQVSRATGLAAYCSADGT 733

Query: 1316 VLRFQLTSKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPR 1495
            V +FQLT KAV+KD SRN+APHFLCGSLTE++  LT+ TP+  IPF +KK+LN+W DTPR
Sbjct: 734  VRQFQLTIKAVEKD-SRNKAPHFLCGSLTEDNSVLTINTPLSTIPFVVKKALNQWGDTPR 792

Query: 1496 SIRGFLSDMNQAKRGSEPMSDNHTMALCYGD---------DPSLHIVSGDTSAPXXXXXX 1648
            SIRG +S+ NQAKR +   S++  + LC  D         D S+ +     +A       
Sbjct: 793  SIRG-ISESNQAKRVNNQKSNDQPLDLCEDDDDDDDDDDNDSSIEVSGSTKAASKRKQKT 851

Query: 1649 XXXXXXXXXPDADVACXXXXXXXXXXXXKKSKA-----TVEVFPSKMVAMHRVRWNMNKG 1813
                     P  D A             K+ +       +EVFPSK+VA+HRVRWNMNKG
Sbjct: 852  KSKSSSKKNPKKDQAALCSYEEAENLENKEDRKEEGGNEIEVFPSKIVALHRVRWNMNKG 911

Query: 1814 SERWLCYGGAAGIVRCQEISASSVADRSLSKK 1909
            SE WLCYGGAAGIVRCQ+I+A  V  + L K+
Sbjct: 912  SEGWLCYGGAAGIVRCQKITA-GVLKKDLVKR 942


>ref|XP_007049744.1| DNA binding protein, putative isoform 1 [Theobroma cacao]
            gi|508702005|gb|EOX93901.1| DNA binding protein, putative
            isoform 1 [Theobroma cacao]
          Length = 868

 Score =  648 bits (1671), Expect = 0.0
 Identities = 338/608 (55%), Positives = 408/608 (67%), Gaps = 31/608 (5%)
 Frame = +2

Query: 176  SVTGADKNTQELAMLKSHDKDIVLETATHEGCIHAVRTCKASLVTPTHKRKMKDKARAKS 355
            ++  A  NTQE A  KSH +    E    EG      T  A+  T    RK+K K +AK+
Sbjct: 269  AIDSALGNTQENAPNKSHHEK---EKGEKEGAF----TSDATPTTSVQSRKLKSKVQAKT 321

Query: 356  YNNSVSPLLLTQNEDRES-------------QDAMT--------STHEVANSTPLENITD 472
              +     LLTQNE+  S             Q+AM         S+    +S P +N ++
Sbjct: 322  NTHGKCLPLLTQNEETRSSSTINKQIHYNSGQEAMVHNNILDSNSSETPGSSIPRDNSSE 381

Query: 473  S--FHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLLGNGSLEVW 646
            +    +P D+ LPR VLCLAHNGKVAWDVKW+P +  D EC  RMGYLAVLLGNGSLEVW
Sbjct: 382  TPGSSIPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRMGYLAVLLGNGSLEVW 441

Query: 647  EVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSPSFPHDLIL 826
            EVP P  I   +SS  K G DPRFVKLEPVFKCSKLKCGD QSIPLTVEWS S PH+ +L
Sbjct: 442  EVPLPHMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIPLTVEWSTSPPHNYLL 501

Query: 827  AGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPESVNAIVTAGH 1006
            AGCHDG VALWKF+ASGS   TDTRPLLCFSADTVPIR++AWAP  SD ES N ++TAGH
Sbjct: 502  AGCHDGMVALWKFSASGSP--TDTRPLLCFSADTVPIRSVAWAPSGSDMESANVVLTAGH 559

Query: 1007 EGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILSLLRSAYDVPV 1186
             GL+FWD+RDP  PLWD++   + IY +DWLP+PRCVILSFDDGT+++LSL+++A DVPV
Sbjct: 560  GGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLIQAACDVPV 619

Query: 1187 TGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSKAVDKDASR 1366
            TGKPF GT+QQGL  Y CSSF IW+VQ SRLTGM+AYC +DG+V RFQLTSKAVDKD SR
Sbjct: 620  TGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDKDFSR 679

Query: 1367 NRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLSDMNQAKRGSE 1546
            NRAPHF+CGSLTEE+  + V TP+P+IP  +KK  N++ + PRS+R FL++ NQAK   +
Sbjct: 680  NRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGEGPRSMRAFLTESNQAKNAKD 739

Query: 1547 -----PMSDNHTMALCYGDDPSLHIVSGDT---SAPXXXXXXXXXXXXXXXPDADVACXX 1702
                 P  D  T+ALCYG+DP +   S +T   +A                   D A   
Sbjct: 740  NKAKVPTPDKQTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQALAV 799

Query: 1703 XXXXXXXXXXKKSKATVEVFPSKMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEISASS 1882
                      +++   +EVFP K+VAMHRVRWNMNKGSERWLCYGGAAGIVRCQEI    
Sbjct: 800  RINEPANTQKEEAGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIIVPD 859

Query: 1883 VADRSLSK 1906
            VA +S  K
Sbjct: 860  VAKKSARK 867


>emb|CBI24867.3| unnamed protein product [Vitis vinifera]
          Length = 834

 Score =  644 bits (1660), Expect = 0.0
 Identities = 333/558 (59%), Positives = 396/558 (70%), Gaps = 24/558 (4%)
 Frame = +2

Query: 308  TPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVANSTPLENITD----- 472
            TPT +R+ K K R  +Y++  S  L TQN+++ES  A   TH  +   P+ +  D     
Sbjct: 296  TPTERRRSKRKTRVVNYSDESSLPLSTQNKNKESSPANFQTHINSEEHPMMSSDDMPQNS 355

Query: 473  SF-------HLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLLGNG 631
            SF        +PNDV+LPR+VLCLAHNGKVAWDVKWRP +  DLECKHRMGYLAVLLGNG
Sbjct: 356  SFGISSANDSIPNDVALPRIVLCLAHNGKVAWDVKWRPSSMSDLECKHRMGYLAVLLGNG 415

Query: 632  SLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSPSFP 811
            SLEVWEVPS  TIK  +SS +K+G DPRF+KL+PVF+CS LK GDRQSIPLTVEWS   P
Sbjct: 416  SLEVWEVPSLHTIKVIYSSSKKEGTDPRFIKLKPVFRCSNLKYGDRQSIPLTVEWSAFSP 475

Query: 812  HDLILAGCHDGTVALWKFAASGSSQ-------CTDTRPLLCFSADTVPIRALAWAPDESD 970
            HDLI+AGCHDGTVALWKF+A+GS +        +DTRPLLCFSADTVPIRALAWAP E+D
Sbjct: 476  HDLIVAGCHDGTVALWKFSANGSFEGSGTMQVTSDTRPLLCFSADTVPIRALAWAPVETD 535

Query: 971  PESVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRI 1150
            PES N IVTAGH G++FWD+RDP RPLW++N VRR+IY VDWLPDPRC+ILSFDDGTLRI
Sbjct: 536  PESANIIVTAGHAGVKFWDIRDPFRPLWEINPVRRVIYSVDWLPDPRCIILSFDDGTLRI 595

Query: 1151 LSLLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQ 1330
             SL + A DVPVTGKPF GTQQ GL  Y CS FPIWSVQ SR TG+ AYCS+DG+V +FQ
Sbjct: 596  FSLAKIANDVPVTGKPFSGTQQPGLICYSCSPFPIWSVQVSRATGLAAYCSADGTVRQFQ 655

Query: 1331 LTSKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGF 1510
            LT KAV+KD SRN+APHFLCGSLTE++  LT+ TP+  IPF +KK+LN+W DTPRSIRG 
Sbjct: 656  LTIKAVEKD-SRNKAPHFLCGSLTEDNSVLTINTPLSTIPFVVKKALNQWGDTPRSIRG- 713

Query: 1511 LSDMNQAKRGSEPMSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXPDADV 1690
            +S+ NQAKR +   S++  + L           S                     P  D 
Sbjct: 714  ISESNQAKRVNNQKSNDQPLDLSSKRKQKTKSKSSSKK----------------NPKKDQ 757

Query: 1691 ACXXXXXXXXXXXXKKSKA-----TVEVFPSKMVAMHRVRWNMNKGSERWLCYGGAAGIV 1855
            A             K+ +       +EVFPSK+VA+HRVRWNMNKGSE WLCYGGAAGIV
Sbjct: 758  AALCSYEEAENLENKEDRKEEGGNEIEVFPSKIVALHRVRWNMNKGSEGWLCYGGAAGIV 817

Query: 1856 RCQEISASSVADRSLSKK 1909
            RCQ+I+A  V  + L K+
Sbjct: 818  RCQKITA-GVLKKDLVKR 834


>ref|XP_006481815.1| PREDICTED: uncharacterized protein LOC102609984 isoform X3 [Citrus
            sinensis]
          Length = 801

 Score =  607 bits (1566), Expect = e-171
 Identities = 309/563 (54%), Positives = 385/563 (68%), Gaps = 25/563 (4%)
 Frame = +2

Query: 296  ASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVANSTPLEN-ITD 472
            +SL TP   RK+K KAR + +++ +   L   NED E     T+ H++ + +  ++ + D
Sbjct: 244  SSLKTPVRSRKLKSKARVEKHSHDICQPLSNVNEDEEPP---TANHQIYHGSERDSAVCD 300

Query: 473  ------------SFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAV 616
                        S  +P D++LPR+VLCLAHNGKVAWDVKW+P N  D +CK R+GYLAV
Sbjct: 301  VLGDFLSKPSLVSCPIPKDIALPRVVLCLAHNGKVAWDVKWKPYNAVDCKCKQRLGYLAV 360

Query: 617  LLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEW 796
            LLGNGSLEVWEVP  RT+K  + S  K+G DPRFVKLEPVF+CS LKCG  QSIPLT+EW
Sbjct: 361  LLGNGSLEVWEVPLLRTMKAIYLSSMKEGTDPRFVKLEPVFRCSMLKCGGTQSIPLTMEW 420

Query: 797  SPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPE 976
            S S PHD +LAGCHDGTVALWKF AS SS   D+RPLLCFSADT+PIRA++WAP ESD +
Sbjct: 421  STSPPHDYLLAGCHDGTVALWKFVASDSS--IDSRPLLCFSADTLPIRAVSWAPAESDSD 478

Query: 977  SVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILS 1156
            S N I+TAGH GL+FWD+RDP RPLWD++   + IYG+DWLPDP CVILSFDDG +RI+S
Sbjct: 479  SANVILTAGHGGLKFWDIRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILSFDDGAMRIVS 538

Query: 1157 LLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLT 1336
            LL++AYDVP TGKPF GT+QQGL    CSSF IWSVQ SRLTGM+AYCS+DG+V RFQLT
Sbjct: 539  LLKAAYDVPATGKPFAGTKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSADGTVHRFQLT 598

Query: 1337 SKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLS 1516
            +KAV+KD SRNR  HFLCGS+TE++  +TV TP+ N P P+KK++++  +  RS+R FL 
Sbjct: 599  AKAVEKDHSRNRPMHFLCGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE--RSMRSFLI 656

Query: 1517 DMNQAKRGSEP------MSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXP 1678
            + N +K  ++        SDN  +ALCYG++P        T A                 
Sbjct: 657  ESNSSKSPNDKKGKNVLSSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPKSRSSSKKK 716

Query: 1679 DADVACXXXXXXXXXXXXKKSKAT------VEVFPSKMVAMHRVRWNMNKGSERWLCYGG 1840
            + D                K  A       +EV P K+VAMHRVRWNMNKGSERWLCYGG
Sbjct: 717  EEDDQAMVCIDEEATDIQGKENAKGEAGNGIEVLPPKVVAMHRVRWNMNKGSERWLCYGG 776

Query: 1841 AAGIVRCQEISASSVADRSLSKK 1909
            A GI+RCQEI    + D+ + KK
Sbjct: 777  AGGIIRCQEIRVPDI-DKKMGKK 798


>ref|XP_006481813.1| PREDICTED: uncharacterized protein LOC102609984 isoform X1 [Citrus
            sinensis] gi|568856485|ref|XP_006481814.1| PREDICTED:
            uncharacterized protein LOC102609984 isoform X2 [Citrus
            sinensis]
          Length = 911

 Score =  607 bits (1566), Expect = e-171
 Identities = 309/563 (54%), Positives = 385/563 (68%), Gaps = 25/563 (4%)
 Frame = +2

Query: 296  ASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVANSTPLEN-ITD 472
            +SL TP   RK+K KAR + +++ +   L   NED E     T+ H++ + +  ++ + D
Sbjct: 354  SSLKTPVRSRKLKSKARVEKHSHDICQPLSNVNEDEEPP---TANHQIYHGSERDSAVCD 410

Query: 473  ------------SFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAV 616
                        S  +P D++LPR+VLCLAHNGKVAWDVKW+P N  D +CK R+GYLAV
Sbjct: 411  VLGDFLSKPSLVSCPIPKDIALPRVVLCLAHNGKVAWDVKWKPYNAVDCKCKQRLGYLAV 470

Query: 617  LLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEW 796
            LLGNGSLEVWEVP  RT+K  + S  K+G DPRFVKLEPVF+CS LKCG  QSIPLT+EW
Sbjct: 471  LLGNGSLEVWEVPLLRTMKAIYLSSMKEGTDPRFVKLEPVFRCSMLKCGGTQSIPLTMEW 530

Query: 797  SPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPE 976
            S S PHD +LAGCHDGTVALWKF AS SS   D+RPLLCFSADT+PIRA++WAP ESD +
Sbjct: 531  STSPPHDYLLAGCHDGTVALWKFVASDSS--IDSRPLLCFSADTLPIRAVSWAPAESDSD 588

Query: 977  SVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILS 1156
            S N I+TAGH GL+FWD+RDP RPLWD++   + IYG+DWLPDP CVILSFDDG +RI+S
Sbjct: 589  SANVILTAGHGGLKFWDIRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILSFDDGAMRIVS 648

Query: 1157 LLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLT 1336
            LL++AYDVP TGKPF GT+QQGL    CSSF IWSVQ SRLTGM+AYCS+DG+V RFQLT
Sbjct: 649  LLKAAYDVPATGKPFAGTKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSADGTVHRFQLT 708

Query: 1337 SKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLS 1516
            +KAV+KD SRNR  HFLCGS+TE++  +TV TP+ N P P+KK++++  +  RS+R FL 
Sbjct: 709  AKAVEKDHSRNRPMHFLCGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE--RSMRSFLI 766

Query: 1517 DMNQAKRGSEP------MSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXP 1678
            + N +K  ++        SDN  +ALCYG++P        T A                 
Sbjct: 767  ESNSSKSPNDKKGKNVLSSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPKSRSSSKKK 826

Query: 1679 DADVACXXXXXXXXXXXXKKSKAT------VEVFPSKMVAMHRVRWNMNKGSERWLCYGG 1840
            + D                K  A       +EV P K+VAMHRVRWNMNKGSERWLCYGG
Sbjct: 827  EEDDQAMVCIDEEATDIQGKENAKGEAGNGIEVLPPKVVAMHRVRWNMNKGSERWLCYGG 886

Query: 1841 AAGIVRCQEISASSVADRSLSKK 1909
            A GI+RCQEI    + D+ + KK
Sbjct: 887  AGGIIRCQEIRVPDI-DKKMGKK 908


>ref|XP_007049745.1| DNA binding protein, putative isoform 2 [Theobroma cacao]
            gi|508702006|gb|EOX93902.1| DNA binding protein, putative
            isoform 2 [Theobroma cacao]
          Length = 846

 Score =  602 bits (1552), Expect = e-169
 Identities = 324/608 (53%), Positives = 390/608 (64%), Gaps = 31/608 (5%)
 Frame = +2

Query: 176  SVTGADKNTQELAMLKSHDKDIVLETATHEGCIHAVRTCKASLVTPTHKRKMKDKARAKS 355
            ++  A  NTQE A  KSH +    E    EG      T  A+  T    RK+K K +AK+
Sbjct: 269  AIDSALGNTQENAPNKSHHEK---EKGEKEGAF----TSDATPTTSVQSRKLKSKVQAKT 321

Query: 356  YNNSVSPLLLTQNEDRES-------------QDAMT--------STHEVANSTPLENITD 472
              +     LLTQNE+  S             Q+AM         S+    +S P +N ++
Sbjct: 322  NTHGKCLPLLTQNEETRSSSTINKQIHYNSGQEAMVHNNILDSNSSETPGSSIPRDNSSE 381

Query: 473  S--FHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLLGNGSLEVW 646
            +    +P D+ LPR VLCLAHNGKVAWDVKW+P +  D EC  RMGYLAVLLGNGSLEVW
Sbjct: 382  TPGSSIPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRMGYLAVLLGNGSLEVW 441

Query: 647  EVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSPSFPHDLIL 826
            EVP P  I   +SS  K G DPRFVKLEPVFKCSKLKCGD QSIPLTVEWS S PH+ +L
Sbjct: 442  EVPLPHMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIPLTVEWSTSPPHNYLL 501

Query: 827  AGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPESVNAIVTAGH 1006
            AGCHDG VALWKF+ASGS   TDTRPLLCFSADTVPIR++AWAP  SDP           
Sbjct: 502  AGCHDGMVALWKFSASGSP--TDTRPLLCFSADTVPIRSVAWAPSGSDP----------- 548

Query: 1007 EGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILSLLRSAYDVPV 1186
                         PLWD++   + IY +DWLP+PRCVILSFDDGT+++LSL+++A DVPV
Sbjct: 549  -----------FLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLIQAACDVPV 597

Query: 1187 TGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSKAVDKDASR 1366
            TGKPF GT+QQGL  Y CSSF IW+VQ SRLTGM+AYC +DG+V RFQLTSKAVDKD SR
Sbjct: 598  TGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDKDFSR 657

Query: 1367 NRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLSDMNQAKRGSE 1546
            NRAPHF+CGSLTEE+  + V TP+P+IP  +KK  N++ + PRS+R FL++ NQAK   +
Sbjct: 658  NRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGEGPRSMRAFLTESNQAKNAKD 717

Query: 1547 -----PMSDNHTMALCYGDDPSLHIVSGDT---SAPXXXXXXXXXXXXXXXPDADVACXX 1702
                 P  D  T+ALCYG+DP +   S +T   +A                   D A   
Sbjct: 718  NKAKVPTPDKQTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQALAV 777

Query: 1703 XXXXXXXXXXKKSKATVEVFPSKMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEISASS 1882
                      +++   +EVFP K+VAMHRVRWNMNKGSERWLCYGGAAGIVRCQEI    
Sbjct: 778  RINEPANTQKEEAGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIIVPD 837

Query: 1883 VADRSLSK 1906
            VA +S  K
Sbjct: 838  VAKKSARK 845


>ref|XP_002311825.2| hypothetical protein POPTR_0008s20540g [Populus trichocarpa]
            gi|550333546|gb|EEE89192.2| hypothetical protein
            POPTR_0008s20540g [Populus trichocarpa]
          Length = 813

 Score =  590 bits (1520), Expect = e-165
 Identities = 321/629 (51%), Positives = 389/629 (61%), Gaps = 27/629 (4%)
 Frame = +2

Query: 104  EPLDDFHGGSQCVVALSAELLEDS----SVTGADKNTQELAMLKSHDKDIVLETATHEGC 271
            E LD     +Q V ALS E  +DS    S+ G  +N+Q+ A                   
Sbjct: 229  ESLDSLDSSNQYVQALSVEYPQDSPGLLSIEGISQNSQDEA------------------- 269

Query: 272  IHAVRTCKASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVANST 451
                                K K +     +   PLLL  NED      + ST    N  
Sbjct: 270  --------------------KQKHKGSDSGDVACPLLLIHNEDDNVSLDINSTSSTVNYQ 309

Query: 452  PLEN------------------ITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFG 577
              EN                  I  +  +P D  LPR+VLCLAHNGKVAWDVKW+PCN  
Sbjct: 310  THENSGLNTAMPAYGSDNVSLDINPTSSIPKDADLPRVVLCLAHNGKVAWDVKWQPCNAP 369

Query: 578  DLECKHRMGYLAVLLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLK 757
              + +HRMGYLAVLLGNGSLEVW+VP P  +K  +SS   +G DPRFVK++PVF+CS LK
Sbjct: 370  PSKFQHRMGYLAVLLGNGSLEVWDVPLPHAMKSVYSSSNLEGTDPRFVKIKPVFRCSTLK 429

Query: 758  CGDRQSIPLTVEWSPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPI 937
            CG  QSIPL VEWS S+PHD +LAGCHDGTVALWKF+ASG+S   DTRPLLCFSADTVPI
Sbjct: 430  CGGIQSIPLAVEWSTSYPHDYLLAGCHDGTVALWKFSASGAS--GDTRPLLCFSADTVPI 487

Query: 938  RALAWAPDESDPESVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCV 1117
            RA+AW P ESD ES N I+TAGH GL+FWD+RDP RPLWDL+   ++IY +DWLPDPRC+
Sbjct: 488  RAIAWVPSESDQESPNLILTAGHLGLKFWDIRDPFRPLWDLHPAPKLIYSLDWLPDPRCI 547

Query: 1118 ILSFDDGTLRILSLLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAY 1297
            ILSFDDGT+R+LSL R+AYD  V GKP VG +Q G+    CSSF IWSVQ SRLTGM+AY
Sbjct: 548  ILSFDDGTMRLLSLARAAYDAAVNGKPSVGPKQLGMHVVNCSSFAIWSVQVSRLTGMVAY 607

Query: 1298 CSSDGSVLRFQLTSKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNE 1477
            CS+DG+V RFQLT+KAV+KD SR+RAPHF CGSL+E++  + V TP+P+ P P+KK +N+
Sbjct: 608  CSADGTVCRFQLTTKAVEKDPSRHRAPHFGCGSLSEDESAIIVGTPLPDTPLPLKKPVND 667

Query: 1478 WSDTPRSIRGFLSDMNQAKRGSEPMSDNHTMALCYGDDPSLHIVSGDT---SAPXXXXXX 1648
              + P+S +  LS  N+A +   P SD+  +ALCYGDDP +   S +T   +        
Sbjct: 668  VGNNPKS-KQRLSVSNKAAK--IPTSDDPPLALCYGDDPGMDHGSDETLTATKSKRKPKS 724

Query: 1649 XXXXXXXXXPDADVACXXXXXXXXXXXXKKSKA--TVEVFPSKMVAMHRVRWNMNKGSER 1822
                      D  + C             K  A   VE  P KMVAMHRVRWNMNKGSER
Sbjct: 725  KSGSKQMEGEDQALVCIDDEQDVKQKGGGKEGAGNVVESIPPKMVAMHRVRWNMNKGSER 784

Query: 1823 WLCYGGAAGIVRCQEISASSVADRSLSKK 1909
            WLC GGAAGIVRCQEI     AD  L++K
Sbjct: 785  WLCSGGAAGIVRCQEIKMFD-ADICLARK 812


>ref|XP_004303006.1| PREDICTED: uncharacterized protein LOC101299208 [Fragaria vesca
            subsp. vesca]
          Length = 1076

 Score =  576 bits (1485), Expect = e-161
 Identities = 296/543 (54%), Positives = 359/543 (66%), Gaps = 37/543 (6%)
 Frame = +2

Query: 392  NEDRESQDAMTSTHEVANSTPLENITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCN 571
            ++ ++      +   V+N+   EN + S+ +P DV+LPR++ CLAH+GKVAWDVKWRP N
Sbjct: 549  HDGKQESSGWKTNDIVSNNDYAENGSTSYSVPKDVALPRIIFCLAHHGKVAWDVKWRPLN 608

Query: 572  FGDLECKHRMGYLAVLLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSK 751
              D  CKHRMGYLAVLLGNGSLEVWEVP PR I+  +SS   +G DPRFVKL PVF+CS 
Sbjct: 609  EYDSRCKHRMGYLAVLLGNGSLEVWEVPVPRAIEVIYSSSSGEGTDPRFVKLAPVFRCSM 668

Query: 752  LKCGDRQSIPLTVEWSPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTV 931
            LK GD++SIPLTVEWS S PHD ++AGCHDGTVA+WKF+AS +SQ  DTRPLLCFSADT 
Sbjct: 669  LKSGDKKSIPLTVEWSASPPHDYLIAGCHDGTVAMWKFSASNASQ--DTRPLLCFSADTN 726

Query: 932  PIRALAWAPDESDPESVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPR 1111
            PIRAL+WAP ES+ +  N I TAGH GL+FWDLRDP RPLWD++ + R IY +DWLPDPR
Sbjct: 727  PIRALSWAPVESNSDGANVIATAGHGGLKFWDLRDPFRPLWDIDHIPRFIYSLDWLPDPR 786

Query: 1112 CVILSFDDGTLRILSLLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMI 1291
            C++LSFDDGT+R+LSL + A D P TGKPF GT+QQGL +  C  F IWSVQ SRLTGM+
Sbjct: 787  CLLLSFDDGTMRLLSLTKVASDAPTTGKPFTGTKQQGLHNLGCLPFAIWSVQVSRLTGMV 846

Query: 1292 AYCSSDGSVLRFQLTSKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSL 1471
            AYC +DG+VLRFQLTSKAV+KDA RNRAPHFLC SLTEED  +T+ TP+ N PFP+K S 
Sbjct: 847  AYCGADGTVLRFQLTSKAVEKDAIRNRAPHFLCVSLTEEDSVVTINTPVLNNPFPLKTSR 906

Query: 1472 NEWSDTPRSIRGFLSDMNQAKRGSEPM---SDNHTMALCYGDD-----------PSLHI- 1606
                          ++ N+ KR  + M   S++  +ALCYGDD           PSL   
Sbjct: 907  K-------------AEPNKVKREHDKMATASEDKVLALCYGDDPVVELESGKEAPSLRSK 953

Query: 1607 --VSGDTSAPXXXXXXXXXXXXXXXPDADVACXXXXXXXXXXXXK--------------- 1735
               SGD  A                 +   +             K               
Sbjct: 954  PRTSGDDQALACMDHEPFNTLEEEIGEKGASLKSIVKQKSKSSKKTEDEQELVCRDEELN 1013

Query: 1736 -----KSKATVEVFPSKMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEISASSVADRSL 1900
                 K     EVFPSK++AMHRVRWNMNKGSERWLCYGGAAG+VRCQEI+ S +  +  
Sbjct: 1014 NMQREKIGTEYEVFPSKLIAMHRVRWNMNKGSERWLCYGGAAGLVRCQEIALSEIDTKWA 1073

Query: 1901 SKK 1909
             KK
Sbjct: 1074 RKK 1076


>ref|XP_006416501.1| hypothetical protein EUTSA_v10006802mg [Eutrema salsugineum]
            gi|557094272|gb|ESQ34854.1| hypothetical protein
            EUTSA_v10006802mg [Eutrema salsugineum]
          Length = 828

 Score =  557 bits (1436), Expect = e-156
 Identities = 298/606 (49%), Positives = 382/606 (63%), Gaps = 13/606 (2%)
 Frame = +2

Query: 98   AEEPLDDFHGGSQCVVALSAELLEDSSVTGADKNTQELAMLKSHDKDIVLET-ATHEGCI 274
            +E P++   G    V ALS    E+S              L+S  +  V ET + +E   
Sbjct: 243  SELPIEQLDGDVLYVEALSVRYPEESVAPETP--------LRSLRETSVTETKSNNESSE 294

Query: 275  HAVRTCKASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVANSTP 454
              + +  A++  P  +++ K                 TQ+ +   +  ++   E   + P
Sbjct: 295  QVLSSENANIKLPVRRKRQK-----------------TQHTEETCKPVLSEGSEALGNVP 337

Query: 455  LENITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLLGNGS 634
             E  +D   +  D+SLPR+VLCLAHNGKVAWD+KWRP +  D   KHRMGYLAVLLGNGS
Sbjct: 338  GELSSD---VSEDISLPRVVLCLAHNGKVAWDMKWRPSSADDSLNKHRMGYLAVLLGNGS 394

Query: 635  LEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSPSFPH 814
            LEVW+VP P+ I   + S +KD  DPRFVKL P+FKCS LKCGD QSIPLTVEWS     
Sbjct: 395  LEVWDVPMPQAISAVYLSSKKDATDPRFVKLAPIFKCSNLKCGDTQSIPLTVEWSAFGNP 454

Query: 815  DLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPESVNAIV 994
            D +LAGCHDGTVALWKF+ + SS+  DTRPLL FSADT PIRA+AWAP +SDPES N + 
Sbjct: 455  DFLLAGCHDGTVALWKFSTTKSSE--DTRPLLVFSADTAPIRAVAWAPVDSDPESANVVA 512

Query: 995  TAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILSLLRSAY 1174
            TAGH GL+FWDLRDP RPLW+L+ V R IY +DWL DP+CV+LSF+DGT+RILSL++ AY
Sbjct: 513  TAGHAGLKFWDLRDPFRPLWELHPVPRFIYSIDWLQDPKCVLLSFEDGTIRILSLVKVAY 572

Query: 1175 DVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSKAVDK 1354
            DVP TGKP+  ++QQG   Y CSSFPIWS++ SRLTGM AYC++DGSV  F+LT+KAV+K
Sbjct: 573  DVPATGKPYRNSKQQGFSVYNCSSFPIWSIRVSRLTGMAAYCTADGSVFHFELTTKAVEK 632

Query: 1355 DASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNE-WSDTPRSIRGFLSDMNQA 1531
            D SRNR PHFLCG  T  D   TV++P+PNIP  +KK ++E   +  R +R  +++    
Sbjct: 633  D-SRNRTPHFLCGRFTMNDSTFTVHSPLPNIPIFLKKPVSETGGEKQRCLRSLVNE--TP 689

Query: 1532 KRGSEPMSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXPDAD---VACXX 1702
            KR + P+SD   +A  + +DP L   +  T                   D +   + C  
Sbjct: 690  KRYASPVSDVQPLAFAHDEDPGLECETEGTKNKKRSKIKAKIGENTIEEDENRGALVCVQ 749

Query: 1703 XXXXXXXXXXKKSKAT--------VEVFPSKMVAMHRVRWNMNKGSERWLCYGGAAGIVR 1858
                      ++ +A+         EVFP KMVAMHRVRWNMNKGSER LCYGGAAGIVR
Sbjct: 750  EDGDEEEEGRRRKEASNSSSSGVKAEVFPPKMVAMHRVRWNMNKGSERLLCYGGAAGIVR 809

Query: 1859 CQEISA 1876
            CQEI++
Sbjct: 810  CQEIAS 815


>ref|XP_004149225.1| PREDICTED: uncharacterized protein LOC101210135 [Cucumis sativus]
          Length = 952

 Score =  557 bits (1435), Expect = e-156
 Identities = 295/624 (47%), Positives = 392/624 (62%), Gaps = 38/624 (6%)
 Frame = +2

Query: 152  SAELLEDSSVTGADKNTQELAMLKSHDKDIVLETATHEGCIHAVRTCKASLVTPTHKRKM 331
            S+ LLE   + G  KNT+   +L+++   +  E++T    +  V TC +    P  KR++
Sbjct: 343  SSNLLE---IDGVPKNTENFVLLENN---VERESST----LQEVSTCHSEDEVPAKKRRV 392

Query: 332  KDKARAKSYNNSVSPLLLTQNED-------RESQDAMTSTH---------EVANSTPLEN 463
            + K + ++  + V  L L + ++        E+ + + S +         +++ +  L+ 
Sbjct: 393  RRKVKPRNLVDDVGVLSLAEYQEDGSIANNHEANENVKSEYSGEDNLLCKDISENVVLDA 452

Query: 464  ITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLLGNGSLEV 643
             +  F +P  V+LPR+VLCLAHNGKVAWD+KW+P N     CKHRMGYLAVLLGNGSLEV
Sbjct: 453  SSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPMNACTDNCKHRMGYLAVLLGNGSLEV 512

Query: 644  WEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSPSFPHDLI 823
            WEVP P  +K  +S    +G DPRF+KL+P+F+CS+L+  + QSIPLTVEWS + P+D +
Sbjct: 513  WEVPFPHAVKAIYSKFNGEGTDPRFMKLKPIFRCSRLRTTNTQSIPLTVEWSRTPPYDYL 572

Query: 824  LAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPESVNAIVTAG 1003
            LAGCHDGTVALWKF+A+ S  C DTRPLL FSADTVPIRA+AWAP ESD ES N I+TAG
Sbjct: 573  LAGCHDGTVALWKFSANSS--CEDTRPLLRFSADTVPIRAVAWAPSESDLESANVILTAG 630

Query: 1004 HEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILSLLRSAYDVP 1183
            H GL+FWDLRDP RPLWDL+   RIIY +DWLP+PRCV LSFDDGTLR+LSLL++A DVP
Sbjct: 631  HGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAANDVP 690

Query: 1184 VTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSKAVDKDAS 1363
             TG+PF   +Q+GL +Y CSS+ IWS+Q SR TGM+AYC +DG+V+RFQLT+KA DK+ S
Sbjct: 691  ATGRPFTAIKQKGLHTYICSSYAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAADKENS 750

Query: 1364 RNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLSD---MNQAK 1534
            R+R PH++C  LTEE+  +T  +P PN+P P+KK  N+ S+ P S+R  LSD    N+ K
Sbjct: 751  RHRTPHYVCEYLTEEESIITFRSPPPNVPIPLKKLSNK-SEHPLSMRAILSDSVQSNEDK 809

Query: 1535 RGSEPMSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXPDADVACXXXXXX 1714
              +    +N    +C   D  +   S DT  P                  ++ C      
Sbjct: 810  PATASTLENEA-TICSDVDVRVESGSEDTLTPTKKKNRTQPKCKEGVEKLELECSDEPKD 868

Query: 1715 XXXXXXKKSKAT-------------------VEVFPSKMVAMHRVRWNMNKGSERWLCYG 1837
                       T                    E  P K VAMHRVRWNMN GSE WLCYG
Sbjct: 869  DAHMDADVDAQTDAVLEAQMDADALPTSGDHFENLPPKSVAMHRVRWNMNIGSEEWLCYG 928

Query: 1838 GAAGIVRCQEISASSVADRSLSKK 1909
            GAAGI+RC+EI  S++  + + KK
Sbjct: 929  GAAGILRCREIVLSALDMKLMKKK 952


>ref|XP_006304420.1| hypothetical protein CARUB_v10010997mg [Capsella rubella]
            gi|482573131|gb|EOA37318.1| hypothetical protein
            CARUB_v10010997mg [Capsella rubella]
          Length = 822

 Score =  552 bits (1422), Expect = e-154
 Identities = 302/599 (50%), Positives = 377/599 (62%), Gaps = 10/599 (1%)
 Frame = +2

Query: 116  DFHGGSQCVVALSAELLEDSSVTGADKNTQELAMLKSHDKDIVLETATHEGCIHAVRTCK 295
            +  GG   V ALS    EDS V+        L +L+  +  +      +EG    + +  
Sbjct: 252  ELDGGVLYVEALSVRYPEDSVVSATP-----LRILQ--ETPVTEPKVNNEGSEQILSSEN 304

Query: 296  ASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVANSTPLENITD- 472
            A++  P  +++ K K   +S      P+LL                   NS  + NI   
Sbjct: 305  ANIKLPVRRKRQKTKGTEES----CKPMLLE------------------NSEVVGNILGE 342

Query: 473  -SFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLLGNGSLEVWE 649
             S  +  D++LPR+VLCLAHNGKVAWD+KWRP +  D   KHRMGYLAVLLGNGSLEVW+
Sbjct: 343  PSSGISEDIALPRVVLCLAHNGKVAWDMKWRPSHGDDSLNKHRMGYLAVLLGNGSLEVWD 402

Query: 650  VPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSPSFPHDLILA 829
            VP PRT    + S +K   DPRFVKL PVFKCS LKCGD QSIPLT+EWS S   D +LA
Sbjct: 403  VPLPRTTSAVYLSSKKAATDPRFVKLAPVFKCSNLKCGDMQSIPLTLEWSTSKNPDFLLA 462

Query: 830  GCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPESVNAIVTAGHE 1009
            GCHDGTVALWKF+ + SS+  DTRPLL FSADT PIRA+AWAP ESD ESVN + TAGH 
Sbjct: 463  GCHDGTVALWKFSTTTSSE--DTRPLLFFSADTAPIRAVAWAPGESDQESVNIVATAGHG 520

Query: 1010 GLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILSLLRSAYDVPVT 1189
            GL+FWDLRDP RPLWDL+ V R IY +DW+ DPRCV+L FDDGTLRILSL++ AYDVP T
Sbjct: 521  GLKFWDLRDPFRPLWDLHPVPRFIYSLDWVQDPRCVLLPFDDGTLRILSLVKVAYDVPAT 580

Query: 1190 GKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSKAVDKDASRN 1369
            G P+  T+QQGL  Y  S+FPIWS+Q SRLTG+ AYC++DGS+  FQLT+KAV+KD +RN
Sbjct: 581  GNPYPNTKQQGLSVYNLSTFPIWSIQVSRLTGIAAYCTADGSIFHFQLTTKAVEKD-TRN 639

Query: 1370 RAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLSDMNQAKRGSEP 1549
            R+PHFLCG LT +D    V++P+P+IP  +KK + E  +  R +R  L++     R +  
Sbjct: 640  RSPHFLCGKLTMKDSTFIVHSPVPDIPIVLKKPVGETGEKQRCLRSLLNE--SPNRYASN 697

Query: 1550 MSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXPDADV-ACXXXXXXXXXX 1726
            +SD   +A  + +D  L    G T                   D +  A           
Sbjct: 698  VSDVRPLAFAHEEDQDLEPEFGGTDNKGPKFKAKKGKNNIGEVDENSRALVCVSEDGDEG 757

Query: 1727 XXKKSKAT-------VEVFPSKMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEISASS 1882
              +++KA+        E FP KMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEI+ +S
Sbjct: 758  EERRNKASNGSIGMKTEGFPPKMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIAPTS 816


>ref|XP_004168803.1| PREDICTED: uncharacterized LOC101210135 [Cucumis sativus]
          Length = 983

 Score =  545 bits (1404), Expect = e-152
 Identities = 281/571 (49%), Positives = 366/571 (64%), Gaps = 38/571 (6%)
 Frame = +2

Query: 311  PTHKRKMKDKARAKSYNNSVSPLLLTQNED-------RESQDAMTSTH---------EVA 442
            P  KR+++ K + ++  + V  L L + ++        E+ + + S +         +++
Sbjct: 417  PAKKRRVRRKVKPRNLVDDVGVLSLAEYQEDGSIANNHEANENVKSEYSGEDNLLCKDIS 476

Query: 443  NSTPLENITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLL 622
             +  L+  +  F +P  V+LPR+VLCLAHNGKVAWD+KW+P N     CKHRMGYLAVLL
Sbjct: 477  ENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPMNACTDNCKHRMGYLAVLL 536

Query: 623  GNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSP 802
            GNGSLEVWEVP P  +K  +S    +G DPRF+KL+P+F+CS+L+  + QSIPLTVEWS 
Sbjct: 537  GNGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFMKLKPIFRCSRLRTTNTQSIPLTVEWSR 596

Query: 803  SFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPESV 982
            + P+D +LAGCHDGTVALWKF+A+ S  C DTRPLL FSADTVPIRA+AWAP ESD ES 
Sbjct: 597  TPPYDYLLAGCHDGTVALWKFSANSS--CEDTRPLLRFSADTVPIRAVAWAPSESDLESA 654

Query: 983  NAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILSLL 1162
            N I+TAGH GL+FWDLRDP RPLWDL+   RIIY +DWLP+PRCV LSFDDGTLR+LSLL
Sbjct: 655  NVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLL 714

Query: 1163 RSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSK 1342
            ++A DVP TG+PF   +Q+GL +Y CSS+ IWS+Q SR TGM+AYC +DG+V+RFQLT+K
Sbjct: 715  KAANDVPATGRPFTAIKQKGLHTYICSSYAIWSIQVSRQTGMVAYCGADGAVVRFQLTTK 774

Query: 1343 AVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLSD- 1519
            A DK+ SR+R PH++C  LTEE+  +T  +P PN+P P+KK  N+ S+ P S+R  LSD 
Sbjct: 775  AADKENSRHRTPHYVCEYLTEEESIITFRSPPPNVPIPLKKLSNK-SEHPLSMRAILSDS 833

Query: 1520 --MNQAKRGSEPMSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXPDADVA 1693
               N+ K  +    +N    +C   D  +   S DT  P                  ++ 
Sbjct: 834  VQSNEDKTATASTLENEA-TICSDVDVRVESGSEDTLTPTKKKNRTQPKCKEGVEKLELE 892

Query: 1694 CXXXXXXXXXXXXKKSKAT-------------------VEVFPSKMVAMHRVRWNMNKGS 1816
            C                 T                    E  P K VAMHRVRWNMN GS
Sbjct: 893  CSDEPKDDAHMDADVDAQTDAVLEAQMDADALPTSGDHFENLPPKSVAMHRVRWNMNIGS 952

Query: 1817 ERWLCYGGAAGIVRCQEISASSVADRSLSKK 1909
            E WLCYGGAAGI+RC+EI  S++  + + KK
Sbjct: 953  EEWLCYGGAAGILRCREIVLSALDMKLMKKK 983


>ref|NP_564086.1| transducin/WD-40 repeat-containing protein [Arabidopsis thaliana]
            gi|334182693|ref|NP_001185037.1| transducin/WD-40
            repeat-containing protein [Arabidopsis thaliana]
            gi|332191737|gb|AEE29858.1| transducin/WD-40
            repeat-containing protein [Arabidopsis thaliana]
            gi|332191738|gb|AEE29859.1| transducin/WD-40
            repeat-containing protein [Arabidopsis thaliana]
          Length = 815

 Score =  543 bits (1399), Expect = e-151
 Identities = 288/559 (51%), Positives = 359/559 (64%), Gaps = 12/559 (2%)
 Frame = +2

Query: 242  VLET-ATHEGCIHAVRTCKASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDA 418
            V ET   +EG    + +  A++  P  +++ K     KS   S +P++L  +E       
Sbjct: 277  VTETKVNNEGSGQVLSSDNANIKLPVRRKRQK----TKSTEESCTPMILEYSE------- 325

Query: 419  MTSTHEVAN--STPLENITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECK 592
                  V N  S P   I++       V+LPR+VLCLAHNGKV WD+KWRP   GD   K
Sbjct: 326  -----AVGNVPSKPSSGISEDI-----VALPRVVLCLAHNGKVVWDMKWRPSYAGDSLNK 375

Query: 593  HRMGYLAVLLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQ 772
            H MGYLAVLLGNGSLEVW+VP P+     + S +K   DPRFVKL PVFKCS LKCGD +
Sbjct: 376  HSMGYLAVLLGNGSLEVWDVPMPKATSALYLSSKKAATDPRFVKLAPVFKCSNLKCGDTK 435

Query: 773  SIPLTVEWSPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAW 952
            SIPLTVEWS     D +LAGCHDGTVALWKF+ + SS+  DTRPLL FSADT PIRA+AW
Sbjct: 436  SIPLTVEWSTLGNPDFLLAGCHDGTVALWKFSTTKSSE--DTRPLLFFSADTAPIRAVAW 493

Query: 953  APDESDPESVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFD 1132
            AP ESD ES N + TAGH GL+FWDLRDP RPLWDL+ V R IY +DWL DP CV+LSFD
Sbjct: 494  APGESDQESANIVATAGHAGLKFWDLRDPFRPLWDLHPVPRFIYSLDWLQDPSCVLLSFD 553

Query: 1133 DGTLRILSLLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDG 1312
            DGTLRILSL++ AYDVP TG+P+  T+QQGL  Y CS+FPIWS+Q SRLTG+ AYC++DG
Sbjct: 554  DGTLRILSLVKVAYDVPATGRPYPNTKQQGLSVYNCSTFPIWSIQVSRLTGIAAYCTADG 613

Query: 1313 SVLRFQLTSKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTP 1492
            S+  F+LT+KAV+KD +RNR PH+LCG LT +D    V++P+P+IP  +KK + E  +  
Sbjct: 614  SIFHFELTTKAVEKD-TRNRTPHYLCGQLTMKDSTFIVHSPVPDIPIVLKKPVGETGEKQ 672

Query: 1493 RSIRGFLSDMNQAKRGSEPMSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXX 1672
            R +R  L++     R +  +SD   +A  + +DP L   S  T+                
Sbjct: 673  RCLRSLLNE--SPSRYASNVSDVQPLAFAHVEDPGLESESEGTNNKAAKSKAKKGKNNAR 730

Query: 1673 XPDAD----VACXXXXXXXXXXXXKKSK-----ATVEVFPSKMVAMHRVRWNMNKGSERW 1825
              + +    + C            K +         E FP KMVAMHRVRWNMNKGSERW
Sbjct: 731  AEEDENSRALVCVKEDGGEEEGRRKAASNNSNGMKAEGFPPKMVAMHRVRWNMNKGSERW 790

Query: 1826 LCYGGAAGIVRCQEISASS 1882
            LCYGGAAGIVRCQEI+ +S
Sbjct: 791  LCYGGAAGIVRCQEIAPTS 809


>ref|XP_002533545.1| DNA binding protein, putative [Ricinus communis]
            gi|223526581|gb|EEF28835.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 847

 Score =  535 bits (1377), Expect = e-149
 Identities = 300/620 (48%), Positives = 379/620 (61%), Gaps = 16/620 (2%)
 Frame = +2

Query: 98   AEEPLDDFHGGSQCVVALSAELLEDSS----VTGADKNTQELAMLKSHDKDIVLETATHE 265
            A + LD+    +Q V AL+ E  EDSS    + G  +NTQ   + K+  K          
Sbjct: 280  ANDDLDNIFCNNQYVQALAVEYPEDSSQVLAIEGISENTQRQIIGKNKGK---------- 329

Query: 266  GCIHAVRTCKASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVAN 445
                              KRK   +A  +       P +L    D  S +  T    +  
Sbjct: 330  ------------------KRKSCTEAFVQD------PAVLNCGLDNVSGEINTGFCSI-- 363

Query: 446  STPLENITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLLG 625
                         P DV+LPR+VLC+AH+ KV WDVKW+PC   D +C+HRMGYLAVLLG
Sbjct: 364  -------------PKDVALPRVVLCIAHDAKVVWDVKWQPCYGSDSKCQHRMGYLAVLLG 410

Query: 626  NGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSPS 805
            NG LEVW+VP P   K  +SS  ++G DPR+VKL+PVF+ S  K G+ QSIPLTVEWS S
Sbjct: 411  NGFLEVWDVPLPHVTKVIYSSSNREGTDPRYVKLKPVFRGSIAKRGEIQSIPLTVEWSTS 470

Query: 806  FPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPESVN 985
            +PHD +LAGCHDGTVALWKF+ASG S   DTRPLLCFSADTV IRA+AWAP  SD ES N
Sbjct: 471  YPHDYLLAGCHDGTVALWKFSASGLS--GDTRPLLCFSADTVAIRAVAWAPAGSDQESDN 528

Query: 986  AIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILSLLR 1165
             IVT GH GL+FWD+RDP RPLWDL+   + IY +DWLPDPRC+ILSFDDGTLR+LSL++
Sbjct: 529  VIVTGGHGGLKFWDIRDPFRPLWDLHPAPKFIYSLDWLPDPRCIILSFDDGTLRLLSLVK 588

Query: 1166 SAYDVPVTGKPFVGTQQQGLDSYF-CSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSK 1342
            +AYD  V G+P VG +QQG+ + F  SSF IWSVQ SR TG+ AY S+DG+V RFQLT+K
Sbjct: 589  AAYDAHVNGQPSVGPKQQGIQNIFNFSSFAIWSVQVSRKTGLAAYSSADGTVCRFQLTTK 648

Query: 1343 AVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLSDM 1522
            AV+K  SR+R PHF+ GSL++++  +TV  P+P+ P  +KK +N   D PRS+R  L + 
Sbjct: 649  AVEKSPSRHRTPHFMVGSLSKDEAAITVNIPLPDTPLTLKKPVNTVGDNPRSMRSLL-ES 707

Query: 1523 NQAKR-----GSEPMSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXPDAD 1687
            NQ KR      +   +DN  +ALC  +DP +   S ++ A                   +
Sbjct: 708  NQTKRANINKANALAADNQLLALCDVNDPGVQSESDESLAAFRSRTKSKSKSISKKMTGE 767

Query: 1688 VACXXXXXXXXXXXXKKS--KATV----EVFPSKMVAMHRVRWNMNKGSERWLCYGGAAG 1849
                           +K   KA V    EV P K++AMHRVRWN+NKGSERWLC GGAAG
Sbjct: 768  DLALVCIDEGQNNRRQKEIVKAEVANEIEVIPPKIIAMHRVRWNINKGSERWLCSGGAAG 827

Query: 1850 IVRCQEISASSVADRSLSKK 1909
            IVRCQEI  S   D+ L++K
Sbjct: 828  IVRCQEIILSD-TDKLLARK 846


>ref|XP_002890343.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297336185|gb|EFH66602.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 1262

 Score =  531 bits (1369), Expect = e-148
 Identities = 288/587 (49%), Positives = 358/587 (60%), Gaps = 52/587 (8%)
 Frame = +2

Query: 278  AVRTCKASLVTPTHKRKMKDK--ARAKSYNNSVSPLLLTQNED-----RESQDAMTSTHE 436
            +VR  + S+V  T  R +++      K  N     +L ++N +     R  +     T E
Sbjct: 675  SVRYPENSVVPATPLRILRETPVTETKVNNEGSGQVLSSENANIKLPVRRKRQKTKGTEE 734

Query: 437  VANSTPLENITDSFHLPND--------VSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECK 592
                  LEN     ++P +        ++LPR+VLCLAHNGKVAWD+KWRP    D   K
Sbjct: 735  SCKPMLLENSEAVGNVPGEPSPGISQGIALPRVVLCLAHNGKVAWDMKWRPLYANDSLKK 794

Query: 593  HRMGYLAVLLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQ 772
            HRMGYLAVLLGNGSLEVW+VP P+     + S +K   DPRFVKL PVFKCS LKCGD +
Sbjct: 795  HRMGYLAVLLGNGSLEVWDVPMPQATSTLYLSSKKAATDPRFVKLAPVFKCSNLKCGDTK 854

Query: 773  SIPLTVEWSPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAW 952
            SIPLTVEWS S   D +LAGCHDGTVALWKF+ + SS+  DTRPLL FSADT PIRA+AW
Sbjct: 855  SIPLTVEWSTSGNPDFLLAGCHDGTVALWKFSTTKSSE--DTRPLLFFSADTAPIRAVAW 912

Query: 953  APDESDPESVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDP-------- 1108
            AP ESD ES N + TAGH GL+FWDLRDP RPLWDL+ V R IY +DWL DP        
Sbjct: 913  APGESDQESANIVATAGHAGLKFWDLRDPFRPLWDLHPVPRFIYSLDWLQDPKYQSLLYP 972

Query: 1109 -----------------------RCVILSFDDGTLRILSLLRSAYDVPVTGKPFVGTQQQ 1219
                                   RCV+LSFDDGTLRILSL++ AYDVP TG+P+  T+QQ
Sbjct: 973  QLIIQSLDQWFEVLIKYGVLNICRCVLLSFDDGTLRILSLVKVAYDVPATGRPYPNTKQQ 1032

Query: 1220 GLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSKAVDKDASRNRAPHFLCGSL 1399
            GL  Y CS+FPIWS+Q SRLTG+ AYC+ DGS+  F+LT+KAV+KD +RNR PHFLCG L
Sbjct: 1033 GLSVYNCSTFPIWSIQVSRLTGIAAYCTGDGSIFHFELTTKAVEKD-TRNRTPHFLCGQL 1091

Query: 1400 TEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLSDMNQAKRGSEPMSDNHTMALC 1579
            T +D    V++P+P+IP  +KK + E  +  R +R  L++     R +  +SD   +A  
Sbjct: 1092 TMKDSTFIVHSPVPDIPIVLKKPVGETGEKQRCLRSLLNE--SPNRYASNVSDVQPLAFG 1149

Query: 1580 YGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXPDADVACXXXXXXXXXXXXKKSKAT--- 1750
            + +DP L      T+                  D +               +K  +    
Sbjct: 1150 HEEDPGLESEFEGTNNKAPKSKSKKGTKNIGEEDENSRALVCVKEDGGEGRRKEASNNNN 1209

Query: 1751 ---VEVFPSKMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEISASS 1882
               VE FP K+VAMHRVRWNMNKGSERWLCYGGAAGIVRCQEI+ +S
Sbjct: 1210 GTKVEGFPPKLVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIAPTS 1256


>ref|XP_006588830.1| PREDICTED: uncharacterized protein LOC100816953 [Glycine max]
          Length = 1121

 Score =  525 bits (1351), Expect = e-146
 Identities = 276/526 (52%), Positives = 339/526 (64%), Gaps = 18/526 (3%)
 Frame = +2

Query: 386  TQNEDRESQDAMT-------STHEVANSTPLENITDSFHLPNDVSLPRLVLCLAHNGKVA 544
            T+ E R SQD          + H+   S  LE    +  +P DV+LPR+V CLAHNGKVA
Sbjct: 599  TEMEGRYSQDRSQPLQYENEANHQPHWSFELEAPPATCSIPEDVTLPRVVSCLAHNGKVA 658

Query: 545  WDVKWRPCNFGDLECKHRMGYLAVLLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVK 724
            WDVKWRP N  D  CKHRMG+LAVLLGNGSLEVWEVP P  ++  +    K+G DPRF+K
Sbjct: 659  WDVKWRPTNISDSFCKHRMGHLAVLLGNGSLEVWEVPLPHVLRAIYM--HKEGTDPRFIK 716

Query: 725  LEPVFKCSKLKCGDRQSIPLTVEWSPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRP 904
            LEPVFKCS LK G  QSIPLTVEWS + PHD +LAGCHDGTVALWKF  S SS+C DT+P
Sbjct: 717  LEPVFKCSMLKRGGLQSIPLTVEWSVTPPHDYLLAGCHDGTVALWKFCTSSSSKCDDTKP 776

Query: 905  LLCFSADTVPIRALAWAPDESDPESVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIY 1084
            +L F  DTVPIR +AWAP E DPES N IVTAGHEGL+FWDLR+P RPL  LN + RIIY
Sbjct: 777  VLIFGGDTVPIRTVAWAPFEGDPESSNIIVTAGHEGLKFWDLRNPFRPLRSLNPMPRIIY 836

Query: 1085 GVDWLPDPRCVILSFDDGTLRILSLLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSV 1264
             +DWL +P C+I+SF+DGT+R +SL+++A D+PVTG+ + G +Q GL     SSF IWSV
Sbjct: 837  SLDWLSNPSCIIMSFEDGTMRTISLVKAANDLPVTGEIYSGKKQPGLHGSAYSSFAIWSV 896

Query: 1265 QASRLTGMIAYCSSDGSVLRFQLTSKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPN 1444
            Q SRLTGM+AYC  DG+V+RFQLT+K+V+ D SRNR+  FLCGS+TEED  L + TP+ +
Sbjct: 897  QVSRLTGMVAYCGVDGAVIRFQLTTKSVETDHSRNRSRRFLCGSVTEEDSTLIINTPLSD 956

Query: 1445 IPFPMKKSLNEWSDTPRSIRGFLSDMNQAKRGSEPMS-----DNHTMALCYGDDPSLHIV 1609
             PF  KK   E      S R  L+  N  +  S  M+     D+ T+A+  G+D  L   
Sbjct: 957  APFQWKKP-PEKGRCAESFRDLLAKSNPFRSASNQMAETSNPDSQTLAIGAGEDVGLESG 1015

Query: 1610 SGDT--SAPXXXXXXXXXXXXXXXPDADVACXXXXXXXXXXXXKKSKATV----EVFPSK 1771
            S +   S                     + C               K+      E FP K
Sbjct: 1016 SEEALCSVKQPKRPKLNSGRKKKPEGLALVCGDDDAPPITPEADNEKSDFGNIPETFPPK 1075

Query: 1772 MVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEISASSVADRSLSKK 1909
            + A+HRVRWNMNKGSERWLC+GGA G+VRCQEI  S++  R   KK
Sbjct: 1076 VAALHRVRWNMNKGSERWLCFGGACGLVRCQEIVYSNIDKRWALKK 1121


>ref|XP_007133414.1| hypothetical protein PHAVU_011G176600g [Phaseolus vulgaris]
            gi|561006414|gb|ESW05408.1| hypothetical protein
            PHAVU_011G176600g [Phaseolus vulgaris]
          Length = 1092

 Score =  517 bits (1331), Expect = e-143
 Identities = 270/551 (49%), Positives = 345/551 (62%), Gaps = 7/551 (1%)
 Frame = +2

Query: 278  AVRTCKASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVANSTPL 457
            AV  C     T      +K   R   Y+  +S  L   NE         + +++ +++ L
Sbjct: 571  AVSACNTFSTTVVKNSGLKINHREGRYDQDISEPLQYDNE---------ANYQLCSTSEL 621

Query: 458  ENITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLLGNGSL 637
            E  +    +P DV+LPR+V CLAHNGKVAWDVKWRP N  D   KHRMGYLAVLLGNGSL
Sbjct: 622  EACS----VPEDVTLPRVVSCLAHNGKVAWDVKWRPTNISDSSYKHRMGYLAVLLGNGSL 677

Query: 638  EVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSPSFPHD 817
            EVWEVP P  ++  +    ++G DPRF+KLEPVFKCS LK    QSIPLTVEWS + PHD
Sbjct: 678  EVWEVPLPHVLRTIYM--HREGTDPRFIKLEPVFKCSMLKRRGIQSIPLTVEWSVTPPHD 735

Query: 818  LILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPESVNAIVT 997
             +LAGCHDGTVALWKF  + SS+C DT P+LCF  DTVPIR +AWAP E DPES N IVT
Sbjct: 736  YLLAGCHDGTVALWKFCINSSSKCDDTMPVLCFGGDTVPIRTVAWAPFEGDPESSNIIVT 795

Query: 998  AGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILSLLRSAYD 1177
            AGHEGL+FWDLR+P RPL +L+ + RIIY +DWL  P C+I+SF+DGT++ +SL ++A D
Sbjct: 796  AGHEGLKFWDLRNPFRPLRNLHPLPRIIYSLDWLSKPSCIIMSFEDGTMKTISLAKAAND 855

Query: 1178 VPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSKAVDKD 1357
            +PVTG+ + G +Q GL     +SF IWSVQ SR+TGM+AYC +DG+V RFQLT+K+V+ D
Sbjct: 856  LPVTGEIYSGKKQPGLHGSLYASFAIWSVQVSRITGMLAYCGADGTVFRFQLTTKSVEAD 915

Query: 1358 ASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKK------SLNEWSDTPRSIRGFLSD 1519
             +RN AP FLCGS+TEE+ NL + TP+ N P P KK          + D       + + 
Sbjct: 916  HARNSAPRFLCGSVTEENSNLFINTPLSNAPVPWKKPPVKGRCAESFRDLLSKTNPYKNA 975

Query: 1520 MNQAKRGSEPMSDNHTMALCYGDDPSLHIVSGDTSAPXXXXXXXXXXXXXXXPDADVACX 1699
            +NQ    S    D+ T+A+C G++  L + SG   A                        
Sbjct: 976  LNQVPETSIFDCDSQTLAICGGENVDL-LESGSEEASYSMKQPKGPKLNKGSKKKQ---- 1030

Query: 1700 XXXXXXXXXXXKKSKATV-EVFPSKMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEISA 1876
                       K     + E FP KM A+H+VRWNMNKGSERWLC+GGA G+VRCQEI  
Sbjct: 1031 ---------DEKSDFGNIPETFPPKMAALHKVRWNMNKGSERWLCFGGACGLVRCQEIVY 1081

Query: 1877 SSVADRSLSKK 1909
            S + ++   KK
Sbjct: 1082 SDIDNKLALKK 1092


>ref|XP_007145328.1| hypothetical protein PHAVU_007G229800g [Phaseolus vulgaris]
            gi|561018518|gb|ESW17322.1| hypothetical protein
            PHAVU_007G229800g [Phaseolus vulgaris]
          Length = 1104

 Score =  513 bits (1321), Expect = e-142
 Identities = 272/559 (48%), Positives = 343/559 (61%), Gaps = 15/559 (2%)
 Frame = +2

Query: 278  AVRTCKASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDRESQDAMTSTHEVANSTPL 457
            AV  C     T      +K   R   Y+  +S  L   N+         + +++ +++ L
Sbjct: 558  AVSACNTFSTTVVKSDGLKINHREGRYDQDISQPLQYDNQ---------ANYQLCSTSEL 608

Query: 458  ENITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMGYLAVLLGNGSL 637
            E    +  +P DV+LPR+V CLAHNGKVAWDVKWRP N  D   KHRMGYLAVLLGNGSL
Sbjct: 609  EAPPTTSSIPEDVTLPRVVSCLAHNGKVAWDVKWRPTNISDSSYKHRMGYLAVLLGNGSL 668

Query: 638  EVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPLTVEWSPSFPHD 817
            EVWEVP P  ++  +    ++G DPRF+KLEPVF+CS LK    QSIPLTVEWS + PHD
Sbjct: 669  EVWEVPLPHVVRAIYM--HREGTDPRFIKLEPVFRCSMLKRRGIQSIPLTVEWSVTPPHD 726

Query: 818  LILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDESDPESVNAIVT 997
             +LAGCHDGTVALWKF  + SS+C DT PLLCF  DTVPIR +AWAP E DPES N IVT
Sbjct: 727  YLLAGCHDGTVALWKFCINSSSKCDDTMPLLCFGGDTVPIRTVAWAPFEGDPESSNIIVT 786

Query: 998  AGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTLRILSLLRSAYD 1177
            AGHEGL+FWDLR+P RPL  L+   RIIY +DWL  P C+I+SF+DGT+R +SL ++A D
Sbjct: 787  AGHEGLKFWDLRNPFRPLRSLHPAPRIIYSLDWLSKPSCIIMSFEDGTIRTISLAKAAND 846

Query: 1178 VPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLRFQLTSKAVDKD 1357
            +PVTG+ + G +Q GL     +SF IWSVQ SR+TGM+A+C +DG+V RFQLT+K+V+ D
Sbjct: 847  LPVTGEIYSGKKQPGLHGSLYASFAIWSVQVSRITGMLAFCGADGTVFRFQLTTKSVETD 906

Query: 1358 ASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIRGFLSDMNQAKR 1537
             +RNRA  FLCGS+TEE+ NL + TP+ N PF  KK L        S R  LS  N  K 
Sbjct: 907  HARNRARRFLCGSVTEENSNLVINTPVSNAPFLCKK-LPVKGRCAESFRDLLSKTNPYKN 965

Query: 1538 GSEPMS-----DNHTMALCYGDDPSLHIV-SGDTSA-----PXXXXXXXXXXXXXXXPDA 1684
                +      D  +  L  G D ++ ++ SG   A                      + 
Sbjct: 966  ALNKVPETSSFDFDSQTLAIGADENVDLLESGSEEALYSMKQPKRTKLNNGSKKKPEENL 1025

Query: 1685 DVACXXXXXXXXXXXXKKSKATV----EVFPSKMVAMHRVRWNMNKGSERWLCYGGAAGI 1852
            DV C               K+      E FP KM A+H+VRWNMNKGSE+WLC+GGA G+
Sbjct: 1026 DVVCKDGDVPLITTEADNEKSDFGNIPETFPPKMAALHKVRWNMNKGSEKWLCFGGACGL 1085

Query: 1853 VRCQEISASSVADRSLSKK 1909
            VRCQEI  S +  +   KK
Sbjct: 1086 VRCQEIVYSDIDKKWALKK 1104


>ref|XP_007203691.1| hypothetical protein PRUPE_ppa024767mg [Prunus persica]
            gi|462399222|gb|EMJ04890.1| hypothetical protein
            PRUPE_ppa024767mg [Prunus persica]
          Length = 1070

 Score =  510 bits (1314), Expect = e-142
 Identities = 277/525 (52%), Positives = 339/525 (64%), Gaps = 16/525 (3%)
 Frame = +2

Query: 101  EEPLDDFHGGSQCVVALSAELLEDS----SVTGADKNTQELAMLKSHDKDIVLETATHEG 268
            EE +D+  G S  V ALS +  E S    S      NTQE      H K         + 
Sbjct: 351  EESVDNLDGSSNYVEALSIQHPEGSPELHSTGCVPANTQE------HGKK-------RKN 397

Query: 269  CIHAVRTCKASLVTPTHKRKMKDKARAKSYNNSVSPLLLTQNEDR------------ESQ 412
              HA   C  +L +   +RK+ D   A + NN   P LL QNE++              Q
Sbjct: 398  YNHAASECNPTLKSYARRRKLNDMESAGTNNNHTCPPLLNQNEEKGPLVSDYHIQQSSGQ 457

Query: 413  DAMTSTHEVANSTPLENITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECK 592
            D  TS +   N  P    T    +P DV+LPR+V CLAH+GKVAWDVKWRP +  D +CK
Sbjct: 458  DPQTSNNVQDNDYPKIGSTRC-SVPEDVALPRIVSCLAHHGKVAWDVKWRPPSEHDSKCK 516

Query: 593  HRMGYLAVLLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQ 772
            HRMGYLAVL GNGSLEVW+VP P  I+  +SS  ++G DPRF+KL PVF+CS LKCG  +
Sbjct: 517  HRMGYLAVLSGNGSLEVWDVPLPHAIEVIYSSSCREGTDPRFIKLAPVFRCSMLKCGSEK 576

Query: 773  SIPLTVEWSPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAW 952
            SIPLTVEWS S  HD +LAGCHDGTVALWKF+AS +SQ  DTRPLLCFSADT PIRALAW
Sbjct: 577  SIPLTVEWSASPAHDYLLAGCHDGTVALWKFSASNASQ--DTRPLLCFSADTNPIRALAW 634

Query: 953  APDESDPESVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFD 1132
            AP +S  E  N I TAGH GL+FWDLRDP RPLWDL+ + + IY +DWLPDPRCVILSFD
Sbjct: 635  APVDSSSEGANVIATAGHGGLKFWDLRDPFRPLWDLDHLPKFIYSLDWLPDPRCVILSFD 694

Query: 1133 DGTLRILSLLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDG 1312
            DGT++++SL+++A D PVTG    GT+Q GL +  C  F IWSV  SRLTGM AYC +DG
Sbjct: 695  DGTMKVISLVKAASDDPVTG--MAGTKQPGLHNLSCLPFAIWSVHVSRLTGMAAYCGADG 752

Query: 1313 SVLRFQLTSKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTP 1492
            +VLRFQLTSK+V+KD  R+RAPHFLC SLT E+  +T+ T + N PFP+K   N     P
Sbjct: 753  TVLRFQLTSKSVEKDPRRHRAPHFLCVSLTMEESAVTINTTVSNTPFPLKVVRN----NP 808

Query: 1493 RSIRGFLSDMNQAKRGSEPMSDNHTMALCYGDDPSLHIVSGDTSA 1627
             S +  +   N  KR  +  S++ T+ALCYG DP +   SG+  A
Sbjct: 809  ESNK--VKSAND-KRAKDSASEDQTLALCYGVDPDIQSESGEKVA 850



 Score = 85.1 bits (209), Expect = 1e-13
 Identities = 49/135 (36%), Positives = 64/135 (47%), Gaps = 3/135 (2%)
 Frame = +2

Query: 1484 DTPRSIRGFLSDMNQAKRGSEPMSDNHTMALCYGDDPSLHIVS---GDTSAPXXXXXXXX 1654
            D    +    S   Q  R S+   ++     C G++   + +    G  S          
Sbjct: 928  DEETGVASLKSKKTQKSRSSKKNPNDDRGLACIGEEEPTNTLEEEIGVASPESKKKQKTR 987

Query: 1655 XXXXXXXPDADVACXXXXXXXXXXXXKKSKATVEVFPSKMVAMHRVRWNMNKGSERWLCY 1834
                    D D+AC            ++    +E+FP K+VAMHRVRWNMNKGSERWLCY
Sbjct: 988  SSKKKPNDDQDLACIDEVPINTQE--EEDGKELEIFPDKIVAMHRVRWNMNKGSERWLCY 1045

Query: 1835 GGAAGIVRCQEISAS 1879
            GGAAG+VRCQEI  S
Sbjct: 1046 GGAAGLVRCQEIVLS 1060


>ref|XP_004515185.1| PREDICTED: uncharacterized protein LOC101510901 [Cicer arietinum]
          Length = 981

 Score =  508 bits (1308), Expect = e-141
 Identities = 262/507 (51%), Positives = 334/507 (65%), Gaps = 12/507 (2%)
 Frame = +2

Query: 425  STHEVANSTPLENITDSFHLPNDVSLPRLVLCLAHNGKVAWDVKWRPCNFGDLECKHRMG 604
            + H+   S+ LE    +  +P +V+LPR+V CLAHNGKVAWDVKWRP N  D   KHRMG
Sbjct: 478  ANHQPHGSSVLEPPASTCSVPGNVALPRVVSCLAHNGKVAWDVKWRPLNNFDSSTKHRMG 537

Query: 605  YLAVLLGNGSLEVWEVPSPRTIKFSFSSCQKDGIDPRFVKLEPVFKCSKLKCGDRQSIPL 784
            YLAVLLGNGSLEVWEVP P  ++  ++  Q++G DPRF+KLEPVFKCS LK G  QSIPL
Sbjct: 538  YLAVLLGNGSLEVWEVPLPHVLRSIYT--QREGTDPRFIKLEPVFKCSMLKRGSLQSIPL 595

Query: 785  TVEWSPSFPHDLILAGCHDGTVALWKFAASGSSQCTDTRPLLCFSADTVPIRALAWAPDE 964
            TVEWS + PHD ILAGCHDGTVALWKF+ + SS+C DT+P+LCF  DTVPIRA+AWAP E
Sbjct: 596  TVEWSVTPPHDYILAGCHDGTVALWKFSTNSSSKCDDTKPMLCFGGDTVPIRAVAWAPFE 655

Query: 965  SDPESVNAIVTAGHEGLRFWDLRDPHRPLWDLNSVRRIIYGVDWLPDPRCVILSFDDGTL 1144
             DPE  N IVTAGHEGL+FWDLR+P RPL +L   +RIIY +DWL  P C+I+SF+DGT+
Sbjct: 656  GDPEISNIIVTAGHEGLKFWDLRNPFRPLRNLQPSQRIIYSLDWLSKPSCIIMSFEDGTM 715

Query: 1145 RILSLLRSAYDVPVTGKPFVGTQQQGLDSYFCSSFPIWSVQASRLTGMIAYCSSDGSVLR 1324
            + +SL+++A D+PVTG  + G +Q  L     SS+ IWSV  SR TGM+AYC +DGS +R
Sbjct: 716  KTVSLVKAASDLPVTGTIYTGKKQPWLHGTTYSSYAIWSVHVSRETGMVAYCGADGSAVR 775

Query: 1325 FQLTSKAVDKDASRNRAPHFLCGSLTEEDMNLTVYTPMPNIPFPMKKSLNEWSDTPRSIR 1504
            FQLT+KAV+ D S NR P FLCGS+ EE+  + V TP+ N PFP+KK+ +E      S R
Sbjct: 776  FQLTTKAVETDHSHNRLPFFLCGSVCEEESTIIVNTPVSNSPFPLKKT-HEKGPQVNSFR 834

Query: 1505 GFLSDMNQAK---RGSEPMSDNHTMALCYGDDPSLHIVSGD----TSAPXXXXXXXXXXX 1663
              LS  N ++     +   S+N +  L   D  +  + SG     +SA            
Sbjct: 835  DLLSKENLSRSVINQTTKASNNDSEILALYDVDNFDLESGYEEALSSAEQPKRPKLSCSS 894

Query: 1664 XXXXPDADVACXXXXXXXXXXXXKKSK-----ATVEVFPSKMVAMHRVRWNMNKGSERWL 1828
                 ++                 K K        EVFP K+VA+H+VRWNMNKGSE+WL
Sbjct: 895  KKKPRESTSLVRRDGALTNKPGVDKEKLDSGNIIPEVFPHKLVALHKVRWNMNKGSEKWL 954

Query: 1829 CYGGAAGIVRCQEISASSVADRSLSKK 1909
            C+GGA+G+VRCQEI  S +  +   K+
Sbjct: 955  CFGGASGLVRCQEIVFSDIDKKRALKR 981


Top