BLASTX nr result

ID: Akebia26_contig00012298 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00012298
         (1923 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007029794.1| P-loop containing nucleoside triphosphate hy...  1010   0.0  
ref|XP_007029793.1| P-loop containing nucleoside triphosphate hy...  1010   0.0  
ref|XP_007029795.1| P-loop containing nucleoside triphosphate hy...  1001   0.0  
ref|XP_002264216.1| PREDICTED: DNA-binding protein SMUBP-2 [Viti...   988   0.0  
ref|XP_006437411.1| hypothetical protein CICLE_v10030616mg [Citr...   981   0.0  
ref|XP_006484692.1| PREDICTED: DNA-binding protein SMUBP-2-like ...   980   0.0  
ref|XP_004143639.1| PREDICTED: DNA-binding protein SMUBP-2-like ...   980   0.0  
ref|XP_002524012.1| DNA-binding protein smubp-2, putative [Ricin...   972   0.0  
gb|EXB79398.1| DNA-binding protein SMUBP-2 [Morus notabilis]          964   0.0  
ref|XP_002319231.2| hypothetical protein POPTR_0013s07150g [Popu...   962   0.0  
ref|XP_002870460.1| hypothetical protein ARALYDRAFT_493645 [Arab...   952   0.0  
ref|XP_004514995.1| PREDICTED: DNA-binding protein SMUBP-2-like ...   951   0.0  
ref|XP_007145941.1| hypothetical protein PHAVU_007G280900g [Phas...   949   0.0  
ref|NP_198446.3| P-loop containing nucleoside triphosphate hydro...   949   0.0  
ref|XP_006588516.1| PREDICTED: DNA-binding protein SMUBP-2-like ...   947   0.0  
ref|XP_006283073.1| hypothetical protein CARUB_v10004066mg [Caps...   946   0.0  
ref|XP_006574496.1| PREDICTED: DNA-binding protein SMUBP-2-like ...   940   0.0  
ref|XP_006574495.1| PREDICTED: DNA-binding protein SMUBP-2-like ...   940   0.0  
ref|XP_006574494.1| PREDICTED: DNA-binding protein SMUBP-2-like ...   940   0.0  
ref|XP_006878575.1| hypothetical protein AMTR_s00011p00245550 [A...   927   0.0  

>ref|XP_007029794.1| P-loop containing nucleoside triphosphate hydrolases superfamily
            protein isoform 2 [Theobroma cacao]
            gi|508718399|gb|EOY10296.1| P-loop containing nucleoside
            triphosphate hydrolases superfamily protein isoform 2
            [Theobroma cacao]
          Length = 953

 Score = 1010 bits (2611), Expect = 0.0
 Identities = 505/640 (78%), Positives = 556/640 (86%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DF +AE+QGEFLE+ QRMGPGLTFVIQ+QPY NAIP+PLGLE++CLKACTHYPTLFDHFQ
Sbjct: 184  DFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTLFDHFQ 243

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELR++LQ+LQ+ SV++DWRET+SWKLLKELANSAQHRAI RK+ QPK V   LGMD+EK
Sbjct: 244  RELRNILQELQQNSVVEDWRETESWKLLKELANSAQHRAIARKITQPKPVQGVLGMDLEK 303

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
             KA+Q RIDEF K MS+LLRIERDAELEFTQEEL+AVP PD  SDSSKPIE+LVS GQ Q
Sbjct: 304  AKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVSHGQAQ 363

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL+AVS+STGLGGMHLVLF+VEG+HRLPPTTLSPGDMVCVR CDSRGAGATS
Sbjct: 364  QELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGAGATS 423

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            CMQGFV+NLGEDGCSI VALESRHGDPTFSK FGK VRIDRI GLAD LTYERNCEALML
Sbjct: 424  CMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNCEALML 483

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGLQKKNPSIA+VATLFGD+EDVTWLE+N   DW+  +LDG+L+ G +DDSQ +AIA
Sbjct: 484  LQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQQRAIA 543

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LGLNKKRP+L+VQGPP           IAL+V+QGERVLV APTNAAVDN+VEKLSNIGL
Sbjct: 544  LGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKLSNIGL 603

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARISSAVASKSL EIVNSKL+ +  E+ERKK+DLRKDLR CLKDDSLAAGIR
Sbjct: 604  NIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSLAAGIR 663

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  SSAQVVL TNTGAADPLIRR+ TFDLVVIDEA QAIEP
Sbjct: 664  QLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAGQAIEP 723

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SCWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGLG+SLLERA+T+ EGVLATML TQYR
Sbjct: 724  SCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATMLTTQYR 783

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN AIA WASKEMY+G L SSP V SHLL+DSPFVK TWI
Sbjct: 784  MNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWI 823


>ref|XP_007029793.1| P-loop containing nucleoside triphosphate hydrolases superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508718398|gb|EOY10295.1| P-loop containing nucleoside
            triphosphate hydrolases superfamily protein isoform 1
            [Theobroma cacao]
          Length = 1008

 Score = 1010 bits (2611), Expect = 0.0
 Identities = 505/640 (78%), Positives = 556/640 (86%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DF +AE+QGEFLE+ QRMGPGLTFVIQ+QPY NAIP+PLGLE++CLKACTHYPTLFDHFQ
Sbjct: 184  DFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTLFDHFQ 243

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELR++LQ+LQ+ SV++DWRET+SWKLLKELANSAQHRAI RK+ QPK V   LGMD+EK
Sbjct: 244  RELRNILQELQQNSVVEDWRETESWKLLKELANSAQHRAIARKITQPKPVQGVLGMDLEK 303

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
             KA+Q RIDEF K MS+LLRIERDAELEFTQEEL+AVP PD  SDSSKPIE+LVS GQ Q
Sbjct: 304  AKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVSHGQAQ 363

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL+AVS+STGLGGMHLVLF+VEG+HRLPPTTLSPGDMVCVR CDSRGAGATS
Sbjct: 364  QELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGAGATS 423

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            CMQGFV+NLGEDGCSI VALESRHGDPTFSK FGK VRIDRI GLAD LTYERNCEALML
Sbjct: 424  CMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNCEALML 483

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGLQKKNPSIA+VATLFGD+EDVTWLE+N   DW+  +LDG+L+ G +DDSQ +AIA
Sbjct: 484  LQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQQRAIA 543

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LGLNKKRP+L+VQGPP           IAL+V+QGERVLV APTNAAVDN+VEKLSNIGL
Sbjct: 544  LGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKLSNIGL 603

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARISSAVASKSL EIVNSKL+ +  E+ERKK+DLRKDLR CLKDDSLAAGIR
Sbjct: 604  NIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSLAAGIR 663

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  SSAQVVL TNTGAADPLIRR+ TFDLVVIDEA QAIEP
Sbjct: 664  QLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAGQAIEP 723

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SCWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGLG+SLLERA+T+ EGVLATML TQYR
Sbjct: 724  SCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATMLTTQYR 783

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN AIA WASKEMY+G L SSP V SHLL+DSPFVK TWI
Sbjct: 784  MNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWI 823


>ref|XP_007029795.1| P-loop containing nucleoside triphosphate hydrolases superfamily
            protein isoform 3 [Theobroma cacao]
            gi|508718400|gb|EOY10297.1| P-loop containing nucleoside
            triphosphate hydrolases superfamily protein isoform 3
            [Theobroma cacao]
          Length = 951

 Score = 1001 bits (2588), Expect = 0.0
 Identities = 503/640 (78%), Positives = 554/640 (86%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DF +AE+QGEFLE+ QRMGPGLTFVIQ+QPY NAIP+PLGLE++CLKACTHYPTLFDHFQ
Sbjct: 184  DFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTLFDHFQ 243

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELR++LQ+LQ+ SV++DWRET+SWKLLKELANSAQHRAI RK+ QPK V   LGMD+EK
Sbjct: 244  RELRNILQELQQNSVVEDWRETESWKLLKELANSAQHRAIARKITQPKPVQGVLGMDLEK 303

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
             KA+Q RIDEF K MS+LLRIERDAELEFTQEEL+AVP PD  SDSSKPIE+LVS GQ Q
Sbjct: 304  AKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVSHGQAQ 363

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL+AVS+STG  GMHLVLF+VEG+HRLPPTTLSPGDMVCVR CDSRGAGATS
Sbjct: 364  QELCDTICNLNAVSTSTG--GMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGAGATS 421

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            CMQGFV+NLGEDGCSI VALESRHGDPTFSK FGK VRIDRI GLAD LTYERNCEALML
Sbjct: 422  CMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNCEALML 481

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGLQKKNPSIA+VATLFGD+EDVTWLE+N   DW+  +LDG+L+ G +DDSQ +AIA
Sbjct: 482  LQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQQRAIA 541

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LGLNKKRP+L+VQGPP           IAL+V+QGERVLV APTNAAVDN+VEKLSNIGL
Sbjct: 542  LGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKLSNIGL 601

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARISSAVASKSL EIVNSKL+ +  E+ERKK+DLRKDLR CLKDDSLAAGIR
Sbjct: 602  NIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSLAAGIR 661

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  SSAQVVL TNTGAADPLIRR+ TFDLVVIDEA QAIEP
Sbjct: 662  QLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAGQAIEP 721

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SCWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGLG+SLLERA+T+ EGVLATML TQYR
Sbjct: 722  SCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATMLTTQYR 781

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN AIA WASKEMY+G L SSP V SHLL+DSPFVK TWI
Sbjct: 782  MNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWI 821


>ref|XP_002264216.1| PREDICTED: DNA-binding protein SMUBP-2 [Vitis vinifera]
          Length = 953

 Score =  988 bits (2554), Expect = 0.0
 Identities = 501/640 (78%), Positives = 546/640 (85%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DFASAE+QGEF E+ QRMGPGL+FVIQ+QPY NAIPMPLG E++CLKACTHYPTLFDHFQ
Sbjct: 129  DFASAELQGEFAELRQRMGPGLSFVIQAQPYLNAIPMPLGHEAICLKACTHYPTLFDHFQ 188

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELRDVLQD QRKS   DWRET SW+LLKELANSAQHRAI RKV QPK +   LGM+++K
Sbjct: 189  RELRDVLQDHQRKSQFQDWRETQSWQLLKELANSAQHRAISRKVSQPKPLKGVLGMELDK 248

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
             KAIQSRIDEF K MS+LL+IERD+ELEFTQEEL+AVP PD +SDSSKPIE+LVS GQ Q
Sbjct: 249  AKAIQSRIDEFTKRMSELLQIERDSELEFTQEELNAVPTPDESSDSSKPIEFLVSHGQAQ 308

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL+AVS+  GLGGMHLVLFKVEG+HRLPPTTLSPGDMVCVR CDSRGAGATS
Sbjct: 309  QELCDTICNLNAVSTFIGLGGMHLVLFKVEGNHRLPPTTLSPGDMVCVRICDSRGAGATS 368

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            CMQGFV++LG+DGCSI VALESRHGDPTFSKLFGK VRIDRI GLAD LTYERNCEALML
Sbjct: 369  CMQGFVDSLGKDGCSISVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALML 428

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGLQKKNPSIA+VATLFGD+EDV WLE+N LVDW+   LD +LE G YDDSQ +AIA
Sbjct: 429  LQKNGLQKKNPSIAVVATLFGDKEDVAWLEENDLVDWAEVGLDELLESGAYDDSQRRAIA 488

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LGLNKKRP+LI+QGPP           IAL+V+QGERVLVTAPTNAAVDN+VEKLSNIG+
Sbjct: 489  LGLNKKRPILIIQGPPGTGKTVLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGV 548

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARISSAVASKSLGEIVNSKL  F  E+ERKK+DLRKDLR CLKDDSLAAGIR
Sbjct: 549  NIVRVGNPARISSAVASKSLGEIVNSKLENFLTEFERKKSDLRKDLRHCLKDDSLAAGIR 608

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  SSAQVVL TNTGAADP+IRRL  FDLV+IDEA QAIEP
Sbjct: 609  QLLKQLGKALKKKEKETVKEVLSSAQVVLATNTGAADPVIRRLDAFDLVIIDEAGQAIEP 668

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SCWIPILQGKRCI+AGDQCQLAPV+LSR+ALEGGLG+SLLERA+TL E VLAT L TQYR
Sbjct: 669  SCWIPILQGKRCIIAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEEVLATKLTTQYR 728

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN AIA WASKEMY G L SS  V SHLL+DSPFVK  WI
Sbjct: 729  MNDAIASWASKEMYGGSLKSSSSVFSHLLVDSPFVKPAWI 768


>ref|XP_006437411.1| hypothetical protein CICLE_v10030616mg [Citrus clementina]
            gi|557539607|gb|ESR50651.1| hypothetical protein
            CICLE_v10030616mg [Citrus clementina]
          Length = 1010

 Score =  981 bits (2537), Expect = 0.0
 Identities = 499/640 (77%), Positives = 552/640 (86%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DFASAEIQGEF E+ QRMGPGLTFVI++QPY NAIPMP+GLE++CLKA THYPTLFDHFQ
Sbjct: 187  DFASAEIQGEFSELRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQ 246

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELRDVLQ+LQ+K ++ DW ET+SWKLLKELANSAQHRAIVRKV QPK V   LGMD+E+
Sbjct: 247  RELRDVLQELQQKLLVQDWHETESWKLLKELANSAQHRAIVRKVTQPKPVQGVLGMDLER 306

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
            VK IQSR+DEF + MS+LLRIERDAELEFTQEEL+AVP PD NSDSSKPIE+LVS G+  
Sbjct: 307  VKTIQSRLDEFTQRMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGRAP 366

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL AVS+STGLGGMHLVLF+VEG+HRLPPTTLSPGDMVCVR CDSRGA ATS
Sbjct: 367  QELCDTICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATS 426

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            C+QGFV+NLGEDGC+I VALESRHGDPTFSKLFGK VRIDRI GLADTLTYERNCEALML
Sbjct: 427  CIQGFVHNLGEDGCTISVALESRHGDPTFSKLFGKSVRIDRIQGLADTLTYERNCEALML 486

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGL K+NPSIA V TLFGD+EDVTWLE+N L DWS  +LDG++ +  +DDSQ KAIA
Sbjct: 487  LQKNGLHKRNPSIAAVVTLFGDKEDVTWLEENDLADWSEVKLDGIMGK-TFDDSQKKAIA 545

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LGLNKKRP+LI+QGPP           IA +V+QGERVLVTAPTNAAVDN+VEKLS++GL
Sbjct: 546  LGLNKKRPLLIIQGPPGTGKTGLLKEIIARAVQQGERVLVTAPTNAAVDNMVEKLSDVGL 605

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARIS AVASKSLGEIV SKL++F  E+ERKK+DLRKDLRQCLKDDSLAAGIR
Sbjct: 606  NIVRVGNPARISPAVASKSLGEIVKSKLASFVAEFERKKSDLRKDLRQCLKDDSLAAGIR 665

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  SSAQVVL TNTGAADPLIRRL TFDLVVIDEA+QAIEP
Sbjct: 666  QLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAAQAIEP 725

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SC IPILQGKRCILAGDQCQLAPV+LSR+ALEGGLG+SLLERA+TL EGVLAT L TQYR
Sbjct: 726  SCLIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLATKLTTQYR 785

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN AIA WASKEMY G L+SS  V+SHLL+D+PFVK TWI
Sbjct: 786  MNDAIASWASKEMYGGSLISSSTVASHLLVDTPFVKPTWI 825


>ref|XP_006484692.1| PREDICTED: DNA-binding protein SMUBP-2-like [Citrus sinensis]
          Length = 1010

 Score =  980 bits (2533), Expect = 0.0
 Identities = 498/640 (77%), Positives = 551/640 (86%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DFASAEIQGEF E+ QRMGPGLTFVI++QPY NAIPMP+GLE++CLKA THYPTLFDHFQ
Sbjct: 187  DFASAEIQGEFSELRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQ 246

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELRDVLQ+LQ+K ++ DW ET+SWKLLKELANSAQHRAIVRKV QPK V   LGMD+E+
Sbjct: 247  RELRDVLQELQQKLLVQDWHETESWKLLKELANSAQHRAIVRKVTQPKPVQGVLGMDLER 306

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
            VK IQSR+DEF + MS+LLRIERDAELEFTQEEL+AVP PD NSDSSKPIE+LVS G+  
Sbjct: 307  VKTIQSRLDEFTQRMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGRAP 366

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL  VS+STGLGGMHLVLF+VEG+HRLPPTTLSPGDMVCVR CDSRGA ATS
Sbjct: 367  QELCDTICNLFVVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATS 426

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            C+QGFV+NLGEDGC+I VALESRHGDPTFSKLFGK VRIDRI GLADTLTYERNCEALML
Sbjct: 427  CIQGFVHNLGEDGCTISVALESRHGDPTFSKLFGKSVRIDRIQGLADTLTYERNCEALML 486

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGL K+NPSIA V TLFGD+EDVTWLE+N L DWS  +LDG++ +  +DDSQ KAIA
Sbjct: 487  LQKNGLHKRNPSIAAVVTLFGDKEDVTWLEENDLADWSEVKLDGIMGK-TFDDSQKKAIA 545

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LGLNKKRP+LI+QGPP           IA +V+QGERVLVTAPTNAAVDN+VEKLS++GL
Sbjct: 546  LGLNKKRPLLIIQGPPGTGKTGLLKEIIARAVQQGERVLVTAPTNAAVDNMVEKLSDVGL 605

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARIS AVASKSLGEIV SKL++F  E+ERKK+DLRKDLRQCLKDDSLAAGIR
Sbjct: 606  NIVRVGNPARISPAVASKSLGEIVKSKLASFVAEFERKKSDLRKDLRQCLKDDSLAAGIR 665

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  SSAQVVL TNTGAADPLIRRL TFDLVVIDEA+QAIEP
Sbjct: 666  QLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAAQAIEP 725

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SC IPILQGKRCILAGDQCQLAPV+LSR+ALEGGLG+SLLERA+TL EGVLAT L TQYR
Sbjct: 726  SCLIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLATKLTTQYR 785

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN AIA WASKEMY G L+SS  V+SHLL+D+PFVK TWI
Sbjct: 786  MNDAIASWASKEMYGGSLISSSTVASHLLVDTPFVKPTWI 825


>ref|XP_004143639.1| PREDICTED: DNA-binding protein SMUBP-2-like [Cucumis sativus]
            gi|449527761|ref|XP_004170878.1| PREDICTED: DNA-binding
            protein SMUBP-2-like [Cucumis sativus]
          Length = 957

 Score =  980 bits (2533), Expect = 0.0
 Identities = 490/640 (76%), Positives = 551/640 (86%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DFA+AE+QG+F E+ QRMG GLTFVIQ+QPY NA+PMPLGLE++CLKA THYPTLFDHFQ
Sbjct: 133  DFAAAEVQGDFPELQQRMGQGLTFVIQAQPYLNAVPMPLGLEAVCLKASTHYPTLFDHFQ 192

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELRDVLQDLQR+S+  DWRET SWKLLK+LA+S QH+AI RK+ +PK V   LGMD++K
Sbjct: 193  RELRDVLQDLQRQSLFLDWRETQSWKLLKKLAHSVQHKAIARKISEPKVVQGALGMDLKK 252

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
             KAIQ+RIDEFA  MS+LLRIERD+ELEFTQEEL+AVP PD +SD+SKPIE+LVS GQ Q
Sbjct: 253  AKAIQNRIDEFANRMSELLRIERDSELEFTQEELNAVPTPDESSDNSKPIEFLVSHGQAQ 312

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL+AVS+STGLGGMHLVLF+VEGSHRLPPTTLSPGDMVCVR CDSRGAGATS
Sbjct: 313  QELCDTICNLNAVSTSTGLGGMHLVLFRVEGSHRLPPTTLSPGDMVCVRVCDSRGAGATS 372

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            CMQGFVNNLG+DGCSI VALESRHGDPTFSKLFGK VRIDRIPGLADTLTYERNCEALML
Sbjct: 373  CMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKTVRIDRIPGLADTLTYERNCEALML 432

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGL KKNPSIA+VATLFGD+ED+ W+E N+L+  + T LDG++  G +DDSQ  AI+
Sbjct: 433  LQKNGLHKKNPSIAVVATLFGDKEDIKWMEDNNLIGLADTNLDGIVFNGDFDDSQKSAIS 492

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
              LNKKRP+LI+QGPP           IAL+V+QGERVLVTAPTNAAVDN+VEKLSNIG+
Sbjct: 493  RALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGI 552

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARISS+VASKSL EIVNS+LS+FR + ERKKADLRKDLRQCLKDDSLAAGIR
Sbjct: 553  NIVRVGNPARISSSVASKSLAEIVNSELSSFRTDIERKKADLRKDLRQCLKDDSLAAGIR 612

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  S+AQVVL TNTGAADPLIR+L  FDLVVIDEA QAIEP
Sbjct: 613  QLLKQLGKSLKKKEKETVKEVLSNAQVVLATNTGAADPLIRKLEKFDLVVIDEAGQAIEP 672

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            +CWIPILQG+RCILAGDQCQLAPV+LSR+ALEGGLG+SLLERA+TL EG L TML  QYR
Sbjct: 673  ACWIPILQGRRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGALTTMLTIQYR 732

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN AIA WASKEMY+G+L SSP VSSHLL++SPFVK TWI
Sbjct: 733  MNDAIASWASKEMYDGILESSPTVSSHLLVNSPFVKPTWI 772


>ref|XP_002524012.1| DNA-binding protein smubp-2, putative [Ricinus communis]
            gi|223536739|gb|EEF38380.1| DNA-binding protein smubp-2,
            putative [Ricinus communis]
          Length = 989

 Score =  973 bits (2514), Expect = 0.0
 Identities = 492/642 (76%), Positives = 543/642 (84%), Gaps = 2/642 (0%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMG--PGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDH 175
            DFASAE QGEFLE+ QRM    GLTFVIQ+QPY NA+P+PLG E+LCLKAC HYPTLFDH
Sbjct: 163  DFASAETQGEFLELRQRMDLEAGLTFVIQAQPYINAVPIPLGFEALCLKACIHYPTLFDH 222

Query: 176  FQRELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDV 355
            FQRELRDVLQDLQRK ++ DW+ T+SWKLLKELANS QHRA+ RKV +PK +   LGM++
Sbjct: 223  FQRELRDVLQDLQRKGLVQDWQNTESWKLLKELANSVQHRAVARKVSKPKPLQGVLGMNL 282

Query: 356  EKVKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQ 535
            +K KAIQSRIDEF K MS+LL+IERD+ELEFTQEEL+AVP PD NSD SKPIE+LVS GQ
Sbjct: 283  DKAKAIQSRIDEFTKTMSELLQIERDSELEFTQEELNAVPTPDENSDPSKPIEFLVSHGQ 342

Query: 536  DQQELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGA 715
             QQELCDTICNL+AVS+STGLGGMHLVLF+VEG+HRLPPT LSPGDMVCVR CDSRGAGA
Sbjct: 343  AQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGA 402

Query: 716  TSCMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEAL 895
            TSCMQGFVNNLGEDGCSI VALESRHGDPTFSKLFGKGVRIDRI GLAD LTYERNCEAL
Sbjct: 403  TSCMQGFVNNLGEDGCSISVALESRHGDPTFSKLFGKGVRIDRIHGLADALTYERNCEAL 462

Query: 896  MLLQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKA 1075
            MLLQKNGLQKKNPSIAIVATLFGD ED+ WLE+  L +W+  ++DG      +DDSQ +A
Sbjct: 463  MLLQKNGLQKKNPSIAIVATLFGDSEDLAWLEEKDLAEWNEADMDGCFGSERFDDSQRRA 522

Query: 1076 IALGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNI 1255
            +ALGLN+KRP+LI+QGPP           I  +V QGERVLVTAPTNAAVDN+VEKLSNI
Sbjct: 523  MALGLNQKRPLLIIQGPPGTGKSGLLKELIVRAVHQGERVLVTAPTNAAVDNMVEKLSNI 582

Query: 1256 GLNIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAG 1435
            GL+IVRVGNPARISSAVASKSL EIVNSKL+TFR E+ERKK+DLRKDLR CL+DDSLAAG
Sbjct: 583  GLDIVRVGNPARISSAVASKSLSEIVNSKLATFRMEFERKKSDLRKDLRHCLEDDSLAAG 642

Query: 1436 IRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAI 1615
            IR                      SSAQVVL TNTGAADPLIRRL TFDLVVIDEA QAI
Sbjct: 643  IRQLLKQLGKTMKKKEKESVKEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAGQAI 702

Query: 1616 EPSCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQ 1795
            EPSCWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGLG+SLLERA+TL +GVLA  L TQ
Sbjct: 703  EPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHDGVLALQLTTQ 762

Query: 1796 YRMNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            YRMN AIA WASKEMY GLL SS +V+SHLL+ SPFVK TWI
Sbjct: 763  YRMNDAIASWASKEMYGGLLKSSSKVASHLLVHSPFVKPTWI 804


>gb|EXB79398.1| DNA-binding protein SMUBP-2 [Morus notabilis]
          Length = 978

 Score =  964 bits (2492), Expect = 0.0
 Identities = 487/645 (75%), Positives = 545/645 (84%), Gaps = 5/645 (0%)
 Frame = +2

Query: 2    DFASAEI----QGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLF 169
            DFAS E+    + +F E+ Q+MGPGLTFVIQ+QPY NA+PMP GLE++CLKACTHYPTLF
Sbjct: 149  DFASTEVGAGEESDFSELQQQMGPGLTFVIQAQPYLNAVPMPPGLEAVCLKACTHYPTLF 208

Query: 170  DHFQRELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVP-QPKSVLRGLG 346
            DHFQRELRDVLQDLQR+SV+ +W ET SWKLLKELA S QHRA+ RK P  PKS L  LG
Sbjct: 209  DHFQRELRDVLQDLQRRSVVSNWCETCSWKLLKELAGSVQHRAVARKAPGPPKSALSVLG 268

Query: 347  MDVEKVKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVS 526
            M++EK KAIQSRID+F   MS+LLRIERDAELEFTQEELDAVPMPD +SDSSKPIE+LVS
Sbjct: 269  MEMEKAKAIQSRIDKFTNGMSELLRIERDAELEFTQEELDAVPMPDQSSDSSKPIEFLVS 328

Query: 527  RGQDQQELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRG 706
             GQ QQELCDTICNL+AVS+STGLGGMHLV FKVEG+H+LPPTTLSPGDMVCVR+CDSRG
Sbjct: 329  HGQAQQELCDTICNLNAVSTSTGLGGMHLVQFKVEGNHKLPPTTLSPGDMVCVRSCDSRG 388

Query: 707  AGATSCMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNC 886
            AGATSCMQGFVNN  EDGCSI +ALESRHGDPTFSKLFGK VRIDRI GLAD LTYERNC
Sbjct: 389  AGATSCMQGFVNNFEEDGCSISIALESRHGDPTFSKLFGKNVRIDRIYGLADVLTYERNC 448

Query: 887  EALMLLQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQ 1066
            EALMLLQKNGLQKKNPS+A+VATLFGD+EDV WLEQN+ VDW+  EL G       D+SQ
Sbjct: 449  EALMLLQKNGLQKKNPSVAVVATLFGDKEDVKWLEQNNFVDWTEQELSGHFTNENLDESQ 508

Query: 1067 LKAIALGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKL 1246
             +AIALGLNKK+P+L++QGPP           IAL+V+QGERVLVTAPTNAAVDN+V+KL
Sbjct: 509  RRAIALGLNKKQPILVIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVDKL 568

Query: 1247 SNIGLNIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSL 1426
            S IGLNIVRVGNPARIS +VASKSLG+IVNSKL+ F+ E ERKK+DLRKDLR CLKDDSL
Sbjct: 569  SEIGLNIVRVGNPARISPSVASKSLGQIVNSKLANFKAELERKKSDLRKDLRHCLKDDSL 628

Query: 1427 AAGIRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEAS 1606
            AAGIR                      S+A+VVL TNTGAADPLIR+L TFDLVVIDEA+
Sbjct: 629  AAGIRQLLKQLGKTLKKEEKQAVREVLSNARVVLATNTGAADPLIRKLDTFDLVVIDEAA 688

Query: 1607 QAIEPSCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATML 1786
            QAIEP+CWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGLG+SLLERA++L  G+L T L
Sbjct: 689  QAIEPACWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAASLHGGLLTTKL 748

Query: 1787 MTQYRMNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
             TQYRMN AIA WASKEMY+GLL SSP VSSHLL+DSPFVK TWI
Sbjct: 749  TTQYRMNDAIASWASKEMYDGLLKSSPTVSSHLLVDSPFVKPTWI 793


>ref|XP_002319231.2| hypothetical protein POPTR_0013s07150g [Populus trichocarpa]
            gi|550325174|gb|EEE95154.2| hypothetical protein
            POPTR_0013s07150g [Populus trichocarpa]
          Length = 983

 Score =  962 bits (2488), Expect = 0.0
 Identities = 490/640 (76%), Positives = 539/640 (84%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            +FASAE QGEF E+ QRMGPGLTFVIQ+QPY NA+PMPLGLE++CLKACTHYPTLFDHFQ
Sbjct: 160  EFASAEAQGEFTELRQRMGPGLTFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDHFQ 219

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELR+VLQDL+RK ++ DW++T+SWKLLKELANSAQHRAI RK  Q K +   LGM++EK
Sbjct: 220  RELREVLQDLKRKGLVQDWQKTESWKLLKELANSAQHRAIARKATQSKPLQGVLGMNLEK 279

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
             KAIQ RI+EF   MS+LLRIERDAELEFTQEEL+AVP  D +SDSSKPIE+LVS GQ Q
Sbjct: 280  AKAIQGRINEFTNQMSELLRIERDAELEFTQEELNAVPTLDESSDSSKPIEFLVSHGQGQ 339

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL AVS+STGLGGMHLVLF+VEG+HRLPPTTLSPGDMVCVR CDSRGAGATS
Sbjct: 340  QELCDTICNLYAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGAGATS 399

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
             +QGFVNNLGEDGCSI VALESRHGDPTFSKL GK VRIDRI GLAD +TYERNCEALML
Sbjct: 400  SLQGFVNNLGEDGCSISVALESRHGDPTFSKLSGKSVRIDRIHGLADAVTYERNCEALML 459

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQK GL KKNPSIA+VATLFGD+EDV WLE+N L  W   + D  L +  +DDSQ +AI 
Sbjct: 460  LQKKGLHKKNPSIAVVATLFGDKEDVAWLEENDLASWDEADFDEHLGKP-FDDSQRRAIT 518

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LGLNKKRP LI+QGPP           IAL+V +GERVLVTAPTNAAVDN+VEKLSNIGL
Sbjct: 519  LGLNKKRPFLIIQGPPGTGKSGLLKELIALAVGKGERVLVTAPTNAAVDNMVEKLSNIGL 578

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARISSAVASKSLG+IVNSKL+ FR E+ERKK+DLRKDL  CLKDDSLAAGIR
Sbjct: 579  NIVRVGNPARISSAVASKSLGDIVNSKLAAFRTEFERKKSDLRKDLSHCLKDDSLAAGIR 638

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  SSAQVVL TNTGAADPLIRRL  FDLVV+DEA QAIEP
Sbjct: 639  QLLKQLGKTLKKKEKETVREVLSSAQVVLATNTGAADPLIRRLDAFDLVVMDEAGQAIEP 698

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SCWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGLG+SLLERASTL EGVLAT L TQYR
Sbjct: 699  SCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHEGVLATKLTTQYR 758

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN AIA WASKEMY+GLL SS  V+SHLL+D+PFVK TWI
Sbjct: 759  MNDAIASWASKEMYSGLLKSSSTVASHLLVDTPFVKPTWI 798


>ref|XP_002870460.1| hypothetical protein ARALYDRAFT_493645 [Arabidopsis lyrata subsp.
            lyrata] gi|297316296|gb|EFH46719.1| hypothetical protein
            ARALYDRAFT_493645 [Arabidopsis lyrata subsp. lyrata]
          Length = 979

 Score =  952 bits (2461), Expect = 0.0
 Identities = 479/640 (74%), Positives = 532/640 (83%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DFA+AE+QGEF E+ Q +G GLTFVIQ+QPY NAIPMPLG E +CLKACTHYPTLFDHFQ
Sbjct: 155  DFANAEVQGEFSELRQNVGSGLTFVIQAQPYLNAIPMPLGSEVICLKACTHYPTLFDHFQ 214

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELRDVLQDL+RK+++++W+ET+SWKLLKE+ANSAQHR + RK  Q K V  G GM  EK
Sbjct: 215  RELRDVLQDLERKNIMENWKETESWKLLKEIANSAQHREVARKAAQAKPVQGGFGMSSEK 274

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
            VKAIQ+RIDEF  HMS LL++ERD ELE TQEELD +P PD +SDSSKPIE+LV  G   
Sbjct: 275  VKAIQARIDEFTSHMSQLLQVERDTELEVTQEELDVIPTPDESSDSSKPIEFLVRHGDAP 334

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL AVS+STGLGGMHLVLFKV G+HRLPPTTLSPGDMVC+R CDSRGAGAT+
Sbjct: 335  QELCDTICNLYAVSTSTGLGGMHLVLFKVGGNHRLPPTTLSPGDMVCIRVCDSRGAGATA 394

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            C QGFV+NLGEDGCSI VALESRHGDPTFSKLFGK VRIDRI GLAD LTYERNCEALML
Sbjct: 395  CTQGFVHNLGEDGCSIGVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALML 454

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGLQKKNPSI++VATLFGDEED+TWLEQN  VDWS  EL       L+D SQ +AIA
Sbjct: 455  LQKNGLQKKNPSISVVATLFGDEEDITWLEQNDYVDWSEAELSDEPVSKLFDSSQRRAIA 514

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LG+NKKRPV+IVQGPP           I L+V+QGERVLVTAPTNAAVDN+VEKL ++GL
Sbjct: 515  LGVNKKRPVMIVQGPPGTGKTGMLKEVITLAVQQGERVLVTAPTNAAVDNMVEKLLHLGL 574

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARISSAVASKSLGEIVNSKL++FR E ERKK+DLRKDLRQCL+DD LAAGIR
Sbjct: 575  NIVRVGNPARISSAVASKSLGEIVNSKLASFRAELERKKSDLRKDLRQCLRDDVLAAGIR 634

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  S+A VV  TN GAADPLIRRL TFDLVVIDEA Q+IEP
Sbjct: 635  QLLKQLGKTLKKKEKETVKEILSNAHVVFATNIGAADPLIRRLETFDLVVIDEAGQSIEP 694

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SCWIPILQGKRCIL+GD CQLAPVVLSR+ALEGGLG+SLLERA++L +GVLAT L TQYR
Sbjct: 695  SCWIPILQGKRCILSGDPCQLAPVVLSRKALEGGLGVSLLERAASLHDGVLATKLTTQYR 754

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN  IA WASKEMY G L S+P V+SHLLIDSPFVK TWI
Sbjct: 755  MNDVIAGWASKEMYGGWLKSAPSVASHLLIDSPFVKPTWI 794


>ref|XP_004514995.1| PREDICTED: DNA-binding protein SMUBP-2-like [Cicer arietinum]
          Length = 962

 Score =  951 bits (2457), Expect = 0.0
 Identities = 479/642 (74%), Positives = 532/642 (82%), Gaps = 2/642 (0%)
 Frame = +2

Query: 2    DFASAEIQGE--FLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDH 175
            DFASAE+QG+  F E+ Q+MGPGLTFVIQ+QPY NA+PMPLGLE +CLKACTHYPTLFDH
Sbjct: 135  DFASAELQGDNDFFEMKQKMGPGLTFVIQAQPYLNAVPMPLGLEVMCLKACTHYPTLFDH 194

Query: 176  FQRELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDV 355
            FQRELRDVLQD++ K ++ DWRET SWKLLKELANSAQHRA+ RK+ QPK V   LGMD+
Sbjct: 195  FQRELRDVLQDMESKLLVQDWRETQSWKLLKELANSAQHRAVARKITQPKIVQGVLGMDI 254

Query: 356  EKVKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQ 535
            E+VK IQ RIDEF  +MS+LL IERD ELEFTQEELDAVP PD  SD SKPIE+LVS  Q
Sbjct: 255  ERVKVIQHRIDEFTNNMSELLNIERDVELEFTQEELDAVPKPDDTSDPSKPIEFLVSHSQ 314

Query: 536  DQQELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGA 715
             QQELCDTICNL A+S+STGLGGMHLVLFK+EG+HRLPPTTLSPG+MVCVRTCDS+GA  
Sbjct: 315  PQQELCDTICNLQAISTSTGLGGMHLVLFKIEGNHRLPPTTLSPGEMVCVRTCDSKGAVT 374

Query: 716  TSCMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEAL 895
            TSCMQG V+NLG+DG SI VALE RHGDPTFSKLFGK VRIDRI GLADTLTYERNCEAL
Sbjct: 375  TSCMQGVVDNLGDDGYSITVALELRHGDPTFSKLFGKNVRIDRIQGLADTLTYERNCEAL 434

Query: 896  MLLQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKA 1075
            MLLQKNGL+KKNPSI++VATLFGD ED+ WLE+N L D++  + +  L    YD +Q +A
Sbjct: 435  MLLQKNGLRKKNPSISVVATLFGDGEDIAWLEKNDLADFAEEKTNETLGSESYDKTQQRA 494

Query: 1076 IALGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNI 1255
            IALGLNKKRP+L++QGPP           IA +V+QGERVLVTAPTNAAVDN+VEKLSN+
Sbjct: 495  IALGLNKKRPLLVIQGPPGTGKTGLLKQLIACAVEQGERVLVTAPTNAAVDNMVEKLSNV 554

Query: 1256 GLNIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAG 1435
            GLNIVRVGNPARIS  V SKSLGEIVN+KL++FR+EYERKK+DLRKDLR CLKDDSLAAG
Sbjct: 555  GLNIVRVGNPARISKTVGSKSLGEIVNAKLASFREEYERKKSDLRKDLRHCLKDDSLAAG 614

Query: 1436 IRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAI 1615
            IR                      SSAQVVL TNTGAADPLIRRL  FDLVVIDEA QAI
Sbjct: 615  IRQLLKQLARSLKKKEKQTINEVLSSAQVVLATNTGAADPLIRRLDAFDLVVIDEAGQAI 674

Query: 1616 EPSCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQ 1795
            EPSCWIPILQ KRCILAGDQCQLAPV+ SR+ALE GLGISLLERA+TL EGVL T L TQ
Sbjct: 675  EPSCWIPILQAKRCILAGDQCQLAPVIFSRKALESGLGISLLERAATLHEGVLTTRLTTQ 734

Query: 1796 YRMNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            YRMN AIA WASKEMY GLL SS  V SHLL+DSPFVK TWI
Sbjct: 735  YRMNDAIASWASKEMYGGLLKSSKSVFSHLLVDSPFVKPTWI 776


>ref|XP_007145941.1| hypothetical protein PHAVU_007G280900g [Phaseolus vulgaris]
            gi|561019131|gb|ESW17935.1| hypothetical protein
            PHAVU_007G280900g [Phaseolus vulgaris]
          Length = 813

 Score =  949 bits (2454), Expect = 0.0
 Identities = 478/640 (74%), Positives = 535/640 (83%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            D A+AEI GE +E+ + MGPGLTF++Q+QPY NA+PMP+GLE +CLKACTHYPTLFDHFQ
Sbjct: 119  DLAAAEILGE-MELWELMGPGLTFIMQAQPYLNAVPMPIGLEGVCLKACTHYPTLFDHFQ 177

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELR VLQDLQ  + I DWR+T SWKLLK+LANSAQHRA+VRK+ QPKSV   LGMD+EK
Sbjct: 178  RELRAVLQDLQNDNPIQDWRDTKSWKLLKQLANSAQHRAVVRKIAQPKSVQGVLGMDLEK 237

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
            VKAIQ RIDEF  HMS+LLR+ERDAELEFTQEELDAVP PD  SDSSKPI++LVS  Q +
Sbjct: 238  VKAIQHRIDEFTNHMSELLRVERDAELEFTQEELDAVPKPDDASDSSKPIDFLVSHSQPE 297

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL+A+S+STGLGGMHLVLFKVEG+HRLPPT LSPGDMVCVRT DSRGA  TS
Sbjct: 298  QELCDTICNLNAISTSTGLGGMHLVLFKVEGNHRLPPTALSPGDMVCVRTYDSRGAITTS 357

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            C+QGFVN+ G+DG SI +ALESRHGDPTFSKLFGK VRIDRI GLADTLTYERNCEALML
Sbjct: 358  CIQGFVNSFGDDGYSITIALESRHGDPTFSKLFGKNVRIDRIQGLADTLTYERNCEALML 417

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGL+KKNPSI++VATLFGD EDV WLE+NH  DW+  + D +L    +DD+Q +AIA
Sbjct: 418  LQKNGLRKKNPSISVVATLFGDGEDVAWLEKNHFADWAEEKSDAILGSESFDDTQRRAIA 477

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LGLNKKRPVL+VQGPP           IA +V+QGERVLVTAPTNAAVDN+VEKLSN+ L
Sbjct: 478  LGLNKKRPVLVVQGPPGTGKTGLLKHLIACAVQQGERVLVTAPTNAAVDNMVEKLSNVRL 537

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            N+VRVGNPARIS  V SKSL EIVN KL++FR+EYERKK+DLRKDLR CL+DDSLAAGIR
Sbjct: 538  NVVRVGNPARISKTVGSKSLEEIVNGKLASFREEYERKKSDLRKDLRHCLRDDSLAAGIR 597

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  SSAQVVL TNTGAADPLIRRL TFDLVVIDEA QAIEP
Sbjct: 598  QLLKQLGRSLKKKEKQTVNEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAGQAIEP 657

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SCWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGL ISLLERA+TL EG+L T L TQYR
Sbjct: 658  SCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLRISLLERAATLHEGILTTRLTTQYR 717

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN AI+ WASKEMY GLL SS  V SHLL+DSPFVK +WI
Sbjct: 718  MNDAISSWASKEMYGGLLKSSETVFSHLLVDSPFVKPSWI 757


>ref|NP_198446.3| P-loop containing nucleoside triphosphate hydrolases superfamily
            protein [Arabidopsis thaliana]
            gi|332006651|gb|AED94034.1| P-loop containing nucleoside
            triphosphate hydrolases superfamily protein [Arabidopsis
            thaliana]
          Length = 961

 Score =  949 bits (2452), Expect = 0.0
 Identities = 480/640 (75%), Positives = 531/640 (82%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DFA+AE+QGEF E+ Q +G GLTFVIQ+QPY NAIPMPLG E +CLKACTHYPTLFDHFQ
Sbjct: 137  DFATAEVQGEFSELRQNVGSGLTFVIQAQPYLNAIPMPLGSEVICLKACTHYPTLFDHFQ 196

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELRDVLQDL+RK++++ W+E++SWKLLKE+ANSAQHR + RK  Q K V   LGMD EK
Sbjct: 197  RELRDVLQDLERKNIMESWKESESWKLLKEIANSAQHREVARKAAQAKPVQGVLGMDSEK 256

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
            VKAIQ RIDEF   MS LL++ERD ELE TQEELD VP PD +SDSSKPIE+LV  G   
Sbjct: 257  VKAIQERIDEFTSQMSQLLQVERDTELEVTQEELDVVPTPDESSDSSKPIEFLVRHGDAP 316

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL AVS+STGLGGMHLVLFKV G+HRLPPTTLSPGDMVC+R CDSRGAGAT+
Sbjct: 317  QELCDTICNLYAVSTSTGLGGMHLVLFKVGGNHRLPPTTLSPGDMVCIRVCDSRGAGATA 376

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            C QGFV+NLGEDGCSI VALESRHGDPTFSKLFGK VRIDRI GLAD LTYERNCEALML
Sbjct: 377  CTQGFVHNLGEDGCSIGVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALML 436

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGLQKKNPSI++VATLFGD ED+TWLEQN  VDWS  EL       L+D SQ +AIA
Sbjct: 437  LQKNGLQKKNPSISVVATLFGDGEDITWLEQNDYVDWSEAELSDEPVSKLFDSSQRRAIA 496

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LG+NKKRPV+IVQGPP           I L+V+QGERVLVTAPTNAAVDN+VEKL ++GL
Sbjct: 497  LGVNKKRPVMIVQGPPGTGKTGMLKEVITLAVQQGERVLVTAPTNAAVDNMVEKLLHLGL 556

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARISSAVASKSLGEIVNSKL++FR E ERKK+DLRKDLRQCL+DD LAAGIR
Sbjct: 557  NIVRVGNPARISSAVASKSLGEIVNSKLASFRAELERKKSDLRKDLRQCLRDDVLAAGIR 616

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  S+AQVV  TN GAADPLIRRL TFDLVVIDEA Q+IEP
Sbjct: 617  QLLKQLGKTLKKKEKETVKEILSNAQVVFATNIGAADPLIRRLETFDLVVIDEAGQSIEP 676

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SCWIPILQGKRCIL+GD CQLAPVVLSR+ALEGGLG+SLLERA++L +GVLAT L TQYR
Sbjct: 677  SCWIPILQGKRCILSGDPCQLAPVVLSRKALEGGLGVSLLERAASLHDGVLATKLTTQYR 736

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN  IA WASKEMY G L S+P V+SHLLIDSPFVKATWI
Sbjct: 737  MNDVIAGWASKEMYGGWLKSAPSVASHLLIDSPFVKATWI 776


>ref|XP_006588516.1| PREDICTED: DNA-binding protein SMUBP-2-like [Glycine max]
          Length = 949

 Score =  947 bits (2448), Expect = 0.0
 Identities = 482/644 (74%), Positives = 538/644 (83%), Gaps = 4/644 (0%)
 Frame = +2

Query: 2    DFASAEIQG---EFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFD 172
            D A+AE++G   EF E+ + MGPGLTF++ +QPY NA+PMP+GLE LCLKACTHYPTLFD
Sbjct: 122  DLAAAELEGGEGEF-ELWELMGPGLTFIMLAQPYLNAVPMPIGLEGLCLKACTHYPTLFD 180

Query: 173  HFQRELRDVLQDLQRK-SVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGM 349
            HFQRELR VL+DLQ+  S I DWR+T SWKLLK+LANSAQHRA+VRK+ QPKSV   LGM
Sbjct: 181  HFQRELRQVLRDLQQSNSFIQDWRDTKSWKLLKDLANSAQHRAVVRKITQPKSVQGVLGM 240

Query: 350  DVEKVKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSR 529
            D EKVKA+Q RIDEF  HMS+LLRIERDAELEFTQEELDAVP PD  SDSSK I++LVS 
Sbjct: 241  DFEKVKALQHRIDEFTTHMSELLRIERDAELEFTQEELDAVPKPDDTSDSSKTIDFLVSH 300

Query: 530  GQDQQELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGA 709
             Q QQELCDTICNL+A+S+STGLGGMHLVLFKVEG+HRLPPTTLSPGDMVCVRT DS GA
Sbjct: 301  SQPQQELCDTICNLNAISTSTGLGGMHLVLFKVEGNHRLPPTTLSPGDMVCVRTYDSMGA 360

Query: 710  GATSCMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCE 889
              TSC+QGFVN+ G+DG SI VALESRHGDPTFSKLFGK VRIDRI GLADTLTYERNCE
Sbjct: 361  ITTSCIQGFVNSFGDDGYSITVALESRHGDPTFSKLFGKSVRIDRIQGLADTLTYERNCE 420

Query: 890  ALMLLQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQL 1069
            ALMLLQKNGL+KKNPSI++VATLFGD EDV WLE+NHL DW+  +LDG L    +DDSQ 
Sbjct: 421  ALMLLQKNGLRKKNPSISVVATLFGDGEDVAWLEKNHLADWAEEKLDGRLGNETFDDSQW 480

Query: 1070 KAIALGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLS 1249
            +AIA+GLNKKRPVL++QGPP           IA +V+QGERVLVTAPTNAAVDN+VEKLS
Sbjct: 481  RAIAMGLNKKRPVLVIQGPPGTGKTGLLKQLIACAVQQGERVLVTAPTNAAVDNMVEKLS 540

Query: 1250 NIGLNIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLA 1429
            N+GLNIVRVGNPARIS  V SKSL EIVN+KL++FR+EYERKK+DLRKDLR CL+DDSLA
Sbjct: 541  NVGLNIVRVGNPARISKTVGSKSLEEIVNAKLASFREEYERKKSDLRKDLRHCLRDDSLA 600

Query: 1430 AGIRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQ 1609
            +GIR                      SSAQVV+ TNTGAADPL+RRL TFDLVVIDEA Q
Sbjct: 601  SGIRQLLKQLGRSLKKKEKQTVIEVLSSAQVVVATNTGAADPLVRRLDTFDLVVIDEAGQ 660

Query: 1610 AIEPSCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLM 1789
            AIEPSCWIPILQGKRCILAGDQCQLAPV+LSR+ALE GLGISLLERA+TL EG+L T L 
Sbjct: 661  AIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEVGLGISLLERAATLHEGILTTRLT 720

Query: 1790 TQYRMNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            TQYRMN AIA WASKEMY GLL SS  V SHLL+DSPFVK TWI
Sbjct: 721  TQYRMNDAIASWASKEMYGGLLKSSETVFSHLLVDSPFVKPTWI 764


>ref|XP_006283073.1| hypothetical protein CARUB_v10004066mg [Capsella rubella]
            gi|482551778|gb|EOA15971.1| hypothetical protein
            CARUB_v10004066mg [Capsella rubella]
          Length = 984

 Score =  946 bits (2446), Expect = 0.0
 Identities = 479/640 (74%), Positives = 529/640 (82%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            DFA+AE+QGEFLE+ Q +G GLTFVIQ+QPY NAIPMPLG E +CLKACTHYPTLFDHFQ
Sbjct: 160  DFATAEVQGEFLELRQTVGSGLTFVIQAQPYLNAIPMPLGSEVVCLKACTHYPTLFDHFQ 219

Query: 182  RELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMDVEK 361
            RELRDVLQDL+RK+V+++W+ET+SWKLLKE+ANSAQHR + RK  QPK V    G+D EK
Sbjct: 220  RELRDVLQDLERKNVMENWKETESWKLLKEIANSAQHREVARKAAQPKPVQGVFGLDSEK 279

Query: 362  VKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRGQDQ 541
            VKAIQ RIDEF   MS LL++ERD ELE TQEELD +P PD  SDSSKPIE+LV  G   
Sbjct: 280  VKAIQGRIDEFTSQMSQLLQVERDTELEVTQEELDVIPTPDERSDSSKPIEFLVRHGDAP 339

Query: 542  QELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAGATS 721
            QELCDTICNL AVS+STGLGGMHLVLFKV G+HRLPPTTLSPGDMVC+R CDSRGAGAT+
Sbjct: 340  QELCDTICNLYAVSTSTGLGGMHLVLFKVGGNHRLPPTTLSPGDMVCIRICDSRGAGATA 399

Query: 722  CMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEALML 901
            C QGFV+NLGEDGCSI VALESRHGDPTFSKLFGK VRIDRI GLAD LTYERNCEALML
Sbjct: 400  CTQGFVHNLGEDGCSIGVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALML 459

Query: 902  LQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLKAIA 1081
            LQKNGLQKKNPSI++VATLFGD ED+ WLEQ   VDWS  EL       L+DDSQ +AIA
Sbjct: 460  LQKNGLQKKNPSISVVATLFGDGEDIEWLEQKDYVDWSEAELSDEPVGKLFDDSQRRAIA 519

Query: 1082 LGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSNIGL 1261
            LG+NKKRPV+IVQGPP           I L+V+QGERVLVTAPTNAAVDN+VEKL ++GL
Sbjct: 520  LGVNKKRPVMIVQGPPGTGKTGMLKEVITLAVQQGERVLVTAPTNAAVDNMVEKLLHLGL 579

Query: 1262 NIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAAGIR 1441
            NIVRVGNPARISSAVASKSLGEIVNSKL++FR E ERKK+DLRKDLRQCL+DD LAAGIR
Sbjct: 580  NIVRVGNPARISSAVASKSLGEIVNSKLASFRAELERKKSDLRKDLRQCLRDDVLAAGIR 639

Query: 1442 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQAIEP 1621
                                  ++AQVV  TN GAADPLIRRL TFDLVVIDEA QAIEP
Sbjct: 640  QLLKQLGKTLKKKEKETVKEILANAQVVFATNIGAADPLIRRLETFDLVVIDEAGQAIEP 699

Query: 1622 SCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMTQYR 1801
            SCWIPILQGKRCIL+GD CQLAPVVLSR+ALEGGLG+SLLERA++L  GVLAT L TQYR
Sbjct: 700  SCWIPILQGKRCILSGDPCQLAPVVLSRKALEGGLGVSLLERAASLHNGVLATKLTTQYR 759

Query: 1802 MNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            MN  IA WASKEMY G L S+P V+SHLLIDSPFVK TWI
Sbjct: 760  MNDVIAGWASKEMYGGWLKSAPSVASHLLIDSPFVKPTWI 799


>ref|XP_006574496.1| PREDICTED: DNA-binding protein SMUBP-2-like isoform X3 [Glycine max]
          Length = 840

 Score =  940 bits (2430), Expect = 0.0
 Identities = 480/643 (74%), Positives = 531/643 (82%), Gaps = 3/643 (0%)
 Frame = +2

Query: 2    DFASAEIQG---EFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFD 172
            D A+AE++G   EF E+ +RMGPGLTF++ +QPY NA+PMP+GLE LCLK CTHYPTLFD
Sbjct: 106  DLAAAELEGGEGEF-ELWERMGPGLTFIMLAQPYLNAVPMPIGLEGLCLKVCTHYPTLFD 164

Query: 173  HFQRELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMD 352
            HFQRELR VL+D    S I DWR+T SWKLLK+LANSAQHRA+VRK+ QPKSV   LGMD
Sbjct: 165  HFQRELRQVLRD----SFIQDWRDTKSWKLLKDLANSAQHRAVVRKITQPKSVQGVLGMD 220

Query: 353  VEKVKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRG 532
             EKVK IQ RIDEF  HMS+LLRIERDAELEFTQEELDAVP PD  SDSSKPI++LVS  
Sbjct: 221  FEKVKTIQHRIDEFTSHMSELLRIERDAELEFTQEELDAVPKPDDTSDSSKPIDFLVSHS 280

Query: 533  QDQQELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAG 712
            Q QQELCDTICNL+A+S+S GLGGMHLVLFKVEG+HRLPPT LSPGDMVCVRT DS GA 
Sbjct: 281  QPQQELCDTICNLNAISTSRGLGGMHLVLFKVEGNHRLPPTALSPGDMVCVRTYDSTGAI 340

Query: 713  ATSCMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEA 892
             TSC+QGFVN+ G+DG SI VALESRHGDPTFSKLFGK VRIDRI GLADTLTYERNCEA
Sbjct: 341  TTSCIQGFVNSFGDDGYSITVALESRHGDPTFSKLFGKSVRIDRIQGLADTLTYERNCEA 400

Query: 893  LMLLQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLK 1072
            LMLLQKNGL+KKNPSI++VATLFGD EDV WLE+N LVDW+   LD  L    +DDSQ +
Sbjct: 401  LMLLQKNGLRKKNPSISVVATLFGDGEDVAWLEKNQLVDWAEENLDARLGNETFDDSQQR 460

Query: 1073 AIALGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSN 1252
            AIA+GLNKKRPVL++QGPP           I  +V+QGERVLVTAPTNAAVDN+VEKLSN
Sbjct: 461  AIAMGLNKKRPVLVIQGPPGTGKTGLLKQLIVCAVQQGERVLVTAPTNAAVDNMVEKLSN 520

Query: 1253 IGLNIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAA 1432
            +GLNIVRVGNPARIS  V SKSL EIVN+KL++FR+EYERKK+DLRKDLR CLKDDSLA+
Sbjct: 521  VGLNIVRVGNPARISKTVGSKSLEEIVNAKLASFREEYERKKSDLRKDLRHCLKDDSLAS 580

Query: 1433 GIRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQA 1612
            GIR                      SSAQVVL TNTGAADPLIRRL TFDLVVIDEA QA
Sbjct: 581  GIRQLLKQLGRSLKKKEKQTVVEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAGQA 640

Query: 1613 IEPSCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMT 1792
            IEPSCWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGLGISLLERA+TL EG+L T L T
Sbjct: 641  IEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGISLLERAATLHEGILTTRLTT 700

Query: 1793 QYRMNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            QYRMN AIA WASKEMY GLL SS  V SHLL++SPFVK TWI
Sbjct: 701  QYRMNDAIASWASKEMYGGLLKSSETVFSHLLVNSPFVKPTWI 743


>ref|XP_006574495.1| PREDICTED: DNA-binding protein SMUBP-2-like isoform X2 [Glycine max]
          Length = 851

 Score =  940 bits (2430), Expect = 0.0
 Identities = 480/643 (74%), Positives = 531/643 (82%), Gaps = 3/643 (0%)
 Frame = +2

Query: 2    DFASAEIQG---EFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFD 172
            D A+AE++G   EF E+ +RMGPGLTF++ +QPY NA+PMP+GLE LCLK CTHYPTLFD
Sbjct: 106  DLAAAELEGGEGEF-ELWERMGPGLTFIMLAQPYLNAVPMPIGLEGLCLKVCTHYPTLFD 164

Query: 173  HFQRELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMD 352
            HFQRELR VL+D    S I DWR+T SWKLLK+LANSAQHRA+VRK+ QPKSV   LGMD
Sbjct: 165  HFQRELRQVLRD----SFIQDWRDTKSWKLLKDLANSAQHRAVVRKITQPKSVQGVLGMD 220

Query: 353  VEKVKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRG 532
             EKVK IQ RIDEF  HMS+LLRIERDAELEFTQEELDAVP PD  SDSSKPI++LVS  
Sbjct: 221  FEKVKTIQHRIDEFTSHMSELLRIERDAELEFTQEELDAVPKPDDTSDSSKPIDFLVSHS 280

Query: 533  QDQQELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAG 712
            Q QQELCDTICNL+A+S+S GLGGMHLVLFKVEG+HRLPPT LSPGDMVCVRT DS GA 
Sbjct: 281  QPQQELCDTICNLNAISTSRGLGGMHLVLFKVEGNHRLPPTALSPGDMVCVRTYDSTGAI 340

Query: 713  ATSCMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEA 892
             TSC+QGFVN+ G+DG SI VALESRHGDPTFSKLFGK VRIDRI GLADTLTYERNCEA
Sbjct: 341  TTSCIQGFVNSFGDDGYSITVALESRHGDPTFSKLFGKSVRIDRIQGLADTLTYERNCEA 400

Query: 893  LMLLQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLK 1072
            LMLLQKNGL+KKNPSI++VATLFGD EDV WLE+N LVDW+   LD  L    +DDSQ +
Sbjct: 401  LMLLQKNGLRKKNPSISVVATLFGDGEDVAWLEKNQLVDWAEENLDARLGNETFDDSQQR 460

Query: 1073 AIALGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSN 1252
            AIA+GLNKKRPVL++QGPP           I  +V+QGERVLVTAPTNAAVDN+VEKLSN
Sbjct: 461  AIAMGLNKKRPVLVIQGPPGTGKTGLLKQLIVCAVQQGERVLVTAPTNAAVDNMVEKLSN 520

Query: 1253 IGLNIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAA 1432
            +GLNIVRVGNPARIS  V SKSL EIVN+KL++FR+EYERKK+DLRKDLR CLKDDSLA+
Sbjct: 521  VGLNIVRVGNPARISKTVGSKSLEEIVNAKLASFREEYERKKSDLRKDLRHCLKDDSLAS 580

Query: 1433 GIRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQA 1612
            GIR                      SSAQVVL TNTGAADPLIRRL TFDLVVIDEA QA
Sbjct: 581  GIRQLLKQLGRSLKKKEKQTVVEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAGQA 640

Query: 1613 IEPSCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMT 1792
            IEPSCWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGLGISLLERA+TL EG+L T L T
Sbjct: 641  IEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGISLLERAATLHEGILTTRLTT 700

Query: 1793 QYRMNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            QYRMN AIA WASKEMY GLL SS  V SHLL++SPFVK TWI
Sbjct: 701  QYRMNDAIASWASKEMYGGLLKSSETVFSHLLVNSPFVKPTWI 743


>ref|XP_006574494.1| PREDICTED: DNA-binding protein SMUBP-2-like isoform X1 [Glycine max]
          Length = 928

 Score =  940 bits (2430), Expect = 0.0
 Identities = 480/643 (74%), Positives = 531/643 (82%), Gaps = 3/643 (0%)
 Frame = +2

Query: 2    DFASAEIQG---EFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFD 172
            D A+AE++G   EF E+ +RMGPGLTF++ +QPY NA+PMP+GLE LCLK CTHYPTLFD
Sbjct: 106  DLAAAELEGGEGEF-ELWERMGPGLTFIMLAQPYLNAVPMPIGLEGLCLKVCTHYPTLFD 164

Query: 173  HFQRELRDVLQDLQRKSVIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRGLGMD 352
            HFQRELR VL+D    S I DWR+T SWKLLK+LANSAQHRA+VRK+ QPKSV   LGMD
Sbjct: 165  HFQRELRQVLRD----SFIQDWRDTKSWKLLKDLANSAQHRAVVRKITQPKSVQGVLGMD 220

Query: 353  VEKVKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNSDSSKPIEYLVSRG 532
             EKVK IQ RIDEF  HMS+LLRIERDAELEFTQEELDAVP PD  SDSSKPI++LVS  
Sbjct: 221  FEKVKTIQHRIDEFTSHMSELLRIERDAELEFTQEELDAVPKPDDTSDSSKPIDFLVSHS 280

Query: 533  QDQQELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGAG 712
            Q QQELCDTICNL+A+S+S GLGGMHLVLFKVEG+HRLPPT LSPGDMVCVRT DS GA 
Sbjct: 281  QPQQELCDTICNLNAISTSRGLGGMHLVLFKVEGNHRLPPTALSPGDMVCVRTYDSTGAI 340

Query: 713  ATSCMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCEA 892
             TSC+QGFVN+ G+DG SI VALESRHGDPTFSKLFGK VRIDRI GLADTLTYERNCEA
Sbjct: 341  TTSCIQGFVNSFGDDGYSITVALESRHGDPTFSKLFGKSVRIDRIQGLADTLTYERNCEA 400

Query: 893  LMLLQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSVTELDGVLERGLYDDSQLK 1072
            LMLLQKNGL+KKNPSI++VATLFGD EDV WLE+N LVDW+   LD  L    +DDSQ +
Sbjct: 401  LMLLQKNGLRKKNPSISVVATLFGDGEDVAWLEKNQLVDWAEENLDARLGNETFDDSQQR 460

Query: 1073 AIALGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKLSN 1252
            AIA+GLNKKRPVL++QGPP           I  +V+QGERVLVTAPTNAAVDN+VEKLSN
Sbjct: 461  AIAMGLNKKRPVLVIQGPPGTGKTGLLKQLIVCAVQQGERVLVTAPTNAAVDNMVEKLSN 520

Query: 1253 IGLNIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSLAA 1432
            +GLNIVRVGNPARIS  V SKSL EIVN+KL++FR+EYERKK+DLRKDLR CLKDDSLA+
Sbjct: 521  VGLNIVRVGNPARISKTVGSKSLEEIVNAKLASFREEYERKKSDLRKDLRHCLKDDSLAS 580

Query: 1433 GIRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEASQA 1612
            GIR                      SSAQVVL TNTGAADPLIRRL TFDLVVIDEA QA
Sbjct: 581  GIRQLLKQLGRSLKKKEKQTVVEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAGQA 640

Query: 1613 IEPSCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATMLMT 1792
            IEPSCWIPILQGKRCILAGDQCQLAPV+LSR+ALEGGLGISLLERA+TL EG+L T L T
Sbjct: 641  IEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGISLLERAATLHEGILTTRLTT 700

Query: 1793 QYRMNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
            QYRMN AIA WASKEMY GLL SS  V SHLL++SPFVK TWI
Sbjct: 701  QYRMNDAIASWASKEMYGGLLKSSETVFSHLLVNSPFVKPTWI 743


>ref|XP_006878575.1| hypothetical protein AMTR_s00011p00245550 [Amborella trichopoda]
            gi|548831918|gb|ERM94720.1| hypothetical protein
            AMTR_s00011p00245550 [Amborella trichopoda]
          Length = 922

 Score =  927 bits (2397), Expect = 0.0
 Identities = 473/645 (73%), Positives = 532/645 (82%), Gaps = 5/645 (0%)
 Frame = +2

Query: 2    DFASAEIQGEFLEISQRMGPGLTFVIQSQPYFNAIPMPLGLESLCLKACTHYPTLFDHFQ 181
            D   AEI GEF EI Q MG GLTFV Q+QPY +A+PMP G+ESLCLKA THYPTL DHFQ
Sbjct: 93   DLVCAEINGEFSEIQQSMGRGLTFVTQAQPYLSAVPMPKGMESLCLKASTHYPTLLDHFQ 152

Query: 182  RELRDVLQDLQRKS--VIDDWRETDSWKLLKELANSAQHRAIVRKVPQPKSVLRG-LGMD 352
            REL++VLQ+ Q +   V+DDWR+T+SWKLLKE +N AQHR IVRKV   K  L G LGM+
Sbjct: 153  RELKEVLQEFQGRKLLVVDDWRQTESWKLLKEFSNCAQHRVIVRKVSPVKRALHGALGME 212

Query: 353  VEKVKAIQSRIDEFAKHMSDLLRIERDAELEFTQEELDAVPMPDVNS-DSSKPIEYLVSR 529
            +EKV+A+QS ID+FA+HMS LLRIERD+ELE TQEEL+AVPMPD NS DS KPIEYLVS 
Sbjct: 213  LEKVQAMQSHIDDFARHMSGLLRIERDSELEATQEELNAVPMPDENSGDSLKPIEYLVSH 272

Query: 530  GQDQQELCDTICNLSAVSSSTGLGGMHLVLFKVEGSHRLPPTTLSPGDMVCVRTCDSRGA 709
            GQ QQE CDTICNL AVS STGLGGMHLVLF+VEG+HRLPP +LSPGDMVCVR CDSRGA
Sbjct: 273  GQAQQEQCDTICNLYAVSCSTGLGGMHLVLFRVEGNHRLPPISLSPGDMVCVRACDSRGA 332

Query: 710  GATSCMQGFVNNLGEDGCSIVVALESRHGDPTFSKLFGKGVRIDRIPGLADTLTYERNCE 889
            GATSCMQGFV+NLGEDGCSI VALESRHGDPTFSKLFGK VRIDRI GLAD LTYERNCE
Sbjct: 333  GATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKNVRIDRIHGLADALTYERNCE 392

Query: 890  ALMLLQKNGLQKKNPSIAIVATLFGDEEDVTWLEQNHLVDWSV-TELDGVLERGLYDDSQ 1066
            ALMLLQKNGL K+NPSIA+VATLFG  ED++W+EQNHLV+W+    +  +L RG +D SQ
Sbjct: 393  ALMLLQKNGLHKRNPSIAVVATLFGTNEDISWMEQNHLVEWNEDPTISELLPRGPFDKSQ 452

Query: 1067 LKAIALGLNKKRPVLIVQGPPXXXXXXXXXXXIALSVKQGERVLVTAPTNAAVDNVVEKL 1246
            L+AIA+GLNKKRP+L++QGPP           I L+V++GERVLVTAPTNAAVDN+VE+L
Sbjct: 453  LRAIAVGLNKKRPLLVIQGPPGTGKSGLLKELITLAVERGERVLVTAPTNAAVDNMVERL 512

Query: 1247 SNIGLNIVRVGNPARISSAVASKSLGEIVNSKLSTFRKEYERKKADLRKDLRQCLKDDSL 1426
            +N+GLNIVRVGNP RIS +VASKSL  IVN KL+TFRKE ERK+ADLRKDLR CLKDDSL
Sbjct: 513  TNVGLNIVRVGNPVRISPSVASKSLASIVNDKLATFRKEQERKRADLRKDLRHCLKDDSL 572

Query: 1427 AAGIRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLCTNTGAADPLIRRLGTFDLVVIDEAS 1606
            AAGIR                      SSAQVVL TNTGAADP+IRRL  FDLVVIDEA 
Sbjct: 573  AAGIRQLLKQLGKALKKKEKETVKEVLSSAQVVLSTNTGAADPIIRRLDCFDLVVIDEAG 632

Query: 1607 QAIEPSCWIPILQGKRCILAGDQCQLAPVVLSRRALEGGLGISLLERASTLQEGVLATML 1786
            QAIEPSCWIPILQGKR ILAGDQCQLAPV+LSR+ALEGGLG+SL+ERAS L EG+LAT L
Sbjct: 633  QAIEPSCWIPILQGKRTILAGDQCQLAPVILSRKALEGGLGVSLMERASKLHEGILATRL 692

Query: 1787 MTQYRMNHAIACWASKEMYNGLLLSSPRVSSHLLIDSPFVKATWI 1921
              QYRMN  IA WASKEMY+GLL SSP V+SHLL+DSPF+KATWI
Sbjct: 693  TIQYRMNDKIASWASKEMYDGLLNSSPTVASHLLVDSPFIKATWI 737


Top