BLASTX nr result

ID: Rauwolfia21_contig00002153 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00002153
         (2124 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006342186.1| PREDICTED: AT-rich interactive domain-contai...   498   e-138
ref|XP_004238465.1| PREDICTED: AT-rich interactive domain-contai...   471   e-130
ref|XP_002510735.1| transcription factor, putative [Ricinus comm...   401   e-109
ref|XP_006435461.1| hypothetical protein CICLE_v10001099mg [Citr...   393   e-106
ref|XP_006435458.1| hypothetical protein CICLE_v10001099mg [Citr...   393   e-106
ref|XP_006473863.1| PREDICTED: AT-rich interactive domain-contai...   392   e-106
gb|EOY15870.1| Transcription factor, putative isoform 1 [Theobro...   389   e-105
gb|EMJ10413.1| hypothetical protein PRUPE_ppa006712mg [Prunus pe...   389   e-105
ref|XP_002528526.1| transcription factor, putative [Ricinus comm...   387   e-105
ref|XP_002300618.2| hypothetical protein POPTR_0002s00550g [Popu...   385   e-104
gb|AEO22030.1| ARID and Hsp20 domains containing protein [Lotus ...   385   e-104
gb|EOY15871.1| ARID/BRIGHT DNA-binding domain-containing protein...   382   e-103
gb|ESW07981.1| hypothetical protein PHAVU_009G008800g [Phaseolus...   381   e-103
ref|XP_004500824.1| PREDICTED: AT-rich interactive domain-contai...   381   e-103
gb|EOY15872.1| ARID/BRIGHT DNA-binding domain-containing protein...   381   e-103
ref|XP_006577925.1| PREDICTED: AT-rich interactive domain-contai...   380   e-102
ref|XP_006342525.1| PREDICTED: AT-rich interactive domain-contai...   380   e-102
ref|XP_004299960.1| PREDICTED: AT-rich interactive domain-contai...   379   e-102
ref|XP_003527249.1| PREDICTED: AT-rich interactive domain-contai...   379   e-102
gb|EOY15876.1| ARID/BRIGHT DNA-binding domain-containing protein...   374   e-101

>ref|XP_006342186.1| PREDICTED: AT-rich interactive domain-containing protein 5-like
            [Solanum tuberosum]
          Length = 604

 Score =  498 bits (1282), Expect = e-138
 Identities = 283/604 (46%), Positives = 373/604 (61%), Gaps = 22/604 (3%)
 Frame = +2

Query: 377  MDKSDVEMEDAEKESQGGPDNVKESRDGTVDAKMEPEDQKQHSGANADCQKSTEVEAKNE 556
            MDK+DVEMEDAE +     ++V ES D ++  +      +Q + +  +CQ++ +   +++
Sbjct: 1    MDKTDVEMEDAENKVHDVQNSVVESIDSSLKPEGPAVASQQSTESLLECQENAKTGTEDQ 60

Query: 557  NDVETVCEE----AKKSNAETE---VQEQTNAKSEVLELSNREGMKDDQTNATMEAEVQS 715
            N+     ++    A+  N E     V+ + N   + +  ++ + + +DQ NA  E     
Sbjct: 61   NNHPESGDKSPIPAENGNGEQNSNVVEGEGNVNDDTMNQNDSKAVVEDQRNAASETATIG 120

Query: 716  MASSEAEDQKEDQSNVTVKNEMKDATDPQNAPDEQNPSESGKTSINPVESKTEEKKAGHT 895
               ++     +DQ+N T   E +   + +   D        K           E+  G+ 
Sbjct: 121  QPETQGNTAVDDQNNATGGTEGETIIEEKKEGDPDADVALKK-----------EEPIGNV 169

Query: 896  GEEEKSAVLNGQLPFQLPASAIPPKKKEMVNEDNQTDKEGLKKCMVDQAHNRDTGTTIDE 1075
               +   V N Q+  +LPA+ +    KE V E +Q +        +   H       +DE
Sbjct: 170  AHHDTRGV-NMQVHEELPANEVGDSGKE-VKEAHQAENS------IKDVHQNGIDMMVDE 221

Query: 1076 PKEPENKSILPAAALTDENINLIEKTASKATKHEPRDTSVGNKRQSATPDLPVKYST-RS 1252
             K+ E+K+   A+ ++D +    E +    TK + ++T++    + ATP+L VK  T  +
Sbjct: 222  QKKVEHKTNT-ASVISDRDDRSNELSE---TKDDVKNTAIAKMPEPATPNLSVKCDTANT 277

Query: 1253 GGHSGETLKNAFDDAKMPE--------NDDGTPEEQAAFMKELESFYRERAVEFKPPKFY 1408
            G H+GE     FD++KM +        NDDG+PE+QAAFM++LE FYRERA+EFKPPKFY
Sbjct: 278  GQHTGEASNKIFDESKMADDEDEDEDWNDDGSPEDQAAFMRDLEIFYRERAIEFKPPKFY 337

Query: 1409 GQPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYE 1588
            G PLNCLKLWR+VIRLGGYDRVTGSK+WRQVGESF+PPKTCTTVSWTFRIFYEK+LLEYE
Sbjct: 338  GIPLNCLKLWRSVIRLGGYDRVTGSKLWRQVGESFNPPKTCTTVSWTFRIFYEKALLEYE 397

Query: 1589 RYQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE- 1765
            R++ Q+G+LQLP+ ALPE + VDNEGNG Q               MQGWH QRL G GE 
Sbjct: 398  RHKTQSGKLQLPIAALPE-AGVDNEGNGNQTPGSGRARRDAAARAMQGWHEQRLLGCGEV 456

Query: 1766 -----EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPA 1930
                 +DKN NN PKRE   KSIGS+K KR +E+E P K  RTETSKQLV  VVD+G PA
Sbjct: 457  GEPIVKDKNCNNTPKREKNFKSIGSIKHKRPNEVEHPSKVARTETSKQLVTTVVDLGPPA 516

Query: 1931 DWVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKVV 2110
            DWVKINVRET+D FE+YALVPGLLREEVRVQSDPAGRLVITGQPEQ DNPWGIT FKKVV
Sbjct: 517  DWVKINVRETRDCFEIYALVPGLLREEVRVQSDPAGRLVITGQPEQLDNPWGITAFKKVV 576

Query: 2111 SLPA 2122
            SLPA
Sbjct: 577  SLPA 580


>ref|XP_004238465.1| PREDICTED: AT-rich interactive domain-containing protein 5-like
            [Solanum lycopersicum]
          Length = 568

 Score =  471 bits (1212), Expect = e-130
 Identities = 283/600 (47%), Positives = 352/600 (58%), Gaps = 18/600 (3%)
 Frame = +2

Query: 377  MDKSDVEMEDAEKESQGGPDNVKESRDGTVDAKMEPEDQKQHS---GANADCQKSTEVEA 547
            MDK+DVEM+D +       + V ES D +      P    QHS    A    Q   E  A
Sbjct: 1    MDKTDVEMDDVQ-------NTVVESIDSS------PALPGQHSTSENAKIGTQNHPETGA 47

Query: 548  KNENDVETVCEEAKKSNAETEVQEQTNAKSEVLELSNREGMKDDQTNATMEAEVQSMASS 727
            K     E         N   +  +QT+ K+ V          +D  NA  E        +
Sbjct: 48   KTPIPAE---------NGNGDTMKQTDPKAAV----------EDHKNAATETATIGQPET 88

Query: 728  EAEDQKEDQSNVTVKNEMKDATDPQNAPDEQNPSESGKTSINPVESKTEEKKAGHTGEEE 907
            +     +DQ+N T   E +     +   D        K           E+  G+    +
Sbjct: 89   QGNTALDDQNNATGGTEGETTIQDKKEGDPDADVALKK-----------EEPIGNVARHD 137

Query: 908  KSAVLNGQLPFQLPASAIPPKKKEMVNEDNQTDKEGLKKCMVDQAHNRDTGTTIDEPKEP 1087
             + V N Q+  +LP + +    KE V E +Q +        +   H       +DE K+ 
Sbjct: 138  TTGV-NMQVLEELPTNEVGDSGKE-VKEAHQAENS------IKDVHQNGNDMMVDEQKKV 189

Query: 1088 ENKSILPAAALTDENINLIEKTASKATKHEPRDTSVGNKRQSATPDLPVKYST-RSGGHS 1264
            E+K+   A+ ++D +    E +    TK   ++T++    + ATP+L VK  T  +G H+
Sbjct: 190  EHKANT-ASVISDRDDRSNELSE---TKDAVKNTAIAKMPEPATPNLSVKCDTANTGQHT 245

Query: 1265 GETLKNAFDDAKMPE--------NDDGTPEEQAAFMKELESFYRERAVEFKPPKFYGQPL 1420
            GE     FD++KM +        NDDG+PE+Q AFM+ELE+FYRERA+EFKPPKFYG PL
Sbjct: 246  GEASNKIFDESKMADDEDEDEDWNDDGSPEDQTAFMRELENFYRERAMEFKPPKFYGIPL 305

Query: 1421 NCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYERYQR 1600
            NCLKLWR+VIRLGGYDRVTGSK+WRQVGESF+PPKTCTTVSWTFRIFYEK+LLEYER++ 
Sbjct: 306  NCLKLWRSVIRLGGYDRVTGSKLWRQVGESFNPPKTCTTVSWTFRIFYEKALLEYERHKM 365

Query: 1601 QNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE----- 1765
            Q+G+LQLP+ ALPE + VDNEGNG Q               MQGWH QRL G GE     
Sbjct: 366  QSGKLQLPIAALPE-AGVDNEGNGNQTPGSGRARRDAAARAMQGWHEQRLLGCGEVGEPI 424

Query: 1766 -EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPADWVK 1942
             +DKN  + PKRE   KSIGS+K KR +E+E P K  RTETSKQLV  VVD+G PADWVK
Sbjct: 425  VKDKNAKHTPKREKNFKSIGSIKHKRPNEMEHPSKVARTETSKQLVTTVVDLGPPADWVK 484

Query: 1943 INVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVSLPA 2122
            INVRETKD FE+YALVPGLLREEVRVQSDPAGRLVITGQPEQ DNPWGIT FKKVVSLPA
Sbjct: 485  INVRETKDCFEIYALVPGLLREEVRVQSDPAGRLVITGQPEQLDNPWGITAFKKVVSLPA 544


>ref|XP_002510735.1| transcription factor, putative [Ricinus communis]
            gi|223551436|gb|EEF52922.1| transcription factor,
            putative [Ricinus communis]
          Length = 449

 Score =  401 bits (1031), Expect = e-109
 Identities = 222/422 (52%), Positives = 276/422 (65%), Gaps = 28/422 (6%)
 Frame = +2

Query: 941  QLPASAIPPKKKEMVNEDNQTDKEGLKKCMVDQAHNRDTGTTIDEPKEPENKSILPAAAL 1120
            ++P+  +P    ++   D     E  K+   ++ +  + G T     E +    LP+   
Sbjct: 5    EMPSQDLPAGTTDVSLVDGVVSTEQPKE---NERNPSENGNTPAASSEGDKTDTLPSDVY 61

Query: 1121 TDENINLIE-KTASKATK------------HEPRDTSVGN-------KRQSATPDLPVKY 1240
              EN  L E KT + AT              E +D   G        +  + TP  P KY
Sbjct: 62   MSENQVLPESKTTTTATTGTNVNDANSIKVRESKDDGDGGTIVHDQPEALAVTPLAPRKY 121

Query: 1241 STRSGGH-SGETLKNAFDDAKMPENDD-GTPEEQAAFMKELESFYRERAVEFKPPKFYGQ 1414
            ST    H SG   K  + D +M E D+ GTPEE+AAFM+ELE+F++E A+EFKPPKFYG+
Sbjct: 122  STPRAKHESGAKSKGVWTDVEMGEADESGTPEERAAFMRELETFHKENALEFKPPKFYGE 181

Query: 1415 PLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYERY 1594
            PLNCLKLWR+VIRLGGY+ VT SK+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE++
Sbjct: 182  PLNCLKLWRSVIRLGGYEVVTASKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEKH 241

Query: 1595 QRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE--- 1765
            +RQ+GELQLP +   +P+SV+ E +GYQ               MQGWH QRL G GE   
Sbjct: 242  KRQSGELQLPSSPPHQPASVEKEVSGYQAPGSGRARRDAAARAMQGWHAQRLLGHGEVSE 301

Query: 1766 ---EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPADW 1936
               +D+++N+ P+RE  LKSIG  KQK  + +EL  K    ET K+L   +VD+G PADW
Sbjct: 302  PIIKDRSVNSAPRREKPLKSIGLHKQK--NNLELAEKHANIETDKELDMEIVDVGPPADW 359

Query: 1937 VKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVSL 2116
            VKINVRE+KD FEVYALVPGLLREEVRVQSDPAGRLVITGQPEQ DNPWGITPFKKV+SL
Sbjct: 360  VKINVRESKDCFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQLDNPWGITPFKKVISL 419

Query: 2117 PA 2122
            P+
Sbjct: 420  PS 421


>ref|XP_006435461.1| hypothetical protein CICLE_v10001099mg [Citrus clementina]
            gi|557537583|gb|ESR48701.1| hypothetical protein
            CICLE_v10001099mg [Citrus clementina]
          Length = 456

 Score =  393 bits (1010), Expect = e-106
 Identities = 203/387 (52%), Positives = 259/387 (66%), Gaps = 12/387 (3%)
 Frame = +2

Query: 998  QTDKEGLKKCMVDQAHNRDTGTTIDE--PKEPENKSILPAAALTDENINLIEKTASKATK 1171
            Q  + G   C        +T  ++DE  P++ + ++       +       ++     +K
Sbjct: 32   QPSENGQSSCRETAEDKPETLPSVDEVFPEKSDAEATAGVNTKSGGGGAGTDELPESTSK 91

Query: 1172 HEPRDTSVGNKRQSATPDLPVKY----STRSGGHSGETLKNAFDDAKMPENDDGTPEEQA 1339
                DT V N+ ++ T   P+ +    ++++   S    KN  +D +M E D+GTPEEQA
Sbjct: 92   TNGDDTHVENEPKTNTSVTPIPHGESSTSKAEDESVRKSKNWLNDIEMGEADEGTPEEQA 151

Query: 1340 AFMKELESFYRERAVEFKPPKFYGQPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHP 1519
             FMKE+ESFYRE A+EFKPPKFYG+PLNCLKLWRAV+RLGGY+ VT SK+WRQVGESFHP
Sbjct: 152  EFMKEIESFYRENALEFKPPKFYGEPLNCLKLWRAVVRLGGYEVVTASKLWRQVGESFHP 211

Query: 1520 PKTCTTVSWTFRIFYEKSLLEYERYQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXX 1699
            PKTCTTVSWTFRIFYEK+LLEYE+++R +GELQLP ++ P+P++   E +GYQ       
Sbjct: 212  PKTCTTVSWTFRIFYEKALLEYEKHKRLSGELQLPASSFPQPTNAGKEASGYQTPGSSRA 271

Query: 1700 XXXXXXXXMQGWHVQRLFGLGE------EDKNLNNMPKRENKLKSIGSLKQKRSSEIELP 1861
                    MQGWH QRL G GE      +DK+L +  +RE +LK+IG  K +     E  
Sbjct: 272  RRDAAARAMQGWHAQRLLGHGEVAEPIIKDKSLPSPARREKQLKNIGLPKNRTLDSAE-- 329

Query: 1862 VKAQRTETSKQLVANVVDIGAPADWVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGR 2041
             KA  TE  KQ++  +VD+G PADWVKINVRE KD +EVYALVPGLLREEVRVQSDPAGR
Sbjct: 330  -KAALTEADKQIITEIVDVGPPADWVKINVREAKDCYEVYALVPGLLREEVRVQSDPAGR 388

Query: 2042 LVITGQPEQPDNPWGITPFKKVVSLPA 2122
            LVITG+PEQ DNPWGITPFKKVV LP+
Sbjct: 389  LVITGEPEQVDNPWGITPFKKVVILPS 415


>ref|XP_006435458.1| hypothetical protein CICLE_v10001099mg [Citrus clementina]
            gi|567885801|ref|XP_006435459.1| hypothetical protein
            CICLE_v10001099mg [Citrus clementina]
            gi|567885803|ref|XP_006435460.1| hypothetical protein
            CICLE_v10001099mg [Citrus clementina]
            gi|557537580|gb|ESR48698.1| hypothetical protein
            CICLE_v10001099mg [Citrus clementina]
            gi|557537581|gb|ESR48699.1| hypothetical protein
            CICLE_v10001099mg [Citrus clementina]
            gi|557537582|gb|ESR48700.1| hypothetical protein
            CICLE_v10001099mg [Citrus clementina]
          Length = 442

 Score =  393 bits (1010), Expect = e-106
 Identities = 203/387 (52%), Positives = 259/387 (66%), Gaps = 12/387 (3%)
 Frame = +2

Query: 998  QTDKEGLKKCMVDQAHNRDTGTTIDE--PKEPENKSILPAAALTDENINLIEKTASKATK 1171
            Q  + G   C        +T  ++DE  P++ + ++       +       ++     +K
Sbjct: 32   QPSENGQSSCRETAEDKPETLPSVDEVFPEKSDAEATAGVNTKSGGGGAGTDELPESTSK 91

Query: 1172 HEPRDTSVGNKRQSATPDLPVKY----STRSGGHSGETLKNAFDDAKMPENDDGTPEEQA 1339
                DT V N+ ++ T   P+ +    ++++   S    KN  +D +M E D+GTPEEQA
Sbjct: 92   TNGDDTHVENEPKTNTSVTPIPHGESSTSKAEDESVRKSKNWLNDIEMGEADEGTPEEQA 151

Query: 1340 AFMKELESFYRERAVEFKPPKFYGQPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHP 1519
             FMKE+ESFYRE A+EFKPPKFYG+PLNCLKLWRAV+RLGGY+ VT SK+WRQVGESFHP
Sbjct: 152  EFMKEIESFYRENALEFKPPKFYGEPLNCLKLWRAVVRLGGYEVVTASKLWRQVGESFHP 211

Query: 1520 PKTCTTVSWTFRIFYEKSLLEYERYQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXX 1699
            PKTCTTVSWTFRIFYEK+LLEYE+++R +GELQLP ++ P+P++   E +GYQ       
Sbjct: 212  PKTCTTVSWTFRIFYEKALLEYEKHKRLSGELQLPASSFPQPTNAGKEASGYQTPGSSRA 271

Query: 1700 XXXXXXXXMQGWHVQRLFGLGE------EDKNLNNMPKRENKLKSIGSLKQKRSSEIELP 1861
                    MQGWH QRL G GE      +DK+L +  +RE +LK+IG  K +     E  
Sbjct: 272  RRDAAARAMQGWHAQRLLGHGEVAEPIIKDKSLPSPARREKQLKNIGLPKNRTLDSAE-- 329

Query: 1862 VKAQRTETSKQLVANVVDIGAPADWVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGR 2041
             KA  TE  KQ++  +VD+G PADWVKINVRE KD +EVYALVPGLLREEVRVQSDPAGR
Sbjct: 330  -KAALTEADKQIITEIVDVGPPADWVKINVREAKDCYEVYALVPGLLREEVRVQSDPAGR 388

Query: 2042 LVITGQPEQPDNPWGITPFKKVVSLPA 2122
            LVITG+PEQ DNPWGITPFKKVV LP+
Sbjct: 389  LVITGEPEQVDNPWGITPFKKVVILPS 415


>ref|XP_006473863.1| PREDICTED: AT-rich interactive domain-containing protein 5-like
            isoform X1 [Citrus sinensis]
            gi|568839798|ref|XP_006473864.1| PREDICTED: AT-rich
            interactive domain-containing protein 5-like isoform X2
            [Citrus sinensis]
          Length = 457

 Score =  392 bits (1008), Expect = e-106
 Identities = 196/336 (58%), Positives = 242/336 (72%), Gaps = 10/336 (2%)
 Frame = +2

Query: 1145 EKTASKATKHEPRDTSVGNKRQSATPDLPVKY----STRSGGHSGETLKNAFDDAKMPEN 1312
            ++     +K    DT V N+ ++ T   P+ +    ++++   S    KN  +D +M E 
Sbjct: 84   DELPESTSKTNGDDTHVENEPKTNTSVTPIPHGESSTSKAEDESVRKSKNWLNDIEMGEA 143

Query: 1313 DDGTPEEQAAFMKELESFYRERAVEFKPPKFYGQPLNCLKLWRAVIRLGGYDRVTGSKMW 1492
            D+GTPEEQA FMKE+ESFYRE A+EFKPPKFYG+PLNCLKLWRAV+RLGGY+ VT SK+W
Sbjct: 144  DEGTPEEQAEFMKEIESFYRENALEFKPPKFYGEPLNCLKLWRAVVRLGGYEVVTASKLW 203

Query: 1493 RQVGESFHPPKTCTTVSWTFRIFYEKSLLEYERYQRQNGELQLPVTALPEPSSVDNEGNG 1672
            RQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+++R +GELQLP ++ P+P++   E +G
Sbjct: 204  RQVGESFHPPKTCTTVSWTFRIFYEKALLEYEKHKRLSGELQLPASSFPQPTNAGKEASG 263

Query: 1673 YQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE------EDKNLNNMPKRENKLKSIGSLKQ 1834
            YQ               MQGWH QRL G GE      +DK+L +  +RE +LK+IG  K 
Sbjct: 264  YQTPGSSRARRDAAARAMQGWHAQRLLGHGEVAEPIIKDKSLPSPARREKQLKNIGLPKN 323

Query: 1835 KRSSEIELPVKAQRTETSKQLVANVVDIGAPADWVKINVRETKDSFEVYALVPGLLREEV 2014
            +     E   KA  TE  KQ++  +VD+G PADWVKINVRE KD +EVYALVPGLLREEV
Sbjct: 324  RTLDSAE---KAALTEADKQIITEIVDVGPPADWVKINVREAKDCYEVYALVPGLLREEV 380

Query: 2015 RVQSDPAGRLVITGQPEQPDNPWGITPFKKVVSLPA 2122
            RVQSDPAGRLVITG+PEQ DNPWGITPFKKVV LP+
Sbjct: 381  RVQSDPAGRLVITGEPEQVDNPWGITPFKKVVILPS 416


>gb|EOY15870.1| Transcription factor, putative isoform 1 [Theobroma cacao]
            gi|508723976|gb|EOY15873.1| Transcription factor,
            putative isoform 1 [Theobroma cacao]
            gi|508723977|gb|EOY15874.1| Transcription factor,
            putative isoform 1 [Theobroma cacao]
          Length = 437

 Score =  389 bits (998), Expect = e-105
 Identities = 224/423 (52%), Positives = 279/423 (65%), Gaps = 17/423 (4%)
 Frame = +2

Query: 905  EKSAVLNGQLP----FQLPASAIPPKKKEMVNEDNQTDKEGLKKCMVDQAHNRDTGTTID 1072
            E + +L  QLP      L  S +  ++  +  ED  T          +  H+  +GT  D
Sbjct: 5    EDTEMLEQQLPEASKVNLVDSGVQQQQSSLATEDQDT---------TETRHSPHSGTADD 55

Query: 1073 EPKE-PENKSILPAAALTDENINLIEKTASKATKHEPRDTSVGNKRQSATPD---LP-VK 1237
            +    P + ++    AL D+     +KT++ A  +     SV    + ++ D   LP  +
Sbjct: 56   KALTLPTDVNMSDNPALPDKPD---KKTSNDANTNARDAASVERLEKKSSGDAAPLPCAE 112

Query: 1238 YSTRSGGH-SGETLKNAFDDAKMPENDD-GTPEEQAAFMKELESFYRERAVEFKPPKFYG 1411
            + T    H S +  KN   D +M E D+ GT EE+AAFMKELESFY++R++EFKPPKFYG
Sbjct: 113  FLTPKSQHGSVKKSKNWLLDPEMGEADEAGTQEERAAFMKELESFYKDRSLEFKPPKFYG 172

Query: 1412 QPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYER 1591
            +PLNCLKLWRAVIRLGGY+ VT SK+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+
Sbjct: 173  EPLNCLKLWRAVIRLGGYEVVTASKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEK 232

Query: 1592 YQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE-- 1765
            Y+R+NGE+QLP ++LP     + E +GYQ               MQGWH QR  G GE  
Sbjct: 233  YKRENGEIQLPASSLPHTVG-EKESSGYQASGSGRARRDAAARAMQGWHAQRSVGYGEIT 291

Query: 1766 ----EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPAD 1933
                +DK+L++ PK+++ LK+IG  KQK     E P +    E +KQLV  VVD+GAPAD
Sbjct: 292  EPIIKDKSLSSTPKQKH-LKTIGLQKQKTPISTE-PAEKSAHEPNKQLVTEVVDVGAPAD 349

Query: 1934 WVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVS 2113
            WVKINVRETKD FEVYALVPGLLREEVRVQSDPAGRLVITGQPEQ DNPWGITPFKKVV+
Sbjct: 350  WVKINVRETKDCFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQVDNPWGITPFKKVVT 409

Query: 2114 LPA 2122
            LPA
Sbjct: 410  LPA 412


>gb|EMJ10413.1| hypothetical protein PRUPE_ppa006712mg [Prunus persica]
          Length = 399

 Score =  389 bits (998), Expect = e-105
 Identities = 206/362 (56%), Positives = 247/362 (68%), Gaps = 11/362 (3%)
 Frame = +2

Query: 1067 IDEPKEPENKSILPAAALTDENINLIEKTASKATKHEPRDTSVGNKRQSATPDLPVKYST 1246
            + +P +PE    +   A+ D+              +E   + V  + ++A  +  VK   
Sbjct: 14   LGQPPQPETNDQVMRDAIQDKPATHEGNVGDDVKVNEALLSDVPIENKAAGSNATVKNQL 73

Query: 1247 RS-GGHSGE-TLKNAFDDAKMPENDD---GTPEEQAAFMKELESFYRERAVEFKPPKFYG 1411
            +S   H G   LKN  +D +M E DD   GTP +QAAFMKE+ESFY+E  +EFK PKFYG
Sbjct: 74   KSVDKHVGVGELKNGSNDTEMAEADDTGDGTPSQQAAFMKEVESFYKENTLEFKAPKFYG 133

Query: 1412 QPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYER 1591
            +PLNCLKLWRAV RLGGYD VT SK+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+
Sbjct: 134  EPLNCLKLWRAVTRLGGYDVVTASKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEK 193

Query: 1592 YQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE-- 1765
            ++RQ GEL+LPV  + +  +V+ E +G+Q               MQGWH QRL G GE  
Sbjct: 194  HKRQTGELRLPVGPVTQSMTVEKEASGHQTPGSGRARRDAAARAMQGWHAQRLVGYGEVA 253

Query: 1766 ----EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPAD 1933
                +DKNL++  KRE  LKSIGS+K +  + +E        E  KQ+V  VVD+G PAD
Sbjct: 254  EPIIKDKNLSSTSKREKNLKSIGSIKHRAPTNLE--HATANIEADKQVVTTVVDLGPPAD 311

Query: 1934 WVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVS 2113
            WVKINVRETKD FEVYALVPGLLREEVRVQSDPAGRLVITGQPEQ DNPWGITPFKKVVS
Sbjct: 312  WVKINVRETKDCFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQVDNPWGITPFKKVVS 371

Query: 2114 LP 2119
            LP
Sbjct: 372  LP 373


>ref|XP_002528526.1| transcription factor, putative [Ricinus communis]
            gi|223532028|gb|EEF33838.1| transcription factor,
            putative [Ricinus communis]
          Length = 656

 Score =  387 bits (995), Expect = e-105
 Identities = 253/613 (41%), Positives = 336/613 (54%), Gaps = 17/613 (2%)
 Frame = +2

Query: 335  SGTTGPLRQDTVLQMDKSDVEMEDAEK-ESQGGPDNVKESRDGTVDAKMEPEDQKQHSGA 511
            S     L   + +Q  +++ ++ D +K E++  PD            ++ P D  +  G 
Sbjct: 77   SSINNTLESKSTMQQSQTEADLGDPQKLETEAAPDTNSAQLKTPSQDEIVPNDNIE--GL 134

Query: 512  NADCQKSTEVEAKNEN-DVETVCEEAKKSNAETEVQEQTNAKSEVLELSNREGMKDDQTN 688
             AD Q   E      N +++TV              +QT  K+ V    N E     QT 
Sbjct: 135  RADTQPQIESALNGINMELDTV-------------HQQTQNKA-VSSYDNVEFKTSPQTE 180

Query: 689  ATMEAEVQSMASSEAEDQKE---DQSNVTVKNEMKDATDPQNAPDEQNPSESGKTSI-NP 856
            A  + +   + SS  +   E   + +NV +K+  +   +  ++P    PS+  KTS+ + 
Sbjct: 181  AAADDKDMDLKSSPQKHPTEAGLNDNNVDLKSGPQQPLEGSSSPA---PSDDSKTSLKSE 237

Query: 857  VESKTEEKKAGHTGEEEKSAVLNGQLPFQLPASAIPPKKKEMVNEDNQTDKEGLKKCMVD 1036
             E  T+EK  G+   + K                  P+   +    N T      K   D
Sbjct: 238  TEPPTKEKTTGNENSDCK------------------PETCGVDGTTNSTSNMETAKLTAD 279

Query: 1037 QAHNRDTGTTIDEPKE-PENKSILPAAALTDENINLIE--KTASKATKHEPR-DTSVGNK 1204
                        EP E P++KS        + +   I   +T+S  T+HE + +   GN 
Sbjct: 280  SK---------SEPSEVPQSKSGHADTVTKEHSEPAIPHAETSSIKTEHENKQELKNGNS 330

Query: 1205 RQSATPDLPVKYSTRSGGHSGETLKNAFDDAKMPENDDGTPEEQAAFMKELESFYRERAV 1384
                 P    K +  S   S   L+N  D +     + G+ EEQ AFMKELE+F+RER+ 
Sbjct: 331  EMDVAP----KSNGNSASKSSFLLENYHDGS-----ESGSEEEQLAFMKELENFFRERST 381

Query: 1385 EFKPPKFYGQPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFY 1564
            EFKPPKFYG+ LNCLKLWRAV+RLGGYD+VT  K+WRQVGESF PPKTCTTVSWTFR FY
Sbjct: 382  EFKPPKFYGEGLNCLKLWRAVMRLGGYDKVTTCKLWRQVGESFKPPKTCTTVSWTFRGFY 441

Query: 1565 EKSLLEYERYQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQ 1744
            EK+LL+YER++   GEL LP+T+ PEP  VDN+  G                 MQGWH Q
Sbjct: 442  EKALLDYERHKTNGGELNLPLTSNPEPVIVDNQTPG-----SGRARRDAAARAMQGWHSQ 496

Query: 1745 RLFGLGE------EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRTETSK-QLVA 1903
            RL G GE      +DKN   M KRE +LK++G +K+K+ S +E  VKA R +TSK QL  
Sbjct: 497  RLLGNGEVSDAIIKDKNSVPMQKREKQLKNLGIVKRKKPSYMEHAVKAARAKTSKPQLDV 556

Query: 1904 NVVDIGAPADWVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPW 2083
             VVD+G+PADWVKINV++TKD FEVYALVPGLLREEVRVQSDPAGRLVI+G+PE PDNPW
Sbjct: 557  EVVDLGSPADWVKINVQKTKDCFEVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPW 616

Query: 2084 GITPFKKVVSLPA 2122
            G+TPFKKVVSLP+
Sbjct: 617  GVTPFKKVVSLPS 629


>ref|XP_002300618.2| hypothetical protein POPTR_0002s00550g [Populus trichocarpa]
            gi|550343983|gb|EEE79891.2| hypothetical protein
            POPTR_0002s00550g [Populus trichocarpa]
          Length = 444

 Score =  385 bits (990), Expect = e-104
 Identities = 215/467 (46%), Positives = 282/467 (60%), Gaps = 7/467 (1%)
 Frame = +2

Query: 740  QKEDQSNVTVKNEMKDATDPQNAPDEQNPSESGKTSINPVESKTEEKKAGHTGEEEKSAV 919
            + ED   VTV++ + D+ D +  P E +  E  +T+   VE K+ E       +++ +  
Sbjct: 3    EMEDSEMVTVQDLLVDSEDKK--PSEASVKEQEETADASVEQKSNENGQTSVADDDHTVT 60

Query: 920  LNGQLPFQLPASAIPPKKKEMVNEDNQTDKEGLKKCMVDQAHNRDTGTTIDEPKEPENKS 1099
            L   +P      A+P +K +                                        
Sbjct: 61   LASDVPMS-DTQALPNEKND---------------------------------------- 79

Query: 1100 ILPAAALTDENINLIEKTASKATKHEPRDTSVGNKRQSATPDLPVKYST-RSGGHSGETL 1276
                   TDENIN   + A +          V N+ Q+ATP  P +++T ++   S    
Sbjct: 80   -------TDENIN---QQAGEEKTDGDDGGCVQNQPQTATPSTPRRHATPKAKQDSAAKS 129

Query: 1277 KNAFDDAKMPEND-DGTPEEQAAFMKELESFYRERAVEFKPPKFYGQPLNCLKLWRAVIR 1453
            KN + D KM E D  GTPEE+AAFMKELE+FY++  ++FKPPKFYG+PLNCLKLWR+VI+
Sbjct: 130  KNVWTDIKMGEADVAGTPEERAAFMKELETFYKQNTMDFKPPKFYGEPLNCLKLWRSVIK 189

Query: 1454 LGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYERYQRQNGELQLPVTA 1633
            LGGY+ VT +K+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+++++ GELQLP + 
Sbjct: 190  LGGYEVVTANKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEKHKKETGELQLPSSP 249

Query: 1634 LPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGEEDKNLN-----NMPKR 1798
            L + +SV+ E +GYQ               MQGWH QR  G GE  + +      N  +R
Sbjct: 250  LHQATSVEKEASGYQAPGSGRARRDAAARAMQGWHAQRHLGHGEVSEPIAKVKSLNFARR 309

Query: 1799 ENKLKSIGSLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPADWVKINVRETKDSFEV 1978
            E  LKSIG L +++++ +EL  +    E  K++ A + DIG PADWVKINVRE+KD +E+
Sbjct: 310  EKPLKSIG-LHRQKTTNLELAERPMNAEPDKEVDAEIADIGPPADWVKINVRESKDCYEI 368

Query: 1979 YALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVSLP 2119
            YALVPGLLREEVRVQSDP GRLVITGQPEQ DNPWGITPFKKVVSLP
Sbjct: 369  YALVPGLLREEVRVQSDPVGRLVITGQPEQLDNPWGITPFKKVVSLP 415


>gb|AEO22030.1| ARID and Hsp20 domains containing protein [Lotus japonicus]
          Length = 425

 Score =  385 bits (988), Expect = e-104
 Identities = 200/337 (59%), Positives = 236/337 (70%), Gaps = 8/337 (2%)
 Frame = +2

Query: 1136 NLIEKTASKATKHEPRDTSVGNKRQSATPDLPVKYSTRSGGHSGETLKNAFDDAKMPEND 1315
            N++E       + E  D      ++  TP        +S   +   +K+  +D ++ + D
Sbjct: 69   NMLEVKTGSENQLELEDVKTPLHQELVTP--------KSRERNVREMKSVLNDTEVVDYD 120

Query: 1316 D-GTPEEQAAFMKELESFYRERAVEFKPPKFYGQPLNCLKLWRAVIRLGGYDRVTGSKMW 1492
            + G   E+ AFMKELE+FYRER++EFKPPKFYG+PLNCLKLWRAVIRLGGYD VTGSK+W
Sbjct: 121  EPGASLEREAFMKELENFYRERSLEFKPPKFYGEPLNCLKLWRAVIRLGGYDVVTGSKLW 180

Query: 1493 RQVGESFHPPKTCTTVSWTFRIFYEKSLLEYERYQRQNGELQLPVTALPEPSSVDNEGNG 1672
            RQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+++R+ GELQLPV   P+PSSV+ E   
Sbjct: 181  RQVGESFHPPKTCTTVSWTFRIFYEKALLEYEKHKREIGELQLPVGVFPQPSSVEKETTV 240

Query: 1673 YQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE------EDKNLNNMPKRENKLKSIGSL-K 1831
            YQ               MQGWH QRL G GE      +DKN N   KRE  LKSIG++ K
Sbjct: 241  YQAPGSGRARRDAAARAMQGWHAQRLLGYGEVAEPVIKDKNFNPTTKREKNLKSIGAINK 300

Query: 1832 QKRSSEIELPVKAQRTETSKQLVANVVDIGAPADWVKINVRETKDSFEVYALVPGLLREE 2011
            Q+  S +E   KA   +  +QLV  VVD+G PADWVKINVRETKD FEVYALVPGLLREE
Sbjct: 301  QRTPSVLEHVEKAANIDGDRQLVTAVVDVGPPADWVKINVRETKDCFEVYALVPGLLREE 360

Query: 2012 VRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVSLPA 2122
            VRVQSDP GRLVITG PE  DNPWGITPFKKVV+LPA
Sbjct: 361  VRVQSDPVGRLVITGMPEHIDNPWGITPFKKVVNLPA 397


>gb|EOY15871.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 2, partial
            [Theobroma cacao]
          Length = 411

 Score =  382 bits (981), Expect = e-103
 Identities = 223/423 (52%), Positives = 278/423 (65%), Gaps = 17/423 (4%)
 Frame = +2

Query: 905  EKSAVLNGQLP----FQLPASAIPPKKKEMVNEDNQTDKEGLKKCMVDQAHNRDTGTTID 1072
            E + +L  QLP      L  S +  ++  +  ED  T          +  H+  +GT  D
Sbjct: 5    EDTEMLEQQLPEASKVNLVDSGVQQQQSSLATEDQDT---------TETRHSPHSGTADD 55

Query: 1073 EPKE-PENKSILPAAALTDENINLIEKTASKATKHEPRDTSVGNKRQSATPD---LP-VK 1237
            +    P + ++    AL D+     +KT++ A  +     SV    + ++ D   LP  +
Sbjct: 56   KALTLPTDVNMSDNPALPDKPD---KKTSNDANTNARDAASVERLEKKSSGDAAPLPCAE 112

Query: 1238 YSTRSGGH-SGETLKNAFDDAKMPENDD-GTPEEQAAFMKELESFYRERAVEFKPPKFYG 1411
            + T    H S +  KN   D +M E D+ GT EE+AAFMKELESFY++R++EFKPPKFYG
Sbjct: 113  FLTPKSQHGSVKKSKNWLLDPEMGEADEAGTQEERAAFMKELESFYKDRSLEFKPPKFYG 172

Query: 1412 QPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYER 1591
            +PLNCLKLWRAVIRLGGY+ VT SK+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+
Sbjct: 173  EPLNCLKLWRAVIRLGGYEVVTASKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEK 232

Query: 1592 YQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE-- 1765
            Y+R+NGE+QLP ++LP     + E +GYQ               MQGWH QR  G GE  
Sbjct: 233  YKRENGEIQLPASSLPHTVG-EKESSGYQASGSGRARRDAAARAMQGWHAQRSVGYGEIT 291

Query: 1766 ----EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPAD 1933
                +DK+L++ PK+++ LK+IG  KQK     E P +    E +K LV  VVD+GAPAD
Sbjct: 292  EPIIKDKSLSSTPKQKH-LKTIGLQKQKTPISTE-PAEKSAHEPNK-LVTEVVDVGAPAD 348

Query: 1934 WVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVS 2113
            WVKINVRETKD FEVYALVPGLLREEVRVQSDPAGRLVITGQPEQ DNPWGITPFKKVV+
Sbjct: 349  WVKINVRETKDCFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQVDNPWGITPFKKVVT 408

Query: 2114 LPA 2122
            LPA
Sbjct: 409  LPA 411


>gb|ESW07981.1| hypothetical protein PHAVU_009G008800g [Phaseolus vulgaris]
            gi|561009075|gb|ESW07982.1| hypothetical protein
            PHAVU_009G008800g [Phaseolus vulgaris]
          Length = 419

 Score =  381 bits (979), Expect = e-103
 Identities = 198/337 (58%), Positives = 239/337 (70%), Gaps = 8/337 (2%)
 Frame = +2

Query: 1136 NLIEKTASKATKHEPRDTSVGNKRQSATPDLPVKYSTRSGGHSGETLKNAFDDAKMPEND 1315
            N+ E +     + E  D+   +  +  TP  P + + R        +KN  +D +M E D
Sbjct: 63   NVPETSVISENQFELEDSKTVSHHELITPK-PKEKNIRE-------MKNVLNDTEMTEYD 114

Query: 1316 D-GTPEEQAAFMKELESFYRERAVEFKPPKFYGQPLNCLKLWRAVIRLGGYDRVTGSKMW 1492
            + GTP ++  FMKE+E+FYRER++EFKPPKFYG+PLNCLKLWRAVIRLGGYD VTGSK+W
Sbjct: 115  EYGTPLDRDTFMKEIENFYRERSLEFKPPKFYGEPLNCLKLWRAVIRLGGYDVVTGSKLW 174

Query: 1493 RQVGESFHPPKTCTTVSWTFRIFYEKSLLEYERYQRQNGELQLPVTALPEPSSVDNEGNG 1672
            RQVGESF+PPKTCTTVSWTFRIFYEK+LLEYE+++R+ GELQLPV +  + S+V+ E   
Sbjct: 175  RQVGESFNPPKTCTTVSWTFRIFYEKALLEYEKHKRETGELQLPVGSFHQSSNVEKETTV 234

Query: 1673 YQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE------EDKNLNNMPKRENKLKSIGSL-K 1831
            YQ               MQGWH QRL G GE      +DKN ++ PKRE  LKSIG + K
Sbjct: 235  YQAPGSGRARRDAAARAMQGWHTQRLLGYGEVAEPATKDKNFSSTPKREKNLKSIGMINK 294

Query: 1832 QKRSSEIELPVKAQRTETSKQLVANVVDIGAPADWVKINVRETKDSFEVYALVPGLLREE 2011
            Q+  + ++   KA   E  +QL+  VVDIG PADWVKINVRETKD FEVYALVPGLLREE
Sbjct: 295  QRTQAGLDHAEKAANIEGDRQLITAVVDIGPPADWVKINVRETKDCFEVYALVPGLLREE 354

Query: 2012 VRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVSLPA 2122
            VRVQSDP GRLVITG PE  DNPWGITPFKKVV+LPA
Sbjct: 355  VRVQSDPVGRLVITGLPEHIDNPWGITPFKKVVNLPA 391


>ref|XP_004500824.1| PREDICTED: AT-rich interactive domain-containing protein 5-like
            isoform X1 [Cicer arietinum]
            gi|502130949|ref|XP_004500825.1| PREDICTED: AT-rich
            interactive domain-containing protein 5-like isoform X2
            [Cicer arietinum]
          Length = 401

 Score =  381 bits (979), Expect = e-103
 Identities = 192/291 (65%), Positives = 223/291 (76%), Gaps = 8/291 (2%)
 Frame = +2

Query: 1274 LKNAFDDAKMPENDD-GTPEEQAAFMKELESFYRERAVEFKPPKFYGQPLNCLKLWRAVI 1450
            + +  + A+M   D+ GTP ++ AFMKELE+FYRER++EFKPPKFYG PLNCLKLWRAVI
Sbjct: 83   MNHVLNSAEMTSYDESGTPADREAFMKELENFYRERSLEFKPPKFYGVPLNCLKLWRAVI 142

Query: 1451 RLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYERYQRQNGELQLPVT 1630
            RLGGY+ VT SK+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+Y+R+ GELQLPV 
Sbjct: 143  RLGGYEVVTASKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEKYKREIGELQLPVG 202

Query: 1631 ALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE------EDKNLNNMP 1792
            +L +PSSV+ E   YQ               MQGWH QRL G GE      +DKN ++ P
Sbjct: 203  SLHQPSSVEKETAVYQAPGSGRARRDAAARAMQGWHAQRLLGYGEAAEPTVKDKNFSSTP 262

Query: 1793 KRENKLKSIGSL-KQKRSSEIELPVKAQRTETSKQLVANVVDIGAPADWVKINVRETKDS 1969
            KRE  LK+IG + KQ+  S ++   KA   E  +QL+A VVD+G PADWVKINVRETKD 
Sbjct: 263  KREKNLKNIGVINKQRTPSSMDHADKAANIEGDRQLIAAVVDLGPPADWVKINVRETKDC 322

Query: 1970 FEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVSLPA 2122
            FEVYALVPGLLREEVRVQSDP GRLVITG PE  DNPWGITPFKKVV+LPA
Sbjct: 323  FEVYALVPGLLREEVRVQSDPVGRLVITGLPEHVDNPWGITPFKKVVNLPA 373


>gb|EOY15872.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 3
            [Theobroma cacao]
          Length = 465

 Score =  381 bits (978), Expect = e-103
 Identities = 220/418 (52%), Positives = 274/418 (65%), Gaps = 17/418 (4%)
 Frame = +2

Query: 905  EKSAVLNGQLP----FQLPASAIPPKKKEMVNEDNQTDKEGLKKCMVDQAHNRDTGTTID 1072
            E + +L  QLP      L  S +  ++  +  ED  T          +  H+  +GT  D
Sbjct: 5    EDTEMLEQQLPEASKVNLVDSGVQQQQSSLATEDQDT---------TETRHSPHSGTADD 55

Query: 1073 EPKE-PENKSILPAAALTDENINLIEKTASKATKHEPRDTSVGNKRQSATPD---LP-VK 1237
            +    P + ++    AL D+     +KT++ A  +     SV    + ++ D   LP  +
Sbjct: 56   KALTLPTDVNMSDNPALPDKPD---KKTSNDANTNARDAASVERLEKKSSGDAAPLPCAE 112

Query: 1238 YSTRSGGH-SGETLKNAFDDAKMPENDD-GTPEEQAAFMKELESFYRERAVEFKPPKFYG 1411
            + T    H S +  KN   D +M E D+ GT EE+AAFMKELESFY++R++EFKPPKFYG
Sbjct: 113  FLTPKSQHGSVKKSKNWLLDPEMGEADEAGTQEERAAFMKELESFYKDRSLEFKPPKFYG 172

Query: 1412 QPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYER 1591
            +PLNCLKLWRAVIRLGGY+ VT SK+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+
Sbjct: 173  EPLNCLKLWRAVIRLGGYEVVTASKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEK 232

Query: 1592 YQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE-- 1765
            Y+R+NGE+QLP ++LP     + E +GYQ               MQGWH QR  G GE  
Sbjct: 233  YKRENGEIQLPASSLPHTVG-EKESSGYQASGSGRARRDAAARAMQGWHAQRSVGYGEIT 291

Query: 1766 ----EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPAD 1933
                +DK+L++ PK+++ LK+IG  KQK     E P +    E +KQLV  VVD+GAPAD
Sbjct: 292  EPIIKDKSLSSTPKQKH-LKTIGLQKQKTPISTE-PAEKSAHEPNKQLVTEVVDVGAPAD 349

Query: 1934 WVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKV 2107
            WVKINVRETKD FEVYALVPGLLREEVRVQSDPAGRLVITGQPEQ DNPWGITPFKKV
Sbjct: 350  WVKINVRETKDCFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQVDNPWGITPFKKV 407


>ref|XP_006577925.1| PREDICTED: AT-rich interactive domain-containing protein 5-like
            isoform X1 [Glycine max] gi|571448689|ref|XP_006577926.1|
            PREDICTED: AT-rich interactive domain-containing protein
            5-like isoform X2 [Glycine max]
          Length = 414

 Score =  380 bits (976), Expect = e-102
 Identities = 216/424 (50%), Positives = 266/424 (62%), Gaps = 8/424 (1%)
 Frame = +2

Query: 872  EEKKAGHTGEEEKSAVLNGQLPFQLPASAIPPKKKEMVNEDNQTDKEGLKKCMVDQAHNR 1051
            +EK  G++       V +  L  Q  AS+  P +++ V              MV+Q H+ 
Sbjct: 4    DEKPTGNSHGIADMDVADAALQGQRVASSPAPLQQDQV--------------MVEQGHDI 49

Query: 1052 DTGTTIDEPKEPENKSILPAAALTDENINLIEKTASKATKHEPRDTSVGNKRQSATPDLP 1231
              G +          + LP A++ + N            + E  D    + ++  TP  P
Sbjct: 50   KNGDS--------GPAHLPEASVINNN------------QFEVEDAQTVSHQELTTPK-P 88

Query: 1232 VKYSTRSGGHSGETLKNAFDDAKMPENDD-GTPEEQAAFMKELESFYRERAVEFKPPKFY 1408
             + + R        +KN  +D +M + D+ GTP ++  FMKELE+FYRER++EFKPPKFY
Sbjct: 89   KEKNVRE-------MKNVLNDTEMTDYDEYGTPLDRETFMKELETFYRERSLEFKPPKFY 141

Query: 1409 GQPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYE 1588
            G+PLNCLKLWRAVIRLGGYD VTGSK+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE
Sbjct: 142  GEPLNCLKLWRAVIRLGGYDVVTGSKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYE 201

Query: 1589 RYQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE- 1765
            R++R+ GELQLPV    + S+V+ E   YQ               MQGWH QRL G GE 
Sbjct: 202  RHKREIGELQLPVGPFHQSSNVEKEPAVYQTPGSGRARRDSAARAMQGWHAQRLLGYGEV 261

Query: 1766 -----EDKNLNNMPKRENKLKSIGSL-KQKRSSEIELPVKAQRTETSKQLVANVVDIGAP 1927
                 +DKN ++  KRE  LKSIG + KQ+  S +E   K+   E  +QLV  VVD+G P
Sbjct: 262  AEPVIKDKNFSSTQKREKNLKSIGMINKQRTLSGLEHAEKSANIEGDQQLVTAVVDVGPP 321

Query: 1928 ADWVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKV 2107
            ADWVKINVRETKD FEVYALVPGLLREEVRVQSDP GRLVITG PE  DNPWGITPFKKV
Sbjct: 322  ADWVKINVRETKDGFEVYALVPGLLREEVRVQSDPVGRLVITGVPEHLDNPWGITPFKKV 381

Query: 2108 VSLP 2119
            V+LP
Sbjct: 382  VNLP 385


>ref|XP_006342525.1| PREDICTED: AT-rich interactive domain-containing protein 5-like
            [Solanum tuberosum]
          Length = 418

 Score =  380 bits (975), Expect = e-102
 Identities = 209/393 (53%), Positives = 257/393 (65%), Gaps = 13/393 (3%)
 Frame = +2

Query: 983  VNEDNQTDKEGLKKCMVDQAHNRDTGTTIDEPKEPENKSILPAA-----ALTDENINL-I 1144
            VN+ +Q++K      M+ +         +D+  +P+ K+          A +DE  NL I
Sbjct: 14   VNDVHQSEK------MIYEVIPNGNNMLVDKSVDPQMKTNATTVTTDGDARSDEFPNLTI 67

Query: 1145 EKTASKATKHEPRDTSVGNKRQSATPDLPVKYSTRSGG-HSGETLKNAFDDAKMPENDDG 1321
            E T          D+S+G   + A+P   +  ST + G HSGE   N +      E+D+G
Sbjct: 68   EVT----------DSSIGKAPEPASPMFLINPSTANAGQHSGEASANIYAMMADGEDDEG 117

Query: 1322 TPEEQAAFMKELESFYRERAVEFKPPKFYGQPLNCLKLWRAVIRLGGYDRVTGSKMWRQV 1501
            +PE+QAAF+ +L +FYRE+A+EFK PKFYG PLNCLKLWR+VIRLGGYDRVTG K+WRQV
Sbjct: 118  SPEDQAAFIGKLGTFYREKAMEFKLPKFYGHPLNCLKLWRSVIRLGGYDRVTGYKLWRQV 177

Query: 1502 GESFHPPKTCTTVSWTFRIFYEKSLLEYERYQRQNGELQLPVTALPEPSSVDNEGNGYQX 1681
            G+SF+PPKTCTTVSWTFR FYEK LL+YER++ QN ELQLP+   P  S VDNEG+GYQ 
Sbjct: 178  GDSFNPPKTCTTVSWTFRGFYEKLLLQYERHRTQNRELQLPIPPPPGSSGVDNEGSGYQV 237

Query: 1682 XXXXXXXXXXXXXXMQGWHVQRLFGLGE------EDKNLNNMPKRENKLKSIGSLKQKRS 1843
                            GW  Q L G GE      +D++ NNMPKR   LK+ GSLK +  
Sbjct: 238  SASGRAVRDSAARCRLGWQEQHLLGYGEVAEPIVKDRSANNMPKRAKSLKTSGSLKHQGQ 297

Query: 1844 SEIELPVKAQRTETSKQLVANVVDIGAPADWVKINVRETKDSFEVYALVPGLLREEVRVQ 2023
            +E+E P+KA  TET K L   VVD+G PADWVKI  RET DSFEVYALVPGL REEV+VQ
Sbjct: 298  NEVEHPMKAAETETFKLLDVQVVDVGPPADWVKITARETNDSFEVYALVPGLSREEVQVQ 357

Query: 2024 SDPAGRLVITGQPEQPDNPWGITPFKKVVSLPA 2122
            SDPAGRLVITGQP Q DN WG T FKKVV+LPA
Sbjct: 358  SDPAGRLVITGQPNQLDNLWGATAFKKVVTLPA 390


>ref|XP_004299960.1| PREDICTED: AT-rich interactive domain-containing protein 5-like
            [Fragaria vesca subsp. vesca]
          Length = 414

 Score =  379 bits (974), Expect = e-102
 Identities = 191/320 (59%), Positives = 234/320 (73%), Gaps = 8/320 (2%)
 Frame = +2

Query: 1184 DTSVGNKRQSATPDLPVKYST--RSGGHSGETLKNAFDDAKMPENDDGTPEEQAAFMKEL 1357
            + SVG K   ++  +  K  T  +  G S    K+   D++  +++DGTP +QAAFM+EL
Sbjct: 70   EVSVGRKANGSSARVANKLKTVDKLVGES----KSWLSDSEGEDSEDGTPLQQAAFMREL 125

Query: 1358 ESFYRERAVEFKPPKFYGQPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTT 1537
            ESF++ER++EFK PKFYG+PLNCLKLWRAVI+ GGYD VT SK+WRQVGESFHPPKTCTT
Sbjct: 126  ESFHKERSLEFKAPKFYGEPLNCLKLWRAVIKAGGYDVVTTSKLWRQVGESFHPPKTCTT 185

Query: 1538 VSWTFRIFYEKSLLEYERYQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXX 1717
            VSWTFRIFYEK+LLEYE+++R+ GE+Q  + +L + ++   E + +Q             
Sbjct: 186  VSWTFRIFYEKALLEYEKHKRKTGEIQTAIASLTQCTTPVKEASSHQAPGSGRARRDAAA 245

Query: 1718 XXMQGWHVQRLFGLGE------EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRT 1879
              MQGWH QRL G GE      +DKNL+  PKRE  LKSIGS+K K  + ++   K    
Sbjct: 246  RAMQGWHAQRLVGYGEIAEPIVKDKNLSTTPKREKNLKSIGSIKHKTPTFLDHAEKTAFV 305

Query: 1880 ETSKQLVANVVDIGAPADWVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQ 2059
            E  KQ+V N+VD+G PADWVKI+VRETKD FEVYALVPGLLREEVRVQSDPAGRLVITGQ
Sbjct: 306  EADKQVVTNIVDLGPPADWVKISVRETKDCFEVYALVPGLLREEVRVQSDPAGRLVITGQ 365

Query: 2060 PEQPDNPWGITPFKKVVSLP 2119
            PE PDNPWGITPFKKVVSLP
Sbjct: 366  PEHPDNPWGITPFKKVVSLP 385


>ref|XP_003527249.1| PREDICTED: AT-rich interactive domain-containing protein 5-like
            [Glycine max]
          Length = 404

 Score =  379 bits (972), Expect = e-102
 Identities = 191/291 (65%), Positives = 222/291 (76%), Gaps = 8/291 (2%)
 Frame = +2

Query: 1274 LKNAFDDAKMPENDD-GTPEEQAAFMKELESFYRERAVEFKPPKFYGQPLNCLKLWRAVI 1450
            +KN  +D +M + D+ GTP ++  FMKELE+FYRER++EFKPPKFYG+PLNCLKLWRAVI
Sbjct: 86   MKNVLNDTEMTDYDEYGTPLDRETFMKELETFYRERSLEFKPPKFYGEPLNCLKLWRAVI 145

Query: 1451 RLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYERYQRQNGELQLPVT 1630
            RLGGYD VTGSK+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+++R+ GELQLPV 
Sbjct: 146  RLGGYDVVTGSKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEKHKREIGELQLPVG 205

Query: 1631 ALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE------EDKNLNNMP 1792
               + S+V+ E   YQ               MQGWH QRL G GE      +DKN ++ P
Sbjct: 206  PFHQSSNVEKEPAVYQTPGSGRARRDAAARAMQGWHAQRLLGYGEVAEPVIKDKNFSSTP 265

Query: 1793 KRENKLKSIG-SLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPADWVKINVRETKDS 1969
            KRE  LKSIG + KQ+  S +E   K+   E  +QLV  VVD+G PADWVKINVRE+KD 
Sbjct: 266  KREKNLKSIGMNNKQRTLSGLEHAEKSANIEGDRQLVTAVVDVGPPADWVKINVRESKDC 325

Query: 1970 FEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKVVSLPA 2122
            FEVYALVPGLLREEVRVQSDP GRLVITG PE  DNPWGITPFKKVV+LPA
Sbjct: 326  FEVYALVPGLLREEVRVQSDPVGRLVITGVPEHIDNPWGITPFKKVVNLPA 376


>gb|EOY15876.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 7
            [Theobroma cacao]
          Length = 464

 Score =  374 bits (961), Expect = e-101
 Identities = 219/418 (52%), Positives = 273/418 (65%), Gaps = 17/418 (4%)
 Frame = +2

Query: 905  EKSAVLNGQLP----FQLPASAIPPKKKEMVNEDNQTDKEGLKKCMVDQAHNRDTGTTID 1072
            E + +L  QLP      L  S +  ++  +  ED  T          +  H+  +GT  D
Sbjct: 5    EDTEMLEQQLPEASKVNLVDSGVQQQQSSLATEDQDT---------TETRHSPHSGTADD 55

Query: 1073 EPKE-PENKSILPAAALTDENINLIEKTASKATKHEPRDTSVGNKRQSATPD---LP-VK 1237
            +    P + ++    AL D+     +KT++ A  +     SV    + ++ D   LP  +
Sbjct: 56   KALTLPTDVNMSDNPALPDKPD---KKTSNDANTNARDAASVERLEKKSSGDAAPLPCAE 112

Query: 1238 YSTRSGGH-SGETLKNAFDDAKMPENDD-GTPEEQAAFMKELESFYRERAVEFKPPKFYG 1411
            + T    H S +  KN   D +M E D+ GT EE+AAFMKELESFY++R++EFKPPKFYG
Sbjct: 113  FLTPKSQHGSVKKSKNWLLDPEMGEADEAGTQEERAAFMKELESFYKDRSLEFKPPKFYG 172

Query: 1412 QPLNCLKLWRAVIRLGGYDRVTGSKMWRQVGESFHPPKTCTTVSWTFRIFYEKSLLEYER 1591
            +PLNCLKLWRAVIRLGGY+ VT SK+WRQVGESFHPPKTCTTVSWTFRIFYEK+LLEYE+
Sbjct: 173  EPLNCLKLWRAVIRLGGYEVVTASKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEK 232

Query: 1592 YQRQNGELQLPVTALPEPSSVDNEGNGYQXXXXXXXXXXXXXXXMQGWHVQRLFGLGE-- 1765
            Y+R+NGE+QLP ++LP     + E +GYQ               MQGWH QR  G GE  
Sbjct: 233  YKRENGEIQLPASSLPHTVG-EKESSGYQASGSGRARRDAAARAMQGWHAQRSVGYGEIT 291

Query: 1766 ----EDKNLNNMPKRENKLKSIGSLKQKRSSEIELPVKAQRTETSKQLVANVVDIGAPAD 1933
                +DK+L++ PK+++ LK+IG  KQK     E P +    E +K LV  VVD+GAPAD
Sbjct: 292  EPIIKDKSLSSTPKQKH-LKTIGLQKQKTPISTE-PAEKSAHEPNK-LVTEVVDVGAPAD 348

Query: 1934 WVKINVRETKDSFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQPDNPWGITPFKKV 2107
            WVKINVRETKD FEVYALVPGLLREEVRVQSDPAGRLVITGQPEQ DNPWGITPFKKV
Sbjct: 349  WVKINVRETKDCFEVYALVPGLLREEVRVQSDPAGRLVITGQPEQVDNPWGITPFKKV 406


Top