BLASTX nr result

ID: Perilla23_contig00024226 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00024226
         (859 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011090808.1| PREDICTED: uncharacterized protein LOC105171...   344   4e-92
ref|XP_011093201.1| PREDICTED: uncharacterized protein LOC105173...   320   7e-85
ref|XP_009604923.1| PREDICTED: putative DNA-binding protein ESCA...   228   3e-57
ref|XP_009789955.1| PREDICTED: uncharacterized protein LOC104237...   224   5e-56
ref|XP_006339435.1| PREDICTED: uncharacterized protein LOC102580...   223   2e-55
ref|XP_010277390.1| PREDICTED: putative DNA-binding protein ESCA...   211   6e-52
ref|XP_004229817.1| PREDICTED: uncharacterized protein LOC101253...   211   7e-52
emb|CDP17777.1| unnamed protein product [Coffea canephora]            210   1e-51
ref|XP_012489770.1| PREDICTED: AT-hook motif nuclear-localized p...   207   8e-51
ref|XP_012489769.1| PREDICTED: AT-hook motif nuclear-localized p...   207   8e-51
ref|XP_010276424.1| PREDICTED: uncharacterized protein LOC104611...   207   8e-51
ref|XP_010277387.1| PREDICTED: putative DNA-binding protein ESCA...   201   8e-49
ref|XP_002511726.1| DNA binding protein, putative [Ricinus commu...   201   8e-49
ref|XP_008438154.1| PREDICTED: uncharacterized protein LOC103483...   199   3e-48
ref|XP_011034408.1| PREDICTED: uncharacterized protein LOC105132...   198   4e-48
ref|XP_010092838.1| hypothetical protein L484_022433 [Morus nota...   198   5e-48
ref|XP_002302537.2| hypothetical protein POPTR_0002s14950g [Popu...   198   5e-48
ref|XP_002320727.1| hypothetical protein POPTR_0014s06550g [Popu...   197   6e-48
ref|XP_008238665.1| PREDICTED: putative DNA-binding protein ESCA...   197   8e-48
ref|XP_007040013.1| AT-hook motif nuclear-localized protein 1 is...   197   8e-48

>ref|XP_011090808.1| PREDICTED: uncharacterized protein LOC105171401 [Sesamum indicum]
           gi|747044791|ref|XP_011090815.1| PREDICTED:
           uncharacterized protein LOC105171401 [Sesamum indicum]
           gi|747044793|ref|XP_011090826.1| PREDICTED:
           uncharacterized protein LOC105171401 [Sesamum indicum]
          Length = 313

 Score =  344 bits (883), Expect = 4e-92
 Identities = 181/255 (70%), Positives = 207/255 (81%), Gaps = 1/255 (0%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PDESS+R FSSVQ+SSSAPPA+GKAY E+ K+ V R           IGTEKLDDW++CS
Sbjct: 60  PDESSSRTFSSVQVSSSAPPASGKAYTEEDKLNVARPMNSEKKHKSKIGTEKLDDWVDCS 119

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
           TGSSFLPHVITVN GEDIS KIMEFSR+GPRAVCIISGSG VS +T+RHP+SS GI TYE
Sbjct: 120 TGSSFLPHVITVNTGEDISTKIMEFSREGPRAVCIISGSGTVSTLTIRHPSSSAGITTYE 179

Query: 498 GLFEILSFSGSFTPAEMPD-KYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322
           GLFEILSFSGSFTP EMPD + G SG MTITLSGADGRVVGGL++GLT+AASPVK+VVAS
Sbjct: 180 GLFEILSFSGSFTPMEMPDPRSGTSGRMTITLSGADGRVVGGLIAGLTLAASPVKVVVAS 239

Query: 321 FLVGNSLELKPKKQFTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTWAAIQTAE 142
           FL+G+  ELKPKK FTV+         SN +KR S ++QG   S S DNPT+WAA+QTAE
Sbjct: 240 FLLGSPHELKPKKHFTVDALGPNGAAASNAEKRSSDNVQGPGYSIS-DNPTSWAAMQTAE 298

Query: 141 KSRKAAADINISLQG 97
           +SRK+ ADINISLQG
Sbjct: 299 RSRKSKADINISLQG 313


>ref|XP_011093201.1| PREDICTED: uncharacterized protein LOC105173223 [Sesamum indicum]
           gi|747090980|ref|XP_011093202.1| PREDICTED:
           uncharacterized protein LOC105173223 [Sesamum indicum]
           gi|747090982|ref|XP_011093203.1| PREDICTED:
           uncharacterized protein LOC105173223 [Sesamum indicum]
           gi|747090984|ref|XP_011093204.1| PREDICTED:
           uncharacterized protein LOC105173223 [Sesamum indicum]
           gi|747090986|ref|XP_011093205.1| PREDICTED:
           uncharacterized protein LOC105173223 [Sesamum indicum]
           gi|747090988|ref|XP_011093207.1| PREDICTED:
           uncharacterized protein LOC105173223 [Sesamum indicum]
           gi|747090990|ref|XP_011093208.1| PREDICTED:
           uncharacterized protein LOC105173223 [Sesamum indicum]
          Length = 318

 Score =  320 bits (821), Expect = 7e-85
 Identities = 170/260 (65%), Positives = 197/260 (75%), Gaps = 6/260 (2%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PDES++RV S V LSSSAPPA GK+Y E+ K    R           +G EKLDDW +C 
Sbjct: 60  PDESTSRVLSPVPLSSSAPPATGKSYVEEKKPTPARPVSSEKKHRSKVGAEKLDDWGDCF 119

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
           TGSSFLPHVIT+N+GEDIS KIMEFS QGPR VC+ISGSG VSNVT+RHP+SSGG LTYE
Sbjct: 120 TGSSFLPHVITINSGEDISTKIMEFSLQGPRTVCVISGSGTVSNVTIRHPSSSGGTLTYE 179

Query: 498 GLFEILSFSGSFTPAEMPD-KYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322
           GLFEILSFSGSFTP EMPD K+GRSGMM I+LSGADGRVVGGL++GLT+AASPVK+VVAS
Sbjct: 180 GLFEILSFSGSFTPVEMPDSKFGRSGMMAISLSGADGRVVGGLIAGLTIAASPVKVVVAS 239

Query: 321 FLVGNSLELKPKKQ-----FTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTWAA 157
           FL+G+    K +KQ     FTV+         +N+DKR+    Q  SCS S +NPT WAA
Sbjct: 240 FLLGSPFVPKSRKQKTEAAFTVDASGPNAAPATNVDKRVPNETQRPSCSAS-ENPTNWAA 298

Query: 156 IQTAEKSRKAAADINISLQG 97
            Q AE+SRK+ ADINISLQG
Sbjct: 299 TQVAERSRKSTADINISLQG 318


>ref|XP_009604923.1| PREDICTED: putative DNA-binding protein ESCAROLA [Nicotiana
           tomentosiformis]
          Length = 332

 Score =  228 bits (582), Expect = 3e-57
 Identities = 131/269 (48%), Positives = 163/269 (60%), Gaps = 15/269 (5%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD + TR  S + +S+SAPP +G    E V +  P             G E L +WI CS
Sbjct: 66  PDGAVTRTLSPMPISASAPPTSGSFLSEKVSVARPASEKKPRNKV---GAENLGEWISCS 122

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
           TG +FLPH+ITV AGED++MKI+ FS+QGPRA+CIIS  G +SNVTLR PN+SGG LTYE
Sbjct: 123 TGGNFLPHMITVEAGEDVTMKIISFSQQGPRAICIISAVGLISNVTLRQPNTSGGTLTYE 182

Query: 498 GLFEILSFSGSFTPAEM-PDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322
           G FEILS SGSFTP E    +  R+G M+I+L+  DGRVVGG ++GL +AASPV++VV S
Sbjct: 183 GRFEILSLSGSFTPTEFGGSRTSRTGGMSISLASPDGRVVGGTLAGLLIAASPVQVVVGS 242

Query: 321 FLVGNSLELKPKKQ------FTVNXXXXXXXXXSNLDKR--------ISGSIQGTSCSTS 184
           FL  N  E KPKKQ                   SN+D R        I G+      S+S
Sbjct: 243 FLPSNYQEAKPKKQKAEPKAIPYATVSPAAPHSSNMDPRSSNALTVNIPGAGNQNIISSS 302

Query: 183 ADNPTTWAAIQTAEKSRKAAADINISLQG 97
                 W A+ T + SRK+A DINISLQG
Sbjct: 303 TMQTNHWTAMPTVQDSRKSATDINISLQG 331


>ref|XP_009789955.1| PREDICTED: uncharacterized protein LOC104237497 [Nicotiana
           sylvestris]
          Length = 332

 Score =  224 bits (572), Expect = 5e-56
 Identities = 129/269 (47%), Positives = 162/269 (60%), Gaps = 15/269 (5%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD + TR  S + +S+SAPP +G    E V +  P             G E L +WI CS
Sbjct: 66  PDGAVTRTLSPMPISASAPPTSGSFLPEKVSVARPASEKKPRNKV---GAENLGEWISCS 122

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
           TG +FLPH+ITV AGED++MKI+ FS+QGPRA+CIIS  G +SNVTLR PN+SGG LTYE
Sbjct: 123 TGGNFLPHMITVEAGEDVTMKIISFSQQGPRAICIISAVGLISNVTLRQPNTSGGTLTYE 182

Query: 498 GLFEILSFSGSFTPAEM-PDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322
           G FEILS SGSFTP E    +  R+G M+I+L+  DGRVVGG ++GL +AASPV++VV S
Sbjct: 183 GRFEILSLSGSFTPTEFGGSRTSRTGGMSISLASPDGRVVGGTLAGLLIAASPVQVVVGS 242

Query: 321 FLVGNSLELKPKKQ------FTVNXXXXXXXXXSNLDKR--------ISGSIQGTSCSTS 184
           FL  N  E KPKKQ                   SN++ R        I G+      S+S
Sbjct: 243 FLPSNYQEAKPKKQKAEPKAIPYATVSPAAPHSSNMEPRSSNALTVNIPGAGNQNIISSS 302

Query: 183 ADNPTTWAAIQTAEKSRKAAADINISLQG 97
                 W  + T + SRK+A DINISLQG
Sbjct: 303 TIQTNHWTTMPTVQDSRKSATDINISLQG 331


>ref|XP_006339435.1| PREDICTED: uncharacterized protein LOC102580329 [Solanum tuberosum]
          Length = 332

 Score =  223 bits (567), Expect = 2e-55
 Identities = 128/269 (47%), Positives = 162/269 (60%), Gaps = 15/269 (5%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD +  R  S + +S+SAPP +G    E V +  P             G E L +WI CS
Sbjct: 66  PDGAVARTISPMPISASAPPTSGNFLSEKVSVARPASEKKPRNKV---GAENLGEWISCS 122

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
           TG +FLPH+ITV AGED++MKI+ FS+QGPRA+CIIS  G +SNVTLR PNSSGG LTYE
Sbjct: 123 TGGNFLPHMITVEAGEDVTMKIISFSQQGPRAICIISAVGLISNVTLRQPNSSGGTLTYE 182

Query: 498 GLFEILSFSGSFTPAEMPDK--YGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVA 325
           G FEILS SGSFTP E        R+G M+I+L+  DGRVVGG ++GL +AASPV++VV 
Sbjct: 183 GRFEILSLSGSFTPTEFGGSRTTSRTGGMSISLASPDGRVVGGTLAGLLIAASPVQVVVG 242

Query: 324 SFLVGNSLELKPKKQ------FTVNXXXXXXXXXSNLDKRISGS------IQGT-SCSTS 184
           SFL  N  E KPKKQ                   SN++ R S +        GT +  +S
Sbjct: 243 SFLPSNYQEAKPKKQKAEPKAIAYGTLSPAAPHSSNMEPRSSNAHTVNVPAAGTQNVISS 302

Query: 183 ADNPTTWAAIQTAEKSRKAAADINISLQG 97
           +  P  W  + + + SRK+  DINISLQG
Sbjct: 303 SIQPNHWTTMPSVQDSRKSTTDINISLQG 331


>ref|XP_010277390.1| PREDICTED: putative DNA-binding protein ESCAROLA isoform X2
           [Nelumbo nucifera]
          Length = 330

 Score =  211 bits (537), Expect = 6e-52
 Identities = 124/266 (46%), Positives = 158/266 (59%), Gaps = 12/266 (4%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           P  + +   S + +SSSAPP          K G  R              E L DW++CS
Sbjct: 66  PGGTVSLALSPIPISSSAPPVVSNFSAG--KRGRGRPVGLINREQPKFEVENLGDWVKCS 123

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
            G++F PHVITV AGEDI+MKI+ FS+QGPRA+CI+S +G +SNVTLR P+S GG LTYE
Sbjct: 124 VGANFTPHVITVAAGEDITMKIISFSQQGPRAICILSANGVISNVTLRQPDSCGGTLTYE 183

Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319
           G FEILS SGSF P+E      RSG M+++LS  DGRVVGG V+GL VAASPV++VV SF
Sbjct: 184 GRFEILSLSGSFMPSETGGTRSRSGGMSVSLSSPDGRVVGGGVAGLLVAASPVQVVVGSF 243

Query: 318 LVGNSLELKPKKQ-----FTVNXXXXXXXXXSNLDKRISGSIQGTS-------CSTSADN 175
           L    LE KPKKQ      TV          + L +  +G  Q  S        S+S+  
Sbjct: 244 LPSTQLEHKPKKQKIEVTSTVTPTTAIPVPNAELQEGYNGQGQQNSATPKPNLASSSSFR 303

Query: 174 PTTWAAIQTAEKSRKAAADINISLQG 97
              W+++Q+  +SR +A DINISL G
Sbjct: 304 ADNWSSLQSMPESRNSATDINISLPG 329


>ref|XP_004229817.1| PREDICTED: uncharacterized protein LOC101253722 [Solanum
           lycopersicum] gi|723660675|ref|XP_010325755.1|
           PREDICTED: uncharacterized protein LOC101253722 [Solanum
           lycopersicum] gi|723660680|ref|XP_010325760.1|
           PREDICTED: uncharacterized protein LOC101253722 [Solanum
           lycopersicum]
          Length = 318

 Score =  211 bits (536), Expect = 7e-52
 Identities = 117/221 (52%), Positives = 146/221 (66%), Gaps = 15/221 (6%)
 Frame = -2

Query: 714 GTEKLDDWIECSTGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLR 535
           G E L +WI CSTG +FLPH+ITV AGED++MKI+ FS+QGPRA+CIIS  G +SNVTLR
Sbjct: 97  GAENLGEWISCSTGGNFLPHMITVEAGEDVTMKIISFSQQGPRAICIISAVGLISNVTLR 156

Query: 534 HPNSSGGILTYEGLFEILSFSGSFTPAEMPDK--YGRSGMMTITLSGADGRVVGGLVSGL 361
            PNSSGG LTYEG FEILS SGSFTP E        R+G M+I+L+  DGRVVGG ++GL
Sbjct: 157 QPNSSGGTLTYEGRFEILSLSGSFTPTEFGGSRTTSRTGGMSISLASPDGRVVGGTLAGL 216

Query: 360 TVAASPVKIVVASFLVGNSLELKPKKQ------FTVNXXXXXXXXXSNLDKRISGS---- 211
            +AASPV++VV SFL  N  E+KPKKQ       T           SN++ R S +    
Sbjct: 217 LIAASPVQVVVGSFLPSNYQEVKPKKQKAELKAITYGTLSPAAPHSSNMEPRSSNAHTVN 276

Query: 210 --IQGT-SCSTSADNPTTWAAIQTAEKSRKAAADINISLQG 97
               GT +  +S+  P  W A+ + + SRK+  DINISLQG
Sbjct: 277 VPAAGTQNVISSSIQPNHWTAMPSVQDSRKSTTDINISLQG 317


>emb|CDP17777.1| unnamed protein product [Coffea canephora]
          Length = 334

 Score =  210 bits (534), Expect = 1e-51
 Identities = 126/271 (46%), Positives = 160/271 (59%), Gaps = 18/271 (6%)
 Frame = -2

Query: 855 DESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECST 676
           D  ++R  S + +SSSAP  AG    +    G                 E L +W+ CST
Sbjct: 67  DGPNSRPLSPMPISSSAPAVAGNFLADKASAG---RRPYTSEKKHKPKVENLGEWVACST 123

Query: 675 GSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYEG 496
           G SFLPH+ITVNAGED+S KI+ F + GPRA+C+IS  G +SNVTLR PNSSGG LTYEG
Sbjct: 124 GGSFLPHMITVNAGEDVSKKIVSFCQNGPRAICVISAVGLISNVTLRQPNSSGGTLTYEG 183

Query: 495 LFEILSFSGSFTPAEM-PDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319
            FEILS SGSFTP E+   +  R+G M+I+L+  DGRVVGG ++GL VAASPV++VV SF
Sbjct: 184 RFEILSLSGSFTPTELGGSRVARTGGMSISLASPDGRVVGGTLAGLLVAASPVQVVVGSF 243

Query: 318 LVGNSLELKPK------KQFTVNXXXXXXXXXSNL--DKRISGSIQGTS---------CS 190
           L  N  ELKPK      K              +N+  + RIS ++QG +          +
Sbjct: 244 LPSNHNELKPKKHKYEHKSLAAAGSAAAAPRTNNMLVEHRIS-TVQGLNNVISDNQGMVA 302

Query: 189 TSADNPTTWAAIQTAEKSRKAAADINISLQG 97
           +S      WA I + E SRK+  DINISLQG
Sbjct: 303 SSTLQTANWANISSMEDSRKSNTDINISLQG 333


>ref|XP_012489770.1| PREDICTED: AT-hook motif nuclear-localized protein 1-like isoform
           X2 [Gossypium raimondii] gi|763773974|gb|KJB41097.1|
           hypothetical protein B456_007G091400 [Gossypium
           raimondii] gi|763773975|gb|KJB41098.1| hypothetical
           protein B456_007G091400 [Gossypium raimondii]
          Length = 312

 Score =  207 bits (527), Expect = 8e-51
 Identities = 122/268 (45%), Positives = 157/268 (58%), Gaps = 16/268 (5%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD +  R  S + +SSS PP+ G+      K G  R           +  E L +W   S
Sbjct: 44  PDGTMARALSPMPISSSVPPSGGEFSSGGGKRGRGRGSGYQIKHQKGMDLENLGEWAATS 103

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
            GSSF PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE
Sbjct: 104 VGSSFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 163

Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319
           G FEILS SGSF P E      RSG M+++L+ ADGRVVGG V+GL +AASPV++VV SF
Sbjct: 164 GRFEILSLSGSFMPTETQGTRSRSGGMSVSLASADGRVVGGGVAGLLIAASPVQVVVGSF 223

Query: 318 LVGNSLELKPKKQ-------FTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPT--- 169
           L GN  + KPKKQ                    SN +K     +     +++A  P+   
Sbjct: 224 LPGNQHDQKPKKQKIESIPATVAPNPSIVAAPASNAEKEDGIDVVSPQQNSNALKPSLTG 283

Query: 168 ------TWAAIQTAEKSRKAAADINISL 103
                  WAA  T ++ R +A DINISL
Sbjct: 284 ATFRRENWAA--TMQEPRNSATDINISL 309


>ref|XP_012489769.1| PREDICTED: AT-hook motif nuclear-localized protein 1-like isoform
           X1 [Gossypium raimondii] gi|763773973|gb|KJB41096.1|
           hypothetical protein B456_007G091400 [Gossypium
           raimondii] gi|763773976|gb|KJB41099.1| hypothetical
           protein B456_007G091400 [Gossypium raimondii]
          Length = 331

 Score =  207 bits (527), Expect = 8e-51
 Identities = 122/268 (45%), Positives = 157/268 (58%), Gaps = 16/268 (5%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD +  R  S + +SSS PP+ G+      K G  R           +  E L +W   S
Sbjct: 63  PDGTMARALSPMPISSSVPPSGGEFSSGGGKRGRGRGSGYQIKHQKGMDLENLGEWAATS 122

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
            GSSF PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE
Sbjct: 123 VGSSFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 182

Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319
           G FEILS SGSF P E      RSG M+++L+ ADGRVVGG V+GL +AASPV++VV SF
Sbjct: 183 GRFEILSLSGSFMPTETQGTRSRSGGMSVSLASADGRVVGGGVAGLLIAASPVQVVVGSF 242

Query: 318 LVGNSLELKPKKQ-------FTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPT--- 169
           L GN  + KPKKQ                    SN +K     +     +++A  P+   
Sbjct: 243 LPGNQHDQKPKKQKIESIPATVAPNPSIVAAPASNAEKEDGIDVVSPQQNSNALKPSLTG 302

Query: 168 ------TWAAIQTAEKSRKAAADINISL 103
                  WAA  T ++ R +A DINISL
Sbjct: 303 ATFRRENWAA--TMQEPRNSATDINISL 328


>ref|XP_010276424.1| PREDICTED: uncharacterized protein LOC104611170 [Nelumbo nucifera]
           gi|719972052|ref|XP_010276433.1| PREDICTED:
           uncharacterized protein LOC104611170 [Nelumbo nucifera]
          Length = 330

 Score =  207 bits (527), Expect = 8e-51
 Identities = 122/266 (45%), Positives = 156/266 (58%), Gaps = 12/266 (4%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD + +   S + +SSSAPPA  +      K G  R              E L +W+ CS
Sbjct: 66  PDGTVSLALSPIPISSSAPPAVSEFSAG--KRGRGRPTGLINKQQPKFEIENLGEWVACS 123

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
            G++F PHV+TV  GED++MKI+ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE
Sbjct: 124 VGANFTPHVLTVATGEDVTMKIISFSQQGPRAICILSANGAISNVTLRQPDSSGGTLTYE 183

Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319
           G FEILS SGSF P+E      RSG M+++L+  DGRVVGG V+GL VAASPV++VV SF
Sbjct: 184 GRFEILSLSGSFMPSESGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSF 243

Query: 318 LVGNSLELKPKK---QFTVNXXXXXXXXXSNLDKRISGSIQG---------TSCSTSADN 175
           L    LE KPKK   + T           SN +     S QG            S S+  
Sbjct: 244 LPTTQLEHKPKKPKTEVTSTATPTTAIPISNAEMEEGYSDQGQRNSATPKPNLASASSFR 303

Query: 174 PTTWAAIQTAEKSRKAAADINISLQG 97
              W+ IQ+  +SR +A DINISL G
Sbjct: 304 GENWSTIQSVPESRNSATDINISLPG 329


>ref|XP_010277387.1| PREDICTED: putative DNA-binding protein ESCAROLA isoform X1
           [Nelumbo nucifera] gi|720069279|ref|XP_010277388.1|
           PREDICTED: putative DNA-binding protein ESCAROLA isoform
           X1 [Nelumbo nucifera]
          Length = 346

 Score =  201 bits (510), Expect = 8e-49
 Identities = 124/282 (43%), Positives = 158/282 (56%), Gaps = 28/282 (9%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           P  + +   S + +SSSAPP          K G  R              E L DW++CS
Sbjct: 66  PGGTVSLALSPIPISSSAPPVVSNFSAG--KRGRGRPVGLINREQPKFEVENLGDWVKCS 123

Query: 678 TGSSFLPHVITVNAGE----------------DISMKIMEFSRQGPRAVCIISGSGRVSN 547
            G++F PHVITV AGE                DI+MKI+ FS+QGPRA+CI+S +G +SN
Sbjct: 124 VGANFTPHVITVAAGEVYVKKKYSFVSSEICQDITMKIISFSQQGPRAICILSANGVISN 183

Query: 546 VTLRHPNSSGGILTYEGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVS 367
           VTLR P+S GG LTYEG FEILS SGSF P+E      RSG M+++LS  DGRVVGG V+
Sbjct: 184 VTLRQPDSCGGTLTYEGRFEILSLSGSFMPSETGGTRSRSGGMSVSLSSPDGRVVGGGVA 243

Query: 366 GLTVAASPVKIVVASFLVGNSLELKPKKQ-----FTVNXXXXXXXXXSNLDKRISGSIQG 202
           GL VAASPV++VV SFL    LE KPKKQ      TV          + L +  +G  Q 
Sbjct: 244 GLLVAASPVQVVVGSFLPSTQLEHKPKKQKIEVTSTVTPTTAIPVPNAELQEGYNGQGQQ 303

Query: 201 TS-------CSTSADNPTTWAAIQTAEKSRKAAADINISLQG 97
            S        S+S+     W+++Q+  +SR +A DINISL G
Sbjct: 304 NSATPKPNLASSSSFRADNWSSLQSMPESRNSATDINISLPG 345


>ref|XP_002511726.1| DNA binding protein, putative [Ricinus communis]
           gi|223548906|gb|EEF50395.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 324

 Score =  201 bits (510), Expect = 8e-49
 Identities = 116/260 (44%), Positives = 153/260 (58%), Gaps = 8/260 (3%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD +  R  S + +SSSAPP    + G+  K+               +G E   DW   S
Sbjct: 66  PDGTVARALSPMPISSSAPPGGDFSSGKPGKVW---SGGFEKKKYKKMGMENSGDWASGS 122

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
            G++F PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE
Sbjct: 123 VGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 182

Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319
           G FEILS SGSF P E      RSG M+++L+  DGRVVGG V+GL VAASPV++VV SF
Sbjct: 183 GRFEILSLSGSFMPTESQGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSF 242

Query: 318 LVGNSLELKPKK--------QFTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTW 163
           L GN  + KPKK          T           +N ++  S    G   ++S+     W
Sbjct: 243 LPGNHQDQKPKKIKIDPVPASITPAQTIAIPIPVTNAERDDSMGGHGLQ-NSSSFRRENW 301

Query: 162 AAIQTAEKSRKAAADINISL 103
             +Q  ++ R +  DINISL
Sbjct: 302 TTMQPVQEMRTSGTDINISL 321


>ref|XP_008438154.1| PREDICTED: uncharacterized protein LOC103483349 [Cucumis melo]
          Length = 344

 Score =  199 bits (505), Expect = 3e-48
 Identities = 123/278 (44%), Positives = 161/278 (57%), Gaps = 24/278 (8%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD + T   S + LSSSAP A G +     K G  R           +G E + +W  CS
Sbjct: 70  PDGTVTMALSPLPLSSSAPAAGGFSI---TKRGKGRLGGSEFKHHKKMGMEYIGEWNACS 126

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
            G++F+PH+ITVNAGED++MKI+ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE
Sbjct: 127 VGTNFMPHIITVNAGEDVTMKIISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 186

Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319
           G FEILS SGSF P E      RSG M+++L+  DGRVVGG V+GL +AASPV++VV SF
Sbjct: 187 GRFEILSLSGSFMPTENQGTRSRSGGMSVSLASPDGRVVGGGVAGLLIAASPVQVVVGSF 246

Query: 318 LVGNSLELKPKKQ------------FTVNXXXXXXXXXSNLDKRIS---------GSIQG 202
           L  +  E K KKQ               +         SN D   +         GS++ 
Sbjct: 247 LPTSQQEQKVKKQKPPESVPTAAPGSVPSTAPATAMPASNADTEDNLNGNGVQNPGSLKP 306

Query: 201 TSCSTS---ADNPTTWAAIQTAEKSRKAAADINISLQG 97
              + S    DN  T AA+ + ++ R +A DINISL G
Sbjct: 307 AGFAPSPFQRDNWGTNAAVHSLQEPRNSATDINISLPG 344


>ref|XP_011034408.1| PREDICTED: uncharacterized protein LOC105132539 [Populus
           euphratica] gi|743788764|ref|XP_011034415.1| PREDICTED:
           uncharacterized protein LOC105132539 [Populus
           euphratica] gi|743788766|ref|XP_011034423.1| PREDICTED:
           uncharacterized protein LOC105132539 [Populus
           euphratica] gi|743788770|ref|XP_011034428.1| PREDICTED:
           uncharacterized protein LOC105132539 [Populus
           euphratica]
          Length = 323

 Score =  198 bits (504), Expect = 4e-48
 Identities = 115/258 (44%), Positives = 154/258 (59%), Gaps = 6/258 (2%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD +  R  S + +S+SAP   G    +  K+               +G E L +W   S
Sbjct: 68  PDGAVARALSPMPISASAPHTGGDYSAKPGKVW---PGSYEKKKYKKMGMENLGEWAANS 124

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
            G++F PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE
Sbjct: 125 VGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 184

Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319
           G FEILS SGSF P E+     RSG M+++L+  DGRVVGG V+GL VAASPV++VV SF
Sbjct: 185 GRFEILSLSGSFMPTEIQGSRSRSGGMSVSLASPDGRVVGGSVAGLLVAASPVQVVVGSF 244

Query: 318 LVGNSLELKPKKQFTVNXXXXXXXXXSNLDKRI------SGSIQGTSCSTSADNPTTWAA 157
           L GN  E KPKK   ++           +   I      +G+ QG   ++S      WA 
Sbjct: 245 LPGNHQEQKPKKP-KIDSIPATFAPAPAIPASIAEREESAGTPQGQQ-NSSPFQRENWAT 302

Query: 156 IQTAEKSRKAAADINISL 103
           + + +  R +  DINISL
Sbjct: 303 MHSMQDVRSSGTDINISL 320


>ref|XP_010092838.1| hypothetical protein L484_022433 [Morus notabilis]
           gi|587862871|gb|EXB52656.1| hypothetical protein
           L484_022433 [Morus notabilis]
          Length = 500

 Score =  198 bits (503), Expect = 5e-48
 Identities = 117/268 (43%), Positives = 156/268 (58%), Gaps = 14/268 (5%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLD-DWIEC 682
           PD + T   S + +SSSAPP+   + G   K G  R           +G +    +W  C
Sbjct: 66  PDGTVTMALSPMPISSSAPPSGEFSSG---KRGKARSSGFEYKQHKKVGLDHFSGEWNSC 122

Query: 681 STGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTY 502
           S G++F+PH+ITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR  +SSGG LTY
Sbjct: 123 SLGTNFMPHIITVNAGEDVTMKVISFSQQGPRAICILSANGLISNVTLRQHDSSGGTLTY 182

Query: 501 EGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322
           EG FEILS SGSF P E      R G M+++L+  DGRVVGG V+GL VAASPV++VV S
Sbjct: 183 EGRFEILSLSGSFMPTETQGTRSRQGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGS 242

Query: 321 FLVGNSLELKPKK----QFTVNXXXXXXXXXSNLDK---------RISGSIQGTSCSTSA 181
           FL  N  E KPKK      TV          +  D+         + S + +    S++ 
Sbjct: 243 FLPSNQQEPKPKKLRTEHMTVTPGISMVPPVAEKDQDGMSHGHGHQNSSAPRPNLASSAP 302

Query: 180 DNPTTWAAIQTAEKSRKAAADINISLQG 97
                W A+ +   SR +A DINISL G
Sbjct: 303 FQRENWPAMNSMHDSRNSATDINISLPG 330


>ref|XP_002302537.2| hypothetical protein POPTR_0002s14950g [Populus trichocarpa]
           gi|550345046|gb|EEE81810.2| hypothetical protein
           POPTR_0002s14950g [Populus trichocarpa]
          Length = 325

 Score =  198 bits (503), Expect = 5e-48
 Identities = 115/261 (44%), Positives = 152/261 (58%), Gaps = 9/261 (3%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXI---GTEKLDDWI 688
           PD +  R  S + +S+SAP   G     D   G P                G E L +W 
Sbjct: 68  PDGAVARALSPMPISASAPSPGG-----DYSAGKPGKVWPGSYEKKKYKKLGMENLGEWA 122

Query: 687 ECSTGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGIL 508
             S G++F PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG L
Sbjct: 123 ANSVGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTL 182

Query: 507 TYEGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVV 328
           TYEG FEILS SGSF P E      RSG M+++L+  DGRVVGG V+GL VAASPV++VV
Sbjct: 183 TYEGRFEILSLSGSFMPTESQGTRSRSGGMSVSLASPDGRVVGGSVAGLLVAASPVQVVV 242

Query: 327 ASFLVGNSLELKPKK------QFTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTT 166
            SFL GN  + KPKK        T           +  ++ + G+  G   ++S+     
Sbjct: 243 GSFLAGNHQDQKPKKPKIDSIPATFAPAPVIPVSIAEREESV-GTPHGQQQNSSSFQREN 301

Query: 165 WAAIQTAEKSRKAAADINISL 103
           WA + + +  R +  DINISL
Sbjct: 302 WATMHSMQDVRNSVTDINISL 322


>ref|XP_002320727.1| hypothetical protein POPTR_0014s06550g [Populus trichocarpa]
           gi|222861500|gb|EEE99042.1| hypothetical protein
           POPTR_0014s06550g [Populus trichocarpa]
          Length = 324

 Score =  197 bits (502), Expect = 6e-48
 Identities = 117/261 (44%), Positives = 153/261 (58%), Gaps = 9/261 (3%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXI---GTEKLDDWI 688
           PD +  R  S + +S+SAP   G     D   G P                G E L +W 
Sbjct: 68  PDGAVARALSPMPISASAPHTGG-----DYSAGKPGKVWPGSYEKKKYKKMGMENLGEWA 122

Query: 687 ECSTGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGIL 508
             S G++F PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG L
Sbjct: 123 ANSVGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTL 182

Query: 507 TYEGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVV 328
           TYEG FEILS SGSF P E+     RSG M+++L+  DGRVVGG V+GL VAASPV++VV
Sbjct: 183 TYEGRFEILSLSGSFMPTEIQGTRSRSGGMSVSLASPDGRVVGGSVAGLLVAASPVQVVV 242

Query: 327 ASFLVGNSLELKPKKQFTVNXXXXXXXXXSNLDKRI------SGSIQGTSCSTSADNPTT 166
            SFL GN  E KPKK   ++           +   I      +G+ QG   ++S      
Sbjct: 243 GSFLPGNHQEQKPKKP-KIDSIPATFAPAPAIPASIAEREESAGTPQGQQ-NSSPFQREN 300

Query: 165 WAAIQTAEKSRKAAADINISL 103
           WA + + +  R +  DINISL
Sbjct: 301 WATMHSMQDVRNSGTDINISL 321


>ref|XP_008238665.1| PREDICTED: putative DNA-binding protein ESCAROLA [Prunus mume]
          Length = 318

 Score =  197 bits (501), Expect = 8e-48
 Identities = 119/257 (46%), Positives = 157/257 (61%), Gaps = 3/257 (1%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679
           PD S T   S   +SSSAPP       E  K G  +              E L +W+ CS
Sbjct: 72  PDGSVTMALSPKPISSSAPPPVIDFSAE--KRG--KVKPTSSVSKTKYEVENLGEWVACS 127

Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499
            G++F PH+ITVN+GED+ MKI+ FS+QGPRA+C++S +G +S+VTLR P+SSGG LTYE
Sbjct: 128 VGANFTPHIITVNSGEDVMMKIISFSQQGPRAICVLSANGVISSVTLRQPDSSGGTLTYE 187

Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319
           G FEILS SGSF P E      RSG M+++L+  DGRVVGG V+GL VAASPV++VV SF
Sbjct: 188 GRFEILSLSGSFMPNETGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSF 247

Query: 318 LVGNSLELKPKKQ---FTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTWAAIQT 148
           L GN  E KPKKQ   +  N         S++D + + S   +S S   DN   W+++ +
Sbjct: 248 LSGNQHEQKPKKQKHDYISNATPTMAVPISSVDPKPNFS---SSTSFRGDN---WSSLPS 301

Query: 147 AEKSRKAAADINISLQG 97
             K++    DIN+SL G
Sbjct: 302 DPKTK---TDINVSLPG 315


>ref|XP_007040013.1| AT-hook motif nuclear-localized protein 1 isoform 2 [Theobroma
           cacao] gi|508777258|gb|EOY24514.1| AT-hook motif
           nuclear-localized protein 1 isoform 2 [Theobroma cacao]
          Length = 331

 Score =  197 bits (501), Expect = 8e-48
 Identities = 114/253 (45%), Positives = 155/253 (61%), Gaps = 1/253 (0%)
 Frame = -2

Query: 858 PDESSTRVFSSVQLSSSAPPAA-GKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIEC 682
           PD S T   S   +S++APP     + G+  K+  P               E L +W+ C
Sbjct: 87  PDGSVTMALSPKPISTAAPPPLIDFSAGKRGKVKSPTSVSKAKYEL-----ENLGEWVAC 141

Query: 681 STGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTY 502
           S G++F PH+ITVNAGED++MKI+ FS+QGPRA+CI+S +G +S+VTLR P+SSGG LTY
Sbjct: 142 SVGANFTPHIITVNAGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTY 201

Query: 501 EGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322
           EG FEILS SGSF P++      RSG M+++L+  DGRVVGG V+GL VAASPV++VV S
Sbjct: 202 EGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGS 261

Query: 321 FLVGNSLELKPKKQFTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTWAAIQTAE 142
           FL GN  E KPKKQ                   +S +   ++ STS+    +W+++ +  
Sbjct: 262 FLAGNQHEQKPKKQ----KHEPISAATPMAAIPVSSADPKSNLSTSSFRGDSWSSLPS-- 315

Query: 141 KSRKAAADINISL 103
            SR    DIN+SL
Sbjct: 316 DSRNKPTDINVSL 328


Top