BLASTX nr result
ID: Perilla23_contig00024226
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00024226 (859 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011090808.1| PREDICTED: uncharacterized protein LOC105171... 344 4e-92 ref|XP_011093201.1| PREDICTED: uncharacterized protein LOC105173... 320 7e-85 ref|XP_009604923.1| PREDICTED: putative DNA-binding protein ESCA... 228 3e-57 ref|XP_009789955.1| PREDICTED: uncharacterized protein LOC104237... 224 5e-56 ref|XP_006339435.1| PREDICTED: uncharacterized protein LOC102580... 223 2e-55 ref|XP_010277390.1| PREDICTED: putative DNA-binding protein ESCA... 211 6e-52 ref|XP_004229817.1| PREDICTED: uncharacterized protein LOC101253... 211 7e-52 emb|CDP17777.1| unnamed protein product [Coffea canephora] 210 1e-51 ref|XP_012489770.1| PREDICTED: AT-hook motif nuclear-localized p... 207 8e-51 ref|XP_012489769.1| PREDICTED: AT-hook motif nuclear-localized p... 207 8e-51 ref|XP_010276424.1| PREDICTED: uncharacterized protein LOC104611... 207 8e-51 ref|XP_010277387.1| PREDICTED: putative DNA-binding protein ESCA... 201 8e-49 ref|XP_002511726.1| DNA binding protein, putative [Ricinus commu... 201 8e-49 ref|XP_008438154.1| PREDICTED: uncharacterized protein LOC103483... 199 3e-48 ref|XP_011034408.1| PREDICTED: uncharacterized protein LOC105132... 198 4e-48 ref|XP_010092838.1| hypothetical protein L484_022433 [Morus nota... 198 5e-48 ref|XP_002302537.2| hypothetical protein POPTR_0002s14950g [Popu... 198 5e-48 ref|XP_002320727.1| hypothetical protein POPTR_0014s06550g [Popu... 197 6e-48 ref|XP_008238665.1| PREDICTED: putative DNA-binding protein ESCA... 197 8e-48 ref|XP_007040013.1| AT-hook motif nuclear-localized protein 1 is... 197 8e-48 >ref|XP_011090808.1| PREDICTED: uncharacterized protein LOC105171401 [Sesamum indicum] gi|747044791|ref|XP_011090815.1| PREDICTED: uncharacterized protein LOC105171401 [Sesamum indicum] gi|747044793|ref|XP_011090826.1| PREDICTED: uncharacterized protein LOC105171401 [Sesamum indicum] Length = 313 Score = 344 bits (883), Expect = 4e-92 Identities = 181/255 (70%), Positives = 207/255 (81%), Gaps = 1/255 (0%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PDESS+R FSSVQ+SSSAPPA+GKAY E+ K+ V R IGTEKLDDW++CS Sbjct: 60 PDESSSRTFSSVQVSSSAPPASGKAYTEEDKLNVARPMNSEKKHKSKIGTEKLDDWVDCS 119 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 TGSSFLPHVITVN GEDIS KIMEFSR+GPRAVCIISGSG VS +T+RHP+SS GI TYE Sbjct: 120 TGSSFLPHVITVNTGEDISTKIMEFSREGPRAVCIISGSGTVSTLTIRHPSSSAGITTYE 179 Query: 498 GLFEILSFSGSFTPAEMPD-KYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322 GLFEILSFSGSFTP EMPD + G SG MTITLSGADGRVVGGL++GLT+AASPVK+VVAS Sbjct: 180 GLFEILSFSGSFTPMEMPDPRSGTSGRMTITLSGADGRVVGGLIAGLTLAASPVKVVVAS 239 Query: 321 FLVGNSLELKPKKQFTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTWAAIQTAE 142 FL+G+ ELKPKK FTV+ SN +KR S ++QG S S DNPT+WAA+QTAE Sbjct: 240 FLLGSPHELKPKKHFTVDALGPNGAAASNAEKRSSDNVQGPGYSIS-DNPTSWAAMQTAE 298 Query: 141 KSRKAAADINISLQG 97 +SRK+ ADINISLQG Sbjct: 299 RSRKSKADINISLQG 313 >ref|XP_011093201.1| PREDICTED: uncharacterized protein LOC105173223 [Sesamum indicum] gi|747090980|ref|XP_011093202.1| PREDICTED: uncharacterized protein LOC105173223 [Sesamum indicum] gi|747090982|ref|XP_011093203.1| PREDICTED: uncharacterized protein LOC105173223 [Sesamum indicum] gi|747090984|ref|XP_011093204.1| PREDICTED: uncharacterized protein LOC105173223 [Sesamum indicum] gi|747090986|ref|XP_011093205.1| PREDICTED: uncharacterized protein LOC105173223 [Sesamum indicum] gi|747090988|ref|XP_011093207.1| PREDICTED: uncharacterized protein LOC105173223 [Sesamum indicum] gi|747090990|ref|XP_011093208.1| PREDICTED: uncharacterized protein LOC105173223 [Sesamum indicum] Length = 318 Score = 320 bits (821), Expect = 7e-85 Identities = 170/260 (65%), Positives = 197/260 (75%), Gaps = 6/260 (2%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PDES++RV S V LSSSAPPA GK+Y E+ K R +G EKLDDW +C Sbjct: 60 PDESTSRVLSPVPLSSSAPPATGKSYVEEKKPTPARPVSSEKKHRSKVGAEKLDDWGDCF 119 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 TGSSFLPHVIT+N+GEDIS KIMEFS QGPR VC+ISGSG VSNVT+RHP+SSGG LTYE Sbjct: 120 TGSSFLPHVITINSGEDISTKIMEFSLQGPRTVCVISGSGTVSNVTIRHPSSSGGTLTYE 179 Query: 498 GLFEILSFSGSFTPAEMPD-KYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322 GLFEILSFSGSFTP EMPD K+GRSGMM I+LSGADGRVVGGL++GLT+AASPVK+VVAS Sbjct: 180 GLFEILSFSGSFTPVEMPDSKFGRSGMMAISLSGADGRVVGGLIAGLTIAASPVKVVVAS 239 Query: 321 FLVGNSLELKPKKQ-----FTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTWAA 157 FL+G+ K +KQ FTV+ +N+DKR+ Q SCS S +NPT WAA Sbjct: 240 FLLGSPFVPKSRKQKTEAAFTVDASGPNAAPATNVDKRVPNETQRPSCSAS-ENPTNWAA 298 Query: 156 IQTAEKSRKAAADINISLQG 97 Q AE+SRK+ ADINISLQG Sbjct: 299 TQVAERSRKSTADINISLQG 318 >ref|XP_009604923.1| PREDICTED: putative DNA-binding protein ESCAROLA [Nicotiana tomentosiformis] Length = 332 Score = 228 bits (582), Expect = 3e-57 Identities = 131/269 (48%), Positives = 163/269 (60%), Gaps = 15/269 (5%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD + TR S + +S+SAPP +G E V + P G E L +WI CS Sbjct: 66 PDGAVTRTLSPMPISASAPPTSGSFLSEKVSVARPASEKKPRNKV---GAENLGEWISCS 122 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 TG +FLPH+ITV AGED++MKI+ FS+QGPRA+CIIS G +SNVTLR PN+SGG LTYE Sbjct: 123 TGGNFLPHMITVEAGEDVTMKIISFSQQGPRAICIISAVGLISNVTLRQPNTSGGTLTYE 182 Query: 498 GLFEILSFSGSFTPAEM-PDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322 G FEILS SGSFTP E + R+G M+I+L+ DGRVVGG ++GL +AASPV++VV S Sbjct: 183 GRFEILSLSGSFTPTEFGGSRTSRTGGMSISLASPDGRVVGGTLAGLLIAASPVQVVVGS 242 Query: 321 FLVGNSLELKPKKQ------FTVNXXXXXXXXXSNLDKR--------ISGSIQGTSCSTS 184 FL N E KPKKQ SN+D R I G+ S+S Sbjct: 243 FLPSNYQEAKPKKQKAEPKAIPYATVSPAAPHSSNMDPRSSNALTVNIPGAGNQNIISSS 302 Query: 183 ADNPTTWAAIQTAEKSRKAAADINISLQG 97 W A+ T + SRK+A DINISLQG Sbjct: 303 TMQTNHWTAMPTVQDSRKSATDINISLQG 331 >ref|XP_009789955.1| PREDICTED: uncharacterized protein LOC104237497 [Nicotiana sylvestris] Length = 332 Score = 224 bits (572), Expect = 5e-56 Identities = 129/269 (47%), Positives = 162/269 (60%), Gaps = 15/269 (5%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD + TR S + +S+SAPP +G E V + P G E L +WI CS Sbjct: 66 PDGAVTRTLSPMPISASAPPTSGSFLPEKVSVARPASEKKPRNKV---GAENLGEWISCS 122 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 TG +FLPH+ITV AGED++MKI+ FS+QGPRA+CIIS G +SNVTLR PN+SGG LTYE Sbjct: 123 TGGNFLPHMITVEAGEDVTMKIISFSQQGPRAICIISAVGLISNVTLRQPNTSGGTLTYE 182 Query: 498 GLFEILSFSGSFTPAEM-PDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322 G FEILS SGSFTP E + R+G M+I+L+ DGRVVGG ++GL +AASPV++VV S Sbjct: 183 GRFEILSLSGSFTPTEFGGSRTSRTGGMSISLASPDGRVVGGTLAGLLIAASPVQVVVGS 242 Query: 321 FLVGNSLELKPKKQ------FTVNXXXXXXXXXSNLDKR--------ISGSIQGTSCSTS 184 FL N E KPKKQ SN++ R I G+ S+S Sbjct: 243 FLPSNYQEAKPKKQKAEPKAIPYATVSPAAPHSSNMEPRSSNALTVNIPGAGNQNIISSS 302 Query: 183 ADNPTTWAAIQTAEKSRKAAADINISLQG 97 W + T + SRK+A DINISLQG Sbjct: 303 TIQTNHWTTMPTVQDSRKSATDINISLQG 331 >ref|XP_006339435.1| PREDICTED: uncharacterized protein LOC102580329 [Solanum tuberosum] Length = 332 Score = 223 bits (567), Expect = 2e-55 Identities = 128/269 (47%), Positives = 162/269 (60%), Gaps = 15/269 (5%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD + R S + +S+SAPP +G E V + P G E L +WI CS Sbjct: 66 PDGAVARTISPMPISASAPPTSGNFLSEKVSVARPASEKKPRNKV---GAENLGEWISCS 122 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 TG +FLPH+ITV AGED++MKI+ FS+QGPRA+CIIS G +SNVTLR PNSSGG LTYE Sbjct: 123 TGGNFLPHMITVEAGEDVTMKIISFSQQGPRAICIISAVGLISNVTLRQPNSSGGTLTYE 182 Query: 498 GLFEILSFSGSFTPAEMPDK--YGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVA 325 G FEILS SGSFTP E R+G M+I+L+ DGRVVGG ++GL +AASPV++VV Sbjct: 183 GRFEILSLSGSFTPTEFGGSRTTSRTGGMSISLASPDGRVVGGTLAGLLIAASPVQVVVG 242 Query: 324 SFLVGNSLELKPKKQ------FTVNXXXXXXXXXSNLDKRISGS------IQGT-SCSTS 184 SFL N E KPKKQ SN++ R S + GT + +S Sbjct: 243 SFLPSNYQEAKPKKQKAEPKAIAYGTLSPAAPHSSNMEPRSSNAHTVNVPAAGTQNVISS 302 Query: 183 ADNPTTWAAIQTAEKSRKAAADINISLQG 97 + P W + + + SRK+ DINISLQG Sbjct: 303 SIQPNHWTTMPSVQDSRKSTTDINISLQG 331 >ref|XP_010277390.1| PREDICTED: putative DNA-binding protein ESCAROLA isoform X2 [Nelumbo nucifera] Length = 330 Score = 211 bits (537), Expect = 6e-52 Identities = 124/266 (46%), Positives = 158/266 (59%), Gaps = 12/266 (4%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 P + + S + +SSSAPP K G R E L DW++CS Sbjct: 66 PGGTVSLALSPIPISSSAPPVVSNFSAG--KRGRGRPVGLINREQPKFEVENLGDWVKCS 123 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 G++F PHVITV AGEDI+MKI+ FS+QGPRA+CI+S +G +SNVTLR P+S GG LTYE Sbjct: 124 VGANFTPHVITVAAGEDITMKIISFSQQGPRAICILSANGVISNVTLRQPDSCGGTLTYE 183 Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319 G FEILS SGSF P+E RSG M+++LS DGRVVGG V+GL VAASPV++VV SF Sbjct: 184 GRFEILSLSGSFMPSETGGTRSRSGGMSVSLSSPDGRVVGGGVAGLLVAASPVQVVVGSF 243 Query: 318 LVGNSLELKPKKQ-----FTVNXXXXXXXXXSNLDKRISGSIQGTS-------CSTSADN 175 L LE KPKKQ TV + L + +G Q S S+S+ Sbjct: 244 LPSTQLEHKPKKQKIEVTSTVTPTTAIPVPNAELQEGYNGQGQQNSATPKPNLASSSSFR 303 Query: 174 PTTWAAIQTAEKSRKAAADINISLQG 97 W+++Q+ +SR +A DINISL G Sbjct: 304 ADNWSSLQSMPESRNSATDINISLPG 329 >ref|XP_004229817.1| PREDICTED: uncharacterized protein LOC101253722 [Solanum lycopersicum] gi|723660675|ref|XP_010325755.1| PREDICTED: uncharacterized protein LOC101253722 [Solanum lycopersicum] gi|723660680|ref|XP_010325760.1| PREDICTED: uncharacterized protein LOC101253722 [Solanum lycopersicum] Length = 318 Score = 211 bits (536), Expect = 7e-52 Identities = 117/221 (52%), Positives = 146/221 (66%), Gaps = 15/221 (6%) Frame = -2 Query: 714 GTEKLDDWIECSTGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLR 535 G E L +WI CSTG +FLPH+ITV AGED++MKI+ FS+QGPRA+CIIS G +SNVTLR Sbjct: 97 GAENLGEWISCSTGGNFLPHMITVEAGEDVTMKIISFSQQGPRAICIISAVGLISNVTLR 156 Query: 534 HPNSSGGILTYEGLFEILSFSGSFTPAEMPDK--YGRSGMMTITLSGADGRVVGGLVSGL 361 PNSSGG LTYEG FEILS SGSFTP E R+G M+I+L+ DGRVVGG ++GL Sbjct: 157 QPNSSGGTLTYEGRFEILSLSGSFTPTEFGGSRTTSRTGGMSISLASPDGRVVGGTLAGL 216 Query: 360 TVAASPVKIVVASFLVGNSLELKPKKQ------FTVNXXXXXXXXXSNLDKRISGS---- 211 +AASPV++VV SFL N E+KPKKQ T SN++ R S + Sbjct: 217 LIAASPVQVVVGSFLPSNYQEVKPKKQKAELKAITYGTLSPAAPHSSNMEPRSSNAHTVN 276 Query: 210 --IQGT-SCSTSADNPTTWAAIQTAEKSRKAAADINISLQG 97 GT + +S+ P W A+ + + SRK+ DINISLQG Sbjct: 277 VPAAGTQNVISSSIQPNHWTAMPSVQDSRKSTTDINISLQG 317 >emb|CDP17777.1| unnamed protein product [Coffea canephora] Length = 334 Score = 210 bits (534), Expect = 1e-51 Identities = 126/271 (46%), Positives = 160/271 (59%), Gaps = 18/271 (6%) Frame = -2 Query: 855 DESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECST 676 D ++R S + +SSSAP AG + G E L +W+ CST Sbjct: 67 DGPNSRPLSPMPISSSAPAVAGNFLADKASAG---RRPYTSEKKHKPKVENLGEWVACST 123 Query: 675 GSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYEG 496 G SFLPH+ITVNAGED+S KI+ F + GPRA+C+IS G +SNVTLR PNSSGG LTYEG Sbjct: 124 GGSFLPHMITVNAGEDVSKKIVSFCQNGPRAICVISAVGLISNVTLRQPNSSGGTLTYEG 183 Query: 495 LFEILSFSGSFTPAEM-PDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319 FEILS SGSFTP E+ + R+G M+I+L+ DGRVVGG ++GL VAASPV++VV SF Sbjct: 184 RFEILSLSGSFTPTELGGSRVARTGGMSISLASPDGRVVGGTLAGLLVAASPVQVVVGSF 243 Query: 318 LVGNSLELKPK------KQFTVNXXXXXXXXXSNL--DKRISGSIQGTS---------CS 190 L N ELKPK K +N+ + RIS ++QG + + Sbjct: 244 LPSNHNELKPKKHKYEHKSLAAAGSAAAAPRTNNMLVEHRIS-TVQGLNNVISDNQGMVA 302 Query: 189 TSADNPTTWAAIQTAEKSRKAAADINISLQG 97 +S WA I + E SRK+ DINISLQG Sbjct: 303 SSTLQTANWANISSMEDSRKSNTDINISLQG 333 >ref|XP_012489770.1| PREDICTED: AT-hook motif nuclear-localized protein 1-like isoform X2 [Gossypium raimondii] gi|763773974|gb|KJB41097.1| hypothetical protein B456_007G091400 [Gossypium raimondii] gi|763773975|gb|KJB41098.1| hypothetical protein B456_007G091400 [Gossypium raimondii] Length = 312 Score = 207 bits (527), Expect = 8e-51 Identities = 122/268 (45%), Positives = 157/268 (58%), Gaps = 16/268 (5%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD + R S + +SSS PP+ G+ K G R + E L +W S Sbjct: 44 PDGTMARALSPMPISSSVPPSGGEFSSGGGKRGRGRGSGYQIKHQKGMDLENLGEWAATS 103 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 GSSF PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE Sbjct: 104 VGSSFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 163 Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319 G FEILS SGSF P E RSG M+++L+ ADGRVVGG V+GL +AASPV++VV SF Sbjct: 164 GRFEILSLSGSFMPTETQGTRSRSGGMSVSLASADGRVVGGGVAGLLIAASPVQVVVGSF 223 Query: 318 LVGNSLELKPKKQ-------FTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPT--- 169 L GN + KPKKQ SN +K + +++A P+ Sbjct: 224 LPGNQHDQKPKKQKIESIPATVAPNPSIVAAPASNAEKEDGIDVVSPQQNSNALKPSLTG 283 Query: 168 ------TWAAIQTAEKSRKAAADINISL 103 WAA T ++ R +A DINISL Sbjct: 284 ATFRRENWAA--TMQEPRNSATDINISL 309 >ref|XP_012489769.1| PREDICTED: AT-hook motif nuclear-localized protein 1-like isoform X1 [Gossypium raimondii] gi|763773973|gb|KJB41096.1| hypothetical protein B456_007G091400 [Gossypium raimondii] gi|763773976|gb|KJB41099.1| hypothetical protein B456_007G091400 [Gossypium raimondii] Length = 331 Score = 207 bits (527), Expect = 8e-51 Identities = 122/268 (45%), Positives = 157/268 (58%), Gaps = 16/268 (5%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD + R S + +SSS PP+ G+ K G R + E L +W S Sbjct: 63 PDGTMARALSPMPISSSVPPSGGEFSSGGGKRGRGRGSGYQIKHQKGMDLENLGEWAATS 122 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 GSSF PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE Sbjct: 123 VGSSFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 182 Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319 G FEILS SGSF P E RSG M+++L+ ADGRVVGG V+GL +AASPV++VV SF Sbjct: 183 GRFEILSLSGSFMPTETQGTRSRSGGMSVSLASADGRVVGGGVAGLLIAASPVQVVVGSF 242 Query: 318 LVGNSLELKPKKQ-------FTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPT--- 169 L GN + KPKKQ SN +K + +++A P+ Sbjct: 243 LPGNQHDQKPKKQKIESIPATVAPNPSIVAAPASNAEKEDGIDVVSPQQNSNALKPSLTG 302 Query: 168 ------TWAAIQTAEKSRKAAADINISL 103 WAA T ++ R +A DINISL Sbjct: 303 ATFRRENWAA--TMQEPRNSATDINISL 328 >ref|XP_010276424.1| PREDICTED: uncharacterized protein LOC104611170 [Nelumbo nucifera] gi|719972052|ref|XP_010276433.1| PREDICTED: uncharacterized protein LOC104611170 [Nelumbo nucifera] Length = 330 Score = 207 bits (527), Expect = 8e-51 Identities = 122/266 (45%), Positives = 156/266 (58%), Gaps = 12/266 (4%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD + + S + +SSSAPPA + K G R E L +W+ CS Sbjct: 66 PDGTVSLALSPIPISSSAPPAVSEFSAG--KRGRGRPTGLINKQQPKFEIENLGEWVACS 123 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 G++F PHV+TV GED++MKI+ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE Sbjct: 124 VGANFTPHVLTVATGEDVTMKIISFSQQGPRAICILSANGAISNVTLRQPDSSGGTLTYE 183 Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319 G FEILS SGSF P+E RSG M+++L+ DGRVVGG V+GL VAASPV++VV SF Sbjct: 184 GRFEILSLSGSFMPSESGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSF 243 Query: 318 LVGNSLELKPKK---QFTVNXXXXXXXXXSNLDKRISGSIQG---------TSCSTSADN 175 L LE KPKK + T SN + S QG S S+ Sbjct: 244 LPTTQLEHKPKKPKTEVTSTATPTTAIPISNAEMEEGYSDQGQRNSATPKPNLASASSFR 303 Query: 174 PTTWAAIQTAEKSRKAAADINISLQG 97 W+ IQ+ +SR +A DINISL G Sbjct: 304 GENWSTIQSVPESRNSATDINISLPG 329 >ref|XP_010277387.1| PREDICTED: putative DNA-binding protein ESCAROLA isoform X1 [Nelumbo nucifera] gi|720069279|ref|XP_010277388.1| PREDICTED: putative DNA-binding protein ESCAROLA isoform X1 [Nelumbo nucifera] Length = 346 Score = 201 bits (510), Expect = 8e-49 Identities = 124/282 (43%), Positives = 158/282 (56%), Gaps = 28/282 (9%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 P + + S + +SSSAPP K G R E L DW++CS Sbjct: 66 PGGTVSLALSPIPISSSAPPVVSNFSAG--KRGRGRPVGLINREQPKFEVENLGDWVKCS 123 Query: 678 TGSSFLPHVITVNAGE----------------DISMKIMEFSRQGPRAVCIISGSGRVSN 547 G++F PHVITV AGE DI+MKI+ FS+QGPRA+CI+S +G +SN Sbjct: 124 VGANFTPHVITVAAGEVYVKKKYSFVSSEICQDITMKIISFSQQGPRAICILSANGVISN 183 Query: 546 VTLRHPNSSGGILTYEGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVS 367 VTLR P+S GG LTYEG FEILS SGSF P+E RSG M+++LS DGRVVGG V+ Sbjct: 184 VTLRQPDSCGGTLTYEGRFEILSLSGSFMPSETGGTRSRSGGMSVSLSSPDGRVVGGGVA 243 Query: 366 GLTVAASPVKIVVASFLVGNSLELKPKKQ-----FTVNXXXXXXXXXSNLDKRISGSIQG 202 GL VAASPV++VV SFL LE KPKKQ TV + L + +G Q Sbjct: 244 GLLVAASPVQVVVGSFLPSTQLEHKPKKQKIEVTSTVTPTTAIPVPNAELQEGYNGQGQQ 303 Query: 201 TS-------CSTSADNPTTWAAIQTAEKSRKAAADINISLQG 97 S S+S+ W+++Q+ +SR +A DINISL G Sbjct: 304 NSATPKPNLASSSSFRADNWSSLQSMPESRNSATDINISLPG 345 >ref|XP_002511726.1| DNA binding protein, putative [Ricinus communis] gi|223548906|gb|EEF50395.1| DNA binding protein, putative [Ricinus communis] Length = 324 Score = 201 bits (510), Expect = 8e-49 Identities = 116/260 (44%), Positives = 153/260 (58%), Gaps = 8/260 (3%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD + R S + +SSSAPP + G+ K+ +G E DW S Sbjct: 66 PDGTVARALSPMPISSSAPPGGDFSSGKPGKVW---SGGFEKKKYKKMGMENSGDWASGS 122 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 G++F PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE Sbjct: 123 VGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 182 Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319 G FEILS SGSF P E RSG M+++L+ DGRVVGG V+GL VAASPV++VV SF Sbjct: 183 GRFEILSLSGSFMPTESQGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSF 242 Query: 318 LVGNSLELKPKK--------QFTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTW 163 L GN + KPKK T +N ++ S G ++S+ W Sbjct: 243 LPGNHQDQKPKKIKIDPVPASITPAQTIAIPIPVTNAERDDSMGGHGLQ-NSSSFRRENW 301 Query: 162 AAIQTAEKSRKAAADINISL 103 +Q ++ R + DINISL Sbjct: 302 TTMQPVQEMRTSGTDINISL 321 >ref|XP_008438154.1| PREDICTED: uncharacterized protein LOC103483349 [Cucumis melo] Length = 344 Score = 199 bits (505), Expect = 3e-48 Identities = 123/278 (44%), Positives = 161/278 (57%), Gaps = 24/278 (8%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD + T S + LSSSAP A G + K G R +G E + +W CS Sbjct: 70 PDGTVTMALSPLPLSSSAPAAGGFSI---TKRGKGRLGGSEFKHHKKMGMEYIGEWNACS 126 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 G++F+PH+ITVNAGED++MKI+ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE Sbjct: 127 VGTNFMPHIITVNAGEDVTMKIISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 186 Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319 G FEILS SGSF P E RSG M+++L+ DGRVVGG V+GL +AASPV++VV SF Sbjct: 187 GRFEILSLSGSFMPTENQGTRSRSGGMSVSLASPDGRVVGGGVAGLLIAASPVQVVVGSF 246 Query: 318 LVGNSLELKPKKQ------------FTVNXXXXXXXXXSNLDKRIS---------GSIQG 202 L + E K KKQ + SN D + GS++ Sbjct: 247 LPTSQQEQKVKKQKPPESVPTAAPGSVPSTAPATAMPASNADTEDNLNGNGVQNPGSLKP 306 Query: 201 TSCSTS---ADNPTTWAAIQTAEKSRKAAADINISLQG 97 + S DN T AA+ + ++ R +A DINISL G Sbjct: 307 AGFAPSPFQRDNWGTNAAVHSLQEPRNSATDINISLPG 344 >ref|XP_011034408.1| PREDICTED: uncharacterized protein LOC105132539 [Populus euphratica] gi|743788764|ref|XP_011034415.1| PREDICTED: uncharacterized protein LOC105132539 [Populus euphratica] gi|743788766|ref|XP_011034423.1| PREDICTED: uncharacterized protein LOC105132539 [Populus euphratica] gi|743788770|ref|XP_011034428.1| PREDICTED: uncharacterized protein LOC105132539 [Populus euphratica] Length = 323 Score = 198 bits (504), Expect = 4e-48 Identities = 115/258 (44%), Positives = 154/258 (59%), Gaps = 6/258 (2%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD + R S + +S+SAP G + K+ +G E L +W S Sbjct: 68 PDGAVARALSPMPISASAPHTGGDYSAKPGKVW---PGSYEKKKYKKMGMENLGEWAANS 124 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 G++F PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG LTYE Sbjct: 125 VGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYE 184 Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319 G FEILS SGSF P E+ RSG M+++L+ DGRVVGG V+GL VAASPV++VV SF Sbjct: 185 GRFEILSLSGSFMPTEIQGSRSRSGGMSVSLASPDGRVVGGSVAGLLVAASPVQVVVGSF 244 Query: 318 LVGNSLELKPKKQFTVNXXXXXXXXXSNLDKRI------SGSIQGTSCSTSADNPTTWAA 157 L GN E KPKK ++ + I +G+ QG ++S WA Sbjct: 245 LPGNHQEQKPKKP-KIDSIPATFAPAPAIPASIAEREESAGTPQGQQ-NSSPFQRENWAT 302 Query: 156 IQTAEKSRKAAADINISL 103 + + + R + DINISL Sbjct: 303 MHSMQDVRSSGTDINISL 320 >ref|XP_010092838.1| hypothetical protein L484_022433 [Morus notabilis] gi|587862871|gb|EXB52656.1| hypothetical protein L484_022433 [Morus notabilis] Length = 500 Score = 198 bits (503), Expect = 5e-48 Identities = 117/268 (43%), Positives = 156/268 (58%), Gaps = 14/268 (5%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLD-DWIEC 682 PD + T S + +SSSAPP+ + G K G R +G + +W C Sbjct: 66 PDGTVTMALSPMPISSSAPPSGEFSSG---KRGKARSSGFEYKQHKKVGLDHFSGEWNSC 122 Query: 681 STGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTY 502 S G++F+PH+ITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR +SSGG LTY Sbjct: 123 SLGTNFMPHIITVNAGEDVTMKVISFSQQGPRAICILSANGLISNVTLRQHDSSGGTLTY 182 Query: 501 EGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322 EG FEILS SGSF P E R G M+++L+ DGRVVGG V+GL VAASPV++VV S Sbjct: 183 EGRFEILSLSGSFMPTETQGTRSRQGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGS 242 Query: 321 FLVGNSLELKPKK----QFTVNXXXXXXXXXSNLDK---------RISGSIQGTSCSTSA 181 FL N E KPKK TV + D+ + S + + S++ Sbjct: 243 FLPSNQQEPKPKKLRTEHMTVTPGISMVPPVAEKDQDGMSHGHGHQNSSAPRPNLASSAP 302 Query: 180 DNPTTWAAIQTAEKSRKAAADINISLQG 97 W A+ + SR +A DINISL G Sbjct: 303 FQRENWPAMNSMHDSRNSATDINISLPG 330 >ref|XP_002302537.2| hypothetical protein POPTR_0002s14950g [Populus trichocarpa] gi|550345046|gb|EEE81810.2| hypothetical protein POPTR_0002s14950g [Populus trichocarpa] Length = 325 Score = 198 bits (503), Expect = 5e-48 Identities = 115/261 (44%), Positives = 152/261 (58%), Gaps = 9/261 (3%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXI---GTEKLDDWI 688 PD + R S + +S+SAP G D G P G E L +W Sbjct: 68 PDGAVARALSPMPISASAPSPGG-----DYSAGKPGKVWPGSYEKKKYKKLGMENLGEWA 122 Query: 687 ECSTGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGIL 508 S G++F PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG L Sbjct: 123 ANSVGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTL 182 Query: 507 TYEGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVV 328 TYEG FEILS SGSF P E RSG M+++L+ DGRVVGG V+GL VAASPV++VV Sbjct: 183 TYEGRFEILSLSGSFMPTESQGTRSRSGGMSVSLASPDGRVVGGSVAGLLVAASPVQVVV 242 Query: 327 ASFLVGNSLELKPKK------QFTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTT 166 SFL GN + KPKK T + ++ + G+ G ++S+ Sbjct: 243 GSFLAGNHQDQKPKKPKIDSIPATFAPAPVIPVSIAEREESV-GTPHGQQQNSSSFQREN 301 Query: 165 WAAIQTAEKSRKAAADINISL 103 WA + + + R + DINISL Sbjct: 302 WATMHSMQDVRNSVTDINISL 322 >ref|XP_002320727.1| hypothetical protein POPTR_0014s06550g [Populus trichocarpa] gi|222861500|gb|EEE99042.1| hypothetical protein POPTR_0014s06550g [Populus trichocarpa] Length = 324 Score = 197 bits (502), Expect = 6e-48 Identities = 117/261 (44%), Positives = 153/261 (58%), Gaps = 9/261 (3%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXI---GTEKLDDWI 688 PD + R S + +S+SAP G D G P G E L +W Sbjct: 68 PDGAVARALSPMPISASAPHTGG-----DYSAGKPGKVWPGSYEKKKYKKMGMENLGEWA 122 Query: 687 ECSTGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGIL 508 S G++F PHVITVNAGED++MK++ FS+QGPRA+CI+S +G +SNVTLR P+SSGG L Sbjct: 123 ANSVGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTL 182 Query: 507 TYEGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVV 328 TYEG FEILS SGSF P E+ RSG M+++L+ DGRVVGG V+GL VAASPV++VV Sbjct: 183 TYEGRFEILSLSGSFMPTEIQGTRSRSGGMSVSLASPDGRVVGGSVAGLLVAASPVQVVV 242 Query: 327 ASFLVGNSLELKPKKQFTVNXXXXXXXXXSNLDKRI------SGSIQGTSCSTSADNPTT 166 SFL GN E KPKK ++ + I +G+ QG ++S Sbjct: 243 GSFLPGNHQEQKPKKP-KIDSIPATFAPAPAIPASIAEREESAGTPQGQQ-NSSPFQREN 300 Query: 165 WAAIQTAEKSRKAAADINISL 103 WA + + + R + DINISL Sbjct: 301 WATMHSMQDVRNSGTDINISL 321 >ref|XP_008238665.1| PREDICTED: putative DNA-binding protein ESCAROLA [Prunus mume] Length = 318 Score = 197 bits (501), Expect = 8e-48 Identities = 119/257 (46%), Positives = 157/257 (61%), Gaps = 3/257 (1%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAAGKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIECS 679 PD S T S +SSSAPP E K G + E L +W+ CS Sbjct: 72 PDGSVTMALSPKPISSSAPPPVIDFSAE--KRG--KVKPTSSVSKTKYEVENLGEWVACS 127 Query: 678 TGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTYE 499 G++F PH+ITVN+GED+ MKI+ FS+QGPRA+C++S +G +S+VTLR P+SSGG LTYE Sbjct: 128 VGANFTPHIITVNSGEDVMMKIISFSQQGPRAICVLSANGVISSVTLRQPDSSGGTLTYE 187 Query: 498 GLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVASF 319 G FEILS SGSF P E RSG M+++L+ DGRVVGG V+GL VAASPV++VV SF Sbjct: 188 GRFEILSLSGSFMPNETGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSF 247 Query: 318 LVGNSLELKPKKQ---FTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTWAAIQT 148 L GN E KPKKQ + N S++D + + S +S S DN W+++ + Sbjct: 248 LSGNQHEQKPKKQKHDYISNATPTMAVPISSVDPKPNFS---SSTSFRGDN---WSSLPS 301 Query: 147 AEKSRKAAADINISLQG 97 K++ DIN+SL G Sbjct: 302 DPKTK---TDINVSLPG 315 >ref|XP_007040013.1| AT-hook motif nuclear-localized protein 1 isoform 2 [Theobroma cacao] gi|508777258|gb|EOY24514.1| AT-hook motif nuclear-localized protein 1 isoform 2 [Theobroma cacao] Length = 331 Score = 197 bits (501), Expect = 8e-48 Identities = 114/253 (45%), Positives = 155/253 (61%), Gaps = 1/253 (0%) Frame = -2 Query: 858 PDESSTRVFSSVQLSSSAPPAA-GKAYGEDVKIGVPRXXXXXXXXXXXIGTEKLDDWIEC 682 PD S T S +S++APP + G+ K+ P E L +W+ C Sbjct: 87 PDGSVTMALSPKPISTAAPPPLIDFSAGKRGKVKSPTSVSKAKYEL-----ENLGEWVAC 141 Query: 681 STGSSFLPHVITVNAGEDISMKIMEFSRQGPRAVCIISGSGRVSNVTLRHPNSSGGILTY 502 S G++F PH+ITVNAGED++MKI+ FS+QGPRA+CI+S +G +S+VTLR P+SSGG LTY Sbjct: 142 SVGANFTPHIITVNAGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTY 201 Query: 501 EGLFEILSFSGSFTPAEMPDKYGRSGMMTITLSGADGRVVGGLVSGLTVAASPVKIVVAS 322 EG FEILS SGSF P++ RSG M+++L+ DGRVVGG V+GL VAASPV++VV S Sbjct: 202 EGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGS 261 Query: 321 FLVGNSLELKPKKQFTVNXXXXXXXXXSNLDKRISGSIQGTSCSTSADNPTTWAAIQTAE 142 FL GN E KPKKQ +S + ++ STS+ +W+++ + Sbjct: 262 FLAGNQHEQKPKKQ----KHEPISAATPMAAIPVSSADPKSNLSTSSFRGDSWSSLPS-- 315 Query: 141 KSRKAAADINISL 103 SR DIN+SL Sbjct: 316 DSRNKPTDINVSL 328