BLASTX nr result
ID: Ephedra27_contig00002693
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra27_contig00002693 (1366 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006402494.1| hypothetical protein EUTSA_v10006069mg [Eutr... 165 5e-38 gb|ADE76494.1| unknown [Picea sitchensis] 162 2e-37 ref|XP_006291414.1| hypothetical protein CARUB_v10017552mg [Caps... 157 8e-36 ref|XP_004148734.1| PREDICTED: uncharacterized protein LOC101204... 157 1e-35 ref|NP_191690.2| AT hook motif DNA-binding family protein [Arabi... 153 1e-34 emb|CAB71061.1| putative DNA-binding protein [Arabidopsis thaliana] 153 1e-34 ref|XP_002878374.1| hypothetical protein ARALYDRAFT_324562 [Arab... 152 2e-34 gb|ADE76393.1| unknown [Picea sitchensis] 152 2e-34 ref|XP_006847725.1| hypothetical protein AMTR_s00149p00085280 [A... 151 5e-34 ref|XP_004253116.1| PREDICTED: uncharacterized protein LOC101247... 150 1e-33 gb|EMJ19482.1| hypothetical protein PRUPE_ppa008388mg [Prunus pe... 150 2e-33 ref|XP_006849966.1| hypothetical protein AMTR_s00022p00149810 [A... 149 2e-33 gb|AGE46020.1| putative AT-hook DNA-binding protein [Elaeis guin... 148 6e-33 ref|XP_002275328.1| PREDICTED: uncharacterized protein LOC100263... 147 1e-32 emb|CAN64876.1| hypothetical protein VITISV_030792 [Vitis vinifera] 147 1e-32 ref|XP_006291415.1| hypothetical protein CARUB_v10017552mg [Caps... 147 1e-32 ref|NP_182109.1| AT hook motif DNA-binding family protein [Arabi... 147 1e-32 ref|XP_003524712.2| PREDICTED: putative DNA-binding protein ESCA... 146 2e-32 gb|EXB56269.1| Putative DNA-binding protein ESCAROLA [Morus nota... 146 2e-32 ref|XP_006294414.1| hypothetical protein CARUB_v10023431mg [Caps... 146 2e-32 >ref|XP_006402494.1| hypothetical protein EUTSA_v10006069mg [Eutrema salsugineum] gi|567182785|ref|XP_006402495.1| hypothetical protein EUTSA_v10006069mg [Eutrema salsugineum] gi|567182788|ref|XP_006402496.1| hypothetical protein EUTSA_v10006069mg [Eutrema salsugineum] gi|557103593|gb|ESQ43947.1| hypothetical protein EUTSA_v10006069mg [Eutrema salsugineum] gi|557103594|gb|ESQ43948.1| hypothetical protein EUTSA_v10006069mg [Eutrema salsugineum] gi|557103595|gb|ESQ43949.1| hypothetical protein EUTSA_v10006069mg [Eutrema salsugineum] Length = 350 Score = 165 bits (417), Expect = 5e-38 Identities = 134/363 (36%), Positives = 174/363 (47%), Gaps = 19/363 (5%) Frame = +3 Query: 156 PGSAGAAPPLQPRDYTDQGHDHAALYANQSYMLQPHPDDDNN---NINGYFSSPNTINDD 326 PGS P QP + QG H + +N + +P+ + N +G+ S P I Sbjct: 22 PGSGPPPPQTQPTFHGSQGFHHFS-NSNSPFGSNTNPNPNPNPGGGSSGFVSPPLPIESS 80 Query: 327 GDEEGVQGAKLSS-----KRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXX 491 + SS KRKRGRPRKYG D Sbjct: 81 PADSSAAAPPPSSGETSVKRKRGRPRKYGQDGSVSLALSPSVSTM--------------- 125 Query: 492 XXXXXAEPRMPSRGRGRPNGSLGKKKKLAAFGTL-----GTGFTPHVIIVPPGEDVASKI 656 P RGRGRP GS GKK++LA+ G L G FTPHVI+V GED+ASK+ Sbjct: 126 ------SPNSNKRGRGRPPGS-GKKQRLASIGDLMPSSSGMSFTPHVIVVSVGEDIASKV 178 Query: 657 LSFSQQGPHLFCILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSI-- 830 +SFSQQGP C+LSA GA+S AT Q P T ++GRFELISLS S Sbjct: 179 ISFSQQGPRAICVLSASGAVSTATLLQPSPSHGAIT--------YEGRFELISLSTSYLN 230 Query: 831 SAENDGQTRTSRLSASLATNDSQVI-GGVVGMLIAASTVQVVVGSFI--IDKKDLKKLHN 1001 + +ND RT L+ SLA+ D +VI GG+ G LIAAS VQV+VGSFI I K LKK Sbjct: 231 ATDNDYPNRTGNLAVSLASPDGRVIGGGIGGPLIAASPVQVIVGSFIWAIPKAKLKK--R 288 Query: 1002 DVDSTALANSSATNHQVSQVENHSSFQLNSNFIPLPS-NSCQTSNHNYYASPTSRSYHGC 1178 D S + ++ A +EN+ + S +P S N Q+ + P S H Sbjct: 289 DETSEDVQDTEA-------LENNENIAATSPPVPQQSQNLVQSPVGIWSTGPRSMDLHHA 341 Query: 1179 NME 1187 +++ Sbjct: 342 HID 344 >gb|ADE76494.1| unknown [Picea sitchensis] Length = 302 Score = 162 bits (411), Expect = 2e-37 Identities = 107/230 (46%), Positives = 134/230 (58%), Gaps = 3/230 (1%) Frame = +3 Query: 366 KRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEPRMPSRGRGRP 545 KRKRGRPRKYG D RGRGRP Sbjct: 37 KRKRGRPRKYGPDGSMALALSPFSALPGMTGSSSQ------------------KRGRGRP 78 Query: 546 NGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQGPHLFCILSAIGAISEA 725 G+ G+K++LAA G+ G GFTPHVI + GEDVA+KI+SFSQQGP CILSA GAIS Sbjct: 79 PGT-GRKQQLAALGSAGVGFTPHVITIAAGEDVATKIMSFSQQGPRAVCILSANGAISNV 137 Query: 726 TFKQIDPYGNCGTSECAEVKSFKGRFELISLSCS-ISAENDGQTRTSRLSASLATNDSQV 902 T +Q P + GT +++GRF+++SLS S + EN+G RT LS SLA D +V Sbjct: 138 TVRQ--PAASGGT------VTYEGRFDIVSLSGSFLLMENNGARRTGGLSISLAGPDGRV 189 Query: 903 IGGVV-GMLIAASTVQVVVGSFIID-KKDLKKLHNDVDSTALANSSATNH 1046 +GGVV GML+AAS VQV+ GSFI+D KK K N V S+ L + +A+ H Sbjct: 190 VGGVVAGMLMAASPVQVIAGSFILDSKKGQGKPENPVSSSGLPHVAASGH 239 >ref|XP_006291414.1| hypothetical protein CARUB_v10017552mg [Capsella rubella] gi|482560121|gb|EOA24312.1| hypothetical protein CARUB_v10017552mg [Capsella rubella] Length = 350 Score = 157 bits (398), Expect = 8e-36 Identities = 123/318 (38%), Positives = 158/318 (49%), Gaps = 23/318 (7%) Frame = +3 Query: 156 PGSAGAAPPLQPRDYTDQGHDHAALYANQSYMLQPHPDDDNNNINGYFSSPNTINDD-GD 332 PGS P QP + QG H + +N + P+P+ + G+ S P + D Sbjct: 22 PGSGPPPPQTQPTFHGSQGFHHFS-NSNSPFGSNPNPNPGGGSA-GFVSPPLQVESSPAD 79 Query: 333 EEGVQGAKL--------SSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXX 488 GA + S KRKRGRPRKYG D Sbjct: 80 SSATAGAAVAPPPSGDTSVKRKRGRPRKYGQDGSVSLALSPSVSN--------------- 124 Query: 489 XXXXXXAEPRMPSRGRGRPNGSLGKKKKLAAFGTL-----GTGFTPHVIIVPPGEDVASK 653 P RGRGRP GS G+K++L G L G FTPHVI+V GED+ASK Sbjct: 125 ------VSPNSNKRGRGRPPGS-GRKQRLTTVGELMPSSSGMSFTPHVIVVSIGEDIASK 177 Query: 654 ILSFSQQGPHLFCILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSI- 830 ++SFS QGP C+LSA GA+S AT Q P + GT +++GRFELISLS S Sbjct: 178 VISFSHQGPRAICVLSASGAVSTATLLQPPP--SHGTI------TYEGRFELISLSTSYL 229 Query: 831 -SAENDGQTRTSRLSASLATNDSQVI-GGVVGMLIAASTVQVVVGSFII-----DKKDLK 989 + +ND RT L+ SLA+ D +VI GG+ G LIAAS+VQV+VGSFI K + Sbjct: 230 NTTDNDYPNRTGNLAVSLASPDGRVIGGGIGGPLIAASSVQVIVGSFIWAAPKGKTKKRE 289 Query: 990 KLHNDV-DSTALANSSAT 1040 + DV D+ AL N+ T Sbjct: 290 ETSEDVQDTDALDNNDNT 307 >ref|XP_004148734.1| PREDICTED: uncharacterized protein LOC101204243 [Cucumis sativus] gi|449511145|ref|XP_004163876.1| PREDICTED: uncharacterized LOC101204243 [Cucumis sativus] Length = 362 Score = 157 bits (396), Expect = 1e-35 Identities = 105/281 (37%), Positives = 142/281 (50%), Gaps = 2/281 (0%) Frame = +3 Query: 150 HQPGSAGAAPPLQPRDYTDQGHDHAALYANQSYMLQPHPDDDNNNINGYFSSPNTINDDG 329 + P S +P + P Q A+ + S M ++ N Y S + + G Sbjct: 34 NMPNSNNTSPLINPNSAAAQMMSSASRFPFNSMMGSSSKPSESPNAASYDGSQSELRTGG 93 Query: 330 DEEGVQGAKLSSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXA 509 S K+KRGRPRKY D Sbjct: 94 FNID------SGKKKRGRPRKYSPDGNIALGLSPTPITSSAVPADSAGMHSP-------- 139 Query: 510 EPRMPSRGRGRPNGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQGPHLF 689 +PR P + RGRP G+ K+++ A GT G GFTPHVI+V PGED+ASK+++FSQQGP Sbjct: 140 DPR-PKKNRGRPPGT--GKRQMDALGTGGVGFTPHVILVKPGEDIASKVMAFSQQGPRTV 196 Query: 690 CILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCS--ISAENDGQTRTS 863 CILSA GA+ T + G+ S++GR+E+ISLS S IS N ++R+ Sbjct: 197 CILSAHGAVCNVTLQPALSSGSV---------SYEGRYEIISLSGSFLISENNGNRSRSG 247 Query: 864 RLSASLATNDSQVIGGVVGMLIAASTVQVVVGSFIIDKKDL 986 LS SLA+ D QV+GG+ ML AASTVQV+VGSF++D K L Sbjct: 248 GLSVSLASADGQVLGGITNMLTAASTVQVIVGSFLVDGKKL 288 >ref|NP_191690.2| AT hook motif DNA-binding family protein [Arabidopsis thaliana] gi|22136014|gb|AAM91589.1| putative DNA-binding protein [Arabidopsis thaliana] gi|31711840|gb|AAP68276.1| At3g61310 [Arabidopsis thaliana] gi|119657366|tpd|FAA00282.1| TPA: AT-hook motif nuclear localized protein 11 [Arabidopsis thaliana] gi|332646665|gb|AEE80186.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana] Length = 354 Score = 153 bits (387), Expect = 1e-34 Identities = 121/348 (34%), Positives = 162/348 (46%), Gaps = 20/348 (5%) Frame = +3 Query: 156 PGSAGAAPPLQPRDYTDQGHDHAALYANQSYMLQPHPDDDNNNINGYFSSPNTINDDGDE 335 PGS P QP + QG H + + P+P+ + ++ F SP D Sbjct: 22 PGSGPPPPQTQPTFHGSQGFHHFT-NSISPFGSNPNPNPNPGGVSTGFVSPPLPVDSSPA 80 Query: 336 EGVQGAK----------LSSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXX 485 + A S KRKRGRPRKYG D Sbjct: 81 DSSAAAAGALVAPPSGDTSVKRKRGRPRKYGQDGGSVSLALSPSISN------------- 127 Query: 486 XXXXXXXAEPRMPSRGRGRPNGSLGKKKKLAAFGTL-----GTGFTPHVIIVPPGEDVAS 650 P RGRGRP GS GKK++L++ G + G FTPHVI+V GED+AS Sbjct: 128 -------VSPNSNKRGRGRPPGS-GKKQRLSSIGEMMPSSTGMSFTPHVIVVSIGEDIAS 179 Query: 651 KILSFSQQGPHLFCILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSI 830 K++SFS QGP C+LSA GA+S AT Q P + GT ++G FELISLS S Sbjct: 180 KVISFSHQGPRAICVLSASGAVSTATLLQ--PAPSHGTI------IYEGLFELISLSTSY 231 Query: 831 --SAENDGQTRTSRLSASLATNDSQVI-GGVVGMLIAASTVQVVVGSFI--IDKKDLKKL 995 + +ND RT L+ SLA+ D +VI GG+ G LIAAS VQV+VGSFI I K +KK Sbjct: 232 LNTTDNDYPNRTGSLAVSLASPDGRVIGGGIGGPLIAASQVQVIVGSFIWAIPKGKIKKR 291 Query: 996 HNDVDSTALANSSATNHQVSQVENHSSFQLNSNFIPLPSNSCQTSNHN 1139 + ++ N+ + + Q + N + P T + + Sbjct: 292 EETSEDVQDTDALENNNDNTAATSPPVPQQSQNIVQTPVGIWSTGSRS 339 >emb|CAB71061.1| putative DNA-binding protein [Arabidopsis thaliana] Length = 348 Score = 153 bits (387), Expect = 1e-34 Identities = 121/348 (34%), Positives = 162/348 (46%), Gaps = 20/348 (5%) Frame = +3 Query: 156 PGSAGAAPPLQPRDYTDQGHDHAALYANQSYMLQPHPDDDNNNINGYFSSPNTINDDGDE 335 PGS P QP + QG H + + P+P+ + ++ F SP D Sbjct: 16 PGSGPPPPQTQPTFHGSQGFHHFT-NSISPFGSNPNPNPNPGGVSTGFVSPPLPVDSSPA 74 Query: 336 EGVQGAK----------LSSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXX 485 + A S KRKRGRPRKYG D Sbjct: 75 DSSAAAAGALVAPPSGDTSVKRKRGRPRKYGQDGGSVSLALSPSISN------------- 121 Query: 486 XXXXXXXAEPRMPSRGRGRPNGSLGKKKKLAAFGTL-----GTGFTPHVIIVPPGEDVAS 650 P RGRGRP GS GKK++L++ G + G FTPHVI+V GED+AS Sbjct: 122 -------VSPNSNKRGRGRPPGS-GKKQRLSSIGEMMPSSTGMSFTPHVIVVSIGEDIAS 173 Query: 651 KILSFSQQGPHLFCILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSI 830 K++SFS QGP C+LSA GA+S AT Q P + GT ++G FELISLS S Sbjct: 174 KVISFSHQGPRAICVLSASGAVSTATLLQ--PAPSHGTI------IYEGLFELISLSTSY 225 Query: 831 --SAENDGQTRTSRLSASLATNDSQVI-GGVVGMLIAASTVQVVVGSFI--IDKKDLKKL 995 + +ND RT L+ SLA+ D +VI GG+ G LIAAS VQV+VGSFI I K +KK Sbjct: 226 LNTTDNDYPNRTGSLAVSLASPDGRVIGGGIGGPLIAASQVQVIVGSFIWAIPKGKIKKR 285 Query: 996 HNDVDSTALANSSATNHQVSQVENHSSFQLNSNFIPLPSNSCQTSNHN 1139 + ++ N+ + + Q + N + P T + + Sbjct: 286 EETSEDVQDTDALENNNDNTAATSPPVPQQSQNIVQTPVGIWSTGSRS 333 >ref|XP_002878374.1| hypothetical protein ARALYDRAFT_324562 [Arabidopsis lyrata subsp. lyrata] gi|297324212|gb|EFH54633.1| hypothetical protein ARALYDRAFT_324562 [Arabidopsis lyrata subsp. lyrata] Length = 346 Score = 152 bits (385), Expect = 2e-34 Identities = 126/327 (38%), Positives = 160/327 (48%), Gaps = 26/327 (7%) Frame = +3 Query: 156 PGSAGA-APPLQPRDYTDQGHDHAALYANQSYMLQPHPDDDNNNINGYFSSPNTINDD-G 329 PGS AP QP + QG H + N + P + G+ P + Sbjct: 22 PGSGPPPAPQTQPTFHGSQGFHH---FTNSN---SPFGSNPGGVSTGFVPPPLPVESSPA 75 Query: 330 DEEGVQGAKL-------SSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXX 488 D GA + S KRKRGRPRKYG D Sbjct: 76 DSSAAAGAVVVPPSGDTSLKRKRGRPRKYGQDGSVSLALSPSVSN--------------- 120 Query: 489 XXXXXXAEPRMPSRGRGRPNGSLGKKKKLAAFGTL-----GTGFTPHVIIVPPGEDVASK 653 P RGRGRP GS GKK++L++ G + G FTPHVI+V GED+ASK Sbjct: 121 ------VSPNSNKRGRGRPPGS-GKKQRLSSIGEMMPSSSGMSFTPHVIVVSIGEDIASK 173 Query: 654 ILSFSQQGPHLFCILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSI- 830 ++SFS QGP C+LSA GA+S AT Q P + GT +++G FELISLS S Sbjct: 174 VISFSHQGPRAICVLSASGAVSTATLLQ--PAPSHGTI------TYEGLFELISLSTSYL 225 Query: 831 -SAENDGQTRTSRLSASLATNDSQVI-GGVVGMLIAASTVQVVVGSFI--IDKKDLKK-- 992 + +ND RT L+ SLA++D +VI GG+ G LIAAS VQV+VGSFI I K +KK Sbjct: 226 NTTDNDYPNRTGSLAVSLASSDGRVIGGGIGGPLIAASQVQVIVGSFIWAIPKGKIKKRE 285 Query: 993 -LHNDVDSTALA----NSSATNHQVSQ 1058 DV TA N++AT+ V Q Sbjct: 286 ETSEDVQDTAALDNNDNTAATSPPVPQ 312 >gb|ADE76393.1| unknown [Picea sitchensis] Length = 302 Score = 152 bits (385), Expect = 2e-34 Identities = 110/245 (44%), Positives = 136/245 (55%), Gaps = 9/245 (3%) Frame = +3 Query: 348 GAKLSSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEPRMPS 527 G S KRKRGRPRKYG D + Sbjct: 14 GGTDSMKRKRGRPRKYGPDGSMALALAPLSASAPGAPFSP-----------------LQK 56 Query: 528 RGRGRPNGSLGKKKKLAAFG-----TLGTGFTPHVIIVPPGEDVASKILSFSQQGPHLFC 692 RGRGRP GS GKK++LAA G + G GFTPHVI + GEDVASKI+SFSQQGP C Sbjct: 57 RGRGRPPGS-GKKQRLAALGEWVVGSAGIGFTPHVITIAAGEDVASKIMSFSQQGPRAVC 115 Query: 693 ILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSISAENDG--QTRTSR 866 ILSA GAIS T +Q P + GT +++GRFE++SLS S +G ++RT Sbjct: 116 ILSANGAISNVTLRQ--PATSGGT------LTYEGRFEILSLSGSFMLTENGGARSRTGG 167 Query: 867 LSASLATNDSQVI-GGVVGMLIAASTVQVVVGSFIID-KKDLKKLHNDVDSTALANSSAT 1040 LS SLA+ D +V+ GGV GML+AAS VQVVVGSFI + +KD K S LA ++A+ Sbjct: 168 LSVSLASPDGRVVGGGVAGMLMAASPVQVVVGSFISNGQKDPPKPAKPEPSIGLAQAAAS 227 Query: 1041 NHQVS 1055 V+ Sbjct: 228 GGPVA 232 >ref|XP_006847725.1| hypothetical protein AMTR_s00149p00085280 [Amborella trichopoda] gi|548850994|gb|ERN09306.1| hypothetical protein AMTR_s00149p00085280 [Amborella trichopoda] Length = 346 Score = 151 bits (382), Expect = 5e-34 Identities = 107/248 (43%), Positives = 133/248 (53%), Gaps = 5/248 (2%) Frame = +3 Query: 348 GAKLSS--KRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEPRM 521 GA +S K+KRGRPRKYG D Sbjct: 68 GASMSEPIKKKRGRPRKYGPDGSVSLALASPISSVPGYSTTPSY---------------- 111 Query: 522 PSRGRGRPNGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQGPHLFCILS 701 R RGRP G+ G+K+++AA GT G GFTPH+I + GEDVASKI+SFSQQGP CILS Sbjct: 112 -KRNRGRPAGAGGRKQQMAALGTAGVGFTPHIIAIMAGEDVASKIMSFSQQGPRAICILS 170 Query: 702 AIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCS-ISAENDG-QTRTSRLSA 875 A GAIS T +Q G T ++GRFE+ISLS S + E DG +RT LS Sbjct: 171 ANGAISNVTLRQAATSGGTVT--------YEGRFEIISLSGSYLLTERDGILSRTGGLSV 222 Query: 876 SLATNDSQVI-GGVVGMLIAASTVQVVVGSFIIDKKDLKKLHNDVDSTALANSSATNHQV 1052 SLA D +V+ GGV G+L+AA+ VQVVVGSFI + K K D L+ S+ +Q Sbjct: 223 SLAGPDGRVLGGGVAGLLVAATPVQVVVGSFIAEGKKPKPKPQIRD--PLSASAFEPNQS 280 Query: 1053 SQVENHSS 1076 S +H S Sbjct: 281 SSPHSHGS 288 >ref|XP_004253116.1| PREDICTED: uncharacterized protein LOC101247708 [Solanum lycopersicum] Length = 357 Score = 150 bits (379), Expect = 1e-33 Identities = 98/222 (44%), Positives = 118/222 (53%), Gaps = 10/222 (4%) Frame = +3 Query: 366 KRKRGRPRKYGID-------NXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEPRMP 524 KRKRGRPRKYG D N P Sbjct: 82 KRKRGRPRKYGPDGSMALGLNPAASPVGGGSLGGSLSPRDPGNSAGAQMHSGGPGSPNSS 141 Query: 525 SRGRGRPNGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQGPHLFCILSA 704 +GRGRP GS GKK+++ G+ G GFTPH+I V PGEDVA KI+SFSQ GP CILSA Sbjct: 142 KKGRGRPPGS-GKKQQMDNLGSTGFGFTPHIIAVKPGEDVAYKIMSFSQNGPRAVCILSA 200 Query: 705 IGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSISAENDG--QTRTSRLSAS 878 GAIS T KQ G T ++GRFE++SLS S + G Q+RT LS S Sbjct: 201 SGAISYVTLKQTATSGGTAT--------YEGRFEILSLSGSFMLSDIGGQQSRTGGLSVS 252 Query: 879 LATNDSQVIGG-VVGMLIAASTVQVVVGSFIIDKKDLKKLHN 1001 LA +D +++GG V G+L AAS VQV+VGSFI D + K N Sbjct: 253 LAGSDGRILGGCVAGVLTAASPVQVIVGSFIADGRKEPKTSN 294 >gb|EMJ19482.1| hypothetical protein PRUPE_ppa008388mg [Prunus persica] Length = 333 Score = 150 bits (378), Expect = 2e-33 Identities = 108/263 (41%), Positives = 136/263 (51%), Gaps = 24/263 (9%) Frame = +3 Query: 366 KRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEPRM----PSRG 533 KRKRGRPRKYG D A P M P RG Sbjct: 83 KRKRGRPRKYGPDGTVSLALSPSSS----------------------ANPGMVTSTPKRG 120 Query: 534 RGRPNGSLGKKKKLAAFGTL-----GTGFTPHVIIVPPGEDVASKILSFSQQGPHLFCIL 698 RGRP GS GKK++LA+ G L G GFTPH+I + GED+A+KI+SFSQQGP CIL Sbjct: 121 RGRPPGS-GKKQQLASLGELLSGSAGMGFTPHIITIAMGEDIATKIMSFSQQGPRALCIL 179 Query: 699 SAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCS--ISAENDGQTRTSRLS 872 SA GA+S T +Q G T ++GRFE+I LS S ++ + RT LS Sbjct: 180 SANGAVSTVTLRQPSTSGGTVT--------YEGRFEIICLSGSYLLTESGGSRNRTGGLS 231 Query: 873 ASLATNDSQVI-GGVVGMLIAASTVQVVVGSFIIDKKDLKKL------------HNDVDS 1013 SLA+ D +VI GGV GMLIAAS VQV+VGSFI K H VD+ Sbjct: 232 VSLASPDGRVIGGGVGGMLIAASPVQVIVGSFIWGSSKTKSKKREAVEGATDLDHQTVDN 291 Query: 1014 TALANSSATNHQVSQVENHSSFQ 1082 + NS + + +SQ + +++Q Sbjct: 292 SVALNSISQDQSLSQSASLAAWQ 314 >ref|XP_006849966.1| hypothetical protein AMTR_s00022p00149810 [Amborella trichopoda] gi|548853564|gb|ERN11547.1| hypothetical protein AMTR_s00022p00149810 [Amborella trichopoda] Length = 335 Score = 149 bits (377), Expect = 2e-33 Identities = 120/295 (40%), Positives = 146/295 (49%), Gaps = 9/295 (3%) Frame = +3 Query: 153 QPGSAGAAPPLQPRDYTDQGHDHAALYANQSYMLQPHPDDDNNNINGYFSSPNTINDDGD 332 Q S PP+ P D AA+Y + P+ D ++ T+ G Sbjct: 28 QKSSVQPPPPVAPNMRLAFSSDGAAVYKPVTGNSPPYQGDTSS----------TMVQHGG 77 Query: 333 EEGVQGAKLSSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAE 512 G L KRKRGRPRKYG D Sbjct: 78 INMNMGEPL--KRKRGRPRKYGPDGTMALALTPATPVSAGF------------------- 116 Query: 513 PRMPS-----RGRGRPNGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQG 677 P PS + RGRP GS GKK++LAA G G GF PHVI V GEDVASKI+SFSQQG Sbjct: 117 PGSPSSSSLKKARGRPPGS-GKKQQLAALGAAGVGFMPHVITVKTGEDVASKIMSFSQQG 175 Query: 678 PHLFCILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCS-ISAENDGQ- 851 P CILSA GAIS T +Q G T ++GRFE++SLS S + +E+ GQ Sbjct: 176 PRAVCILSANGAISNVTLRQAATSGGTVT--------YEGRFEILSLSGSFLLSESGGQR 227 Query: 852 TRTSRLSASLATNDSQVI-GGVVGMLIAASTVQVVVGSFIID-KKDLKKLHNDVD 1010 +RT LS SLA D +V+ GGV G+L+AA+ VQVVVGSFI D +K K N D Sbjct: 228 SRTGGLSVSLAGPDGRVLGGGVAGLLMAATPVQVVVGSFISDGRKSDSKTPNQQD 282 >gb|AGE46020.1| putative AT-hook DNA-binding protein [Elaeis guineensis] Length = 362 Score = 148 bits (373), Expect = 6e-33 Identities = 112/282 (39%), Positives = 138/282 (48%), Gaps = 5/282 (1%) Frame = +3 Query: 366 KRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--AEPRMPSRGRG 539 KRKRGRPRKYG D A + RG Sbjct: 93 KRKRGRPRKYGPDGTMSLALTTVSPTAAVSPGSGGFSPSSAGAGNPASSASAEAMKKARG 152 Query: 540 RPNGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQGPHLFCILSAIGAIS 719 RP GS GKK++LAA G+ G GFTPHVI V GEDV+SKI+SFSQ GP CILSA GAIS Sbjct: 153 RPPGS-GKKQQLAALGSAGIGFTPHVITVKAGEDVSSKIMSFSQHGPRAVCILSANGAIS 211 Query: 720 EATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCS-ISAENDGQ-TRTSRLSASLATND 893 T +Q G T ++GRFE++SLS S + +E+ GQ +RT LS SLA D Sbjct: 212 NVTLRQAATSGGTVT--------YEGRFEILSLSGSFLLSESGGQRSRTGGLSVSLAGPD 263 Query: 894 SQVI-GGVVGMLIAASTVQVVVGSFIIDKKDLKKLHNDVDSTALANSSATNHQVSQVENH 1070 +V+ GGV G+L AAS VQVVVGSFI D K K D T A + + Sbjct: 264 GRVLGGGVAGLLTAASPVQVVVGSFIADGKKEPKHTAPSDPTLAPGKLAAGGAAAGANS- 322 Query: 1071 SSFQLNSNFIPLPSNSCQTSNHNYYASPTSRSYHGCNMEDQR 1196 P + S+ SP ++S CN +Q+ Sbjct: 323 ----------PPSRGTLSESSGGGPGSPLNQSTGTCNNSNQQ 354 >ref|XP_002275328.1| PREDICTED: uncharacterized protein LOC100263332 [Vitis vinifera] gi|297745600|emb|CBI40765.3| unnamed protein product [Vitis vinifera] Length = 353 Score = 147 bits (371), Expect = 1e-32 Identities = 111/275 (40%), Positives = 141/275 (51%), Gaps = 3/275 (1%) Frame = +3 Query: 366 KRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEPRMPSRGRGRP 545 KRKRGRPRKYG D A P + RGRP Sbjct: 86 KRKRGRPRKYGPDGTMALALSPAPSGVNVSQSGGAFSSPPASAGS--ASPSSLKKARGRP 143 Query: 546 NGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQGPHLFCILSAIGAISEA 725 GS KK+++ A G+ G GFTPHVI V GEDV+SKI+SFSQ GP CILSA GAIS Sbjct: 144 PGS-SKKQQMEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQHGPRAVCILSANGAISNV 202 Query: 726 TFKQIDPYGNCGTSECAEVKSFKGRFELISLSCS-ISAENDGQ-TRTSRLSASLATNDSQ 899 T +Q P + GT +++GRFE++SLS S + +EN GQ +RT LS SL+ D + Sbjct: 203 TLRQ--PATSGGT------VTYEGRFEILSLSGSFLLSENGGQRSRTGGLSVSLSGPDGR 254 Query: 900 VI-GGVVGMLIAASTVQVVVGSFIIDKKDLKKLHNDVDSTALANSSATNHQVSQVENHSS 1076 V+ GGV G+L AAS VQVVVGSFI D + K + V+ ++ A V SS Sbjct: 255 VLGGGVAGLLTAASPVQVVVGSFIADGRKESKSASQVEPSSAPPKIAPVGGGGGVTGTSS 314 Query: 1077 FQLNSNFIPLPSNSCQTSNHNYYASPTSRSYHGCN 1181 PS + + SP ++S CN Sbjct: 315 ---------PPSRGTLSESSGGPGSPLNQSTGACN 340 >emb|CAN64876.1| hypothetical protein VITISV_030792 [Vitis vinifera] Length = 390 Score = 147 bits (371), Expect = 1e-32 Identities = 111/275 (40%), Positives = 141/275 (51%), Gaps = 3/275 (1%) Frame = +3 Query: 366 KRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEPRMPSRGRGRP 545 KRKRGRPRKYG D A P + RGRP Sbjct: 86 KRKRGRPRKYGPDGTMALALSPAPSGVNVSQSGGAFSSPPASAGS--ASPSSLKKARGRP 143 Query: 546 NGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQGPHLFCILSAIGAISEA 725 GS KK+++ A G+ G GFTPHVI V GEDV+SKI+SFSQ GP CILSA GAIS Sbjct: 144 PGS-SKKQQMEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQHGPRAVCILSANGAISNV 202 Query: 726 TFKQIDPYGNCGTSECAEVKSFKGRFELISLSCS-ISAENDGQ-TRTSRLSASLATNDSQ 899 T +Q P + GT +++GRFE++SLS S + +EN GQ +RT LS SL+ D + Sbjct: 203 TLRQ--PATSGGT------VTYEGRFEILSLSGSFLLSENGGQRSRTGGLSVSLSGPDGR 254 Query: 900 VI-GGVVGMLIAASTVQVVVGSFIIDKKDLKKLHNDVDSTALANSSATNHQVSQVENHSS 1076 V+ GGV G+L AAS VQVVVGSFI D + K + V+ ++ A V SS Sbjct: 255 VLGGGVAGLLTAASPVQVVVGSFIADGRKESKSASQVEPSSAPPKIAPVGGGGGVTGTSS 314 Query: 1077 FQLNSNFIPLPSNSCQTSNHNYYASPTSRSYHGCN 1181 PS + + SP ++S CN Sbjct: 315 ---------PPSRGTLSESSGGPGSPLNQSTGACN 340 >ref|XP_006291415.1| hypothetical protein CARUB_v10017552mg [Capsella rubella] gi|482560122|gb|EOA24313.1| hypothetical protein CARUB_v10017552mg [Capsella rubella] Length = 274 Score = 147 bits (370), Expect = 1e-32 Identities = 110/282 (39%), Positives = 140/282 (49%), Gaps = 17/282 (6%) Frame = +3 Query: 156 PGSAGAAPPLQPRDYTDQGHDHAALYANQSYMLQPHPDDDNNNINGYFSSPNTINDD-GD 332 PGS P QP + QG H + +N + P+P+ + G+ S P + D Sbjct: 22 PGSGPPPPQTQPTFHGSQGFHHFS-NSNSPFGSNPNPNPGGGSA-GFVSPPLQVESSPAD 79 Query: 333 EEGVQGAKL--------SSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXX 488 GA + S KRKRGRPRKYG D Sbjct: 80 SSATAGAAVAPPPSGDTSVKRKRGRPRKYGQDGSVSLALSPSVSN--------------- 124 Query: 489 XXXXXXAEPRMPSRGRGRPNGSLGKKKKLAAFGTL-----GTGFTPHVIIVPPGEDVASK 653 P RGRGRP GS G+K++L G L G FTPHVI+V GED+ASK Sbjct: 125 ------VSPNSNKRGRGRPPGS-GRKQRLTTVGELMPSSSGMSFTPHVIVVSIGEDIASK 177 Query: 654 ILSFSQQGPHLFCILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSI- 830 ++SFS QGP C+LSA GA+S AT Q P + GT +++GRFELISLS S Sbjct: 178 VISFSHQGPRAICVLSASGAVSTATLLQPPP--SHGTI------TYEGRFELISLSTSYL 229 Query: 831 -SAENDGQTRTSRLSASLATNDSQVI-GGVVGMLIAASTVQV 950 + +ND RT L+ SLA+ D +VI GG+ G LIAAS+VQV Sbjct: 230 NTTDNDYPNRTGNLAVSLASPDGRVIGGGIGGPLIAASSVQV 271 >ref|NP_182109.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana] gi|30690145|ref|NP_850442.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana] gi|14194131|gb|AAK56260.1|AF367271_1 At2g45850/F4I18.17 [Arabidopsis thaliana] gi|3386609|gb|AAC28539.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana] gi|16323338|gb|AAL15382.1| At2g45850/F4I18.17 [Arabidopsis thaliana] gi|17065246|gb|AAL32777.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana] gi|21387187|gb|AAM47997.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana] gi|119657362|tpd|FAA00280.1| TPA: AT-hook motif nuclear localized protein 9 [Arabidopsis thaliana] gi|330255515|gb|AEC10609.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana] gi|330255516|gb|AEC10610.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana] Length = 348 Score = 147 bits (370), Expect = 1e-32 Identities = 111/294 (37%), Positives = 149/294 (50%), Gaps = 9/294 (3%) Frame = +3 Query: 366 KRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEPRMPSRGRGRP 545 KRKRGRPRKYG D RGRGRP Sbjct: 98 KRKRGRPRKYGQDGSVSLALSSSSVSTITPNNSN-------------------KRGRGRP 138 Query: 546 NGSLGKKKKLAAFGTL-----GTGFTPHVIIVPPGEDVASKILSFSQQGPHLFCILSAIG 710 GS GKK+++A+ G L G FTPHVI V GED+ASK+++FSQQGP C+LSA G Sbjct: 139 PGS-GKKQRMASVGELMPSSSGMSFTPHVIAVSIGEDIASKVIAFSQQGPRAICVLSASG 197 Query: 711 AISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSISAENDG--QTRTSRLSASLA 884 A+S AT I P + G + ++GRFE+++LS S DG + RT LS SLA Sbjct: 198 AVSTATL--IQPSASPGAIK------YEGRFEILALSTSYIVATDGSFRNRTGNLSVSLA 249 Query: 885 TNDSQVIGGVV-GMLIAASTVQVVVGSFIIDKKDLKKLHNDVDSTALANSSATNHQVSQV 1061 + D +VIGG + G LIAAS VQV+VGSFI +K + +++ + V + Sbjct: 250 SPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKSKKREEEASEV---------VQET 300 Query: 1062 ENHSSFQLNSNFI-PLPSNSCQTSNHNYYASPTSRSYHGCNMEDQRADAEFDMV 1220 ++H N+N I P+P Q N N S SR M+ + A A+ D++ Sbjct: 301 DDHHVLDNNNNTISPVPQ---QQPNQNLIWSTGSR-----QMDMRHAHADIDLM 346 >ref|XP_003524712.2| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max] Length = 362 Score = 146 bits (369), Expect = 2e-32 Identities = 112/289 (38%), Positives = 145/289 (50%), Gaps = 5/289 (1%) Frame = +3 Query: 324 DGDEEGVQGAKLSSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 503 DG ++ L+ K+KRGRPRKY D Sbjct: 69 DGSSSPMKACSLA-KKKRGRPRKYSPDGNIALRLAPTHASPPAAASGGGGGGDSAGMASA 127 Query: 504 XAEPRMPSRGRGRPNGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQGPH 683 A + + RGRP GS KK+L A G G GFTPHVI+V GED+ +KI++FSQQGP Sbjct: 128 DAPAK---KHRGRPPGS--GKKQLDALGAGGVGFTPHVILVESGEDITAKIMAFSQQGPR 182 Query: 684 LFCILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSI--SAENDGQTR 857 CILSAIGAI T +Q G T ++GRFE+ISLS S+ S N ++R Sbjct: 183 TVCILSAIGAIGNVTLQQSAMTGGIAT--------YEGRFEIISLSGSLQQSENNSERSR 234 Query: 858 TSRLSASLATNDSQVI-GGVVGMLIAASTVQVVVGSFIIDKKDLKKLHNDVDSTALANSS 1034 T L+ +LA +D +V+ GGV G LIAASTVQV+VGSFI D K S AL + S Sbjct: 235 TCTLNVTLAGSDGRVLGGGVAGTLIAASTVQVIVGSFIADAK-------KSSSNALKSGS 287 Query: 1035 ATNHQVSQVENHSSFQLNSNFIPLPS-NSCQTSNHN-YYASPTSRSYHG 1175 ++ + SS NS PS S + +H+ + P S HG Sbjct: 288 SSAPPPQMLTFGSSMTPNSPTSQGPSTESSEEQDHSPFCRGPGPGSGHG 336 >gb|EXB56269.1| Putative DNA-binding protein ESCAROLA [Morus notabilis] Length = 351 Score = 146 bits (368), Expect = 2e-32 Identities = 102/230 (44%), Positives = 126/230 (54%), Gaps = 9/230 (3%) Frame = +3 Query: 339 GVQGAKLSSKRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAE-- 512 G G + KRKRGRPRKYG D Sbjct: 68 GGGGGEPMVKRKRGRPRKYGPDGTMALGLSPNPPSVGVTQSSGGGFSSPPPTAAISGGGG 127 Query: 513 --PRMPS--RGRGRPNGSLGKKKKLAAFGTLGTGFTPHVIIVPPGEDVASKILSFSQQGP 680 P S + RGRP GS GKK++ AFG+ G GFTPHVI V GEDV+SKI+SFSQ GP Sbjct: 128 GGPTSASLKKARGRPPGSTGKKQQFDAFGSAGFGFTPHVITVKAGEDVSSKIMSFSQHGP 187 Query: 681 HLFCILSAIGAISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCS-ISAENDGQ-T 854 C+LSA GAIS T +Q P + GT +++GR+E++SLS S + +EN GQ + Sbjct: 188 RAVCVLSANGAISNVTLRQ--PATSGGT------VTYEGRYEILSLSGSFLLSENGGQRS 239 Query: 855 RTSRLSASLATNDSQVI-GGVVGMLIAASTVQVVVGSFIIDKKDLKKLHN 1001 RT LS SL+ D +V+ GGV G+L AAS VQVVVGSFI D + K N Sbjct: 240 RTGGLSVSLSGTDGRVLGGGVAGLLTAASPVQVVVGSFIADGRKEPKSAN 289 >ref|XP_006294414.1| hypothetical protein CARUB_v10023431mg [Capsella rubella] gi|482563122|gb|EOA27312.1| hypothetical protein CARUB_v10023431mg [Capsella rubella] Length = 379 Score = 146 bits (368), Expect = 2e-32 Identities = 110/294 (37%), Positives = 146/294 (49%), Gaps = 9/294 (3%) Frame = +3 Query: 366 KRKRGRPRKYGIDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEPRMPSRGRGRP 545 KRKRGRPRKYG D RGRGRP Sbjct: 126 KRKRGRPRKYGQDGSVSLALSSSSVSTITPNNSN-------------------KRGRGRP 166 Query: 546 NGSLGKKKKLAAFGTL-----GTGFTPHVIIVPPGEDVASKILSFSQQGPHLFCILSAIG 710 GS GKK++ A+ G L G FTPHVI V GED+ASK++SFSQQGP C+LSA G Sbjct: 167 PGS-GKKQRSASIGELMPSSSGMSFTPHVIAVSIGEDIASKVISFSQQGPRAVCVLSASG 225 Query: 711 AISEATFKQIDPYGNCGTSECAEVKSFKGRFELISLSCSISAENDG--QTRTSRLSASLA 884 A+S AT Q P + G + ++GRFE+++LS S DG + RT LS SLA Sbjct: 226 AVSTATLLQ--PSASPGAIK------YEGRFEILALSTSFIVATDGSFRNRTGNLSVSLA 277 Query: 885 TNDSQVIGGVV-GMLIAASTVQVVVGSFIIDKKDLKKLHNDVDSTALANSSATNHQVSQV 1061 + D +VIGG + G LIAA+ VQV+VGSFI +K + +++ + V Sbjct: 278 SPDGRVIGGAIGGPLIAATPVQVIVGSFIWAAPKIKSKKREEEASEV---------VQDT 328 Query: 1062 ENHSSFQLNSNFI-PLPSNSCQTSNHNYYASPTSRSYHGCNMEDQRADAEFDMV 1220 ++H NSN I P+P + N S SR M+ + A A+ D++ Sbjct: 329 DDHQVLDNNSNTISPVPQQPQSQPSQNLIWSTGSR-----QMDMRHAHADIDLM 377