BLASTX nr result

ID: Rehmannia22_contig00000159 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00000159
         (1077 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002277078.1| PREDICTED: uncharacterized protein LOC100246...   335   2e-89
emb|CAN67639.1| hypothetical protein VITISV_044258 [Vitis vinifera]   333   9e-89
ref|XP_006435487.1| hypothetical protein CICLE_v10002292mg [Citr...   332   2e-88
gb|EOY15860.1| PLATZ transcription factor family protein isoform...   331   3e-88
gb|EOY15859.1| PLATZ transcription factor family protein isoform...   331   3e-88
gb|ABK96587.1| unknown [Populus trichocarpa x Populus deltoides]      331   3e-88
ref|XP_002301890.1| zinc-binding family protein [Populus trichoc...   330   5e-88
ref|XP_002510750.1| protein with unknown function [Ricinus commu...   328   2e-87
ref|XP_006416340.1| hypothetical protein EUTSA_v10008598mg [Eutr...   320   5e-85
ref|NP_564128.1| PLATZ transcription factor family protein [Arab...   319   1e-84
ref|XP_006390173.1| hypothetical protein EUTSA_v10019067mg [Eutr...   318   2e-84
ref|XP_006416339.1| hypothetical protein EUTSA_v10008598mg [Eutr...   318   3e-84
ref|XP_002890421.1| zinc-binding family protein [Arabidopsis lyr...   318   3e-84
ref|XP_006305316.1| hypothetical protein CARUB_v10009696mg [Caps...   317   4e-84
gb|EOY15862.1| PLATZ transcription factor family protein isoform...   317   5e-84
gb|AAM61414.1| unknown [Arabidopsis thaliana]                         317   5e-84
ref|XP_006383802.1| hypothetical protein POPTR_0005s28040g [Popu...   316   9e-84
ref|NP_001117322.1| PLATZ transcription factor family protein [A...   316   9e-84
gb|EOY15858.1| PLATZ transcription factor family protein isoform...   313   6e-83
gb|EPS61450.1| hypothetical protein M569_13347, partial [Genlise...   311   2e-82

>ref|XP_002277078.1| PREDICTED: uncharacterized protein LOC100246080 [Vitis vinifera]
          Length = 247

 Score =  335 bits (859), Expect = 2e-89
 Identities = 166/247 (67%), Positives = 186/247 (75%), Gaps = 7/247 (2%)
 Frame = -2

Query: 824 VSPIVRMQKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIH 645
           VS I RM  +  LGP WLKPML+A YF+PC +H DSNK ECNMFCLDCMG+A CSYCLIH
Sbjct: 2   VSSIGRMGNDH-LGPPWLKPMLRASYFVPCGIHGDSNKSECNMFCLDCMGDALCSYCLIH 60

Query: 644 HKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYT 465
           HKDH V+QIRRSSYHNVVRVNEIQKYIDISC+QTYVINSAKIVFLNERPQPRPGKGVT T
Sbjct: 61  HKDHCVVQIRRSSYHNVVRVNEIQKYIDISCVQTYVINSAKIVFLNERPQPRPGKGVTNT 120

Query: 464 CEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIHK 285
           CEIC R L DSFRFCSLGCKL AM+RGD +LTF +K KH RE F  S+S E STPRK  +
Sbjct: 121 CEICCRSLLDSFRFCSLGCKLGAMKRGDPDLTFWLKLKHGRETFHGSESDESSTPRKFQR 180

Query: 284 RNVLNRFW-DGP------XXXXXXXXXXXXXXXXXXXGVNNLSPATPPLFNHTNSSRRKG 126
            ++ +R   DGP                          +NN+SPATPP+FNH+N+ RRKG
Sbjct: 181 THLFSRLMIDGPTISLDGHHDATVAADKSTASSSGDETINNISPATPPIFNHSNARRRKG 240

Query: 125 IPHRAPF 105
           IPHRAPF
Sbjct: 241 IPHRAPF 247


>emb|CAN67639.1| hypothetical protein VITISV_044258 [Vitis vinifera]
          Length = 240

 Score =  333 bits (853), Expect = 9e-89
 Identities = 161/235 (68%), Positives = 180/235 (76%), Gaps = 7/235 (2%)
 Frame = -2

Query: 788 LGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIHHKDHRVLQIRRS 609
           LGP WLKPML+A YF+PC +H DSNK ECNMFCLDCMG+A CSYCLIHHKDH V+QIRRS
Sbjct: 6   LGPPWLKPMLRASYFVPCGIHGDSNKSECNMFCLDCMGDALCSYCLIHHKDHCVVQIRRS 65

Query: 608 SYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYTCEICARGLPDSF 429
           SYHNVVRVNEIQKYIDISC+QTYVINSAKIVFLNERPQPRPGKGVT TCEIC R L DSF
Sbjct: 66  SYHNVVRVNEIQKYIDISCVQTYVINSAKIVFLNERPQPRPGKGVTNTCEICCRSLLDSF 125

Query: 428 RFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIHKRNVLNRFW-DGP 252
           RFCSLGCKL AM+RGD +LTF +K KH RE F  S+S E STPRK  + ++ +R   DGP
Sbjct: 126 RFCSLGCKLGAMKRGDPDLTFWLKLKHGRETFHGSESDESSTPRKFQRTHLFSRLMIDGP 185

Query: 251 ------XXXXXXXXXXXXXXXXXXXGVNNLSPATPPLFNHTNSSRRKGIPHRAPF 105
                                     +NN+SPATPP+FNH+N+ RRKGIPHRAPF
Sbjct: 186 TISLDGHHDATVAADKSTASSSGDETINNISPATPPIFNHSNARRRKGIPHRAPF 240


>ref|XP_006435487.1| hypothetical protein CICLE_v10002292mg [Citrus clementina]
           gi|567885859|ref|XP_006435488.1| hypothetical protein
           CICLE_v10002292mg [Citrus clementina]
           gi|557537609|gb|ESR48727.1| hypothetical protein
           CICLE_v10002292mg [Citrus clementina]
           gi|557537610|gb|ESR48728.1| hypothetical protein
           CICLE_v10002292mg [Citrus clementina]
          Length = 244

 Score =  332 bits (850), Expect = 2e-88
 Identities = 161/244 (65%), Positives = 185/244 (75%), Gaps = 4/244 (1%)
 Frame = -2

Query: 824 VSPIVRMQKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIH 645
           VSPI RM+ ++ +GP+WLKPML+A YFIPC VH DSNK ECNMFCLDCMGNAFCSYCLI+
Sbjct: 2   VSPIGRMEDDD-MGPQWLKPMLRASYFIPCVVHGDSNKSECNMFCLDCMGNAFCSYCLIN 60

Query: 644 HKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYT 465
           HKDHRV+QIRRSSYHNVVRVNEIQK+IDISC+QTY+INSAKIVFLNERPQPRPGKGVT T
Sbjct: 61  HKDHRVVQIRRSSYHNVVRVNEIQKFIDISCVQTYIINSAKIVFLNERPQPRPGKGVTNT 120

Query: 464 CEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIHK 285
           CEIC R L DSFRFCSLGCKL AM+RGD++LTF+++ KH       S+S E STP+KI +
Sbjct: 121 CEICCRSLLDSFRFCSLGCKLGAMKRGDLDLTFTLRVKHKDGFHGGSESDESSTPKKIRR 180

Query: 284 RNVLNRFWDG----PXXXXXXXXXXXXXXXXXXXGVNNLSPATPPLFNHTNSSRRKGIPH 117
               NR  +G                            LSPATPP++NH N+ RRKGIPH
Sbjct: 181 TPNFNRLMEGLTIYRHSHHNTNEGAERSCSSGDEATTKLSPATPPIYNHGNARRRKGIPH 240

Query: 116 RAPF 105
           RAPF
Sbjct: 241 RAPF 244


>gb|EOY15860.1| PLATZ transcription factor family protein isoform 3, partial
           [Theobroma cacao]
          Length = 325

 Score =  331 bits (849), Expect = 3e-88
 Identities = 163/245 (66%), Positives = 190/245 (77%), Gaps = 3/245 (1%)
 Frame = -2

Query: 830 MQVSPIVRMQKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCL 651
           + VSPI RM++++ +GP WL PML+A YFIPC +H D+NK ECN+FCLDCM NA CSYCL
Sbjct: 82  VMVSPIGRMEEDD-MGPPWLVPMLRASYFIPCPIHGDANKSECNLFCLDCMRNALCSYCL 140

Query: 650 IHHKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVT 471
           I+HKDHRV+QIRRSSYHNVVRV+EIQK+IDISC+QTY+INSAKIVFLNERPQPRPGKGVT
Sbjct: 141 INHKDHRVVQIRRSSYHNVVRVSEIQKFIDISCVQTYIINSAKIVFLNERPQPRPGKGVT 200

Query: 470 YTCEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMF-DESDSIELSTPRK 294
            TCEIC R L DSFRFCSLGCKL AM+RGD +LTF++KAKH R+ F   S+S E STP+K
Sbjct: 201 NTCEICCRSLLDSFRFCSLGCKLGAMKRGDPDLTFTLKAKHTRDSFYGGSESDESSTPKK 260

Query: 293 IHKRNVLNRFWDG-PXXXXXXXXXXXXXXXXXXXGVNN-LSPATPPLFNHTNSSRRKGIP 120
           I K  + NR  DG P                     NN +SPATPP+FNH N+ RRKGIP
Sbjct: 261 IRKTPLFNRMMDGLPLSSDSHKNDGRERYSSSGDEANNTISPATPPIFNHHNARRRKGIP 320

Query: 119 HRAPF 105
           HRAPF
Sbjct: 321 HRAPF 325


>gb|EOY15859.1| PLATZ transcription factor family protein isoform 2 [Theobroma
           cacao]
          Length = 243

 Score =  331 bits (848), Expect = 3e-88
 Identities = 163/243 (67%), Positives = 189/243 (77%), Gaps = 3/243 (1%)
 Frame = -2

Query: 824 VSPIVRMQKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIH 645
           VSPI RM++++ +GP WL PML+A YFIPC +H D+NK ECN+FCLDCM NA CSYCLI+
Sbjct: 2   VSPIGRMEEDD-MGPPWLVPMLRASYFIPCPIHGDANKSECNLFCLDCMRNALCSYCLIN 60

Query: 644 HKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYT 465
           HKDHRV+QIRRSSYHNVVRV+EIQK+IDISC+QTY+INSAKIVFLNERPQPRPGKGVT T
Sbjct: 61  HKDHRVVQIRRSSYHNVVRVSEIQKFIDISCVQTYIINSAKIVFLNERPQPRPGKGVTNT 120

Query: 464 CEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMF-DESDSIELSTPRKIH 288
           CEIC R L DSFRFCSLGCKL AM+RGD +LTF++KAKH R+ F   S+S E STP+KI 
Sbjct: 121 CEICCRSLLDSFRFCSLGCKLGAMKRGDPDLTFTLKAKHTRDSFYGGSESDESSTPKKIR 180

Query: 287 KRNVLNRFWDG-PXXXXXXXXXXXXXXXXXXXGVNN-LSPATPPLFNHTNSSRRKGIPHR 114
           K  + NR  DG P                     NN +SPATPP+FNH N+ RRKGIPHR
Sbjct: 181 KTPLFNRMMDGLPLSSDSHKNDGRERYSSSGDEANNTISPATPPIFNHHNARRRKGIPHR 240

Query: 113 APF 105
           APF
Sbjct: 241 APF 243


>gb|ABK96587.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 238

 Score =  331 bits (848), Expect = 3e-88
 Identities = 159/229 (69%), Positives = 178/229 (77%), Gaps = 1/229 (0%)
 Frame = -2

Query: 788 LGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIHHKDHRVLQIRRS 609
           +GP WL PML+A YFIPC VH +SNK ECNMFCLDCMGNAFCSYCLI+HKDHRV+QIRRS
Sbjct: 13  MGPPWLIPMLRASYFIPCGVHGESNKSECNMFCLDCMGNAFCSYCLIYHKDHRVVQIRRS 72

Query: 608 SYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYTCEICARGLPDSF 429
           SYHNVVRVNEIQKYIDISC+QTY+INSAKIVFLNERPQPRPGKGVT TCEIC R L DSF
Sbjct: 73  SYHNVVRVNEIQKYIDISCVQTYIINSAKIVFLNERPQPRPGKGVTNTCEICCRSLLDSF 132

Query: 428 RFCSLGCKLNAMERGDIELTFSVKAKHNRE-MFDESDSIELSTPRKIHKRNVLNRFWDGP 252
           RFCSLGCKL  M+RGD +LTF+VK KHNR+  F  S+S E STP+KI + +  NR  +G 
Sbjct: 133 RFCSLGCKLGGMKRGDPDLTFAVKLKHNRDPFFGGSESDESSTPKKIRRTHAFNRLMEG- 191

Query: 251 XXXXXXXXXXXXXXXXXXXGVNNLSPATPPLFNHTNSSRRKGIPHRAPF 105
                                 N+SPATPP+FNH N+ RRKGIPHRAPF
Sbjct: 192 --LSIYSSNNDGAESSGDDAATNISPATPPIFNHRNARRRKGIPHRAPF 238


>ref|XP_002301890.1| zinc-binding family protein [Populus trichocarpa]
           gi|222843616|gb|EEE81163.1| zinc-binding family protein
           [Populus trichocarpa]
          Length = 238

 Score =  330 bits (847), Expect = 5e-88
 Identities = 162/241 (67%), Positives = 183/241 (75%), Gaps = 1/241 (0%)
 Frame = -2

Query: 824 VSPIVRMQKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIH 645
           VSP  +M+  +  GP WL PML+A YFIPC VH +SNK ECNMFCLDCMGNAFCSYCLI+
Sbjct: 2   VSPFGQMRNHDT-GPPWLIPMLRASYFIPCAVHGESNKSECNMFCLDCMGNAFCSYCLIY 60

Query: 644 HKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYT 465
           H+DHRV+QIRRSSYHNVVRVNEIQKYIDISC+QTY+INSAKIVFLNERPQPRPGKGVT T
Sbjct: 61  HRDHRVVQIRRSSYHNVVRVNEIQKYIDISCVQTYIINSAKIVFLNERPQPRPGKGVTNT 120

Query: 464 CEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNRE-MFDESDSIELSTPRKIH 288
           CEIC R L DSFRFCSLGCKL  M+RGD +LTF++K K NR+  F  S+S E STP+KI 
Sbjct: 121 CEICCRSLLDSFRFCSLGCKLGGMKRGDPDLTFALKLKQNRDPFFGGSESDESSTPKKIR 180

Query: 287 KRNVLNRFWDGPXXXXXXXXXXXXXXXXXXXGVNNLSPATPPLFNHTNSSRRKGIPHRAP 108
           + +  NR  DG                       N+SPATPPLFNH N+ RRKGIPHRAP
Sbjct: 181 RTHAFNRLMDG---LSIYSSNNDGAESSGDDAATNISPATPPLFNHRNARRRKGIPHRAP 237

Query: 107 F 105
           F
Sbjct: 238 F 238


>ref|XP_002510750.1| protein with unknown function [Ricinus communis]
           gi|223551451|gb|EEF52937.1| protein with unknown
           function [Ricinus communis]
          Length = 235

 Score =  328 bits (842), Expect = 2e-87
 Identities = 158/241 (65%), Positives = 183/241 (75%), Gaps = 1/241 (0%)
 Frame = -2

Query: 824 VSPIVRMQKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIH 645
           VSP  +M   + LGP WL+PML+A YF+PC  H DSNK ECN+FCLDCMGNA CSYCLI+
Sbjct: 2   VSPFGQMVHHD-LGPPWLRPMLRASYFVPCSFHGDSNKSECNLFCLDCMGNALCSYCLIN 60

Query: 644 HKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYT 465
           HKDHR++QIRRSSYHNVVRVNEIQKYIDISC+QTY+INSAKIVFLNERPQPRPGKGVT T
Sbjct: 61  HKDHRIVQIRRSSYHNVVRVNEIQKYIDISCVQTYIINSAKIVFLNERPQPRPGKGVTNT 120

Query: 464 CEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMF-DESDSIELSTPRKIH 288
           CEIC R L DSFRFCSLGCKL  M+RGD +LTF+++ KHNR+ F   S+S E STP+KI 
Sbjct: 121 CEICCRSLLDSFRFCSLGCKLGGMKRGDPDLTFTLRMKHNRDPFLGGSESDESSTPKKIR 180

Query: 287 KRNVLNRFWDGPXXXXXXXXXXXXXXXXXXXGVNNLSPATPPLFNHTNSSRRKGIPHRAP 108
           K +  NR  DG                       N+SP+TPP++NH N+ RRKGIPHRAP
Sbjct: 181 KTHAFNRLMDG------LSIYSSDGHSSGDEATTNISPSTPPIYNHRNARRRKGIPHRAP 234

Query: 107 F 105
           F
Sbjct: 235 F 235


>ref|XP_006416340.1| hypothetical protein EUTSA_v10008598mg [Eutrema salsugineum]
           gi|557094111|gb|ESQ34693.1| hypothetical protein
           EUTSA_v10008598mg [Eutrema salsugineum]
          Length = 246

 Score =  320 bits (821), Expect = 5e-85
 Identities = 154/246 (62%), Positives = 175/246 (71%), Gaps = 6/246 (2%)
 Frame = -2

Query: 824 VSPIVRMQKEE-ILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLI 648
           + P++R ++EE  + P WL PML+  YF+PC +H DSNK ECN+FCLDC GNAFCSYCL+
Sbjct: 1   MGPMIRTEEEEDCMSPPWLMPMLRGSYFVPCSIHADSNKNECNLFCLDCAGNAFCSYCLV 60

Query: 647 HHKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTY 468
            HKDHRV+QIRRSSYHNVVRVNEIQKYIDISC+QTY+INSAKIVFLNERPQPR GKGVT 
Sbjct: 61  KHKDHRVVQIRRSSYHNVVRVNEIQKYIDISCVQTYIINSAKIVFLNERPQPRIGKGVTN 120

Query: 467 TCEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIH 288
           TCEIC R L DSFRFCSLGCKL  M+RGD  LTFS K KH RE    S+S E +TP K+ 
Sbjct: 121 TCEICCRSLLDSFRFCSLGCKLGGMKRGDQSLTFSFKGKHGREYQGGSESDEATTPTKMR 180

Query: 287 KRNVLNRFWDGPXXXXXXXXXXXXXXXXXXXGVN-----NLSPATPPLFNHTNSSRRKGI 123
           K N  NR   G                            N SP TPP++NH NSSRRKG+
Sbjct: 181 KTNAFNRLMSGLSISTVRFDDYGPGGDQRSSSSGDEGGFNFSPGTPPIYNHRNSSRRKGV 240

Query: 122 PHRAPF 105
           PHRAPF
Sbjct: 241 PHRAPF 246


>ref|NP_564128.1| PLATZ transcription factor family protein [Arabidopsis thaliana]
           gi|14030627|gb|AAK52988.1|AF375404_1 At1g21000/F9H16_1
           [Arabidopsis thaliana]
           gi|16226407|gb|AAL16160.1|AF428392_1 At1g21000/F9H16_1
           [Arabidopsis thaliana] gi|22136542|gb|AAM91057.1|
           At1g21000/F9H16_1 [Arabidopsis thaliana]
           gi|332191931|gb|AEE30052.1| PLATZ transcription factor
           family protein [Arabidopsis thaliana]
          Length = 246

 Score =  319 bits (817), Expect = 1e-84
 Identities = 151/246 (61%), Positives = 176/246 (71%), Gaps = 6/246 (2%)
 Frame = -2

Query: 824 VSPIVRMQKEE-ILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLI 648
           + P++R ++EE    P WL PML+  YF+PC +H+DSNK ECN+FCLDC GNAFCSYCL+
Sbjct: 1   MGPMIRTEEEEDYTSPPWLMPMLRGSYFVPCSIHVDSNKNECNLFCLDCAGNAFCSYCLV 60

Query: 647 HHKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTY 468
            HKDHRV+QIRRSSYHNVVRVNEIQK+IDI+C+QTY+INSAKIVFLNERPQPR GKGVT 
Sbjct: 61  KHKDHRVVQIRRSSYHNVVRVNEIQKFIDIACVQTYIINSAKIVFLNERPQPRIGKGVTN 120

Query: 467 TCEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIH 288
           TCEIC R L DSFRFCSLGCKL  M RGD+ LTFS+K KH RE    S+S E +TP K+ 
Sbjct: 121 TCEICCRSLLDSFRFCSLGCKLGGMRRGDLSLTFSLKGKHGREYLGGSESDEATTPTKMR 180

Query: 287 KRNVLNRFWDGPXXXXXXXXXXXXXXXXXXXGVN-----NLSPATPPLFNHTNSSRRKGI 123
           K N  NR   G                            + SP TPP++NH NSSRRKG+
Sbjct: 181 KTNAFNRLMSGLSISTVRFDDYGPNGDQRSSSSGDEGGFSFSPGTPPIYNHRNSSRRKGV 240

Query: 122 PHRAPF 105
           PHRAPF
Sbjct: 241 PHRAPF 246


>ref|XP_006390173.1| hypothetical protein EUTSA_v10019067mg [Eutrema salsugineum]
           gi|312282733|dbj|BAJ34232.1| unnamed protein product
           [Thellungiella halophila] gi|557086607|gb|ESQ27459.1|
           hypothetical protein EUTSA_v10019067mg [Eutrema
           salsugineum]
          Length = 240

 Score =  318 bits (816), Expect = 2e-84
 Identities = 152/233 (65%), Positives = 170/233 (72%)
 Frame = -2

Query: 803 QKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIHHKDHRVL 624
           +++  L P WL PML+ADYF+PC +H DSNK ECNMFCLDC  NAFC YCLI HK+HRVL
Sbjct: 8   EEDNYLSPPWLIPMLRADYFVPCSIHADSNKSECNMFCLDCTSNAFCPYCLIDHKNHRVL 67

Query: 623 QIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYTCEICARG 444
           QIRRSSYHNVVRVNEIQKYIDISC+QTY+INSA+IVFLNERPQPR GKGVT TCEIC R 
Sbjct: 68  QIRRSSYHNVVRVNEIQKYIDISCVQTYIINSARIVFLNERPQPRIGKGVTNTCEICCRS 127

Query: 443 LPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIHKRNVLNRF 264
           L DSFRFCSLGCKL  M+RG+  LTFS+K KH RE    S+S E +TP KI K    NR 
Sbjct: 128 LLDSFRFCSLGCKLGGMKRGNQSLTFSLKGKHGREYQGGSESDEATTPTKIRKTCAFNRL 187

Query: 263 WDGPXXXXXXXXXXXXXXXXXXXGVNNLSPATPPLFNHTNSSRRKGIPHRAPF 105
             G                       NLSP TPP++NH NSSRRKG+PHRAPF
Sbjct: 188 MSGLSISTVKSDYFSGSSSSGDDSGFNLSPGTPPIYNHRNSSRRKGVPHRAPF 240


>ref|XP_006416339.1| hypothetical protein EUTSA_v10008598mg [Eutrema salsugineum]
           gi|557094110|gb|ESQ34692.1| hypothetical protein
           EUTSA_v10008598mg [Eutrema salsugineum]
          Length = 243

 Score =  318 bits (814), Expect = 3e-84
 Identities = 151/238 (63%), Positives = 170/238 (71%), Gaps = 5/238 (2%)
 Frame = -2

Query: 803 QKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIHHKDHRVL 624
           ++E+ + P WL PML+  YF+PC +H DSNK ECN+FCLDC GNAFCSYCL+ HKDHRV+
Sbjct: 6   EEEDCMSPPWLMPMLRGSYFVPCSIHADSNKNECNLFCLDCAGNAFCSYCLVKHKDHRVV 65

Query: 623 QIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYTCEICARG 444
           QIRRSSYHNVVRVNEIQKYIDISC+QTY+INSAKIVFLNERPQPR GKGVT TCEIC R 
Sbjct: 66  QIRRSSYHNVVRVNEIQKYIDISCVQTYIINSAKIVFLNERPQPRIGKGVTNTCEICCRS 125

Query: 443 LPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIHKRNVLNRF 264
           L DSFRFCSLGCKL  M+RGD  LTFS K KH RE    S+S E +TP K+ K N  NR 
Sbjct: 126 LLDSFRFCSLGCKLGGMKRGDQSLTFSFKGKHGREYQGGSESDEATTPTKMRKTNAFNRL 185

Query: 263 WDGPXXXXXXXXXXXXXXXXXXXGVN-----NLSPATPPLFNHTNSSRRKGIPHRAPF 105
             G                            N SP TPP++NH NSSRRKG+PHRAPF
Sbjct: 186 MSGLSISTVRFDDYGPGGDQRSSSSGDEGGFNFSPGTPPIYNHRNSSRRKGVPHRAPF 243


>ref|XP_002890421.1| zinc-binding family protein [Arabidopsis lyrata subsp. lyrata]
           gi|297336263|gb|EFH66680.1| zinc-binding family protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 246

 Score =  318 bits (814), Expect = 3e-84
 Identities = 152/246 (61%), Positives = 176/246 (71%), Gaps = 6/246 (2%)
 Frame = -2

Query: 824 VSPIVRMQKEE-ILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLI 648
           + P++R ++EE    P WL PML+  YF+PC +H+DSNK ECN+FCLDC GNAFCSYCL+
Sbjct: 1   MGPMIRTEEEEDYTSPPWLMPMLRGSYFVPCSIHVDSNKNECNLFCLDCAGNAFCSYCLV 60

Query: 647 HHKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTY 468
            HKDHRV+QIRRSSYHNVVRVNEIQK+IDISC+QTY+INSAKIVFLNERPQPR GKGVT 
Sbjct: 61  KHKDHRVVQIRRSSYHNVVRVNEIQKFIDISCVQTYIINSAKIVFLNERPQPRIGKGVTN 120

Query: 467 TCEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIH 288
           TCEIC R L DSFRFCSLGCKL  M+RGD  LTFS+K KH RE    S+S E +TP K+ 
Sbjct: 121 TCEICCRSLLDSFRFCSLGCKLGGMKRGDSSLTFSLKGKHGREYQGGSESDEATTPTKMR 180

Query: 287 KRNVLNRFWDGPXXXXXXXXXXXXXXXXXXXGVN-----NLSPATPPLFNHTNSSRRKGI 123
           K N  NR   G                            + SP TPP++NH NSSRRKG+
Sbjct: 181 KTNAFNRLMSGLSISTVRFDDYGPGGDQRSSSSGDEGGFSFSPGTPPIYNHRNSSRRKGV 240

Query: 122 PHRAPF 105
           PHRAPF
Sbjct: 241 PHRAPF 246


>ref|XP_006305316.1| hypothetical protein CARUB_v10009696mg [Capsella rubella]
           gi|482574027|gb|EOA38214.1| hypothetical protein
           CARUB_v10009696mg [Capsella rubella]
          Length = 331

 Score =  317 bits (813), Expect = 4e-84
 Identities = 152/246 (61%), Positives = 176/246 (71%), Gaps = 6/246 (2%)
 Frame = -2

Query: 824 VSPIVRMQKEE-ILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLI 648
           + P++R ++EE    P WL PML+  YF+PC +H+DSNK ECN+FCLDC GNAFCSYCL+
Sbjct: 86  MGPMIRTEEEEDCTSPPWLMPMLRGSYFVPCSIHVDSNKNECNLFCLDCAGNAFCSYCLV 145

Query: 647 HHKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTY 468
            HKDHRV+QIRRSSYHNVVRVNEIQK+IDISC+QTY+INSAKIVFLNERPQPR GKGVT 
Sbjct: 146 KHKDHRVVQIRRSSYHNVVRVNEIQKFIDISCVQTYIINSAKIVFLNERPQPRIGKGVTN 205

Query: 467 TCEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIH 288
           TCEIC R L DSFRFCSLGCKL  M+RGD  LTFS+K KH RE    S+S E +TP K+ 
Sbjct: 206 TCEICCRSLLDSFRFCSLGCKLGGMKRGDSTLTFSLKGKHGREYQGGSESDEATTPTKMR 265

Query: 287 KRNVLNRFWDGPXXXXXXXXXXXXXXXXXXXGVN-----NLSPATPPLFNHTNSSRRKGI 123
           K N  NR   G                            + SP TPP++NH NSSRRKG+
Sbjct: 266 KTNAFNRLMSGLSISTVRFDDYGQGGDQRSSSSGDEGGFSFSPGTPPIYNHRNSSRRKGV 325

Query: 122 PHRAPF 105
           PHRAPF
Sbjct: 326 PHRAPF 331


>gb|EOY15862.1| PLATZ transcription factor family protein isoform 5, partial
           [Theobroma cacao]
          Length = 251

 Score =  317 bits (812), Expect = 5e-84
 Identities = 157/237 (66%), Positives = 183/237 (77%), Gaps = 3/237 (1%)
 Frame = -2

Query: 827 QVSPIVRMQKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLI 648
           QVSPI RM++++ +GP WL PML+A YFIPC +H D+NK ECN+FCLDCM NA CSYCLI
Sbjct: 16  QVSPIGRMEEDD-MGPPWLVPMLRASYFIPCPIHGDANKSECNLFCLDCMRNALCSYCLI 74

Query: 647 HHKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTY 468
           +HKDHRV+QIRRSSYHNVVRV+EIQK+IDISC+QTY+INSAKIVFLNERPQPRPGKGVT 
Sbjct: 75  NHKDHRVVQIRRSSYHNVVRVSEIQKFIDISCVQTYIINSAKIVFLNERPQPRPGKGVTN 134

Query: 467 TCEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMF-DESDSIELSTPRKI 291
           TCEIC R L DSFRFCSLGCKL AM+RGD +LTF++KAKH R+ F   S+S E STP+KI
Sbjct: 135 TCEICCRSLLDSFRFCSLGCKLGAMKRGDPDLTFTLKAKHTRDSFYGGSESDESSTPKKI 194

Query: 290 HKRNVLNRFWDG-PXXXXXXXXXXXXXXXXXXXGVNN-LSPATPPLFNHTNSSRRKG 126
            K  + NR  DG P                     NN +SPATPP+FNH N+ RRKG
Sbjct: 195 RKTPLFNRMMDGLPLSSDSHKNDGRERYSSSGDEANNTISPATPPIFNHHNARRRKG 251


>gb|AAM61414.1| unknown [Arabidopsis thaliana]
          Length = 246

 Score =  317 bits (812), Expect = 5e-84
 Identities = 150/246 (60%), Positives = 176/246 (71%), Gaps = 6/246 (2%)
 Frame = -2

Query: 824 VSPIVRMQKEE-ILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLI 648
           + P++R ++EE    P WL PML+  YF+PC +H+DSNK ECN+FCLDC GNAFCSYCL+
Sbjct: 1   MGPMIRTEEEEDYTSPPWLMPMLRGSYFVPCSIHVDSNKNECNLFCLDCAGNAFCSYCLV 60

Query: 647 HHKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTY 468
            HKDHRV+QIRRSSYHNVVRVNEIQK+IDI+C+QT++INSAKIVFLNERPQPR GKGVT 
Sbjct: 61  KHKDHRVVQIRRSSYHNVVRVNEIQKFIDIACVQTHIINSAKIVFLNERPQPRIGKGVTN 120

Query: 467 TCEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIH 288
           TCEIC R L DSFRFCSLGCKL  M RGD+ LTFS+K KH RE    S+S E +TP K+ 
Sbjct: 121 TCEICCRSLLDSFRFCSLGCKLGGMRRGDLSLTFSLKGKHGREYLGGSESDEATTPTKMR 180

Query: 287 KRNVLNRFWDGPXXXXXXXXXXXXXXXXXXXGVN-----NLSPATPPLFNHTNSSRRKGI 123
           K N  NR   G                            + SP TPP++NH NSSRRKG+
Sbjct: 181 KTNAFNRLMSGLSISTVRFDDYGPNGDQRSSSSGDEGGFSFSPGTPPIYNHRNSSRRKGV 240

Query: 122 PHRAPF 105
           PHRAPF
Sbjct: 241 PHRAPF 246


>ref|XP_006383802.1| hypothetical protein POPTR_0005s28040g [Populus trichocarpa]
           gi|566173661|ref|XP_006383803.1| hypothetical protein
           POPTR_0005s28040g [Populus trichocarpa]
           gi|566173663|ref|XP_002307009.2| zinc-binding family
           protein [Populus trichocarpa]
           gi|550339909|gb|ERP61599.1| hypothetical protein
           POPTR_0005s28040g [Populus trichocarpa]
           gi|550339910|gb|ERP61600.1| hypothetical protein
           POPTR_0005s28040g [Populus trichocarpa]
           gi|550339911|gb|EEE94005.2| zinc-binding family protein
           [Populus trichocarpa]
          Length = 234

 Score =  316 bits (810), Expect = 9e-84
 Identities = 155/229 (67%), Positives = 174/229 (75%), Gaps = 1/229 (0%)
 Frame = -2

Query: 788 LGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIHHKDHRVLQIRRS 609
           +GP WL PML+A YFIPC VH +SNK ECNMFCLDCMGNAFCSYCLI+HKDHRV+QIRRS
Sbjct: 13  MGPPWLIPMLRASYFIPCGVHGESNKSECNMFCLDCMGNAFCSYCLIYHKDHRVVQIRRS 72

Query: 608 SYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYTCEICARGLPDSF 429
           SYHNVVRVNEIQKYIDISC+QTY+INSAKIVFLNERPQPRPGKGVT TCEIC R L DSF
Sbjct: 73  SYHNVVRVNEIQKYIDISCVQTYIINSAKIVFLNERPQPRPGKGVTNTCEICCRSLLDSF 132

Query: 428 RFCSLGCKLNAMERGDIELTFSVKAKHNRE-MFDESDSIELSTPRKIHKRNVLNRFWDGP 252
           RFCSLGCKL  M+RGD +LTF+VK KHNR+  F  S+S E STP+KI + +  NR  +G 
Sbjct: 133 RFCSLGCKLGGMKRGDPDLTFAVKLKHNRDPFFGGSESDESSTPKKIRRTHAFNRLMEG- 191

Query: 251 XXXXXXXXXXXXXXXXXXXGVNNLSPATPPLFNHTNSSRRKGIPHRAPF 105
                                 N+S    P+FNH N+ RRKGIPHRAPF
Sbjct: 192 --LSIYSSNNDGAESSGDDAATNIS----PIFNHRNARRRKGIPHRAPF 234


>ref|NP_001117322.1| PLATZ transcription factor family protein [Arabidopsis thaliana]
           gi|4836888|gb|AAD30591.1|AC007369_1 Unknown protein
           [Arabidopsis thaliana]
           gi|13877713|gb|AAK43934.1|AF370615_1 Unknown protein
           [Arabidopsis thaliana] gi|332191932|gb|AEE30053.1| PLATZ
           transcription factor family protein [Arabidopsis
           thaliana]
          Length = 243

 Score =  316 bits (810), Expect = 9e-84
 Identities = 148/238 (62%), Positives = 171/238 (71%), Gaps = 5/238 (2%)
 Frame = -2

Query: 803 QKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIHHKDHRVL 624
           ++E+   P WL PML+  YF+PC +H+DSNK ECN+FCLDC GNAFCSYCL+ HKDHRV+
Sbjct: 6   EEEDYTSPPWLMPMLRGSYFVPCSIHVDSNKNECNLFCLDCAGNAFCSYCLVKHKDHRVV 65

Query: 623 QIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYTCEICARG 444
           QIRRSSYHNVVRVNEIQK+IDI+C+QTY+INSAKIVFLNERPQPR GKGVT TCEIC R 
Sbjct: 66  QIRRSSYHNVVRVNEIQKFIDIACVQTYIINSAKIVFLNERPQPRIGKGVTNTCEICCRS 125

Query: 443 LPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMFDESDSIELSTPRKIHKRNVLNRF 264
           L DSFRFCSLGCKL  M RGD+ LTFS+K KH RE    S+S E +TP K+ K N  NR 
Sbjct: 126 LLDSFRFCSLGCKLGGMRRGDLSLTFSLKGKHGREYLGGSESDEATTPTKMRKTNAFNRL 185

Query: 263 WDGPXXXXXXXXXXXXXXXXXXXGVN-----NLSPATPPLFNHTNSSRRKGIPHRAPF 105
             G                            + SP TPP++NH NSSRRKG+PHRAPF
Sbjct: 186 MSGLSISTVRFDDYGPNGDQRSSSSGDEGGFSFSPGTPPIYNHRNSSRRKGVPHRAPF 243


>gb|EOY15858.1| PLATZ transcription factor family protein isoform 1 [Theobroma
           cacao]
          Length = 256

 Score =  313 bits (803), Expect = 6e-83
 Identities = 156/240 (65%), Positives = 183/240 (76%), Gaps = 3/240 (1%)
 Frame = -2

Query: 824 VSPIVRMQKEEILGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIH 645
           VSPI RM++++ +GP WL PML+A YFIPC +H D+NK ECN+FCLDCM NA CSYCLI+
Sbjct: 2   VSPIGRMEEDD-MGPPWLVPMLRASYFIPCPIHGDANKSECNLFCLDCMRNALCSYCLIN 60

Query: 644 HKDHRVLQIRRSSYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYT 465
           HKDHRV+QIRRSSYHNVVRV+EIQK+IDISC+QTY+INSAKIVFLNERPQPRPGKGVT T
Sbjct: 61  HKDHRVVQIRRSSYHNVVRVSEIQKFIDISCVQTYIINSAKIVFLNERPQPRPGKGVTNT 120

Query: 464 CEICARGLPDSFRFCSLGCKLNAMERGDIELTFSVKAKHNREMF-DESDSIELSTPRKIH 288
           CEIC R L DSFRFCSLGCKL AM+RGD +LTF++KAKH R+ F   S+S E STP+KI 
Sbjct: 121 CEICCRSLLDSFRFCSLGCKLGAMKRGDPDLTFTLKAKHTRDSFYGGSESDESSTPKKIR 180

Query: 287 KRNVLNRFWDG-PXXXXXXXXXXXXXXXXXXXGVNN-LSPATPPLFNHTNSSRRKGIPHR 114
           K  + NR  DG P                     NN +SPATPP+FNH N+ RRK +  R
Sbjct: 181 KTPLFNRMMDGLPLSSDSHKNDGRERYSSSGDEANNTISPATPPIFNHHNARRRKVLTDR 240


>gb|EPS61450.1| hypothetical protein M569_13347, partial [Genlisea aurea]
          Length = 220

 Score =  311 bits (798), Expect = 2e-82
 Identities = 157/236 (66%), Positives = 176/236 (74%), Gaps = 9/236 (3%)
 Frame = -2

Query: 788 LGPRWLKPMLKADYFIPCEVHLDSNKCECNMFCLDCMGNAFCSYCLIHHKDHRVLQIRRS 609
           +GP WLKPML+ DYFIPC VHLDSNK ECN FCLDCMGNAFCSYCL++H+DHRVLQIRRS
Sbjct: 1   MGPPWLKPMLRGDYFIPCSVHLDSNKNECNFFCLDCMGNAFCSYCLVNHRDHRVLQIRRS 60

Query: 608 SYHNVVRVNEIQKYIDISCIQTYVINSAKIVFLNERPQPRPGKGVTYTCEICARGLPDSF 429
           SYHNVVRVNEIQKYIDIS IQTYVINSAKIVFLNERPQPRPGKG+T+TCEICARGLPD F
Sbjct: 61  SYHNVVRVNEIQKYIDISSIQTYVINSAKIVFLNERPQPRPGKGITFTCEICARGLPDCF 120

Query: 428 RFCSLGCKLNAM----ERGDIE-LTFSVKAKHN----REMFDESDSIELSTPRKIHKRNV 276
           RFCS+GCKLN M    E GD + +TFS+K K       + FD+SD  E STP+K  +  V
Sbjct: 121 RFCSIGCKLNGMRSEGEEGDADGITFSLKNKSGGGGILDGFDDSD--EFSTPKKALRSGV 178

Query: 275 LNRFWDGPXXXXXXXXXXXXXXXXXXXGVNNLSPATPPLFNHTNSSRRKGIPHRAP 108
            NR  DG                      + + P TPP FNH+NSSRRKG+PHRAP
Sbjct: 179 FNRVLDGGGSSSSGD--------------DCVCPETPPPFNHSNSSRRKGVPHRAP 220


Top