BLASTX nr result

ID: Paeonia23_contig00000816 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00000816
         (2056 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI30576.3| unnamed protein product [Vitis vinifera]              397   e-107
emb|CAN79695.1| hypothetical protein VITISV_023936 [Vitis vinifera]   382   e-103
ref|XP_002269847.1| PREDICTED: uncharacterized protein LOC100261...   377   e-102
ref|XP_007027055.1| Uncharacterized protein isoform 2 [Theobroma...   293   2e-76
ref|XP_007027054.1| Uncharacterized protein isoform 1 [Theobroma...   293   2e-76
ref|XP_007027057.1| Uncharacterized protein isoform 4 [Theobroma...   287   1e-74
ref|XP_004302736.1| PREDICTED: uncharacterized protein LOC101292...   257   1e-65
ref|XP_007027056.1| Uncharacterized protein isoform 3 [Theobroma...   253   3e-64
ref|XP_007205809.1| hypothetical protein PRUPE_ppa010713mg [Prun...   223   2e-55
gb|EYU41709.1| hypothetical protein MIMGU_mgv1a025324mg, partial...   221   1e-54
ref|XP_007027058.1| Uncharacterized protein isoform 5 [Theobroma...   187   2e-44
ref|XP_006406611.1| hypothetical protein EUTSA_v10021077mg [Eutr...    63   5e-07
ref|XP_004511250.1| PREDICTED: uncharacterized protein LOC101500...    63   6e-07
ref|XP_006298048.1| hypothetical protein CARUB_v10014092mg [Caps...    63   6e-07
ref|XP_006298047.1| hypothetical protein CARUB_v10014092mg [Caps...    63   6e-07
ref|NP_974333.1| sequence-specific DNA binding transcription fac...    63   6e-07
ref|NP_188467.2| sequence-specific DNA binding transcription fac...    63   6e-07
dbj|BAB01104.1| unnamed protein product [Arabidopsis thaliana]         63   6e-07
ref|NP_001189923.1| sequence-specific DNA binding transcription ...    63   6e-07
ref|XP_002885254.1| sequence-specific DNA binding protein [Arabi...    63   6e-07

>emb|CBI30576.3| unnamed protein product [Vitis vinifera]
          Length = 693

 Score =  397 bits (1020), Expect = e-107
 Identities = 259/593 (43%), Positives = 347/593 (58%), Gaps = 24/593 (4%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLR 1783
            M +GT    VELEAMRK+DSSWHPC+VSLSSTG GLIV+F +QDLE++I ++EE L  LR
Sbjct: 124  MGTGTGDATVELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQDLEDIISNEEEALARLR 183

Query: 1782 VRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRCTFLIKWL 1603
            +RS+PL+G+DC  +E+GE VLATH S FK+L FDA VEKA RVRHS R+ CRCTF+IKWL
Sbjct: 184  IRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKALRVRHSTRISCRCTFVIKWL 243

Query: 1602 NQDLEKGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLSFSGASPFSTIFDDMDCEMD 1423
            +QDL+  T  VPSSS+M+L+T+SI +HP +AAFL+ +K+L+ S A  FST+F+D+DCE+D
Sbjct: 244  HQDLKGATSIVPSSSIMKLATQSITVHPMVAAFLKPIKTLNCSAAPSFSTVFEDVDCEVD 303

Query: 1422 LNKLVEMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNKQMQHKTVAASTVSISHVGVPSG 1243
            L+KL+E Q+E I NL  D  +K I ED LFG K D  +QM    VA S ++ SH  VP  
Sbjct: 304  LHKLLEKQIEEISNLA-DASKKEISEDILFGIKADIKEQMDCSPVAESKITSSHFQVPHE 362

Query: 1242 Q-NXXXXXXXXXXKLQVQMEVKEPPSSASSIQEELSEIRSHLTPLACRAALASLVS---K 1075
            Q N          KL+V MEVK+P    SSIQ+ELSE R++L+PLA RAALAS++S   +
Sbjct: 363  QENHFKRSTRSSSKLRVNMEVKDPLPPDSSIQKELSENRAYLSPLASRAALASIMSNLPQ 422

Query: 1074 QLELSNFHEEK-GFAHASDVHAKSRNGTTAFTSID----NNISEDYLSSELGPAXXXXXX 910
            +LE S +HEE+ GFA A D      N T    ++D        +D LSSE+  A      
Sbjct: 423  KLEFSIYHEEENGFACAPD------NITNKHVTMDLLNGTKPVKDKLSSEIEAAFIPAEI 476

Query: 909  XXXXXXKLPKSTRRYSSVNTGLVESYSGIPTKDMNNRNKTAEVANSSASIASSLTEIKLS 730
                      +T + +S    LVE+ S I      N        ++S S++  + E +L 
Sbjct: 477  FKSLI-----TTEKGASRRPLLVEASSEIANPKSQN--------DASPSLSGLIEERELR 523

Query: 729  QPTNRTRITRISVRKGAEIPNEDVQMKTSAEDTKLKTSTTARRMTCSAVDQEQG------ 568
            QP   +R T  +++K A     + +MKT AE+ K   + T +R+T SAV +++       
Sbjct: 524  QPAKESRFTSSAIQKHAVSSTSNAEMKTHAEEIK-SVALTNKRLTRSAVHKQEENLAMEV 582

Query: 567  ------NKSAQTSESDSSEGNARXXXXXXXXXXXXXXXXXXXXXXXXDADINFXXXXXXX 406
                  N SAQ  ES+SSEGN                            + N        
Sbjct: 583  KQRSEVNNSAQDIESNSSEGNVTIPDRKAPKKKKPVSLPPAAQSSPVTEERN------KK 636

Query: 405  XXXXSTVETSQHSEGNGASNGG---KKRRKSIPFKNQELRCSPRLKSLGRTRS 256
                S VET+  +EG  + NGG    ++ KS   K QELR SPRL+ L RTRS
Sbjct: 637  RKMPSAVETASKTEGKVSRNGGNSESQKSKSTSSKKQELRFSPRLRFLPRTRS 689


>emb|CAN79695.1| hypothetical protein VITISV_023936 [Vitis vinifera]
          Length = 1508

 Score =  382 bits (981), Expect = e-103
 Identities = 260/624 (41%), Positives = 347/624 (55%), Gaps = 46/624 (7%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLR 1783
            M +GT    VELEAMRK+DSSWHPC+VSLSSTG GLIV+F +QDLE++I ++EE L  LR
Sbjct: 29   MGTGTGDATVELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQDLEDIISNEEEALARLR 88

Query: 1782 VRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEK--------------------- 1666
            +RS+PL+G+DC  +E+GE VLATH S FK+L FDA VEK                     
Sbjct: 89   IRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKEMSHEFXIECDLIDWGIXVNV 148

Query: 1665 -AQRVRHSKRVYCRCTFLIKWLNQDLEKGTFTVPSSSLMRLSTKSINIHPTIAAFLESVK 1489
             A RVRHS R+ CRCTF+IKWL+QDL+  T  VPSSS+M+L+T+SI +HP +AAFL+ +K
Sbjct: 149  VALRVRHSTRISCRCTFVIKWLHQDLKGATSIVPSSSIMKLATQSITVHPMVAAFLKPIK 208

Query: 1488 SLSFSGASPFSTIFDDMDCEMDLNKLVEMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNK 1309
            +L+ S A  FST+F+D+DCE+DL+KL+E Q+E I NL  D  +K I ED LFG K D  +
Sbjct: 209  TLNCSAAPSFSTVFEDVDCEVDLHKLLEKQIEEISNLA-DASKKEISEDILFGIKADIKE 267

Query: 1308 QMQHKTVAASTVSISHVGVPSGQ-NXXXXXXXXXXKLQVQMEVKEPPSSASSIQEELSEI 1132
            QM    VA S ++ SH  VP  Q N          KL+V MEVK+P    SSIQEELSE 
Sbjct: 268  QMDCSPVAESKITSSHFQVPHEQENHFKRSTRSSSKLRVNMEVKDPLPPDSSIQEELSEN 327

Query: 1131 RSHLTPLACRAALASLVS---KQLELSNFHEEK-GFAHASDVHAKSRNGTTAFTSID--- 973
            R++L+PLA RAALAS++S   ++LE S  HEE+ GFA A D      N T    ++D   
Sbjct: 328  RAYLSPLASRAALASIMSNLPQKLEFSIXHEEENGFACAPD------NITNKHVTMDLLN 381

Query: 972  -NNISEDYLSSELGPAXXXXXXXXXXXXKLPKSTRRYSSVNTGLVESYSGIPTKDMNNRN 796
                 +D LSSE+  A                +T + +S    LVE+ S I      N  
Sbjct: 382  GTKPVKDKLSSEIEAAFIPAEIFKSLI-----TTEKGASRRPLLVEASSEIANPKSQN-- 434

Query: 795  KTAEVANSSASIASSLTEIKLSQPTNRTRITRISVRKGAEIPNEDVQMKTSAEDTKLKTS 616
                  ++S S++  + E +L QP   +R T  +++K A     + +MKT AE+ K   +
Sbjct: 435  ------DASPSLSGLIEERELRQPAKESRFTSSAIQKHAVSSTSNAEMKTHAEEIK-SVA 487

Query: 615  TTARRMTCSAVDQEQG------------NKSAQTSESDSSEGNARXXXXXXXXXXXXXXX 472
               +R+T SAV +++             N SAQ  ES+SSEGN                 
Sbjct: 488  LXNKRLTRSAVHKQEENLAMEVKQRSEVNNSAQDIESNSSEGNVTIPDRKAPKKKKPVSL 547

Query: 471  XXXXXXXXXDADINFXXXXXXXXXXXSTVETSQHSEGNGASNGG---KKRRKSIPFKNQE 301
                       +              S VET+  +EG  + NGG    ++ KS   K QE
Sbjct: 548  PPAAQTSSPVTE-----ERNKKRKMPSAVETASKTEGKVSRNGGNSESQKSKSTSSKKQE 602

Query: 300  LRCSPRLKSLGRTRSQVE*KDKPN 229
            LR SPRL+ L RTRS      KP+
Sbjct: 603  LRFSPRLRFLPRTRSANNCDSKPH 626


>ref|XP_002269847.1| PREDICTED: uncharacterized protein LOC100261386 [Vitis vinifera]
          Length = 552

 Score =  377 bits (969), Expect = e-102
 Identities = 233/501 (46%), Positives = 314/501 (62%), Gaps = 21/501 (4%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLR 1783
            M +GT    VELEAMRK+DSSWHPC+VSLSSTG GLIV+F +QDLE++I ++EE L  LR
Sbjct: 1    MGTGTGDATVELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQDLEDIISNEEEALARLR 60

Query: 1782 VRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRCTFLIKWL 1603
            +RS+PL+G+DC  +E+GE VLATH S FK+L FDA VEKA RVRHS R+ CRCTF+IKWL
Sbjct: 61   IRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKALRVRHSTRISCRCTFVIKWL 120

Query: 1602 NQDLEKGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLSFSGASPFSTIFDDMDCEMD 1423
            +QDL+  T  VPSSS+M+L+T+SI +HP +AAFL+ +K+L+ S A  FST+F+D+DCE+D
Sbjct: 121  HQDLKGATSIVPSSSIMKLATQSITVHPMVAAFLKPIKTLNCSAAPSFSTVFEDVDCEVD 180

Query: 1422 LNKLVEMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNKQMQHKTVAASTVSISHVGVPSG 1243
            L+KL+E Q+E I NL  D  +K I ED LFG K D  +QM    VA S ++ SH  VP  
Sbjct: 181  LHKLLEKQIEEISNLA-DASKKEISEDILFGIKADIKEQMDCSPVAESKITSSHFQVPHE 239

Query: 1242 Q-NXXXXXXXXXXKLQVQMEVKEPPSSASSIQEELSEIRSHLTPLACRAALASLVS---K 1075
            Q N          KL+V MEVK+P    SSIQ+ELSE R++L+PLA RAALAS++S   +
Sbjct: 240  QENHFKRSTRSSSKLRVNMEVKDPLPPDSSIQKELSENRAYLSPLASRAALASIMSNLPQ 299

Query: 1074 QLELSNFHEEK-GFAHASDVHAKSRNGTTAFTSID----NNISEDYLSSELGPAXXXXXX 910
            +LE S +HEE+ GFA A D      N T    ++D        +D LSSE+  A      
Sbjct: 300  KLEFSIYHEEENGFACAPD------NITNKHVTMDLLNGTKPVKDKLSSEIEAAFIPAEI 353

Query: 909  XXXXXXKLPKSTRRYSSVNTGLVESYSGIPTKDMNNRNKTAEVANSSASIASSLTEIKLS 730
                      +T + +S    LVE+ S I      N        ++S S++  + E +L 
Sbjct: 354  FKSLI-----TTEKGASRRPLLVEASSEIANPKSQN--------DASPSLSGLIEERELR 400

Query: 729  QPTNRTRITRISVRKGAEIPNEDVQMKTSAEDTKLKTSTTARRMTCSAVDQEQG------ 568
            QP   +R T  +++K A     + +MKT AE+ K   + T +R+T SAV +++       
Sbjct: 401  QPAKESRFTSSAIQKHAVSSTSNAEMKTHAEEIK-SVALTNKRLTRSAVHKQEENLAMEV 459

Query: 567  ------NKSAQTSESDSSEGN 523
                  N SAQ  ES+SSEGN
Sbjct: 460  KQRSEVNNSAQDIESNSSEGN 480


>ref|XP_007027055.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508715660|gb|EOY07557.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 611

 Score =  293 bits (751), Expect = 2e-76
 Identities = 204/517 (39%), Positives = 275/517 (53%), Gaps = 42/517 (8%)
 Frame = -2

Query: 1947 AQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLRVRSIP 1768
            + ++VELEA RKEDSSWHPC+V LSS+G  LIV F  Q+L++M+L  EEVL  LR RS+P
Sbjct: 7    SDNSVELEAKRKEDSSWHPCRVYLSSSGDSLIVNFGRQELDDMLLQKEEVLMHLRFRSMP 66

Query: 1767 LKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRCTFLIKWLNQDLE 1588
            L+ DDC  +E+GE VLA   S+FK LF DA V K  RVRHSKR  CRCTF+IKWL+QDLE
Sbjct: 67   LQVDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKRG-CRCTFMIKWLDQDLE 125

Query: 1587 KGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLSFSGASPFSTIFDDMDCEMDLNKLV 1408
              TFT+PSSS+M+L+TKSI+ HP I   L+  K    S +SP  TI +  D E+DLNKL+
Sbjct: 126  GQTFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLL 185

Query: 1407 EMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNKQMQHKTVAASTVSISHVGVPSGQNXXX 1228
            + Q+E I NL D   +K IPED  +  K     Q  HK  A S   +    V    N   
Sbjct: 186  QKQIEQISNLAD-ASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVP--AVADHHNHLK 242

Query: 1227 XXXXXXXKLQVQMEVKEPPSSASSIQEELSEIRSHLTPLACRAALAS--LVSKQ---LEL 1063
                   KLQ+ +E +       S++E   + RSHL+PLA RAALAS  L +K+   ++L
Sbjct: 243  RTTRSTRKLQINIEAENQSGHTISMKEAFIQSRSHLSPLASRAALASSLLTAKKCLDMDL 302

Query: 1062 SNFHEEKGFAHASDVHAKSRNGTTAFTSIDNNISEDYLSSELGPAXXXXXXXXXXXXKLP 883
            S+       +  + +  K ++ +         +SE   S E+ P                
Sbjct: 303  SS-------SMTASMFMKGKDSSDILAVSIPLVSE--ASHEISPHI-------------- 339

Query: 882  KSTRRYSSVNTGLVESYSGIPTKDMNNRNKTAEVANSSAS-----------------IAS 754
             ST+  +S      +  S IPTK   N NKT++  N +A                  +A+
Sbjct: 340  -STQGDASCEPQPTKPSSCIPTKGWENENKTSDEINCTAEQRTYSPVKITAESVTSGVAT 398

Query: 753  SLTEIKLSQP-------------TNRTRITRISVRKGAEIPNEDVQMKTSAEDTKLKTST 613
            S  E+ +S+              T   R+TR + RKGA IPN  VQ+K    DTK + S 
Sbjct: 399  STAELPISRAKKSLVHANFNASSTAPIRLTRSATRKGAVIPNNCVQVKICVNDTKRRMSG 458

Query: 612  TARRMTCSAVDQ-------EQGNKSAQTSESDSSEGN 523
               +++ SAV Q       E+ N S    +SDSSEGN
Sbjct: 459  NKNQLSRSAVFQGNENLANEEENNSTHIIDSDSSEGN 495


>ref|XP_007027054.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508715659|gb|EOY07556.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 567

 Score =  293 bits (751), Expect = 2e-76
 Identities = 204/517 (39%), Positives = 275/517 (53%), Gaps = 42/517 (8%)
 Frame = -2

Query: 1947 AQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLRVRSIP 1768
            + ++VELEA RKEDSSWHPC+V LSS+G  LIV F  Q+L++M+L  EEVL  LR RS+P
Sbjct: 7    SDNSVELEAKRKEDSSWHPCRVYLSSSGDSLIVNFGRQELDDMLLQKEEVLMHLRFRSMP 66

Query: 1767 LKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRCTFLIKWLNQDLE 1588
            L+ DDC  +E+GE VLA   S+FK LF DA V K  RVRHSKR  CRCTF+IKWL+QDLE
Sbjct: 67   LQVDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKRG-CRCTFMIKWLDQDLE 125

Query: 1587 KGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLSFSGASPFSTIFDDMDCEMDLNKLV 1408
              TFT+PSSS+M+L+TKSI+ HP I   L+  K    S +SP  TI +  D E+DLNKL+
Sbjct: 126  GQTFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLL 185

Query: 1407 EMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNKQMQHKTVAASTVSISHVGVPSGQNXXX 1228
            + Q+E I NL D   +K IPED  +  K     Q  HK  A S   +    V    N   
Sbjct: 186  QKQIEQISNLAD-ASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVP--AVADHHNHLK 242

Query: 1227 XXXXXXXKLQVQMEVKEPPSSASSIQEELSEIRSHLTPLACRAALAS--LVSKQ---LEL 1063
                   KLQ+ +E +       S++E   + RSHL+PLA RAALAS  L +K+   ++L
Sbjct: 243  RTTRSTRKLQINIEAENQSGHTISMKEAFIQSRSHLSPLASRAALASSLLTAKKCLDMDL 302

Query: 1062 SNFHEEKGFAHASDVHAKSRNGTTAFTSIDNNISEDYLSSELGPAXXXXXXXXXXXXKLP 883
            S+       +  + +  K ++ +         +SE   S E+ P                
Sbjct: 303  SS-------SMTASMFMKGKDSSDILAVSIPLVSE--ASHEISPHI-------------- 339

Query: 882  KSTRRYSSVNTGLVESYSGIPTKDMNNRNKTAEVANSSAS-----------------IAS 754
             ST+  +S      +  S IPTK   N NKT++  N +A                  +A+
Sbjct: 340  -STQGDASCEPQPTKPSSCIPTKGWENENKTSDEINCTAEQRTYSPVKITAESVTSGVAT 398

Query: 753  SLTEIKLSQP-------------TNRTRITRISVRKGAEIPNEDVQMKTSAEDTKLKTST 613
            S  E+ +S+              T   R+TR + RKGA IPN  VQ+K    DTK + S 
Sbjct: 399  STAELPISRAKKSLVHANFNASSTAPIRLTRSATRKGAVIPNNCVQVKICVNDTKRRMSG 458

Query: 612  TARRMTCSAVDQ-------EQGNKSAQTSESDSSEGN 523
               +++ SAV Q       E+ N S    +SDSSEGN
Sbjct: 459  NKNQLSRSAVFQGNENLANEEENNSTHIIDSDSSEGN 495


>ref|XP_007027057.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508715662|gb|EOY07559.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 565

 Score =  287 bits (735), Expect = 1e-74
 Identities = 203/517 (39%), Positives = 274/517 (52%), Gaps = 42/517 (8%)
 Frame = -2

Query: 1947 AQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLRVRSIP 1768
            + ++VELEA RKEDSSWHPC+V LSS+G  LIV F  Q+L++M+L  EEVL  LR RS+P
Sbjct: 7    SDNSVELEAKRKEDSSWHPCRVYLSSSGDSLIVNFGRQELDDMLLQKEEVLMHLRFRSMP 66

Query: 1767 LKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRCTFLIKWLNQDLE 1588
            L+ DDC  +E+GE VLA   S+FK LF DA V    RVRHSKR  CRCTF+IKWL+QDLE
Sbjct: 67   LQVDDCFHIEEGERVLADRKSQFKILFHDAVV--VDRVRHSKRG-CRCTFMIKWLDQDLE 123

Query: 1587 KGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLSFSGASPFSTIFDDMDCEMDLNKLV 1408
              TFT+PSSS+M+L+TKSI+ HP I   L+  K    S +SP  TI +  D E+DLNKL+
Sbjct: 124  GQTFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLL 183

Query: 1407 EMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNKQMQHKTVAASTVSISHVGVPSGQNXXX 1228
            + Q+E I NL D   +K IPED  +  K     Q  HK  A S   +    V    N   
Sbjct: 184  QKQIEQISNLAD-ASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVP--AVADHHNHLK 240

Query: 1227 XXXXXXXKLQVQMEVKEPPSSASSIQEELSEIRSHLTPLACRAALAS--LVSKQ---LEL 1063
                   KLQ+ +E +       S++E   + RSHL+PLA RAALAS  L +K+   ++L
Sbjct: 241  RTTRSTRKLQINIEAENQSGHTISMKEAFIQSRSHLSPLASRAALASSLLTAKKCLDMDL 300

Query: 1062 SNFHEEKGFAHASDVHAKSRNGTTAFTSIDNNISEDYLSSELGPAXXXXXXXXXXXXKLP 883
            S+       +  + +  K ++ +         +SE   S E+ P                
Sbjct: 301  SS-------SMTASMFMKGKDSSDILAVSIPLVSE--ASHEISPHI-------------- 337

Query: 882  KSTRRYSSVNTGLVESYSGIPTKDMNNRNKTAEVANSSAS-----------------IAS 754
             ST+  +S      +  S IPTK   N NKT++  N +A                  +A+
Sbjct: 338  -STQGDASCEPQPTKPSSCIPTKGWENENKTSDEINCTAEQRTYSPVKITAESVTSGVAT 396

Query: 753  SLTEIKLSQP-------------TNRTRITRISVRKGAEIPNEDVQMKTSAEDTKLKTST 613
            S  E+ +S+              T   R+TR + RKGA IPN  VQ+K    DTK + S 
Sbjct: 397  STAELPISRAKKSLVHANFNASSTAPIRLTRSATRKGAVIPNNCVQVKICVNDTKRRMSG 456

Query: 612  TARRMTCSAVDQ-------EQGNKSAQTSESDSSEGN 523
               +++ SAV Q       E+ N S    +SDSSEGN
Sbjct: 457  NKNQLSRSAVFQGNENLANEEENNSTHIIDSDSSEGN 493


>ref|XP_004302736.1| PREDICTED: uncharacterized protein LOC101292719 [Fragaria vesca
            subsp. vesca]
          Length = 580

 Score =  257 bits (657), Expect = 1e-65
 Identities = 204/633 (32%), Positives = 293/633 (46%), Gaps = 68/633 (10%)
 Frame = -2

Query: 1947 AQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLRVRSIP 1768
            A++A ELEA+ K+DSSW+PC VSLSST   LIV+F  Q+LE+M+L+ +E L  LR RS P
Sbjct: 7    AENATELEALCKQDSSWYPCHVSLSSTEDSLIVDFGRQELEDMVLNKDEALMRLRFRSGP 66

Query: 1767 LKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRCTFLIKWLNQDLE 1588
            L+GDDC  +E GEHVLA H S FKS  +DA+VEK  RVRHS RVYCRC+F+I WL+ D +
Sbjct: 67   LQGDDCSHIE-GEHVLAIHKSPFKSYLYDAKVEKVTRVRHSTRVYCRCSFMILWLHPDFK 125

Query: 1587 KGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLSFSGASPFSTIFDDMDCEMDLNKLV 1408
                T+ SSS+M+L++KSIN HPT+AA  +SVK +    A     + +D+D E DLNKL+
Sbjct: 126  GQMVTITSSSIMKLASKSINSHPTVAALFKSVKQMGLYTAPLLPIMHEDIDVEFDLNKLL 185

Query: 1407 EMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNKQMQHKTVAASTVSISHVGVPSGQNXXX 1228
              Q+E I N+  +     I  D + G K D++  +    +  S   +SH      Q+   
Sbjct: 186  GKQIEEI-NISANRVTNEITVDIIEGVKADSSGHVTESKIGTSKAQVSH-----DQDQLK 239

Query: 1227 XXXXXXXKLQVQMEVKEPPSSASSIQEELSEIRSHLTPLACRAALASLVSKQLELSNFHE 1048
                    L+V  E ++P     S QEE SE R H++PLA RAALASLVS    L++ H 
Sbjct: 240  SVANRSGNLEVNKEDEDPHPPFLSKQEEHSEHRCHISPLAARAALASLVS----LTHKH- 294

Query: 1047 EKGFAHASDVHAKSRNGTTAFTSIDNNISEDYLSSEL--GPAXXXXXXXXXXXXKLPKST 874
                         + +GT  F S D+      +SS+    P                +  
Sbjct: 295  ------------IAISGTELFKSSDSTDLSIKVSSDRTESPKNGNANLGSGARTTRSRGL 342

Query: 873  RRYSSVNTGLVESYSGIPTKDMNNR----------------------NKTAEVANSSASI 760
            + +   N+ L +S   I  + + NR                      ++ +E A S+ S 
Sbjct: 343  KGFEKQNSDLHDSAEAIKLRAVTNRGWLTRSAVKEEKDISSVASKHGSEESESAQSTESY 402

Query: 759  ASSLTEIKLSQP--TNRTRITRISVRKGAE-----------------IPNEDVQMKTSAE 637
            +S  T+I       T +  I++ +V                      I +  VQ KT A+
Sbjct: 403  SSDGTDIVHGNKVLTKKNGISKKAVSSPLHSESNGHKENLTSGDLGVIQDAYVQTKTCAK 462

Query: 636  DTKLKTSTTARRMT--------------CSAVDQEQ--------GNKSAQTSESDSSEGN 523
            DT    ST  RR+T              C AV++E         G+ S+Q   +   +GN
Sbjct: 463  DTNSSVSTNLRRLTRSRVSCQDNLIVPECHAVEKENRESKKKKAGSASSQNYSTSGEDGN 522

Query: 522  ARXXXXXXXXXXXXXXXXXXXXXXXXDADINFXXXXXXXXXXXSTVETSQHSEGNGASNG 343
             +                                           V  S+ +EG  + +G
Sbjct: 523  RQ--------------------------------------HNSGVVRNSRQTEGKMSGSG 544

Query: 342  GK---KRRKSIPFKNQELRCSPRLKSLGRTRSQ 253
                 ++RKS     QE + SP+L+ L RTRSQ
Sbjct: 545  DNSQGRKRKSNSSSRQEQQFSPQLRFLPRTRSQ 577


>ref|XP_007027056.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508715661|gb|EOY07558.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 409

 Score =  253 bits (645), Expect = 3e-64
 Identities = 144/287 (50%), Positives = 182/287 (63%)
 Frame = -2

Query: 1947 AQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLRVRSIP 1768
            + ++VELEA RKEDSSWHPC+V LSS+G  LIV F  Q+L++M+L  EEVL  LR RS+P
Sbjct: 7    SDNSVELEAKRKEDSSWHPCRVYLSSSGDSLIVNFGRQELDDMLLQKEEVLMHLRFRSMP 66

Query: 1767 LKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRCTFLIKWLNQDLE 1588
            L+ DDC  +E+GE VLA   S+FK LF DA V K  RVRHSKR  CRCTF+IKWL+QDLE
Sbjct: 67   LQVDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQDLE 125

Query: 1587 KGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLSFSGASPFSTIFDDMDCEMDLNKLV 1408
              TFT+PSSS+M+L+TKSI+ HP I   L+  K    S +SP  TI +  D E+DLNKL+
Sbjct: 126  GQTFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLL 185

Query: 1407 EMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNKQMQHKTVAASTVSISHVGVPSGQNXXX 1228
            + Q+E I NL  D  +K IPED  +  K     Q  HK  A S   +    V    N   
Sbjct: 186  QKQIEQISNLA-DASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVP--AVADHHNHLK 242

Query: 1227 XXXXXXXKLQVQMEVKEPPSSASSIQEELSEIRSHLTPLACRAALAS 1087
                   KLQ+ +E +       S++E   + RSHL+PLA RAALAS
Sbjct: 243  RTTRSTRKLQINIEAENQSGHTISMKEAFIQSRSHLSPLASRAALAS 289


>ref|XP_007205809.1| hypothetical protein PRUPE_ppa010713mg [Prunus persica]
            gi|462401451|gb|EMJ07008.1| hypothetical protein
            PRUPE_ppa010713mg [Prunus persica]
          Length = 238

 Score =  223 bits (569), Expect = 2e-55
 Identities = 119/210 (56%), Positives = 151/210 (71%), Gaps = 2/210 (0%)
 Frame = -2

Query: 1947 AQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLRVRSIP 1768
            A++  ELEAM KEDSSWHPCQVSLSST   LIV+F  Q+LE+M+L+ +E L  LR R  P
Sbjct: 7    AENVTELEAMCKEDSSWHPCQVSLSSTKDSLIVDFGGQELEDMVLNTDEALTRLRFRCAP 66

Query: 1767 LKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRCTFLIKWLNQDLE 1588
            L+GDDC  +E GEHVLA + S+ KS FFDA+VEK  RVRHS RVYCRCTF+IKWL+QDL+
Sbjct: 67   LQGDDCTRIE-GEHVLAINKSQSKSHFFDAKVEKVLRVRHSTRVYCRCTFMIKWLHQDLK 125

Query: 1587 KGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLSFSGAS--PFSTIFDDMDCEMDLNK 1414
                TVPSSS+M+L+ K+IN+HPT++AFL+SVK +    AS  P     +D   E+DLNK
Sbjct: 126  GQMVTVPSSSIMKLTGKNINVHPTVSAFLKSVKQMGLDSASSVPVMLEVEDFAVELDLNK 185

Query: 1413 LVEMQVEGIGNLVDDVYRKGIPEDTLFGGK 1324
             +E Q+E I  +  + +RK I  D L G K
Sbjct: 186  FLEKQIEDI-TVSANEFRKAITIDILEGVK 214


>gb|EYU41709.1| hypothetical protein MIMGU_mgv1a025324mg, partial [Mimulus guttatus]
          Length = 317

 Score =  221 bits (563), Expect = 1e-54
 Identities = 134/302 (44%), Positives = 185/302 (61%), Gaps = 3/302 (0%)
 Frame = -2

Query: 1956 SGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGLIVEFENQDLENMILDDEEVLKCLRVR 1777
            + T  D V+LEAMRK+  SWHPC+VSL S G+GLI++F +  +E +I D +EV+  +RVR
Sbjct: 8    NSTGNDVVQLEAMRKDSFSWHPCKVSLCSRGLGLILQFGDNYMEEIITDQQEVMARIRVR 67

Query: 1776 SIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRCTFLIKWLNQ 1597
            S PL+GDDC S+ QG+ VLAT +S  KS+F DA VE+A RVRHSKR++CRCTF IKWL+Q
Sbjct: 68   STPLQGDDCSSLRQGDRVLATRSSHAKSVFCDALVEEAMRVRHSKRIHCRCTFKIKWLHQ 127

Query: 1596 DLEKGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLSFSGASPFSTIFDDMDCEMDLN 1417
            +    T TVP+ ++M+LST+SIN+HPTI+ +   ++S +    SP+S   D  + EMD+N
Sbjct: 128  E---ETLTVPAGAIMKLSTESINLHPTISTYFSMLESSNDLDKSPYSIAADITNLEMDIN 184

Query: 1416 KLVEMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNKQMQHKTVAASTVSISHVGVPSGQN 1237
             L+E Q+E I N  +    + I +D + G +VD   Q     + AS +    V +P   N
Sbjct: 185  VLLEKQIEEIRNSTN--VSQKISKDFVLGLEVDLGGQSHGWEIDAS-LKEPCVTIPFPNN 241

Query: 1236 XXXXXXXXXXKLQVQMEVKEPPSSASSIQEELSEIRSHLTPLACRAALASLVS---KQLE 1066
                               E P      QEE +  RS L+PLA RAALASL S   + +E
Sbjct: 242  IKAYTGSGTE--HTAKTTTEIP------QEEFNGSRSLLSPLAARAALASLRSNFPQSVE 293

Query: 1065 LS 1060
            LS
Sbjct: 294  LS 295


>ref|XP_007027058.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508715663|gb|EOY07560.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 468

 Score =  187 bits (475), Expect = 2e-44
 Identities = 148/421 (35%), Positives = 205/421 (48%), Gaps = 42/421 (9%)
 Frame = -2

Query: 1659 RVRHSKRVYCRCTFLIKWLNQDLEKGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLS 1480
            RVRHSKR  CRCTF+IKWL+QDLE  TFT+PSSS+M+L+TKSI+ HP I   L+  K   
Sbjct: 4    RVRHSKRG-CRCTFMIKWLDQDLEGQTFTLPSSSIMKLATKSISAHPIINKLLKPEKHRG 62

Query: 1479 FSGASPFSTIFDDMDCEMDLNKLVEMQVEGIGNLVDDVYRKGIPEDTLFGGKVDTNKQMQ 1300
             S +SP  TI +  D E+DLNKL++ Q+E I NL D   +K IPED  +  K     Q  
Sbjct: 63   LSYSSPLLTILEGTDSEIDLNKLLQKQIEQISNLAD-ASKKDIPEDIPWRNKGVNKGQSP 121

Query: 1299 HKTVAASTVSISHVGVPSGQNXXXXXXXXXXKLQVQMEVKEPPSSASSIQEELSEIRSHL 1120
            HK  A S   +    V    N          KLQ+ +E +       S++E   + RSHL
Sbjct: 122  HKPTAESNACVP--AVADHHNHLKRTTRSTRKLQINIEAENQSGHTISMKEAFIQSRSHL 179

Query: 1119 TPLACRAALAS--LVSKQ---LELSNFHEEKGFAHASDVHAKSRNGTTAFTSIDNNISED 955
            +PLA RAALAS  L +K+   ++LS+       +  + +  K ++ +         +SE 
Sbjct: 180  SPLASRAALASSLLTAKKCLDMDLSS-------SMTASMFMKGKDSSDILAVSIPLVSE- 231

Query: 954  YLSSELGPAXXXXXXXXXXXXKLPKSTRRYSSVNTGLVESYSGIPTKDMNNRNKTAEVAN 775
              S E+ P                 ST+  +S      +  S IPTK   N NKT++  N
Sbjct: 232  -ASHEISPHI---------------STQGDASCEPQPTKPSSCIPTKGWENENKTSDEIN 275

Query: 774  SSAS-----------------IASSLTEIKLSQP-------------TNRTRITRISVRK 685
             +A                  +A+S  E+ +S+              T   R+TR + RK
Sbjct: 276  CTAEQRTYSPVKITAESVTSGVATSTAELPISRAKKSLVHANFNASSTAPIRLTRSATRK 335

Query: 684  GAEIPNEDVQMKTSAEDTKLKTSTTARRMTCSAVDQ-------EQGNKSAQTSESDSSEG 526
            GA IPN  VQ+K    DTK + S    +++ SAV Q       E+ N S    +SDSSEG
Sbjct: 336  GAVIPNNCVQVKICVNDTKRRMSGNKNQLSRSAVFQGNENLANEEENNSTHIIDSDSSEG 395

Query: 525  N 523
            N
Sbjct: 396  N 396


>ref|XP_006406611.1| hypothetical protein EUTSA_v10021077mg [Eutrema salsugineum]
            gi|557107757|gb|ESQ48064.1| hypothetical protein
            EUTSA_v10021077mg [Eutrema salsugineum]
          Length = 341

 Score = 63.2 bits (152), Expect = 5e-07
 Identities = 45/144 (31%), Positives = 70/144 (48%), Gaps = 7/144 (4%)
 Frame = -2

Query: 1995 PVEFQTFSPL*MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGL-----IVEFENQD 1831
            P      +P  M SG     +E EA    D +W+  Q  L+   + +      V F   +
Sbjct: 125  PAPSDILAPGVMRSGPDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFE 184

Query: 1830 LENMILDDE--EVLKCLRVRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQR 1657
            +E    +DE   V K +R RS+P +  +C +V  G+ VL     + ++L+FDA V  AQR
Sbjct: 185  VE----EDEWINVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQR 240

Query: 1656 VRHSKRVYCRCTFLIKWLNQDLEK 1585
             RH  R  CRC FL+++ +   E+
Sbjct: 241  RRHDVR-GCRCRFLVRYSHDQSEE 263


>ref|XP_004511250.1| PREDICTED: uncharacterized protein LOC101500707 [Cicer arietinum]
          Length = 377

 Score = 62.8 bits (151), Expect = 6e-07
 Identities = 54/186 (29%), Positives = 85/186 (45%), Gaps = 8/186 (4%)
 Frame = -2

Query: 1995 PVEFQTFSPL*MASGTAQDAV-ELEAMRKEDSSWHPCQVSLS-----STGVGLIVEFENQ 1834
            P+   + S    A GT +++V E EA    D +W+     LS     S+   ++V F   
Sbjct: 108  PIXAPSVSVQTTAKGTPENSVMEFEAKSGRDGAWYDVANFLSYRHLESSDPEVLVRFAGF 167

Query: 1833 DLENMILDDE--EVLKCLRVRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQ 1660
              E    +DE   V K +R RS+P +  +C +V  G+ +L     + ++L+FDA V  AQ
Sbjct: 168  GSE----EDEWINVRKNVRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDAQ 223

Query: 1659 RVRHSKRVYCRCTFLIKWLNQDLEKGTFTVPSSSLMRLSTKSINIHPTIAAFLESVKSLS 1480
            R RH  R  CRC FL+++   D ++    VP   + R       +H   A    +  +  
Sbjct: 224  RRRHDVR-GCRCRFLVRY---DHDQSEEIVPLRKVCRRPETDYRLHQLHAVHDSAAPADQ 279

Query: 1479 FSGASP 1462
             SG  P
Sbjct: 280  KSGMDP 285


>ref|XP_006298048.1| hypothetical protein CARUB_v10014092mg [Capsella rubella]
            gi|482566757|gb|EOA30946.1| hypothetical protein
            CARUB_v10014092mg [Capsella rubella]
          Length = 346

 Score = 62.8 bits (151), Expect = 6e-07
 Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 7/133 (5%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGL-----IVEFENQDLENMILDDE-- 1804
            M SG+    +E EA    D +W+  Q  L+   + +      V F   ++E    +DE  
Sbjct: 131  MRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVE----EDEWI 186

Query: 1803 EVLKCLRVRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRC 1624
             V K +R RS+P +  +C +V  G+ VL     + ++L+FDA V  AQR RH  R  CRC
Sbjct: 187  NVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVR-GCRC 245

Query: 1623 TFLIKWLNQDLEK 1585
             FL+++ +   E+
Sbjct: 246  RFLVRYSHDQSEQ 258


>ref|XP_006298047.1| hypothetical protein CARUB_v10014092mg [Capsella rubella]
            gi|482566756|gb|EOA30945.1| hypothetical protein
            CARUB_v10014092mg [Capsella rubella]
          Length = 345

 Score = 62.8 bits (151), Expect = 6e-07
 Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 7/133 (5%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGL-----IVEFENQDLENMILDDE-- 1804
            M SG+    +E EA    D +W+  Q  L+   + +      V F   ++E    +DE  
Sbjct: 131  MRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVE----EDEWI 186

Query: 1803 EVLKCLRVRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRC 1624
             V K +R RS+P +  +C +V  G+ VL     + ++L+FDA V  AQR RH  R  CRC
Sbjct: 187  NVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVR-GCRC 245

Query: 1623 TFLIKWLNQDLEK 1585
             FL+++ +   E+
Sbjct: 246  RFLVRYSHDQSEE 258


>ref|NP_974333.1| sequence-specific DNA binding transcription factor [Arabidopsis
            thaliana] gi|332642568|gb|AEE76089.1| sequence-specific
            DNA binding transcription factor [Arabidopsis thaliana]
          Length = 349

 Score = 62.8 bits (151), Expect = 6e-07
 Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 7/133 (5%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGL-----IVEFENQDLENMILDDE-- 1804
            M SG+    +E EA    D +W+  Q  L+   + +      V F   ++E    +DE  
Sbjct: 134  MRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVE----EDEWI 189

Query: 1803 EVLKCLRVRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRC 1624
             V K +R RS+P +  +C +V  G+ VL     + ++L+FDA V  AQR RH  R  CRC
Sbjct: 190  NVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVR-GCRC 248

Query: 1623 TFLIKWLNQDLEK 1585
             FL+++ +   E+
Sbjct: 249  RFLVRYSHDQSEQ 261


>ref|NP_188467.2| sequence-specific DNA binding transcription factor [Arabidopsis
            thaliana] gi|75330703|sp|Q8RWJ7.1|SHH2_ARATH RecName:
            Full=Protein SAWADEE HOMEODOMAIN HOMOLOG 2; AltName:
            Full=Probable DNA-binding transcription factor 2
            gi|20260286|gb|AAM13041.1| unknown protein [Arabidopsis
            thaliana] gi|28059773|gb|AAO30091.1| unknown protein
            [Arabidopsis thaliana] gi|332642567|gb|AEE76088.1|
            sequence-specific DNA binding transcription factor
            [Arabidopsis thaliana]
          Length = 348

 Score = 62.8 bits (151), Expect = 6e-07
 Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 7/133 (5%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGL-----IVEFENQDLENMILDDE-- 1804
            M SG+    +E EA    D +W+  Q  L+   + +      V F   ++E    +DE  
Sbjct: 134  MRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVE----EDEWI 189

Query: 1803 EVLKCLRVRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRC 1624
             V K +R RS+P +  +C +V  G+ VL     + ++L+FDA V  AQR RH  R  CRC
Sbjct: 190  NVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVR-GCRC 248

Query: 1623 TFLIKWLNQDLEK 1585
             FL+++ +   E+
Sbjct: 249  RFLVRYSHDQSEE 261


>dbj|BAB01104.1| unnamed protein product [Arabidopsis thaliana]
          Length = 323

 Score = 62.8 bits (151), Expect = 6e-07
 Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 7/133 (5%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGL-----IVEFENQDLENMILDDE-- 1804
            M SG+    +E EA    D +W+  Q  L+   + +      V F   ++E    +DE  
Sbjct: 109  MRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVE----EDEWI 164

Query: 1803 EVLKCLRVRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRC 1624
             V K +R RS+P +  +C +V  G+ VL     + ++L+FDA V  AQR RH  R  CRC
Sbjct: 165  NVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVR-GCRC 223

Query: 1623 TFLIKWLNQDLEK 1585
             FL+++ +   E+
Sbjct: 224  RFLVRYSHDQSEE 236


>ref|NP_001189923.1| sequence-specific DNA binding transcription factor [Arabidopsis
            thaliana] gi|332642569|gb|AEE76090.1| sequence-specific
            DNA binding transcription factor [Arabidopsis thaliana]
          Length = 346

 Score = 62.8 bits (151), Expect = 6e-07
 Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 7/133 (5%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGL-----IVEFENQDLENMILDDE-- 1804
            M SG+    +E EA    D +W+  Q  L+   + +      V F   ++E    +DE  
Sbjct: 131  MRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVE----EDEWI 186

Query: 1803 EVLKCLRVRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRC 1624
             V K +R RS+P +  +C +V  G+ VL     + ++L+FDA V  AQR RH  R  CRC
Sbjct: 187  NVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVR-GCRC 245

Query: 1623 TFLIKWLNQDLEK 1585
             FL+++ +   E+
Sbjct: 246  RFLVRYSHDQSEQ 258


>ref|XP_002885254.1| sequence-specific DNA binding protein [Arabidopsis lyrata subsp.
            lyrata] gi|297331094|gb|EFH61513.1| sequence-specific DNA
            binding protein [Arabidopsis lyrata subsp. lyrata]
          Length = 349

 Score = 62.8 bits (151), Expect = 6e-07
 Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 7/133 (5%)
 Frame = -2

Query: 1962 MASGTAQDAVELEAMRKEDSSWHPCQVSLSSTGVGL-----IVEFENQDLENMILDDE-- 1804
            M SG+    +E EA    D +W+  Q  L+   + +      V F   ++E    +DE  
Sbjct: 134  MRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVE----EDEWI 189

Query: 1803 EVLKCLRVRSIPLKGDDCCSVEQGEHVLATHNSRFKSLFFDAEVEKAQRVRHSKRVYCRC 1624
             V K +R RS+P +  +C +V  G+ VL     + ++L+FDA V  AQR RH  R  CRC
Sbjct: 190  NVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVR-GCRC 248

Query: 1623 TFLIKWLNQDLEK 1585
             FL+++ +   E+
Sbjct: 249  RFLVRYSHDQSEQ 261


Top