BLASTX nr result

ID: Achyranthes22_contig00016389 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes22_contig00016389
         (1926 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002273558.1| PREDICTED: uncharacterized protein LOC100268...   404   e-110
emb|CBI27315.3| unnamed protein product [Vitis vinifera]              402   e-109
ref|XP_006484353.1| PREDICTED: uncharacterized protein LOC102628...   374   e-100
gb|EMJ28449.1| hypothetical protein PRUPE_ppa017973mg [Prunus pe...   351   6e-94
ref|XP_002312290.1| hypothetical protein POPTR_0008s09730g [Popu...   350   1e-93
gb|EXB44873.1| hypothetical protein L484_026455 [Morus notabilis]     347   8e-93
ref|XP_006437884.1| hypothetical protein CICLE_v10033741mg, part...   344   7e-92
ref|XP_002512056.1| conserved hypothetical protein [Ricinus comm...   342   3e-91
gb|EOY01639.1| Histone-lysine N-methyltransferase ATX1, putative...   337   8e-90
gb|EOY01640.1| Histone-lysine N-methyltransferase ATX1, putative...   335   5e-89
ref|XP_006599179.1| PREDICTED: uncharacterized protein LOC100802...   322   3e-85
ref|XP_006599178.1| PREDICTED: uncharacterized protein LOC100802...   322   4e-85
gb|ESW28388.1| hypothetical protein PHAVU_003G282800g [Phaseolus...   322   5e-85
ref|XP_004509752.1| PREDICTED: uncharacterized protein LOC101515...   320   1e-84
ref|XP_004298357.1| PREDICTED: uncharacterized protein LOC101305...   317   9e-84
ref|XP_002893407.1| predicted protein [Arabidopsis lyrata subsp....   310   1e-81
ref|XP_006415912.1| hypothetical protein EUTSA_v10006590mg [Eutr...   309   3e-81
gb|AAG50686.1|AC079829_19 hypothetical protein [Arabidopsis thal...   306   2e-80
gb|AAF98582.1|AC013427_25 This gene may be cut off [Arabidopsis ...   306   2e-80
ref|NP_001185099.1| DNA binding protein [Arabidopsis thaliana] g...   306   2e-80

>ref|XP_002273558.1| PREDICTED: uncharacterized protein LOC100268093 [Vitis vinifera]
          Length = 1242

 Score =  404 bits (1037), Expect = e-110
 Identities = 207/479 (43%), Positives = 294/479 (61%), Gaps = 4/479 (0%)
 Frame = +2

Query: 86   VHVRHSDGYIEKDMTGNQNSDLMYQSVDQGRDS----SRLHIREEKTATSSSCEEKQELN 253
            + V     YI+K +   QN   + +SV +G  S    +  +  E +  T+     K E+ 
Sbjct: 760  LRVESCQAYIDKKLVEQQNLVKLNRSVQKGGTSFGENNMSNAEEVQAGTNLKAHIKMEVK 819

Query: 254  DEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQG 433
             +++G  + +GCY HPM + SV+L     EI+ICV CG L D+   LF+Y +T KE    
Sbjct: 820  HDLVGNTELVGCYVHPMPVLSVLLNTREDEIHICVLCGLLVDKDTILFIYKVTIKEPRLQ 879

Query: 434  NPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCP 613
            +P  VG+T + LP+LKD  G EVA+D+  LQ TPDG+ LVL++SI+ PYCRE+ + CLC 
Sbjct: 880  SPTFVGYTPIILPTLKDRSGGEVALDRFGLQFTPDGQSLVLLNSIKTPYCREQKIPCLCS 939

Query: 614  QCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMN 793
             C+  CFE+NA+KIV +KLG++S+V KL T   V C+LVCEP HL+A++ESGR+++W+MN
Sbjct: 940  ACKLECFEENAIKIVQIKLGFLSVVEKLKTVDSVQCILVCEPNHLVAVEESGRLHVWVMN 999

Query: 794  STWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXX 973
            STWSVQTE ++IP+Y+ +   IV+LKRIPK A +VVGH+G+GEFSLWDI +R        
Sbjct: 1000 STWSVQTEDFIIPTYDCVSPCIVELKRIPKCAPLVVGHHGFGEFSLWDISQRILISRFAM 1059

Query: 974  XXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDV 1153
                  +F+P+SLFS+ S+         +  + K+  AT  WFS+H+E Y   P+ GE +
Sbjct: 1060 PSISIFEFIPISLFSFQSEVPLSSNPDVDLHINKIMAATKMWFSKHNENYTFLPLGGESI 1119

Query: 1154 AVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXX 1333
            AVWL + T SDS  Q     ++      G W L L++K++VILG+ALD            
Sbjct: 1120 AVWLLVSTLSDSDTQHDNQMNDCQTNPVGWWRLALLVKNMVILGSALDPRAAAIGASAGH 1179

Query: 1334 XXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQLCVYLH 1510
                     VY+W+L+ G+K  +LH  +G  +  IATDD  S V A+A DGGQL VYLH
Sbjct: 1180 GIIGTHDGLVYMWELSTGTKLGSLHYFKG-GVSCIATDDSRSDVFAVAGDGGQLLVYLH 1237


>emb|CBI27315.3| unnamed protein product [Vitis vinifera]
          Length = 1177

 Score =  402 bits (1034), Expect = e-109
 Identities = 206/471 (43%), Positives = 292/471 (61%), Gaps = 4/471 (0%)
 Frame = +2

Query: 110  YIEKDMTGNQNSDLMYQSVDQGRDS----SRLHIREEKTATSSSCEEKQELNDEIIGTMK 277
            YI+K +   QN   + +SV +G  S    +  +  E +  T+     K E+  +++G  +
Sbjct: 703  YIDKKLVEQQNLVKLNRSVQKGGTSFGENNMSNAEEVQAGTNLKAHIKMEVKHDLVGNTE 762

Query: 278  FIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMVGHT 457
             +GCY HPM + SV+L     EI+ICV CG L D+   LF+Y +T KE    +P  VG+T
Sbjct: 763  LVGCYVHPMPVLSVLLNTREDEIHICVLCGLLVDKDTILFIYKVTIKEPRLQSPTFVGYT 822

Query: 458  SMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFE 637
             + LP+LKD  G EVA+D+  LQ TPDG+ LVL++SI+ PYCRE+ + CLC  C+  CFE
Sbjct: 823  PIILPTLKDRSGGEVALDRFGLQFTPDGQSLVLLNSIKTPYCREQKIPCLCSACKLECFE 882

Query: 638  KNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTE 817
            +NA+KIV +KLG++S+V KL T   V C+LVCEP HL+A++ESGR+++W+MNSTWSVQTE
Sbjct: 883  ENAIKIVQIKLGFLSVVEKLKTVDSVQCILVCEPNHLVAVEESGRLHVWVMNSTWSVQTE 942

Query: 818  LYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDF 997
             ++IP+Y+ +   IV+LKRIPK A +VVGH+G+GEFSLWDI +R              +F
Sbjct: 943  DFIIPTYDCVSPCIVELKRIPKCAPLVVGHHGFGEFSLWDISQRILISRFAMPSISIFEF 1002

Query: 998  LPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRT 1177
            +P+SLFS+ S+         +  + K+  AT  WFS+H+E Y   P+ GE +AVWL + T
Sbjct: 1003 IPISLFSFQSEVPLSSNPDVDLHINKIMAATKMWFSKHNENYTFLPLGGESIAVWLLVST 1062

Query: 1178 SSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXX 1357
             SDS  Q     ++      G W L L++K++VILG+ALD                    
Sbjct: 1063 LSDSDTQHDNQMNDCQTNPVGWWRLALLVKNMVILGSALDPRAAAIGASAGHGIIGTHDG 1122

Query: 1358 QVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQLCVYLH 1510
             VY+W+L+ G+K  +LH  +G  +  IATDD  S V A+A DGGQL VYLH
Sbjct: 1123 LVYMWELSTGTKLGSLHYFKG-GVSCIATDDSRSDVFAVAGDGGQLLVYLH 1172


>ref|XP_006484353.1| PREDICTED: uncharacterized protein LOC102628159 [Citrus sinensis]
          Length = 1252

 Score =  374 bits (959), Expect = e-100
 Identities = 200/461 (43%), Positives = 270/461 (58%), Gaps = 4/461 (0%)
 Frame = +2

Query: 140  NSDLMYQSVD-QGRDSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISS 316
            NS ++ Q  +  G + +  + +E + ++    ++  E  +E+ GT   +GCY  P+ I S
Sbjct: 788  NSSVVSQKQEISGCEYTSSNAKESQVSSDLKLQKNVECINELAGTFDLMGCYFFPLPILS 847

Query: 317  VMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGR 496
            V+L     +IY+CVSCG L D+KR LF+YT+  +E   GNP  VGHTS+ LP LKD FGR
Sbjct: 848  VLLSTTGDKIYVCVSCGFLVDKKRTLFIYTVDIQEPRVGNPSCVGHTSVMLPFLKDNFGR 907

Query: 497  EVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGY 676
            E+A+++S    TPDG+ LVL+DS++ PYCRE    CLC  C S   ++NAVKIV VK GY
Sbjct: 908  EIALERSCALFTPDGQYLVLLDSMKTPYCREGRSDCLCSTCTSHRLDENAVKIVKVKPGY 967

Query: 677  VSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSR 856
            VS+V KL T   V C+LVCEP+HLIA+ ESG++++W MNS+WS Q E  +IP  + +   
Sbjct: 968  VSVVAKLKTDDCVQCILVCEPKHLIAVGESGKLHLWEMNSSWSAQVEECIIPINDCIYPC 1027

Query: 857  IVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGL 1036
            IV++KRIPK A +VVGHNG+GEF +WDI KR               F P++LFSW   G 
Sbjct: 1028 IVEMKRIPKCAPLVVGHNGFGEFGIWDISKRVLVSRFSAARASIYQFFPINLFSWQRNG- 1086

Query: 1037 TKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSS 1216
                V  +  +     AT S FS+HSE    CP  GED A+WL + T SDS  Q  C S 
Sbjct: 1087 ---SVSMDASLELTNTATTSLFSKHSEKSSFCPSVGEDSAIWLLVSTISDSDAQHNCMSR 1143

Query: 1217 NNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKR 1396
            +        W L L++K+ VILG+ LD                     VY W+L+ G+K 
Sbjct: 1144 DCQKNPVRFWRLALLVKNRVILGSPLDPRASAIGASSGLGIIGTNDGLVYAWELSSGNKL 1203

Query: 1397 EALHDLEGHTILRIATDDLTSSVVAIA---RDGGQLCVYLH 1510
              LH  +G T+  IATDD     +A+A    DGGQL VYLH
Sbjct: 1204 GILHHFKGGTVSCIATDDSGLQALAVAGDGPDGGQLLVYLH 1244


>gb|EMJ28449.1| hypothetical protein PRUPE_ppa017973mg [Prunus persica]
          Length = 1170

 Score =  351 bits (901), Expect = 6e-94
 Identities = 195/477 (40%), Positives = 277/477 (58%), Gaps = 3/477 (0%)
 Frame = +2

Query: 92   VRHSDGYIEKDMTGNQNS-DLMYQSVDQGRDSSRLHIREEKTATSSSCEEKQELNDEIIG 268
            V   + +++KD+ G++N  +       Q + +  +H       +S S     ELN+E+ G
Sbjct: 699  VSRLENHVDKDVVGHENLLEPNDTETSQKQGTGLMHDPNSVPHSSDSKPHSMELNNELTG 758

Query: 269  TMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMV 448
            +++F+G Y H   + SV+L     EIY+CV CG L D+   LF+Y +  +E   G P  V
Sbjct: 759  SLEFVGRYSHQNPVLSVLLSAKGTEIYVCVLCGPLVDKDGSLFIYKVAIEEPRVGCPSFV 818

Query: 449  GHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESF 628
            GHTS++LP  KD FGR +A+++S+LQ TPDG+ LVL+DSI+ PYCR+ ++HCLC  C S 
Sbjct: 819  GHTSVTLPIRKDYFGR-IALERSSLQFTPDGQYLVLLDSIKTPYCRQGSIHCLCSTCTSN 877

Query: 629  CFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSV 808
            C E+N VKIV V+LGYVS V  L     + C+LVCEP +L+A+ ESGR+++W+MNSTWS 
Sbjct: 878  CSEENTVKIVQVRLGYVSKVASLKAVDSLECILVCEPNNLVAVGESGRLHLWVMNSTWSA 937

Query: 809  QTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXX 988
            Q E +V+P+ + +   IV+LKRIP    +VVGHNG+GEFSLWDI K              
Sbjct: 938  QIENFVLPAEDCISPGIVELKRIPNCTHIVVGHNGFGEFSLWDISKCILVSRFSAASSSI 997

Query: 989  LDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATM-SWFSRHSEAYISCPVEGEDVAVWL 1165
              F+PVSLF+W  K         E  + +L  AT  + FS          +EGED+AVWL
Sbjct: 998  CQFVPVSLFTWRIKCPVSSYSDIEEHINELVAATSNNQFS----------LEGEDIAVWL 1047

Query: 1166 FIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXX 1345
             + +SSDS  Q    S +      G W L LM+K++VI G+ALD                
Sbjct: 1048 LVSSSSDSDAQQDYVSDDCDSNPMGRWRLALMVKNMVIFGSALDPRAAVIGASAGQGICG 1107

Query: 1346 XXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDG-GQLCVYLHT 1513
                 VY+W+L+ G+K  A+H  +G ++  IATDD   S  A+A  G  QL V+LH+
Sbjct: 1108 TCDGLVYMWELSTGNKFGAMHHFKGGSVSCIATDDSRPSPGAVAVAGDNQLLVFLHS 1164


>ref|XP_002312290.1| hypothetical protein POPTR_0008s09730g [Populus trichocarpa]
            gi|222852110|gb|EEE89657.1| hypothetical protein
            POPTR_0008s09730g [Populus trichocarpa]
          Length = 1312

 Score =  350 bits (898), Expect = 1e-93
 Identities = 176/436 (40%), Positives = 255/436 (58%)
 Frame = +2

Query: 197  IREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALE 376
            ++E +T +        + N+E+ G  + +GCY HPM + S+++     EI +C  CG L 
Sbjct: 872  VKEVQTNSDLKLHRNLKHNNELEGNFELVGCYLHPMPVLSLLVVTKGDEINVCALCGHLV 931

Query: 377  DRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVL 556
            D+ R LFLY L  +E+  GNP  VGHTS++ P   D FGRE A+++S LQLTPDG+ LVL
Sbjct: 932  DKNRTLFLYKLAIEETRTGNPSFVGHTSVTFPFSTDIFGRETALERSGLQLTPDGQNLVL 991

Query: 557  VDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCE 736
            + S++ PYCRE    CLC  C   C E++ VKIV VK GYVS++VKL+T   + C+LVCE
Sbjct: 992  LGSMKTPYCREGRTDCLCSTCSLNCSEQSTVKIVQVKTGYVSVLVKLSTFDSMQCILVCE 1051

Query: 737  PEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGY 916
            P HLIA  ESGR+++W MNS WS  TE ++I + + +   IV+LKR+P  AS+VVG+NG+
Sbjct: 1052 PNHLIAAGESGRLHLWTMNSAWSAPTEEFIISANDCISPCIVELKRVPNCASVVVGNNGF 1111

Query: 917  GEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMS 1096
            GEF++WD+ +R               F P+S F+W            E  +  + +AT  
Sbjct: 1112 GEFTVWDVSRRMFMARVSSPSASACQFFPISSFTWQRVVHGFHYSTVEEQIDGIVDATKL 1171

Query: 1097 WFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVV 1276
            WFS +SE Y   P++GED+A+WL + T  +   Q    SS+  +   G W L L++K+++
Sbjct: 1172 WFSENSEYYSLPPLDGEDIAIWLLVSTIPELDTQEDYISSDCGINPVGWWRLALLVKNML 1231

Query: 1277 ILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLT 1456
            ILG ALD                     VY+W+   G++   LH  EG ++  IATD+  
Sbjct: 1232 ILGKALDPRAAAIGSSSGNGIIGTFDGLVYMWEFTTGTRLGTLHHFEGESVSCIATDNSK 1291

Query: 1457 SSVVAIARDGGQLCVY 1504
              V+++A D GQL VY
Sbjct: 1292 PGVISVAGDKGQLLVY 1307


>gb|EXB44873.1| hypothetical protein L484_026455 [Morus notabilis]
          Length = 1147

 Score =  347 bits (891), Expect = 8e-93
 Identities = 182/427 (42%), Positives = 251/427 (58%)
 Frame = +2

Query: 209  KTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKR 388
            +T    S +EK    D  +G  + IGCY HP+ + S+++     +I+ICV CG   ++ R
Sbjct: 709  ETVEMGSSDEKSHTKD--LGLGELIGCYLHPLPVLSLLVCTTGEDIHICVLCGLRVNKDR 766

Query: 389  DLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSI 568
             LF+Y + T+E   G P  VGHTS++LPSLKD FG+E+A+++S LQ TP G+ LVL+D I
Sbjct: 767  TLFIYKIATQEPRVGYPSFVGHTSVTLPSLKDYFGKEIALERSGLQYTPGGQYLVLLDCI 826

Query: 569  RMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHL 748
            R PYCR+  + CLCP C S  FE++AVKIV VKLGYVS+VVKL T   + CVLVCEP HL
Sbjct: 827  RTPYCRQGTIPCLCPACASGSFEEDAVKIVEVKLGYVSVVVKLKTLESLQCVLVCEPNHL 886

Query: 749  IALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFS 928
            +A+ ESGR+++W+MN  WS QTE +++P+ + +   IV+LKRIPK   +VVGHNG+GEFS
Sbjct: 887  VAVGESGRLHLWVMNPAWSAQTEQFILPANDLVSPGIVELKRIPKCVRLVVGHNGFGEFS 946

Query: 929  LWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSR 1108
            L                    +F PV+LF W  KG +       G + ++  AT  WFS 
Sbjct: 947  L-------------------CEFFPVALFGWKKKGHSFGDCNVHGHVNRMMAATNMWFSE 987

Query: 1109 HSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGA 1288
             +    S P+  E++AVWL +   SDS       S + H  S G W L L++K++VILG 
Sbjct: 988  QTND-DSLPLLEEEIAVWLLVSVPSDSDDHHDYTSGDYHTKSVGWWRLALLVKNMVILGG 1046

Query: 1289 ALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVV 1468
            ALD                     VYIW+++ G+K   LH   G ++  IATDD     V
Sbjct: 1047 ALDPSAEAIGASAGHGIIGTCDGLVYIWEMSTGTKLGTLHHFRGSSVSCIATDDSKKGAV 1106

Query: 1469 AIARDGG 1489
            AI+   G
Sbjct: 1107 AISGGEG 1113


>ref|XP_006437884.1| hypothetical protein CICLE_v10033741mg, partial [Citrus clementina]
            gi|557540080|gb|ESR51124.1| hypothetical protein
            CICLE_v10033741mg, partial [Citrus clementina]
          Length = 1177

 Score =  344 bits (883), Expect = 7e-92
 Identities = 181/424 (42%), Positives = 247/424 (58%), Gaps = 1/424 (0%)
 Frame = +2

Query: 140  NSDLMYQSVD-QGRDSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISS 316
            NS ++ Q  +  G + +  + +E + ++    ++  E  +E+ GT   +GCY  P+ I S
Sbjct: 755  NSSVVSQKQEISGCEYTSSNAKESQVSSDLKLQKNVECINELAGTFDLMGCYFFPLPILS 814

Query: 317  VMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGR 496
            V+L     +IY+CVSCG L D+KR LF+YT+  +E   GNP  VGHTS+ LP LKD FGR
Sbjct: 815  VLLSTTGDKIYVCVSCGFLVDKKRTLFIYTVDIQEPRVGNPSCVGHTSVMLPFLKDNFGR 874

Query: 497  EVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGY 676
            E+A+++S    TPDG+ LVL+DS++ PYCRE    CLC  C S   ++NAVKIV V  GY
Sbjct: 875  EIALERSCALFTPDGQYLVLLDSMKTPYCREGRSDCLCSTCTSHRLDENAVKIVKVNPGY 934

Query: 677  VSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSR 856
            VS+V KL T   V C+LVCEP+HLIA+ ESG++++W MNS+WS Q E  +IP  + +   
Sbjct: 935  VSVVAKLKTDDCVQCILVCEPKHLIAVGESGKLHLWEMNSSWSAQVEECIIPINDCIYPC 994

Query: 857  IVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGL 1036
            IV++KRIPK A +VVGHNG+GEF +WDI KR               F P++LFSW   G 
Sbjct: 995  IVEMKRIPKCAPLVVGHNGFGEFGIWDISKRVLVSRFSAARASIYQFFPINLFSWQRNG- 1053

Query: 1037 TKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSS 1216
                V  +  +     AT S FS+HSE    CP  GED A+WL + T SDS  Q  C S 
Sbjct: 1054 ---SVSMDASLELTNTATTSLFSKHSEKSSFCPSVGEDSAIWLLVSTISDSDAQHNCMSR 1110

Query: 1217 NNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKR 1396
            +        W L L++K+ VILG+ LD                     VY W+L+ G+K 
Sbjct: 1111 DCQKNPVRFWRLALLVKNRVILGSPLDPRASAIGASSGLGIIGTNDGLVYAWELSSGNKL 1170

Query: 1397 EALH 1408
              LH
Sbjct: 1171 GILH 1174


>ref|XP_002512056.1| conserved hypothetical protein [Ricinus communis]
            gi|223549236|gb|EEF50725.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1246

 Score =  342 bits (877), Expect = 3e-91
 Identities = 182/419 (43%), Positives = 252/419 (60%), Gaps = 2/419 (0%)
 Frame = +2

Query: 245  ELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKES 424
            EL +E+ G ++F+GCY HPM + S++++R  +EIYICV CG L ++ R LFLY L  +  
Sbjct: 751  ELTNELDGIVEFLGCYFHPMPVLSLLVRRKGNEIYICVLCGLLVEKDRTLFLYKLAIEGP 810

Query: 425  SQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHC 604
              G PC +GHTS++ PS    FGRE++ ++S LQLTPDG+ LVL+ S R P CRE  L C
Sbjct: 811  RIGCPCFIGHTSVTWPSSTGIFGREISFERSGLQLTPDGQCLVLLGSTRAPCCREGRLEC 870

Query: 605  LCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIW 784
            LC  C S CF  N VKIV VK GYVS++VKL T   + C+LVCEP+HL+A  E+ R+++W
Sbjct: 871  LCSACASDCFGSNGVKIVQVKAGYVSVLVKLKTNDSLQCILVCEPDHLVAAGENSRLHLW 930

Query: 785  IMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXX 964
             MNS WS  TE + I S ++    I++LKRIPK  S+V+GH+G+GEF+LWDI KR     
Sbjct: 931  TMNSVWSAPTEEFTIQSNDYTSPCIMELKRIPKCTSLVIGHDGFGEFTLWDISKRIFVSK 990

Query: 965  XXXXXXXXLDFLPVSLFSWSSK--GLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPV 1138
                      F P+SLF W  +  GL+   V  E  + +L +AT   FS HS   I+  +
Sbjct: 991  FSSPSNSVHQFSPISLFHWQREVHGLSYSNV--EAHVNRLMDAT-KMFSGHS---INHSL 1044

Query: 1139 EGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXX 1318
              ED+A+W  + T+ DS       SS++ +   G W L L++K+ +ILG+ALD       
Sbjct: 1045 PHEDIAIWFLVSTAPDSDALHDYGSSHSQINPVGYWRLALLMKNSLILGSALDPRAAAIG 1104

Query: 1319 XXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQL 1495
                          VY+W+L  G K   LH  +G +   IATDD  S V+AIA D G++
Sbjct: 1105 TSAGHGIIGTLDGLVYMWELLTGKKLGTLHKFKGGSASCIATDDSGSGVLAIADDKGEI 1163


>gb|EOY01639.1| Histone-lysine N-methyltransferase ATX1, putative isoform 1
            [Theobroma cacao]
          Length = 1329

 Score =  337 bits (865), Expect = 8e-90
 Identities = 186/455 (40%), Positives = 258/455 (56%), Gaps = 10/455 (2%)
 Frame = +2

Query: 179  DSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICV 358
            D++R   RE + ++  +     ELN ++ G +  +G Y HP+ ISSV L    +EI+ICV
Sbjct: 873  DTNRSKAREVQGSSDVNHCRDVELNCDLRGIVNLVGSYFHPLPISSVWLCTKGNEIHICV 932

Query: 359  SCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSA------ 520
             CG L D+ R LFLY ++ +E S G P  VG+TS++L   +  FG  +  + SA      
Sbjct: 933  LCGLLVDKDRTLFLYRVSIEEPSIGCPSFVGYTSVTLTFSEVSFGGRICCNSSAIFIIDS 992

Query: 521  ----LQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIV 688
                LQ TPDG+ LVL+D I+ PYCRE  + C+C  C S C  +N VKIV V  GYVS+V
Sbjct: 993  ERCGLQFTPDGQCLVLLDGIKTPYCREGIIDCICSICSSGCSNENGVKIVQVNHGYVSLV 1052

Query: 689  VKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDL 868
             KL T   V C+LVCE  +L+A   SGR+++W+MNSTWS  TE +++P+ + L   +V+L
Sbjct: 1053 AKLETVESVQCILVCENNYLVAAGTSGRLHLWVMNSTWSAWTEEFILPAGDCLSPCVVEL 1112

Query: 869  KRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDK 1048
            KRIPK A +V+GHNG GEF +WDI+KR               FLP+SLFSW       D 
Sbjct: 1113 KRIPKCARLVIGHNGIGEFVVWDILKRLILSRFSASGNPIKQFLPISLFSWQPVFSYAD- 1171

Query: 1049 VITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHL 1228
                G + ++   T   FS H + +   P+EGED+A+WL + T SD   Q     SN   
Sbjct: 1172 --MNGRIDEIFTTTKILFSEHKDCFFP-PLEGEDIALWLLLSTVSDFEDQYERLPSNCQA 1228

Query: 1229 ISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALH 1408
                 W L L++KD VILG+ LD                     VY+W+L+ G++   LH
Sbjct: 1229 NPARSWRLALLVKDRVILGSTLDPRAAAIGASFDHGIIGRDDGLVYMWELSTGTRLGVLH 1288

Query: 1409 DLEGHTILRIATDDLTSSVVAIARDGGQLCVYLHT 1513
              +G ++  IATDDL   VVA+A D GQL +YLH+
Sbjct: 1289 HFKGGSVSCIATDDLRPDVVAVAADDGQLLIYLHS 1323


>gb|EOY01640.1| Histone-lysine N-methyltransferase ATX1, putative isoform 2
            [Theobroma cacao] gi|508709744|gb|EOY01641.1|
            Histone-lysine N-methyltransferase ATX1, putative isoform
            2 [Theobroma cacao]
          Length = 1128

 Score =  335 bits (858), Expect = 5e-89
 Identities = 183/445 (41%), Positives = 255/445 (57%)
 Frame = +2

Query: 179  DSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICV 358
            D++R   RE + ++  +     ELN ++ G +  +G Y HP+ ISSV L    +EI+ICV
Sbjct: 688  DTNRSKAREVQGSSDVNHCRDVELNCDLRGIVNLVGSYFHPLPISSVWLCTKGNEIHICV 747

Query: 359  SCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPD 538
             CG L D+ R LFLY ++ +E S G P  VG+TS++L         E+  ++  LQ TPD
Sbjct: 748  LCGLLVDKDRTLFLYRVSIEEPSIGCPSFVGYTSVTLTF------SEIDSERCGLQFTPD 801

Query: 539  GRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVH 718
            G+ LVL+D I+ PYCRE  + C+C  C S C  +N VKIV V  GYVS+V KL T   V 
Sbjct: 802  GQCLVLLDGIKTPYCREGIIDCICSICSSGCSNENGVKIVQVNHGYVSLVAKLETVESVQ 861

Query: 719  CVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMV 898
            C+LVCE  +L+A   SGR+++W+MNSTWS  TE +++P+ + L   +V+LKRIPK A +V
Sbjct: 862  CILVCENNYLVAAGTSGRLHLWVMNSTWSAWTEEFILPAGDCLSPCVVELKRIPKCARLV 921

Query: 899  VGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKL 1078
            +GHNG GEF +WDI+KR               FLP+SLFSW       D     G + ++
Sbjct: 922  IGHNGIGEFVVWDILKRLILSRFSASGNPIKQFLPISLFSWQPVFSYAD---MNGRIDEI 978

Query: 1079 QEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGL 1258
               T   FS H + +   P+EGED+A+WL + T SD   Q     SN        W L L
Sbjct: 979  FTTTKILFSEHKDCFFP-PLEGEDIALWLLLSTVSDFEDQYERLPSNCQANPARSWRLAL 1037

Query: 1259 MIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRI 1438
            ++KD VILG+ LD                     VY+W+L+ G++   LH  +G ++  I
Sbjct: 1038 LVKDRVILGSTLDPRAAAIGASFDHGIIGRDDGLVYMWELSTGTRLGVLHHFKGGSVSCI 1097

Query: 1439 ATDDLTSSVVAIARDGGQLCVYLHT 1513
            ATDDL   VVA+A D GQL +YLH+
Sbjct: 1098 ATDDLRPDVVAVAADDGQLLIYLHS 1122


>ref|XP_006599179.1| PREDICTED: uncharacterized protein LOC100802319 isoform X2 [Glycine
            max]
          Length = 1115

 Score =  322 bits (826), Expect = 3e-85
 Identities = 194/550 (35%), Positives = 289/550 (52%), Gaps = 43/550 (7%)
 Frame = +2

Query: 2    LSQNCE--FQEKTIDFTINKKDLQIVE---GCYVHVRHSDGYI-----------EKDMTG 133
            + QNC+    E  +D  ++ KDL I E      +HV+ +  ++            +D TG
Sbjct: 566  MPQNCDVCIPESVLD-DMSPKDLIIYERSDDACLHVKENPAHVFLSSVQKDLPTAQDFTG 624

Query: 134  NQNSDLMYQS--------------VDQGRDSSR---LHIREEKTATSSSCE--------E 238
            +  + L  Q+              VD    SS+   L   E K   +   +        +
Sbjct: 625  DDTAGLCVQTPQIRSDVLGGHSNLVDPNPTSSQNLTLFADENKCFGTKEVQLISEPMPLQ 684

Query: 239  KQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTK 418
             QEL + +  ++KF+G Y HPM +SS+ L     EI++CV CG L  + R LF Y +   
Sbjct: 685  NQELKNNLGSSVKFVGRYLHPMPVSSLFLSTREDEIHVCVLCGYLTGQYRTLFTYKVAIA 744

Query: 419  ESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNL 598
            E + G P ++ H+S+ LP  K  F +E  V++S +QLTP G+ +VL+ SI+ P CRE  +
Sbjct: 745  EPTLGCPSVMAHSSILLPDPKHNFIKETMVERSGVQLTPGGQYVVLIGSIKTPNCREGKI 804

Query: 599  HCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIY 778
             C C  C+S C EKNA+KIV V+ GYVS+V  L T   VHC+LVCEP  L+++ ESG++ 
Sbjct: 805  DCHCSTCKSVCSEKNALKIVQVEHGYVSVVTTLETVDNVHCILVCEPNRLVSVGESGKLQ 864

Query: 779  IWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXX 958
            +W+MNS WS + E ++IP+   +   I++LKR+PK   +VVGHN  GEFSLWDI K    
Sbjct: 865  VWVMNSKWSEKIEYFIIPADGSVSPGIMELKRVPKCTHLVVGHNSRGEFSLWDIAKCNCV 924

Query: 959  XXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPV 1138
                       +F P+SLF W +KG     V  E    KL EAT  W+S   +     P+
Sbjct: 925  TSFSALKSPVNEFFPISLFQWQTKGSGFSNVNIEEQADKLLEATNLWYSEQRDICWFSPI 984

Query: 1139 EGEDVAVWLFIRTSS--DSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXX 1312
            E EDVA+WLF+ T+S  DS       SS+  + +   W L L++K+ +I G+ LD     
Sbjct: 985  E-EDVAMWLFVSTTSDLDSCHNHVSTSSSYDIHTARSWRLALLMKNSIIFGSPLDLRTSG 1043

Query: 1313 XXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQ 1492
                            VY+W+L++GSK + LH  +   +  +ATDD +   + +A   G+
Sbjct: 1044 NGVSCGYGIISTSDGVVYMWELSKGSKLDTLHHFQDGNVTCVATDD-SRGALGVAGGRGE 1102

Query: 1493 LCVYLHT*DL 1522
            L +YLH  +L
Sbjct: 1103 LLLYLHDPEL 1112


>ref|XP_006599178.1| PREDICTED: uncharacterized protein LOC100802319 isoform X1 [Glycine
            max]
          Length = 1217

 Score =  322 bits (825), Expect = 4e-85
 Identities = 170/431 (39%), Positives = 248/431 (57%), Gaps = 2/431 (0%)
 Frame = +2

Query: 236  EKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTT 415
            + QEL + +  ++KF+G Y HPM +SS+ L     EI++CV CG L  + R LF Y +  
Sbjct: 786  QNQELKNNLGSSVKFVGRYLHPMPVSSLFLSTREDEIHVCVLCGYLTGQYRTLFTYKVAI 845

Query: 416  KESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKN 595
             E + G P ++ H+S+ LP  K  F +E  V++S +QLTP G+ +VL+ SI+ P CRE  
Sbjct: 846  AEPTLGCPSVMAHSSILLPDPKHNFIKETMVERSGVQLTPGGQYVVLIGSIKTPNCREGK 905

Query: 596  LHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRI 775
            + C C  C+S C EKNA+KIV V+ GYVS+V  L T   VHC+LVCEP  L+++ ESG++
Sbjct: 906  IDCHCSTCKSVCSEKNALKIVQVEHGYVSVVTTLETVDNVHCILVCEPNRLVSVGESGKL 965

Query: 776  YIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXX 955
             +W+MNS WS + E ++IP+   +   I++LKR+PK   +VVGHN  GEFSLWDI K   
Sbjct: 966  QVWVMNSKWSEKIEYFIIPADGSVSPGIMELKRVPKCTHLVVGHNSRGEFSLWDIAKCNC 1025

Query: 956  XXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCP 1135
                        +F P+SLF W +KG     V  E    KL EAT  W+S   +     P
Sbjct: 1026 VTSFSALKSPVNEFFPISLFQWQTKGSGFSNVNIEEQADKLLEATNLWYSEQRDICWFSP 1085

Query: 1136 VEGEDVAVWLFIRTSS--DSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXX 1309
            +E EDVA+WLF+ T+S  DS       SS+  + +   W L L++K+ +I G+ LD    
Sbjct: 1086 IE-EDVAMWLFVSTTSDLDSCHNHVSTSSSYDIHTARSWRLALLMKNSIIFGSPLDLRTS 1144

Query: 1310 XXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGG 1489
                             VY+W+L++GSK + LH  +   +  +ATDD +   + +A   G
Sbjct: 1145 GNGVSCGYGIISTSDGVVYMWELSKGSKLDTLHHFQDGNVTCVATDD-SRGALGVAGGRG 1203

Query: 1490 QLCVYLHT*DL 1522
            +L +YLH  +L
Sbjct: 1204 ELLLYLHDPEL 1214


>gb|ESW28388.1| hypothetical protein PHAVU_003G282800g [Phaseolus vulgaris]
          Length = 1211

 Score =  322 bits (824), Expect = 5e-85
 Identities = 174/435 (40%), Positives = 253/435 (58%), Gaps = 6/435 (1%)
 Frame = +2

Query: 236  EKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTT 415
            + +EL   +  ++KF+GCY HPM +SS+ L     E++ICV CG L D+ R LF Y +  
Sbjct: 780  QNEELKSNLGSSVKFVGCYLHPMPVSSLFLSTKEDEVHICVLCGHLTDQYRTLFTYKVAI 839

Query: 416  KESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKN 595
             E + G P ++ H+S+ LP  K  F +E  V++S +QLTP G+ +VL+ SI+ P CRE  
Sbjct: 840  TEPTLGYPSVMAHSSILLPDPKHNFIKETMVERSGVQLTPGGQYIVLIGSIKAPNCREGK 899

Query: 596  LHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRI 775
            + C C  C S  +EKNA+KIV V+ GYVS+V  L T   VHC+LVCEP  L+++ ESG++
Sbjct: 900  IDCSCSTCTSVFYEKNALKIVQVEHGYVSVVTTLETADNVHCILVCEPNRLVSVGESGKL 959

Query: 776  YIWIMNSTWSVQTELYVIPSYEFLPS-RIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRX 952
             +W+MNS WS +TE ++IP+ +   S  IV+LK++PKS  +VVGHN YGEFSLWDI K  
Sbjct: 960  EVWVMNSKWSEKTEHFIIPTDDGSASPGIVELKKVPKSTHLVVGHNSYGEFSLWDIAKCN 1019

Query: 953  XXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISC 1132
                         +F P+SLF W +KG        E    KL +AT SW+S+  E     
Sbjct: 1020 CVARFSAIKSPINEFFPISLFQWQTKGSGFSYASMEEQADKLLKATNSWYSQQRETSWPS 1079

Query: 1133 PVEGEDVAVWLFIRTSSDSYPQSACY-----SSNNHLISNGCWWLGLMIKDVVILGAALD 1297
            P+E E+VA+WLF+ T SD   Q  C+     SS+  + +   W L LM+K+ +  G+ L+
Sbjct: 1080 PLE-ENVAMWLFVSTYSD---QDCCHNPTSTSSSFDIHTARSWRLALMMKNSINFGSPLN 1135

Query: 1298 XXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIA 1477
                                 VY+W+L++GSK   LH  +   +  +ATD+ +   + +A
Sbjct: 1136 LRTCGIGVSSGYGIIGTTEGVVYMWELSKGSKLYTLHQFQDGNVACVATDN-SRGALGVA 1194

Query: 1478 RDGGQLCVYLHT*DL 1522
              GGQL +YLH  +L
Sbjct: 1195 -GGGQLLLYLHIPEL 1208


>ref|XP_004509752.1| PREDICTED: uncharacterized protein LOC101515165 [Cicer arietinum]
          Length = 1239

 Score =  320 bits (820), Expect = 1e-84
 Identities = 171/436 (39%), Positives = 242/436 (55%), Gaps = 5/436 (1%)
 Frame = +2

Query: 218  TSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLF 397
            T  +CE K  LN  +    KF+G Y HPM +SS++++    EI+ICV CG L  ++R LF
Sbjct: 802  TQRNCELKNNLNSNV----KFVGRYMHPMPVSSLLIRTREDEIHICVICGLLMSQQRTLF 857

Query: 398  LYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMP 577
             Y +  KES+ G P ++ H+ + LP     F RE  V+ + ++LTPDG+ +VL+ SIR P
Sbjct: 858  TYKVAIKESNFGFPSVMAHSPIILPDPNHNFIRETMVESTGVELTPDGQYIVLIGSIRTP 917

Query: 578  YCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIAL 757
             CRE  + C C  C S C EK+A+KIVHV+ GYVS++  L     VHC+LVCEP  L+++
Sbjct: 918  NCREGKIDCCCSTCTSVCSEKSALKIVHVQCGYVSLMATLEVIDDVHCILVCEPNRLVSV 977

Query: 758  DESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWD 937
             ESGR+++W+MNSTWS   E ++IP    +   IV+LK++PK A +VVG N  GEFSLWD
Sbjct: 978  GESGRLHVWVMNSTWSEMVEYFIIPPDGSMSPGIVELKKVPKCAHLVVGRNICGEFSLWD 1037

Query: 938  IVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSE 1117
            I K               +F P+SLF    K +       E    KL EAT  W S   E
Sbjct: 1038 ITKLNCVSSFSASKYPINEFSPISLFHLQRKDVGFSYASIEEKAEKLLEATKLWHSEQRE 1097

Query: 1118 AYISCPVEGEDVAVWLFIRTSS--DSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAA 1291
              +  P   +DVA+W  + T S  D        SS++ + S   W L L++++ ++ G+ 
Sbjct: 1098 TSVFLP--SQDVAIWFLVSTPSDVDCCQNHVSTSSHHDVHSARSWRLALLVENSIVFGSP 1155

Query: 1292 LDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSS--- 1462
            LD                     VY W+L+ GSK + LH  E  T+  +ATD+  S+   
Sbjct: 1156 LDPRATAIGVSGGYGISSTSDGVVYTWELSRGSKVDTLHRFEDGTVTSLATDESNSNSRG 1215

Query: 1463 VVAIARDGGQLCVYLH 1510
             V +A DGGQL +YLH
Sbjct: 1216 AVGVAGDGGQLLLYLH 1231


>ref|XP_004298357.1| PREDICTED: uncharacterized protein LOC101305752 [Fragaria vesca
            subsp. vesca]
          Length = 1259

 Score =  317 bits (813), Expect = 9e-84
 Identities = 178/472 (37%), Positives = 257/472 (54%), Gaps = 1/472 (0%)
 Frame = +2

Query: 92   VRHSDGYIEKDMTGNQNSDLMYQSVDQGRDSSRLHIREEKTATSSSCE-EKQELNDEIIG 268
            V H +  ++K + GN+N      S    +              SS+ +  K+E N+ + G
Sbjct: 792  VSHLENQVDKKVVGNENLLQFIDSETSHKQGPSFSYDPNSIPFSSNTKPHKKEHNNGLAG 851

Query: 269  TMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMV 448
             ++F+GCY  P+ + SV+L      IY+ V CG L  +   LF+Y +  +E   G+  +V
Sbjct: 852  ILEFVGCYTQPVPVLSVLLSTKGRYIYVSVLCGLLVGKDVSLFIYKVAIEEPMVGHSSLV 911

Query: 449  GHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESF 628
            GHTS++LP L D +   +A+++  LQ  PDG+ LVL+D IR P+CR+   HCLC  C S 
Sbjct: 912  GHTSLTLPDLTD-YSNGMALERFCLQFIPDGQCLVLLDKIRTPFCRQGKTHCLCTTCASS 970

Query: 629  CFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSV 808
            C E++AVKIV VKLGYVS+V +L       C+LVCEP +L+++ +SGR+++W+M+STWS 
Sbjct: 971  CSEEDAVKIVQVKLGYVSLVTRLKAAQSQRCILVCEPNNLVSVGKSGRLHLWVMDSTWSA 1030

Query: 809  QTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXX 988
            Q E  V+PS + +   +VDLKRIP    ++VGHNGYGEFSLWDI K              
Sbjct: 1031 QMEYIVMPSEDCISPGVVDLKRIPNCTHLIVGHNGYGEFSLWDITKCIFVSRFSAPSGSI 1090

Query: 989  LDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLF 1168
              F+P+SLF+W            E  + ++    M+  S+   +Y     EGEDVA+ L 
Sbjct: 1091 CQFVPISLFAWQMNFHASSHFEMEEHVNQM----MASISKTLSSY-----EGEDVAICLL 1141

Query: 1169 IRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXX 1348
            +  SSDS  Q      N H    G W L LM+K++VILG ALD                 
Sbjct: 1142 V-LSSDSDAQHDYELGNCHPNPVGRWRLALMVKNIVILGTALDSRASVIGASAGQGICGT 1200

Query: 1349 XXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQLCVY 1504
                VY W+L+ G+K   +H  +G ++  I+ DD  S  VAIA D  Q+ VY
Sbjct: 1201 CDGLVYTWELSSGTKLGTMHHFKGGSVSCISNDDSRSGAVAIAGD-NQVLVY 1251


>ref|XP_002893407.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297339249|gb|EFH69666.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 1194

 Score =  310 bits (795), Expect = 1e-81
 Identities = 170/447 (38%), Positives = 245/447 (54%)
 Frame = +2

Query: 170  QGRDSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIY 349
            +     R  ++E   +++       ++N+E+  T++ +GCY HPM +SSV+LK   +EIY
Sbjct: 755  ENTSEKRTSVQEFPASSNLEINRDVKINNEMGKTVELLGCYFHPMPVSSVLLKSAGNEIY 814

Query: 350  ICVSCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQL 529
            ICV   A EDR R LF+Y ++ K  S+G P ++GHT   LP + D+ G    ++ S L  
Sbjct: 815  ICVLSFATEDRVRTLFMYKMSAKAPSKGFPSIIGHTPAILPIVDDKSGGNRTLEISNLHF 874

Query: 530  TPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTF 709
            TPDG  L+L+ +I+ PYCR++   C C  C S CFE+NAV+IV VK G+VS+V KL    
Sbjct: 875  TPDGLHLILIGNIKTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADD 934

Query: 710  PVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSA 889
             V CV+VC+P +LIA  +SG + +W MNS WS  TE  VI +   + S I++LK+IPK  
Sbjct: 935  SVQCVVVCDPNNLIAAVKSGNLIVWAMNSHWSGSTEESVILANPCISSCIMELKKIPKCP 994

Query: 890  SMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCM 1069
             +V+GHNG GEF++WDI KR              +F+P SLF+W            E  +
Sbjct: 995  HLVIGHNGIGEFTIWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDHV 1051

Query: 1070 RKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWW 1249
              +  AT  WFS+        P E +D A+WL + T  +S  +     S        CW 
Sbjct: 1052 DMILAATKLWFSKGINNKTLVPAEVKDTAIWLLVSTDLESDAKCDRVESPAR-----CWR 1106

Query: 1250 LGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTI 1429
            L L++K+ +ILG  LD                     VY+WDL+ G+K  +LHD +G  +
Sbjct: 1107 LALLVKNQLILGNQLDPRADVAGTISGHGVAGTLDGLVYMWDLSTGAKLGSLHDFKGQRV 1166

Query: 1430 LRIATDDLTSSVVAIARDGGQLCVYLH 1510
              I+TDD  S  + IA + GQL VY H
Sbjct: 1167 SCISTDD--SRNICIASEDGQLLVYCH 1191


>ref|XP_006415912.1| hypothetical protein EUTSA_v10006590mg [Eutrema salsugineum]
            gi|557093683|gb|ESQ34265.1| hypothetical protein
            EUTSA_v10006590mg [Eutrema salsugineum]
          Length = 1207

 Score =  309 bits (791), Expect = 3e-81
 Identities = 165/422 (39%), Positives = 240/422 (56%)
 Frame = +2

Query: 245  ELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKES 424
            ++N+E+  T++ +G Y HPM +S+V L+   +EIYICV   A EDR   LF+Y ++ K  
Sbjct: 786  KINNEMEKTVELLGYYFHPMPVSTVSLQYVGNEIYICVLSFATEDRVSTLFMYKISAKSP 845

Query: 425  SQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHC 604
            ++G P +VGHT   LP + D+ GR   +++S L  TPDG+ L+   +I+ PYCR++ + C
Sbjct: 846  TRGFPSVVGHTPAILPIVDDKSGRNRTLERSYLHFTPDGQHLIFTGNIKTPYCRQREIDC 905

Query: 605  LCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIW 784
            LC  C S  FE+NAV+IV VK GYVS+V KL     V CV+VC+P +LIA+ +SG +  W
Sbjct: 906  LCLTCTSASFEENAVRIVEVKAGYVSLVTKLQAVDSVQCVVVCDPNYLIAVVKSGNLIAW 965

Query: 785  IMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXX 964
             MNS W   TE +VI +   + S IV+LK+IPK   +++GHNG GEF++WDI KR     
Sbjct: 966  AMNSDWRGSTEEFVILANPCISSCIVELKKIPKCPHLIIGHNGIGEFTIWDISKRSLVSR 1025

Query: 965  XXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEG 1144
                     +F+P SLF+W +     +    E  +  +  AT  WFS+        P E 
Sbjct: 1026 FVSPSNLIFEFIPTSLFAWHT---VHNHSTIEDHVDVILAATKLWFSKGVNNKTLVPAEV 1082

Query: 1145 EDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXX 1324
            ED A+WL + T  D  P + C    +      CW L L++++ VILG+ LD         
Sbjct: 1083 EDTAIWLLVSTDPD--PDAICDRVES---PARCWRLALLVRNQVILGSQLDPRADVAGTV 1137

Query: 1325 XXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQLCVY 1504
                        VY+WDL+ G+K  +LHD +G  +  I++DD  S  + IA + GQL VY
Sbjct: 1138 SGHGVAGTLDGHVYMWDLSTGTKLGSLHDFKGQGVSCISSDD--SGNICIASEDGQLLVY 1195

Query: 1505 LH 1510
             H
Sbjct: 1196 CH 1197


>gb|AAG50686.1|AC079829_19 hypothetical protein [Arabidopsis thaliana]
          Length = 1196

 Score =  306 bits (785), Expect = 2e-80
 Identities = 168/438 (38%), Positives = 244/438 (55%)
 Frame = +2

Query: 197  IREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALE 376
            ++E   +++       ++N+E+  T++ +GCY HPM +SSV+L+   +EIYI V   A E
Sbjct: 766  VQEFPASSNLKLNRDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATE 825

Query: 377  DRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVL 556
            DR R LF+Y ++ +  S+G P ++GHT   LP + D+      ++ S L  TPDG  L+L
Sbjct: 826  DRVRTLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLIL 885

Query: 557  VDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCE 736
              +I+ PYCR++   C C  C S CFE+NAV+IV VK G+VS+V KL     V CV+VC+
Sbjct: 886  TGNIKTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCD 945

Query: 737  PEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGY 916
            P +LIA  +SG + +W MNS WS  TE YVI +   + S I++LK+IPK   +V+GHNG 
Sbjct: 946  PNNLIAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGI 1005

Query: 917  GEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMS 1096
            GEF++WDI KR              +F+P SLF+W            E  +  +  AT  
Sbjct: 1006 GEFTIWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKL 1062

Query: 1097 WFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVV 1276
            WFS+        P E +D A+WL + T  DS   + C    + +    CW L L++KD +
Sbjct: 1063 WFSKGVNNKTLVPAEVKDTAIWLLVSTDLDS--DAKCDRVESPV---RCWRLALLVKDQL 1117

Query: 1277 ILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLT 1456
            ILG+ LD                     VY+WDL+ G+K  +LHD +G  +  I+TDD  
Sbjct: 1118 ILGSQLDPRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-- 1175

Query: 1457 SSVVAIARDGGQLCVYLH 1510
            S  + IA + GQL VY H
Sbjct: 1176 SRNICIASEDGQLLVYCH 1193


>gb|AAF98582.1|AC013427_25 This gene may be cut off [Arabidopsis thaliana]
          Length = 554

 Score =  306 bits (785), Expect = 2e-80
 Identities = 168/438 (38%), Positives = 244/438 (55%)
 Frame = +2

Query: 197  IREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALE 376
            ++E   +++       ++N+E+  T++ +GCY HPM +SSV+L+   +EIYI V   A E
Sbjct: 124  VQEFPASSNLKLNRDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATE 183

Query: 377  DRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVL 556
            DR R LF+Y ++ +  S+G P ++GHT   LP + D+      ++ S L  TPDG  L+L
Sbjct: 184  DRVRTLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLIL 243

Query: 557  VDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCE 736
              +I+ PYCR++   C C  C S CFE+NAV+IV VK G+VS+V KL     V CV+VC+
Sbjct: 244  TGNIKTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCD 303

Query: 737  PEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGY 916
            P +LIA  +SG + +W MNS WS  TE YVI +   + S I++LK+IPK   +V+GHNG 
Sbjct: 304  PNNLIAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGI 363

Query: 917  GEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMS 1096
            GEF++WDI KR              +F+P SLF+W            E  +  +  AT  
Sbjct: 364  GEFTIWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKL 420

Query: 1097 WFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVV 1276
            WFS+        P E +D A+WL + T  DS   + C    + +    CW L L++KD +
Sbjct: 421  WFSKGVNNKTLVPAEVKDTAIWLLVSTDLDS--DAKCDRVESPV---RCWRLALLVKDQL 475

Query: 1277 ILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLT 1456
            ILG+ LD                     VY+WDL+ G+K  +LHD +G  +  I+TDD  
Sbjct: 476  ILGSQLDPRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-- 533

Query: 1457 SSVVAIARDGGQLCVYLH 1510
            S  + IA + GQL VY H
Sbjct: 534  SRNICIASEDGQLLVYCH 551


>ref|NP_001185099.1| DNA binding protein [Arabidopsis thaliana]
            gi|332192557|gb|AEE30678.1| DNA binding protein
            [Arabidopsis thaliana]
          Length = 1194

 Score =  306 bits (785), Expect = 2e-80
 Identities = 168/438 (38%), Positives = 244/438 (55%)
 Frame = +2

Query: 197  IREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALE 376
            ++E   +++       ++N+E+  T++ +GCY HPM +SSV+L+   +EIYI V   A E
Sbjct: 764  VQEFPASSNLKLNRDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATE 823

Query: 377  DRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVL 556
            DR R LF+Y ++ +  S+G P ++GHT   LP + D+      ++ S L  TPDG  L+L
Sbjct: 824  DRVRTLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLIL 883

Query: 557  VDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCE 736
              +I+ PYCR++   C C  C S CFE+NAV+IV VK G+VS+V KL     V CV+VC+
Sbjct: 884  TGNIKTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCD 943

Query: 737  PEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGY 916
            P +LIA  +SG + +W MNS WS  TE YVI +   + S I++LK+IPK   +V+GHNG 
Sbjct: 944  PNNLIAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGI 1003

Query: 917  GEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMS 1096
            GEF++WDI KR              +F+P SLF+W            E  +  +  AT  
Sbjct: 1004 GEFTIWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKL 1060

Query: 1097 WFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVV 1276
            WFS+        P E +D A+WL + T  DS   + C    + +    CW L L++KD +
Sbjct: 1061 WFSKGVNNKTLVPAEVKDTAIWLLVSTDLDS--DAKCDRVESPV---RCWRLALLVKDQL 1115

Query: 1277 ILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLT 1456
            ILG+ LD                     VY+WDL+ G+K  +LHD +G  +  I+TDD  
Sbjct: 1116 ILGSQLDPRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-- 1173

Query: 1457 SSVVAIARDGGQLCVYLH 1510
            S  + IA + GQL VY H
Sbjct: 1174 SRNICIASEDGQLLVYCH 1191


Top