BLASTX nr result

ID: Zingiber23_contig00020554 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber23_contig00020554
         (1210 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [A...   167   1e-38
ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613...   160   7e-37
ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613...   160   7e-37
ref|XP_006450349.1| hypothetical protein CICLE_v10010421mg [Citr...   160   7e-37
gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus no...   157   6e-36
emb|CBI21104.3| unnamed protein product [Vitis vinifera]              154   9e-35
ref|XP_002460648.1| hypothetical protein SORBIDRAFT_02g032470 [S...   152   2e-34
ref|XP_006660963.1| PREDICTED: uncharacterized protein LOC102721...   150   1e-33
ref|XP_006660962.1| PREDICTED: uncharacterized protein LOC102721...   150   1e-33
gb|EEE70205.1| hypothetical protein OsJ_30300 [Oryza sativa Japo...   149   2e-33
gb|EEC85039.1| hypothetical protein OsI_32352 [Oryza sativa Indi...   149   3e-33
tpg|DAA40735.1| TPA: putative trithorax-like family protein [Zea...   148   4e-33
ref|XP_004957609.1| PREDICTED: uncharacterized protein LOC101761...   147   7e-33
ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putativ...   145   3e-32
gb|EOY29408.1| Uncharacterized protein isoform 9 [Theobroma cacao]    144   6e-32
gb|EOY29407.1| Uncharacterized protein isoform 8, partial [Theob...   144   6e-32
gb|EOY29402.1| Uncharacterized protein isoform 3 [Theobroma cacao]    144   6e-32
gb|EOY29400.1| Uncharacterized protein isoform 1 [Theobroma caca...   144   6e-32
tpg|DAA40736.1| TPA: putative trithorax-like family protein [Zea...   142   4e-31
ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313...   137   1e-29

>ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda]
            gi|548856405|gb|ERN14258.1| hypothetical protein
            AMTR_s00033p00150780 [Amborella trichopoda]
          Length = 2123

 Score =  167 bits (422), Expect = 1e-38
 Identities = 113/325 (34%), Positives = 164/325 (50%), Gaps = 27/325 (8%)
 Frame = +1

Query: 316  VVCGNXXXXXXXXT-DGDQKPAKIISLASILKRARKCNLTETSDTAVSHHSETSEDAKN- 489
            +VCGN        + +G QK AK++SL+SIL+RA++C   E  +   S  SET     N 
Sbjct: 1385 IVCGNLGIIANVNSAEGLQKAAKVVSLSSILRRAKRCT-NENQEMRFSSMSETQNKFSNR 1443

Query: 490  SAIFHRLEESCECLR-KNGEDLSSPSTAKAFHSGNNTGIRCHLHSMQSISNLRCKDICTC 666
            S   H    +   ++ K G D    S A  F     + I+ H    Q+ + +  K++   
Sbjct: 1444 SQGCHTTPCAASRVKDKEGHDSVETSAADWF-----SAIQMH----QTANAV--KEVRKY 1492

Query: 667  SPGKLAAKFRHHAKSTCSS---------TTEINECSKLTMAKDQL-------------NC 780
            S  +L  K +H  K  C +         + E N C +     D+L             +C
Sbjct: 1493 SLNELTQKGKHANKQACLNHLSRQEHLQSREKNLCPRSATQNDKLVDNLNEKQSRTPNSC 1552

Query: 781  SPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAGKLSQVSRSRS--LLNPD 954
            +      ++    +  ++  LE    T     P   +++    K S   R R   +L+ D
Sbjct: 1553 TRKNSICMQRSVFRTSEKLCLENVKET---QGPIDVSHEVKGKKSSTKCRKRKAFILDSD 1609

Query: 955  AFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQDIVCVLCG 1134
             FCCVCG S++ D N +LEC  CLIKVHQACYGV K PKG WCCRPC+ + +DIVCVLCG
Sbjct: 1610 VFCCVCGGSDKDDFNCILECSQCLIKVHQACYGVLKAPKGRWCCRPCRADIKDIVCVLCG 1669

Query: 1135 YGDGAMTRAVKCQNIIKSLLKAWKV 1209
            Y  GAMTRA++ +NI+K+LL+ WK+
Sbjct: 1670 YSGGAMTRALRSRNIVKNLLQTWKI 1694


>ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613578 isoform X2 [Citrus
            sinensis]
          Length = 2119

 Score =  160 bits (406), Expect = 7e-37
 Identities = 111/316 (35%), Positives = 152/316 (48%), Gaps = 18/316 (5%)
 Frame = +1

Query: 316  VVCGNXXXXXXXXTDGDQKPAKIISLASILKRARKCNLTETSDTAVSHHSETSE------ 477
            VVCG              +PAKI+ L+ ILK +R+  L  T D+  +   E  +      
Sbjct: 1393 VVCGKYGEICNELIGDVSRPAKIVPLSRILKTSRRDTLPNTCDSKQTFPDELKKAIFCGS 1452

Query: 478  DAKNSAIFHRLEE------SCECLRKNGEDLSSPSTAKAFHSG---NNTGIRCHLH--SM 624
            DA  +   +  EE      S  C   N  DLS     K F +G    N+ +   L   S 
Sbjct: 1453 DAGYNGFSNLKEEKSAIHHSSICNEMN-VDLSLEEDEKMFTNGVDEENSMLEKKLDHKSK 1511

Query: 625  QSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEINECSKLTMAKDQLNCSPSAICGL 804
            ++ S L  K      P     + R   + T +     +E   L        C P    G 
Sbjct: 1512 KNCSKLNRKVFTKSKPKSKEIRKRSLCELTDNGKKSTSESFSLVKIS---KCMPKMEAG- 1567

Query: 805  EDQDNKLHQQKILEPASPTAIGSFP-FPKNNQDHAGKLSQVSRSRSLLNPDAFCCVCGSS 981
                            S  A+GS      +++ ++ KL+   RS  +++ DAFCCVCG S
Sbjct: 1568 --------------KVSKNAVGSKQNIRASSEVNSEKLNPEHRSLYVMDSDAFCCVCGGS 1613

Query: 982  NQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQDIVCVLCGYGDGAMTRA 1161
            N+ + N L+EC  C IKVHQACYGVSK+PKG+W CRPC+ NS+DIVCVLCGYG GAMT A
Sbjct: 1614 NKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYGGGAMTCA 1673

Query: 1162 VKCQNIIKSLLKAWKV 1209
            ++ + I+K LLKAW +
Sbjct: 1674 LRSRTIVKGLLKAWNI 1689


>ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613578 isoform X1 [Citrus
            sinensis]
          Length = 2120

 Score =  160 bits (406), Expect = 7e-37
 Identities = 111/316 (35%), Positives = 152/316 (48%), Gaps = 18/316 (5%)
 Frame = +1

Query: 316  VVCGNXXXXXXXXTDGDQKPAKIISLASILKRARKCNLTETSDTAVSHHSETSE------ 477
            VVCG              +PAKI+ L+ ILK +R+  L  T D+  +   E  +      
Sbjct: 1394 VVCGKYGEICNELIGDVSRPAKIVPLSRILKTSRRDTLPNTCDSKQTFPDELKKAIFCGS 1453

Query: 478  DAKNSAIFHRLEE------SCECLRKNGEDLSSPSTAKAFHSG---NNTGIRCHLH--SM 624
            DA  +   +  EE      S  C   N  DLS     K F +G    N+ +   L   S 
Sbjct: 1454 DAGYNGFSNLKEEKSAIHHSSICNEMN-VDLSLEEDEKMFTNGVDEENSMLEKKLDHKSK 1512

Query: 625  QSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEINECSKLTMAKDQLNCSPSAICGL 804
            ++ S L  K      P     + R   + T +     +E   L        C P    G 
Sbjct: 1513 KNCSKLNRKVFTKSKPKSKEIRKRSLCELTDNGKKSTSESFSLVKIS---KCMPKMEAG- 1568

Query: 805  EDQDNKLHQQKILEPASPTAIGSFP-FPKNNQDHAGKLSQVSRSRSLLNPDAFCCVCGSS 981
                            S  A+GS      +++ ++ KL+   RS  +++ DAFCCVCG S
Sbjct: 1569 --------------KVSKNAVGSKQNIRASSEVNSEKLNPEHRSLYVMDSDAFCCVCGGS 1614

Query: 982  NQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQDIVCVLCGYGDGAMTRA 1161
            N+ + N L+EC  C IKVHQACYGVSK+PKG+W CRPC+ NS+DIVCVLCGYG GAMT A
Sbjct: 1615 NKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYGGGAMTCA 1674

Query: 1162 VKCQNIIKSLLKAWKV 1209
            ++ + I+K LLKAW +
Sbjct: 1675 LRSRTIVKGLLKAWNI 1690


>ref|XP_006450349.1| hypothetical protein CICLE_v10010421mg [Citrus clementina]
            gi|557553575|gb|ESR63589.1| hypothetical protein
            CICLE_v10010421mg [Citrus clementina]
          Length = 765

 Score =  160 bits (406), Expect = 7e-37
 Identities = 111/316 (35%), Positives = 152/316 (48%), Gaps = 18/316 (5%)
 Frame = +1

Query: 316  VVCGNXXXXXXXXTDGDQKPAKIISLASILKRARKCNLTETSDTAVSHHSETSE------ 477
            VVCG              +PAKI+ L+ ILK +R+  L  T D+  +   E  +      
Sbjct: 39   VVCGKYGEICNELIGDVSRPAKIVPLSRILKTSRRDTLPNTCDSKQTFPDELKKTIFCGS 98

Query: 478  DAKNSAIFHRLEE------SCECLRKNGEDLSSPSTAKAFHSG---NNTGIRCHLH--SM 624
            DA  +   +  EE      S  C   N  DLS     K F +G    N+ +   L   S 
Sbjct: 99   DAGYNGFSNLKEEKSAIHHSSICNEMN-VDLSLEEDEKMFTNGFDEENSMLEKKLDHKSK 157

Query: 625  QSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEINECSKLTMAKDQLNCSPSAICGL 804
            ++ S L  K      P     + R   + T +     +E   L        C P    G 
Sbjct: 158  KNCSKLNRKVFTKSKPKSKEIRKRSLCELTDNGKKSTSESFSLVKIS---KCMPKMEAG- 213

Query: 805  EDQDNKLHQQKILEPASPTAIGSFP-FPKNNQDHAGKLSQVSRSRSLLNPDAFCCVCGSS 981
                            S  A+GS      +++ ++ KL+   RS  +++ DAFCCVCG S
Sbjct: 214  --------------KVSKNAVGSKQNIRASSEVNSEKLNPEHRSLYVMDSDAFCCVCGGS 259

Query: 982  NQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQDIVCVLCGYGDGAMTRA 1161
            N+ + N L+EC  C IKVHQACYGVSK+PKG+W CRPC+ NS+DIVCVLCGYG GAMT A
Sbjct: 260  NKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYGGGAMTCA 319

Query: 1162 VKCQNIIKSLLKAWKV 1209
            ++ + I+K LLKAW +
Sbjct: 320  LRSRTIVKGLLKAWNI 335


>gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus notabilis]
          Length = 2073

 Score =  157 bits (398), Expect = 6e-36
 Identities = 110/333 (33%), Positives = 162/333 (48%), Gaps = 15/333 (4%)
 Frame = +1

Query: 256  SLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQ-KPAKIISLASILKRARKCNL- 429
            SLNC AN             +VCG           G+  KPAKI+ L+ +L  AR+C L 
Sbjct: 1325 SLNCQANTRHCKSKP-----IVCGKYGELSDGELVGNMSKPAKIVPLSRVLMLARRCTLP 1379

Query: 430  -----TETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDLSSPSTAKAFHSGNN 594
                 T TS   +  HS+ ++       FHRL        +  ++  S   A +    N 
Sbjct: 1380 KNEKRTFTSIRGMKTHSDGADG------FHRL--------RTEKESRSHDAAVSGKLNNE 1425

Query: 595  TGIRCHLHSMQSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEINECSKLTMAKD-- 768
            T +    +      +   +D+       +    RH  +  C     I      + +K+  
Sbjct: 1426 TFLEIMKNRCSGRDDKFAEDL------SMLEIERHENEKACGKEDSIAHARLKSRSKEIR 1479

Query: 769  QLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAGKLSQVSRSR---- 936
            + +    A+ G    +  L   K  + +   + G+     N +D    L +V++      
Sbjct: 1480 KRSIYELAVDGEAPHNKTLSLSKASKCSPEVSKGTIL--GNGEDGTHGLCEVAQKSPDQI 1537

Query: 937  --SLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQ 1110
              SL   ++FCCVCGSS++ DTN LLEC+ CLIKVHQACYGVS+ PKG+W CRPC+ +S+
Sbjct: 1538 WSSLPVSESFCCVCGSSDKDDTNNLLECNICLIKVHQACYGVSRAPKGHWYCRPCRTSSR 1597

Query: 1111 DIVCVLCGYGDGAMTRAVKCQNIIKSLLKAWKV 1209
            +IVCVLCGYG GAMTRA++ + I+KSLL+ W V
Sbjct: 1598 NIVCVLCGYGGGAMTRALRSRTIVKSLLRVWNV 1630


>emb|CBI21104.3| unnamed protein product [Vitis vinifera]
          Length = 1111

 Score =  154 bits (388), Expect = 9e-35
 Identities = 134/428 (31%), Positives = 197/428 (46%), Gaps = 40/428 (9%)
 Frame = +1

Query: 46   KRKRSVLSFNKFKERIGYQDGVPKD----DDKQLQN----DDISLRRL---KRVG----- 177
            KR+RS LS  K   R    D +  D    D  Q Q+    + +S+  +   KR+G     
Sbjct: 319  KRRRSTLSSAKNFSRKRDVDKIYADREGEDGYQAQSKGKTEFLSIHEVSGAKRIGPDRTA 378

Query: 178  EKMKQGLVACSKHESRSGTAKPPKFMSLNCI--ANXXXXXXXXXXXXXVVCGNXXXXXXX 351
            E  +Q    C +  S +   K  K+ S+ C+  ++             VVCG        
Sbjct: 379  EAFRQ---FCMQEPSHT---KAVKYNSVGCVKESSCLKLDVSNRREKPVVCGKYGVISNG 432

Query: 352  XTDGD-QKPAKIISLASILKRARKCNLTETSD---TAVSHHSETSEDAKNSAIF------ 501
                D  KPAKI SL+ +LK AR+C L+   +   T++    +      N  +       
Sbjct: 433  KLAIDVPKPAKIFSLSRVLKTARRCTLSANDEPRLTSMRQLKKARLRGSNGCVNEISNLM 492

Query: 502  ----HRLEESCECLRKNGEDLSSPSTAKAFHSGNNTGIRCHLHSMQSISNLRCKDICTCS 669
                + ++ +  C  +N  D S     KA  SG+       L S Q  +    KD    S
Sbjct: 493  KEKENEIQNATRCDERN-PDNSMEEAEKAVISGDTRCADELLMSKQEKAYGSKKDDSYHS 551

Query: 670  PGKLAAKFRHHAKSTC-------SSTTEINECSKLTMAKDQLNCSPSAICGLEDQDNKLH 828
              +L  K++   K +         S +  N   K+     Q     S   GLE+ ++  H
Sbjct: 552  T-RLKRKYKEIRKRSLYELTGKGKSPSSGNAFVKIPKHAPQ---KKSGSVGLENAEDSKH 607

Query: 829  QQKILEPASPTAIGSFPFPKNNQDHAGKLSQVSRSRSLLNP-DAFCCVCGSSNQGDTNQL 1005
                               ++ + ++ K  +  R  S ++  DAFCCVCGSSN+ + N L
Sbjct: 608  SMS----------------ESYKVNSKKSIKEHRFESFISDTDAFCCVCGSSNKDEINCL 651

Query: 1006 LECHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIK 1185
            LEC  CLI+VHQACYGVS++PKG W CRPC+ +S++IVCVLCGYG GAMTRA++ +NI+K
Sbjct: 652  LECSRCLIRVHQACYGVSRVPKGRWYCRPCRTSSKNIVCVLCGYGGGAMTRALRTRNIVK 711

Query: 1186 SLLKAWKV 1209
            SLLK W +
Sbjct: 712  SLLKVWNI 719


>ref|XP_002460648.1| hypothetical protein SORBIDRAFT_02g032470 [Sorghum bicolor]
            gi|241924025|gb|EER97169.1| hypothetical protein
            SORBIDRAFT_02g032470 [Sorghum bicolor]
          Length = 1658

 Score =  152 bits (385), Expect = 2e-34
 Identities = 114/396 (28%), Positives = 185/396 (46%), Gaps = 12/396 (3%)
 Frame = +1

Query: 46   KRKRSVLSFNKFKERIGYQDGVPKDDDKQLQNDDI-----SLRRLKRVGEKMKQGLVACS 210
            KRK  ++  NK  +R+  Q+   + D++     +      S  R ++V +          
Sbjct: 909  KRKHPIMHLNKPVKRLHSQNNFFESDEQPDAKGNFLGGLNSSDRKRQVEDMSTPDRTKHH 968

Query: 211  KHESRSGTAKPPKFMSLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQKPAKIIS 390
            +  SR+   K PK++SLNCI N                 +         D  + P KI+ 
Sbjct: 969  QEGSRAFVRKLPKYVSLNCIVNEPNTNSEGTCSGSGGIDSSLIATGITNDNRKSP-KIVP 1027

Query: 391  LASILKRARKCNLTETSDTAVSH-HSETSEDAKNSAIFHRLE----ESCECLRKNGEDLS 555
            L+ +LK+A++C+  +   T  +H + E S D   ++  + ++    +   C  +   +L 
Sbjct: 1028 LSLVLKKAKRCHAVKLCKTESTHLYEEKSSDCSVNSSDYSIDKYSVDDENCSPQAEYELQ 1087

Query: 556  SPSTAKAFHSGNNTGIRCHLHSMQSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEI 735
                ++  +S N+  +R H+   +  S++  +D     P  L     +H   + SS    
Sbjct: 1088 DYKRSR--YSSND--LRSHVAHRKRTSSVIGED----GPLGLTDVETNHLSISSSSNGTK 1139

Query: 736  NECSKLTMAKDQLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAGKL 915
            N  + +++                    ++ + K     S    GS      ++D+A   
Sbjct: 1140 NRRTSVSL-------------------TRIRRHKKFRSKSTCYSGS------DKDNAALA 1174

Query: 916  SQVSRSR--SLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCR 1089
             +V+ +R    LN DA CCVC  S+    N+L+EC  C IKVHQACYGV K+P+G W CR
Sbjct: 1175 QEVNATRYSGRLNSDASCCVCAISDLEPCNRLIECSKCYIKVHQACYGVLKVPRGQWFCR 1234

Query: 1090 PCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSLLK 1197
            PCK N+ + VCVLCGYG GAMTRA+K +NI+KSLLK
Sbjct: 1235 PCKANTMNTVCVLCGYGGGAMTRALKTKNILKSLLK 1270


>ref|XP_006660963.1| PREDICTED: uncharacterized protein LOC102721579 isoform X2 [Oryza
            brachyantha]
          Length = 1706

 Score =  150 bits (378), Expect = 1e-33
 Identities = 119/395 (30%), Positives = 172/395 (43%), Gaps = 12/395 (3%)
 Frame = +1

Query: 46   KRKRSVLSFNKFKERIGYQDGVPKDDDKQLQNDDI------SLRRLKRVGEKMKQGLVAC 207
            KRK   +  NK  + +     V   DD++  N  I      S  R K+  +        C
Sbjct: 947  KRKHPPMRLNKHVKWLHKNYKVLDVDDERSDNKGILVGESNSSDREKQEDDVTTSARTKC 1006

Query: 208  SKHESRSGTAKPPKFMSLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQKPAKII 387
             +  SR    K PK++SLN I N             +   +        T+ ++K  KI+
Sbjct: 1007 QQQGSRLFARKLPKYVSLNGIVNEPNSEDACSGSASI---DSSLIATGITNDNRKSPKIV 1063

Query: 388  SLASILKRARKCNLTETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDLSSPST 567
             L+ ILK+A++C   ++     + H+  SE+  +     +   S        E L SP  
Sbjct: 1064 PLSLILKKAKRCRTVKS--LGKTEHAHFSEEKSSDCSVDKSSSSNRSFSSQDE-LWSPKN 1120

Query: 568  AKAFHSGNNTGIRCH----LHSMQSISNLRCKDICTCSPGKLAAKFRHHAKS--TCSSTT 729
             +   + +   ++       H ++    L   DI T    +L+A      K+   C S  
Sbjct: 1121 NRYSCNASRPHVKSDHQNPCHVLEEDELLSLADIGT---SQLSASRSRGIKTRRACISLN 1177

Query: 730  EINECSKLTMAKDQLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAG 909
             +  C + T      N S  + CG     +K    ++ E                     
Sbjct: 1178 RMERCEEFT------NESACSSCG-----DKHSAVQVCE--------------------A 1206

Query: 910  KLSQVSRSRSLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCR 1089
            K  + ++  SL   DA CCVCG SN    NQL+EC  C IKVHQACYGV K+P+G W CR
Sbjct: 1207 KFERYAQRPSL---DASCCVCGISNLEPCNQLIECSKCFIKVHQACYGVLKVPRGQWFCR 1263

Query: 1090 PCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSLL 1194
            PCK N  D VCV+CGYG GAMTRA+K +NI+KSLL
Sbjct: 1264 PCKINIHDTVCVICGYGGGAMTRALKAKNILKSLL 1298


>ref|XP_006660962.1| PREDICTED: uncharacterized protein LOC102721579 isoform X1 [Oryza
            brachyantha]
          Length = 1730

 Score =  150 bits (378), Expect = 1e-33
 Identities = 119/395 (30%), Positives = 172/395 (43%), Gaps = 12/395 (3%)
 Frame = +1

Query: 46   KRKRSVLSFNKFKERIGYQDGVPKDDDKQLQNDDI------SLRRLKRVGEKMKQGLVAC 207
            KRK   +  NK  + +     V   DD++  N  I      S  R K+  +        C
Sbjct: 971  KRKHPPMRLNKHVKWLHKNYKVLDVDDERSDNKGILVGESNSSDREKQEDDVTTSARTKC 1030

Query: 208  SKHESRSGTAKPPKFMSLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQKPAKII 387
             +  SR    K PK++SLN I N             +   +        T+ ++K  KI+
Sbjct: 1031 QQQGSRLFARKLPKYVSLNGIVNEPNSEDACSGSASI---DSSLIATGITNDNRKSPKIV 1087

Query: 388  SLASILKRARKCNLTETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDLSSPST 567
             L+ ILK+A++C   ++     + H+  SE+  +     +   S        E L SP  
Sbjct: 1088 PLSLILKKAKRCRTVKS--LGKTEHAHFSEEKSSDCSVDKSSSSNRSFSSQDE-LWSPKN 1144

Query: 568  AKAFHSGNNTGIRCH----LHSMQSISNLRCKDICTCSPGKLAAKFRHHAKS--TCSSTT 729
             +   + +   ++       H ++    L   DI T    +L+A      K+   C S  
Sbjct: 1145 NRYSCNASRPHVKSDHQNPCHVLEEDELLSLADIGT---SQLSASRSRGIKTRRACISLN 1201

Query: 730  EINECSKLTMAKDQLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAG 909
             +  C + T      N S  + CG     +K    ++ E                     
Sbjct: 1202 RMERCEEFT------NESACSSCG-----DKHSAVQVCE--------------------A 1230

Query: 910  KLSQVSRSRSLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCR 1089
            K  + ++  SL   DA CCVCG SN    NQL+EC  C IKVHQACYGV K+P+G W CR
Sbjct: 1231 KFERYAQRPSL---DASCCVCGISNLEPCNQLIECSKCFIKVHQACYGVLKVPRGQWFCR 1287

Query: 1090 PCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSLL 1194
            PCK N  D VCV+CGYG GAMTRA+K +NI+KSLL
Sbjct: 1288 PCKINIHDTVCVICGYGGGAMTRALKAKNILKSLL 1322


>gb|EEE70205.1| hypothetical protein OsJ_30300 [Oryza sativa Japonica Group]
          Length = 1792

 Score =  149 bits (377), Expect = 2e-33
 Identities = 116/390 (29%), Positives = 162/390 (41%), Gaps = 6/390 (1%)
 Frame = +1

Query: 46   KRKRSVLSFNKFKERIGYQ------DGVPKDDDKQLQNDDISLRRLKRVGEKMKQGLVAC 207
            KRK      NK  +R+         D    DD+     +  S  R K+           C
Sbjct: 1061 KRKHPPTHLNKHVKRLHSNCKVLNVDNERSDDEGIYVGESNSSDRKKQEDNMTTLDRTKC 1120

Query: 208  SKHESRSGTAKPPKFMSLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQKPAKII 387
             +  SR    K PK++SLNCI N             +   +        T+ ++K  KI+
Sbjct: 1121 QQQGSRLLVRKLPKYVSLNCIVNETNSEDACSGSASI---DSSLIATGITNDNRKSPKIV 1177

Query: 388  SLASILKRARKCNLTETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDLSSPST 567
             L  ILK+A++C+  +      + H    + +  SA     ++S    R         S 
Sbjct: 1178 PLNLILKKAKRCHAIKPLSKTENIHFSEEKSSDGSA-----DKSSSGDRSFSPQDELWSP 1232

Query: 568  AKAFHSGNNTGIRCHLHSMQSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEINECS 747
             K  +S N +                                R H K+ C S   + E  
Sbjct: 1233 KKNRYSSNVS--------------------------------RPHVKTDCQSPCCVLE-- 1258

Query: 748  KLTMAKDQLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAGKLSQVS 927
                               ED+   L      + ++  + GS      NQ     L+++ 
Sbjct: 1259 -------------------EDEPLSLADMGTSQLSASRSRGS-----KNQRACISLNRME 1294

Query: 928  RSRSLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPCKCNS 1107
            R     + DA CCVCG SN   +NQL+EC  C IKVHQACYGV K+P+G W C+PCK N+
Sbjct: 1295 RYIQRPSLDASCCVCGISNLEPSNQLIECSKCFIKVHQACYGVLKVPRGQWFCKPCKINT 1354

Query: 1108 QDIVCVLCGYGDGAMTRAVKCQNIIKSLLK 1197
            QD VCVLCGYG GAMTRA+K QNI+KSLL+
Sbjct: 1355 QDTVCVLCGYGGGAMTRALKAQNILKSLLR 1384


>gb|EEC85039.1| hypothetical protein OsI_32352 [Oryza sativa Indica Group]
          Length = 1741

 Score =  149 bits (375), Expect = 3e-33
 Identities = 119/390 (30%), Positives = 162/390 (41%), Gaps = 6/390 (1%)
 Frame = +1

Query: 46   KRKRSVLSFNKFKERIGYQ------DGVPKDDDKQLQNDDISLRRLKRVGEKMKQGLVAC 207
            KRK      NK  +R+         D    DD+     +  S  R K+           C
Sbjct: 1010 KRKHPPTHLNKHVKRLHSNCKVLNVDNERSDDEGIYVGESNSSDRKKQEDNTTTLDRTKC 1069

Query: 208  SKHESRSGTAKPPKFMSLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQKPAKII 387
             +  SR    K PK++SLNCI N             +   +        T+ ++K  KI+
Sbjct: 1070 QQQGSRLLVRKLPKYVSLNCIVNETNSEDACSGSASI---DSSLIATGITNDNRKSPKIV 1126

Query: 388  SLASILKRARKCNLTETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDLSSPST 567
             L  ILK+A++C+       A+   S+T          H  EE       +G    S S 
Sbjct: 1127 PLNLILKKAKRCH-------AIKPLSKTEN-------IHFSEEKSS----DGSTDKSSSG 1168

Query: 568  AKAFHSGNNTGIRCHLHSMQSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEINECS 747
             ++F   +            ++S    K  C               +S C    E    S
Sbjct: 1169 DRSFSPQDELWSPKKNRYSSNVSRPHVKTDC---------------QSPCCVLEEDEPLS 1213

Query: 748  KLTMAKDQLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAGKLSQVS 927
               M   QL+ S S   G++                            NQ     L+++ 
Sbjct: 1214 LADMGTSQLSASRSR--GIK----------------------------NQRACISLNRME 1243

Query: 928  RSRSLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPCKCNS 1107
            R     + DA CCVCG SN   +NQL+EC  C IKVHQACYGV K+P+G W C+PCK N+
Sbjct: 1244 RYIQRPSLDASCCVCGISNLEPSNQLIECSKCFIKVHQACYGVLKVPRGQWFCKPCKINT 1303

Query: 1108 QDIVCVLCGYGDGAMTRAVKCQNIIKSLLK 1197
            QD VCVLCGYG GAMTRA+K QNI+KSLL+
Sbjct: 1304 QDTVCVLCGYGGGAMTRALKAQNILKSLLR 1333


>tpg|DAA40735.1| TPA: putative trithorax-like family protein [Zea mays]
          Length = 1591

 Score =  148 bits (374), Expect = 4e-33
 Identities = 115/394 (29%), Positives = 186/394 (47%), Gaps = 10/394 (2%)
 Frame = +1

Query: 46   KRKRSVLSFNKFKERIGYQDGVPKDDDKQLQNDDI-----SLRRLKRVGEKMKQGLVACS 210
            KRK  ++  NK  +++  Q    + D++     +      S  R ++V +          
Sbjct: 845  KRKHPIMHLNKHVKQLHRQTKFFEGDEQPDAKGNFLGGLDSYDRKRQVEDMSTLDKTRHH 904

Query: 211  KHESRSGTAKPPKFMSLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQKPAKIIS 390
            +  SR+   K PK++SLNCI N                 +         D  + P KI+ 
Sbjct: 905  QEGSRAFVRKLPKYVSLNCIVNEPNTNSEGACSGSGGIDSSLIATGITNDNRKSP-KIVP 963

Query: 391  LASILKRARKCNLTETSDTAVSH-HSETSEDAKNSAIFHRLEESC----ECLRKNGEDLS 555
            L  +LK+A++CN  +   T  +H + E S D   ++  + +E+       C  +   +L 
Sbjct: 964  LNLVLKKAKRCNAVKLRKTESTHLYEEKSSDCSVNSSDYSIEKYSVDDENCSPQAEYELQ 1023

Query: 556  SPSTAKAFHSGNNTGIRCHLHSMQSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEI 735
                ++  +S N+  +R H+   +  S +  +D    S G    +    + S+ S+ T+ 
Sbjct: 1024 DSKRSR--YSSND--LRSHVALHKRTSGVIGEDD---SLGLTDVEINCLSISSSSNGTK- 1075

Query: 736  NECSKLTMAKDQLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAGKL 915
            N  + +++A+ +   S S      D+DN +   ++                N + ++G+L
Sbjct: 1076 NRRTSVSLARIKKFGSKSVCYSGSDKDNAVLAHEV----------------NARRYSGRL 1119

Query: 916  SQVSRSRSLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPC 1095
            S  S           CCVCG S+    N+L+EC  C IKVHQACYGV K+P+G W CRPC
Sbjct: 1120 SSNSP----------CCVCGISDLEPCNRLIECSKCYIKVHQACYGVLKVPRGQWFCRPC 1169

Query: 1096 KCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSLLK 1197
            K N+ D VCVLCGYG GAMTRA+K +NI+KSLL+
Sbjct: 1170 KNNTMDTVCVLCGYGGGAMTRALKTKNILKSLLQ 1203


>ref|XP_004957609.1| PREDICTED: uncharacterized protein LOC101761429 [Setaria italica]
          Length = 1886

 Score =  147 bits (372), Expect = 7e-33
 Identities = 118/393 (30%), Positives = 177/393 (45%), Gaps = 8/393 (2%)
 Frame = +1

Query: 46   KRKRSVLSFNKFKERIGYQDGVPKDDDKQLQNDDI------SLRRLKRVGEKMKQGLVAC 207
            KRK   +  NK  +++  Q+ V K D K             S  R K+V E    G    
Sbjct: 1137 KRKHPTMQLNKPVKQLHSQNKVFKGDGKLPDTKGNFFGGLDSFDRKKQV-EDTTPGRTKH 1195

Query: 208  SKHESRSGTAKPPKFMSLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQKPAKII 387
             +  SR+   K PK++SLNCI N             +   +         + ++K  KI+
Sbjct: 1196 HQEGSRAFVRKLPKYVSLNCIVNEPNSEDACSGSAGI---DSSLIATGMANDNRKSPKIV 1252

Query: 388  SLASILKRARKCNLTETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDLSSPST 567
             L+ +LK+A++C+  +   T  +H  E                      K G D S  S+
Sbjct: 1253 PLSLVLKKAKRCHSVKLCKTESTHLYE----------------------KKGSDCSVNSS 1290

Query: 568  AKAFHSGNNTGIRCHLHSMQSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEINECS 747
            +    S +   I     S Q+   ++       S   L + F  H K       E +   
Sbjct: 1291 SDC--SVDKCPIDDEGCSPQAEYEMQGSKRSRYSSNGLRSHFMAHCKRPSGVLGEDDPLG 1348

Query: 748  KLTMAKDQLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAGKLSQ-- 921
               M  ++L+ + S   G +++   +   +I +     A  S  +  + +++A    +  
Sbjct: 1349 LKDMETNRLSITSSRSNGTKNRRASVSLTRI-KRHKKFANKSACYSSSGKENAVLTHEEN 1407

Query: 922  VSRSRSLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPCKC 1101
            V R    L+ DA CCVCG S+    NQ +EC  C IKVHQACYGV K+P+G W CRPCK 
Sbjct: 1408 VRRDSGRLSLDAPCCVCGISDPEPCNQFIECCKCYIKVHQACYGVLKVPRGQWFCRPCKT 1467

Query: 1102 NSQDIVCVLCGYGDGAMTRAVKCQNIIKSLLKA 1200
            N+ +  CVLCGYG GAMTRA+K +NI+KSLLK+
Sbjct: 1468 NTLNTACVLCGYGGGAMTRALKTKNILKSLLKS 1500


>ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
            gi|223540953|gb|EEF42511.1| mixed-lineage leukemia
            protein, mll, putative [Ricinus communis]
          Length = 1125

 Score =  145 bits (366), Expect = 3e-32
 Identities = 112/405 (27%), Positives = 178/405 (43%), Gaps = 23/405 (5%)
 Frame = +1

Query: 64   LSFNKFKERIGYQDGVPKDDDKQLQNDDISLRRLKRVGEKMKQGLVACS-------KHES 222
            LS N+   R+        +    + +DD S   L+ +G K  + + A         +  +
Sbjct: 312  LSRNRDLHRLYNAGDGEANPHNDINHDDNSCEVLEILGRKKFRSIHAADLSIQFQRQDCT 371

Query: 223  RSGTAKPPKFMSLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGD-QKPAKIISLAS 399
            ++   K  K+ SL+ I               V CG          +GD  KPAKI+SL  
Sbjct: 372  QAVGEKAGKYDSLDRIKASSAQHLCHGKAKPVACGKYGEIVNGNLNGDVSKPAKIVSLDK 431

Query: 400  ILKRARKCNLTETSDTAVSHHSE--TSEDAKNSAI--FHRLEESCECLRKNGEDLSSPST 567
            +LK A+KC+L +     ++   E  T+    N+    F  L +  E  R         + 
Sbjct: 432  VLKTAQKCSLPKICKPGLTSSKEIGTNFSWSNACFGKFSNLTKEKEHGRNVALLCKDMNV 491

Query: 568  AKAFHSGNNTGIRCHLHSMQSISNLR---------CKDICTCSPGKLAAKFRHHAKSTCS 720
              +    +N+       S   +S L          C  + T +  +  +K+R   K +  
Sbjct: 492  RTSLEKRSNSFANYDEQSADEVSMLEKSEGKNGRGCVILDTIAHAQSRSKYRETRKRSLY 551

Query: 721  STTEINECS--KLTMAKDQLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNN 894
              T   + S  K+   K      P    G   ++++       +   P            
Sbjct: 552  ELTLKGKSSSPKMVSRKKNFKYVPKMKLGKTLRNSEKSHDNGSQKVDPK----------- 600

Query: 895  QDHAGKLSQVSRSRSLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKG 1074
                 + ++  +  S+ + D+FC VC SSN+ + N LLEC  C I+VHQACYGVS++PKG
Sbjct: 601  -----RCAREQKHLSITDMDSFCSVCRSSNKDEVNCLLECRRCSIRVHQACYGVSRVPKG 655

Query: 1075 NWCCRPCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSLLKAWKV 1209
            +W CRPC+ +++DIVCVLCGYG GAMT A++ + I+K LLKAW +
Sbjct: 656  HWYCRPCRTSAKDIVCVLCGYGGGAMTLALRSRTIVKGLLKAWNL 700


>gb|EOY29408.1| Uncharacterized protein isoform 9 [Theobroma cacao]
          Length = 1619

 Score =  144 bits (364), Expect = 6e-32
 Identities = 121/426 (28%), Positives = 191/426 (44%), Gaps = 33/426 (7%)
 Frame = +1

Query: 31   SAAIIKRKRSVLSFNKFKERIGYQDGVP--KDDDKQLQNDDISLRR-LKRVGEKMKQGLV 201
            SA I+ +KR +     + ++ G +D  P  K D +  +  ++S R+ LKR G       +
Sbjct: 923  SAKIVSQKRDL--HGVYNDQDGEEDYQPELKCDARFGKIPEVSGRKKLKRAGAFDSFESL 980

Query: 202  ACSKHESRSGTAKPPKFMSLNCIA--NXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQ-K 372
              SK   R+   K     +++CI   +             +VCG            D+ +
Sbjct: 981  GTSKSILRT-VEKSYNSNAVHCIKAFSSLEVTFCDKKDRPIVCGEYGEICSRKFATDELR 1039

Query: 373  PAKIISLASILKRARKCNLTETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDL 552
            PAKI+ L+ +LK   +C L ++     +      +    S ++  L+++ E     G   
Sbjct: 1040 PAKIVPLSRVLKNTEQCTLQKSCKPKSTLRKSKKKRRPKSTVYFDLKKAEE---NGGNQF 1096

Query: 553  SSPSTAKAFH--SGNNT---GIR--------------------CHLHS--MQSISNLRCK 651
            S        H   G  T   GI+                    C +      + SN+RCK
Sbjct: 1097 SVSHEVSGCHVEEGKKTCVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCK 1156

Query: 652  DICTCSPGKLAAKFRHHAKSTCSSTTEINECSKLTMAKDQLNCSPSAICGLEDQDNKLHQ 831
            +I   S  +L  K     K + S +  + E SK         C P              +
Sbjct: 1157 EIRKRSLYELTGK----GKESGSDSHPLMEISK---------CMP--------------K 1189

Query: 832  QKILEPASPTAIGSFPFPKNNQDHAGKLSQVSRSRSLLNPDAFCCVCGSSNQGDTNQLLE 1011
             K+ +    T        +++  +A K    +R  S+++ D FCCVCGSSN+ + N LLE
Sbjct: 1190 MKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDVFCCVCGSSNKDEFNCLLE 1249

Query: 1012 CHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSL 1191
            C  C I+VHQACYG+ K+P+G+W CRPC+ +S+D VCVLCGYG GAMT+A++ +  +K L
Sbjct: 1250 CSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGYGGGAMTQALRSRAFVKGL 1309

Query: 1192 LKAWKV 1209
            LKAW +
Sbjct: 1310 LKAWNI 1315


>gb|EOY29407.1| Uncharacterized protein isoform 8, partial [Theobroma cacao]
          Length = 2068

 Score =  144 bits (364), Expect = 6e-32
 Identities = 121/426 (28%), Positives = 191/426 (44%), Gaps = 33/426 (7%)
 Frame = +1

Query: 31   SAAIIKRKRSVLSFNKFKERIGYQDGVP--KDDDKQLQNDDISLRR-LKRVGEKMKQGLV 201
            SA I+ +KR +     + ++ G +D  P  K D +  +  ++S R+ LKR G       +
Sbjct: 1289 SAKIVSQKRDL--HGVYNDQDGEEDYQPELKCDARFGKIPEVSGRKKLKRAGAFDSFESL 1346

Query: 202  ACSKHESRSGTAKPPKFMSLNCIA--NXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQ-K 372
              SK   R+   K     +++CI   +             +VCG            D+ +
Sbjct: 1347 GTSKSILRT-VEKSYNSNAVHCIKAFSSLEVTFCDKKDRPIVCGEYGEICSRKFATDELR 1405

Query: 373  PAKIISLASILKRARKCNLTETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDL 552
            PAKI+ L+ +LK   +C L ++     +      +    S ++  L+++ E     G   
Sbjct: 1406 PAKIVPLSRVLKNTEQCTLQKSCKPKSTLRKSKKKRRPKSTVYFDLKKAEE---NGGNQF 1462

Query: 553  SSPSTAKAFH--SGNNT---GIR--------------------CHLHS--MQSISNLRCK 651
            S        H   G  T   GI+                    C +      + SN+RCK
Sbjct: 1463 SVSHEVSGCHVEEGKKTCVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCK 1522

Query: 652  DICTCSPGKLAAKFRHHAKSTCSSTTEINECSKLTMAKDQLNCSPSAICGLEDQDNKLHQ 831
            +I   S  +L  K     K + S +  + E SK         C P              +
Sbjct: 1523 EIRKRSLYELTGK----GKESGSDSHPLMEISK---------CMP--------------K 1555

Query: 832  QKILEPASPTAIGSFPFPKNNQDHAGKLSQVSRSRSLLNPDAFCCVCGSSNQGDTNQLLE 1011
             K+ +    T        +++  +A K    +R  S+++ D FCCVCGSSN+ + N LLE
Sbjct: 1556 MKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDVFCCVCGSSNKDEFNCLLE 1615

Query: 1012 CHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSL 1191
            C  C I+VHQACYG+ K+P+G+W CRPC+ +S+D VCVLCGYG GAMT+A++ +  +K L
Sbjct: 1616 CSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGYGGGAMTQALRSRAFVKGL 1675

Query: 1192 LKAWKV 1209
            LKAW +
Sbjct: 1676 LKAWNI 1681


>gb|EOY29402.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 2104

 Score =  144 bits (364), Expect = 6e-32
 Identities = 121/426 (28%), Positives = 191/426 (44%), Gaps = 33/426 (7%)
 Frame = +1

Query: 31   SAAIIKRKRSVLSFNKFKERIGYQDGVP--KDDDKQLQNDDISLRR-LKRVGEKMKQGLV 201
            SA I+ +KR +     + ++ G +D  P  K D +  +  ++S R+ LKR G       +
Sbjct: 1289 SAKIVSQKRDL--HGVYNDQDGEEDYQPELKCDARFGKIPEVSGRKKLKRAGAFDSFESL 1346

Query: 202  ACSKHESRSGTAKPPKFMSLNCIA--NXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQ-K 372
              SK   R+   K     +++CI   +             +VCG            D+ +
Sbjct: 1347 GTSKSILRT-VEKSYNSNAVHCIKAFSSLEVTFCDKKDRPIVCGEYGEICSRKFATDELR 1405

Query: 373  PAKIISLASILKRARKCNLTETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDL 552
            PAKI+ L+ +LK   +C L ++     +      +    S ++  L+++ E     G   
Sbjct: 1406 PAKIVPLSRVLKNTEQCTLQKSCKPKSTLRKSKKKRRPKSTVYFDLKKAEE---NGGNQF 1462

Query: 553  SSPSTAKAFH--SGNNT---GIR--------------------CHLHS--MQSISNLRCK 651
            S        H   G  T   GI+                    C +      + SN+RCK
Sbjct: 1463 SVSHEVSGCHVEEGKKTCVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCK 1522

Query: 652  DICTCSPGKLAAKFRHHAKSTCSSTTEINECSKLTMAKDQLNCSPSAICGLEDQDNKLHQ 831
            +I   S  +L  K     K + S +  + E SK         C P              +
Sbjct: 1523 EIRKRSLYELTGK----GKESGSDSHPLMEISK---------CMP--------------K 1555

Query: 832  QKILEPASPTAIGSFPFPKNNQDHAGKLSQVSRSRSLLNPDAFCCVCGSSNQGDTNQLLE 1011
             K+ +    T        +++  +A K    +R  S+++ D FCCVCGSSN+ + N LLE
Sbjct: 1556 MKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDVFCCVCGSSNKDEFNCLLE 1615

Query: 1012 CHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSL 1191
            C  C I+VHQACYG+ K+P+G+W CRPC+ +S+D VCVLCGYG GAMT+A++ +  +K L
Sbjct: 1616 CSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGYGGGAMTQALRSRAFVKGL 1675

Query: 1192 LKAWKV 1209
            LKAW +
Sbjct: 1676 LKAWNI 1681


>gb|EOY29400.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782145|gb|EOY29401.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782147|gb|EOY29403.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782148|gb|EOY29404.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782149|gb|EOY29405.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782150|gb|EOY29406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1738

 Score =  144 bits (364), Expect = 6e-32
 Identities = 121/426 (28%), Positives = 191/426 (44%), Gaps = 33/426 (7%)
 Frame = +1

Query: 31   SAAIIKRKRSVLSFNKFKERIGYQDGVP--KDDDKQLQNDDISLRR-LKRVGEKMKQGLV 201
            SA I+ +KR +     + ++ G +D  P  K D +  +  ++S R+ LKR G       +
Sbjct: 923  SAKIVSQKRDL--HGVYNDQDGEEDYQPELKCDARFGKIPEVSGRKKLKRAGAFDSFESL 980

Query: 202  ACSKHESRSGTAKPPKFMSLNCIA--NXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQ-K 372
              SK   R+   K     +++CI   +             +VCG            D+ +
Sbjct: 981  GTSKSILRT-VEKSYNSNAVHCIKAFSSLEVTFCDKKDRPIVCGEYGEICSRKFATDELR 1039

Query: 373  PAKIISLASILKRARKCNLTETSDTAVSHHSETSEDAKNSAIFHRLEESCECLRKNGEDL 552
            PAKI+ L+ +LK   +C L ++     +      +    S ++  L+++ E     G   
Sbjct: 1040 PAKIVPLSRVLKNTEQCTLQKSCKPKSTLRKSKKKRRPKSTVYFDLKKAEE---NGGNQF 1096

Query: 553  SSPSTAKAFH--SGNNT---GIR--------------------CHLHS--MQSISNLRCK 651
            S        H   G  T   GI+                    C +      + SN+RCK
Sbjct: 1097 SVSHEVSGCHVEEGKKTCVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCK 1156

Query: 652  DICTCSPGKLAAKFRHHAKSTCSSTTEINECSKLTMAKDQLNCSPSAICGLEDQDNKLHQ 831
            +I   S  +L  K     K + S +  + E SK         C P              +
Sbjct: 1157 EIRKRSLYELTGK----GKESGSDSHPLMEISK---------CMP--------------K 1189

Query: 832  QKILEPASPTAIGSFPFPKNNQDHAGKLSQVSRSRSLLNPDAFCCVCGSSNQGDTNQLLE 1011
             K+ +    T        +++  +A K    +R  S+++ D FCCVCGSSN+ + N LLE
Sbjct: 1190 MKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDVFCCVCGSSNKDEFNCLLE 1249

Query: 1012 CHDCLIKVHQACYGVSKIPKGNWCCRPCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSL 1191
            C  C I+VHQACYG+ K+P+G+W CRPC+ +S+D VCVLCGYG GAMT+A++ +  +K L
Sbjct: 1250 CSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGYGGGAMTQALRSRAFVKGL 1309

Query: 1192 LKAWKV 1209
            LKAW +
Sbjct: 1310 LKAWNI 1315


>tpg|DAA40736.1| TPA: putative trithorax-like family protein [Zea mays]
          Length = 1566

 Score =  142 bits (357), Expect = 4e-31
 Identities = 111/390 (28%), Positives = 171/390 (43%), Gaps = 6/390 (1%)
 Frame = +1

Query: 46   KRKRSVLSFNKFKERIGYQDGVPKDDDKQLQNDDI-----SLRRLKRVGEKMKQGLVACS 210
            KRK  ++  NK  +++  Q    + D++     +      S  R ++V +          
Sbjct: 845  KRKHPIMHLNKHVKQLHRQTKFFEGDEQPDAKGNFLGGLDSYDRKRQVEDMSTLDKTRHH 904

Query: 211  KHESRSGTAKPPKFMSLNCIANXXXXXXXXXXXXXVVCGNXXXXXXXXTDGDQKPAKIIS 390
            +  SR+   K PK++SLNCI N                 +         D  + P KI+ 
Sbjct: 905  QEGSRAFVRKLPKYVSLNCIVNEPNTNSEGACSGSGGIDSSLIATGITNDNRKSP-KIVP 963

Query: 391  LASILKRARKCNLTETSDTAVSH-HSETSEDAKNSAIFHRLEESCECLRKNGEDLSSPST 567
            L  +LK+A++CN  +   T  +H + E S D   ++  + +E+         ++  SP  
Sbjct: 964  LNLVLKKAKRCNAVKLRKTESTHLYEEKSSDCSVNSSDYSIEKYSV-----DDENCSPQA 1018

Query: 568  AKAFHSGNNTGIRCHLHSMQSISNLRCKDICTCSPGKLAAKFRHHAKSTCSSTTEINECS 747
                             S  S ++LR                 H A    +S    N  +
Sbjct: 1019 EYELQDSKR--------SRYSSNDLRS----------------HVALHKRTSGGTKNRRT 1054

Query: 748  KLTMAKDQLNCSPSAICGLEDQDNKLHQQKILEPASPTAIGSFPFPKNNQDHAGKLSQVS 927
             +++A+ +   S S      D+DN +   ++                N + ++G+LS  S
Sbjct: 1055 SVSLARIKKFGSKSVCYSGSDKDNAVLAHEV----------------NARRYSGRLSSNS 1098

Query: 928  RSRSLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKGNWCCRPCKCNS 1107
                       CCVCG S+    N+L+EC  C IKVHQACYGV K+P+G W CRPCK N+
Sbjct: 1099 P----------CCVCGISDLEPCNRLIECSKCYIKVHQACYGVLKVPRGQWFCRPCKNNT 1148

Query: 1108 QDIVCVLCGYGDGAMTRAVKCQNIIKSLLK 1197
             D VCVLCGYG GAMTRA+K +NI+KSLL+
Sbjct: 1149 MDTVCVLCGYGGGAMTRALKTKNILKSLLQ 1178


>ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313577 [Fragaria vesca
            subsp. vesca]
          Length = 2169

 Score =  137 bits (344), Expect = 1e-29
 Identities = 60/105 (57%), Positives = 75/105 (71%)
 Frame = +1

Query: 895  QDHAGKLSQVSRSRSLLNPDAFCCVCGSSNQGDTNQLLECHDCLIKVHQACYGVSKIPKG 1074
            Q  A   +Q  R     + D  CCVCGSSNQ + N LLEC  C ++VHQACYGVSK+PKG
Sbjct: 1637 QHSAKNSTQEHRCHCNCDSDPICCVCGSSNQDEINILLECSQCSVRVHQACYGVSKVPKG 1696

Query: 1075 NWCCRPCKCNSQDIVCVLCGYGDGAMTRAVKCQNIIKSLLKAWKV 1209
             W CRPC+ +S+DIVCVLCGYG GAMT+A++ Q I  S+L+AW +
Sbjct: 1697 CWSCRPCRMSSKDIVCVLCGYGGGAMTQALRSQTIAVSILRAWNI 1741


Top