BLASTX nr result

ID: Ophiopogon21_contig00024839 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon21_contig00024839
         (1505 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010921925.1| PREDICTED: uncharacterized protein LOC105045...   408   e-111
ref|XP_010921923.1| PREDICTED: uncharacterized protein LOC105045...   408   e-111
ref|XP_008787592.1| PREDICTED: uncharacterized protein LOC103705...   393   e-106
ref|XP_009412876.1| PREDICTED: uncharacterized protein LOC103994...   341   9e-91
ref|XP_010273302.1| PREDICTED: uncharacterized protein LOC104608...   283   3e-73
ref|XP_004958593.1| PREDICTED: uncharacterized protein LOC101777...   261   1e-66
ref|XP_007015163.1| DNA binding,zinc ion binding,DNA binding, pu...   256   4e-65
ref|XP_007015162.1| DNA binding,zinc ion binding,DNA binding, pu...   256   4e-65
ref|XP_007015161.1| DNA binding,zinc ion binding,DNA binding, pu...   256   4e-65
ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [S...   255   8e-65
gb|EMT20858.1| hypothetical protein F775_52258 [Aegilops tauschii]    253   4e-64
gb|KQK14817.1| hypothetical protein BRADI_1g18770 [Brachypodium ...   249   3e-63
ref|XP_010234427.1| PREDICTED: uncharacterized protein LOC100822...   249   3e-63
ref|XP_008653439.1| PREDICTED: uncharacterized protein LOC103633...   248   1e-62
tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea m...   248   1e-62
gb|KHG13465.1| Nucleosome-remodeling factor subunit [Gossypium a...   246   5e-62
gb|KDO50419.1| hypothetical protein CISIN_1g000462mg [Citrus sin...   242   6e-61
gb|KDO50418.1| hypothetical protein CISIN_1g000462mg [Citrus sin...   242   6e-61
gb|KJB09356.1| hypothetical protein B456_001G136300 [Gossypium r...   241   2e-60
gb|KJB09354.1| hypothetical protein B456_001G136300 [Gossypium r...   241   2e-60

>ref|XP_010921925.1| PREDICTED: uncharacterized protein LOC105045366 isoform X2 [Elaeis
            guineensis]
          Length = 1023

 Score =  408 bits (1049), Expect = e-111
 Identities = 238/509 (46%), Positives = 302/509 (59%), Gaps = 8/509 (1%)
 Frame = +3

Query: 3    VNAISMCW-VPVDASYANNQS--MNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLP 173
            VNAIS  W VPV+AS ++N    + ++H+ ++     M+S+     K +   D    + P
Sbjct: 43   VNAISSYWKVPVNASNSSNHGHEIPNVHEVLD---ASMHSQHLALAKQEVSIDGIIENAP 99

Query: 174  HENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQET 353
             E +AS    EP     SD   LN + + +   +N PF+CS  +DE   A T    SQ+ 
Sbjct: 100  KEYSASPGCSEPNCLSASDLRQLNLMDSHQSAEINRPFACSESVDEMADATTCDQLSQQI 159

Query: 354  DTDCPMGSIAPSKQVIPSKHTDLTVASENCIGLQGRGC----ITDRNRFGASELQSDPGN 521
              +C      P K+ I  K  DL+V +E  + L G G     ITDR +   S LQSDPG 
Sbjct: 160  YNECSKNENVPDKEFISVKPVDLSVENEKYVELPGWGVGISLITDRWKGVDSRLQSDPGC 219

Query: 522  YINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNP 701
            Y+NYYTFGRIA S+  ELMHK+SE+ N E KK +ED+ + QLKAIS  S +   Y+ Q  
Sbjct: 220  YVNYYTFGRIAFSVAQELMHKASESGNKESKKPVEDMMSQQLKAISKNSIRFCWYSNQKL 279

Query: 702  SMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPH 881
            S+D QKE CGWC+SC++ N  DCLFKV  DK+LE SK                 +T G  
Sbjct: 280  SLDAQKEKCGWCYSCKSLNGSDCLFKVMDDKHLESSK----------------PRTAGLR 323

Query: 882  SEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXX 1061
            SEK  +SHI +AMHHILSIE      LSG WE PH+S  WRKAV+KASD           
Sbjct: 324  SEKKKKSHILSAMHHILSIEDRVRCFLSGLWENPHYSNLWRKAVLKASDVASLKHLLLNL 383

Query: 1062 XXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTS 1238
                     S EW KPVD    + SAS +VT  +  SS+  GSRKQ KK +S +E   + 
Sbjct: 384  ESNLRRVALSAEWLKPVDSVEIVGSASHVVTGSLLVSSNNGGSRKQSKKTLSVSE---SV 440

Query: 1239 RRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKY 1418
            R      + WWRGGRLSRQ+FQ K+LPR LASKGG QAG +KIP+ILYPD S+ A+RSK+
Sbjct: 441  REPAAGSLFWWRGGRLSRQVFQWKILPRSLASKGGHQAGCKKIPNILYPDGSEFARRSKF 500

Query: 1419 VAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
            VAWR AVEMSQSVAQLI+Q K+FDSNI+W
Sbjct: 501  VAWRAAVEMSQSVAQLIFQIKEFDSNIRW 529


>ref|XP_010921923.1| PREDICTED: uncharacterized protein LOC105045366 isoform X1 [Elaeis
            guineensis] gi|743785508|ref|XP_010921924.1| PREDICTED:
            uncharacterized protein LOC105045366 isoform X1 [Elaeis
            guineensis]
          Length = 1619

 Score =  408 bits (1049), Expect = e-111
 Identities = 238/509 (46%), Positives = 302/509 (59%), Gaps = 8/509 (1%)
 Frame = +3

Query: 3    VNAISMCW-VPVDASYANNQS--MNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLP 173
            VNAIS  W VPV+AS ++N    + ++H+ ++     M+S+     K +   D    + P
Sbjct: 639  VNAISSYWKVPVNASNSSNHGHEIPNVHEVLD---ASMHSQHLALAKQEVSIDGIIENAP 695

Query: 174  HENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQET 353
             E +AS    EP     SD   LN + + +   +N PF+CS  +DE   A T    SQ+ 
Sbjct: 696  KEYSASPGCSEPNCLSASDLRQLNLMDSHQSAEINRPFACSESVDEMADATTCDQLSQQI 755

Query: 354  DTDCPMGSIAPSKQVIPSKHTDLTVASENCIGLQGRGC----ITDRNRFGASELQSDPGN 521
              +C      P K+ I  K  DL+V +E  + L G G     ITDR +   S LQSDPG 
Sbjct: 756  YNECSKNENVPDKEFISVKPVDLSVENEKYVELPGWGVGISLITDRWKGVDSRLQSDPGC 815

Query: 522  YINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNP 701
            Y+NYYTFGRIA S+  ELMHK+SE+ N E KK +ED+ + QLKAIS  S +   Y+ Q  
Sbjct: 816  YVNYYTFGRIAFSVAQELMHKASESGNKESKKPVEDMMSQQLKAISKNSIRFCWYSNQKL 875

Query: 702  SMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPH 881
            S+D QKE CGWC+SC++ N  DCLFKV  DK+LE SK                 +T G  
Sbjct: 876  SLDAQKEKCGWCYSCKSLNGSDCLFKVMDDKHLESSK----------------PRTAGLR 919

Query: 882  SEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXX 1061
            SEK  +SHI +AMHHILSIE      LSG WE PH+S  WRKAV+KASD           
Sbjct: 920  SEKKKKSHILSAMHHILSIEDRVRCFLSGLWENPHYSNLWRKAVLKASDVASLKHLLLNL 979

Query: 1062 XXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTS 1238
                     S EW KPVD    + SAS +VT  +  SS+  GSRKQ KK +S +E   + 
Sbjct: 980  ESNLRRVALSAEWLKPVDSVEIVGSASHVVTGSLLVSSNNGGSRKQSKKTLSVSE---SV 1036

Query: 1239 RRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKY 1418
            R      + WWRGGRLSRQ+FQ K+LPR LASKGG QAG +KIP+ILYPD S+ A+RSK+
Sbjct: 1037 REPAAGSLFWWRGGRLSRQVFQWKILPRSLASKGGHQAGCKKIPNILYPDGSEFARRSKF 1096

Query: 1419 VAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
            VAWR AVEMSQSVAQLI+Q K+FDSNI+W
Sbjct: 1097 VAWRAAVEMSQSVAQLIFQIKEFDSNIRW 1125


>ref|XP_008787592.1| PREDICTED: uncharacterized protein LOC103705599 [Phoenix dactylifera]
            gi|672128223|ref|XP_008787593.1| PREDICTED:
            uncharacterized protein LOC103705599 [Phoenix
            dactylifera]
          Length = 1634

 Score =  393 bits (1009), Expect = e-106
 Identities = 236/509 (46%), Positives = 298/509 (58%), Gaps = 8/509 (1%)
 Frame = +3

Query: 3    VNAISMCW-VPVDASYANNQS--MNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLP 173
            V+AIS  W V V+AS ++N    + ++H+ + D + H  SE     K +   D      P
Sbjct: 640  VDAISSYWKVTVNASNSSNHGHEIPNVHE-VLDASVH--SEHLTLSKQEVSFDGIIEKAP 696

Query: 174  HENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQET 353
             + +AS    EP     SD   LN + + +   +N PF+ S   DE   AAT    S++ 
Sbjct: 697  KDYSASPGCSEPNRLSTSDLRQLNLMDSRQSAEINQPFAHSESADEMADAATCDPISRQI 756

Query: 354  DTDCPMGSIAPSKQVIPSKHTDLTVASENCIGLQGRGC----ITDRNRFGASELQSDPGN 521
              DC      P K+ I     +L+V +E  + L G G     ITDR +   S LQSDPG 
Sbjct: 757  YNDCSRNENVPDKEFISVNPVELSVDNEKYVELPGWGVGTSLITDRWKGADSRLQSDPGC 816

Query: 522  YINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNP 701
            Y+NYYTFGRIA S+  ELMHKSSE+ N E KK +ED+ + QLKAIS KS +      Q  
Sbjct: 817  YMNYYTFGRIAFSVAQELMHKSSESGNKESKKPVEDMMSQQLKAISKKSIRFCWCTNQKL 876

Query: 702  SMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPH 881
            S+D QKE CGWC SC+T N  +CLFK+  DK+LE SK                 + VG  
Sbjct: 877  SLDAQKEKCGWCHSCKTLNGSNCLFKIMDDKHLESSK----------------PRIVGLR 920

Query: 882  SEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXX 1061
            SEK  +SHI +AMHHILSIE      LSGPWEKPH+S  WRKAV+KASD           
Sbjct: 921  SEKKKKSHILSAMHHILSIEDRLRCFLSGPWEKPHYSNLWRKAVLKASDVASLKHLLLTL 980

Query: 1062 XXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTS 1238
                     S EW KPVD    + SAS +VT  V  SS+   SRKQ KK++S +E   + 
Sbjct: 981  ESNLRRVALSAEWLKPVDSVEIVGSASHVVTGSVLMSSNNGSSRKQSKKSLSVSE---SV 1037

Query: 1239 RRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKY 1418
            R      + WWRGGRLSRQ+F  K+LPR LASKGGRQAG +KIP++LYPD S+ A+RSK+
Sbjct: 1038 RDPAAGSVFWWRGGRLSRQVFHWKILPRSLASKGGRQAGCKKIPNMLYPDGSEFARRSKF 1097

Query: 1419 VAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
            VAWR AVEMSQSVAQLI+Q K+FDSNI+W
Sbjct: 1098 VAWRAAVEMSQSVAQLIFQIKEFDSNIRW 1126


>ref|XP_009412876.1| PREDICTED: uncharacterized protein LOC103994272 [Musa acuminata
            subsp. malaccensis]
          Length = 1291

 Score =  341 bits (875), Expect = 9e-91
 Identities = 204/512 (39%), Positives = 286/512 (55%), Gaps = 11/512 (2%)
 Frame = +3

Query: 3    VNAISMCW-VPVDASYANNQSMNDI--HDDINDIATHMYSEPPLPPKLDGLSDCNAGSLP 173
            VN IS  W + +D+  + +QS ++I   ++  D   ++ S  P     D + +       
Sbjct: 637  VNTISAQWGISLDSHSSISQSCHEIINRNEALDSQLNLLSSDPNVVNDDIVKNSK----- 691

Query: 174  HENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQET 353
             +N  +SE  +P ++  SD    N V+ D    M+  F  S   ++  HA      +Q+T
Sbjct: 692  -DNCTNSEHSDPISANASDLSQTNLVSLDHASGMSLLFVSSEPAEQLAHAVNYLQSTQQT 750

Query: 354  DTDCPMGSIAPSKQVIPSKHTDLTVASENC-----IGLQGRGCITDRNRFGAS--ELQSD 512
               C + +  P  +VI    T + ++++N        L G   I+++ +  A   +LQSD
Sbjct: 751  TDSCSIATDNPVDEVISV--TPVVISTDNSKHFAITDLGGTSFISEQVQKKAETCKLQSD 808

Query: 513  PGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAY 692
            P  YINYY FGR+ASS+  +LM KSSE+ N E KKS ED+   QLKAI  +  K   Y++
Sbjct: 809  PCGYINYYIFGRVASSVAEDLMIKSSESNNKEPKKSDEDMVVAQLKAIFKRCPKLSSYSF 868

Query: 693  QNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTV 872
               S+D+QKE CGWC SC+TS+  DC F V                   K++E+ +S  V
Sbjct: 869  LQQSLDIQKEKCGWCHSCKTSSSSDCAFVVND-----------------KHIEDMKSDAV 911

Query: 873  GPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXX 1052
            G  SEK  +SHI + MH ILSIE     LLSGPW+ PH+S  WRKAVMKASD        
Sbjct: 912  GLDSEKKKKSHIVSVMHDILSIEDHLNGLLSGPWDNPHYSSLWRKAVMKASDVASLKHML 971

Query: 1053 XXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFS 1229
                          +W KPVD A  + SAS I+   +   S+  GSRKQGK+  S +EF+
Sbjct: 972  LLLESNLRRVAMLSDWMKPVDFAHTVGSASHILIGSMDAFSNCGGSRKQGKRTTSGSEFN 1031

Query: 1230 FTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKR 1409
              S+ A  S +CWWRGGRLSR++F  KMLPR L SKGGRQAG +KI ++ YPD  + A+R
Sbjct: 1032 I-SQAAAASYVCWWRGGRLSRRVFHWKMLPRSLTSKGGRQAGCKKISNVFYPDVPEFARR 1090

Query: 1410 SKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
            +K++ WR AVEMS++VAQL +  K+FDSNI+W
Sbjct: 1091 NKFITWRAAVEMSETVAQLAFLTKEFDSNIRW 1122


>ref|XP_010273302.1| PREDICTED: uncharacterized protein LOC104608880 [Nelumbo nucifera]
          Length = 1956

 Score =  283 bits (724), Expect = 3e-73
 Identities = 185/481 (38%), Positives = 248/481 (51%), Gaps = 36/481 (7%)
 Frame = +3

Query: 171  PHENAASSEQCEPRASQISDAV-HLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQ 347
            P+E +  SE    ++S+ISD++  LNS T ++ + M  P + S    +          SQ
Sbjct: 830  PNEGSVISEGLAHQSSKISDSISRLNSATVNQFMEMASPLASSEGSADISQVNAGKQTSQ 889

Query: 348  ETDTDCPMGSIAPSKQVIPSK-----------HTDLTVASEN-CIG-----------LQG 458
            +   DC    I  +   IP K             DL V  E   IG            Q 
Sbjct: 890  KNGADCSNKLIQSADSEIPVKLQSAIGEDLPNPADLGVKQEEGFIGEQLSKPADLNDKQE 949

Query: 459  RGC----------ITDRNRFGASELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNE 608
            +G           + +  R   S +Q + G+Y+N Y F + A+S+  EL+HKSSE  N +
Sbjct: 950  KGLAPAVPIHTSPVNNTKRVVPSPMQFESGSYVNCYIFAQTAASVAEELLHKSSERINED 1009

Query: 609  LKKSLEDIKTVQLKAISNKSTKCLRYAYQNPSMDVQKENCGWCFSCRTSNDF-DCLFKVA 785
               S+++I + QLK IS KSTK      QN   D+QKENCGWCFSC+   D  +CLF + 
Sbjct: 1010 PNSSVDEIVSAQLKVISKKSTKLCWSNIQNLYKDLQKENCGWCFSCKNPTDSGNCLFNMF 1069

Query: 786  GDKNLEGSKNKDRKGSLAKNLEESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLS 965
              K+                 E  +S  VG HS+KN ++H+   +HHILSIE     LLS
Sbjct: 1070 NKKHPP---------------EGPKSGAVGLHSKKNRKNHLFDVIHHILSIEHRLSGLLS 1114

Query: 966  GPWEKPHHSQHWRKAVMKASDXXXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS- 1142
            GPW+ P +S  WRK+V+KASD                    S EW K VD    + SAS 
Sbjct: 1115 GPWQNPLYSMQWRKSVLKASDIASVKRLLLILESSLRRIALSEEWLKQVDSVFTMGSASH 1174

Query: 1143 IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPR 1322
            ++T+ V   S +   RK+G+   S  + SF+S  A  S I WWRGGRLSRQ++    LP 
Sbjct: 1175 VLTTSVNLPSKHGIGRKRGR--FSDADSSFSSNTAG-SGIFWWRGGRLSRQVYHWMFLPH 1231

Query: 1323 PLASKGGRQAGRRKIPHILYPDSSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIK 1502
             LA K GRQAG  KIP ILYPD S+LAKRSKY+AWR A+EM  SV QL +Q ++ DSNI+
Sbjct: 1232 TLAYKAGRQAGCIKIPGILYPDGSELAKRSKYIAWRAALEMCISVPQLAFQVRELDSNIR 1291

Query: 1503 W 1505
            W
Sbjct: 1292 W 1292


>ref|XP_004958593.1| PREDICTED: uncharacterized protein LOC101777112 [Setaria italica]
            gi|944262782|gb|KQL27039.1| hypothetical protein
            SETIT_028659mg [Setaria italica]
          Length = 1696

 Score =  261 bits (666), Expect = 1e-66
 Identities = 145/336 (43%), Positives = 188/336 (55%), Gaps = 1/336 (0%)
 Frame = +3

Query: 501  LQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCL 680
            L SD   YINYY+FG+IA+S   EL HK SE  N E KK ++D  +  L+ I  K     
Sbjct: 700  LHSDLARYINYYSFGQIAASAAEELKHKLSE--NKEGKKPVQDALSFHLRTICKKYANIF 757

Query: 681  RYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESR 860
                Q  S+++ KE CGWC SC+ S   DC+F+V   K +EG+K                
Sbjct: 758  ALTDQKLSVELLKEKCGWCNSCQISGGVDCIFRVTDVKCMEGTK---------------- 801

Query: 861  SQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXX 1040
               +G  +EKN ESHI  AMH+ILSIE     LL+GPW+ P +  +WRK V+KA+D    
Sbjct: 802  PHALGVEAEKNMESHIILAMHNILSIEERLNGLLTGPWQNPQYRIYWRKEVLKAADVSSL 861

Query: 1041 XXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIST 1217
                            S+EW KP D    + SA+ I+     +S S+  +RK G+K  S 
Sbjct: 862  KQPLLMLESSLRRVAISMEWQKPADSVEVVGSAAHILVRSSNKSLSHGTARKPGRKPSSN 921

Query: 1218 NEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSD 1397
             E    SR      + WWRGG+LSRQ+F  K LP+ L  K  RQAGRRKIP ILY D S 
Sbjct: 922  GELKVDSRNV---GVYWWRGGKLSRQVFHWKRLPQSLVYKAARQAGRRKIPTILYTDGSQ 978

Query: 1398 LAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
             A+R KY+AWR AVEM+++VAQLI Q K+ + NIKW
Sbjct: 979  FARRFKYIAWRAAVEMAENVAQLILQIKELEWNIKW 1014


>ref|XP_007015163.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3
            [Theobroma cacao] gi|590584387|ref|XP_007015164.1| DNA
            binding,zinc ion binding,DNA binding, putative isoform 3
            [Theobroma cacao] gi|508785526|gb|EOY32782.1| DNA
            binding,zinc ion binding,DNA binding, putative isoform 3
            [Theobroma cacao] gi|508785527|gb|EOY32783.1| DNA
            binding,zinc ion binding,DNA binding, putative isoform 3
            [Theobroma cacao]
          Length = 1859

 Score =  256 bits (654), Expect = 4e-65
 Identities = 181/520 (34%), Positives = 252/520 (48%), Gaps = 19/520 (3%)
 Frame = +3

Query: 3    VNAISMCWVPVDASYANNQSMNDIHDDINDIA--THMYSEPP-----LPPKLDGLSDCNA 161
            + AI   W   D +  +N + +++ D +N +   T M  + P     LPP   G +    
Sbjct: 762  LKAIHKQW---DVAVGSNGASSNL-DSLNSVCSETLMKGQIPTASTVLPPLASGETSAIK 817

Query: 162  GSLPHENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHP 341
                 +     ++    +  +   V  ++   D +     P+  S    E +   +  H 
Sbjct: 818  NETVDDGKQEDKEVAGNSGHLDVEVTESANLLDSVAGTEIPYISSEGSAETMQMGSVIHN 877

Query: 342  SQETDTDCPMGSIAPSKQV-IPSKHTDLTVASENCIGL---------QGRGCITDRNRFG 491
             Q+       GS   S Q  +P K ++L   S    GL         Q   C  +  R  
Sbjct: 878  FQK------QGSAEFSNQSEVPGKSSNLEDCSLISKGLYQESKIKLAQQTLCAINAKRGD 931

Query: 492  ASELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKST 671
            AS+ Q   G Y+NYY+F + AS +  ELM K SE  N +  KS+E+I  +Q+K I  KS 
Sbjct: 932  ASQTQPGTG-YLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMKVILKKSN 990

Query: 672  KCLRYAYQNPSMDVQKENCGWCFSCR-TSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNL 848
            +       N  +D +KENCGWCF CR   +D DCLFK+      E SK            
Sbjct: 991  RFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQEVSK------------ 1038

Query: 849  EESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASD 1028
                S+ VG  S+ N + H+   + H  SIE     LLSGPW  P + + W K+++KASD
Sbjct: 1039 ----SEMVGLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASD 1094

Query: 1029 XXXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKK 1205
                                S EW K VD A  + SAS +VT+  R S+ +  +RK+G+ 
Sbjct: 1095 VASLKHFLLMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRS 1154

Query: 1206 NISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYP 1385
            N    E + TS  A    ICWWRGGR+SRQLF  K+LPR LASK  RQ G +KIP ILYP
Sbjct: 1155 N--DGESNPTSNPAAGPSICWWRGGRVSRQLFNWKVLPRSLASKAARQGGGKKIPGILYP 1212

Query: 1386 DSSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
            +SSD A+RSK +AWR AVE S S+ QL  Q ++ DSNI+W
Sbjct: 1213 ESSDFARRSKSMAWRAAVESSTSIEQLALQVRELDSNIRW 1252


>ref|XP_007015162.1| DNA binding,zinc ion binding,DNA binding, putative isoform 2
            [Theobroma cacao] gi|508785525|gb|EOY32781.1| DNA
            binding,zinc ion binding,DNA binding, putative isoform 2
            [Theobroma cacao]
          Length = 1647

 Score =  256 bits (654), Expect = 4e-65
 Identities = 181/520 (34%), Positives = 252/520 (48%), Gaps = 19/520 (3%)
 Frame = +3

Query: 3    VNAISMCWVPVDASYANNQSMNDIHDDINDIA--THMYSEPP-----LPPKLDGLSDCNA 161
            + AI   W   D +  +N + +++ D +N +   T M  + P     LPP   G +    
Sbjct: 762  LKAIHKQW---DVAVGSNGASSNL-DSLNSVCSETLMKGQIPTASTVLPPLASGETSAIK 817

Query: 162  GSLPHENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHP 341
                 +     ++    +  +   V  ++   D +     P+  S    E +   +  H 
Sbjct: 818  NETVDDGKQEDKEVAGNSGHLDVEVTESANLLDSVAGTEIPYISSEGSAETMQMGSVIHN 877

Query: 342  SQETDTDCPMGSIAPSKQV-IPSKHTDLTVASENCIGL---------QGRGCITDRNRFG 491
             Q+       GS   S Q  +P K ++L   S    GL         Q   C  +  R  
Sbjct: 878  FQK------QGSAEFSNQSEVPGKSSNLEDCSLISKGLYQESKIKLAQQTLCAINAKRGD 931

Query: 492  ASELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKST 671
            AS+ Q   G Y+NYY+F + AS +  ELM K SE  N +  KS+E+I  +Q+K I  KS 
Sbjct: 932  ASQTQPGTG-YLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMKVILKKSN 990

Query: 672  KCLRYAYQNPSMDVQKENCGWCFSCR-TSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNL 848
            +       N  +D +KENCGWCF CR   +D DCLFK+      E SK            
Sbjct: 991  RFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQEVSK------------ 1038

Query: 849  EESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASD 1028
                S+ VG  S+ N + H+   + H  SIE     LLSGPW  P + + W K+++KASD
Sbjct: 1039 ----SEMVGLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASD 1094

Query: 1029 XXXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKK 1205
                                S EW K VD A  + SAS +VT+  R S+ +  +RK+G+ 
Sbjct: 1095 VASLKHFLLMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRS 1154

Query: 1206 NISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYP 1385
            N    E + TS  A    ICWWRGGR+SRQLF  K+LPR LASK  RQ G +KIP ILYP
Sbjct: 1155 N--DGESNPTSNPAAGPSICWWRGGRVSRQLFNWKVLPRSLASKAARQGGGKKIPGILYP 1212

Query: 1386 DSSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
            +SSD A+RSK +AWR AVE S S+ QL  Q ++ DSNI+W
Sbjct: 1213 ESSDFARRSKSMAWRAAVESSTSIEQLALQVRELDSNIRW 1252


>ref|XP_007015161.1| DNA binding,zinc ion binding,DNA binding, putative isoform 1
            [Theobroma cacao] gi|508785524|gb|EOY32780.1| DNA
            binding,zinc ion binding,DNA binding, putative isoform 1
            [Theobroma cacao]
          Length = 1931

 Score =  256 bits (654), Expect = 4e-65
 Identities = 181/520 (34%), Positives = 252/520 (48%), Gaps = 19/520 (3%)
 Frame = +3

Query: 3    VNAISMCWVPVDASYANNQSMNDIHDDINDIA--THMYSEPP-----LPPKLDGLSDCNA 161
            + AI   W   D +  +N + +++ D +N +   T M  + P     LPP   G +    
Sbjct: 762  LKAIHKQW---DVAVGSNGASSNL-DSLNSVCSETLMKGQIPTASTVLPPLASGETSAIK 817

Query: 162  GSLPHENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHP 341
                 +     ++    +  +   V  ++   D +     P+  S    E +   +  H 
Sbjct: 818  NETVDDGKQEDKEVAGNSGHLDVEVTESANLLDSVAGTEIPYISSEGSAETMQMGSVIHN 877

Query: 342  SQETDTDCPMGSIAPSKQV-IPSKHTDLTVASENCIGL---------QGRGCITDRNRFG 491
             Q+       GS   S Q  +P K ++L   S    GL         Q   C  +  R  
Sbjct: 878  FQK------QGSAEFSNQSEVPGKSSNLEDCSLISKGLYQESKIKLAQQTLCAINAKRGD 931

Query: 492  ASELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKST 671
            AS+ Q   G Y+NYY+F + AS +  ELM K SE  N +  KS+E+I  +Q+K I  KS 
Sbjct: 932  ASQTQPGTG-YLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMKVILKKSN 990

Query: 672  KCLRYAYQNPSMDVQKENCGWCFSCR-TSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNL 848
            +       N  +D +KENCGWCF CR   +D DCLFK+      E SK            
Sbjct: 991  RFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQEVSK------------ 1038

Query: 849  EESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASD 1028
                S+ VG  S+ N + H+   + H  SIE     LLSGPW  P + + W K+++KASD
Sbjct: 1039 ----SEMVGLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASD 1094

Query: 1029 XXXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKK 1205
                                S EW K VD A  + SAS +VT+  R S+ +  +RK+G+ 
Sbjct: 1095 VASLKHFLLMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRS 1154

Query: 1206 NISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYP 1385
            N    E + TS  A    ICWWRGGR+SRQLF  K+LPR LASK  RQ G +KIP ILYP
Sbjct: 1155 N--DGESNPTSNPAAGPSICWWRGGRVSRQLFNWKVLPRSLASKAARQGGGKKIPGILYP 1212

Query: 1386 DSSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
            +SSD A+RSK +AWR AVE S S+ QL  Q ++ DSNI+W
Sbjct: 1213 ESSDFARRSKSMAWRAAVESSTSIEQLALQVRELDSNIRW 1252


>ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
            gi|241926713|gb|EER99857.1| hypothetical protein
            SORBIDRAFT_02g042000 [Sorghum bicolor]
          Length = 1688

 Score =  255 bits (651), Expect = 8e-65
 Identities = 140/337 (41%), Positives = 186/337 (55%), Gaps = 1/337 (0%)
 Frame = +3

Query: 498  ELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKC 677
            +L SDP  YINYY+FG+IA++   EL HK SE K+   KK ++D+ +  L+ I  K    
Sbjct: 661  QLHSDPARYINYYSFGQIAANAAEELKHKLSENKDG--KKPVQDVLSFHLRTICKKYANI 718

Query: 678  LRYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEES 857
                 Q  S ++ KE CGWC SC+ S   DC+F+V   K +EG K               
Sbjct: 719  FALTDQKLSAELLKEKCGWCNSCQISGGVDCIFRVTDIKYMEGPK--------------- 763

Query: 858  RSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXX 1037
               T+   +E N +SHI  AMH+ILSIE     LLSGPW+ P +S  WR+ V+KASD   
Sbjct: 764  -PHTLDLRAESNMDSHIILAMHNILSIEERLNGLLSGPWQNPQYSICWRETVLKASDVSS 822

Query: 1038 XXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIS 1214
                             + EW KP D    + SA+ I+     +S S+  +RK G+K   
Sbjct: 823  LKKPLLTLESSLRRVAITAEWQKPADSVEVVGSAAHILVRSSNKSLSHGSARKPGRKPSP 882

Query: 1215 TNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSS 1394
              E    SR      + WWRGG+LSRQ+F  K LP+ L +K  RQAGRRKIP ILY D S
Sbjct: 883  NGELKVDSRDV---GVYWWRGGKLSRQVFHWKRLPQTLVNKAARQAGRRKIPTILYTDGS 939

Query: 1395 DLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
              A+R KY+AW+ AVEM+++ AQLI Q K+ + NIKW
Sbjct: 940  QFARRFKYIAWQAAVEMAENAAQLILQIKELEWNIKW 976


>gb|EMT20858.1| hypothetical protein F775_52258 [Aegilops tauschii]
          Length = 1851

 Score =  253 bits (645), Expect = 4e-64
 Identities = 146/336 (43%), Positives = 185/336 (55%), Gaps = 1/336 (0%)
 Frame = +3

Query: 501  LQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCL 680
            L+S    YINYY+FG+IA+S   EL HK SE  N E KK   D  + +LK I  K     
Sbjct: 632  LRSGNAMYINYYSFGQIAASAAEELKHKLSE--NEEGKKHGPDAVSFRLKTICKKYVNVF 689

Query: 681  RYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESR 860
                Q  S+++ KE CGWC SC+ S   DC+F+    K +E  K                
Sbjct: 690  ALTDQKLSVELLKEKCGWCNSCQISGGSDCIFRFTDVKCMESPK---------------- 733

Query: 861  SQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXX 1040
               VGP SEKN ESHI  A H +LSIE     LLSGPW+ P +S +WRKAV+ ASD    
Sbjct: 734  PCAVGPLSEKNKESHIVLATHSMLSIEKRLNGLLSGPWQNPQYSMYWRKAVLMASDVSSL 793

Query: 1041 XXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIST 1217
                            S EW KP D    + SA+ I+     +S+ Y  +RK G+K ++ 
Sbjct: 794  KQPLLTLESSLRRVAFSGEWQKPADSVEVVGSAAHILVRTSNKSAGYAIARKPGRKPLAI 853

Query: 1218 NEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSD 1397
             E     R      + WWRGG LSRQ+F  K LP+ LA K  RQAGR+KIP I+YPD S 
Sbjct: 854  -ELKVDFRDV---GVYWWRGGTLSRQVFHWKRLPQSLACKSARQAGRKKIPTIVYPDGSQ 909

Query: 1398 LAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
             A+RSKY+AWR AVEM+Q+V+QLI Q K+ + NIKW
Sbjct: 910  FARRSKYIAWRAAVEMAQNVSQLILQIKELELNIKW 945


>gb|KQK14817.1| hypothetical protein BRADI_1g18770 [Brachypodium distachyon]
            gi|944079466|gb|KQK14818.1| hypothetical protein
            BRADI_1g18770 [Brachypodium distachyon]
          Length = 1723

 Score =  249 bits (637), Expect = 3e-63
 Identities = 143/336 (42%), Positives = 188/336 (55%), Gaps = 1/336 (0%)
 Frame = +3

Query: 501  LQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCL 680
            L SDP  YINYY+FG+IA+S   EL HK SE  N E KK  +D  + +LK I  K     
Sbjct: 663  LHSDPTRYINYYSFGQIAASAARELKHKLSE--NEEGKKHGQDAVSFRLKTICKKYVNVF 720

Query: 681  RYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESR 860
                Q  S+++ KE CGWC SC+ S+  DC+F+V                     ++  +
Sbjct: 721  ALTDQKLSVELLKEKCGWCNSCQISSGTDCIFRV---------------------VDGLK 759

Query: 861  SQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXX 1040
               +G  SEKN ESHI  AMH+ILSIE     LLSGPW+ P +S +WRKAV++ASD    
Sbjct: 760  PCNLGLLSEKNKESHIVLAMHNILSIEERLNGLLSGPWQNPQYSIYWRKAVLRASDLSSL 819

Query: 1041 XXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIST 1217
                              +W KP D    + SA+ I+     +S SY  +RK G+K  S 
Sbjct: 820  KQPLLMLESSLRRVAFFGDWQKPADSVEVVGSAAHILVRSSNKSKSYASARKPGRKP-SI 878

Query: 1218 NEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSD 1397
            +E    S       + WWRGG LSRQ+F  K LP+ LAS+  RQAGR+KI  I+YP+ S 
Sbjct: 879  DELKVDSPDV---GVYWWRGGTLSRQVFHWKRLPQSLASRAARQAGRKKISTIVYPEGSQ 935

Query: 1398 LAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
             A+R KY+AWR AVEM+Q+V+QLI Q K+ + NIKW
Sbjct: 936  FARRLKYIAWRAAVEMAQNVSQLILQIKELELNIKW 971


>ref|XP_010234427.1| PREDICTED: uncharacterized protein LOC100822072 [Brachypodium
            distachyon]
          Length = 1748

 Score =  249 bits (637), Expect = 3e-63
 Identities = 143/336 (42%), Positives = 188/336 (55%), Gaps = 1/336 (0%)
 Frame = +3

Query: 501  LQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCL 680
            L SDP  YINYY+FG+IA+S   EL HK SE  N E KK  +D  + +LK I  K     
Sbjct: 688  LHSDPTRYINYYSFGQIAASAARELKHKLSE--NEEGKKHGQDAVSFRLKTICKKYVNVF 745

Query: 681  RYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESR 860
                Q  S+++ KE CGWC SC+ S+  DC+F+V                     ++  +
Sbjct: 746  ALTDQKLSVELLKEKCGWCNSCQISSGTDCIFRV---------------------VDGLK 784

Query: 861  SQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXX 1040
               +G  SEKN ESHI  AMH+ILSIE     LLSGPW+ P +S +WRKAV++ASD    
Sbjct: 785  PCNLGLLSEKNKESHIVLAMHNILSIEERLNGLLSGPWQNPQYSIYWRKAVLRASDLSSL 844

Query: 1041 XXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIST 1217
                              +W KP D    + SA+ I+     +S SY  +RK G+K  S 
Sbjct: 845  KQPLLMLESSLRRVAFFGDWQKPADSVEVVGSAAHILVRSSNKSKSYASARKPGRKP-SI 903

Query: 1218 NEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSD 1397
            +E    S       + WWRGG LSRQ+F  K LP+ LAS+  RQAGR+KI  I+YP+ S 
Sbjct: 904  DELKVDSPDV---GVYWWRGGTLSRQVFHWKRLPQSLASRAARQAGRKKISTIVYPEGSQ 960

Query: 1398 LAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
             A+R KY+AWR AVEM+Q+V+QLI Q K+ + NIKW
Sbjct: 961  FARRLKYIAWRAAVEMAQNVSQLILQIKELELNIKW 996


>ref|XP_008653439.1| PREDICTED: uncharacterized protein LOC103633535 [Zea mays]
            gi|414887991|tpg|DAA64005.1| TPA: hypothetical protein
            ZEAMMB73_302261 [Zea mays] gi|414887992|tpg|DAA64006.1|
            TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
          Length = 1712

 Score =  248 bits (633), Expect = 1e-62
 Identities = 139/339 (41%), Positives = 186/339 (54%), Gaps = 3/339 (0%)
 Frame = +3

Query: 498  ELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKC 677
            +L SDP  YINYY+FG+IA+S   EL HK SE  N ++KK ++D+ +  L+ I  K    
Sbjct: 688  QLHSDPARYINYYSFGQIAASAAEELKHKLSE--NKDVKKPVQDVLSFHLRTICKKYANF 745

Query: 678  LRYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEES 857
                 Q  S ++ KE CGWC SC+ S   DC+F++   K +EG K               
Sbjct: 746  FALTDQKLSAELLKEKCGWCNSCQISGGVDCIFRLTDIKYMEGPK--------------- 790

Query: 858  RSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXX 1037
               T+   +E N ESHI  AM++ILS+E     LLSGPW+ P +S  WR AV+KASD   
Sbjct: 791  -PHTLDLGAENNMESHIILAMYNILSVEERLNGLLSGPWQNPQYSICWRNAVLKASDVSS 849

Query: 1038 XXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGS--RKQGKKN 1208
                             + EW K  D    + SA+ I+     +S S+V +  RK G+K 
Sbjct: 850  LKQPLLMLESSLRRVAITTEWQKAADSVEVVGSAAHILVRSSNKSLSHVSATARKPGRKP 909

Query: 1209 ISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPD 1388
                E    SR      + WWRGG+LSRQ+F  K LP+ L +K  RQAGRR+IP I Y D
Sbjct: 910  SPNGELKVDSRDV---GVYWWRGGKLSRQVFHWKRLPQSLVNKAARQAGRRRIPTISYTD 966

Query: 1389 SSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
             S  A+R KY+AWR AVEM+++ AQLI Q K+ + NIKW
Sbjct: 967  GSQFARRFKYIAWRAAVEMAENAAQLILQIKELEWNIKW 1005


>tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
          Length = 1679

 Score =  248 bits (633), Expect = 1e-62
 Identities = 139/339 (41%), Positives = 186/339 (54%), Gaps = 3/339 (0%)
 Frame = +3

Query: 498  ELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKC 677
            +L SDP  YINYY+FG+IA+S   EL HK SE  N ++KK ++D+ +  L+ I  K    
Sbjct: 655  QLHSDPARYINYYSFGQIAASAAEELKHKLSE--NKDVKKPVQDVLSFHLRTICKKYANF 712

Query: 678  LRYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEES 857
                 Q  S ++ KE CGWC SC+ S   DC+F++   K +EG K               
Sbjct: 713  FALTDQKLSAELLKEKCGWCNSCQISGGVDCIFRLTDIKYMEGPK--------------- 757

Query: 858  RSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXX 1037
               T+   +E N ESHI  AM++ILS+E     LLSGPW+ P +S  WR AV+KASD   
Sbjct: 758  -PHTLDLGAENNMESHIILAMYNILSVEERLNGLLSGPWQNPQYSICWRNAVLKASDVSS 816

Query: 1038 XXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGS--RKQGKKN 1208
                             + EW K  D    + SA+ I+     +S S+V +  RK G+K 
Sbjct: 817  LKQPLLMLESSLRRVAITTEWQKAADSVEVVGSAAHILVRSSNKSLSHVSATARKPGRKP 876

Query: 1209 ISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPD 1388
                E    SR      + WWRGG+LSRQ+F  K LP+ L +K  RQAGRR+IP I Y D
Sbjct: 877  SPNGELKVDSRDV---GVYWWRGGKLSRQVFHWKRLPQSLVNKAARQAGRRRIPTISYTD 933

Query: 1389 SSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
             S  A+R KY+AWR AVEM+++ AQLI Q K+ + NIKW
Sbjct: 934  GSQFARRFKYIAWRAAVEMAENAAQLILQIKELEWNIKW 972


>gb|KHG13465.1| Nucleosome-remodeling factor subunit [Gossypium arboreum]
          Length = 867

 Score =  246 bits (627), Expect = 5e-62
 Identities = 146/369 (39%), Positives = 201/369 (54%), Gaps = 2/369 (0%)
 Frame = +3

Query: 405  SKHTDLTVASENCIGLQGRGCITDRNRFGASELQSDPGNYINYYTFGRIASSIYAELMHK 584
            S   D    S+     Q   C+ +  R  AS+LQ   G Y+N+Y+F + AS +  EL+ K
Sbjct: 112  SNDLDARQESKTKFASQQTPCVLNVKRRDASQLQPGTG-YVNHYSFAQTASLVVEELLRK 170

Query: 585  SSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNPSMDVQKENCGWCFSCRTS-ND 761
             SE  N++  KSLE+I   Q+K I  KS +       N  +D +KENCGWCFSCR   +D
Sbjct: 171  PSEKTNDDSLKSLEEIIGNQMKVILKKSNRFRWPDIYNLYVDARKENCGWCFSCRYPVDD 230

Query: 762  FDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPHSEKNTESHITTAMHHILSIE 941
             DCLF++               G +    E S+S+ +      N + H+   ++HI SIE
Sbjct: 231  TDCLFRITS-------------GCVP---EVSKSEMLDLQLRWNKKGHVIDVIYHIFSIE 274

Query: 942  GCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXXXXXXXXXXXSVEWTKPVDDA 1121
                 LLSGPW  P + + W K+++ AS                     S +W K VD A
Sbjct: 275  NRLSGLLSGPWLNPQYMKIWHKSILNASGIASVKHLLLTLEANLHHLALSTDWMKHVDSA 334

Query: 1122 RAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTSRRATISDICWWRGGRLSRQL 1298
              + SAS +V +  R S+ +  +RK+G  N + NE + TS  A  + ICWWRGGR+SRQL
Sbjct: 335  VIMGSASHVVIASSRGSAKHGIARKRG--NCNDNESNPTSNPAVGASICWWRGGRVSRQL 392

Query: 1299 FQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKYVAWRVAVEMSQSVAQLIYQA 1478
            F  K+LP  L SK  RQ G +KIP ILYP+SSD AKRS+ +AWR AVE S S+ QL +Q 
Sbjct: 393  FNWKVLPCSLVSKAARQGGGKKIPGILYPESSDFAKRSRSIAWRAAVESSTSIEQLAFQV 452

Query: 1479 KDFDSNIKW 1505
            ++ DSNI+W
Sbjct: 453  RELDSNIRW 461


>gb|KDO50419.1| hypothetical protein CISIN_1g000462mg [Citrus sinensis]
          Length = 1306

 Score =  242 bits (618), Expect = 6e-61
 Identities = 176/518 (33%), Positives = 254/518 (49%), Gaps = 17/518 (3%)
 Frame = +3

Query: 3    VNAISMCWVPVDASYANNQSMNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLPHEN 182
            +NAI   W   D + ++N   +++  +   ++ HM +E P   ++D     N   L  EN
Sbjct: 449  INAICKQW---DITVSSNGVRSNLALNTVSLSRHMKAEVPTISEID-----NEQKL-EEN 499

Query: 183  AASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALL-------------DEKVHA 323
              +     P  +    A  L+SVTA EL  ++   S                  D  + A
Sbjct: 500  FLAGYSNRPDNALSKSANLLDSVTAMELPNISSEGSAETTQMNSGFDNFQKEGPDNSIRA 559

Query: 324  ATSSHPSQETDTDCPMGSI-APSKQVIPSKHTDLT--VASENCIGLQGRGCITDRNRFGA 494
            A  S+ S+        G + AP    + S  +D+    AS  C         T+  +  A
Sbjct: 560  AEFSNQSEIA------GKLPAPGHNSMTSSTSDIKQKFASSGC-----NSSPTNSRKGDA 608

Query: 495  SELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTK 674
             +LQ +   Y+N Y+F + ASS+  ELMHKSS   + E   S E+I + Q+KAI  K  K
Sbjct: 609  LQLQPEIA-YMNRYSFAQTASSVAEELMHKSSNEISKEPINSNEEIISKQMKAILKKWDK 667

Query: 675  CLRYAYQNPSMDVQKENCGWCFSCRTS-NDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLE 851
                  Q  + D QKE CGWCFSC+++ +D DCLF +   + L  S+             
Sbjct: 668  FYWPNTQKLNADTQKEKCGWCFSCKSATDDMDCLFYMNNGRVLGSSE------------- 714

Query: 852  ESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDX 1031
               S+  G  S++N + H+   + HILSIE     LL GPW  PH+++ WRK+ +KA+D 
Sbjct: 715  ---SEVAGLLSKRNKKGHLVDVICHILSIEDRLLGLLLGPWLNPHYTKLWRKSALKAADM 771

Query: 1032 XXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSASIVTSPVRRSSSYVGSRKQGKKNI 1211
                               S EW K VD    + SAS +     R++S  G+ ++  ++ 
Sbjct: 772  ASVKHLLLTLEANLQHLALSAEWFKHVDPVVTVGSASHIVIASSRANSKAGAGRKKARDF 831

Query: 1212 STNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDS 1391
              N    +++ A    +CWWRGGRLS QLF  K LPR L SK  RQAG  KIP ILYP++
Sbjct: 832  DGNP---STKAAGGLSLCWWRGGRLSCQLFSWKRLPRSLVSKAARQAGCMKIPGILYPEN 888

Query: 1392 SDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
            SD A+RS+ VAWR AVE S SV QL  Q ++FDSN++W
Sbjct: 889  SDFARRSRTVAWRAAVESSTSVEQLAIQVREFDSNVRW 926


>gb|KDO50418.1| hypothetical protein CISIN_1g000462mg [Citrus sinensis]
          Length = 1482

 Score =  242 bits (618), Expect = 6e-61
 Identities = 176/518 (33%), Positives = 254/518 (49%), Gaps = 17/518 (3%)
 Frame = +3

Query: 3    VNAISMCWVPVDASYANNQSMNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLPHEN 182
            +NAI   W   D + ++N   +++  +   ++ HM +E P   ++D     N   L  EN
Sbjct: 449  INAICKQW---DITVSSNGVRSNLALNTVSLSRHMKAEVPTISEID-----NEQKL-EEN 499

Query: 183  AASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALL-------------DEKVHA 323
              +     P  +    A  L+SVTA EL  ++   S                  D  + A
Sbjct: 500  FLAGYSNRPDNALSKSANLLDSVTAMELPNISSEGSAETTQMNSGFDNFQKEGPDNSIRA 559

Query: 324  ATSSHPSQETDTDCPMGSI-APSKQVIPSKHTDLT--VASENCIGLQGRGCITDRNRFGA 494
            A  S+ S+        G + AP    + S  +D+    AS  C         T+  +  A
Sbjct: 560  AEFSNQSEIA------GKLPAPGHNSMTSSTSDIKQKFASSGC-----NSSPTNSRKGDA 608

Query: 495  SELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTK 674
             +LQ +   Y+N Y+F + ASS+  ELMHKSS   + E   S E+I + Q+KAI  K  K
Sbjct: 609  LQLQPEIA-YMNRYSFAQTASSVAEELMHKSSNEISKEPINSNEEIISKQMKAILKKWDK 667

Query: 675  CLRYAYQNPSMDVQKENCGWCFSCRTS-NDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLE 851
                  Q  + D QKE CGWCFSC+++ +D DCLF +   + L  S+             
Sbjct: 668  FYWPNTQKLNADTQKEKCGWCFSCKSATDDMDCLFYMNNGRVLGSSE------------- 714

Query: 852  ESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDX 1031
               S+  G  S++N + H+   + HILSIE     LL GPW  PH+++ WRK+ +KA+D 
Sbjct: 715  ---SEVAGLLSKRNKKGHLVDVICHILSIEDRLLGLLLGPWLNPHYTKLWRKSALKAADM 771

Query: 1032 XXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSASIVTSPVRRSSSYVGSRKQGKKNI 1211
                               S EW K VD    + SAS +     R++S  G+ ++  ++ 
Sbjct: 772  ASVKHLLLTLEANLQHLALSAEWFKHVDPVVTVGSASHIVIASSRANSKAGAGRKKARDF 831

Query: 1212 STNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDS 1391
              N    +++ A    +CWWRGGRLS QLF  K LPR L SK  RQAG  KIP ILYP++
Sbjct: 832  DGNP---STKAAGGLSLCWWRGGRLSCQLFSWKRLPRSLVSKAARQAGCMKIPGILYPEN 888

Query: 1392 SDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505
            SD A+RS+ VAWR AVE S SV QL  Q ++FDSN++W
Sbjct: 889  SDFARRSRTVAWRAAVESSTSVEQLAIQVREFDSNVRW 926


>gb|KJB09356.1| hypothetical protein B456_001G136300 [Gossypium raimondii]
          Length = 1620

 Score =  241 bits (614), Expect = 2e-60
 Identities = 145/369 (39%), Positives = 199/369 (53%), Gaps = 2/369 (0%)
 Frame = +3

Query: 405  SKHTDLTVASENCIGLQGRGCITDRNRFGASELQSDPGNYINYYTFGRIASSIYAELMHK 584
            S   D    S+  +  Q    + +  R  AS+L    G Y+N+Y+F + AS +  EL+HK
Sbjct: 865  SNDLDARQESKTKLASQQTPRVLNAKRGDASQLLPGTG-YVNHYSFAQTASLVVEELLHK 923

Query: 585  SSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNPSMDVQKENCGWCFSCRTS-ND 761
             SE  N++  KSLE+I  +Q+K I  KS +       N  +D +KENCGWCFSCR   +D
Sbjct: 924  PSEKTNDDSLKSLEEIIGIQMKVILKKSNRLHWPDIHNLYVDARKENCGWCFSCRYPVDD 983

Query: 762  FDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPHSEKNTESHITTAMHHILSIE 941
             DCLF++               G +    E S+S+ V   S  N + H+   ++HI SIE
Sbjct: 984  TDCLFRITS-------------GCVP---EVSKSEMVDLQSRWNKKGHVIDVIYHIFSIE 1027

Query: 942  GCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXXXXXXXXXXXSVEWTKPVDDA 1121
                 LLSGPW    + + W K+++ AS                     S +W K VD A
Sbjct: 1028 NRLSGLLSGPWLNLQYMKIWHKSILNASGIASVKHLLLTLEANLHHLALSTDWMKHVDSA 1087

Query: 1122 RAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTSRRATISDICWWRGGRLSRQL 1298
              + SAS +V +  R S+ +  +RK+G  N   NE + TS  A    ICWWRGGR+SRQL
Sbjct: 1088 VIMGSASHVVIASSRGSAKHGIARKRGSCN--DNESNPTSNPAVGPSICWWRGGRVSRQL 1145

Query: 1299 FQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKYVAWRVAVEMSQSVAQLIYQA 1478
            F  K+LP  L SK  RQ G +KIP ILYP+SSD AKRS+ +AWR AVE S S+ QL +Q 
Sbjct: 1146 FNWKVLPCSLVSKAARQGGGKKIPGILYPESSDFAKRSRSIAWRAAVESSTSIEQLAFQV 1205

Query: 1479 KDFDSNIKW 1505
            ++  SNI+W
Sbjct: 1206 RELGSNIRW 1214


>gb|KJB09354.1| hypothetical protein B456_001G136300 [Gossypium raimondii]
          Length = 1653

 Score =  241 bits (614), Expect = 2e-60
 Identities = 145/369 (39%), Positives = 199/369 (53%), Gaps = 2/369 (0%)
 Frame = +3

Query: 405  SKHTDLTVASENCIGLQGRGCITDRNRFGASELQSDPGNYINYYTFGRIASSIYAELMHK 584
            S   D    S+  +  Q    + +  R  AS+L    G Y+N+Y+F + AS +  EL+HK
Sbjct: 898  SNDLDARQESKTKLASQQTPRVLNAKRGDASQLLPGTG-YVNHYSFAQTASLVVEELLHK 956

Query: 585  SSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNPSMDVQKENCGWCFSCRTS-ND 761
             SE  N++  KSLE+I  +Q+K I  KS +       N  +D +KENCGWCFSCR   +D
Sbjct: 957  PSEKTNDDSLKSLEEIIGIQMKVILKKSNRLHWPDIHNLYVDARKENCGWCFSCRYPVDD 1016

Query: 762  FDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPHSEKNTESHITTAMHHILSIE 941
             DCLF++               G +    E S+S+ V   S  N + H+   ++HI SIE
Sbjct: 1017 TDCLFRITS-------------GCVP---EVSKSEMVDLQSRWNKKGHVIDVIYHIFSIE 1060

Query: 942  GCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXXXXXXXXXXXSVEWTKPVDDA 1121
                 LLSGPW    + + W K+++ AS                     S +W K VD A
Sbjct: 1061 NRLSGLLSGPWLNLQYMKIWHKSILNASGIASVKHLLLTLEANLHHLALSTDWMKHVDSA 1120

Query: 1122 RAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTSRRATISDICWWRGGRLSRQL 1298
              + SAS +V +  R S+ +  +RK+G  N   NE + TS  A    ICWWRGGR+SRQL
Sbjct: 1121 VIMGSASHVVIASSRGSAKHGIARKRGSCN--DNESNPTSNPAVGPSICWWRGGRVSRQL 1178

Query: 1299 FQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKYVAWRVAVEMSQSVAQLIYQA 1478
            F  K+LP  L SK  RQ G +KIP ILYP+SSD AKRS+ +AWR AVE S S+ QL +Q 
Sbjct: 1179 FNWKVLPCSLVSKAARQGGGKKIPGILYPESSDFAKRSRSIAWRAAVESSTSIEQLAFQV 1238

Query: 1479 KDFDSNIKW 1505
            ++  SNI+W
Sbjct: 1239 RELGSNIRW 1247


Top