BLASTX nr result
ID: Ophiopogon21_contig00024839
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon21_contig00024839 (1505 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010921925.1| PREDICTED: uncharacterized protein LOC105045... 408 e-111 ref|XP_010921923.1| PREDICTED: uncharacterized protein LOC105045... 408 e-111 ref|XP_008787592.1| PREDICTED: uncharacterized protein LOC103705... 393 e-106 ref|XP_009412876.1| PREDICTED: uncharacterized protein LOC103994... 341 9e-91 ref|XP_010273302.1| PREDICTED: uncharacterized protein LOC104608... 283 3e-73 ref|XP_004958593.1| PREDICTED: uncharacterized protein LOC101777... 261 1e-66 ref|XP_007015163.1| DNA binding,zinc ion binding,DNA binding, pu... 256 4e-65 ref|XP_007015162.1| DNA binding,zinc ion binding,DNA binding, pu... 256 4e-65 ref|XP_007015161.1| DNA binding,zinc ion binding,DNA binding, pu... 256 4e-65 ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [S... 255 8e-65 gb|EMT20858.1| hypothetical protein F775_52258 [Aegilops tauschii] 253 4e-64 gb|KQK14817.1| hypothetical protein BRADI_1g18770 [Brachypodium ... 249 3e-63 ref|XP_010234427.1| PREDICTED: uncharacterized protein LOC100822... 249 3e-63 ref|XP_008653439.1| PREDICTED: uncharacterized protein LOC103633... 248 1e-62 tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea m... 248 1e-62 gb|KHG13465.1| Nucleosome-remodeling factor subunit [Gossypium a... 246 5e-62 gb|KDO50419.1| hypothetical protein CISIN_1g000462mg [Citrus sin... 242 6e-61 gb|KDO50418.1| hypothetical protein CISIN_1g000462mg [Citrus sin... 242 6e-61 gb|KJB09356.1| hypothetical protein B456_001G136300 [Gossypium r... 241 2e-60 gb|KJB09354.1| hypothetical protein B456_001G136300 [Gossypium r... 241 2e-60 >ref|XP_010921925.1| PREDICTED: uncharacterized protein LOC105045366 isoform X2 [Elaeis guineensis] Length = 1023 Score = 408 bits (1049), Expect = e-111 Identities = 238/509 (46%), Positives = 302/509 (59%), Gaps = 8/509 (1%) Frame = +3 Query: 3 VNAISMCW-VPVDASYANNQS--MNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLP 173 VNAIS W VPV+AS ++N + ++H+ ++ M+S+ K + D + P Sbjct: 43 VNAISSYWKVPVNASNSSNHGHEIPNVHEVLD---ASMHSQHLALAKQEVSIDGIIENAP 99 Query: 174 HENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQET 353 E +AS EP SD LN + + + +N PF+CS +DE A T SQ+ Sbjct: 100 KEYSASPGCSEPNCLSASDLRQLNLMDSHQSAEINRPFACSESVDEMADATTCDQLSQQI 159 Query: 354 DTDCPMGSIAPSKQVIPSKHTDLTVASENCIGLQGRGC----ITDRNRFGASELQSDPGN 521 +C P K+ I K DL+V +E + L G G ITDR + S LQSDPG Sbjct: 160 YNECSKNENVPDKEFISVKPVDLSVENEKYVELPGWGVGISLITDRWKGVDSRLQSDPGC 219 Query: 522 YINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNP 701 Y+NYYTFGRIA S+ ELMHK+SE+ N E KK +ED+ + QLKAIS S + Y+ Q Sbjct: 220 YVNYYTFGRIAFSVAQELMHKASESGNKESKKPVEDMMSQQLKAISKNSIRFCWYSNQKL 279 Query: 702 SMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPH 881 S+D QKE CGWC+SC++ N DCLFKV DK+LE SK +T G Sbjct: 280 SLDAQKEKCGWCYSCKSLNGSDCLFKVMDDKHLESSK----------------PRTAGLR 323 Query: 882 SEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXX 1061 SEK +SHI +AMHHILSIE LSG WE PH+S WRKAV+KASD Sbjct: 324 SEKKKKSHILSAMHHILSIEDRVRCFLSGLWENPHYSNLWRKAVLKASDVASLKHLLLNL 383 Query: 1062 XXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTS 1238 S EW KPVD + SAS +VT + SS+ GSRKQ KK +S +E + Sbjct: 384 ESNLRRVALSAEWLKPVDSVEIVGSASHVVTGSLLVSSNNGGSRKQSKKTLSVSE---SV 440 Query: 1239 RRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKY 1418 R + WWRGGRLSRQ+FQ K+LPR LASKGG QAG +KIP+ILYPD S+ A+RSK+ Sbjct: 441 REPAAGSLFWWRGGRLSRQVFQWKILPRSLASKGGHQAGCKKIPNILYPDGSEFARRSKF 500 Query: 1419 VAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 VAWR AVEMSQSVAQLI+Q K+FDSNI+W Sbjct: 501 VAWRAAVEMSQSVAQLIFQIKEFDSNIRW 529 >ref|XP_010921923.1| PREDICTED: uncharacterized protein LOC105045366 isoform X1 [Elaeis guineensis] gi|743785508|ref|XP_010921924.1| PREDICTED: uncharacterized protein LOC105045366 isoform X1 [Elaeis guineensis] Length = 1619 Score = 408 bits (1049), Expect = e-111 Identities = 238/509 (46%), Positives = 302/509 (59%), Gaps = 8/509 (1%) Frame = +3 Query: 3 VNAISMCW-VPVDASYANNQS--MNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLP 173 VNAIS W VPV+AS ++N + ++H+ ++ M+S+ K + D + P Sbjct: 639 VNAISSYWKVPVNASNSSNHGHEIPNVHEVLD---ASMHSQHLALAKQEVSIDGIIENAP 695 Query: 174 HENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQET 353 E +AS EP SD LN + + + +N PF+CS +DE A T SQ+ Sbjct: 696 KEYSASPGCSEPNCLSASDLRQLNLMDSHQSAEINRPFACSESVDEMADATTCDQLSQQI 755 Query: 354 DTDCPMGSIAPSKQVIPSKHTDLTVASENCIGLQGRGC----ITDRNRFGASELQSDPGN 521 +C P K+ I K DL+V +E + L G G ITDR + S LQSDPG Sbjct: 756 YNECSKNENVPDKEFISVKPVDLSVENEKYVELPGWGVGISLITDRWKGVDSRLQSDPGC 815 Query: 522 YINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNP 701 Y+NYYTFGRIA S+ ELMHK+SE+ N E KK +ED+ + QLKAIS S + Y+ Q Sbjct: 816 YVNYYTFGRIAFSVAQELMHKASESGNKESKKPVEDMMSQQLKAISKNSIRFCWYSNQKL 875 Query: 702 SMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPH 881 S+D QKE CGWC+SC++ N DCLFKV DK+LE SK +T G Sbjct: 876 SLDAQKEKCGWCYSCKSLNGSDCLFKVMDDKHLESSK----------------PRTAGLR 919 Query: 882 SEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXX 1061 SEK +SHI +AMHHILSIE LSG WE PH+S WRKAV+KASD Sbjct: 920 SEKKKKSHILSAMHHILSIEDRVRCFLSGLWENPHYSNLWRKAVLKASDVASLKHLLLNL 979 Query: 1062 XXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTS 1238 S EW KPVD + SAS +VT + SS+ GSRKQ KK +S +E + Sbjct: 980 ESNLRRVALSAEWLKPVDSVEIVGSASHVVTGSLLVSSNNGGSRKQSKKTLSVSE---SV 1036 Query: 1239 RRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKY 1418 R + WWRGGRLSRQ+FQ K+LPR LASKGG QAG +KIP+ILYPD S+ A+RSK+ Sbjct: 1037 REPAAGSLFWWRGGRLSRQVFQWKILPRSLASKGGHQAGCKKIPNILYPDGSEFARRSKF 1096 Query: 1419 VAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 VAWR AVEMSQSVAQLI+Q K+FDSNI+W Sbjct: 1097 VAWRAAVEMSQSVAQLIFQIKEFDSNIRW 1125 >ref|XP_008787592.1| PREDICTED: uncharacterized protein LOC103705599 [Phoenix dactylifera] gi|672128223|ref|XP_008787593.1| PREDICTED: uncharacterized protein LOC103705599 [Phoenix dactylifera] Length = 1634 Score = 393 bits (1009), Expect = e-106 Identities = 236/509 (46%), Positives = 298/509 (58%), Gaps = 8/509 (1%) Frame = +3 Query: 3 VNAISMCW-VPVDASYANNQS--MNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLP 173 V+AIS W V V+AS ++N + ++H+ + D + H SE K + D P Sbjct: 640 VDAISSYWKVTVNASNSSNHGHEIPNVHE-VLDASVH--SEHLTLSKQEVSFDGIIEKAP 696 Query: 174 HENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQET 353 + +AS EP SD LN + + + +N PF+ S DE AAT S++ Sbjct: 697 KDYSASPGCSEPNRLSTSDLRQLNLMDSRQSAEINQPFAHSESADEMADAATCDPISRQI 756 Query: 354 DTDCPMGSIAPSKQVIPSKHTDLTVASENCIGLQGRGC----ITDRNRFGASELQSDPGN 521 DC P K+ I +L+V +E + L G G ITDR + S LQSDPG Sbjct: 757 YNDCSRNENVPDKEFISVNPVELSVDNEKYVELPGWGVGTSLITDRWKGADSRLQSDPGC 816 Query: 522 YINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNP 701 Y+NYYTFGRIA S+ ELMHKSSE+ N E KK +ED+ + QLKAIS KS + Q Sbjct: 817 YMNYYTFGRIAFSVAQELMHKSSESGNKESKKPVEDMMSQQLKAISKKSIRFCWCTNQKL 876 Query: 702 SMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPH 881 S+D QKE CGWC SC+T N +CLFK+ DK+LE SK + VG Sbjct: 877 SLDAQKEKCGWCHSCKTLNGSNCLFKIMDDKHLESSK----------------PRIVGLR 920 Query: 882 SEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXX 1061 SEK +SHI +AMHHILSIE LSGPWEKPH+S WRKAV+KASD Sbjct: 921 SEKKKKSHILSAMHHILSIEDRLRCFLSGPWEKPHYSNLWRKAVLKASDVASLKHLLLTL 980 Query: 1062 XXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTS 1238 S EW KPVD + SAS +VT V SS+ SRKQ KK++S +E + Sbjct: 981 ESNLRRVALSAEWLKPVDSVEIVGSASHVVTGSVLMSSNNGSSRKQSKKSLSVSE---SV 1037 Query: 1239 RRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKY 1418 R + WWRGGRLSRQ+F K+LPR LASKGGRQAG +KIP++LYPD S+ A+RSK+ Sbjct: 1038 RDPAAGSVFWWRGGRLSRQVFHWKILPRSLASKGGRQAGCKKIPNMLYPDGSEFARRSKF 1097 Query: 1419 VAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 VAWR AVEMSQSVAQLI+Q K+FDSNI+W Sbjct: 1098 VAWRAAVEMSQSVAQLIFQIKEFDSNIRW 1126 >ref|XP_009412876.1| PREDICTED: uncharacterized protein LOC103994272 [Musa acuminata subsp. malaccensis] Length = 1291 Score = 341 bits (875), Expect = 9e-91 Identities = 204/512 (39%), Positives = 286/512 (55%), Gaps = 11/512 (2%) Frame = +3 Query: 3 VNAISMCW-VPVDASYANNQSMNDI--HDDINDIATHMYSEPPLPPKLDGLSDCNAGSLP 173 VN IS W + +D+ + +QS ++I ++ D ++ S P D + + Sbjct: 637 VNTISAQWGISLDSHSSISQSCHEIINRNEALDSQLNLLSSDPNVVNDDIVKNSK----- 691 Query: 174 HENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQET 353 +N +SE +P ++ SD N V+ D M+ F S ++ HA +Q+T Sbjct: 692 -DNCTNSEHSDPISANASDLSQTNLVSLDHASGMSLLFVSSEPAEQLAHAVNYLQSTQQT 750 Query: 354 DTDCPMGSIAPSKQVIPSKHTDLTVASENC-----IGLQGRGCITDRNRFGAS--ELQSD 512 C + + P +VI T + ++++N L G I+++ + A +LQSD Sbjct: 751 TDSCSIATDNPVDEVISV--TPVVISTDNSKHFAITDLGGTSFISEQVQKKAETCKLQSD 808 Query: 513 PGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAY 692 P YINYY FGR+ASS+ +LM KSSE+ N E KKS ED+ QLKAI + K Y++ Sbjct: 809 PCGYINYYIFGRVASSVAEDLMIKSSESNNKEPKKSDEDMVVAQLKAIFKRCPKLSSYSF 868 Query: 693 QNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTV 872 S+D+QKE CGWC SC+TS+ DC F V K++E+ +S V Sbjct: 869 LQQSLDIQKEKCGWCHSCKTSSSSDCAFVVND-----------------KHIEDMKSDAV 911 Query: 873 GPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXX 1052 G SEK +SHI + MH ILSIE LLSGPW+ PH+S WRKAVMKASD Sbjct: 912 GLDSEKKKKSHIVSVMHDILSIEDHLNGLLSGPWDNPHYSSLWRKAVMKASDVASLKHML 971 Query: 1053 XXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFS 1229 +W KPVD A + SAS I+ + S+ GSRKQGK+ S +EF+ Sbjct: 972 LLLESNLRRVAMLSDWMKPVDFAHTVGSASHILIGSMDAFSNCGGSRKQGKRTTSGSEFN 1031 Query: 1230 FTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKR 1409 S+ A S +CWWRGGRLSR++F KMLPR L SKGGRQAG +KI ++ YPD + A+R Sbjct: 1032 I-SQAAAASYVCWWRGGRLSRRVFHWKMLPRSLTSKGGRQAGCKKISNVFYPDVPEFARR 1090 Query: 1410 SKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 +K++ WR AVEMS++VAQL + K+FDSNI+W Sbjct: 1091 NKFITWRAAVEMSETVAQLAFLTKEFDSNIRW 1122 >ref|XP_010273302.1| PREDICTED: uncharacterized protein LOC104608880 [Nelumbo nucifera] Length = 1956 Score = 283 bits (724), Expect = 3e-73 Identities = 185/481 (38%), Positives = 248/481 (51%), Gaps = 36/481 (7%) Frame = +3 Query: 171 PHENAASSEQCEPRASQISDAV-HLNSVTADELVAMNCPFSCSALLDEKVHAATSSHPSQ 347 P+E + SE ++S+ISD++ LNS T ++ + M P + S + SQ Sbjct: 830 PNEGSVISEGLAHQSSKISDSISRLNSATVNQFMEMASPLASSEGSADISQVNAGKQTSQ 889 Query: 348 ETDTDCPMGSIAPSKQVIPSK-----------HTDLTVASEN-CIG-----------LQG 458 + DC I + IP K DL V E IG Q Sbjct: 890 KNGADCSNKLIQSADSEIPVKLQSAIGEDLPNPADLGVKQEEGFIGEQLSKPADLNDKQE 949 Query: 459 RGC----------ITDRNRFGASELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNE 608 +G + + R S +Q + G+Y+N Y F + A+S+ EL+HKSSE N + Sbjct: 950 KGLAPAVPIHTSPVNNTKRVVPSPMQFESGSYVNCYIFAQTAASVAEELLHKSSERINED 1009 Query: 609 LKKSLEDIKTVQLKAISNKSTKCLRYAYQNPSMDVQKENCGWCFSCRTSNDF-DCLFKVA 785 S+++I + QLK IS KSTK QN D+QKENCGWCFSC+ D +CLF + Sbjct: 1010 PNSSVDEIVSAQLKVISKKSTKLCWSNIQNLYKDLQKENCGWCFSCKNPTDSGNCLFNMF 1069 Query: 786 GDKNLEGSKNKDRKGSLAKNLEESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLS 965 K+ E +S VG HS+KN ++H+ +HHILSIE LLS Sbjct: 1070 NKKHPP---------------EGPKSGAVGLHSKKNRKNHLFDVIHHILSIEHRLSGLLS 1114 Query: 966 GPWEKPHHSQHWRKAVMKASDXXXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS- 1142 GPW+ P +S WRK+V+KASD S EW K VD + SAS Sbjct: 1115 GPWQNPLYSMQWRKSVLKASDIASVKRLLLILESSLRRIALSEEWLKQVDSVFTMGSASH 1174 Query: 1143 IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPR 1322 ++T+ V S + RK+G+ S + SF+S A S I WWRGGRLSRQ++ LP Sbjct: 1175 VLTTSVNLPSKHGIGRKRGR--FSDADSSFSSNTAG-SGIFWWRGGRLSRQVYHWMFLPH 1231 Query: 1323 PLASKGGRQAGRRKIPHILYPDSSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIK 1502 LA K GRQAG KIP ILYPD S+LAKRSKY+AWR A+EM SV QL +Q ++ DSNI+ Sbjct: 1232 TLAYKAGRQAGCIKIPGILYPDGSELAKRSKYIAWRAALEMCISVPQLAFQVRELDSNIR 1291 Query: 1503 W 1505 W Sbjct: 1292 W 1292 >ref|XP_004958593.1| PREDICTED: uncharacterized protein LOC101777112 [Setaria italica] gi|944262782|gb|KQL27039.1| hypothetical protein SETIT_028659mg [Setaria italica] Length = 1696 Score = 261 bits (666), Expect = 1e-66 Identities = 145/336 (43%), Positives = 188/336 (55%), Gaps = 1/336 (0%) Frame = +3 Query: 501 LQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCL 680 L SD YINYY+FG+IA+S EL HK SE N E KK ++D + L+ I K Sbjct: 700 LHSDLARYINYYSFGQIAASAAEELKHKLSE--NKEGKKPVQDALSFHLRTICKKYANIF 757 Query: 681 RYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESR 860 Q S+++ KE CGWC SC+ S DC+F+V K +EG+K Sbjct: 758 ALTDQKLSVELLKEKCGWCNSCQISGGVDCIFRVTDVKCMEGTK---------------- 801 Query: 861 SQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXX 1040 +G +EKN ESHI AMH+ILSIE LL+GPW+ P + +WRK V+KA+D Sbjct: 802 PHALGVEAEKNMESHIILAMHNILSIEERLNGLLTGPWQNPQYRIYWRKEVLKAADVSSL 861 Query: 1041 XXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIST 1217 S+EW KP D + SA+ I+ +S S+ +RK G+K S Sbjct: 862 KQPLLMLESSLRRVAISMEWQKPADSVEVVGSAAHILVRSSNKSLSHGTARKPGRKPSSN 921 Query: 1218 NEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSD 1397 E SR + WWRGG+LSRQ+F K LP+ L K RQAGRRKIP ILY D S Sbjct: 922 GELKVDSRNV---GVYWWRGGKLSRQVFHWKRLPQSLVYKAARQAGRRKIPTILYTDGSQ 978 Query: 1398 LAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 A+R KY+AWR AVEM+++VAQLI Q K+ + NIKW Sbjct: 979 FARRFKYIAWRAAVEMAENVAQLILQIKELEWNIKW 1014 >ref|XP_007015163.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] gi|590584387|ref|XP_007015164.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] gi|508785526|gb|EOY32782.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] gi|508785527|gb|EOY32783.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] Length = 1859 Score = 256 bits (654), Expect = 4e-65 Identities = 181/520 (34%), Positives = 252/520 (48%), Gaps = 19/520 (3%) Frame = +3 Query: 3 VNAISMCWVPVDASYANNQSMNDIHDDINDIA--THMYSEPP-----LPPKLDGLSDCNA 161 + AI W D + +N + +++ D +N + T M + P LPP G + Sbjct: 762 LKAIHKQW---DVAVGSNGASSNL-DSLNSVCSETLMKGQIPTASTVLPPLASGETSAIK 817 Query: 162 GSLPHENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHP 341 + ++ + + V ++ D + P+ S E + + H Sbjct: 818 NETVDDGKQEDKEVAGNSGHLDVEVTESANLLDSVAGTEIPYISSEGSAETMQMGSVIHN 877 Query: 342 SQETDTDCPMGSIAPSKQV-IPSKHTDLTVASENCIGL---------QGRGCITDRNRFG 491 Q+ GS S Q +P K ++L S GL Q C + R Sbjct: 878 FQK------QGSAEFSNQSEVPGKSSNLEDCSLISKGLYQESKIKLAQQTLCAINAKRGD 931 Query: 492 ASELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKST 671 AS+ Q G Y+NYY+F + AS + ELM K SE N + KS+E+I +Q+K I KS Sbjct: 932 ASQTQPGTG-YLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMKVILKKSN 990 Query: 672 KCLRYAYQNPSMDVQKENCGWCFSCR-TSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNL 848 + N +D +KENCGWCF CR +D DCLFK+ E SK Sbjct: 991 RFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQEVSK------------ 1038 Query: 849 EESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASD 1028 S+ VG S+ N + H+ + H SIE LLSGPW P + + W K+++KASD Sbjct: 1039 ----SEMVGLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASD 1094 Query: 1029 XXXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKK 1205 S EW K VD A + SAS +VT+ R S+ + +RK+G+ Sbjct: 1095 VASLKHFLLMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRS 1154 Query: 1206 NISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYP 1385 N E + TS A ICWWRGGR+SRQLF K+LPR LASK RQ G +KIP ILYP Sbjct: 1155 N--DGESNPTSNPAAGPSICWWRGGRVSRQLFNWKVLPRSLASKAARQGGGKKIPGILYP 1212 Query: 1386 DSSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 +SSD A+RSK +AWR AVE S S+ QL Q ++ DSNI+W Sbjct: 1213 ESSDFARRSKSMAWRAAVESSTSIEQLALQVRELDSNIRW 1252 >ref|XP_007015162.1| DNA binding,zinc ion binding,DNA binding, putative isoform 2 [Theobroma cacao] gi|508785525|gb|EOY32781.1| DNA binding,zinc ion binding,DNA binding, putative isoform 2 [Theobroma cacao] Length = 1647 Score = 256 bits (654), Expect = 4e-65 Identities = 181/520 (34%), Positives = 252/520 (48%), Gaps = 19/520 (3%) Frame = +3 Query: 3 VNAISMCWVPVDASYANNQSMNDIHDDINDIA--THMYSEPP-----LPPKLDGLSDCNA 161 + AI W D + +N + +++ D +N + T M + P LPP G + Sbjct: 762 LKAIHKQW---DVAVGSNGASSNL-DSLNSVCSETLMKGQIPTASTVLPPLASGETSAIK 817 Query: 162 GSLPHENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHP 341 + ++ + + V ++ D + P+ S E + + H Sbjct: 818 NETVDDGKQEDKEVAGNSGHLDVEVTESANLLDSVAGTEIPYISSEGSAETMQMGSVIHN 877 Query: 342 SQETDTDCPMGSIAPSKQV-IPSKHTDLTVASENCIGL---------QGRGCITDRNRFG 491 Q+ GS S Q +P K ++L S GL Q C + R Sbjct: 878 FQK------QGSAEFSNQSEVPGKSSNLEDCSLISKGLYQESKIKLAQQTLCAINAKRGD 931 Query: 492 ASELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKST 671 AS+ Q G Y+NYY+F + AS + ELM K SE N + KS+E+I +Q+K I KS Sbjct: 932 ASQTQPGTG-YLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMKVILKKSN 990 Query: 672 KCLRYAYQNPSMDVQKENCGWCFSCR-TSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNL 848 + N +D +KENCGWCF CR +D DCLFK+ E SK Sbjct: 991 RFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQEVSK------------ 1038 Query: 849 EESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASD 1028 S+ VG S+ N + H+ + H SIE LLSGPW P + + W K+++KASD Sbjct: 1039 ----SEMVGLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASD 1094 Query: 1029 XXXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKK 1205 S EW K VD A + SAS +VT+ R S+ + +RK+G+ Sbjct: 1095 VASLKHFLLMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRS 1154 Query: 1206 NISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYP 1385 N E + TS A ICWWRGGR+SRQLF K+LPR LASK RQ G +KIP ILYP Sbjct: 1155 N--DGESNPTSNPAAGPSICWWRGGRVSRQLFNWKVLPRSLASKAARQGGGKKIPGILYP 1212 Query: 1386 DSSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 +SSD A+RSK +AWR AVE S S+ QL Q ++ DSNI+W Sbjct: 1213 ESSDFARRSKSMAWRAAVESSTSIEQLALQVRELDSNIRW 1252 >ref|XP_007015161.1| DNA binding,zinc ion binding,DNA binding, putative isoform 1 [Theobroma cacao] gi|508785524|gb|EOY32780.1| DNA binding,zinc ion binding,DNA binding, putative isoform 1 [Theobroma cacao] Length = 1931 Score = 256 bits (654), Expect = 4e-65 Identities = 181/520 (34%), Positives = 252/520 (48%), Gaps = 19/520 (3%) Frame = +3 Query: 3 VNAISMCWVPVDASYANNQSMNDIHDDINDIA--THMYSEPP-----LPPKLDGLSDCNA 161 + AI W D + +N + +++ D +N + T M + P LPP G + Sbjct: 762 LKAIHKQW---DVAVGSNGASSNL-DSLNSVCSETLMKGQIPTASTVLPPLASGETSAIK 817 Query: 162 GSLPHENAASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALLDEKVHAATSSHP 341 + ++ + + V ++ D + P+ S E + + H Sbjct: 818 NETVDDGKQEDKEVAGNSGHLDVEVTESANLLDSVAGTEIPYISSEGSAETMQMGSVIHN 877 Query: 342 SQETDTDCPMGSIAPSKQV-IPSKHTDLTVASENCIGL---------QGRGCITDRNRFG 491 Q+ GS S Q +P K ++L S GL Q C + R Sbjct: 878 FQK------QGSAEFSNQSEVPGKSSNLEDCSLISKGLYQESKIKLAQQTLCAINAKRGD 931 Query: 492 ASELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKST 671 AS+ Q G Y+NYY+F + AS + ELM K SE N + KS+E+I +Q+K I KS Sbjct: 932 ASQTQPGTG-YLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMKVILKKSN 990 Query: 672 KCLRYAYQNPSMDVQKENCGWCFSCR-TSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNL 848 + N +D +KENCGWCF CR +D DCLFK+ E SK Sbjct: 991 RFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQEVSK------------ 1038 Query: 849 EESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASD 1028 S+ VG S+ N + H+ + H SIE LLSGPW P + + W K+++KASD Sbjct: 1039 ----SEMVGLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASD 1094 Query: 1029 XXXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKK 1205 S EW K VD A + SAS +VT+ R S+ + +RK+G+ Sbjct: 1095 VASLKHFLLMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRS 1154 Query: 1206 NISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYP 1385 N E + TS A ICWWRGGR+SRQLF K+LPR LASK RQ G +KIP ILYP Sbjct: 1155 N--DGESNPTSNPAAGPSICWWRGGRVSRQLFNWKVLPRSLASKAARQGGGKKIPGILYP 1212 Query: 1386 DSSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 +SSD A+RSK +AWR AVE S S+ QL Q ++ DSNI+W Sbjct: 1213 ESSDFARRSKSMAWRAAVESSTSIEQLALQVRELDSNIRW 1252 >ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor] gi|241926713|gb|EER99857.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor] Length = 1688 Score = 255 bits (651), Expect = 8e-65 Identities = 140/337 (41%), Positives = 186/337 (55%), Gaps = 1/337 (0%) Frame = +3 Query: 498 ELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKC 677 +L SDP YINYY+FG+IA++ EL HK SE K+ KK ++D+ + L+ I K Sbjct: 661 QLHSDPARYINYYSFGQIAANAAEELKHKLSENKDG--KKPVQDVLSFHLRTICKKYANI 718 Query: 678 LRYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEES 857 Q S ++ KE CGWC SC+ S DC+F+V K +EG K Sbjct: 719 FALTDQKLSAELLKEKCGWCNSCQISGGVDCIFRVTDIKYMEGPK--------------- 763 Query: 858 RSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXX 1037 T+ +E N +SHI AMH+ILSIE LLSGPW+ P +S WR+ V+KASD Sbjct: 764 -PHTLDLRAESNMDSHIILAMHNILSIEERLNGLLSGPWQNPQYSICWRETVLKASDVSS 822 Query: 1038 XXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIS 1214 + EW KP D + SA+ I+ +S S+ +RK G+K Sbjct: 823 LKKPLLTLESSLRRVAITAEWQKPADSVEVVGSAAHILVRSSNKSLSHGSARKPGRKPSP 882 Query: 1215 TNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSS 1394 E SR + WWRGG+LSRQ+F K LP+ L +K RQAGRRKIP ILY D S Sbjct: 883 NGELKVDSRDV---GVYWWRGGKLSRQVFHWKRLPQTLVNKAARQAGRRKIPTILYTDGS 939 Query: 1395 DLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 A+R KY+AW+ AVEM+++ AQLI Q K+ + NIKW Sbjct: 940 QFARRFKYIAWQAAVEMAENAAQLILQIKELEWNIKW 976 >gb|EMT20858.1| hypothetical protein F775_52258 [Aegilops tauschii] Length = 1851 Score = 253 bits (645), Expect = 4e-64 Identities = 146/336 (43%), Positives = 185/336 (55%), Gaps = 1/336 (0%) Frame = +3 Query: 501 LQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCL 680 L+S YINYY+FG+IA+S EL HK SE N E KK D + +LK I K Sbjct: 632 LRSGNAMYINYYSFGQIAASAAEELKHKLSE--NEEGKKHGPDAVSFRLKTICKKYVNVF 689 Query: 681 RYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESR 860 Q S+++ KE CGWC SC+ S DC+F+ K +E K Sbjct: 690 ALTDQKLSVELLKEKCGWCNSCQISGGSDCIFRFTDVKCMESPK---------------- 733 Query: 861 SQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXX 1040 VGP SEKN ESHI A H +LSIE LLSGPW+ P +S +WRKAV+ ASD Sbjct: 734 PCAVGPLSEKNKESHIVLATHSMLSIEKRLNGLLSGPWQNPQYSMYWRKAVLMASDVSSL 793 Query: 1041 XXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIST 1217 S EW KP D + SA+ I+ +S+ Y +RK G+K ++ Sbjct: 794 KQPLLTLESSLRRVAFSGEWQKPADSVEVVGSAAHILVRTSNKSAGYAIARKPGRKPLAI 853 Query: 1218 NEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSD 1397 E R + WWRGG LSRQ+F K LP+ LA K RQAGR+KIP I+YPD S Sbjct: 854 -ELKVDFRDV---GVYWWRGGTLSRQVFHWKRLPQSLACKSARQAGRKKIPTIVYPDGSQ 909 Query: 1398 LAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 A+RSKY+AWR AVEM+Q+V+QLI Q K+ + NIKW Sbjct: 910 FARRSKYIAWRAAVEMAQNVSQLILQIKELELNIKW 945 >gb|KQK14817.1| hypothetical protein BRADI_1g18770 [Brachypodium distachyon] gi|944079466|gb|KQK14818.1| hypothetical protein BRADI_1g18770 [Brachypodium distachyon] Length = 1723 Score = 249 bits (637), Expect = 3e-63 Identities = 143/336 (42%), Positives = 188/336 (55%), Gaps = 1/336 (0%) Frame = +3 Query: 501 LQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCL 680 L SDP YINYY+FG+IA+S EL HK SE N E KK +D + +LK I K Sbjct: 663 LHSDPTRYINYYSFGQIAASAARELKHKLSE--NEEGKKHGQDAVSFRLKTICKKYVNVF 720 Query: 681 RYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESR 860 Q S+++ KE CGWC SC+ S+ DC+F+V ++ + Sbjct: 721 ALTDQKLSVELLKEKCGWCNSCQISSGTDCIFRV---------------------VDGLK 759 Query: 861 SQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXX 1040 +G SEKN ESHI AMH+ILSIE LLSGPW+ P +S +WRKAV++ASD Sbjct: 760 PCNLGLLSEKNKESHIVLAMHNILSIEERLNGLLSGPWQNPQYSIYWRKAVLRASDLSSL 819 Query: 1041 XXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIST 1217 +W KP D + SA+ I+ +S SY +RK G+K S Sbjct: 820 KQPLLMLESSLRRVAFFGDWQKPADSVEVVGSAAHILVRSSNKSKSYASARKPGRKP-SI 878 Query: 1218 NEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSD 1397 +E S + WWRGG LSRQ+F K LP+ LAS+ RQAGR+KI I+YP+ S Sbjct: 879 DELKVDSPDV---GVYWWRGGTLSRQVFHWKRLPQSLASRAARQAGRKKISTIVYPEGSQ 935 Query: 1398 LAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 A+R KY+AWR AVEM+Q+V+QLI Q K+ + NIKW Sbjct: 936 FARRLKYIAWRAAVEMAQNVSQLILQIKELELNIKW 971 >ref|XP_010234427.1| PREDICTED: uncharacterized protein LOC100822072 [Brachypodium distachyon] Length = 1748 Score = 249 bits (637), Expect = 3e-63 Identities = 143/336 (42%), Positives = 188/336 (55%), Gaps = 1/336 (0%) Frame = +3 Query: 501 LQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKCL 680 L SDP YINYY+FG+IA+S EL HK SE N E KK +D + +LK I K Sbjct: 688 LHSDPTRYINYYSFGQIAASAARELKHKLSE--NEEGKKHGQDAVSFRLKTICKKYVNVF 745 Query: 681 RYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESR 860 Q S+++ KE CGWC SC+ S+ DC+F+V ++ + Sbjct: 746 ALTDQKLSVELLKEKCGWCNSCQISSGTDCIFRV---------------------VDGLK 784 Query: 861 SQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXX 1040 +G SEKN ESHI AMH+ILSIE LLSGPW+ P +S +WRKAV++ASD Sbjct: 785 PCNLGLLSEKNKESHIVLAMHNILSIEERLNGLLSGPWQNPQYSIYWRKAVLRASDLSSL 844 Query: 1041 XXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGSRKQGKKNIST 1217 +W KP D + SA+ I+ +S SY +RK G+K S Sbjct: 845 KQPLLMLESSLRRVAFFGDWQKPADSVEVVGSAAHILVRSSNKSKSYASARKPGRKP-SI 903 Query: 1218 NEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDSSD 1397 +E S + WWRGG LSRQ+F K LP+ LAS+ RQAGR+KI I+YP+ S Sbjct: 904 DELKVDSPDV---GVYWWRGGTLSRQVFHWKRLPQSLASRAARQAGRKKISTIVYPEGSQ 960 Query: 1398 LAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 A+R KY+AWR AVEM+Q+V+QLI Q K+ + NIKW Sbjct: 961 FARRLKYIAWRAAVEMAQNVSQLILQIKELELNIKW 996 >ref|XP_008653439.1| PREDICTED: uncharacterized protein LOC103633535 [Zea mays] gi|414887991|tpg|DAA64005.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays] gi|414887992|tpg|DAA64006.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays] Length = 1712 Score = 248 bits (633), Expect = 1e-62 Identities = 139/339 (41%), Positives = 186/339 (54%), Gaps = 3/339 (0%) Frame = +3 Query: 498 ELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKC 677 +L SDP YINYY+FG+IA+S EL HK SE N ++KK ++D+ + L+ I K Sbjct: 688 QLHSDPARYINYYSFGQIAASAAEELKHKLSE--NKDVKKPVQDVLSFHLRTICKKYANF 745 Query: 678 LRYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEES 857 Q S ++ KE CGWC SC+ S DC+F++ K +EG K Sbjct: 746 FALTDQKLSAELLKEKCGWCNSCQISGGVDCIFRLTDIKYMEGPK--------------- 790 Query: 858 RSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXX 1037 T+ +E N ESHI AM++ILS+E LLSGPW+ P +S WR AV+KASD Sbjct: 791 -PHTLDLGAENNMESHIILAMYNILSVEERLNGLLSGPWQNPQYSICWRNAVLKASDVSS 849 Query: 1038 XXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGS--RKQGKKN 1208 + EW K D + SA+ I+ +S S+V + RK G+K Sbjct: 850 LKQPLLMLESSLRRVAITTEWQKAADSVEVVGSAAHILVRSSNKSLSHVSATARKPGRKP 909 Query: 1209 ISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPD 1388 E SR + WWRGG+LSRQ+F K LP+ L +K RQAGRR+IP I Y D Sbjct: 910 SPNGELKVDSRDV---GVYWWRGGKLSRQVFHWKRLPQSLVNKAARQAGRRRIPTISYTD 966 Query: 1389 SSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 S A+R KY+AWR AVEM+++ AQLI Q K+ + NIKW Sbjct: 967 GSQFARRFKYIAWRAAVEMAENAAQLILQIKELEWNIKW 1005 >tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays] Length = 1679 Score = 248 bits (633), Expect = 1e-62 Identities = 139/339 (41%), Positives = 186/339 (54%), Gaps = 3/339 (0%) Frame = +3 Query: 498 ELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTKC 677 +L SDP YINYY+FG+IA+S EL HK SE N ++KK ++D+ + L+ I K Sbjct: 655 QLHSDPARYINYYSFGQIAASAAEELKHKLSE--NKDVKKPVQDVLSFHLRTICKKYANF 712 Query: 678 LRYAYQNPSMDVQKENCGWCFSCRTSNDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLEES 857 Q S ++ KE CGWC SC+ S DC+F++ K +EG K Sbjct: 713 FALTDQKLSAELLKEKCGWCNSCQISGGVDCIFRLTDIKYMEGPK--------------- 757 Query: 858 RSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDXXX 1037 T+ +E N ESHI AM++ILS+E LLSGPW+ P +S WR AV+KASD Sbjct: 758 -PHTLDLGAENNMESHIILAMYNILSVEERLNGLLSGPWQNPQYSICWRNAVLKASDVSS 816 Query: 1038 XXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSAS-IVTSPVRRSSSYVGS--RKQGKKN 1208 + EW K D + SA+ I+ +S S+V + RK G+K Sbjct: 817 LKQPLLMLESSLRRVAITTEWQKAADSVEVVGSAAHILVRSSNKSLSHVSATARKPGRKP 876 Query: 1209 ISTNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPD 1388 E SR + WWRGG+LSRQ+F K LP+ L +K RQAGRR+IP I Y D Sbjct: 877 SPNGELKVDSRDV---GVYWWRGGKLSRQVFHWKRLPQSLVNKAARQAGRRRIPTISYTD 933 Query: 1389 SSDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 S A+R KY+AWR AVEM+++ AQLI Q K+ + NIKW Sbjct: 934 GSQFARRFKYIAWRAAVEMAENAAQLILQIKELEWNIKW 972 >gb|KHG13465.1| Nucleosome-remodeling factor subunit [Gossypium arboreum] Length = 867 Score = 246 bits (627), Expect = 5e-62 Identities = 146/369 (39%), Positives = 201/369 (54%), Gaps = 2/369 (0%) Frame = +3 Query: 405 SKHTDLTVASENCIGLQGRGCITDRNRFGASELQSDPGNYINYYTFGRIASSIYAELMHK 584 S D S+ Q C+ + R AS+LQ G Y+N+Y+F + AS + EL+ K Sbjct: 112 SNDLDARQESKTKFASQQTPCVLNVKRRDASQLQPGTG-YVNHYSFAQTASLVVEELLRK 170 Query: 585 SSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNPSMDVQKENCGWCFSCRTS-ND 761 SE N++ KSLE+I Q+K I KS + N +D +KENCGWCFSCR +D Sbjct: 171 PSEKTNDDSLKSLEEIIGNQMKVILKKSNRFRWPDIYNLYVDARKENCGWCFSCRYPVDD 230 Query: 762 FDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPHSEKNTESHITTAMHHILSIE 941 DCLF++ G + E S+S+ + N + H+ ++HI SIE Sbjct: 231 TDCLFRITS-------------GCVP---EVSKSEMLDLQLRWNKKGHVIDVIYHIFSIE 274 Query: 942 GCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXXXXXXXXXXXSVEWTKPVDDA 1121 LLSGPW P + + W K+++ AS S +W K VD A Sbjct: 275 NRLSGLLSGPWLNPQYMKIWHKSILNASGIASVKHLLLTLEANLHHLALSTDWMKHVDSA 334 Query: 1122 RAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTSRRATISDICWWRGGRLSRQL 1298 + SAS +V + R S+ + +RK+G N + NE + TS A + ICWWRGGR+SRQL Sbjct: 335 VIMGSASHVVIASSRGSAKHGIARKRG--NCNDNESNPTSNPAVGASICWWRGGRVSRQL 392 Query: 1299 FQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKYVAWRVAVEMSQSVAQLIYQA 1478 F K+LP L SK RQ G +KIP ILYP+SSD AKRS+ +AWR AVE S S+ QL +Q Sbjct: 393 FNWKVLPCSLVSKAARQGGGKKIPGILYPESSDFAKRSRSIAWRAAVESSTSIEQLAFQV 452 Query: 1479 KDFDSNIKW 1505 ++ DSNI+W Sbjct: 453 RELDSNIRW 461 >gb|KDO50419.1| hypothetical protein CISIN_1g000462mg [Citrus sinensis] Length = 1306 Score = 242 bits (618), Expect = 6e-61 Identities = 176/518 (33%), Positives = 254/518 (49%), Gaps = 17/518 (3%) Frame = +3 Query: 3 VNAISMCWVPVDASYANNQSMNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLPHEN 182 +NAI W D + ++N +++ + ++ HM +E P ++D N L EN Sbjct: 449 INAICKQW---DITVSSNGVRSNLALNTVSLSRHMKAEVPTISEID-----NEQKL-EEN 499 Query: 183 AASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALL-------------DEKVHA 323 + P + A L+SVTA EL ++ S D + A Sbjct: 500 FLAGYSNRPDNALSKSANLLDSVTAMELPNISSEGSAETTQMNSGFDNFQKEGPDNSIRA 559 Query: 324 ATSSHPSQETDTDCPMGSI-APSKQVIPSKHTDLT--VASENCIGLQGRGCITDRNRFGA 494 A S+ S+ G + AP + S +D+ AS C T+ + A Sbjct: 560 AEFSNQSEIA------GKLPAPGHNSMTSSTSDIKQKFASSGC-----NSSPTNSRKGDA 608 Query: 495 SELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTK 674 +LQ + Y+N Y+F + ASS+ ELMHKSS + E S E+I + Q+KAI K K Sbjct: 609 LQLQPEIA-YMNRYSFAQTASSVAEELMHKSSNEISKEPINSNEEIISKQMKAILKKWDK 667 Query: 675 CLRYAYQNPSMDVQKENCGWCFSCRTS-NDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLE 851 Q + D QKE CGWCFSC+++ +D DCLF + + L S+ Sbjct: 668 FYWPNTQKLNADTQKEKCGWCFSCKSATDDMDCLFYMNNGRVLGSSE------------- 714 Query: 852 ESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDX 1031 S+ G S++N + H+ + HILSIE LL GPW PH+++ WRK+ +KA+D Sbjct: 715 ---SEVAGLLSKRNKKGHLVDVICHILSIEDRLLGLLLGPWLNPHYTKLWRKSALKAADM 771 Query: 1032 XXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSASIVTSPVRRSSSYVGSRKQGKKNI 1211 S EW K VD + SAS + R++S G+ ++ ++ Sbjct: 772 ASVKHLLLTLEANLQHLALSAEWFKHVDPVVTVGSASHIVIASSRANSKAGAGRKKARDF 831 Query: 1212 STNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDS 1391 N +++ A +CWWRGGRLS QLF K LPR L SK RQAG KIP ILYP++ Sbjct: 832 DGNP---STKAAGGLSLCWWRGGRLSCQLFSWKRLPRSLVSKAARQAGCMKIPGILYPEN 888 Query: 1392 SDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 SD A+RS+ VAWR AVE S SV QL Q ++FDSN++W Sbjct: 889 SDFARRSRTVAWRAAVESSTSVEQLAIQVREFDSNVRW 926 >gb|KDO50418.1| hypothetical protein CISIN_1g000462mg [Citrus sinensis] Length = 1482 Score = 242 bits (618), Expect = 6e-61 Identities = 176/518 (33%), Positives = 254/518 (49%), Gaps = 17/518 (3%) Frame = +3 Query: 3 VNAISMCWVPVDASYANNQSMNDIHDDINDIATHMYSEPPLPPKLDGLSDCNAGSLPHEN 182 +NAI W D + ++N +++ + ++ HM +E P ++D N L EN Sbjct: 449 INAICKQW---DITVSSNGVRSNLALNTVSLSRHMKAEVPTISEID-----NEQKL-EEN 499 Query: 183 AASSEQCEPRASQISDAVHLNSVTADELVAMNCPFSCSALL-------------DEKVHA 323 + P + A L+SVTA EL ++ S D + A Sbjct: 500 FLAGYSNRPDNALSKSANLLDSVTAMELPNISSEGSAETTQMNSGFDNFQKEGPDNSIRA 559 Query: 324 ATSSHPSQETDTDCPMGSI-APSKQVIPSKHTDLT--VASENCIGLQGRGCITDRNRFGA 494 A S+ S+ G + AP + S +D+ AS C T+ + A Sbjct: 560 AEFSNQSEIA------GKLPAPGHNSMTSSTSDIKQKFASSGC-----NSSPTNSRKGDA 608 Query: 495 SELQSDPGNYINYYTFGRIASSIYAELMHKSSEAKNNELKKSLEDIKTVQLKAISNKSTK 674 +LQ + Y+N Y+F + ASS+ ELMHKSS + E S E+I + Q+KAI K K Sbjct: 609 LQLQPEIA-YMNRYSFAQTASSVAEELMHKSSNEISKEPINSNEEIISKQMKAILKKWDK 667 Query: 675 CLRYAYQNPSMDVQKENCGWCFSCRTS-NDFDCLFKVAGDKNLEGSKNKDRKGSLAKNLE 851 Q + D QKE CGWCFSC+++ +D DCLF + + L S+ Sbjct: 668 FYWPNTQKLNADTQKEKCGWCFSCKSATDDMDCLFYMNNGRVLGSSE------------- 714 Query: 852 ESRSQTVGPHSEKNTESHITTAMHHILSIEGCAGSLLSGPWEKPHHSQHWRKAVMKASDX 1031 S+ G S++N + H+ + HILSIE LL GPW PH+++ WRK+ +KA+D Sbjct: 715 ---SEVAGLLSKRNKKGHLVDVICHILSIEDRLLGLLLGPWLNPHYTKLWRKSALKAADM 771 Query: 1032 XXXXXXXXXXXXXXXXXXXSVEWTKPVDDARAIVSASIVTSPVRRSSSYVGSRKQGKKNI 1211 S EW K VD + SAS + R++S G+ ++ ++ Sbjct: 772 ASVKHLLLTLEANLQHLALSAEWFKHVDPVVTVGSASHIVIASSRANSKAGAGRKKARDF 831 Query: 1212 STNEFSFTSRRATISDICWWRGGRLSRQLFQCKMLPRPLASKGGRQAGRRKIPHILYPDS 1391 N +++ A +CWWRGGRLS QLF K LPR L SK RQAG KIP ILYP++ Sbjct: 832 DGNP---STKAAGGLSLCWWRGGRLSCQLFSWKRLPRSLVSKAARQAGCMKIPGILYPEN 888 Query: 1392 SDLAKRSKYVAWRVAVEMSQSVAQLIYQAKDFDSNIKW 1505 SD A+RS+ VAWR AVE S SV QL Q ++FDSN++W Sbjct: 889 SDFARRSRTVAWRAAVESSTSVEQLAIQVREFDSNVRW 926 >gb|KJB09356.1| hypothetical protein B456_001G136300 [Gossypium raimondii] Length = 1620 Score = 241 bits (614), Expect = 2e-60 Identities = 145/369 (39%), Positives = 199/369 (53%), Gaps = 2/369 (0%) Frame = +3 Query: 405 SKHTDLTVASENCIGLQGRGCITDRNRFGASELQSDPGNYINYYTFGRIASSIYAELMHK 584 S D S+ + Q + + R AS+L G Y+N+Y+F + AS + EL+HK Sbjct: 865 SNDLDARQESKTKLASQQTPRVLNAKRGDASQLLPGTG-YVNHYSFAQTASLVVEELLHK 923 Query: 585 SSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNPSMDVQKENCGWCFSCRTS-ND 761 SE N++ KSLE+I +Q+K I KS + N +D +KENCGWCFSCR +D Sbjct: 924 PSEKTNDDSLKSLEEIIGIQMKVILKKSNRLHWPDIHNLYVDARKENCGWCFSCRYPVDD 983 Query: 762 FDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPHSEKNTESHITTAMHHILSIE 941 DCLF++ G + E S+S+ V S N + H+ ++HI SIE Sbjct: 984 TDCLFRITS-------------GCVP---EVSKSEMVDLQSRWNKKGHVIDVIYHIFSIE 1027 Query: 942 GCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXXXXXXXXXXXSVEWTKPVDDA 1121 LLSGPW + + W K+++ AS S +W K VD A Sbjct: 1028 NRLSGLLSGPWLNLQYMKIWHKSILNASGIASVKHLLLTLEANLHHLALSTDWMKHVDSA 1087 Query: 1122 RAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTSRRATISDICWWRGGRLSRQL 1298 + SAS +V + R S+ + +RK+G N NE + TS A ICWWRGGR+SRQL Sbjct: 1088 VIMGSASHVVIASSRGSAKHGIARKRGSCN--DNESNPTSNPAVGPSICWWRGGRVSRQL 1145 Query: 1299 FQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKYVAWRVAVEMSQSVAQLIYQA 1478 F K+LP L SK RQ G +KIP ILYP+SSD AKRS+ +AWR AVE S S+ QL +Q Sbjct: 1146 FNWKVLPCSLVSKAARQGGGKKIPGILYPESSDFAKRSRSIAWRAAVESSTSIEQLAFQV 1205 Query: 1479 KDFDSNIKW 1505 ++ SNI+W Sbjct: 1206 RELGSNIRW 1214 >gb|KJB09354.1| hypothetical protein B456_001G136300 [Gossypium raimondii] Length = 1653 Score = 241 bits (614), Expect = 2e-60 Identities = 145/369 (39%), Positives = 199/369 (53%), Gaps = 2/369 (0%) Frame = +3 Query: 405 SKHTDLTVASENCIGLQGRGCITDRNRFGASELQSDPGNYINYYTFGRIASSIYAELMHK 584 S D S+ + Q + + R AS+L G Y+N+Y+F + AS + EL+HK Sbjct: 898 SNDLDARQESKTKLASQQTPRVLNAKRGDASQLLPGTG-YVNHYSFAQTASLVVEELLHK 956 Query: 585 SSEAKNNELKKSLEDIKTVQLKAISNKSTKCLRYAYQNPSMDVQKENCGWCFSCRTS-ND 761 SE N++ KSLE+I +Q+K I KS + N +D +KENCGWCFSCR +D Sbjct: 957 PSEKTNDDSLKSLEEIIGIQMKVILKKSNRLHWPDIHNLYVDARKENCGWCFSCRYPVDD 1016 Query: 762 FDCLFKVAGDKNLEGSKNKDRKGSLAKNLEESRSQTVGPHSEKNTESHITTAMHHILSIE 941 DCLF++ G + E S+S+ V S N + H+ ++HI SIE Sbjct: 1017 TDCLFRITS-------------GCVP---EVSKSEMVDLQSRWNKKGHVIDVIYHIFSIE 1060 Query: 942 GCAGSLLSGPWEKPHHSQHWRKAVMKASDXXXXXXXXXXXXXXXXXXXXSVEWTKPVDDA 1121 LLSGPW + + W K+++ AS S +W K VD A Sbjct: 1061 NRLSGLLSGPWLNLQYMKIWHKSILNASGIASVKHLLLTLEANLHHLALSTDWMKHVDSA 1120 Query: 1122 RAIVSAS-IVTSPVRRSSSYVGSRKQGKKNISTNEFSFTSRRATISDICWWRGGRLSRQL 1298 + SAS +V + R S+ + +RK+G N NE + TS A ICWWRGGR+SRQL Sbjct: 1121 VIMGSASHVVIASSRGSAKHGIARKRGSCN--DNESNPTSNPAVGPSICWWRGGRVSRQL 1178 Query: 1299 FQCKMLPRPLASKGGRQAGRRKIPHILYPDSSDLAKRSKYVAWRVAVEMSQSVAQLIYQA 1478 F K+LP L SK RQ G +KIP ILYP+SSD AKRS+ +AWR AVE S S+ QL +Q Sbjct: 1179 FNWKVLPCSLVSKAARQGGGKKIPGILYPESSDFAKRSRSIAWRAAVESSTSIEQLAFQV 1238 Query: 1479 KDFDSNIKW 1505 ++ SNI+W Sbjct: 1239 RELGSNIRW 1247