BLASTX nr result
ID: Sinomenium21_contig00009611
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00009611 (2467 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002277884.2| PREDICTED: uncharacterized protein LOC100248... 507 e-140 emb|CBI32817.3| unnamed protein product [Vitis vinifera] 475 e-131 ref|XP_006411360.1| hypothetical protein EUTSA_v10016317mg [Eutr... 441 e-121 ref|XP_002526200.1| transcription factor hy5, putative [Ricinus ... 440 e-120 ref|XP_006430509.1| hypothetical protein CICLE_v10011169mg [Citr... 439 e-120 ref|NP_565946.1| transcription factor BZIP17 [Arabidopsis thalia... 439 e-120 gb|AAM96961.1| putative TGACG-sequence-specific bZIP DNA-binding... 438 e-120 ref|XP_006293762.1| hypothetical protein CARUB_v10022722mg [Caps... 437 e-119 gb|EXC30127.1| TGACG-sequence-specific DNA-binding protein TGA-1... 431 e-117 ref|XP_006482041.1| PREDICTED: uncharacterized protein LOC102629... 428 e-117 ref|XP_002323223.2| bZIP transcription factor family protein [Po... 424 e-116 ref|XP_002881751.1| bZIP transcription factor family protein [Ar... 424 e-116 ref|XP_007028261.1| Transcription factor hy5, putative [Theobrom... 424 e-115 ref|XP_003554104.2| PREDICTED: uncharacterized protein LOC100127... 423 e-115 ref|XP_003521109.2| PREDICTED: uncharacterized protein LOC100101... 418 e-114 ref|XP_004493333.1| PREDICTED: uncharacterized protein LOC101504... 416 e-113 ref|XP_002308867.2| hypothetical protein POPTR_0006s03300g [Popu... 412 e-112 ref|XP_004299018.1| PREDICTED: uncharacterized protein LOC101299... 410 e-111 gb|AGO05994.1| bZIP transcription factor family protein 10 [Came... 409 e-111 ref|XP_007162048.1| hypothetical protein PHAVU_001G1193000g, par... 407 e-111 >ref|XP_002277884.2| PREDICTED: uncharacterized protein LOC100248184 [Vitis vinifera] Length = 768 Score = 507 bits (1305), Expect = e-140 Identities = 351/776 (45%), Positives = 433/776 (55%), Gaps = 71/776 (9%) Frame = +3 Query: 129 SSNGDLTADFDDLQFPPLDVDYLS---NDLMIPEGLMEELGFDP-DFEFSLDNLSFPPEN 296 S N + +AD + L PPLD D+ S ND + E M +LG D DF+F+ D+L FP E+ Sbjct: 10 SPNPNPSADSEPLAVPPLDPDFFSDNSNDAALHETFMSDLGLDGVDFDFTFDDLYFPSES 69 Query: 297 EGF------GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSC---------- 428 E F EGS G S S DV+ L+ PSPESG+C Sbjct: 70 EDFLADFPLPEEGSGGHDSA----------DRSFDVSKVLNSPSPESGNCGVESSLPCQV 119 Query: 429 ----------------DREAPPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQ--SGPAS 554 D++ P PV+SQ S+ N PSP+SG + SGP S Sbjct: 120 SGDRNSDVSSIELGCCDQKLSP-PVASQSSSDQNLDGARVLNVPSPESGSCDRGFSGPES 178 Query: 555 DVRSG-------AVVVEDDEQKVKLEEGGXXXXXXXXXXXXXXSFSDNARSCKFRWA-IQ 710 SG V +QKVKLE+ G + +RS KFR + I Sbjct: 179 SQGSGNGGSGVPGAVNCVVDQKVKLEDSGKNSVPKRKKEQDDST--TESRSSKFRRSSIC 236 Query: 711 SENANSMPDEKDKRKARLTRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFF 890 SE AN+ DE++K+KARL RNRESAQLSRQRKKHYVEELE+K+RSMHS I DL GKIS Sbjct: 237 SETANASNDEEEKKKARLMRNRESAQLSRQRKKHYVEELEEKIRSMHSTIQDLTGKISII 296 Query: 891 MAENVSLRQQLSSGAVC---XXXXXXXXXXXXXXXXXXXXCGYPMKPRGSQVPLVPIPKL 1061 MAEN +LRQQ G +C Y +KP+GSQVPLVPIP+L Sbjct: 297 MAENANLRQQFGGGGMCPPPHAGMYPHPSMAPMAYPWVPCAPYVVKPQGSQVPLVPIPRL 356 Query: 1062 KPQKPLSASKPNKSESKKIQSKTKKVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVP 1241 KPQ P+SA K K+E+KK ++K+KKV SVS VPFVN++YGG +E VP Sbjct: 357 KPQAPVSAPKVKKTENKKNETKSKKVVSVSLLGMLSFMFLMGCLVPFVNIKYGGIKETVP 416 Query: 1242 SGFGSRFTNGGFDRWSQGRVLTV-----NSSHESG---KRELHT-----GNSGFRERCTN 1382 S + + F + R+LTV S++ G +H+ G SG + Sbjct: 417 G--RSDYISNRFSDMHRRRILTVKDDLNGSNYGMGVGFDDRIHSERGRGGGSGSEVKQKG 474 Query: 1383 GXXXXXXXXXXXXXVRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTA 1562 G R N+SEPLV SLYVPRNDKLVKIDGNLIIHSVLASEKAMA S A Sbjct: 475 GGSKPLPGSDGYAHSR-NASEPLVASLYVPRNDKLVKIDGNLIIHSVLASEKAMA-SHAA 532 Query: 1563 SGVKNDKTTISSSNGARETGL-IPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSS 1724 K+ K ++S +N RETGL I NL+ A VS GRN ++ + EQ KAL+S Sbjct: 533 LAKKSPKPSVSLANDVRETGLAIAGNLATAFPVSEVGRNKGRHPHLFRNPAEQHKALASG 592 Query: 1725 SGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXXANI 1898 S DT K N + ++ DG LQ+WFREGL GP+L SGMCTEVF F++ ANI Sbjct: 593 SSDTLKENLQPTSTDGKLQQWFREGLAGPMLSSGMCTEVFQFDVSPAPGAIVPVSSVANI 652 Query: 1899 SAKDHKNYTDPTKRRKNRRILDHLPIPLA-ELNATGQEGSRASHSHDGSSNGNRSSSPMV 2075 SA++ +N T K R NRRIL LPIPLA + +EG + D N++ S MV Sbjct: 653 SAENQQNATHLNKGR-NRRILHGLPIPLAGSTHNITEEGMGRNSQKDNFQGSNKNVSSMV 711 Query: 2076 VSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 2243 VSVLFDPREAGDS+ +GM+ PKS SRIFVVVLLDSVKYVTYSC LP KAS+ HLVT Sbjct: 712 VSVLFDPREAGDSDGDGMMGPKSLSRIFVVVLLDSVKYVTYSCGLPLKASAPHLVT 767 >emb|CBI32817.3| unnamed protein product [Vitis vinifera] Length = 680 Score = 475 bits (1223), Expect = e-131 Identities = 332/732 (45%), Positives = 405/732 (55%), Gaps = 27/732 (3%) Frame = +3 Query: 129 SSNGDLTADFDDLQFPPLDVDYLS---NDLMIPEGLMEELGFDP-DFEFSLDNLSFPPEN 296 S N + +AD + L PPLD D+ S ND + E M +LG D DF+F+ D+L FP E+ Sbjct: 10 SPNPNPSADSEPLAVPPLDPDFFSDNSNDAALHETFMSDLGLDGVDFDFTFDDLYFPSES 69 Query: 297 EGF---------GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAPPG 449 E F GS G D + V SGD +SD S E G CD++ P Sbjct: 70 EDFLADFPLPEEGSGGHDSADRSFDV----SGDRNSD-------VSSIELGCCDQKLSP- 117 Query: 450 PVSSQDSAGCRSVVDGFFNSPSPDSGV--HSQSGPAS----DVRSGAVVVEDDEQKVKLE 611 PV+SQ S+ V NSP DSG HS P+S D G V +QKVKLE Sbjct: 118 PVASQSSSDQNLDV----NSPLLDSGNSDHSSWVPSSPNLADNSWGVV-----DQKVKLE 168 Query: 612 EGGXXXXXXXXXXXXXXSFSDNARSCKFRWA-IQSENANSMPDEKDKRKARLTRNRESAQ 788 + G + +RS KFR + I SE AN+ DE++K+KARL RNRESAQ Sbjct: 169 DSGKNSVPKRKKEQDDST--TESRSSKFRRSSICSETANASNDEEEKKKARLMRNRESAQ 226 Query: 789 LSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVCXXXXXXXXX 968 LSRQRKKHYVEELE+K+RSMHS I DL GKIS MAEN +LRQQ G +C Sbjct: 227 LSRQRKKHYVEELEEKIRSMHSTIQDLTGKISIIMAENANLRQQFGGGGMCPPPHAGMYP 286 Query: 969 XXXXXXXXXXX---CGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKV 1139 Y +KP+GSQVPLVPIP+LKPQ P+SA K K+E+KK ++K+KKV Sbjct: 287 HPSMAPMAYPWVPCAPYVVKPQGSQVPLVPIPRLKPQAPVSAPKVKKTENKKNETKSKKV 346 Query: 1140 ASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSS 1319 SVS VPFVN++YGG +E VP S + + F + R+LTV Sbjct: 347 VSVSLLGMLSFMFLMGCLVPFVNIKYGGIKETVPGR--SDYISNRFSDMHRRRILTVKDD 404 Query: 1320 HESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXXVRGNSSEPLVVSLYVPRNDKLVKID 1499 + G F +R G N+SEPLV SLYVPRNDKLVKID Sbjct: 405 LNGSNYGMGVG---FDDRIHRGSKPLPGSDGYAHS--RNASEPLVASLYVPRNDKLVKID 459 Query: 1500 GNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-IPVNLSPALAVSNDGRN 1676 GNLIIHSVLASEKAMA S A K+ K ++S +N RETGL I NL+ A VS Sbjct: 460 GNLIIHSVLASEKAMA-SHAALAKKSPKPSVSLANDVRETGLAIAGNLATAFPVS----- 513 Query: 1677 CMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI 1856 + ++ DG LQ+WFREGL GP+L SGMCTEVF F++ Sbjct: 514 -------------------------EPTSTDGKLQQWFREGLAGPMLSSGMCTEVFQFDV 548 Query: 1857 XXXXXXXXXXX--ANISAKDHKNYTDPTKRRKNRRILDHLPIPLA-ELNATGQEGSRASH 2027 ANISA++ +N T K R NRRIL LPIPLA + +EG + Sbjct: 549 SPAPGAIVPVSSVANISAENQQNATHLNKGR-NRRILHGLPIPLAGSTHNITEEGMGRNS 607 Query: 2028 SHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCI 2207 D N++ S MVVSVLFDPREAGDS+ +GM+ PKS SRIFVVVLLDSVKYVTYSC Sbjct: 608 QKDNFQGSNKNVSSMVVSVLFDPREAGDSDGDGMMGPKSLSRIFVVVLLDSVKYVTYSCG 667 Query: 2208 LPFKASSHHLVT 2243 LP KAS+ HLVT Sbjct: 668 LPLKASAPHLVT 679 >ref|XP_006411360.1| hypothetical protein EUTSA_v10016317mg [Eutrema salsugineum] gi|557112529|gb|ESQ52813.1| hypothetical protein EUTSA_v10016317mg [Eutrema salsugineum] Length = 722 Score = 441 bits (1135), Expect = e-121 Identities = 312/738 (42%), Positives = 392/738 (53%), Gaps = 41/738 (5%) Frame = +3 Query: 153 DFDDLQFPPLDVDYLSNDLMIPEG-LMEELGF--DPDFEFSL-----DNLSFPPENEGFG 308 DFD + PP D Y S +P G LM +LGF D D EF L D+L FP ENE F Sbjct: 23 DFDSIPIPPFDQFYHSGSDQVPIGELMSDLGFPVDADGEFELTFDGMDDLYFPAENETFL 82 Query: 309 SE-GSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAP-------------- 443 + ++ G G S D SG C+R++P Sbjct: 83 IPVNASNQEQFGDFTPESEGSGISGDSLPKGDADKSTSGCCNRDSPRDSGDRCSGADRTL 142 Query: 444 --PGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEG 617 P P+SSQ S C S V N SP +S VVV+ QKVK+EE Sbjct: 143 DLPTPLSSQGSGNCGSDVSEATNESSP--------------KSVNVVVD---QKVKVEEA 185 Query: 618 GXXXXXXXXXXXXXXSFSDNARSCKFRWAIQSENANSMPDEKD-KRKARLTRNRESAQLS 794 SD +RS K+R + + +A+++ E+D K++ARL RNRESAQLS Sbjct: 186 ATASITKRKKEIEE-DMSDESRSSKYRRSGEDADASAVTGEEDEKKRARLMRNRESAQLS 244 Query: 795 RQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC----XXXXXXX 962 RQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 245 RQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHHPPPPMGM 304 Query: 963 XXXXXXXXXXXXXC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKV 1139 C Y +K +GSQVPL+PIP+LKPQ PL ASK KSESKK ++KTKKV Sbjct: 305 YPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNPLGASKAKKSESKKSEAKTKKV 364 Query: 1140 ASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSS 1319 AS+S P VNV YGG A + S + ++Q R + +S Sbjct: 365 ASISFLGLLLCLFLFGALAPIVNVNYGGISGAFYGNYRSNYVTDQI--YNQHRDRVLETS 422 Query: 1320 HESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXXVRGNSSEPLVVSLYVPRNDKLVKID 1499 ++ N R + GN SEPLV SL+VPRNDKLVKID Sbjct: 423 RSGAGTGVYNSNGMHCGRDCDRGPGKNMSATESSVPPGNGSEPLVASLFVPRNDKLVKID 482 Query: 1500 GNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGR-- 1673 GNLII+S+LASEKA+A SR A S SN + +IP + SPAL + GR Sbjct: 483 GNLIINSILASEKAVA-SRKA----------SESNERKADLVIPKDYSPALPLPGVGRIE 531 Query: 1674 ---NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVF 1844 +Y S TE+QKALSS S DT K KT +G +Q+WFREG GP+ SGMCTEVF Sbjct: 532 DMAKHLYRSKTEKQKALSSGSADTLKDQIKTKAANGEMQQWFREGGAGPMFSSGMCTEVF 591 Query: 1845 HFEI--XXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEG 2012 F++ N+SA+ KN T+ T+ RKNRR L LPIPL ++ N T + Sbjct: 592 QFDVSSTSGAIIPASPATNVSAEHSKNATN-TRSRKNRRTLRGLPIPLPGSDFNFTKE-- 648 Query: 2013 SRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVVVLLDSVKY 2189 H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVVVL+DS KY Sbjct: 649 ----HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVVVLVDSAKY 704 Query: 2190 VTYSCILPFKASSHHLVT 2243 VTYSC+LP ++ + HLVT Sbjct: 705 VTYSCVLP-RSGAPHLVT 721 >ref|XP_002526200.1| transcription factor hy5, putative [Ricinus communis] gi|223534478|gb|EEF36179.1| transcription factor hy5, putative [Ricinus communis] Length = 702 Score = 440 bits (1132), Expect = e-120 Identities = 321/735 (43%), Positives = 407/735 (55%), Gaps = 36/735 (4%) Frame = +3 Query: 147 TADFDDLQFPPLDVDYLSN-----DLMIPEGLMEELGFDPDFEFSLDNL---SFPPENEG 302 T DFD L PPLD +LS + + L L + DF+ + D+L + P +N+ Sbjct: 19 TDDFDSLAIPPLDPMFLSEQSSGENYNLVSDLQFSLDDNYDFDITFDDLVDFNLPSDNDH 78 Query: 303 FGSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAPPGPVSSQDSAGCR 482 G D S A G VA +L+ P +S + C Sbjct: 79 --DHGHDRFSIDPKSASPELGISGDHHVATYLN--------------SSPSASNSTTTCS 122 Query: 483 SVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEGGXXXXXXXXXXXXXX 662 S N SP S S +G + S VV+ QKVKLEE G Sbjct: 123 S--GDQLNVSSPVSSQGSGNGGSGVSDSVNFVVD---QKVKLEEEGSNSKNKNGSLSKRK 177 Query: 663 --SFSDNARSCKFRWAIQSENANS----MPDEKDKRKARLTRNRESAQLSRQRKKHYVEE 824 + S++ R+ K+R +SEN+N+ + DE +KRKARL RNRESAQLSRQRKKHYVEE Sbjct: 178 KENGSEDTRNQKYR---RSENSNANTQCVSDEDEKRKARLMRNRESAQLSRQRKKHYVEE 234 Query: 825 LEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSG-AVCXXXXXXXXXXXXXXXXXXXX 1001 LEDKV++MHS IADLN KISFFMAEN +LRQQLS G +C Sbjct: 235 LEDKVKTMHSTIADLNSKISFFMAENATLRQQLSGGNGMC-----PPPMYAPMPYPWVPC 289 Query: 1002 CGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKVASVSXXXXXXXXXX 1181 Y +K +GSQVPLVPIP+LK Q+P+SA+K KS+ KK + KTKKVASVS Sbjct: 290 APYVVKAQGSQVPLVPIPRLKSQQPVSAAKSKKSDPKKAEGKTKKVASVSFLGLLFFVLL 349 Query: 1182 XXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTV----NSSHESGKRELHT 1349 VP VNV++GG E +G F + F +GRVL V N SHE+ T Sbjct: 350 FGGLVPIVNVKFGGVGENGANG----FVSDKFYNRHRGRVLRVDGHSNGSHENVDVGFST 405 Query: 1350 G--NSGFRERCTNG---------XXXXXXXXXXXXXVRGNSSEPLVVSLYVPRNDKLVKI 1496 G +S FR +C +G RGN+S+PL SLYVPRNDKLVKI Sbjct: 406 GDFDSCFRIQCGSGRNGCLAEKKGRLEHLPEADELVRRGNNSKPLAASLYVPRNDKLVKI 465 Query: 1497 DGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-IPVNLSPALAVSNDGR 1673 DGNLIIHSVLASE+AM+ + +N ++ETGL IP +LSP+ + GR Sbjct: 466 DGNLIIHSVLASERAMSSNEN-----------PEANKSKETGLAIPRDLSPSPTI--PGR 512 Query: 1674 -NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHF 1850 + +YG E+QKAL+S S DT + K++ DG LQ+WF EGL GP+L SGMC+EVF F Sbjct: 513 YSHLYGHHNERQKALTSGSSDTLNDHKKSAAADGKLQQWFHEGLAGPLLSSGMCSEVFQF 572 Query: 1851 EI--XXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEGSR 2018 + +NI+A+ +N T+ K+ KNRRIL LPIPL ++LN TG+ Sbjct: 573 DALPTPGAIIPASSVSNITAEGQQNATN-HKKGKNRRILHGLPIPLTGSDLNITGE---H 628 Query: 2019 ASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTY 2198 +S + GN+S SPMVVSVL DPREAGD E +G+I+PKS SRIFVVVLLDSVKYVTY Sbjct: 629 VGNSQKENFQGNKSVSPMVVSVLVDPREAGDIEVDGVIAPKSISRIFVVVLLDSVKYVTY 688 Query: 2199 SCILPFKASSHHLVT 2243 SC+LP S LVT Sbjct: 689 SCVLP--RSGPQLVT 701 >ref|XP_006430509.1| hypothetical protein CICLE_v10011169mg [Citrus clementina] gi|557532566|gb|ESR43749.1| hypothetical protein CICLE_v10011169mg [Citrus clementina] Length = 727 Score = 439 bits (1128), Expect = e-120 Identities = 326/753 (43%), Positives = 405/753 (53%), Gaps = 55/753 (7%) Frame = +3 Query: 153 DFDDLQFPPLDVDYLSNDLMIPEGLMEELGF----DPDFEFSLDNLSFPPENEGFG---- 308 DFD L PPLD YL++ + P ++L F + DF+F++D+L F E++ F Sbjct: 15 DFDALSIPPLDPPYLNSQIPHPCASSDDLDFFLDDNCDFDFTIDDLYFASEDDTFFLPSE 74 Query: 309 ----------SEGSDGLSSTVSVAWQNSG---DGSSDDVAGFLSYPSPESGSCDREAPPG 449 S G DG ++ S +SG + +S DV +L+Y S S +R Sbjct: 75 DPQDGEFGGFSPGVDGGAAAASPGSGSSGILGNPASLDVESYLNYSSSPQNSGNR----- 129 Query: 450 PVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEE---GG 620 +S +S G S + SGV S + A SG +VV+ QK+K+EE G Sbjct: 130 -ISHLNSIGISGG-----RSENSGSGVSSDNTDAPSPDSGNLVVD---QKIKMEEVSKKG 180 Query: 621 XXXXXXXXXXXXXXSFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQLSRQ 800 S S+ R +++N +++ +E+ KRKARL RNRESAQLSRQ Sbjct: 181 IFKRKKDIEETNNESRSNKYRKSSSLSVNEADNDHNLGEEEMKRKARLMRNRESAQLSRQ 240 Query: 801 RKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAV------CXXXXXXX 962 RKKHYVEELEDKVR+MHS IADLN KISFFMAEN SL+QQLS Sbjct: 241 RKKHYVEELEDKVRNMHSTIADLNSKISFFMAENASLKQQLSGSNAMPPPLGMYPPPPHM 300 Query: 963 XXXXXXXXXXXXXCGYPMKPRGSQVPLVPIPKLKPQK-----PLSASKPNKSESKKIQSK 1127 Y +KP+GSQVPLVPIP+LKPQ P K + ++SK SK Sbjct: 301 AAAPMPYGWMPCAAPYMVKPQGSQVPLVPIPRLKPQAAAAAVPSRTKKSDGNKSKSDGSK 360 Query: 1128 TKKVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSG-FGSRFTNGGFDRWSQGRVL 1304 TKKVASVS VP V+V+YGG R+ V G FGS GF +GRVL Sbjct: 361 TKKVASVSFLGLLFFILLFGGLVPLVDVKYGGIRDGVSGGHFGS-----GFYNQHRGRVL 415 Query: 1305 TV----NSSHESGKRELHTGNSGFRER--CTNG----XXXXXXXXXXXXXVR-GNSSEPL 1451 T+ N S ES G GF R C VR N+SEPL Sbjct: 416 TINGYSNGSGESMGIGFPNGRVGFDNRIHCARAVESKEKESQPAPDSDEFVRPRNASEPL 475 Query: 1452 VVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-I 1628 V SLYVPRNDKLVKIDGNLIIHSVLASEKAMA S +N TGL I Sbjct: 476 VASLYVPRNDKLVKIDGNLIIHSVLASEKAMA-----------SHDASKANSKEATGLAI 524 Query: 1629 PVNLSPALAV----SNDGRNC-MYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFR 1793 P + SPALA+ N R+ Y + E+Q+A+SS S D K + K+S +G LQ+WF+ Sbjct: 525 PKDFSPALAIPDVRGNGARHSHFYRNPAERQRAISSGSTDALKDHMKSSAANGKLQQWFQ 584 Query: 1794 EGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDH 1967 EGL GP+L SGMCTEVF F+ AN++A+ +N T R +NRRIL Sbjct: 585 EGLSGPLLSSGMCTEVFQFDASPAPGAIIPASSVANMTAEHRQNATQ-VNRGRNRRILHR 643 Query: 1968 LPIPLAELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSF 2147 LP+PL N TG+ ++ S GN+S+S MVVSVL DPRE GD + EGMISPKS Sbjct: 644 LPVPLT--NFTGERKAQKE-----SFAGNKSASSMVVSVLVDPRETGDGDVEGMISPKSL 696 Query: 2148 SRIFVVVLLDSVKYVTYSCILPFKASSHHLVTN 2246 SRIFVVVLLDSVKYVTYSC LP S HLVT+ Sbjct: 697 SRIFVVVLLDSVKYVTYSCGLP--RSGLHLVTS 727 >ref|NP_565946.1| transcription factor BZIP17 [Arabidopsis thaliana] gi|20196934|gb|AAB86455.2| bZIP family transcription factor [Arabidopsis thaliana] gi|330254811|gb|AEC09905.1| Basic-leucine zipper (bZIP) transcription factor family protein [Arabidopsis thaliana] Length = 721 Score = 439 bits (1128), Expect = e-120 Identities = 310/737 (42%), Positives = 400/737 (54%), Gaps = 39/737 (5%) Frame = +3 Query: 150 ADFDDLQFPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSL-----DNLSFPPENEGF--- 305 +DFD + PPLD D+ S+ I E LM +LGF PD EF L D+L FP ENE F Sbjct: 25 SDFDSISIPPLD-DHFSDQTPIGE-LMSDLGF-PDGEFELTFDGMDDLYFPAENESFLIP 81 Query: 306 ---GSEGSDGLSSTVSVAWQNSGD-------GSSDDVAGFLSYPSPE------SGSCDRE 437 ++ G + S + SGD + +G ++ SP SG+ Sbjct: 82 INTSNQEQFGDFTPESESSGISGDCIVPKDADKTITTSGCINRESPRDSDDRCSGADHNL 141 Query: 438 APPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEG 617 P P+SSQ S C S V N SP S R+ AV +QKVK+EE Sbjct: 142 DLPTPLSSQGSGNCGSDVSEATNESSPKS------------RNVAV-----DQKVKVEEA 184 Query: 618 GXXXXXXXXXXXXXXS-FSDNARSCKFRWAIQSENANSMPDEKD-KRKARLTRNRESAQL 791 +D +R+ K+R + + +A+++ E+D K++ARL RNRESAQL Sbjct: 185 ATTTTSITKRKKEIDEDLTDESRNSKYRRSGEDADASAVTGEEDEKKRARLMRNRESAQL 244 Query: 792 SRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC----XXXXXX 959 SRQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 245 SRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHLPPPPMG 304 Query: 960 XXXXXXXXXXXXXXC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKK 1136 C Y +K +GSQVPL+PIP+LKPQ L SK KSESKK ++KTKK Sbjct: 305 MYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNTLGTSKAKKSESKKSEAKTKK 364 Query: 1137 VASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNS 1316 VAS+S P VNV YGG A + S + +SQ R +++ Sbjct: 365 VASISFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRSNYITDQI--YSQHRDRVLDT 422 Query: 1317 SHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXXVRGNSSEPLVVSLYVPRNDKLVKI 1496 S + N R R ++ GN SEPLV SL+VPRNDKLVKI Sbjct: 423 SRSGAGTGVSNSNGMHRGRDSDRGARKNISATESSVTPGNGSEPLVASLFVPRNDKLVKI 482 Query: 1497 DGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGR- 1673 DGNLII+S+LASEKA+A SR AS K K + +I + +PAL + + GR Sbjct: 483 DGNLIINSILASEKAVA-SRKASESKERKADL----------MISKDYTPALPLPDVGRT 531 Query: 1674 ----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEV 1841 +Y S E+QKALSS S DT K KT +G +Q+WFREG+ GP+ SGMCTEV Sbjct: 532 EELAKHLYRSKAEKQKALSSGSADTLKDQVKTKAANGEMQQWFREGVAGPMFSSGMCTEV 591 Query: 1842 FHFEIXXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEGS 2015 F F++ N+SA+ KN TD T +++NRRIL LPIPL ++ N T + Sbjct: 592 FQFDVSSTSGAIIPAATNVSAEHGKNTTD-THKQQNRRILRGLPIPLPGSDFNLTKE--- 647 Query: 2016 RASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVVVLLDSVKYV 2192 H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVVVLLDS KYV Sbjct: 648 ---HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVVVLLDSAKYV 704 Query: 2193 TYSCILPFKASSHHLVT 2243 TYSC+LP ++ + HLVT Sbjct: 705 TYSCVLP-RSGAPHLVT 720 >gb|AAM96961.1| putative TGACG-sequence-specific bZIP DNA-binding protein [Arabidopsis thaliana] gi|23198400|gb|AAN15727.1| putative TGACG-sequence-specific bZIP DNA-binding protein [Arabidopsis thaliana] Length = 721 Score = 438 bits (1127), Expect = e-120 Identities = 309/737 (41%), Positives = 400/737 (54%), Gaps = 39/737 (5%) Frame = +3 Query: 150 ADFDDLQFPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSL-----DNLSFPPENEGF--- 305 +DFD + PPLD D+ S+ I E LM +LGF PD EF L D+L FP ENE F Sbjct: 25 SDFDSISIPPLD-DHFSDQTPIGE-LMSDLGF-PDGEFELTFDGMDDLYFPAENESFLIP 81 Query: 306 ---GSEGSDGLSSTVSVAWQNSGD-------GSSDDVAGFLSYPSPE------SGSCDRE 437 ++ G + S + SGD + +G ++ SP SG+ Sbjct: 82 INTSNQEQFGDFTPESESSGISGDCIVPKDADKTITTSGCINRESPRDSDDRCSGADHNL 141 Query: 438 APPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEG 617 P P+SSQ S C S V N SP S R+ AV +QKVK+EE Sbjct: 142 DLPTPLSSQGSGNCGSDVSEATNESSPKS------------RNVAV-----DQKVKVEEA 184 Query: 618 GXXXXXXXXXXXXXXS-FSDNARSCKFRWAIQSENANSMPDEKD-KRKARLTRNRESAQL 791 +D +R+ K+R + + +A+++ E+D K++ARL RNRESAQL Sbjct: 185 ATTTTSITKRKKEIDEDLTDESRNSKYRRSGEDADASAVTGEEDEKKRARLMRNRESAQL 244 Query: 792 SRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC----XXXXXX 959 SRQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 245 SRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHLPPPPMG 304 Query: 960 XXXXXXXXXXXXXXC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKK 1136 C Y +K +GSQVPL+PIP+LKPQ L SK KSESKK ++KTKK Sbjct: 305 MYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNTLGTSKAKKSESKKSEAKTKK 364 Query: 1137 VASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNS 1316 VAS+S P VNV YGG A + S + +SQ R +++ Sbjct: 365 VASISFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRSNYITDQI--YSQHRDRVLDT 422 Query: 1317 SHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXXVRGNSSEPLVVSLYVPRNDKLVKI 1496 S + N R R ++ GN SEPLV SL+VPRNDKLVKI Sbjct: 423 SRSGAGTGVSNSNGMHRGRDSDRGARKNISATESSVTPGNGSEPLVASLFVPRNDKLVKI 482 Query: 1497 DGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGR- 1673 DGNL+I+S+LASEKA+A SR AS K K + +I + +PAL + + GR Sbjct: 483 DGNLVINSILASEKAVA-SRKASESKERKADL----------MISKDYTPALPLPDVGRT 531 Query: 1674 ----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEV 1841 +Y S E+QKALSS S DT K KT +G +Q+WFREG+ GP+ SGMCTEV Sbjct: 532 EELAKHLYRSKAEKQKALSSGSADTLKDQVKTKAANGEMQQWFREGVAGPMFSSGMCTEV 591 Query: 1842 FHFEIXXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEGS 2015 F F++ N+SA+ KN TD T +++NRRIL LPIPL ++ N T + Sbjct: 592 FQFDVSSTSGAIIPAATNVSAEHGKNTTD-THKQQNRRILRGLPIPLPGSDFNLTKE--- 647 Query: 2016 RASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVVVLLDSVKYV 2192 H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVVVLLDS KYV Sbjct: 648 ---HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVVVLLDSAKYV 704 Query: 2193 TYSCILPFKASSHHLVT 2243 TYSC+LP ++ + HLVT Sbjct: 705 TYSCVLP-RSGAPHLVT 720 >ref|XP_006293762.1| hypothetical protein CARUB_v10022722mg [Capsella rubella] gi|482562470|gb|EOA26660.1| hypothetical protein CARUB_v10022722mg [Capsella rubella] Length = 725 Score = 437 bits (1124), Expect = e-119 Identities = 313/746 (41%), Positives = 399/746 (53%), Gaps = 48/746 (6%) Frame = +3 Query: 150 ADFDDLQFPPLDVDYL---SNDLMIPEGLMEELGFDPDFEFSL-----DNLSFPPENEGF 305 +DFD + PP D + S+ I E LM +LGF PD EF L D+L FP ENE F Sbjct: 26 SDFDSISIPPFDDQFYHPGSDQTPIGE-LMSDLGF-PDGEFELTFDGMDDLYFPAENESF 83 Query: 306 GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPE-------SGSCDREAP------- 443 L + + + GD + D +S + SG +RE+P Sbjct: 84 -------LIPVNTSSQEQFGDFTPDSEGSGISGDPKDVFKNITTSGCSNRESPRDSDDRC 136 Query: 444 ---------PGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQ 596 P P+SSQ S C S V N SP +S VVV+ Q Sbjct: 137 SGADPSLDLPTPLSSQGSGNCASDVSEATNESSP--------------KSRNVVVD---Q 179 Query: 597 KVKLEEGGXXXXXXXXXXXXXXSFSDNARSCKFRWAIQSE-NANSMPDEKD-KRKARLTR 770 KVK+EE S +RS K+R + + + +A+++ E+D K+KARL R Sbjct: 180 KVKVEEAATTTSITKRKKEIEEDLSGESRSSKYRRSGEEDIDASAVTGEEDEKKKARLMR 239 Query: 771 NRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC--- 941 NRESAQLSRQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 240 NRESAQLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPH 299 Query: 942 -XXXXXXXXXXXXXXXXXXXXC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKK 1115 C Y +K +GSQVPL+PIP+LKPQ PL SK KSESKK Sbjct: 300 HPPPPMGMYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNPLGTSKAKKSESKK 359 Query: 1116 IQSKTKKVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQG 1295 ++KTKKVAS+S P VNV YGG A + + +SQ Sbjct: 360 SEAKTKKVASISFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRPNYITDQI--YSQH 417 Query: 1296 RVLTVNSSHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXXVRGNSSEPLVVSLYVPR 1475 R +++S + N R ++ GN SEPLV SL+VPR Sbjct: 418 RDRVLDTSRSGAGTGVSNSNGMDCGRDSDRGTRNNISATESSVPPGNGSEPLVASLFVPR 477 Query: 1476 NDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALA 1655 NDKLVKIDGNLII+S+LASEKA+A SR A S SN + +IP + SPAL Sbjct: 478 NDKLVKIDGNLIINSILASEKAVA-SRKA----------SESNERKADLVIPKDYSPALP 526 Query: 1656 VSNDGR-----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILG 1820 + + GR +Y S TE+QKALSS S D+ K KT +G +Q+WFREG+ GP+ Sbjct: 527 LPDVGRTEEMAKHLYRSKTEKQKALSSGSADSLKDQFKTKAANGEMQQWFREGVAGPMFS 586 Query: 1821 SGMCTEVFHFEI--XXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AE 1988 SGMCTEVF F++ N+SA+ KN TD T++RKNRRIL LPIPL ++ Sbjct: 587 SGMCTEVFQFDVSSTSGAIIPASPATNVSAEHSKNTTD-TRKRKNRRILRGLPIPLPGSD 645 Query: 1989 LNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVV 2165 N T + H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVV Sbjct: 646 FNLTKE------HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVV 699 Query: 2166 VLLDSVKYVTYSCILPFKASSHHLVT 2243 VLLDS KYVTYSC+LP ++ + HLVT Sbjct: 700 VLLDSAKYVTYSCVLP-RSGAPHLVT 724 >gb|EXC30127.1| TGACG-sequence-specific DNA-binding protein TGA-1B [Morus notabilis] Length = 797 Score = 431 bits (1107), Expect = e-117 Identities = 323/803 (40%), Positives = 412/803 (51%), Gaps = 102/803 (12%) Frame = +3 Query: 141 DLTADFDDLQFPPLDVDYL-SNDLMIPEGLMEELGF------DPDFEFS--LDNLSFPPE 293 D +A+F+ L PPLD + S+D + E +LG D DF F D+L P E Sbjct: 21 DFSAEFEPLSIPPLDHQFFSSDDAALREDFFSDLGLGLEENCDYDFTFDDIGDDLYLPSE 80 Query: 294 NEGF----GSE-GSDGLS-------------STVSVAWQNSGDGSSD---------DVAG 392 E F G + G + LS S VA +++ S DVAG Sbjct: 81 TEEFLIPDGLDIGPNSLSPNGTNSDRDVNPISEADVAAKSASPESESSTVSGVRDYDVAG 140 Query: 393 FLSYPSPESGSCDREAPPGPVSSQDSAGCRSVVDGFFNSPSPDSG----------VHSQ- 539 FL+ S ESG C+ E S++ A +S +DG +SPSPD G V SQ Sbjct: 141 FLNCQSSESGGCNSE------YSRNLADRKSKIDGVMDSPSPDCGNCDQECSGEAVSSQG 194 Query: 540 -----SGPASDVRSGAVVVEDD---------EQKVKLEEGGXXXXXXXXXXXXXXSFSDN 677 SG + S A D +QKVK+EE G + Sbjct: 195 SGNCGSGVSEGANSPAHSGNSDKDVSSCVFVDQKVKVEEVGKNYMSKRKKEPEEG--NAE 252 Query: 678 ARSCKF-RWAIQSENA------NSMPDEKDKRKARLTRNRESAQLSRQRKKHYVEELEDK 836 +R+ K+ R + +EN N + DE++KRKARL RNRESAQLSRQRKKHYVEELEDK Sbjct: 253 SRTPKYRRSSAPAENTHSQSTLNPLSDEEEKRKARLMRNRESAQLSRQRKKHYVEELEDK 312 Query: 837 VRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC-----XXXXXXXXXXXXXXXXXXXX 1001 +RSM+S I DLN +IS+ M EN SLRQQLS G +C Sbjct: 313 LRSMNSTITDLNSRISYIMVENASLRQQLSGGGICPPPPPTPGMYPHPPMGPMPYPWVPY 372 Query: 1002 CGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQ-SKTKKVASVSXXXXXXXXX 1178 Y +KP+GSQVPLVPIP+LKPQ+ +SASK KSE KK + KTKKVAS+S Sbjct: 373 APYVVKPQGSQVPLVPIPRLKPQQTVSASKAKKSEGKKSEGGKTKKVASISFLGLLFFVF 432 Query: 1179 XXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNS-------------- 1316 VP VNV +GG P G +T+G +G VLT + Sbjct: 433 LFGGLVPMVNVNFGGLTNNAPGGL--VYTSGRLYDQHRGSVLTADHLLNGSGENMRVGSF 490 Query: 1317 ---SHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXXVRGNSSEPLVVSLYVPRNDKL 1487 HE G+ + G +ER + GN SEPLV SLYVPRNDKL Sbjct: 491 NSVQHERGREQGEKLECGEKERGSQALPGSGEFIRL-----GNDSEPLVASLYVPRNDKL 545 Query: 1488 VKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAV--- 1658 VKIDGNLIIHSVLASEKA A + +T+++ I +++P+ AV Sbjct: 546 VKIDGNLIIHSVLASEKAKASLAHSEMKSKTETSLA----------IARDVAPSYAVPEV 595 Query: 1659 -SNDGRNC-MYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMC 1832 N GR+ +Y + E+ KALSS + D K+S DG LQ+WFREGL GP+L SGMC Sbjct: 596 GGNRGRHAPLYRNPVERHKALSSGATDATNDRLKSSAADGKLQQWFREGLAGPMLSSGMC 655 Query: 1833 TEVFHFEIXXXXXXXXXXXA----NISAKDHKNYTDPTKRRK--NRRILDHLPIPLAELN 1994 TEVF F++ A N+SAK +N T +R K NRRIL LP PL++ N Sbjct: 656 TEVFQFDVSPASTSGAIVPASSISNVSAKQRQNTTQNGRRLKGVNRRILRRLPAPLSDSN 715 Query: 1995 ATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVVVLL 2174 E + + G+R+ S MVVSVL DPREAGD++ +G++ PKS SRIFVVVL+ Sbjct: 716 FNISEERTSRNLRKDEFQGSRNVSSMVVSVLVDPREAGDNDVDGVMKPKSLSRIFVVVLM 775 Query: 2175 DSVKYVTYSCILPFKASSHHLVT 2243 DSV+YVTYSC+LP S HLVT Sbjct: 776 DSVRYVTYSCVLP--RSGPHLVT 796 >ref|XP_006482041.1| PREDICTED: uncharacterized protein LOC102629395 [Citrus sinensis] Length = 719 Score = 428 bits (1101), Expect = e-117 Identities = 323/747 (43%), Positives = 404/747 (54%), Gaps = 49/747 (6%) Frame = +3 Query: 153 DFDDLQFPPLDVDYLSNDLMIPEGLMEELGF----DPDFEFSLDNLSFPPENEGF----- 305 DFD L PPLD YL++ + P ++L F + DF+F++D+L F E++ F Sbjct: 15 DFDALSIPPLDPPYLNSQIPHPCASSDDLDFVLDDNCDFDFTIDDLYFASEDDTFFLPSE 74 Query: 306 ----GSEGS-----DGLSSTVSVAWQNSG---DGSSDDVAGFLSYPSPESGSCDREAPPG 449 G G DG ++ VS +SG + +S DV +L+Y S S +R Sbjct: 75 DPHDGQFGDFSPDVDGGAAAVSPGSGSSGILGNPASLDVESYLNYSSSPQNSGNR----- 129 Query: 450 PVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEE---GG 620 +S + G V G S + SGV S + SG +VV+ QK+K+EE G Sbjct: 130 -ISHLNYIG----VSGG-RSENSGSGVSSDNTDDPSPDSGNLVVD---QKIKMEEVSKKG 180 Query: 621 XXXXXXXXXXXXXXSFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQLSRQ 800 S S+ R +++N +++ +E+ KRKARL RNRESAQLSRQ Sbjct: 181 IFKRKKDIEETNNESRSNKYRKSSSLSVNEADNDHNLGEEEMKRKARLMRNRESAQLSRQ 240 Query: 801 RKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAV------CXXXXXXX 962 RKKHYVEELEDKVR+MHS IADLN KISFFMAEN SL+QQLS Sbjct: 241 RKKHYVEELEDKVRNMHSTIADLNSKISFFMAENASLKQQLSGSNAMPPPLGMYPPPPHM 300 Query: 963 XXXXXXXXXXXXXCGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKVA 1142 Y +KP+GSQVPLVPIP+LKPQ +A+ P +++ K SKTKKVA Sbjct: 301 AAAPMPYGWMPCAAPYMVKPQGSQVPLVPIPRLKPQ--AAAAVPPRTK-KSDGSKTKKVA 357 Query: 1143 SVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTV---- 1310 SVS VP V+V+YGG R+ V G+ S GF +GRVLT+ Sbjct: 358 SVSFLGLLFFILLFGGLVPLVDVKYGGIRDGVSGGYFS----SGFYNQHRGRVLTINGYS 413 Query: 1311 NSSHESGKRELHTGNSGFRER--CTNG----XXXXXXXXXXXXXVR-GNSSEPLVVSLYV 1469 N S ES G GF R C VR N+SEPLV SLYV Sbjct: 414 NGSGESMGIGFPNGRVGFDNRIHCARAVESKEKESQPAPDSDEFVRPRNASEPLVASLYV 473 Query: 1470 PRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-IPVNLSP 1646 PRNDKLVKIDGNLIIHSVLA EKAMA S +N TGL IP + SP Sbjct: 474 PRNDKLVKIDGNLIIHSVLAGEKAMA-----------SHDASKANSKEATGLAIPKDFSP 522 Query: 1647 ALAV----SNDGRNC-MYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGP 1811 ALA+ N R+ Y + E+Q+A+SS S D K + K+S +G LQ+WF+EGL GP Sbjct: 523 ALAIPDVRGNGARHSHFYRNPAERQRAISSGSTDALKDHMKSSAANGKLQQWFQEGLSGP 582 Query: 1812 ILGSGMCTEVFHFEI--XXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPIPLA 1985 +L SGMCTEVF F+ AN++A+ +N T R +NRRIL LP+PL Sbjct: 583 LLSSGMCTEVFQFDASPAPGAIIPASSVANMTAEHRQNATQ-VNRGRNRRILHRLPVPLT 641 Query: 1986 ELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVV 2165 N TG+ + S GN+S+S MVVSVL DPRE GD + EGMISPKS SRIFVV Sbjct: 642 --NITGERKVQKE-----SFAGNKSASSMVVSVLVDPRETGDGDVEGMISPKSLSRIFVV 694 Query: 2166 VLLDSVKYVTYSCILPFKASSHHLVTN 2246 VLLDSVKYVTYSC LP S HLVT+ Sbjct: 695 VLLDSVKYVTYSCGLP--RSGLHLVTS 719 >ref|XP_002323223.2| bZIP transcription factor family protein [Populus trichocarpa] gi|550320719|gb|EEF04984.2| bZIP transcription factor family protein [Populus trichocarpa] Length = 640 Score = 424 bits (1091), Expect = e-116 Identities = 300/717 (41%), Positives = 381/717 (53%), Gaps = 24/717 (3%) Frame = +3 Query: 165 LQFPPLDVDYLS-----NDLMIPEGLMEELGFDPDFEFSLDNLS---FPPENEGFGSEGS 320 L PPLD + + ND + L + DF+ + D+L+ FP ENE F Sbjct: 9 LPTPPLDPLFFNQNSDQNDNLNVPDLSSDFEDMSDFDITFDDLTDLYFPSENEQFLIP-- 66 Query: 321 DGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCD------REAPPGPVSSQDSAGCR 482 D +S S GD +V +L+ E+GSCD R + GP SS S Sbjct: 67 DNNASPESGGSGICGDQGGLEVDKYLNPSPSEAGSCDSGGSDSRSSDLGPASSHGSGNSG 126 Query: 483 SVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEGGXXXXXXXXXXXXXX 662 S + G G DV + + + V + GG Sbjct: 127 S-------GRKKEMG----DGENGDVMRNFKSRKAEGEDVSVNVGGG------------- 162 Query: 663 SFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQLSRQRKKHYVEELEDKVR 842 + SE E++KR+ARL RNRESA LSRQRKKHYVEELEDKVR Sbjct: 163 -------------VVSSE-------EEEKRRARLVRNRESAHLSRQRKKHYVEELEDKVR 202 Query: 843 SMHSVIADLNGKISFFMAENVSLRQQLSSGAVCXXXXXXXXXXXXXXXXXXXXCGYPMKP 1022 +MHS IADLNGK+S+FMAEN +LRQQL+ + C Y +KP Sbjct: 203 AMHSTIADLNGKVSYFMAENATLRQQLNGNSAC----PPPMYAPMAPYPWVPCAPYVVKP 258 Query: 1023 RGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKVASVSXXXXXXXXXXXXXXVPF 1202 +GSQVPLVPIP+LKPQ+ + +K K ESKK + KTKKVASVS P Sbjct: 259 QGSQVPLVPIPRLKPQQAVPMAKTKKVESKKGEGKTKKVASVSLIGLVFFILLFGGLAPM 318 Query: 1203 VNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSSHESGKRELH-TGNSGFRERCT 1379 V+V++GG RE+ SGFG F + F +GRVL V+ H +G E H + N G E Sbjct: 319 VDVKFGGVRESGISGFG--FGSERFLDQHKGRVLIVD-GHSNGSHENHDSANKGAAEHLP 375 Query: 1380 NGXXXXXXXXXXXXXVRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRT 1559 GN+SE LV SLYVPRNDKLVKIDGNLIIHS+LASE+AMA Sbjct: 376 GSDEFGQF---------GNASEQLVASLYVPRNDKLVKIDGNLIIHSILASERAMA---- 422 Query: 1560 ASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSS 1724 + E+ + + ALA+ + G N +Y + E+QKAL+S Sbjct: 423 ----------------SHESPEVNITKQTALAIPDVGNNRGRHSHVYRTHAERQKALASG 466 Query: 1725 SGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXXANI 1898 S DT K N K+S G LQ+WFREGL GP+L SGMCTEVF F++ AN+ Sbjct: 467 SADTSKDNLKSSAAKGKLQQWFREGLAGPLLSSGMCTEVFQFDVSPTPGAIVPASSVANV 526 Query: 1899 SAKDHKNYTDPTKRRKNRRILDHLPIPLA--ELNATGQEGSRASHSHDGSSNGNRSSSPM 2072 +A+ KN + + +NRRIL LPIPLA +LN TG+ R +H S GN+S SPM Sbjct: 527 TAEHQKNNSTRLNKGRNRRILRGLPIPLAGSDLNITGEHVGRKTHKE--SFQGNKSVSPM 584 Query: 2073 VVSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 2243 VVSVL DPREAGDS+ +G+I+PKS SRIFVVVL+DS+KYVTYSC+LP + HLVT Sbjct: 585 VVSVLVDPREAGDSDVDGVITPKSLSRIFVVVLVDSIKYVTYSCVLP--SIGPHLVT 639 >ref|XP_002881751.1| bZIP transcription factor family protein [Arabidopsis lyrata subsp. lyrata] gi|297327590|gb|EFH58010.1| bZIP transcription factor family protein [Arabidopsis lyrata subsp. lyrata] Length = 724 Score = 424 bits (1091), Expect = e-116 Identities = 314/745 (42%), Positives = 396/745 (53%), Gaps = 47/745 (6%) Frame = +3 Query: 150 ADFDDLQFPPLDVD-YLSNDLMIPEG-LMEELGFDPDFEFSL-----DNLSFPPENEGF- 305 +DFD + PP D Y S P G LM +LGF PD EF L D+L FP ENE F Sbjct: 25 SDFDSISIPPFDDHFYHSGSDHTPIGELMSDLGF-PDGEFELTFDGMDDLYFPAENESFL 83 Query: 306 ----------------GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGS-CDR 434 SEGS G+S V +++ S +G ++ S + S DR Sbjct: 84 IPVNTSNQEQFGDFTPESEGS-GISGDCPVLPKDAD--KSITTSGCINRDSDDRCSGADR 140 Query: 435 EAP-PGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLE 611 P P+SSQ S C S V N SP +S VVV+ QKVK+E Sbjct: 141 SLDLPTPLSSQGSGNCGSDVSEATNESSP--------------KSRNVVVD---QKVKVE 183 Query: 612 EGGXXXXXXXXXXXXXXS-FSDNARSCKFRWAIQSENANSMPDEKD-KRKARLTRNRESA 785 E +D +R+ K+R + + +A+++ E+D K+KARL RNRESA Sbjct: 184 EAATTTSIITKRKKEIDEDLTDESRNSKYRRSGEDADASAVTGEEDEKKKARLMRNRESA 243 Query: 786 QLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC----XXXX 953 QLSRQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 244 QLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHIPPPP 303 Query: 954 XXXXXXXXXXXXXXXXC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKT 1130 C Y +K +GSQVPL+PIP+LKPQ L SK KSESKK ++KT Sbjct: 304 MGMYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNTLGTSKAKKSESKKSEAKT 363 Query: 1131 KKVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTV 1310 KKVAS+S P VNV YGG A + S + + RVL Sbjct: 364 KKVASISFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRSNYITDQIYSQHRDRVLDT 423 Query: 1311 NSSHE----SGKRELHTGNSGFRERCTNGXXXXXXXXXXXXXVRGNSSEPLVVSLYVPRN 1478 + S S +H G R N GN SEPLV SL+VPRN Sbjct: 424 SRSGTGTGVSNSNGMHCGRDSDRGARKN------ISATESSVPPGNGSEPLVASLFVPRN 477 Query: 1479 DKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAV 1658 DKLVKIDGNLII+S+LASE+A+AL R AS K K + +I + SPAL + Sbjct: 478 DKLVKIDGNLIINSILASERAVAL-RKASESKERKADL----------VISKDYSPALPL 526 Query: 1659 SNDGR-----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGS 1823 + G+ +Y S E+QKALSS S DT K KT +G +Q+WFREG+ GP+ S Sbjct: 527 PDVGKTEEMAKHLYRSKAEKQKALSSGSTDTLKDQFKTKAANGEMQQWFREGVAGPMFSS 586 Query: 1824 GMCTEVFHFEI--XXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AEL 1991 GMCTEVF F++ N+S + KN TD T ++KNRRIL LPIPL ++ Sbjct: 587 GMCTEVFQFDVSSTSGAIIPASPATNVSTEHGKNTTD-THKQKNRRILRGLPIPLPGSDF 645 Query: 1992 NATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVVV 2168 N T + H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVVV Sbjct: 646 NLTKE------HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVVV 699 Query: 2169 LLDSVKYVTYSCILPFKASSHHLVT 2243 LLDS KYVTYSC+LP ++ + HLVT Sbjct: 700 LLDSAKYVTYSCVLP-RSGAPHLVT 723 >ref|XP_007028261.1| Transcription factor hy5, putative [Theobroma cacao] gi|508716866|gb|EOY08763.1| Transcription factor hy5, putative [Theobroma cacao] Length = 687 Score = 424 bits (1089), Expect = e-115 Identities = 305/749 (40%), Positives = 388/749 (51%), Gaps = 32/749 (4%) Frame = +3 Query: 93 SSVPGDTITGIGSSNGDLTADFDDLQFPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSLD 272 + P +T+ G ++ + L PPLD YLS DL L DF+ + D Sbjct: 2 AEAPAETVMG---------SELESLAIPPLDPLYLSTDLGF------SLDDHDDFQITFD 46 Query: 273 NLS---FPPENEGFGSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAP 443 + FP ++E + +S DV +L+ SPE GSC+ Sbjct: 47 DFDQFCFPSDSE--------------HLLIPDSSTTPDSDVERYLNSSSPELGSCNGPDS 92 Query: 444 PG----PVSSQDSAGCRSVVDGFFNSPSPDS-GVHSQSGPASDVRSGAVVVEDDEQKVKL 608 G P+SS S C S V N+ SPDS + Q ++ V +++ + Sbjct: 93 SGNSHSPLSSSGSGNCASAVSEAMNATSPDSENIVDQKISVEEIGKRRVSKRKKDRE-ET 151 Query: 609 EEGGXXXXXXXXXXXXXXSFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQ 788 + S SDN + N+N+ +E++KR+ARL RNRESAQ Sbjct: 152 DSSKCRRSSLTPSVNNSNSNSDNNNN---------NNSNAPSEEEEKRRARLMRNRESAQ 202 Query: 789 LSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSS--------GAVCX 944 LSRQRKKHYVEELEDKVR+MHS IADLN KI++FMAEN +LRQQLS+ GAV Sbjct: 203 LSRQRKKHYVEELEDKVRTMHSTIADLNNKIAYFMAENATLRQQLSTAGGGGGGGGAVMC 262 Query: 945 XXXXXXXXXXXXXXXXXXXCG--YPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKI 1118 C Y MKP GSQVPLVPIP+LKPQ+P + S++KK Sbjct: 263 PPQPLPMPMYPPMAYPWVPCAPPYVMKPPGSQVPLVPIPRLKPQQPPVPA----SKAKKN 318 Query: 1119 QSKTKKVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGR 1298 +SKTKKVASVS P VN RY + GS F GF +GR Sbjct: 319 ESKTKKVASVSLLGMLFFILLFGGLAPIVNDRYDN------TPVGSGFVGDGFYEVHRGR 372 Query: 1299 VLTV----NSSHESGKRELHTGNSGFRER-----CTNGXXXXXXXXXXXXXVRGNSSEPL 1451 VL V N S+ S G R R +G N EPL Sbjct: 373 VLRVDGHLNGSNNSRDVAFSYGKFDRRNRVHGRGSESGVEQKEKGAHSVPGYMSNGGEPL 432 Query: 1452 VVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIP 1631 SLYVPRNDKLVKIDGNLIIHSVLASEKAMA S AS +KN++T ++ IP Sbjct: 433 TASLYVPRNDKLVKIDGNLIIHSVLASEKAMA-SHKASQIKNEETGLA----------IP 481 Query: 1632 VNLSPALAV----SNDG-RNCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFRE 1796 N SPALA+ N G R+ Y + E+Q ALSS + D K + K++ DG +Q+WFRE Sbjct: 482 NNFSPALAIPDARENGGKRSREYRNPAERQMALSSGNADALKDHFKSTVADGKMQQWFRE 541 Query: 1797 GLEGPILGSGMCTEVFHFEIXXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPI 1976 GL GP+L SGMCTEVF F++ N+SA+ H+N T K R NRRIL P+ Sbjct: 542 GLAGPMLSSGMCTEVFQFDV-SAAIVPASSVTNVSAEHHQNATRHNKGR-NRRILHGHPV 599 Query: 1977 PLAELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRI 2156 PL+ + E +S + GN+++S MVVSVLFDPREAGD + + MI+PK SRI Sbjct: 600 PLSRSDVNITEQHVGRNSPKENFKGNKTASSMVVSVLFDPREAGDGDIDDMIAPKPLSRI 659 Query: 2157 FVVVLLDSVKYVTYSCILPFKASSHHLVT 2243 FVVVL+DSVKYVTYSC+LP HL+T Sbjct: 660 FVVVLVDSVKYVTYSCMLPLPGL--HLMT 686 >ref|XP_003554104.2| PREDICTED: uncharacterized protein LOC100127362 [Glycine max] Length = 784 Score = 423 bits (1088), Expect = e-115 Identities = 313/768 (40%), Positives = 406/768 (52%), Gaps = 66/768 (8%) Frame = +3 Query: 138 GDLTADFDDLQFPPLDVDYLSND-LMIPEGLMEELGFDPDFEFS--------LDNLSFPP 290 GD +++F+ P +D + + D L L + FD + EF LD++ P Sbjct: 43 GDFSSNFNAFLIPSMDSLFNTTDALPFASDLEFGMDFDNNGEFEITFDDLDELDDIFIPS 102 Query: 291 ENEGF-----------------GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPES 419 + E F ++ SD S VS SG+G S D S PSPE+ Sbjct: 103 DAEDFLLPDVCNSNYDSASPPIDAKNSDSPDSDVSAV---SGEGDSADNVRVSSVPSPEA 159 Query: 420 GSCDRE-APPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQ---SGPASDVRSGAVVVED 587 CDRE + GPVSSQ S S V +SPSPDSG + + S A V + V +E+ Sbjct: 160 EFCDREESSNGPVSSQGSGNGGSGVYEAMHSPSPDSGPYERDITSSHAHAVTNNGVKMEE 219 Query: 588 D---EQKVKLEEGGXXXXXXXXXXXXXXSFSDNARSCKFRWAIQSENA-NSMPDEKDKRK 755 + K K E FS + + QS++ N + DE +KRK Sbjct: 220 TPAFDLKRKKES-------CDGSATKHRRFSSSVENNNNNTEKQSQSGLNGIDDEDEKRK 272 Query: 756 ARLTRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGA 935 ARL RNRESAQLSRQRKKHYVEELE+KVRS++S+IAD++ K+S+ +AEN +LRQQ+ + Sbjct: 273 ARLMRNRESAQLSRQRKKHYVEELEEKVRSLNSIIADMSSKMSYVVAENATLRQQVGAAG 332 Query: 936 VCXXXXXXXXXXXXXXXXXXXXCGYP--------MKPRGSQVPLVPIPKLKPQKPLSASK 1091 V YP +KP+GSQVPLVPIP+LKPQ+P SA K Sbjct: 333 VMCPPPPAPAPGMYPHHPPMAPMPYPWMPCAPYVVKPQGSQVPLVPIPRLKPQQPASAPK 392 Query: 1092 PNKSESKKIQSKTKKVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNG 1271 KSE+KK + KT KVAS+S VP V+ R+GG E VP S + + Sbjct: 393 GKKSENKKSEGKTTKVASISLLGLFFFIMLFGGLVPLVDFRFGGLVENVPGTGRSNYVSD 452 Query: 1272 GFDRWSQGRVLTVNSSHESGKRELHTGNSGF------------RERCTNGXXXXXXXXXX 1415 G+V ++N +R+ G S R R Sbjct: 453 RVYGQGGGKVWSLNGRRNGSERDEDVGFSNGGRFSVSDRVNYERGRNFREERHDRRKGSD 512 Query: 1416 XXXVRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTIS 1595 +GN+SEPLV SLYVPRNDK+VKIDGNLIIHS++ASEKAMA S+TA K DK Sbjct: 513 DFGRQGNASEPLVASLYVPRNDKMVKIDGNLIIHSIMASEKAMA-SQTAE-AKKDK---- 566 Query: 1596 SSNGARETGL-IPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSSSGDTYKVNSKT 1757 RETGL IP +L ALA+ GR+ +Y + EQ+KAL S S K + K+ Sbjct: 567 -----RETGLAIPKDLDSALAIPGVGRSRGQHPHVYSVSPEQRKALGSGSTKVLKDHMKS 621 Query: 1758 SNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXXANISAKDHKNYTDP 1931 S DG +Q+WFREGL GP+L SGMCTEVF F++ AN+S ++ +N T Sbjct: 622 SVTDGKMQQWFREGLVGPMLSSGMCTEVFQFDVSPSPGAIVPATSVANVSTENRQNATS- 680 Query: 1932 TKRRKNRRILDHLPIPL--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREA 2105 K+ +NRR L LP PL + LN T + H +GN+SS MVVSVL DP+EA Sbjct: 681 VKKTRNRRTLHELPEPLNGSSLNITEERVKNLQKDH---LHGNKSS--MVVSVLVDPKEA 735 Query: 2106 GDS--ESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 2243 GD + +GM+ PKS SRIFVVVL+DSVKYVTYSC LP +S HLVT Sbjct: 736 GDGDVDVDGMMRPKSLSRIFVVVLIDSVKYVTYSCGLP--RASPHLVT 781 >ref|XP_003521109.2| PREDICTED: uncharacterized protein LOC100101871 [Glycine max] Length = 812 Score = 418 bits (1075), Expect = e-114 Identities = 313/770 (40%), Positives = 412/770 (53%), Gaps = 49/770 (6%) Frame = +3 Query: 81 TMDASSVPGDTITGI--GSSNGDLTADFDDLQ--FPPLDVDYLSNDLMIPEGLMEELGFD 248 T D P D G+ ++NG+ FDDL + P D + D ++P+ + Sbjct: 79 TTDGLPFPSDLEFGMDFNNNNGEFEITFDDLDDIYIPSDAE----DFLLPDAC------N 128 Query: 249 PDFEF---SLDNLSFPPENEGFGSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPES 419 P++ +D+ S + + DG+S + Q S S+D+V S PSPE+ Sbjct: 129 PNYASVSPPIDDSSAKNSDSDASAVSGDGVSRFFNS--QVSESDSADNVR-VPSVPSPEA 185 Query: 420 GSCDRE-APPGPVSSQDSAGCRSVVDGFFNSPSPDSG--------VHSQSGPASDVRSGA 572 C+RE + GPVSSQ S S V +SPSPDSG H+ + + V+ Sbjct: 186 EFCEREESSNGPVSSQGSGNGGSGVYEAMHSPSPDSGPYERDITSFHAHAATNNGVKMEE 245 Query: 573 VVVEDDEQKVKLEEGGXXXXXXXXXXXXXXSFSDNARSCKFRWAIQSENANSMPDEKDKR 752 V D ++K EG S +N + K QS+ N + DE +KR Sbjct: 246 VPAFDLKRKKGSCEGSATKHRRFS------SSVENNNNNKTEKQFQSD-LNGIEDEDEKR 298 Query: 753 KARLTRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQL--S 926 KARL RNRESAQLSRQRKKHYVEELE+KVRS++S+IAD++ K+S+ +AE +LRQQ+ + Sbjct: 299 KARLMRNRESAQLSRQRKKHYVEELEEKVRSLNSIIADMSSKMSYMVAEIATLRQQVGAA 358 Query: 927 SGAVC-------XXXXXXXXXXXXXXXXXXXXCGYPMKPRGSQVPLVPIPKLKPQKPLSA 1085 +G +C Y +KP+GSQVPLVPIP+LKPQ+P SA Sbjct: 359 AGVMCPPPPPPAPGMYPHHPPMAPMPYPWMPCAPYVVKPQGSQVPLVPIPRLKPQQPASA 418 Query: 1086 SKPNKSESKKIQSKTKKVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFT 1265 K KSESKK + KTKKVAS+S VP V+ R+GG + VP S + Sbjct: 419 PKSKKSESKKSEGKTKKVASISLLGLFFFIMLFGGLVPVVDFRFGGLVDNVPGTGSSNYV 478 Query: 1266 NGGFDRWSQGRVLTVNSSHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXXVR----- 1430 + G+V ++N R+ G S R ++ R Sbjct: 479 SDRVYGHGGGKVWSLNGPRNGSGRDGDVGFSNGRFSVSDRVKNYEKRGRNLREERHDRKG 538 Query: 1431 -------GNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTT 1589 GN+SEPLV SLYVPRNDK+VKIDGNLIIHS++ASEKAMA S+TA K DK Sbjct: 539 PDDSSRQGNASEPLVASLYVPRNDKMVKIDGNLIIHSIMASEKAMA-SQTAE-AKKDK-- 594 Query: 1590 ISSSNGARETGL-IPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSSSGDTYKVNS 1751 RETGL IP +L ALA+ GR+ +Y + EQ+KAL S S K + Sbjct: 595 -------RETGLAIPKDLDSALAIPGVGRSRDQHPHVYRVSPEQRKALGSGSTKALKDHM 647 Query: 1752 KTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXXANISAKDHKNYT 1925 K+S DG +Q+WFREGL GP+L SGMCTEVF F+ AN+S ++H+N T Sbjct: 648 KSSATDGKMQQWFREGLAGPMLSSGMCTEVFQFDASPSPGAIVPATSVANVSTENHQNAT 707 Query: 1926 DPTKRRKNRRILDHLPIPL--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPR 2099 K+ +NRR L LP PL + LN T ++ H +GN+SS MVVSVL DPR Sbjct: 708 S-VKKTRNRRTLHELPEPLNGSSLNITEEQVKNLQKDH---FHGNKSS--MVVSVLVDPR 761 Query: 2100 EAGDS--ESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 2243 EAGD + +GM+ PKS SRIFVVVL+DSVKYVTYSC LP +S HLVT Sbjct: 762 EAGDGDVDVDGMMRPKSLSRIFVVVLIDSVKYVTYSCGLP--RASPHLVT 809 >ref|XP_004493333.1| PREDICTED: uncharacterized protein LOC101504999 [Cicer arietinum] Length = 786 Score = 416 bits (1070), Expect = e-113 Identities = 315/753 (41%), Positives = 393/753 (52%), Gaps = 61/753 (8%) Frame = +3 Query: 138 GDLTADFDDLQ---FPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSLDNLSFPPENEGFG 308 GD FDDL P D+L D P GL D +++ D +N +G Sbjct: 54 GDFEITFDDLDTLCIPSDTDDFLLPDAWNPNGLPISPLTDNHGDYNGDG-DCSAKNSDYG 112 Query: 309 SEGSDGLSSTVSVAWQN--------------SGDGSSDDVAGFLSYPSPESGSCDRE-AP 443 D S SV + S D +S DV S PSPE+ S DRE + Sbjct: 113 VANFDSPESGASVVSSDQSPDVSRFFNSESVSADDNSVDVK-ISSMPSPETESSDREESS 171 Query: 444 PGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEGGX 623 GP+SSQ S S V NSPSPDSG + + S A+V E+ VKLE Sbjct: 172 NGPISSQGSGNGGSGVYEAMNSPSPDSGRYERD--ISSSHKHAIV----EEGVKLEGIVK 225 Query: 624 XXXXXXXXXXXXXSFSDNARSC---------KFRWAIQSENANS----MPDEKDKRKARL 764 S + C K + +Q + A S + DE +KRKARL Sbjct: 226 GCDLKRKKENCIESAENRTPKCSRRSSSMENKTQQQLQQQQAQSGFDGIEDEDEKRKARL 285 Query: 765 TRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC- 941 RNRESAQLSRQRKKHYVEELE+KVRSMHS IADL+ KI+F MAEN +LRQQL G +C Sbjct: 286 MRNRESAQLSRQRKKHYVEELEEKVRSMHSTIADLSSKITFVMAENATLRQQLGGGMMCP 345 Query: 942 -----XXXXXXXXXXXXXXXXXXXXCGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSE 1106 Y +KP+GSQVPLVPIP+LKPQ+P S+SK K+E Sbjct: 346 PPPPAGSGMYPHPPMPPMPYPWMPYAPYVVKPQGSQVPLVPIPRLKPQQPASSSKSKKNE 405 Query: 1107 SKKIQSKTKKVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRW 1286 SKK + KTKKVAS+S VP V+ ++GG + V SG S + DRW Sbjct: 406 SKKSEVKTKKVASISLLGLFFFIMLFGGLVPLVDFKFGGLVDNV-SGRSSYVS----DRW 460 Query: 1287 ----SQGRVLTVNSSHESGKRELHTGNSGFRERCTN----------GXXXXXXXXXXXXX 1424 GR+ V+ +R+ G S R ++ G Sbjct: 461 LYGQGGGRIWPVSGYRNESERDEELGFSNGRFGISDRNNYERGRKLGEEMNGWKDTSCFG 520 Query: 1425 VRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSN 1604 R N+SEPL+ SLYVPRNDKLVKIDGNLIIHS++ASEKAMA S+ A K Sbjct: 521 HRDNASEPLLASLYVPRNDKLVKIDGNLIIHSIMASEKAMA-SQDAQEKKVKS------- 572 Query: 1605 GARETGL-IPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSSSGDTYKVNSKTSNN 1766 ETGL IP + ALA+ GRN +Y + EQ++A+ S S T K + K+S Sbjct: 573 ---ETGLAIPKDWDSALAIPEVGRNRGPHPNVYRVSAEQRRAIGSGSAKTLKDHMKSSAT 629 Query: 1767 DGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXXANISAKDHKNYTDPTKR 1940 DG +Q+WFREGL GP+L SGMCTEVF F++ ANISA++ +N T K Sbjct: 630 DGKMQQWFREGLAGPMLSSGMCTEVFQFDVSPAPGAIVPATAVANISAENRRNATTVNKS 689 Query: 1941 RKNRRILDHLPIPL--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDS 2114 R NRRIL LP PL + LN T + A + G GN+SS MVVSVL DP+E GD Sbjct: 690 R-NRRILHTLPDPLPGSTLNITEE---HARNLPKGHLPGNKSS--MVVSVLVDPKEVGDG 743 Query: 2115 ESEGMISPKSFSRIFVVVLLDSVKYVTYSCILP 2213 + +GM++PKS +RIFVVVL+DSVKYVTYSC LP Sbjct: 744 DVDGMMAPKSLTRIFVVVLIDSVKYVTYSCGLP 776 >ref|XP_002308867.2| hypothetical protein POPTR_0006s03300g [Populus trichocarpa] gi|550335363|gb|EEE92390.2| hypothetical protein POPTR_0006s03300g [Populus trichocarpa] Length = 729 Score = 412 bits (1058), Expect = e-112 Identities = 285/669 (42%), Positives = 358/669 (53%), Gaps = 16/669 (2%) Frame = +3 Query: 285 PPENEGFGSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAPPGPVSSQ 464 P E E S GSD SS +S S GS + +G LS SPESG+ Sbjct: 146 PSEAESCDSGGSDYRSSVLSPV---SSHGSGNSGSGVLSAGSPESGT------------- 189 Query: 465 DSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEGGXXXXXXXX 644 + C VVD F V +++ A +S + V ++++ EE G Sbjct: 190 NVNPCNFVVDKKF--------VKTETESAKKRKSAKIAVAKRKKEMGDEENG-------- 233 Query: 645 XXXXXXSFSDNARSCKFRWAIQSENAN-------SMPDEKDKRKARLTRNRESAQLSRQR 803 + R+ K R A +SEN + S+ E+D+RKARL RNRESAQLSRQR Sbjct: 234 ---------EIMRNLKSRKA-ESENVSVNVSGSASLSGEEDRRKARLMRNRESAQLSRQR 283 Query: 804 KKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVCXXXXXXXXXXXXXX 983 KKHYVEELEDKVR MHS IA LNGK+S+FMAEN +LR+QLS C Sbjct: 284 KKHYVEELEDKVRMMHSTIAQLNGKVSYFMAENATLRRQLSGNGAC----PPPMYAPMAP 339 Query: 984 XXXXXXCGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKVASVSXXXX 1163 Y +KP+GSQVPLVPIP+LKPQ+ + +KP K ESKK + KTKKVASVS Sbjct: 340 YPWVPCAPYVVKPQGSQVPLVPIPRLKPQQTVPLAKPKKGESKKGEGKTKKVASVSLFGF 399 Query: 1164 XXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSSHESGKREL 1343 VP V+V++ GGF +GRVL V+ H +G E Sbjct: 400 LFFILLFRCLVPIVDVKF-----------------GGFFDQHKGRVLIVD-GHTNGSHEK 441 Query: 1344 HTGNSGFRERCTNGXXXXXXXXXXXXXVRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSV 1523 N N GN+SE LV SLYVPRNDKLVKIDGNLIIHSV Sbjct: 442 RGHNGCLEHDSANKGASERLPGSDEFGQFGNASEHLVASLYVPRNDKLVKIDGNLIIHSV 501 Query: 1524 LASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGRN-----CMYG 1688 LASE+ MA + E+ + + ALA+ G N +Y Sbjct: 502 LASERPMA--------------------SHESPEVNITKETALAIPGVGNNRGRHSHVYR 541 Query: 1689 SATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XX 1862 + TE+QKAL S S DT K N K+S G LQ+WFREGL GP+L GMCTEVF F++ Sbjct: 542 THTERQKALDSGSADTSKDNLKSSAAKGKLQQWFREGLAGPLLSHGMCTEVFQFDVSPAP 601 Query: 1863 XXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEGSRASHSHD 2036 AN++A+ +N + K+ NRRIL LPIPL ++LN TG+ R ++ Sbjct: 602 GAIVPASSVANMTAERQQNNSTHLKKGNNRRILRGLPIPLPGSDLNITGEHVGR--NTQK 659 Query: 2037 GSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPF 2216 + +GN+S SPMVVSVL DPRE+ D E +G+I+PKS SRIFVVVLLDS+KYVTYSC+LP Sbjct: 660 ENFHGNKSVSPMVVSVLVDPRESSDREVDGVITPKSLSRIFVVVLLDSIKYVTYSCVLPS 719 Query: 2217 KASSHHLVT 2243 HLVT Sbjct: 720 AGPLLHLVT 728 >ref|XP_004299018.1| PREDICTED: uncharacterized protein LOC101299380 isoform 1 [Fragaria vesca subsp. vesca] Length = 711 Score = 410 bits (1055), Expect = e-111 Identities = 307/766 (40%), Positives = 395/766 (51%), Gaps = 46/766 (6%) Frame = +3 Query: 84 MDASSVPGDTIT---GIGSSNGDLTADFDDLQFPPLDVDYLSNDL----MIPEGLMEELG 242 M+ S V GD + + D DF+ L PPLD + S+D M + M +LG Sbjct: 1 MEDSVVAGDPPIPHPDLAPNCSDSGEDFESLPIPPLDPQFFSSDAGMATMAADSFMSDLG 60 Query: 243 F------DPDFEFS---LDNLSFPPEN------EGFGSEGSDGLSSTVSVAWQNSGDGSS 377 F + D+E + LDNL P E EGF S+V + ++ GSS Sbjct: 61 FGFGSDDNCDYELTFDDLDNLYIPSEADDFLLPEGFDPAAQPSSDSSVILKSESPESGSS 120 Query: 378 DD-------VAGFLSYPSPESGSCDREAPP---GPVSSQDSAGCRSVVDGFFNSPSPDSG 527 V+GFL+YPS ESG D+E GP+SSQ S G + +S + D Sbjct: 121 GVSKGSDGVVSGFLNYPSSESGGHDQEFSENSGGPLSSQGS-GIPEAANSPTHSGNSDRD 179 Query: 528 VHSQSGPASD--------VRSGAVVVEDDEQKVKLEEGGXXXXXXXXXXXXXXSFSDNAR 683 V S A + RSG V K K E GG +R Sbjct: 180 VSSNVTTADEKVKIEEEVTRSGFVA------KRKKESGGGEEGNM------------ESR 221 Query: 684 SCKFRWAIQSENANS-MPDEKDKRKARLTRNRESAQLSRQRKKHYVEELEDKVRSMHSVI 860 S KFR + S + + DE ++RKARL RNRESAQLSRQRKKHYVEELEDKVR+MH+ I Sbjct: 222 SSKFRRSESSGGSGGCLDDEDERRKARLMRNRESAQLSRQRKKHYVEELEDKVRAMHTTI 281 Query: 861 ADLNGKISFFMAENVSLRQQLSSGA-VCXXXXXXXXXXXXXXXXXXXXCG-YPMKPRGSQ 1034 ADLN K+S+ MAEN +L+QQLSSG+ +C Y +KP+GSQ Sbjct: 282 ADLNNKMSYIMAENATLKQQLSSGSGICPPPPPPGMYPMPPMGYPWMPYSPYVVKPQGSQ 341 Query: 1035 VPLVPIPKLKPQKPLSASKP-NKSESKKIQSKTKKVASVSXXXXXXXXXXXXXXVPFVNV 1211 VPLVPIP+LKPQ+P +A KP KSESK SKTKKVAS+S VP +NV Sbjct: 342 VPLVPIPRLKPQQPAAAPKPKKKSESK---SKTKKVASISFLGLLFFLLLFGGLVPMLNV 398 Query: 1212 RYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSSHESGKRELHTGNSGFRERCTNGXX 1391 +G GS + F + +VL V + + G SG + +N Sbjct: 399 GFG----------GSSYVRDRFYDQQRAKVLKVPGHLNGSEGNVPLGVSGGKFDVSNKIH 448 Query: 1392 XXXXXXXXXXXVR-GNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASG 1568 GN+SEPLV SLYVPRNDKLVKIDGNLIIHSVLASEKA A Sbjct: 449 ERAHKQKEQGLPGVGNASEPLVASLYVPRNDKLVKIDGNLIIHSVLASEKAKA------- 501 Query: 1569 VKNDKTTISSSNGARETGLIPVNLSPALAVSNDGRNCMYGSATEQQKALSSSSGDTYKVN 1748 + K+ + GA+ G + P V+ R +Y + Q+KAL++ S Sbjct: 502 --HKKSREARVEGAK--GFVSALAIPEAGVNRGRRAPLYRTPAGQRKALTAGSA------ 551 Query: 1749 SKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEIXXXXXXXXXXXANI-SAKDHKNYT 1925 DG LQ+WFREGL G +L SGMCTEVF F++ +++ + +H + Sbjct: 552 ------DGKLQQWFREGLAGSLLSSGMCTEVFQFDVSAANSGGIIPASSVANVSEHNSNA 605 Query: 1926 DPTKRRKNRRILDHLPIPLAELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREA 2105 R NRRIL IPLA N + RA ++ S+N S+S +VVSVL DPREA Sbjct: 606 TRLNRGGNRRILGGRAIPLAGSNHNATDDERAIRNNQSSNNFQVSNSSVVVSVLVDPREA 665 Query: 2106 GDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 2243 GD + +GMI PKS SR+FVV+LLDSVKYVTYSC+LP +++ HLVT Sbjct: 666 GDIDVDGMIKPKSLSRVFVVLLLDSVKYVTYSCVLP-RSAPPHLVT 710 >gb|AGO05994.1| bZIP transcription factor family protein 10 [Camellia sinensis] Length = 718 Score = 409 bits (1050), Expect = e-111 Identities = 305/760 (40%), Positives = 400/760 (52%), Gaps = 57/760 (7%) Frame = +3 Query: 123 IGSSNGDLTADFDDLQFPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSLDNLSFPPENEG 302 + S+ T D D L PPLD S+ + G +++L +F+ D+L P + Sbjct: 2 VDPSSNSTTTDSDSLPIPPLDPSIFSDSFLAGGGDIDDL------DFTFDDLYLPSDTPH 55 Query: 303 FGSE------GSDGLSSTVSVAWQNSGDG----SSDDVAGFLSYPSPESGS--------C 428 F + SD + + S S D ++ FL+ SPES Sbjct: 56 FLNSLPPPHFSSDWIPDFPIPSDHTSTPSRVFNSDDLISDFLNVSSPESSHESANKASIV 115 Query: 429 DREAPPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKL 608 R P SSQ S SVV N SPDS +S + + +QK++L Sbjct: 116 ARVLDPEVSSSQGSGNSGSVVSEPLNYTSPDSANNS-------------IHDFVDQKIEL 162 Query: 609 EEGGXXXXXXXXXXXXXXSFSDNARSCKFRWAIQSENANS---------MPDEKDKRKAR 761 +E G S+ R+ K++ + EN N + ++ +K+KAR Sbjct: 163 KEEGTNCLLKRKKESEEDVNSE-FRTSKYQRSNSGENPNQSYGYTSNTGISEDDEKKKAR 221 Query: 762 LTRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC 941 L RNRESAQLSRQRKKHYVEELEDK+R+MHS + DLN KIS+ MAEN SLRQQLS GA+C Sbjct: 222 LMRNRESAQLSRQRKKHYVEELEDKLRTMHSTVQDLNSKISYIMAENASLRQQLSGGAMC 281 Query: 942 XXXXXXXXXXXXXXXXXXXXCGYP--------MKPRGSQVPLVPIPKLKPQKPLSASKPN 1097 GYP +KP+GSQVPLVPIP+LK Q P A K Sbjct: 282 ---PPPVPPPGMYPHPPMAPMGYPWMPCPPYVVKPQGSQVPLVPIPRLKSQNPSPAPKAK 338 Query: 1098 KSESKKIQSKTKKVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGF 1277 K ESKK +KTKKVASVS VP VNV +GG R G + F NG + Sbjct: 339 KVESKK--TKTKKVASVSFLGLLFFILFFGGLVPMVNVNFGGIRRDTVLGGSNYFGNGFY 396 Query: 1278 DRWSQGRVLTVNSSHESGKRELHTG-NSGF--------RERCTNG---XXXXXXXXXXXX 1421 D+ GRV+TVN +++ G ++GF R+R + Sbjct: 397 DQ-HHGRVVTVNGHLNGSDQKIGMGLSNGFTNTTIHCGRDRAESNVEQIEGSQAFPGSDE 455 Query: 1422 XVR-GNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISS 1598 VR NSS PLV SLYVPRNDKLVKIDGNLIIHS+LASEK+MA Sbjct: 456 FVRPDNSSMPLVASLYVPRNDKLVKIDGNLIIHSILASEKSMASGN------------GG 503 Query: 1599 SNGARETGL-IPVNLSPALAVS--NDGRN-CMYGSATEQQKALSSSSGDTYKVNSKTSNN 1766 +N + ETGL + N+ PA+ ++ N+G++ +Y S +E ++AL S S D K N K++ Sbjct: 504 TNSSEETGLAVARNMPPAIPLTERNNGKHPHLYRSTSEPKRALGSGSAD--KDNLKSTPA 561 Query: 1767 DGLLQKWFREGLEGPILGSGMCTEVFHFEIXXXXXXXXXXXA--NISAKDHKNYTDPTKR 1940 DG LQ+WF+EGL GP+L SGMCTEVF F++ + N+SA+ KN T K Sbjct: 562 DGKLQQWFQEGLAGPMLSSGMCTEVFQFDVSPVPGAIVPATSVVNVSAEHRKNATHIIK- 620 Query: 1941 RKNRRILDHLPIPL--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDS 2114 NRRIL +PIPL ++ N + + R D +GN+S S MVVSVL DPR+AGD Sbjct: 621 GLNRRILHGVPIPLPGSQNNISKEHVGRNPEKDD--FHGNKSLSSMVVSVLVDPRDAGDI 678 Query: 2115 ESEGMIS-PKSFSRIFVVVLLDSVKYVTYSCILPFKASSH 2231 +S+G++ PKS SRIFVVVL+DSVKYVTYSC+LP S H Sbjct: 679 DSDGVMGPPKSLSRIFVVVLIDSVKYVTYSCMLPLMGSYH 718 >ref|XP_007162048.1| hypothetical protein PHAVU_001G1193000g, partial [Phaseolus vulgaris] gi|561035512|gb|ESW34042.1| hypothetical protein PHAVU_001G1193000g, partial [Phaseolus vulgaris] Length = 779 Score = 407 bits (1047), Expect = e-111 Identities = 314/748 (41%), Positives = 400/748 (53%), Gaps = 48/748 (6%) Frame = +3 Query: 132 SNGDLTADFDDLQ---FPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSLDNLSFPPENEG 302 +NG+ FDDL P D+L D P+ LG P E S N P + Sbjct: 63 NNGEFEITFDDLDDICIPSDAEDFLLTDACNPDNT-SVLG--PIEESSAKNSDSPRSDAS 119 Query: 303 FGS-EGSDGLS------STVSVAWQNS-GDGSSDDVAGFLS-YPSPESGSCDRE-APPGP 452 S + S G+S ++ SV+ NS +GS D V +S PSPES CDRE + GP Sbjct: 120 VVSGDRSSGVSRFFNSQASDSVSEGNSCKEGSLDAVDVRVSNIPSPESEFCDREESSSGP 179 Query: 453 VSSQDSAGCRSVVDGFFNSPSPDSGVHSQ---SGPASDVRSGAVVVE-----DDEQKVKL 608 VSSQ S S V NSPSPDS + S A +V V +E D ++K + Sbjct: 180 VSSQGSGNAGSGVYEAINSPSPDSVSFERDITSSHAHEVMDKGVKLEEISGCDLKRKKES 239 Query: 609 EEGGXXXXXXXXXXXXXXSFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQ 788 EG FS ++ K S+ N++ D+ +KRKARL RNRESAQ Sbjct: 240 CEGSATKHRR---------FSSSSVDTKTEKQTPSD-VNAIDDDDEKRKARLMRNRESAQ 289 Query: 789 LSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC-----XXXX 953 LSRQRKKHYVEELE+KVRSM+S+IADL+ KIS+ +AEN +LRQQ+ +G +C Sbjct: 290 LSRQRKKHYVEELEEKVRSMNSIIADLSSKISYMVAENATLRQQVGAGVMCAPPPPAPGI 349 Query: 954 XXXXXXXXXXXXXXXXCGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTK 1133 Y +KP+GSQVPLVPIP+LKPQ+ SA K KSESKK + KTK Sbjct: 350 YPHPPMAPMPYPWMPCAPYVVKPQGSQVPLVPIPRLKPQQHTSAPKGKKSESKKSEGKTK 409 Query: 1134 KVASVSXXXXXXXXXXXXXXVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVN 1313 KVAS+S VP V+ ++GG + VP S + + G+V +VN Sbjct: 410 KVASISFLGLFFFIMLFGGLVPLVDFKFGGLVDNVPDTGLSSYVSDRVHGHGGGKVWSVN 469 Query: 1314 SSHESGKRELHTGNSGFR---------ERCTN-GXXXXXXXXXXXXXVRGNSSEPLVVSL 1463 +R+ G S R ER + G +GN+SEPLV SL Sbjct: 470 GPRNGSERDEEVGFSNERFSVKDKMNYERGRHLGEERGERQGPDDFGRQGNASEPLVASL 529 Query: 1464 YVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-IPVNL 1640 YVPRNDK+VKIDGNLIIHS++ASEKAMA S+TA + +ETGL IP + Sbjct: 530 YVPRNDKMVKIDGNLIIHSIMASEKAMA-SQTAEAKEK-----------KETGLAIPKDS 577 Query: 1641 SPALAVSNDGR-----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLE 1805 ALA+ GR +Y EQ+KAL S S K + K+S DG +Q+WFREGL Sbjct: 578 DSALAIPEVGRLRGQHPHVYRVPAEQRKALGSGSTKALKDHMKSSATDGKMQQWFREGLA 637 Query: 1806 GPILGSGMCTEVFHFEI--XXXXXXXXXXXANISAKDHKNYTDPTKRRKNRRILDHLPIP 1979 GP+L SGMCTEVF F++ AN+S + +N T K+ +NRR L LP Sbjct: 638 GPMLSSGMCTEVFQFDVSPSPGAIVPATSVANLSTEKRQNATS-VKKTRNRRTLHGLPDS 696 Query: 1980 L--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDS--ESEGMISPKSF 2147 L + LN T + H +GN SS MVVSVL DP+EAGD + +GM+ PKS Sbjct: 697 LTGSSLNITEEHVKNLQKDH---LHGNESS--MVVSVLVDPKEAGDGDVDVDGMMRPKSL 751 Query: 2148 SRIFVVVLLDSVKYVTYSCILPFKASSH 2231 SRIFVVVL+DSVKYVTYSC LP +AS H Sbjct: 752 SRIFVVVLIDSVKYVTYSCGLP-RASPH 778