BLASTX nr result
ID: Sinomenium22_contig00016884
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00016884 (2469 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002277884.2| PREDICTED: uncharacterized protein LOC100248... 507 e-140 emb|CBI32817.3| unnamed protein product [Vitis vinifera] 475 e-131 ref|XP_006411360.1| hypothetical protein EUTSA_v10016317mg [Eutr... 441 e-121 ref|XP_002526200.1| transcription factor hy5, putative [Ricinus ... 440 e-120 ref|XP_006430509.1| hypothetical protein CICLE_v10011169mg [Citr... 439 e-120 ref|NP_565946.1| transcription factor BZIP17 [Arabidopsis thalia... 439 e-120 gb|AAM96961.1| putative TGACG-sequence-specific bZIP DNA-binding... 438 e-120 ref|XP_006293762.1| hypothetical protein CARUB_v10022722mg [Caps... 437 e-119 gb|EXC30127.1| TGACG-sequence-specific DNA-binding protein TGA-1... 431 e-117 ref|XP_006482041.1| PREDICTED: uncharacterized protein LOC102629... 428 e-117 ref|XP_002323223.2| bZIP transcription factor family protein [Po... 424 e-116 ref|XP_002881751.1| bZIP transcription factor family protein [Ar... 424 e-116 ref|XP_007028261.1| Transcription factor hy5, putative [Theobrom... 424 e-115 ref|XP_003554104.2| PREDICTED: uncharacterized protein LOC100127... 423 e-115 ref|XP_003521109.2| PREDICTED: uncharacterized protein LOC100101... 418 e-114 ref|XP_004493333.1| PREDICTED: uncharacterized protein LOC101504... 416 e-113 ref|XP_002308867.2| hypothetical protein POPTR_0006s03300g [Popu... 412 e-112 ref|XP_004299018.1| PREDICTED: uncharacterized protein LOC101299... 410 e-111 gb|AGO05994.1| bZIP transcription factor family protein 10 [Came... 409 e-111 ref|XP_007162048.1| hypothetical protein PHAVU_001G1193000g, par... 407 e-111 >ref|XP_002277884.2| PREDICTED: uncharacterized protein LOC100248184 [Vitis vinifera] Length = 768 Score = 507 bits (1305), Expect = e-140 Identities = 353/776 (45%), Positives = 435/776 (56%), Gaps = 71/776 (9%) Frame = -2 Query: 2339 SSNGDLTADFDDLQFPPLDVDYLS---NDLMIPEGLMEELGFDP-DFEFSLDNLSFPPEN 2172 S N + +AD + L PPLD D+ S ND + E M +LG D DF+F+ D+L FP E+ Sbjct: 10 SPNPNPSADSEPLAVPPLDPDFFSDNSNDAALHETFMSDLGLDGVDFDFTFDDLYFPSES 69 Query: 2171 EGF------GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSC---------- 2040 E F EGS G S S DV+ L+ PSPESG+C Sbjct: 70 EDFLADFPLPEEGSGGHDSA----------DRSFDVSKVLNSPSPESGNCGVESSLPCQV 119 Query: 2039 ----------------DREAPPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQ--SGPAS 1914 D++ P PV+SQ S+ N PSP+SG + SGP S Sbjct: 120 SGDRNSDVSSIELGCCDQKLSP-PVASQSSSDQNLDGARVLNVPSPESGSCDRGFSGPES 178 Query: 1913 DVRSG-------AVVVEDDEQKVKLEEGGXXXXXXXXXXXXXESFSDNARSCKFRWA-IQ 1758 SG V +QKVKLE+ G + +RS KFR + I Sbjct: 179 SQGSGNGGSGVPGAVNCVVDQKVKLEDSGKNSVPKRKKEQDDST--TESRSSKFRRSSIC 236 Query: 1757 SENANSMPDEKDKRKARLTRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFF 1578 SE AN+ DE++K+KARL RNRESAQLSRQRKKHYVEELE+K+RSMHS I DL GKIS Sbjct: 237 SETANASNDEEEKKKARLMRNRESAQLSRQRKKHYVEELEEKIRSMHSTIQDLTGKISII 296 Query: 1577 MAENVSLRQQLSSGAVC---XXXXXXXXXXXXXXXXXXXPCGYPMKPRGSQVPLVPIPKL 1407 MAEN +LRQQ G +C Y +KP+GSQVPLVPIP+L Sbjct: 297 MAENANLRQQFGGGGMCPPPHAGMYPHPSMAPMAYPWVPCAPYVVKPQGSQVPLVPIPRL 356 Query: 1406 KPQKPLSASKPNKSESKKIQSKTKKVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVP 1227 KPQ P+SA K K+E+KK ++K+KKV SVS LVPFVN++YGG +E VP Sbjct: 357 KPQAPVSAPKVKKTENKKNETKSKKVVSVSLLGMLSFMFLMGCLVPFVNIKYGGIKETVP 416 Query: 1226 SGFGSRFTNGGFDRWSQGRVLTV-----NSSHESG---KRELHT-----GNSGFRERCTN 1086 S + + F + R+LTV S++ G +H+ G SG + Sbjct: 417 G--RSDYISNRFSDMHRRRILTVKDDLNGSNYGMGVGFDDRIHSERGRGGGSGSEVKQKG 474 Query: 1085 GXXXXXXXXXXXXEVRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTA 906 G R N+SEPLV SLYVPRNDKLVKIDGNLIIHSVLASEKAMA S A Sbjct: 475 GGSKPLPGSDGYAHSR-NASEPLVASLYVPRNDKLVKIDGNLIIHSVLASEKAMA-SHAA 532 Query: 905 SGVKNDKTTISSSNGARETGL-IPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSS 744 K+ K ++S +N RETGL I NL+ A VS GRN ++ + EQ KAL+S Sbjct: 533 LAKKSPKPSVSLANDVRETGLAIAGNLATAFPVSEVGRNKGRHPHLFRNPAEQHKALASG 592 Query: 743 SGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXVANI 570 S DT K N + ++ DG LQ+WFREGL GP+L SGMCTEVF F++ VANI Sbjct: 593 SSDTLKENLQPTSTDGKLQQWFREGLAGPMLSSGMCTEVFQFDVSPAPGAIVPVSSVANI 652 Query: 569 SAKDHKNYTDPTKRRKNRRILDHLPIPLA-ELNATGQEGSRASHSHDGSSNGNRSSSPMV 393 SA++ +N T K R NRRIL LPIPLA + +EG + D N++ S MV Sbjct: 653 SAENQQNATHLNKGR-NRRILHGLPIPLAGSTHNITEEGMGRNSQKDNFQGSNKNVSSMV 711 Query: 392 VSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 225 VSVLFDPREAGDS+ +GM+ PKS SRIFVVVLLDSVKYVTYSC LP KAS+ HLVT Sbjct: 712 VSVLFDPREAGDSDGDGMMGPKSLSRIFVVVLLDSVKYVTYSCGLPLKASAPHLVT 767 >emb|CBI32817.3| unnamed protein product [Vitis vinifera] Length = 680 Score = 475 bits (1223), Expect = e-131 Identities = 333/732 (45%), Positives = 406/732 (55%), Gaps = 27/732 (3%) Frame = -2 Query: 2339 SSNGDLTADFDDLQFPPLDVDYLS---NDLMIPEGLMEELGFDP-DFEFSLDNLSFPPEN 2172 S N + +AD + L PPLD D+ S ND + E M +LG D DF+F+ D+L FP E+ Sbjct: 10 SPNPNPSADSEPLAVPPLDPDFFSDNSNDAALHETFMSDLGLDGVDFDFTFDDLYFPSES 69 Query: 2171 EGF---------GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAPPG 2019 E F GS G D + V SGD +SD S E G CD++ P Sbjct: 70 EDFLADFPLPEEGSGGHDSADRSFDV----SGDRNSD-------VSSIELGCCDQKLSP- 117 Query: 2018 PVSSQDSAGCRSVVDGFFNSPSPDSGV--HSQSGPAS----DVRSGAVVVEDDEQKVKLE 1857 PV+SQ S+ V NSP DSG HS P+S D G V +QKVKLE Sbjct: 118 PVASQSSSDQNLDV----NSPLLDSGNSDHSSWVPSSPNLADNSWGVV-----DQKVKLE 168 Query: 1856 EGGXXXXXXXXXXXXXESFSDNARSCKFRWA-IQSENANSMPDEKDKRKARLTRNRESAQ 1680 + G + +RS KFR + I SE AN+ DE++K+KARL RNRESAQ Sbjct: 169 DSGKNSVPKRKKEQDDST--TESRSSKFRRSSICSETANASNDEEEKKKARLMRNRESAQ 226 Query: 1679 LSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVCXXXXXXXXX 1500 LSRQRKKHYVEELE+K+RSMHS I DL GKIS MAEN +LRQQ G +C Sbjct: 227 LSRQRKKHYVEELEEKIRSMHSTIQDLTGKISIIMAENANLRQQFGGGGMCPPPHAGMYP 286 Query: 1499 XXXXXXXXXXP---CGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKV 1329 Y +KP+GSQVPLVPIP+LKPQ P+SA K K+E+KK ++K+KKV Sbjct: 287 HPSMAPMAYPWVPCAPYVVKPQGSQVPLVPIPRLKPQAPVSAPKVKKTENKKNETKSKKV 346 Query: 1328 ASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSS 1149 SVS LVPFVN++YGG +E VP S + + F + R+LTV Sbjct: 347 VSVSLLGMLSFMFLMGCLVPFVNIKYGGIKETVPGR--SDYISNRFSDMHRRRILTVKDD 404 Query: 1148 HESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXEVRGNSSEPLVVSLYVPRNDKLVKID 969 + G F +R G N+SEPLV SLYVPRNDKLVKID Sbjct: 405 LNGSNYGMGVG---FDDRIHRGSKPLPGSDGYAHS--RNASEPLVASLYVPRNDKLVKID 459 Query: 968 GNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-IPVNLSPALAVSNDGRN 792 GNLIIHSVLASEKAMA S A K+ K ++S +N RETGL I NL+ A VS Sbjct: 460 GNLIIHSVLASEKAMA-SHAALAKKSPKPSVSLANDVRETGLAIAGNLATAFPVS----- 513 Query: 791 CMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI 612 + ++ DG LQ+WFREGL GP+L SGMCTEVF F++ Sbjct: 514 -------------------------EPTSTDGKLQQWFREGLAGPMLSSGMCTEVFQFDV 548 Query: 611 XXXXXXXXXXV--ANISAKDHKNYTDPTKRRKNRRILDHLPIPLA-ELNATGQEGSRASH 441 ANISA++ +N T K R NRRIL LPIPLA + +EG + Sbjct: 549 SPAPGAIVPVSSVANISAENQQNATHLNKGR-NRRILHGLPIPLAGSTHNITEEGMGRNS 607 Query: 440 SHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCI 261 D N++ S MVVSVLFDPREAGDS+ +GM+ PKS SRIFVVVLLDSVKYVTYSC Sbjct: 608 QKDNFQGSNKNVSSMVVSVLFDPREAGDSDGDGMMGPKSLSRIFVVVLLDSVKYVTYSCG 667 Query: 260 LPFKASSHHLVT 225 LP KAS+ HLVT Sbjct: 668 LPLKASAPHLVT 679 >ref|XP_006411360.1| hypothetical protein EUTSA_v10016317mg [Eutrema salsugineum] gi|557112529|gb|ESQ52813.1| hypothetical protein EUTSA_v10016317mg [Eutrema salsugineum] Length = 722 Score = 441 bits (1135), Expect = e-121 Identities = 314/738 (42%), Positives = 394/738 (53%), Gaps = 41/738 (5%) Frame = -2 Query: 2315 DFDDLQFPPLDVDYLSNDLMIPEG-LMEELGF--DPDFEFSL-----DNLSFPPENEGFG 2160 DFD + PP D Y S +P G LM +LGF D D EF L D+L FP ENE F Sbjct: 23 DFDSIPIPPFDQFYHSGSDQVPIGELMSDLGFPVDADGEFELTFDGMDDLYFPAENETFL 82 Query: 2159 SE-GSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAP-------------- 2025 + ++ G G S D SG C+R++P Sbjct: 83 IPVNASNQEQFGDFTPESEGSGISGDSLPKGDADKSTSGCCNRDSPRDSGDRCSGADRTL 142 Query: 2024 --PGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEG 1851 P P+SSQ S C S V N SP +S VVV+ QKVK+EE Sbjct: 143 DLPTPLSSQGSGNCGSDVSEATNESSP--------------KSVNVVVD---QKVKVEEA 185 Query: 1850 GXXXXXXXXXXXXXESFSDNARSCKFRWAIQSENANSMPDEKD-KRKARLTRNRESAQLS 1674 SD +RS K+R + + +A+++ E+D K++ARL RNRESAQLS Sbjct: 186 ATASITKRKKEIEE-DMSDESRSSKYRRSGEDADASAVTGEEDEKKRARLMRNRESAQLS 244 Query: 1673 RQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC----XXXXXXX 1506 RQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 245 RQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHHPPPPMGM 304 Query: 1505 XXXXXXXXXXXXPC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKV 1329 PC Y +K +GSQVPL+PIP+LKPQ PL ASK KSESKK ++KTKKV Sbjct: 305 YPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNPLGASKAKKSESKKSEAKTKKV 364 Query: 1328 ASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSS 1149 AS+S L P VNV YGG A + S + ++Q R + +S Sbjct: 365 ASISFLGLLLCLFLFGALAPIVNVNYGGISGAFYGNYRSNYVTDQI--YNQHRDRVLETS 422 Query: 1148 HESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXEVRGNSSEPLVVSLYVPRNDKLVKID 969 ++ N R + GN SEPLV SL+VPRNDKLVKID Sbjct: 423 RSGAGTGVYNSNGMHCGRDCDRGPGKNMSATESSVPPGNGSEPLVASLFVPRNDKLVKID 482 Query: 968 GNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGR-- 795 GNLII+S+LASEKA+A SR A S SN + +IP + SPAL + GR Sbjct: 483 GNLIINSILASEKAVA-SRKA----------SESNERKADLVIPKDYSPALPLPGVGRIE 531 Query: 794 ---NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVF 624 +Y S TE+QKALSS S DT K KT +G +Q+WFREG GP+ SGMCTEVF Sbjct: 532 DMAKHLYRSKTEKQKALSSGSADTLKDQIKTKAANGEMQQWFREGGAGPMFSSGMCTEVF 591 Query: 623 HFEI--XXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEG 456 F++ N+SA+ KN T+ T+ RKNRR L LPIPL ++ N T + Sbjct: 592 QFDVSSTSGAIIPASPATNVSAEHSKNATN-TRSRKNRRTLRGLPIPLPGSDFNFTKE-- 648 Query: 455 SRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVVVLLDSVKY 279 H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVVVL+DS KY Sbjct: 649 ----HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVVVLVDSAKY 704 Query: 278 VTYSCILPFKASSHHLVT 225 VTYSC+LP ++ + HLVT Sbjct: 705 VTYSCVLP-RSGAPHLVT 721 >ref|XP_002526200.1| transcription factor hy5, putative [Ricinus communis] gi|223534478|gb|EEF36179.1| transcription factor hy5, putative [Ricinus communis] Length = 702 Score = 440 bits (1132), Expect = e-120 Identities = 323/735 (43%), Positives = 410/735 (55%), Gaps = 36/735 (4%) Frame = -2 Query: 2321 TADFDDLQFPPLDVDYLSN-----DLMIPEGLMEELGFDPDFEFSLDNL---SFPPENEG 2166 T DFD L PPLD +LS + + L L + DF+ + D+L + P +N+ Sbjct: 19 TDDFDSLAIPPLDPMFLSEQSSGENYNLVSDLQFSLDDNYDFDITFDDLVDFNLPSDNDH 78 Query: 2165 FGSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAPPGPVSSQDSAGCR 1986 G D S A G VA +L+ P +S + C Sbjct: 79 --DHGHDRFSIDPKSASPELGISGDHHVATYLN--------------SSPSASNSTTTCS 122 Query: 1985 SVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEGGXXXXXXXXXXXXXE 1806 S N SP S S +G + S VV+ QKVKLEE G + Sbjct: 123 S--GDQLNVSSPVSSQGSGNGGSGVSDSVNFVVD---QKVKLEEEGSNSKNKNGSLSKRK 177 Query: 1805 --SFSDNARSCKFRWAIQSENANS----MPDEKDKRKARLTRNRESAQLSRQRKKHYVEE 1644 + S++ R+ K+R +SEN+N+ + DE +KRKARL RNRESAQLSRQRKKHYVEE Sbjct: 178 KENGSEDTRNQKYR---RSENSNANTQCVSDEDEKRKARLMRNRESAQLSRQRKKHYVEE 234 Query: 1643 LEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSG-AVCXXXXXXXXXXXXXXXXXXXP 1467 LEDKV++MHS IADLN KISFFMAEN +LRQQLS G +C Sbjct: 235 LEDKVKTMHSTIADLNSKISFFMAENATLRQQLSGGNGMC-----PPPMYAPMPYPWVPC 289 Query: 1466 CGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKVASVSXXXXXXXXXX 1287 Y +K +GSQVPLVPIP+LK Q+P+SA+K KS+ KK + KTKKVASVS Sbjct: 290 APYVVKAQGSQVPLVPIPRLKSQQPVSAAKSKKSDPKKAEGKTKKVASVSFLGLLFFVLL 349 Query: 1286 XXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTV----NSSHESGKRELHT 1119 LVP VNV++GG E +G F + F +GRVL V N SHE+ T Sbjct: 350 FGGLVPIVNVKFGGVGENGANG----FVSDKFYNRHRGRVLRVDGHSNGSHENVDVGFST 405 Query: 1118 G--NSGFRERCTNG---------XXXXXXXXXXXXEVRGNSSEPLVVSLYVPRNDKLVKI 972 G +S FR +C +G RGN+S+PL SLYVPRNDKLVKI Sbjct: 406 GDFDSCFRIQCGSGRNGCLAEKKGRLEHLPEADELVRRGNNSKPLAASLYVPRNDKLVKI 465 Query: 971 DGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-IPVNLSPALAVSNDGR 795 DGNLIIHSVLASE+AM+ + +N ++ETGL IP +LSP+ + GR Sbjct: 466 DGNLIIHSVLASERAMSSNEN-----------PEANKSKETGLAIPRDLSPSPTI--PGR 512 Query: 794 -NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHF 618 + +YG E+QKAL+S S DT + K++ DG LQ+WF EGL GP+L SGMC+EVF F Sbjct: 513 YSHLYGHHNERQKALTSGSSDTLNDHKKSAAADGKLQQWFHEGLAGPLLSSGMCSEVFQF 572 Query: 617 EI--XXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEGSR 450 + V+NI+A+ +N T+ K+ KNRRIL LPIPL ++LN TG+ Sbjct: 573 DALPTPGAIIPASSVSNITAEGQQNATN-HKKGKNRRILHGLPIPLTGSDLNITGE---H 628 Query: 449 ASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTY 270 +S + GN+S SPMVVSVL DPREAGD E +G+I+PKS SRIFVVVLLDSVKYVTY Sbjct: 629 VGNSQKENFQGNKSVSPMVVSVLVDPREAGDIEVDGVIAPKSISRIFVVVLLDSVKYVTY 688 Query: 269 SCILPFKASSHHLVT 225 SC+LP S LVT Sbjct: 689 SCVLP--RSGPQLVT 701 >ref|XP_006430509.1| hypothetical protein CICLE_v10011169mg [Citrus clementina] gi|557532566|gb|ESR43749.1| hypothetical protein CICLE_v10011169mg [Citrus clementina] Length = 727 Score = 439 bits (1128), Expect = e-120 Identities = 329/753 (43%), Positives = 408/753 (54%), Gaps = 55/753 (7%) Frame = -2 Query: 2315 DFDDLQFPPLDVDYLSNDLMIPEGLMEELGF----DPDFEFSLDNLSFPPENEGFG---- 2160 DFD L PPLD YL++ + P ++L F + DF+F++D+L F E++ F Sbjct: 15 DFDALSIPPLDPPYLNSQIPHPCASSDDLDFFLDDNCDFDFTIDDLYFASEDDTFFLPSE 74 Query: 2159 ----------SEGSDGLSSTVSVAWQNSG---DGSSDDVAGFLSYPSPESGSCDREAPPG 2019 S G DG ++ S +SG + +S DV +L+Y S S +R Sbjct: 75 DPQDGEFGGFSPGVDGGAAAASPGSGSSGILGNPASLDVESYLNYSSSPQNSGNR----- 129 Query: 2018 PVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEE---GG 1848 +S +S G S + SGV S + A SG +VV+ QK+K+EE G Sbjct: 130 -ISHLNSIGISGG-----RSENSGSGVSSDNTDAPSPDSGNLVVD---QKIKMEEVSKKG 180 Query: 1847 XXXXXXXXXXXXXESFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQLSRQ 1668 ES S+ R +++N +++ +E+ KRKARL RNRESAQLSRQ Sbjct: 181 IFKRKKDIEETNNESRSNKYRKSSSLSVNEADNDHNLGEEEMKRKARLMRNRESAQLSRQ 240 Query: 1667 RKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAV------CXXXXXXX 1506 RKKHYVEELEDKVR+MHS IADLN KISFFMAEN SL+QQLS Sbjct: 241 RKKHYVEELEDKVRNMHSTIADLNSKISFFMAENASLKQQLSGSNAMPPPLGMYPPPPHM 300 Query: 1505 XXXXXXXXXXXXPCGYPMKPRGSQVPLVPIPKLKPQK-----PLSASKPNKSESKKIQSK 1341 Y +KP+GSQVPLVPIP+LKPQ P K + ++SK SK Sbjct: 301 AAAPMPYGWMPCAAPYMVKPQGSQVPLVPIPRLKPQAAAAAVPSRTKKSDGNKSKSDGSK 360 Query: 1340 TKKVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSG-FGSRFTNGGFDRWSQGRVL 1164 TKKVASVS LVP V+V+YGG R+ V G FGS GF +GRVL Sbjct: 361 TKKVASVSFLGLLFFILLFGGLVPLVDVKYGGIRDGVSGGHFGS-----GFYNQHRGRVL 415 Query: 1163 TV----NSSHESGKRELHTGNSGFRER--CTNG----XXXXXXXXXXXXEVR-GNSSEPL 1017 T+ N S ES G GF R C VR N+SEPL Sbjct: 416 TINGYSNGSGESMGIGFPNGRVGFDNRIHCARAVESKEKESQPAPDSDEFVRPRNASEPL 475 Query: 1016 VVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-I 840 V SLYVPRNDKLVKIDGNLIIHSVLASEKAMA S +N TGL I Sbjct: 476 VASLYVPRNDKLVKIDGNLIIHSVLASEKAMA-----------SHDASKANSKEATGLAI 524 Query: 839 PVNLSPALAV----SNDGRNC-MYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFR 675 P + SPALA+ N R+ Y + E+Q+A+SS S D K + K+S +G LQ+WF+ Sbjct: 525 PKDFSPALAIPDVRGNGARHSHFYRNPAERQRAISSGSTDALKDHMKSSAANGKLQQWFQ 584 Query: 674 EGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDH 501 EGL GP+L SGMCTEVF F+ VAN++A+ +N T R +NRRIL Sbjct: 585 EGLSGPLLSSGMCTEVFQFDASPAPGAIIPASSVANMTAEHRQNATQ-VNRGRNRRILHR 643 Query: 500 LPIPLAELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSF 321 LP+PL N TG+ ++ S GN+S+S MVVSVL DPRE GD + EGMISPKS Sbjct: 644 LPVPLT--NFTGERKAQKE-----SFAGNKSASSMVVSVLVDPRETGDGDVEGMISPKSL 696 Query: 320 SRIFVVVLLDSVKYVTYSCILPFKASSHHLVTN 222 SRIFVVVLLDSVKYVTYSC LP S HLVT+ Sbjct: 697 SRIFVVVLLDSVKYVTYSCGLP--RSGLHLVTS 727 >ref|NP_565946.1| transcription factor BZIP17 [Arabidopsis thaliana] gi|20196934|gb|AAB86455.2| bZIP family transcription factor [Arabidopsis thaliana] gi|330254811|gb|AEC09905.1| Basic-leucine zipper (bZIP) transcription factor family protein [Arabidopsis thaliana] Length = 721 Score = 439 bits (1128), Expect = e-120 Identities = 312/737 (42%), Positives = 403/737 (54%), Gaps = 39/737 (5%) Frame = -2 Query: 2318 ADFDDLQFPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSL-----DNLSFPPENEGF--- 2163 +DFD + PPLD D+ S+ I E LM +LGF PD EF L D+L FP ENE F Sbjct: 25 SDFDSISIPPLD-DHFSDQTPIGE-LMSDLGF-PDGEFELTFDGMDDLYFPAENESFLIP 81 Query: 2162 ---GSEGSDGLSSTVSVAWQNSGD-------GSSDDVAGFLSYPSPE------SGSCDRE 2031 ++ G + S + SGD + +G ++ SP SG+ Sbjct: 82 INTSNQEQFGDFTPESESSGISGDCIVPKDADKTITTSGCINRESPRDSDDRCSGADHNL 141 Query: 2030 APPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEG 1851 P P+SSQ S C S V N SP S R+ AV +QKVK+EE Sbjct: 142 DLPTPLSSQGSGNCGSDVSEATNESSPKS------------RNVAV-----DQKVKVEEA 184 Query: 1850 GXXXXXXXXXXXXXES-FSDNARSCKFRWAIQSENANSMPDEKD-KRKARLTRNRESAQL 1677 + +D +R+ K+R + + +A+++ E+D K++ARL RNRESAQL Sbjct: 185 ATTTTSITKRKKEIDEDLTDESRNSKYRRSGEDADASAVTGEEDEKKRARLMRNRESAQL 244 Query: 1676 SRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC----XXXXXX 1509 SRQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 245 SRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHLPPPPMG 304 Query: 1508 XXXXXXXXXXXXXPC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKK 1332 PC Y +K +GSQVPL+PIP+LKPQ L SK KSESKK ++KTKK Sbjct: 305 MYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNTLGTSKAKKSESKKSEAKTKK 364 Query: 1331 VASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNS 1152 VAS+S L P VNV YGG A + S + +SQ R +++ Sbjct: 365 VASISFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRSNYITDQI--YSQHRDRVLDT 422 Query: 1151 SHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXEVRGNSSEPLVVSLYVPRNDKLVKI 972 S + N R R ++ GN SEPLV SL+VPRNDKLVKI Sbjct: 423 SRSGAGTGVSNSNGMHRGRDSDRGARKNISATESSVTPGNGSEPLVASLFVPRNDKLVKI 482 Query: 971 DGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGR- 795 DGNLII+S+LASEKA+A SR AS K K + +I + +PAL + + GR Sbjct: 483 DGNLIINSILASEKAVA-SRKASESKERKADL----------MISKDYTPALPLPDVGRT 531 Query: 794 ----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEV 627 +Y S E+QKALSS S DT K KT +G +Q+WFREG+ GP+ SGMCTEV Sbjct: 532 EELAKHLYRSKAEKQKALSSGSADTLKDQVKTKAANGEMQQWFREGVAGPMFSSGMCTEV 591 Query: 626 FHFEIXXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEGS 453 F F++ N+SA+ KN TD T +++NRRIL LPIPL ++ N T + Sbjct: 592 FQFDVSSTSGAIIPAATNVSAEHGKNTTD-THKQQNRRILRGLPIPLPGSDFNLTKE--- 647 Query: 452 RASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVVVLLDSVKYV 276 H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVVVLLDS KYV Sbjct: 648 ---HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVVVLLDSAKYV 704 Query: 275 TYSCILPFKASSHHLVT 225 TYSC+LP ++ + HLVT Sbjct: 705 TYSCVLP-RSGAPHLVT 720 >gb|AAM96961.1| putative TGACG-sequence-specific bZIP DNA-binding protein [Arabidopsis thaliana] gi|23198400|gb|AAN15727.1| putative TGACG-sequence-specific bZIP DNA-binding protein [Arabidopsis thaliana] Length = 721 Score = 438 bits (1127), Expect = e-120 Identities = 311/737 (42%), Positives = 403/737 (54%), Gaps = 39/737 (5%) Frame = -2 Query: 2318 ADFDDLQFPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSL-----DNLSFPPENEGF--- 2163 +DFD + PPLD D+ S+ I E LM +LGF PD EF L D+L FP ENE F Sbjct: 25 SDFDSISIPPLD-DHFSDQTPIGE-LMSDLGF-PDGEFELTFDGMDDLYFPAENESFLIP 81 Query: 2162 ---GSEGSDGLSSTVSVAWQNSGD-------GSSDDVAGFLSYPSPE------SGSCDRE 2031 ++ G + S + SGD + +G ++ SP SG+ Sbjct: 82 INTSNQEQFGDFTPESESSGISGDCIVPKDADKTITTSGCINRESPRDSDDRCSGADHNL 141 Query: 2030 APPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEG 1851 P P+SSQ S C S V N SP S R+ AV +QKVK+EE Sbjct: 142 DLPTPLSSQGSGNCGSDVSEATNESSPKS------------RNVAV-----DQKVKVEEA 184 Query: 1850 GXXXXXXXXXXXXXES-FSDNARSCKFRWAIQSENANSMPDEKD-KRKARLTRNRESAQL 1677 + +D +R+ K+R + + +A+++ E+D K++ARL RNRESAQL Sbjct: 185 ATTTTSITKRKKEIDEDLTDESRNSKYRRSGEDADASAVTGEEDEKKRARLMRNRESAQL 244 Query: 1676 SRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC----XXXXXX 1509 SRQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 245 SRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHLPPPPMG 304 Query: 1508 XXXXXXXXXXXXXPC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKK 1332 PC Y +K +GSQVPL+PIP+LKPQ L SK KSESKK ++KTKK Sbjct: 305 MYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNTLGTSKAKKSESKKSEAKTKK 364 Query: 1331 VASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNS 1152 VAS+S L P VNV YGG A + S + +SQ R +++ Sbjct: 365 VASISFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRSNYITDQI--YSQHRDRVLDT 422 Query: 1151 SHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXEVRGNSSEPLVVSLYVPRNDKLVKI 972 S + N R R ++ GN SEPLV SL+VPRNDKLVKI Sbjct: 423 SRSGAGTGVSNSNGMHRGRDSDRGARKNISATESSVTPGNGSEPLVASLFVPRNDKLVKI 482 Query: 971 DGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGR- 795 DGNL+I+S+LASEKA+A SR AS K K + +I + +PAL + + GR Sbjct: 483 DGNLVINSILASEKAVA-SRKASESKERKADL----------MISKDYTPALPLPDVGRT 531 Query: 794 ----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEV 627 +Y S E+QKALSS S DT K KT +G +Q+WFREG+ GP+ SGMCTEV Sbjct: 532 EELAKHLYRSKAEKQKALSSGSADTLKDQVKTKAANGEMQQWFREGVAGPMFSSGMCTEV 591 Query: 626 FHFEIXXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEGS 453 F F++ N+SA+ KN TD T +++NRRIL LPIPL ++ N T + Sbjct: 592 FQFDVSSTSGAIIPAATNVSAEHGKNTTD-THKQQNRRILRGLPIPLPGSDFNLTKE--- 647 Query: 452 RASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVVVLLDSVKYV 276 H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVVVLLDS KYV Sbjct: 648 ---HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVVVLLDSAKYV 704 Query: 275 TYSCILPFKASSHHLVT 225 TYSC+LP ++ + HLVT Sbjct: 705 TYSCVLP-RSGAPHLVT 720 >ref|XP_006293762.1| hypothetical protein CARUB_v10022722mg [Capsella rubella] gi|482562470|gb|EOA26660.1| hypothetical protein CARUB_v10022722mg [Capsella rubella] Length = 725 Score = 437 bits (1124), Expect = e-119 Identities = 316/746 (42%), Positives = 402/746 (53%), Gaps = 48/746 (6%) Frame = -2 Query: 2318 ADFDDLQFPPLDVDYL---SNDLMIPEGLMEELGFDPDFEFSL-----DNLSFPPENEGF 2163 +DFD + PP D + S+ I E LM +LGF PD EF L D+L FP ENE F Sbjct: 26 SDFDSISIPPFDDQFYHPGSDQTPIGE-LMSDLGF-PDGEFELTFDGMDDLYFPAENESF 83 Query: 2162 GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPE-------SGSCDREAP------- 2025 L + + + GD + D +S + SG +RE+P Sbjct: 84 -------LIPVNTSSQEQFGDFTPDSEGSGISGDPKDVFKNITTSGCSNRESPRDSDDRC 136 Query: 2024 ---------PGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQ 1872 P P+SSQ S C S V N SP +S VVV+ Q Sbjct: 137 SGADPSLDLPTPLSSQGSGNCASDVSEATNESSP--------------KSRNVVVD---Q 179 Query: 1871 KVKLEEGGXXXXXXXXXXXXXESFSDNARSCKFRWAIQSE-NANSMPDEKD-KRKARLTR 1698 KVK+EE E S +RS K+R + + + +A+++ E+D K+KARL R Sbjct: 180 KVKVEEAATTTSITKRKKEIEEDLSGESRSSKYRRSGEEDIDASAVTGEEDEKKKARLMR 239 Query: 1697 NRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC--- 1527 NRESAQLSRQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 240 NRESAQLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPH 299 Query: 1526 -XXXXXXXXXXXXXXXXXXXPC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKK 1353 PC Y +K +GSQVPL+PIP+LKPQ PL SK KSESKK Sbjct: 300 HPPPPMGMYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNPLGTSKAKKSESKK 359 Query: 1352 IQSKTKKVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQG 1173 ++KTKKVAS+S L P VNV YGG A + + +SQ Sbjct: 360 SEAKTKKVASISFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRPNYITDQI--YSQH 417 Query: 1172 RVLTVNSSHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXEVRGNSSEPLVVSLYVPR 993 R +++S + N R ++ GN SEPLV SL+VPR Sbjct: 418 RDRVLDTSRSGAGTGVSNSNGMDCGRDSDRGTRNNISATESSVPPGNGSEPLVASLFVPR 477 Query: 992 NDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALA 813 NDKLVKIDGNLII+S+LASEKA+A SR A S SN + +IP + SPAL Sbjct: 478 NDKLVKIDGNLIINSILASEKAVA-SRKA----------SESNERKADLVIPKDYSPALP 526 Query: 812 VSNDGR-----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILG 648 + + GR +Y S TE+QKALSS S D+ K KT +G +Q+WFREG+ GP+ Sbjct: 527 LPDVGRTEEMAKHLYRSKTEKQKALSSGSADSLKDQFKTKAANGEMQQWFREGVAGPMFS 586 Query: 647 SGMCTEVFHFEI--XXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AE 480 SGMCTEVF F++ N+SA+ KN TD T++RKNRRIL LPIPL ++ Sbjct: 587 SGMCTEVFQFDVSSTSGAIIPASPATNVSAEHSKNTTD-TRKRKNRRILRGLPIPLPGSD 645 Query: 479 LNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVV 303 N T + H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVV Sbjct: 646 FNLTKE------HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVV 699 Query: 302 VLLDSVKYVTYSCILPFKASSHHLVT 225 VLLDS KYVTYSC+LP ++ + HLVT Sbjct: 700 VLLDSAKYVTYSCVLP-RSGAPHLVT 724 >gb|EXC30127.1| TGACG-sequence-specific DNA-binding protein TGA-1B [Morus notabilis] Length = 797 Score = 431 bits (1107), Expect = e-117 Identities = 324/803 (40%), Positives = 413/803 (51%), Gaps = 102/803 (12%) Frame = -2 Query: 2327 DLTADFDDLQFPPLDVDYL-SNDLMIPEGLMEELGF------DPDFEFS--LDNLSFPPE 2175 D +A+F+ L PPLD + S+D + E +LG D DF F D+L P E Sbjct: 21 DFSAEFEPLSIPPLDHQFFSSDDAALREDFFSDLGLGLEENCDYDFTFDDIGDDLYLPSE 80 Query: 2174 NEGF----GSE-GSDGLS-------------STVSVAWQNSGDGSSD---------DVAG 2076 E F G + G + LS S VA +++ S DVAG Sbjct: 81 TEEFLIPDGLDIGPNSLSPNGTNSDRDVNPISEADVAAKSASPESESSTVSGVRDYDVAG 140 Query: 2075 FLSYPSPESGSCDREAPPGPVSSQDSAGCRSVVDGFFNSPSPDSG----------VHSQ- 1929 FL+ S ESG C+ E S++ A +S +DG +SPSPD G V SQ Sbjct: 141 FLNCQSSESGGCNSE------YSRNLADRKSKIDGVMDSPSPDCGNCDQECSGEAVSSQG 194 Query: 1928 -----SGPASDVRSGAVVVEDD---------EQKVKLEEGGXXXXXXXXXXXXXESFSDN 1791 SG + S A D +QKVK+EE G + Sbjct: 195 SGNCGSGVSEGANSPAHSGNSDKDVSSCVFVDQKVKVEEVGKNYMSKRKKEPEEG--NAE 252 Query: 1790 ARSCKF-RWAIQSENA------NSMPDEKDKRKARLTRNRESAQLSRQRKKHYVEELEDK 1632 +R+ K+ R + +EN N + DE++KRKARL RNRESAQLSRQRKKHYVEELEDK Sbjct: 253 SRTPKYRRSSAPAENTHSQSTLNPLSDEEEKRKARLMRNRESAQLSRQRKKHYVEELEDK 312 Query: 1631 VRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC-----XXXXXXXXXXXXXXXXXXXP 1467 +RSM+S I DLN +IS+ M EN SLRQQLS G +C Sbjct: 313 LRSMNSTITDLNSRISYIMVENASLRQQLSGGGICPPPPPTPGMYPHPPMGPMPYPWVPY 372 Query: 1466 CGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQ-SKTKKVASVSXXXXXXXXX 1290 Y +KP+GSQVPLVPIP+LKPQ+ +SASK KSE KK + KTKKVAS+S Sbjct: 373 APYVVKPQGSQVPLVPIPRLKPQQTVSASKAKKSEGKKSEGGKTKKVASISFLGLLFFVF 432 Query: 1289 XXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNS-------------- 1152 LVP VNV +GG P G +T+G +G VLT + Sbjct: 433 LFGGLVPMVNVNFGGLTNNAPGGL--VYTSGRLYDQHRGSVLTADHLLNGSGENMRVGSF 490 Query: 1151 ---SHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXEVRGNSSEPLVVSLYVPRNDKL 981 HE G+ + G +ER + GN SEPLV SLYVPRNDKL Sbjct: 491 NSVQHERGREQGEKLECGEKERGSQALPGSGEFIRL-----GNDSEPLVASLYVPRNDKL 545 Query: 980 VKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAV--- 810 VKIDGNLIIHSVLASEKA A + +T+++ I +++P+ AV Sbjct: 546 VKIDGNLIIHSVLASEKAKASLAHSEMKSKTETSLA----------IARDVAPSYAVPEV 595 Query: 809 -SNDGRNC-MYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMC 636 N GR+ +Y + E+ KALSS + D K+S DG LQ+WFREGL GP+L SGMC Sbjct: 596 GGNRGRHAPLYRNPVERHKALSSGATDATNDRLKSSAADGKLQQWFREGLAGPMLSSGMC 655 Query: 635 TEVFHFEIXXXXXXXXXXVA----NISAKDHKNYTDPTKRRK--NRRILDHLPIPLAELN 474 TEVF F++ A N+SAK +N T +R K NRRIL LP PL++ N Sbjct: 656 TEVFQFDVSPASTSGAIVPASSISNVSAKQRQNTTQNGRRLKGVNRRILRRLPAPLSDSN 715 Query: 473 ATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVVVLL 294 E + + G+R+ S MVVSVL DPREAGD++ +G++ PKS SRIFVVVL+ Sbjct: 716 FNISEERTSRNLRKDEFQGSRNVSSMVVSVLVDPREAGDNDVDGVMKPKSLSRIFVVVLM 775 Query: 293 DSVKYVTYSCILPFKASSHHLVT 225 DSV+YVTYSC+LP S HLVT Sbjct: 776 DSVRYVTYSCVLP--RSGPHLVT 796 >ref|XP_006482041.1| PREDICTED: uncharacterized protein LOC102629395 [Citrus sinensis] Length = 719 Score = 428 bits (1101), Expect = e-117 Identities = 326/747 (43%), Positives = 407/747 (54%), Gaps = 49/747 (6%) Frame = -2 Query: 2315 DFDDLQFPPLDVDYLSNDLMIPEGLMEELGF----DPDFEFSLDNLSFPPENEGF----- 2163 DFD L PPLD YL++ + P ++L F + DF+F++D+L F E++ F Sbjct: 15 DFDALSIPPLDPPYLNSQIPHPCASSDDLDFVLDDNCDFDFTIDDLYFASEDDTFFLPSE 74 Query: 2162 ----GSEGS-----DGLSSTVSVAWQNSG---DGSSDDVAGFLSYPSPESGSCDREAPPG 2019 G G DG ++ VS +SG + +S DV +L+Y S S +R Sbjct: 75 DPHDGQFGDFSPDVDGGAAAVSPGSGSSGILGNPASLDVESYLNYSSSPQNSGNR----- 129 Query: 2018 PVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEE---GG 1848 +S + G V G S + SGV S + SG +VV+ QK+K+EE G Sbjct: 130 -ISHLNYIG----VSGG-RSENSGSGVSSDNTDDPSPDSGNLVVD---QKIKMEEVSKKG 180 Query: 1847 XXXXXXXXXXXXXESFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQLSRQ 1668 ES S+ R +++N +++ +E+ KRKARL RNRESAQLSRQ Sbjct: 181 IFKRKKDIEETNNESRSNKYRKSSSLSVNEADNDHNLGEEEMKRKARLMRNRESAQLSRQ 240 Query: 1667 RKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAV------CXXXXXXX 1506 RKKHYVEELEDKVR+MHS IADLN KISFFMAEN SL+QQLS Sbjct: 241 RKKHYVEELEDKVRNMHSTIADLNSKISFFMAENASLKQQLSGSNAMPPPLGMYPPPPHM 300 Query: 1505 XXXXXXXXXXXXPCGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKVA 1326 Y +KP+GSQVPLVPIP+LKPQ +A+ P +++ K SKTKKVA Sbjct: 301 AAAPMPYGWMPCAAPYMVKPQGSQVPLVPIPRLKPQ--AAAAVPPRTK-KSDGSKTKKVA 357 Query: 1325 SVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTV---- 1158 SVS LVP V+V+YGG R+ V G+ S GF +GRVLT+ Sbjct: 358 SVSFLGLLFFILLFGGLVPLVDVKYGGIRDGVSGGYFS----SGFYNQHRGRVLTINGYS 413 Query: 1157 NSSHESGKRELHTGNSGFRER--CTNG----XXXXXXXXXXXXEVR-GNSSEPLVVSLYV 999 N S ES G GF R C VR N+SEPLV SLYV Sbjct: 414 NGSGESMGIGFPNGRVGFDNRIHCARAVESKEKESQPAPDSDEFVRPRNASEPLVASLYV 473 Query: 998 PRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-IPVNLSP 822 PRNDKLVKIDGNLIIHSVLA EKAMA S +N TGL IP + SP Sbjct: 474 PRNDKLVKIDGNLIIHSVLAGEKAMA-----------SHDASKANSKEATGLAIPKDFSP 522 Query: 821 ALAV----SNDGRNC-MYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGP 657 ALA+ N R+ Y + E+Q+A+SS S D K + K+S +G LQ+WF+EGL GP Sbjct: 523 ALAIPDVRGNGARHSHFYRNPAERQRAISSGSTDALKDHMKSSAANGKLQQWFQEGLSGP 582 Query: 656 ILGSGMCTEVFHFEI--XXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPIPLA 483 +L SGMCTEVF F+ VAN++A+ +N T R +NRRIL LP+PL Sbjct: 583 LLSSGMCTEVFQFDASPAPGAIIPASSVANMTAEHRQNATQ-VNRGRNRRILHRLPVPLT 641 Query: 482 ELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVV 303 N TG+ + S GN+S+S MVVSVL DPRE GD + EGMISPKS SRIFVV Sbjct: 642 --NITGERKVQKE-----SFAGNKSASSMVVSVLVDPRETGDGDVEGMISPKSLSRIFVV 694 Query: 302 VLLDSVKYVTYSCILPFKASSHHLVTN 222 VLLDSVKYVTYSC LP S HLVT+ Sbjct: 695 VLLDSVKYVTYSCGLP--RSGLHLVTS 719 >ref|XP_002323223.2| bZIP transcription factor family protein [Populus trichocarpa] gi|550320719|gb|EEF04984.2| bZIP transcription factor family protein [Populus trichocarpa] Length = 640 Score = 424 bits (1091), Expect = e-116 Identities = 302/717 (42%), Positives = 383/717 (53%), Gaps = 24/717 (3%) Frame = -2 Query: 2303 LQFPPLDVDYLS-----NDLMIPEGLMEELGFDPDFEFSLDNLS---FPPENEGFGSEGS 2148 L PPLD + + ND + L + DF+ + D+L+ FP ENE F Sbjct: 9 LPTPPLDPLFFNQNSDQNDNLNVPDLSSDFEDMSDFDITFDDLTDLYFPSENEQFLIP-- 66 Query: 2147 DGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCD------REAPPGPVSSQDSAGCR 1986 D +S S GD +V +L+ E+GSCD R + GP SS S Sbjct: 67 DNNASPESGGSGICGDQGGLEVDKYLNPSPSEAGSCDSGGSDSRSSDLGPASSHGSGNSG 126 Query: 1985 SVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEGGXXXXXXXXXXXXXE 1806 S + G G DV + + + V + GG Sbjct: 127 S-------GRKKEMG----DGENGDVMRNFKSRKAEGEDVSVNVGGG------------- 162 Query: 1805 SFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQLSRQRKKHYVEELEDKVR 1626 + SE E++KR+ARL RNRESA LSRQRKKHYVEELEDKVR Sbjct: 163 -------------VVSSE-------EEEKRRARLVRNRESAHLSRQRKKHYVEELEDKVR 202 Query: 1625 SMHSVIADLNGKISFFMAENVSLRQQLSSGAVCXXXXXXXXXXXXXXXXXXXPCGYPMKP 1446 +MHS IADLNGK+S+FMAEN +LRQQL+ + C Y +KP Sbjct: 203 AMHSTIADLNGKVSYFMAENATLRQQLNGNSAC----PPPMYAPMAPYPWVPCAPYVVKP 258 Query: 1445 RGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKVASVSXXXXXXXXXXXXXLVPF 1266 +GSQVPLVPIP+LKPQ+ + +K K ESKK + KTKKVASVS L P Sbjct: 259 QGSQVPLVPIPRLKPQQAVPMAKTKKVESKKGEGKTKKVASVSLIGLVFFILLFGGLAPM 318 Query: 1265 VNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSSHESGKRELH-TGNSGFRERCT 1089 V+V++GG RE+ SGFG F + F +GRVL V+ H +G E H + N G E Sbjct: 319 VDVKFGGVRESGISGFG--FGSERFLDQHKGRVLIVD-GHSNGSHENHDSANKGAAEHLP 375 Query: 1088 NGXXXXXXXXXXXXEVRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRT 909 GN+SE LV SLYVPRNDKLVKIDGNLIIHS+LASE+AMA Sbjct: 376 GSDEFGQF---------GNASEQLVASLYVPRNDKLVKIDGNLIIHSILASERAMA---- 422 Query: 908 ASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSS 744 + E+ + + ALA+ + G N +Y + E+QKAL+S Sbjct: 423 ----------------SHESPEVNITKQTALAIPDVGNNRGRHSHVYRTHAERQKALASG 466 Query: 743 SGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXVANI 570 S DT K N K+S G LQ+WFREGL GP+L SGMCTEVF F++ VAN+ Sbjct: 467 SADTSKDNLKSSAAKGKLQQWFREGLAGPLLSSGMCTEVFQFDVSPTPGAIVPASSVANV 526 Query: 569 SAKDHKNYTDPTKRRKNRRILDHLPIPLA--ELNATGQEGSRASHSHDGSSNGNRSSSPM 396 +A+ KN + + +NRRIL LPIPLA +LN TG+ R +H S GN+S SPM Sbjct: 527 TAEHQKNNSTRLNKGRNRRILRGLPIPLAGSDLNITGEHVGRKTHKE--SFQGNKSVSPM 584 Query: 395 VVSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 225 VVSVL DPREAGDS+ +G+I+PKS SRIFVVVL+DS+KYVTYSC+LP + HLVT Sbjct: 585 VVSVLVDPREAGDSDVDGVITPKSLSRIFVVVLVDSIKYVTYSCVLP--SIGPHLVT 639 >ref|XP_002881751.1| bZIP transcription factor family protein [Arabidopsis lyrata subsp. lyrata] gi|297327590|gb|EFH58010.1| bZIP transcription factor family protein [Arabidopsis lyrata subsp. lyrata] Length = 724 Score = 424 bits (1091), Expect = e-116 Identities = 316/745 (42%), Positives = 399/745 (53%), Gaps = 47/745 (6%) Frame = -2 Query: 2318 ADFDDLQFPPLDVD-YLSNDLMIPEG-LMEELGFDPDFEFSL-----DNLSFPPENEGF- 2163 +DFD + PP D Y S P G LM +LGF PD EF L D+L FP ENE F Sbjct: 25 SDFDSISIPPFDDHFYHSGSDHTPIGELMSDLGF-PDGEFELTFDGMDDLYFPAENESFL 83 Query: 2162 ----------------GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGS-CDR 2034 SEGS G+S V +++ S +G ++ S + S DR Sbjct: 84 IPVNTSNQEQFGDFTPESEGS-GISGDCPVLPKDAD--KSITTSGCINRDSDDRCSGADR 140 Query: 2033 EAP-PGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLE 1857 P P+SSQ S C S V N SP +S VVV+ QKVK+E Sbjct: 141 SLDLPTPLSSQGSGNCGSDVSEATNESSP--------------KSRNVVVD---QKVKVE 183 Query: 1856 EGGXXXXXXXXXXXXXES-FSDNARSCKFRWAIQSENANSMPDEKD-KRKARLTRNRESA 1683 E + +D +R+ K+R + + +A+++ E+D K+KARL RNRESA Sbjct: 184 EAATTTSIITKRKKEIDEDLTDESRNSKYRRSGEDADASAVTGEEDEKKKARLMRNRESA 243 Query: 1682 QLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC----XXXX 1515 QLSRQRKKHYVEELE+KVR+MHS I DLNGKIS+FMAEN +LRQQL +C Sbjct: 244 QLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHIPPPP 303 Query: 1514 XXXXXXXXXXXXXXXPC-GYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKT 1338 PC Y +K +GSQVPL+PIP+LKPQ L SK KSESKK ++KT Sbjct: 304 MGMYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNTLGTSKAKKSESKKSEAKT 363 Query: 1337 KKVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTV 1158 KKVAS+S L P VNV YGG A + S + + RVL Sbjct: 364 KKVASISFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRSNYITDQIYSQHRDRVLDT 423 Query: 1157 NSSHE----SGKRELHTGNSGFRERCTNGXXXXXXXXXXXXEVRGNSSEPLVVSLYVPRN 990 + S S +H G R N GN SEPLV SL+VPRN Sbjct: 424 SRSGTGTGVSNSNGMHCGRDSDRGARKN------ISATESSVPPGNGSEPLVASLFVPRN 477 Query: 989 DKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAV 810 DKLVKIDGNLII+S+LASE+A+AL R AS K K + +I + SPAL + Sbjct: 478 DKLVKIDGNLIINSILASERAVAL-RKASESKERKADL----------VISKDYSPALPL 526 Query: 809 SNDGR-----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGS 645 + G+ +Y S E+QKALSS S DT K KT +G +Q+WFREG+ GP+ S Sbjct: 527 PDVGKTEEMAKHLYRSKAEKQKALSSGSTDTLKDQFKTKAANGEMQQWFREGVAGPMFSS 586 Query: 644 GMCTEVFHFEI--XXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AEL 477 GMCTEVF F++ N+S + KN TD T ++KNRRIL LPIPL ++ Sbjct: 587 GMCTEVFQFDVSSTSGAIIPASPATNVSTEHGKNTTD-THKQKNRRILRGLPIPLPGSDF 645 Query: 476 NATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMI-SPKSFSRIFVVV 300 N T + H + SS + +S MVVSVL DPRE GD + +GMI PKS SR+FVVV Sbjct: 646 NLTKE------HQRNSSSKEIKPASSMVVSVLVDPREGGDGDIDGMIGGPKSLSRVFVVV 699 Query: 299 LLDSVKYVTYSCILPFKASSHHLVT 225 LLDS KYVTYSC+LP ++ + HLVT Sbjct: 700 LLDSAKYVTYSCVLP-RSGAPHLVT 723 >ref|XP_007028261.1| Transcription factor hy5, putative [Theobroma cacao] gi|508716866|gb|EOY08763.1| Transcription factor hy5, putative [Theobroma cacao] Length = 687 Score = 424 bits (1089), Expect = e-115 Identities = 308/749 (41%), Positives = 391/749 (52%), Gaps = 32/749 (4%) Frame = -2 Query: 2375 SSVPGDTITGIGSSNGDLTADFDDLQFPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSLD 2196 + P +T+ G ++ + L PPLD YLS DL L DF+ + D Sbjct: 2 AEAPAETVMG---------SELESLAIPPLDPLYLSTDLGF------SLDDHDDFQITFD 46 Query: 2195 NLS---FPPENEGFGSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAP 2025 + FP ++E + +S DV +L+ SPE GSC+ Sbjct: 47 DFDQFCFPSDSE--------------HLLIPDSSTTPDSDVERYLNSSSPELGSCNGPDS 92 Query: 2024 PG----PVSSQDSAGCRSVVDGFFNSPSPDS-GVHSQSGPASDVRSGAVVVEDDEQKVKL 1860 G P+SS S C S V N+ SPDS + Q ++ V +++ + Sbjct: 93 SGNSHSPLSSSGSGNCASAVSEAMNATSPDSENIVDQKISVEEIGKRRVSKRKKDRE-ET 151 Query: 1859 EEGGXXXXXXXXXXXXXESFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQ 1680 + S SDN + N+N+ +E++KR+ARL RNRESAQ Sbjct: 152 DSSKCRRSSLTPSVNNSNSNSDNNNN---------NNSNAPSEEEEKRRARLMRNRESAQ 202 Query: 1679 LSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSS--------GAVCX 1524 LSRQRKKHYVEELEDKVR+MHS IADLN KI++FMAEN +LRQQLS+ GAV Sbjct: 203 LSRQRKKHYVEELEDKVRTMHSTIADLNNKIAYFMAENATLRQQLSTAGGGGGGGGAVMC 262 Query: 1523 XXXXXXXXXXXXXXXXXXPCG--YPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKI 1350 PC Y MKP GSQVPLVPIP+LKPQ+P + S++KK Sbjct: 263 PPQPLPMPMYPPMAYPWVPCAPPYVMKPPGSQVPLVPIPRLKPQQPPVPA----SKAKKN 318 Query: 1349 QSKTKKVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGR 1170 +SKTKKVASVS L P VN RY + GS F GF +GR Sbjct: 319 ESKTKKVASVSLLGMLFFILLFGGLAPIVNDRYDN------TPVGSGFVGDGFYEVHRGR 372 Query: 1169 VLTV----NSSHESGKRELHTGNSGFRER-----CTNGXXXXXXXXXXXXEVRGNSSEPL 1017 VL V N S+ S G R R +G N EPL Sbjct: 373 VLRVDGHLNGSNNSRDVAFSYGKFDRRNRVHGRGSESGVEQKEKGAHSVPGYMSNGGEPL 432 Query: 1016 VVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGLIP 837 SLYVPRNDKLVKIDGNLIIHSVLASEKAMA S AS +KN++T ++ IP Sbjct: 433 TASLYVPRNDKLVKIDGNLIIHSVLASEKAMA-SHKASQIKNEETGLA----------IP 481 Query: 836 VNLSPALAV----SNDG-RNCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFRE 672 N SPALA+ N G R+ Y + E+Q ALSS + D K + K++ DG +Q+WFRE Sbjct: 482 NNFSPALAIPDARENGGKRSREYRNPAERQMALSSGNADALKDHFKSTVADGKMQQWFRE 541 Query: 671 GLEGPILGSGMCTEVFHFEIXXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPI 492 GL GP+L SGMCTEVF F++ V N+SA+ H+N T K R NRRIL P+ Sbjct: 542 GLAGPMLSSGMCTEVFQFDV-SAAIVPASSVTNVSAEHHQNATRHNKGR-NRRILHGHPV 599 Query: 491 PLAELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRI 312 PL+ + E +S + GN+++S MVVSVLFDPREAGD + + MI+PK SRI Sbjct: 600 PLSRSDVNITEQHVGRNSPKENFKGNKTASSMVVSVLFDPREAGDGDIDDMIAPKPLSRI 659 Query: 311 FVVVLLDSVKYVTYSCILPFKASSHHLVT 225 FVVVL+DSVKYVTYSC+LP HL+T Sbjct: 660 FVVVLVDSVKYVTYSCMLPLPGL--HLMT 686 >ref|XP_003554104.2| PREDICTED: uncharacterized protein LOC100127362 [Glycine max] Length = 784 Score = 423 bits (1088), Expect = e-115 Identities = 316/768 (41%), Positives = 409/768 (53%), Gaps = 66/768 (8%) Frame = -2 Query: 2330 GDLTADFDDLQFPPLDVDYLSND-LMIPEGLMEELGFDPDFEFS--------LDNLSFPP 2178 GD +++F+ P +D + + D L L + FD + EF LD++ P Sbjct: 43 GDFSSNFNAFLIPSMDSLFNTTDALPFASDLEFGMDFDNNGEFEITFDDLDELDDIFIPS 102 Query: 2177 ENEGF-----------------GSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPES 2049 + E F ++ SD S VS SG+G S D S PSPE+ Sbjct: 103 DAEDFLLPDVCNSNYDSASPPIDAKNSDSPDSDVSAV---SGEGDSADNVRVSSVPSPEA 159 Query: 2048 GSCDRE-APPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQ---SGPASDVRSGAVVVED 1881 CDRE + GPVSSQ S S V +SPSPDSG + + S A V + V +E+ Sbjct: 160 EFCDREESSNGPVSSQGSGNGGSGVYEAMHSPSPDSGPYERDITSSHAHAVTNNGVKMEE 219 Query: 1880 D---EQKVKLEEGGXXXXXXXXXXXXXESFSDNARSCKFRWAIQSENA-NSMPDEKDKRK 1713 + K K E FS + + QS++ N + DE +KRK Sbjct: 220 TPAFDLKRKKES-------CDGSATKHRRFSSSVENNNNNTEKQSQSGLNGIDDEDEKRK 272 Query: 1712 ARLTRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGA 1533 ARL RNRESAQLSRQRKKHYVEELE+KVRS++S+IAD++ K+S+ +AEN +LRQQ+ + Sbjct: 273 ARLMRNRESAQLSRQRKKHYVEELEEKVRSLNSIIADMSSKMSYVVAENATLRQQVGAAG 332 Query: 1532 VCXXXXXXXXXXXXXXXXXXXPCGYP--------MKPRGSQVPLVPIPKLKPQKPLSASK 1377 V P YP +KP+GSQVPLVPIP+LKPQ+P SA K Sbjct: 333 VMCPPPPAPAPGMYPHHPPMAPMPYPWMPCAPYVVKPQGSQVPLVPIPRLKPQQPASAPK 392 Query: 1376 PNKSESKKIQSKTKKVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNG 1197 KSE+KK + KT KVAS+S LVP V+ R+GG E VP S + + Sbjct: 393 GKKSENKKSEGKTTKVASISLLGLFFFIMLFGGLVPLVDFRFGGLVENVPGTGRSNYVSD 452 Query: 1196 GFDRWSQGRVLTVNSSHESGKRELHTGNSGF------------RERCTNGXXXXXXXXXX 1053 G+V ++N +R+ G S R R Sbjct: 453 RVYGQGGGKVWSLNGRRNGSERDEDVGFSNGGRFSVSDRVNYERGRNFREERHDRRKGSD 512 Query: 1052 XXEVRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTIS 873 +GN+SEPLV SLYVPRNDK+VKIDGNLIIHS++ASEKAMA S+TA K DK Sbjct: 513 DFGRQGNASEPLVASLYVPRNDKMVKIDGNLIIHSIMASEKAMA-SQTAE-AKKDK---- 566 Query: 872 SSNGARETGL-IPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSSSGDTYKVNSKT 711 RETGL IP +L ALA+ GR+ +Y + EQ+KAL S S K + K+ Sbjct: 567 -----RETGLAIPKDLDSALAIPGVGRSRGQHPHVYSVSPEQRKALGSGSTKVLKDHMKS 621 Query: 710 SNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXVANISAKDHKNYTDP 537 S DG +Q+WFREGL GP+L SGMCTEVF F++ VAN+S ++ +N T Sbjct: 622 SVTDGKMQQWFREGLVGPMLSSGMCTEVFQFDVSPSPGAIVPATSVANVSTENRQNATS- 680 Query: 536 TKRRKNRRILDHLPIPL--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREA 363 K+ +NRR L LP PL + LN T + H +GN+SS MVVSVL DP+EA Sbjct: 681 VKKTRNRRTLHELPEPLNGSSLNITEERVKNLQKDH---LHGNKSS--MVVSVLVDPKEA 735 Query: 362 GDS--ESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 225 GD + +GM+ PKS SRIFVVVL+DSVKYVTYSC LP +S HLVT Sbjct: 736 GDGDVDVDGMMRPKSLSRIFVVVLIDSVKYVTYSCGLP--RASPHLVT 781 >ref|XP_003521109.2| PREDICTED: uncharacterized protein LOC100101871 [Glycine max] Length = 812 Score = 418 bits (1075), Expect = e-114 Identities = 316/770 (41%), Positives = 415/770 (53%), Gaps = 49/770 (6%) Frame = -2 Query: 2387 TMDASSVPGDTITGI--GSSNGDLTADFDDLQ--FPPLDVDYLSNDLMIPEGLMEELGFD 2220 T D P D G+ ++NG+ FDDL + P D + D ++P+ + Sbjct: 79 TTDGLPFPSDLEFGMDFNNNNGEFEITFDDLDDIYIPSDAE----DFLLPDAC------N 128 Query: 2219 PDFEF---SLDNLSFPPENEGFGSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPES 2049 P++ +D+ S + + DG+S + Q S S+D+V S PSPE+ Sbjct: 129 PNYASVSPPIDDSSAKNSDSDASAVSGDGVSRFFNS--QVSESDSADNVR-VPSVPSPEA 185 Query: 2048 GSCDRE-APPGPVSSQDSAGCRSVVDGFFNSPSPDSG--------VHSQSGPASDVRSGA 1896 C+RE + GPVSSQ S S V +SPSPDSG H+ + + V+ Sbjct: 186 EFCEREESSNGPVSSQGSGNGGSGVYEAMHSPSPDSGPYERDITSFHAHAATNNGVKMEE 245 Query: 1895 VVVEDDEQKVKLEEGGXXXXXXXXXXXXXESFSDNARSCKFRWAIQSENANSMPDEKDKR 1716 V D ++K EG S +N + K QS+ N + DE +KR Sbjct: 246 VPAFDLKRKKGSCEGSATKHRRFS------SSVENNNNNKTEKQFQSD-LNGIEDEDEKR 298 Query: 1715 KARLTRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQL--S 1542 KARL RNRESAQLSRQRKKHYVEELE+KVRS++S+IAD++ K+S+ +AE +LRQQ+ + Sbjct: 299 KARLMRNRESAQLSRQRKKHYVEELEEKVRSLNSIIADMSSKMSYMVAEIATLRQQVGAA 358 Query: 1541 SGAVC-------XXXXXXXXXXXXXXXXXXXPCGYPMKPRGSQVPLVPIPKLKPQKPLSA 1383 +G +C Y +KP+GSQVPLVPIP+LKPQ+P SA Sbjct: 359 AGVMCPPPPPPAPGMYPHHPPMAPMPYPWMPCAPYVVKPQGSQVPLVPIPRLKPQQPASA 418 Query: 1382 SKPNKSESKKIQSKTKKVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFT 1203 K KSESKK + KTKKVAS+S LVP V+ R+GG + VP S + Sbjct: 419 PKSKKSESKKSEGKTKKVASISLLGLFFFIMLFGGLVPVVDFRFGGLVDNVPGTGSSNYV 478 Query: 1202 NGGFDRWSQGRVLTVNSSHESGKRELHTGNSGFRERCTNGXXXXXXXXXXXXEVR----- 1038 + G+V ++N R+ G S R ++ E R Sbjct: 479 SDRVYGHGGGKVWSLNGPRNGSGRDGDVGFSNGRFSVSDRVKNYEKRGRNLREERHDRKG 538 Query: 1037 -------GNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTT 879 GN+SEPLV SLYVPRNDK+VKIDGNLIIHS++ASEKAMA S+TA K DK Sbjct: 539 PDDSSRQGNASEPLVASLYVPRNDKMVKIDGNLIIHSIMASEKAMA-SQTAE-AKKDK-- 594 Query: 878 ISSSNGARETGL-IPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSSSGDTYKVNS 717 RETGL IP +L ALA+ GR+ +Y + EQ+KAL S S K + Sbjct: 595 -------RETGLAIPKDLDSALAIPGVGRSRDQHPHVYRVSPEQRKALGSGSTKALKDHM 647 Query: 716 KTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXVANISAKDHKNYT 543 K+S DG +Q+WFREGL GP+L SGMCTEVF F+ VAN+S ++H+N T Sbjct: 648 KSSATDGKMQQWFREGLAGPMLSSGMCTEVFQFDASPSPGAIVPATSVANVSTENHQNAT 707 Query: 542 DPTKRRKNRRILDHLPIPL--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPR 369 K+ +NRR L LP PL + LN T ++ H +GN+SS MVVSVL DPR Sbjct: 708 S-VKKTRNRRTLHELPEPLNGSSLNITEEQVKNLQKDH---FHGNKSS--MVVSVLVDPR 761 Query: 368 EAGDS--ESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 225 EAGD + +GM+ PKS SRIFVVVL+DSVKYVTYSC LP +S HLVT Sbjct: 762 EAGDGDVDVDGMMRPKSLSRIFVVVLIDSVKYVTYSCGLP--RASPHLVT 809 >ref|XP_004493333.1| PREDICTED: uncharacterized protein LOC101504999 [Cicer arietinum] Length = 786 Score = 416 bits (1070), Expect = e-113 Identities = 318/753 (42%), Positives = 396/753 (52%), Gaps = 61/753 (8%) Frame = -2 Query: 2330 GDLTADFDDLQ---FPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSLDNLSFPPENEGFG 2160 GD FDDL P D+L D P GL D +++ D +N +G Sbjct: 54 GDFEITFDDLDTLCIPSDTDDFLLPDAWNPNGLPISPLTDNHGDYNGDG-DCSAKNSDYG 112 Query: 2159 SEGSDGLSSTVSVAWQN--------------SGDGSSDDVAGFLSYPSPESGSCDRE-AP 2025 D S SV + S D +S DV S PSPE+ S DRE + Sbjct: 113 VANFDSPESGASVVSSDQSPDVSRFFNSESVSADDNSVDVK-ISSMPSPETESSDREESS 171 Query: 2024 PGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEGGX 1845 GP+SSQ S S V NSPSPDSG + + S A+V E+ VKLE Sbjct: 172 NGPISSQGSGNGGSGVYEAMNSPSPDSGRYERD--ISSSHKHAIV----EEGVKLEGIVK 225 Query: 1844 XXXXXXXXXXXXESFSDNARSC---------KFRWAIQSENANS----MPDEKDKRKARL 1704 ES + C K + +Q + A S + DE +KRKARL Sbjct: 226 GCDLKRKKENCIESAENRTPKCSRRSSSMENKTQQQLQQQQAQSGFDGIEDEDEKRKARL 285 Query: 1703 TRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC- 1527 RNRESAQLSRQRKKHYVEELE+KVRSMHS IADL+ KI+F MAEN +LRQQL G +C Sbjct: 286 MRNRESAQLSRQRKKHYVEELEEKVRSMHSTIADLSSKITFVMAENATLRQQLGGGMMCP 345 Query: 1526 -----XXXXXXXXXXXXXXXXXXXPCGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSE 1362 Y +KP+GSQVPLVPIP+LKPQ+P S+SK K+E Sbjct: 346 PPPPAGSGMYPHPPMPPMPYPWMPYAPYVVKPQGSQVPLVPIPRLKPQQPASSSKSKKNE 405 Query: 1361 SKKIQSKTKKVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRW 1182 SKK + KTKKVAS+S LVP V+ ++GG + V SG S + DRW Sbjct: 406 SKKSEVKTKKVASISLLGLFFFIMLFGGLVPLVDFKFGGLVDNV-SGRSSYVS----DRW 460 Query: 1181 ----SQGRVLTVNSSHESGKRELHTGNSGFRERCTN----------GXXXXXXXXXXXXE 1044 GR+ V+ +R+ G S R ++ G Sbjct: 461 LYGQGGGRIWPVSGYRNESERDEELGFSNGRFGISDRNNYERGRKLGEEMNGWKDTSCFG 520 Query: 1043 VRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSN 864 R N+SEPL+ SLYVPRNDKLVKIDGNLIIHS++ASEKAMA S+ A K Sbjct: 521 HRDNASEPLLASLYVPRNDKLVKIDGNLIIHSIMASEKAMA-SQDAQEKKVKS------- 572 Query: 863 GARETGL-IPVNLSPALAVSNDGRN-----CMYGSATEQQKALSSSSGDTYKVNSKTSNN 702 ETGL IP + ALA+ GRN +Y + EQ++A+ S S T K + K+S Sbjct: 573 ---ETGLAIPKDWDSALAIPEVGRNRGPHPNVYRVSAEQRRAIGSGSAKTLKDHMKSSAT 629 Query: 701 DGLLQKWFREGLEGPILGSGMCTEVFHFEI--XXXXXXXXXXVANISAKDHKNYTDPTKR 528 DG +Q+WFREGL GP+L SGMCTEVF F++ VANISA++ +N T K Sbjct: 630 DGKMQQWFREGLAGPMLSSGMCTEVFQFDVSPAPGAIVPATAVANISAENRRNATTVNKS 689 Query: 527 RKNRRILDHLPIPL--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDS 354 R NRRIL LP PL + LN T + A + G GN+SS MVVSVL DP+E GD Sbjct: 690 R-NRRILHTLPDPLPGSTLNITEE---HARNLPKGHLPGNKSS--MVVSVLVDPKEVGDG 743 Query: 353 ESEGMISPKSFSRIFVVVLLDSVKYVTYSCILP 255 + +GM++PKS +RIFVVVL+DSVKYVTYSC LP Sbjct: 744 DVDGMMAPKSLTRIFVVVLIDSVKYVTYSCGLP 776 >ref|XP_002308867.2| hypothetical protein POPTR_0006s03300g [Populus trichocarpa] gi|550335363|gb|EEE92390.2| hypothetical protein POPTR_0006s03300g [Populus trichocarpa] Length = 729 Score = 412 bits (1058), Expect = e-112 Identities = 287/669 (42%), Positives = 360/669 (53%), Gaps = 16/669 (2%) Frame = -2 Query: 2183 PPENEGFGSEGSDGLSSTVSVAWQNSGDGSSDDVAGFLSYPSPESGSCDREAPPGPVSSQ 2004 P E E S GSD SS +S S GS + +G LS SPESG+ Sbjct: 146 PSEAESCDSGGSDYRSSVLSPV---SSHGSGNSGSGVLSAGSPESGT------------- 189 Query: 2003 DSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKLEEGGXXXXXXXX 1824 + C VVD F V +++ A +S + V ++++ EE G Sbjct: 190 NVNPCNFVVDKKF--------VKTETESAKKRKSAKIAVAKRKKEMGDEENG-------- 233 Query: 1823 XXXXXESFSDNARSCKFRWAIQSENAN-------SMPDEKDKRKARLTRNRESAQLSRQR 1665 + R+ K R A +SEN + S+ E+D+RKARL RNRESAQLSRQR Sbjct: 234 ---------EIMRNLKSRKA-ESENVSVNVSGSASLSGEEDRRKARLMRNRESAQLSRQR 283 Query: 1664 KKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVCXXXXXXXXXXXXXX 1485 KKHYVEELEDKVR MHS IA LNGK+S+FMAEN +LR+QLS C Sbjct: 284 KKHYVEELEDKVRMMHSTIAQLNGKVSYFMAENATLRRQLSGNGAC----PPPMYAPMAP 339 Query: 1484 XXXXXPCGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTKKVASVSXXXX 1305 Y +KP+GSQVPLVPIP+LKPQ+ + +KP K ESKK + KTKKVASVS Sbjct: 340 YPWVPCAPYVVKPQGSQVPLVPIPRLKPQQTVPLAKPKKGESKKGEGKTKKVASVSLFGF 399 Query: 1304 XXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSSHESGKREL 1125 LVP V+V++ GGF +GRVL V+ H +G E Sbjct: 400 LFFILLFRCLVPIVDVKF-----------------GGFFDQHKGRVLIVD-GHTNGSHEK 441 Query: 1124 HTGNSGFRERCTNGXXXXXXXXXXXXEVRGNSSEPLVVSLYVPRNDKLVKIDGNLIIHSV 945 N N GN+SE LV SLYVPRNDKLVKIDGNLIIHSV Sbjct: 442 RGHNGCLEHDSANKGASERLPGSDEFGQFGNASEHLVASLYVPRNDKLVKIDGNLIIHSV 501 Query: 944 LASEKAMALSRTASGVKNDKTTISSSNGARETGLIPVNLSPALAVSNDGRN-----CMYG 780 LASE+ MA + E+ + + ALA+ G N +Y Sbjct: 502 LASERPMA--------------------SHESPEVNITKETALAIPGVGNNRGRHSHVYR 541 Query: 779 SATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEI--XX 606 + TE+QKAL S S DT K N K+S G LQ+WFREGL GP+L GMCTEVF F++ Sbjct: 542 THTERQKALDSGSADTSKDNLKSSAAKGKLQQWFREGLAGPLLSHGMCTEVFQFDVSPAP 601 Query: 605 XXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPIPL--AELNATGQEGSRASHSHD 432 VAN++A+ +N + K+ NRRIL LPIPL ++LN TG+ R ++ Sbjct: 602 GAIVPASSVANMTAERQQNNSTHLKKGNNRRILRGLPIPLPGSDLNITGEHVGR--NTQK 659 Query: 431 GSSNGNRSSSPMVVSVLFDPREAGDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPF 252 + +GN+S SPMVVSVL DPRE+ D E +G+I+PKS SRIFVVVLLDS+KYVTYSC+LP Sbjct: 660 ENFHGNKSVSPMVVSVLVDPRESSDREVDGVITPKSLSRIFVVVLLDSIKYVTYSCVLPS 719 Query: 251 KASSHHLVT 225 HLVT Sbjct: 720 AGPLLHLVT 728 >ref|XP_004299018.1| PREDICTED: uncharacterized protein LOC101299380 isoform 1 [Fragaria vesca subsp. vesca] Length = 711 Score = 410 bits (1055), Expect = e-111 Identities = 309/766 (40%), Positives = 397/766 (51%), Gaps = 46/766 (6%) Frame = -2 Query: 2384 MDASSVPGDTIT---GIGSSNGDLTADFDDLQFPPLDVDYLSNDL----MIPEGLMEELG 2226 M+ S V GD + + D DF+ L PPLD + S+D M + M +LG Sbjct: 1 MEDSVVAGDPPIPHPDLAPNCSDSGEDFESLPIPPLDPQFFSSDAGMATMAADSFMSDLG 60 Query: 2225 F------DPDFEFS---LDNLSFPPEN------EGFGSEGSDGLSSTVSVAWQNSGDGSS 2091 F + D+E + LDNL P E EGF S+V + ++ GSS Sbjct: 61 FGFGSDDNCDYELTFDDLDNLYIPSEADDFLLPEGFDPAAQPSSDSSVILKSESPESGSS 120 Query: 2090 DD-------VAGFLSYPSPESGSCDREAPP---GPVSSQDSAGCRSVVDGFFNSPSPDSG 1941 V+GFL+YPS ESG D+E GP+SSQ S G + +S + D Sbjct: 121 GVSKGSDGVVSGFLNYPSSESGGHDQEFSENSGGPLSSQGS-GIPEAANSPTHSGNSDRD 179 Query: 1940 VHSQSGPASD--------VRSGAVVVEDDEQKVKLEEGGXXXXXXXXXXXXXESFSDNAR 1785 V S A + RSG V K K E GG +R Sbjct: 180 VSSNVTTADEKVKIEEEVTRSGFVA------KRKKESGGGEEGNM------------ESR 221 Query: 1784 SCKFRWAIQSENANS-MPDEKDKRKARLTRNRESAQLSRQRKKHYVEELEDKVRSMHSVI 1608 S KFR + S + + DE ++RKARL RNRESAQLSRQRKKHYVEELEDKVR+MH+ I Sbjct: 222 SSKFRRSESSGGSGGCLDDEDERRKARLMRNRESAQLSRQRKKHYVEELEDKVRAMHTTI 281 Query: 1607 ADLNGKISFFMAENVSLRQQLSSGA-VCXXXXXXXXXXXXXXXXXXXPCG-YPMKPRGSQ 1434 ADLN K+S+ MAEN +L+QQLSSG+ +C P Y +KP+GSQ Sbjct: 282 ADLNNKMSYIMAENATLKQQLSSGSGICPPPPPPGMYPMPPMGYPWMPYSPYVVKPQGSQ 341 Query: 1433 VPLVPIPKLKPQKPLSASKP-NKSESKKIQSKTKKVASVSXXXXXXXXXXXXXLVPFVNV 1257 VPLVPIP+LKPQ+P +A KP KSESK SKTKKVAS+S LVP +NV Sbjct: 342 VPLVPIPRLKPQQPAAAPKPKKKSESK---SKTKKVASISFLGLLFFLLLFGGLVPMLNV 398 Query: 1256 RYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVNSSHESGKRELHTGNSGFRERCTNGXX 1077 +G GS + F + +VL V + + G SG + +N Sbjct: 399 GFG----------GSSYVRDRFYDQQRAKVLKVPGHLNGSEGNVPLGVSGGKFDVSNKIH 448 Query: 1076 XXXXXXXXXXEVR-GNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASG 900 GN+SEPLV SLYVPRNDKLVKIDGNLIIHSVLASEKA A Sbjct: 449 ERAHKQKEQGLPGVGNASEPLVASLYVPRNDKLVKIDGNLIIHSVLASEKAKA------- 501 Query: 899 VKNDKTTISSSNGARETGLIPVNLSPALAVSNDGRNCMYGSATEQQKALSSSSGDTYKVN 720 + K+ + GA+ G + P V+ R +Y + Q+KAL++ S Sbjct: 502 --HKKSREARVEGAK--GFVSALAIPEAGVNRGRRAPLYRTPAGQRKALTAGSA------ 551 Query: 719 SKTSNNDGLLQKWFREGLEGPILGSGMCTEVFHFEIXXXXXXXXXXVANI-SAKDHKNYT 543 DG LQ+WFREGL G +L SGMCTEVF F++ +++ + +H + Sbjct: 552 ------DGKLQQWFREGLAGSLLSSGMCTEVFQFDVSAANSGGIIPASSVANVSEHNSNA 605 Query: 542 DPTKRRKNRRILDHLPIPLAELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREA 363 R NRRIL IPLA N + RA ++ S+N S+S +VVSVL DPREA Sbjct: 606 TRLNRGGNRRILGGRAIPLAGSNHNATDDERAIRNNQSSNNFQVSNSSVVVSVLVDPREA 665 Query: 362 GDSESEGMISPKSFSRIFVVVLLDSVKYVTYSCILPFKASSHHLVT 225 GD + +GMI PKS SR+FVV+LLDSVKYVTYSC+LP +++ HLVT Sbjct: 666 GDIDVDGMIKPKSLSRVFVVLLLDSVKYVTYSCVLP-RSAPPHLVT 710 >gb|AGO05994.1| bZIP transcription factor family protein 10 [Camellia sinensis] Length = 718 Score = 409 bits (1050), Expect = e-111 Identities = 307/760 (40%), Positives = 403/760 (53%), Gaps = 57/760 (7%) Frame = -2 Query: 2345 IGSSNGDLTADFDDLQFPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSLDNLSFPPENEG 2166 + S+ T D D L PPLD S+ + G +++L +F+ D+L P + Sbjct: 2 VDPSSNSTTTDSDSLPIPPLDPSIFSDSFLAGGGDIDDL------DFTFDDLYLPSDTPH 55 Query: 2165 FGSE------GSDGLSSTVSVAWQNSGDG----SSDDVAGFLSYPSPESGS--------C 2040 F + SD + + S S D ++ FL+ SPES Sbjct: 56 FLNSLPPPHFSSDWIPDFPIPSDHTSTPSRVFNSDDLISDFLNVSSPESSHESANKASIV 115 Query: 2039 DREAPPGPVSSQDSAGCRSVVDGFFNSPSPDSGVHSQSGPASDVRSGAVVVEDDEQKVKL 1860 R P SSQ S SVV N SPDS +S + + +QK++L Sbjct: 116 ARVLDPEVSSSQGSGNSGSVVSEPLNYTSPDSANNS-------------IHDFVDQKIEL 162 Query: 1859 EEGGXXXXXXXXXXXXXESFSDNARSCKFRWAIQSENANS---------MPDEKDKRKAR 1707 +E G + S+ R+ K++ + EN N + ++ +K+KAR Sbjct: 163 KEEGTNCLLKRKKESEEDVNSE-FRTSKYQRSNSGENPNQSYGYTSNTGISEDDEKKKAR 221 Query: 1706 LTRNRESAQLSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC 1527 L RNRESAQLSRQRKKHYVEELEDK+R+MHS + DLN KIS+ MAEN SLRQQLS GA+C Sbjct: 222 LMRNRESAQLSRQRKKHYVEELEDKLRTMHSTVQDLNSKISYIMAENASLRQQLSGGAMC 281 Query: 1526 XXXXXXXXXXXXXXXXXXXPCGYP--------MKPRGSQVPLVPIPKLKPQKPLSASKPN 1371 P GYP +KP+GSQVPLVPIP+LK Q P A K Sbjct: 282 ---PPPVPPPGMYPHPPMAPMGYPWMPCPPYVVKPQGSQVPLVPIPRLKSQNPSPAPKAK 338 Query: 1370 KSESKKIQSKTKKVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGF 1191 K ESKK +KTKKVASVS LVP VNV +GG R G + F NG + Sbjct: 339 KVESKK--TKTKKVASVSFLGLLFFILFFGGLVPMVNVNFGGIRRDTVLGGSNYFGNGFY 396 Query: 1190 DRWSQGRVLTVNSSHESGKRELHTG-NSGF--------RERCTNG---XXXXXXXXXXXX 1047 D+ GRV+TVN +++ G ++GF R+R + Sbjct: 397 DQ-HHGRVVTVNGHLNGSDQKIGMGLSNGFTNTTIHCGRDRAESNVEQIEGSQAFPGSDE 455 Query: 1046 EVR-GNSSEPLVVSLYVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISS 870 VR NSS PLV SLYVPRNDKLVKIDGNLIIHS+LASEK+MA Sbjct: 456 FVRPDNSSMPLVASLYVPRNDKLVKIDGNLIIHSILASEKSMASGN------------GG 503 Query: 869 SNGARETGL-IPVNLSPALAVS--NDGRN-CMYGSATEQQKALSSSSGDTYKVNSKTSNN 702 +N + ETGL + N+ PA+ ++ N+G++ +Y S +E ++AL S S D K N K++ Sbjct: 504 TNSSEETGLAVARNMPPAIPLTERNNGKHPHLYRSTSEPKRALGSGSAD--KDNLKSTPA 561 Query: 701 DGLLQKWFREGLEGPILGSGMCTEVFHFEIXXXXXXXXXXVA--NISAKDHKNYTDPTKR 528 DG LQ+WF+EGL GP+L SGMCTEVF F++ + N+SA+ KN T K Sbjct: 562 DGKLQQWFQEGLAGPMLSSGMCTEVFQFDVSPVPGAIVPATSVVNVSAEHRKNATHIIK- 620 Query: 527 RKNRRILDHLPIPL--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDS 354 NRRIL +PIPL ++ N + + R D +GN+S S MVVSVL DPR+AGD Sbjct: 621 GLNRRILHGVPIPLPGSQNNISKEHVGRNPEKDD--FHGNKSLSSMVVSVLVDPRDAGDI 678 Query: 353 ESEGMIS-PKSFSRIFVVVLLDSVKYVTYSCILPFKASSH 237 +S+G++ PKS SRIFVVVL+DSVKYVTYSC+LP S H Sbjct: 679 DSDGVMGPPKSLSRIFVVVLIDSVKYVTYSCMLPLMGSYH 718 >ref|XP_007162048.1| hypothetical protein PHAVU_001G1193000g, partial [Phaseolus vulgaris] gi|561035512|gb|ESW34042.1| hypothetical protein PHAVU_001G1193000g, partial [Phaseolus vulgaris] Length = 779 Score = 407 bits (1047), Expect = e-111 Identities = 316/748 (42%), Positives = 402/748 (53%), Gaps = 48/748 (6%) Frame = -2 Query: 2336 SNGDLTADFDDLQ---FPPLDVDYLSNDLMIPEGLMEELGFDPDFEFSLDNLSFPPENEG 2166 +NG+ FDDL P D+L D P+ LG P E S N P + Sbjct: 63 NNGEFEITFDDLDDICIPSDAEDFLLTDACNPDNT-SVLG--PIEESSAKNSDSPRSDAS 119 Query: 2165 FGS-EGSDGLS------STVSVAWQNS-GDGSSDDVAGFLS-YPSPESGSCDRE-APPGP 2016 S + S G+S ++ SV+ NS +GS D V +S PSPES CDRE + GP Sbjct: 120 VVSGDRSSGVSRFFNSQASDSVSEGNSCKEGSLDAVDVRVSNIPSPESEFCDREESSSGP 179 Query: 2015 VSSQDSAGCRSVVDGFFNSPSPDSGVHSQ---SGPASDVRSGAVVVE-----DDEQKVKL 1860 VSSQ S S V NSPSPDS + S A +V V +E D ++K + Sbjct: 180 VSSQGSGNAGSGVYEAINSPSPDSVSFERDITSSHAHEVMDKGVKLEEISGCDLKRKKES 239 Query: 1859 EEGGXXXXXXXXXXXXXESFSDNARSCKFRWAIQSENANSMPDEKDKRKARLTRNRESAQ 1680 EG FS ++ K S+ N++ D+ +KRKARL RNRESAQ Sbjct: 240 CEGSATKHRR---------FSSSSVDTKTEKQTPSD-VNAIDDDDEKRKARLMRNRESAQ 289 Query: 1679 LSRQRKKHYVEELEDKVRSMHSVIADLNGKISFFMAENVSLRQQLSSGAVC-----XXXX 1515 LSRQRKKHYVEELE+KVRSM+S+IADL+ KIS+ +AEN +LRQQ+ +G +C Sbjct: 290 LSRQRKKHYVEELEEKVRSMNSIIADLSSKISYMVAENATLRQQVGAGVMCAPPPPAPGI 349 Query: 1514 XXXXXXXXXXXXXXXPCGYPMKPRGSQVPLVPIPKLKPQKPLSASKPNKSESKKIQSKTK 1335 Y +KP+GSQVPLVPIP+LKPQ+ SA K KSESKK + KTK Sbjct: 350 YPHPPMAPMPYPWMPCAPYVVKPQGSQVPLVPIPRLKPQQHTSAPKGKKSESKKSEGKTK 409 Query: 1334 KVASVSXXXXXXXXXXXXXLVPFVNVRYGGRREAVPSGFGSRFTNGGFDRWSQGRVLTVN 1155 KVAS+S LVP V+ ++GG + VP S + + G+V +VN Sbjct: 410 KVASISFLGLFFFIMLFGGLVPLVDFKFGGLVDNVPDTGLSSYVSDRVHGHGGGKVWSVN 469 Query: 1154 SSHESGKRELHTGNSGFR---------ERCTN-GXXXXXXXXXXXXEVRGNSSEPLVVSL 1005 +R+ G S R ER + G +GN+SEPLV SL Sbjct: 470 GPRNGSERDEEVGFSNERFSVKDKMNYERGRHLGEERGERQGPDDFGRQGNASEPLVASL 529 Query: 1004 YVPRNDKLVKIDGNLIIHSVLASEKAMALSRTASGVKNDKTTISSSNGARETGL-IPVNL 828 YVPRNDK+VKIDGNLIIHS++ASEKAMA S+TA + +ETGL IP + Sbjct: 530 YVPRNDKMVKIDGNLIIHSIMASEKAMA-SQTAEAKEK-----------KETGLAIPKDS 577 Query: 827 SPALAVSNDGR-----NCMYGSATEQQKALSSSSGDTYKVNSKTSNNDGLLQKWFREGLE 663 ALA+ GR +Y EQ+KAL S S K + K+S DG +Q+WFREGL Sbjct: 578 DSALAIPEVGRLRGQHPHVYRVPAEQRKALGSGSTKALKDHMKSSATDGKMQQWFREGLA 637 Query: 662 GPILGSGMCTEVFHFEI--XXXXXXXXXXVANISAKDHKNYTDPTKRRKNRRILDHLPIP 489 GP+L SGMCTEVF F++ VAN+S + +N T K+ +NRR L LP Sbjct: 638 GPMLSSGMCTEVFQFDVSPSPGAIVPATSVANLSTEKRQNATS-VKKTRNRRTLHGLPDS 696 Query: 488 L--AELNATGQEGSRASHSHDGSSNGNRSSSPMVVSVLFDPREAGDS--ESEGMISPKSF 321 L + LN T + H +GN SS MVVSVL DP+EAGD + +GM+ PKS Sbjct: 697 LTGSSLNITEEHVKNLQKDH---LHGNESS--MVVSVLVDPKEAGDGDVDVDGMMRPKSL 751 Query: 320 SRIFVVVLLDSVKYVTYSCILPFKASSH 237 SRIFVVVL+DSVKYVTYSC LP +AS H Sbjct: 752 SRIFVVVLIDSVKYVTYSCGLP-RASPH 778