BLASTX nr result
ID: Coptis25_contig00002793
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis25_contig00002793 (6072 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002273712.2| PREDICTED: transcription initiation factor T... 1875 0.0 ref|XP_002522626.1| transcription initiation factor tfiid, putat... 1785 0.0 ref|XP_003533464.1| PREDICTED: transcription initiation factor T... 1755 0.0 ref|XP_003523903.1| PREDICTED: transcription initiation factor T... 1750 0.0 ref|XP_002323740.1| histone acetyltransferase [Populus trichocar... 1733 0.0 >ref|XP_002273712.2| PREDICTED: transcription initiation factor TFIID subunit 1-A-like [Vitis vinifera] Length = 2068 Score = 1875 bits (4857), Expect = 0.0 Identities = 1035/1770 (58%), Positives = 1214/1770 (68%), Gaps = 70/1770 (3%) Frame = +2 Query: 803 EKDENAELFVEEEKSFEDTVHSPGSFEDDKLAVGLEDFQEDESEVFQEPLNEKSMSSLPV 982 E+ ++ + V+ E+ F+ + E +GL +E E +EP K + LP+ Sbjct: 319 EQCSSSAVIVQHERQFKYGLQLAVLHES--FTIGL--VPNEEPEGLEEPFEGKRSAPLPI 374 Query: 983 LYREDETVVLRFSEIFGIHEPKKKGEKRIHRYPIPIERYKVIDAYDTVEEDEEVFFRGSC 1162 L ED V+LRFSEIFGIH P KKGEKR RY IP ERYK +DA D VEEDEE F +G C Sbjct: 375 LCVEDGMVILRFSEIFGIHGPLKKGEKRDRRYTIPKERYKSMDAPDNVEEDEEAFLKGGC 434 Query: 1163 QSSFNTEELGSLSDDSMVKILDE-ESQISDVREGVSEKTVQTVERRKDSCLLAEPMKESV 1339 Q+ T+ DD+ V + DE E + V +G + +Q E+RK SC+ AEPMKE + Sbjct: 435 QAFSFTKHKLLTQDDASVFMEDEAELKKVGVVQGTATMELQNDEQRKHSCISAEPMKEDM 494 Query: 1340 TEKLAAGWGSPSCPKFCPLDQQDWEDGIVWGDSP-FSSRGXXXXXXXXXXXXXDINKETV 1516 L+ W SP PKF PLDQQDWED I+W +SP S ++KET Sbjct: 495 PVDLSEFWLSPLSPKFYPLDQQDWEDKIIWDNSPEVSDNSAESCEISGPDSEVVVDKETE 554 Query: 1517 SEVGQQCSGLELQMEADEKYHHLFLSSYPIELESFGSRNFSDHVNLPFTERK-HPQQLRL 1693 Q + Q+ DEK H +FL S P+ +E+FGSRN S +N +E K HPQ LRL Sbjct: 555 LVTKAQNQRPKFQVAVDEKDHGVFLGSSPVLIEAFGSRNSSALINHSLSEIKYHPQLLRL 614 Query: 1694 DSFLKVDGSTYSEGGKENETKHLDGNGAIKNFTKLSLQNKDMLEDSWLDKIIWDPNEAVL 1873 ++ L++D S+ S KE+ + G+ AI+ F KL+LQN+DMLE SW+D+IIW+P++ + Sbjct: 615 ETRLEMDNSSQSAVRKEDAIEDPRGSEAIRRFNKLTLQNRDMLEGSWVDRIIWEPHKPIS 674 Query: 1874 KPKLILDLQDEQMLFEISETRDVKHLQAHAGAMIVTRSVKPNGGDYFDLPGHLGTSVGRF 2053 KPKLILDLQDEQMLFEI + +D K+L HAGAM++TR VK + GD +LP H G S GRF Sbjct: 675 KPKLILDLQDEQMLFEILDDKDGKNLGLHAGAMLITRPVKSSTGDSIELPVHGGPSGGRF 734 Query: 2054 NISNDKYYSNRXXXXXXXXXXXXRAAHGVKVLHSIPALKLQTMKPKLSNKDIANFHRPRA 2233 NI+NDK+Y NR R AHGVK+LHSIPALKLQTMK KLSNKDIANFHRP+A Sbjct: 735 NIANDKFYLNRKTSQQLKSHSKKRTAHGVKILHSIPALKLQTMKLKLSNKDIANFHRPKA 794 Query: 2234 LWYPHDNEMVAKEQGKLSTQGPIKLILKSMGGKGSKLHVDAEETISSVKAKASKKLDFKP 2413 LWYPHD EM KEQGKL TQGP+K+ILKS+GGKGSKLHVDAEET+SSVK KASKKLDFKP Sbjct: 795 LWYPHDIEMAVKEQGKLPTQGPMKIILKSLGGKGSKLHVDAEETVSSVKLKASKKLDFKP 854 Query: 2414 SEKVKLIYSGKELEDDKSLAAENVRPNSVLHLVRSKIHLWPKAQKLPGENKSMRPPGAFK 2593 SE VK+ Y+GKELED KSLAA+NV+PNS+LHLVR+KIHLWP+AQKLPGENKS+RPPGAFK Sbjct: 855 SEMVKIFYTGKELEDHKSLAAQNVQPNSLLHLVRTKIHLWPRAQKLPGENKSLRPPGAFK 914 Query: 2594 KKSDLSLKDGHVFLMEYCEERPLMLGNVGMGARLCTYYQKXXXXXXXXXXXXXXXXXXXX 2773 KKSDLS+KDGHVFLMEYCEERPL+LGNVGMGARLCTYYQK Sbjct: 915 KKSDLSVKDGHVFLMEYCEERPLLLGNVGMGARLCTYYQKSAPGDHTGAFMRNGNSSLGT 974 Query: 2774 VLTLDPADKSPFLGDIRPGFSQSCLETNMYRAPIFPHKLCSTDYLLVRSAKGKLSLRRID 2953 VLTLDPADKSPFLGDI+PG SQS LETNMYRAP+FPHK+ STDYLLVRSAKGKLS+RRID Sbjct: 975 VLTLDPADKSPFLGDIKPGCSQSSLETNMYRAPLFPHKVSSTDYLLVRSAKGKLSIRRID 1034 Query: 2954 RIDVVGQQEPHMEVISPGTKNLQTYIWNRLSVYIYREFVANEKRGLRPFIRADDLSAQFP 3133 RIDVVGQQEPHMEV+SPGTK LQTYI NRL VY+YREF A EKRG P IRAD+LSAQFP Sbjct: 1035 RIDVVGQQEPHMEVMSPGTKGLQTYIMNRLLVYMYREFRAGEKRGSLPCIRADELSAQFP 1094 Query: 3134 NISEPFIRKRLKHCADLQRGQNGHLLWVMRRNFRIPLEEEIRRMVTPENVCAYESMQAGL 3313 NISEPF+RKRLKHCADLQ+G NG L WVMRRNFRIPLEEE+RRMVTPENVCAYESMQAGL Sbjct: 1095 NISEPFLRKRLKHCADLQKGSNGLLFWVMRRNFRIPLEEELRRMVTPENVCAYESMQAGL 1154 Query: 3314 YRLKRLGISRLTQPTGLSSAMNQLPDEAIALAAASHIERELQITPWNLSSNFVSCTNQDR 3493 YRLK LGI+RLT PTGLSSAMNQLP EAIALAAASHIERELQITPWNLSSNFV+CTNQDR Sbjct: 1155 YRLKHLGITRLTLPTGLSSAMNQLPYEAIALAAASHIERELQITPWNLSSNFVACTNQDR 1214 Query: 3494 ENIERLEITGVGDPSGRGLGFSYXXXXXXXXXXXXXXXXXXXXXXGGSTVTGTDADLRRL 3673 ENIERLEITGVGDPSGRGLGFSY GGSTVTGTDADLRRL Sbjct: 1215 ENIERLEITGVGDPSGRGLGFSYVRTAPKAPISNAMVKKKITVGRGGSTVTGTDADLRRL 1274 Query: 3674 SMDAAREVLLKFNVPEEQIAKLTRWHRIAMIRKLSSEQATSGVKVDSTTISKYARGQRMS 3853 SM+AAREVLLKFNVPEE IAK TRWHRIAMIRKLSSEQA SGVKVD TTISKYARGQRMS Sbjct: 1275 SMEAAREVLLKFNVPEELIAKQTRWHRIAMIRKLSSEQAASGVKVDPTTISKYARGQRMS 1334 Query: 3854 FLQLQQQTREKCQEIWDRQVQCXXXXXXXXXXXXXXXXXXXXXFAGDLENLLDAEECGVD 4033 FLQLQQQTREKCQEIWDRQVQ FAGDLENLLDAEEC Sbjct: 1335 FLQLQQQTREKCQEIWDRQVQSLSAVDSDEIESDSEANSDLDSFAGDLENLLDAEECEDG 1394 Query: 4034 EEGSFEFEHNKEDGIKGLKMRRRPSQPQXXXXXXXXXXXXXXLCRMLMDEDESEXXXXXX 4213 EEG+ E +H++ DG++GLKMRRRPSQ Q LCRMLMD+DE+E Sbjct: 1395 EEGNCESKHDRTDGVRGLKMRRRPSQAQAEEEIEDEAAEAAELCRMLMDDDEAERKKKKK 1454 Query: 4214 T----------AXXXXXXXXXXXXXTKKINTIVKKIFRTPQPDGSFILKEQIITDPKEVE 4363 T KK + +VK++ QPDGS+ KE+ D KEVE Sbjct: 1455 TRPVGEEEGLALGSQLNFGFENGQKMKKNSAVVKQLVSKVQPDGSYSQKEKAFRDSKEVE 1514 Query: 4364 KILAKRALXXXXXXXXXXXXXXXAPIGLLKKKYTAMGDGLKVIKEKKQTDKPVRENFECG 4543 L KR + A +G+L KK MGDG+K+ KEKK RE+F CG Sbjct: 1515 SFLPKRNI---SGKAKILKKNDAARMGVLHKKIKIMGDGIKMFKEKKS----ARESFVCG 1567 Query: 4544 ACGQKGHMRTNKNCPRYGEDLD---DSTEPGK-----SNFLDPPNLHQKTSVKKLIPKSA 4699 ACGQ GHMRTNKNCP+YGEDL+ + TEP K S+ L Q+T +KK+IPKSA Sbjct: 1568 ACGQLGHMRTNKNCPKYGEDLEAQVEITEPEKASVKSSSLEHSAQLQQRTLIKKIIPKSA 1627 Query: 4700 TRLALVETSENSEKAGSKTK---VKVKCGPAEKISEKIAPEESQNSDKQVTVDSETGTRS 4870 T++ALVETSE EK+ K K VK KCG A+++ +K+AP + D+ V D+ETG + Sbjct: 1628 TKMALVETSE-GEKSSLKAKNLPVKFKCGSADRLPDKVAPGTTHGPDQPVISDAETGNKF 1686 Query: 4871 VNRISRIVFSNKLKPEDTQVE--------------------------------------- 4933 V ++++I+ SNK+KPED+QVE Sbjct: 1687 V-KVNKIIISNKMKPEDSQVESHKPSIVIRPPTETDKEHVESHKPSIVIRPPSEIDRDQV 1745 Query: 4934 ---PPKLVIKPPAEVDRDQPHKKLIIKQPKGNISVDQVKEESSSGLQENYRKTKTMRELS 5104 P +VI+PP + DRDQP KK+IIK+PK IS+DQV ++ S+GL+ YRKTK + ELS Sbjct: 1746 ESHKPSIVIRPPVDTDRDQPRKKIIIKRPK-EISLDQVSQDGSTGLE--YRKTKKIVELS 1802 Query: 5105 SSEKLRKQEGKRLSXXXXXXXXXXXXXXLWEEEDKXXXXXXXXXXXXXXXXXXXXXXMQA 5284 S EK +K E K L+ LWEEE+K ++ Sbjct: 1803 SFEKHKKPETKHLN-EDAAKRKAREDKRLWEEEEKRRNAERLREERAKRLYEEEMRMLE- 1860 Query: 5285 EQQRLAELRR-QEAMWXXXXXXXXXXXXXXXXG--PEMVDEYFE-AKPSRTDRRLPGRER 5452 EQ+RLAE+R+ +EA+ PEM D + E + R DRR+P R+R Sbjct: 1861 EQERLAEIRKFEEAIRREREEEERQKARKKKKKKMPEMRDNHLEDYRTRRNDRRIPERDR 1920 Query: 5453 ATKRRSVVELGNYGAEYAPQSKRRRGEAGEVGLANILEGIVDSLKDATEVSYLFLKPVSK 5632 +TKRR VVELG +GA+Y P +KRRRG GEVGL+N+LE IVDSL+D EVSYLFLKPVSK Sbjct: 1921 STKRRPVVELGKFGADYGPPTKRRRG--GEVGLSNVLESIVDSLRDRYEVSYLFLKPVSK 1978 Query: 5633 KEAPDYHDIIEHPMDLSTIKEKVRKMKYKDREVFRHDVWQITFNAHRYNDGRNPGIPPLA 5812 KEAPDY DII PMDLSTI+EKVRKM+YK+RE FRHDVWQIT+NAH+YNDGRNPGIPPLA Sbjct: 1979 KEAPDYLDIIRDPMDLSTIREKVRKMEYKNREDFRHDVWQITYNAHKYNDGRNPGIPPLA 2038 Query: 5813 DQLLELCDYLLDQNDERLAEAEAGIEIRPT 5902 DQLLELCDYLL +ND L EAEAGIE R T Sbjct: 2039 DQLLELCDYLLSENDASLTEAEAGIEYRDT 2068 Score = 211 bits (537), Expect = 2e-51 Identities = 122/221 (55%), Positives = 144/221 (65%), Gaps = 6/221 (2%) Frame = +2 Query: 278 NRLLGFMFGNVDNAGDLDVDYLGE------DAKEHLAALADKLGSSLTDINLSIKSAHTR 439 NRLLGFMFGNVD AGDLDVDYL E DAKEHLAALADKLG SLTDI+LS+KS T Sbjct: 28 NRLLGFMFGNVDGAGDLDVDYLDEYNYVFQDAKEHLAALADKLGPSLTDIDLSVKSPQTP 87 Query: 440 ADASEQDYDEKAEDAVDYEDIDEQYEGPEIFTTTEEDHILPKTEYFSAEVSLSVLGQRNS 619 AD +EQDYDEKAEDAV+YEDIDEQYEGPEI TEED++L K EYFSA+VS++ L S Sbjct: 88 ADGAEQDYDEKAEDAVNYEDIDEQYEGPEIQAATEEDYLLSKKEYFSADVSVASLDHSAS 147 Query: 620 VFDEENYDEDVEFDEEQKVVNDCSEAQPIITSAVLVESLDPDCGSRLPCDASLGKNNLSS 799 VFD++NYDED EF++E +VV++ SE Q I SS Sbjct: 148 VFDDDNYDEDEEFEKEHEVVDNNSEVQAI-----------------------------SS 178 Query: 800 PEKDENAELFVEEEKSFEDTVHSPGSFEDDKLAVGLEDFQE 922 E+ E+ + E EKS +D + PG E + L LED E Sbjct: 179 GEQGEHLSVVSEGEKSPDDDLF-PGLLEPENLTGDLEDIPE 218 >ref|XP_002522626.1| transcription initiation factor tfiid, putative [Ricinus communis] gi|223538102|gb|EEF39713.1| transcription initiation factor tfiid, putative [Ricinus communis] Length = 1885 Score = 1785 bits (4624), Expect = 0.0 Identities = 1024/1940 (52%), Positives = 1238/1940 (63%), Gaps = 69/1940 (3%) Frame = +2 Query: 278 NRLLGFMFGNVDNAGDLDVDYLGEDAKEHLAALADKLGSSLTDINLSIKSAHTRADASEQ 457 +RLLGFMFGNVDN+GDLDVDYL EDAKEHLAALADKLGSSLTDI++ +KS ADA+EQ Sbjct: 29 SRLLGFMFGNVDNSGDLDVDYLDEDAKEHLAALADKLGSSLTDIDV-LKSPQITADAAEQ 87 Query: 458 DYDEKAEDAVDYEDIDEQYEGPEIFTTTEEDHILPKTEYFSAEVSLSVLGQRNSVFDEEN 637 DYDEKAE+AVDYED DEQYEGPEI +EED++LPK EYFS+EVSLS L SVFD+EN Sbjct: 88 DYDEKAENAVDYEDFDEQYEGPEIQAASEEDYLLPKKEYFSSEVSLSTLKPTTSVFDDEN 147 Query: 638 YDEDVEFDEEQKVVNDCSEAQPIITSAVLVESLDPDCGSRLPCDASLGKNNLSSPEKDEN 817 YDE+ EE+K + E + E++E Sbjct: 148 YDEE----EEEKGGGEGEEEE--------------------------------EEEEEEE 171 Query: 818 AELFVEEEKSFEDTVHSPGSFEDDKLAVGLEDFQEDESEVFQEPLNEKSMSSLPVLYRED 997 AE EE+++ ++ + ED +++ S+ +EP + KS + LPVL ED Sbjct: 172 AEEEEEEKEAEKEHIAVDEKLEDQCISL---------SDAMEEPPDGKSSAPLPVLCVED 222 Query: 998 ETVVLRFSEIFGIHEPKKKGEKRIHRYPIPIERYKVIDAYDTVEEDEEVFFRGSCQ---- 1165 V+LRFSEIFGIHEP KKGEKR RY I ERYK +D D VE+DEE F +GS Q Sbjct: 223 GLVILRFSEIFGIHEPLKKGEKRDRRYSIFKERYKSMDVSDFVEDDEEAFLKGSSQVFQL 282 Query: 1166 -SSFNTEELGSLSDDSMVKILDEESQISDVREGVSEKTVQTVERRKDSCLLAEPMKESVT 1342 S N E+ + +D + GV +++ Q E+R SC+ EPM + ++ Sbjct: 283 HSHVNQYEIAASNDGGS----------ESGKFGVMQRSAQNEEQRS-SCVSGEPMNKDLS 331 Query: 1343 EKLAAGWGSPSCPKFCPLDQQDWEDGIVWGDSPFSSRGXXXXXXXXXXXXXDINKETVSE 1522 + GW SP F PLDQQDWE+ I W +SP S D + + E Sbjct: 332 INIGTGWQSPL---FYPLDQQDWENRICWDNSPAVSENSVESCGLSGPDLADSYTKEM-E 387 Query: 1523 VGQQCSGLE--LQMEADEKYHHLFLSSYPIELESFGSRNFSDHVNLPFTERKHPQQLRLD 1696 +G Q ++ L ++ DEK H+ FL S PI +ESFGS + S +LP + HPQ LRL+ Sbjct: 388 LGSQPQNIQSYLPVQPDEKDHNCFLHSSPILVESFGSLDSSGPSDLPLSVTFHPQLLRLE 447 Query: 1697 SFLKVDGSTYSEGGKENETKHLDGNGAIKNFTKLSLQNKDMLEDSWLDKIIWDPNEAVLK 1876 S ++ + +++ +EN + N A + F+KL+LQNKDM++ SWLD IIW+PN+ +K Sbjct: 448 SHMEAEKHYHADDRRENNAVEVFQNDAFRRFSKLTLQNKDMMDGSWLDNIIWEPNKTNMK 507 Query: 1877 PKLILDLQDEQMLFEISETRDVKHLQAHAGAMIVTRSVKPNGGDYFDLPGHLGTSVGRFN 2056 PKLILDLQDEQMLFE+ + +D KHLQ HAGAMI+TRS+KP +L GH S +FN Sbjct: 508 PKLILDLQDEQMLFEVLDNKDSKHLQLHAGAMIMTRSLKPRVSP--ELSGHGYESGWQFN 565 Query: 2057 ISNDKYYSNRXXXXXXXXXXXXRAAHGVKVLHSIPALKLQTMKPKLSNKDIANFHRPRAL 2236 I+NDK+Y NR R+A+G +V HS PA+KLQTMK KLSNKD+ NFHRP+AL Sbjct: 566 IANDKFYMNRKISQQLQSSSTKRSAYGNRVHHSAPAIKLQTMKLKLSNKDLGNFHRPKAL 625 Query: 2237 WYPHDNEMVAKEQGKLSTQGPIKLILKSMGGKGSKLHVDAEETISSVKAKASKKLDFKPS 2416 WYPHDNE+ KEQ KL TQGP+K+ILKS+GGKGSKLHVDAEETISSVKAKASKKLDFKP Sbjct: 626 WYPHDNEVAVKEQKKLPTQGPMKIILKSLGGKGSKLHVDAEETISSVKAKASKKLDFKPL 685 Query: 2417 EKVKLIYSGKELEDDKSLAAENVRPNSVLHLVRSKIHLWPKAQKLPGENKSMRPPGAFKK 2596 E VK+ Y GKELED KSLAA+NV+PNS+LHLVR+KIHL P+AQ++PGENKS+RPPGAFKK Sbjct: 686 EMVKIFYLGKELEDHKSLAAQNVQPNSLLHLVRTKIHLLPRAQRIPGENKSLRPPGAFKK 745 Query: 2597 KSDLSLKDGHVFLMEYCEERPLMLGNVGMGARLCTYYQKXXXXXXXXXXXXXXXXXXXXV 2776 KSDLS+KDGH+FLMEYCEERPL+L N+GMGA LCTYYQK V Sbjct: 746 KSDLSVKDGHIFLMEYCEERPLLLSNIGMGANLCTYYQKSSPSDQTGVSLRSGNNSLGNV 805 Query: 2777 LTLDPADKSPFLGDIRPGFSQSCLETNMYRAPIFPHKLCSTDYLLVRSAKGKLSLRRIDR 2956 + L+P DKSPFLGDI+ G SQ LETNMY+APIF HK+ STDYLLVRSAKGKLS+RRIDR Sbjct: 806 VVLEPTDKSPFLGDIKAGCSQLSLETNMYKAPIFSHKVASTDYLLVRSAKGKLSIRRIDR 865 Query: 2957 IDVVGQQEPHMEVISPGTKNLQTYIWNRLSVYIYREFVANEKRGLRPFIRADDLSAQFPN 3136 I VVGQQEP MEV+SP +KNLQ YI NRL VY+YRE+ A EKRG P+IRAD+LSA FP Sbjct: 866 IAVVGQQEPLMEVLSPASKNLQAYIINRLLVYVYREYRAAEKRGTIPWIRADELSALFPY 925 Query: 3137 ISEPFIRKRLKHCADLQRGQNGHLLWVMRRNFRIPLEEEIRRMVTPENVCAYESMQAGLY 3316 +SE +RK+LK CA L++ NGHL W +R+F IP EEE+++MV PENVCAYESMQAGLY Sbjct: 926 VSETILRKKLKECAVLRKDANGHLFWSKKRDFIIPSEEELKKMVLPENVCAYESMQAGLY 985 Query: 3317 RLKRLGISRLTQPTGLSSAMNQLPDEAIALAAASHIERELQITPWNLSSNFVSCTNQDRE 3496 RLK LGI+RLT PT +S+AM+QLPDEAIALAAASHIERELQITPW+LSSNFV+CT+QDRE Sbjct: 986 RLKHLGITRLTLPTSVSTAMSQLPDEAIALAAASHIERELQITPWSLSSNFVACTSQDRE 1045 Query: 3497 NIERLEITGVGDPSGRGLGFSYXXXXXXXXXXXXXXXXXXXXXXGGSTVTGTDADLRRLS 3676 NIERLEITGVGDPSGRGLGFSY GGSTVTGTDADLRRLS Sbjct: 1046 NIERLEITGVGDPSGRGLGFSY-VRAAPKAPMSNAMAKKKAAARGGSTVTGTDADLRRLS 1104 Query: 3677 MDAAREVLLKFNVPEEQIAKLTRWHRIAMIRKLSSEQATSGVKVDSTTISKYARGQRMSF 3856 M+AAREVLLKFNVPEEQIAK TRWHRIAMIRKLSSEQA SGVKVD TTISKYARGQRMSF Sbjct: 1105 MEAAREVLLKFNVPEEQIAKQTRWHRIAMIRKLSSEQAASGVKVDPTTISKYARGQRMSF 1164 Query: 3857 LQLQQQTREKCQEIWDRQVQCXXXXXXXXXXXXXXXXXXXXXFAGDLENLLDAEECGVDE 4036 LQLQQQTREKCQEIWDRQVQ FAGDLENLLDAEEC D Sbjct: 1165 LQLQQQTREKCQEIWDRQVQSLSAVDGDELESDSEANSDLDSFAGDLENLLDAEECEGD- 1223 Query: 4037 EGSFEFEHNKEDGIKGLKMRRRPSQPQXXXXXXXXXXXXXXLCRMLMDEDESEXXXXXXT 4216 E ++E + +K DG+KG+KMRR PSQ Q LCR+LMD+DE+E T Sbjct: 1224 ESNYESKQDKADGVKGIKMRRHPSQAQAEEEIEDEAAEAAELCRLLMDDDEAEQKKKKKT 1283 Query: 4217 AXXXXXXXXXXXXXTKKINTIVK-KIFRTPQPDGSFILKEQIITDPKEVEKILAKRALXX 4393 + +N+ K P+GSF+ KE I D KEVE + K+ Sbjct: 1284 KTAGLVAGLLPGLKSNFVNSTEHIKQKDKGHPNGSFVPKESSIKDSKEVEALFIKKKKSE 1343 Query: 4394 XXXXXXXXXXXXXAPIGLLKKKYTAMGDGLKVIKEKKQTDKPVRENFECGACGQKGHMRT 4573 + L K ++ KEKK + RE F CGACGQ GHMRT Sbjct: 1344 KVKALKKNGFQDSSTPPLTKN---------QIFKEKKSS----REKFVCGACGQLGHMRT 1390 Query: 4574 NKNCPRYGEDLDDSTE-------PGKSNFLDPP-NLHQKTSVKKLIPKSATRLALVETSE 4729 NKNCP+YGE+ + E GKSN LDP QK KK + K+A ++ E + Sbjct: 1391 NKNCPKYGEEPEAQVEITDLEKSSGKSNSLDPLFKSQQKLQKKKSMLKTAAKVEDPEGEK 1450 Query: 4730 NSEKAGSKTKVKVKCGPAEKISEKIAPEESQNSDKQVTVD-------SETGTRSVNRISR 4888 +S KA VK C EK S+K A +Q+S++ +T D ETG+ V +IS+ Sbjct: 1451 SSLKA-KLLPVKFVCSSTEKNSDKPADGAAQSSERPITSDVRPDSSEMETGSMPVAKISK 1509 Query: 4889 IVFSNKLKPEDTQVE------------------------------------------PPK 4942 I SNK KPED Q++ P Sbjct: 1510 IKISNKAKPEDVQMDVHKPAIVIRPPMDTDKGQNEYHKPSIVIRPPANTERDHVESHKPS 1569 Query: 4943 LVIKPPAEVDRDQPHKKLIIKQPKGNISVDQVKEESSSGLQENYRKTKTMRELSSSEKLR 5122 +VI+PPA DR QPHKKL+I +PK I +DQV ++ S+GL+ YRK K + ELS +K R Sbjct: 1570 IVIRPPAVKDRGQPHKKLVIIKPKEVIDLDQVSQDGSTGLE--YRKIKKIAELSGVDKQR 1627 Query: 5123 KQEGKRLSXXXXXXXXXXXXXXLWEEEDKXXXXXXXXXXXXXXXXXXXXXXMQAEQQRLA 5302 K LWEEE+K + E+ LA Sbjct: 1628 KPLTWHF-PGESAKKKAREERRLWEEEEKRRNTEKLREERARRSYGEENRGV-VERGALA 1685 Query: 5303 ELRRQEAM----WXXXXXXXXXXXXXXXXGPEMVDEYFEAKPSRTDRRLPGRERATKRRS 5470 ELRR E PE+ D+Y E R RR+ R+R KRRS Sbjct: 1686 ELRRYEEAVREEREEEEQQKAKKKKKKKIRPEISDDYLE--DYRASRRMRERDRGAKRRS 1743 Query: 5471 VVELGNYGAEYAPQSKRRRGEAGEVGLANILEGIVDSLKDATEVSYLFLKPVSKKEAPDY 5650 +VEL YG E+A +KRRRG GEVGLANILEG+VD+L+ EVSYLFLKPV+KKEAPDY Sbjct: 1744 IVELSKYGTEHASATKRRRG--GEVGLANILEGVVDALRGRLEVSYLFLKPVTKKEAPDY 1801 Query: 5651 HDIIEHPMDLSTIKEKVRKMKYKDREVFRHDVWQITFNAHRYNDGRNPGIPPLADQLLEL 5830 DII+ PMDLSTI++KVRKM+YK RE FRHDVWQI +NAH YND RNPGIPPLADQLLE+ Sbjct: 1802 LDIIKRPMDLSTIRDKVRKMEYKHREEFRHDVWQIAYNAHLYNDRRNPGIPPLADQLLEI 1861 Query: 5831 CDYLLDQNDERLAEAEAGIE 5890 CDYLL + + LAEAE GIE Sbjct: 1862 CDYLLAEQNSSLAEAEEGIE 1881 >ref|XP_003533464.1| PREDICTED: transcription initiation factor TFIID subunit 1-A-like [Glycine max] Length = 1910 Score = 1755 bits (4545), Expect = 0.0 Identities = 1022/1926 (53%), Positives = 1249/1926 (64%), Gaps = 53/1926 (2%) Frame = +2 Query: 278 NRLLGFMFGNVDNAGDLDVDYLGEDAKEHLAALADKLGSSLTDINLSIKSAHTRADASEQ 457 NR LGFMFGNVDN+GDLDVDYL EDAKEHL+ALADKLG SLTDI++ KS T D EQ Sbjct: 28 NRFLGFMFGNVDNSGDLDVDYLDEDAKEHLSALADKLGPSLTDIDVR-KSPQTPPDVVEQ 86 Query: 458 DYDEKAEDAVDYEDIDEQYEGPEIFTTTEEDHILPKTEYFSAEVSLSVLGQRNSVFDEEN 637 D D KAEDAVDYEDIDE+Y+GPE EED++LPK E+FS+E S+ L + SVFD+EN Sbjct: 87 DCDVKAEDAVDYEDIDEEYDGPETEAANEEDYLLPKKEFFSSEASVC-LESKASVFDDEN 145 Query: 638 YDEDVEFDEEQKVVNDCSEAQ--PIITSAVLVESLDPDCGSRLPCDASLGKNNLSSPEKD 811 YDE+ E +EQ VND S+ P+ V++ S+ + S Sbjct: 146 YDEESE--KEQDFVNDDSKVYNIPLADLHVMLVSV------------LIKFTYASFVLSG 191 Query: 812 ENAELFVE---EEKSFEDTVHSPGSFEDDKLAVGLEDFQEDESEVFQEPLNEKSMSS-LP 979 E E FV+ EE S E +H S + ++L ++ +ED EV ++SM+ LP Sbjct: 192 EQEESFVDASKEESSLEHELHVD-SPQTEELDADVQKLEEDGPEV-----QKRSMAMPLP 245 Query: 980 VLYREDETVVLRFSEIFGIHEPKKKGEKRIHRYPIPIERYKVIDAYDT-VEEDEEVFFRG 1156 VL ED +LRFSEIFGIHEP +KGEKR HR+ IP + YK D D VEEDEE F +G Sbjct: 246 VLCVEDGVAILRFSEIFGIHEPLRKGEKREHRHSIPRDIYKSFDLTDDFVEEDEEEFLKG 305 Query: 1157 SCQSSFNTEELGSLSDD-SMVKILDEESQISDVREGVSEKTVQTVERRKDSCLLAEPMKE 1333 QS ++++ + +D S +D E + + ++ KDSC AEPMK Sbjct: 306 FSQSLSLSKQVCVVHNDVSESNDVDLEFPKFGFLHADASVDRKDDQQSKDSCHSAEPMKG 365 Query: 1334 SVTEKLAAGWGSPSCPKFCPLDQQDWEDGIVWGDSPFSS-RGXXXXXXXXXXXXXDINKE 1510 E F PLDQQDWED I+WG+SP S E Sbjct: 366 DFVEDHFWKDHPFMLANFYPLDQQDWEDKILWGNSPVPSYNNVESCEISGPELGASGGSE 425 Query: 1511 TVSEVGQQCSGLELQMEADEKYHHLFLSSYPIELESFGSRNFSD-HVNLPFTERKHPQQL 1687 E G +E Q ++K H++ + S P++LE FGSR+ S NL HPQ L Sbjct: 426 IEIESGIHNIQMEPQKVLEDKNHNVLMRSSPVKLEPFGSRDSSGAKTNLISRSLFHPQLL 485 Query: 1688 RLDSFLKVDGSTYSEGGKENETKHLDGNGAIKNFTKLSLQNKDMLEDSWLDKIIWDP-NE 1864 RL+S +VD S+ ++G ++H + +G +K FTK+ QN+DM+E SWLDKIIW+ ++ Sbjct: 486 RLESRSEVDSSSLADGRDAEISEH-NQSGQVKRFTKVISQNRDMMEGSWLDKIIWEELDQ 544 Query: 1865 AVLKPKLILDLQDEQMLFEISETRDVKHLQAHAGAMIVTRSVKPNGGDYFDLPGHLGTSV 2044 +KPKLI DLQD+QM FE+ +T+D HL HAGAMI+T S+K + GD +LPGH G+ Sbjct: 545 PSVKPKLIFDLQDDQMHFEVLDTKDGTHLCLHAGAMILTHSLKLSSGDSSELPGH-GSQY 603 Query: 2045 GRFNISNDKYYSNRXXXXXXXXXXXXRAAHGVKVLHSIPALKLQTMKPKLSNKDIANFHR 2224 G ++NDK+YSNR R+AHGVKV HS PALKLQTMK KLSNKDIANFHR Sbjct: 604 GWRYVANDKHYSNRKTSQQLKSNSKKRSAHGVKVFHSQPALKLQTMKLKLSNKDIANFHR 663 Query: 2225 PRALWYPHDNEMVAKEQGKLSTQGPIKLILKSMGGKGSKLHVDAEETISSVKAKASKKLD 2404 P+ALWYPHDNE+ KEQGKL TQGP+K+I+KS+GGKGSKLHVD EET+SSVKAKASKKLD Sbjct: 664 PKALWYPHDNEVAVKEQGKLPTQGPMKIIIKSLGGKGSKLHVDVEETLSSVKAKASKKLD 723 Query: 2405 FKPSEKVKLIYSGKELEDDKSLAAENVRPNSVLHLVRSKIHLWPKAQKLPGENKSMRPPG 2584 FK SE VK+ Y G+ELED KSLAA+NV+PNS+LHLVR+KIHLWPKAQ++PGENKS+RPPG Sbjct: 724 FKVSETVKIFYLGRELEDHKSLAAQNVQPNSLLHLVRTKIHLWPKAQRVPGENKSLRPPG 783 Query: 2585 AFKKKSDLSLKDGHVFLMEYCEERPLMLGNVGMGARLCTYYQKXXXXXXXXXXXXXXXXX 2764 AFKKKSDLS+KDGHVFLMEYCEERPL+L NVGMGARLCTYYQK Sbjct: 784 AFKKKSDLSVKDGHVFLMEYCEERPLLLSNVGMGARLCTYYQKCSPDDQSGSLLRNTDSR 843 Query: 2765 XXXVLTLDPADKSPFLGDIRPGFSQSCLETNMYRAPIFPHKLCSTDYLLVRSAKGKLSLR 2944 +++LDPADK PFLGD++PG SQS LETNMYRAPIFPHK+ TDYLLVRS+KGKLSLR Sbjct: 844 LGHIISLDPADKFPFLGDLKPGCSQSSLETNMYRAPIFPHKVPLTDYLLVRSSKGKLSLR 903 Query: 2945 RIDRIDVVGQQEPHMEVISPGTKNLQTYIWNRLSVYIYREFVANEKRGLRPFIRADDLSA 3124 RID+I+VVGQQEP MEV+SPG+KNLQTY+ NRL V++ REF A EKR L P+I D+ + Sbjct: 904 RIDKINVVGQQEPLMEVLSPGSKNLQTYMMNRLLVHMCREFQAAEKRHLPPYIGVDEFLS 963 Query: 3125 QFPNISEPFIRKRLKHCADLQRGQNGHLLWVMRRNFRIPLEEEIRRMVTPENVCAYESMQ 3304 QFP SE RK++K A+LQRG NG + V +RNFRI E+E+R+MVTPE VCAYESMQ Sbjct: 964 QFPYQSEASFRKKIKEYANLQRGTNGQSILVKKRNFRIWSEDELRKMVTPELVCAYESMQ 1023 Query: 3305 AGLYRLKRLGISRLTQPTGLSSAMNQLPDEAIALAAASHIERELQITPWNLSSNFVSCTN 3484 A LYRLK LGI+ T PT +SSAM++LPDEAIALAAASHIERELQITPWNLS NFV+CT+ Sbjct: 1024 ASLYRLKHLGITE-THPTNISSAMSRLPDEAIALAAASHIERELQITPWNLSCNFVACTS 1082 Query: 3485 QDRENIERLEITGVGDPSGRGLGFSYXXXXXXXXXXXXXXXXXXXXXXGGSTVTGTDADL 3664 Q +ENIER+EITGVGDPSGRG+GFSY GGSTVTGTDADL Sbjct: 1083 QGKENIERMEITGVGDPSGRGMGFSYARAPPKAPVSSAMVKKKAAANRGGSTVTGTDADL 1142 Query: 3665 RRLSMDAAREVLLKFNVPEEQIAKLTRWHRIAMIRKLSSEQATSGVKVDSTTISKYARGQ 3844 RRLSMDAAREVLLKFNVPEE IAK TRWHRIAMIRKLSSEQATSGVKVD TTISKYARGQ Sbjct: 1143 RRLSMDAAREVLLKFNVPEEVIAKQTRWHRIAMIRKLSSEQATSGVKVDPTTISKYARGQ 1202 Query: 3845 RMSFLQLQQQTREKCQEIWDRQVQCXXXXXXXXXXXXXXXXXXXXXFAGDLENLLDAEEC 4024 RMSFLQLQQQTREKCQEIWDRQVQ FAGDLENLLDAEEC Sbjct: 1203 RMSFLQLQQQTREKCQEIWDRQVQSLSAVNGDENESDSEGNSDLDSFAGDLENLLDAEEC 1262 Query: 4025 GVDEEGSFEFEHNKEDGIKGLKMRRRPSQPQXXXXXXXXXXXXXXLCRMLMDEDESEXXX 4204 EEG+ + + +K DG+KGLKMRRRP+ Q LCR+LMD+ E++ Sbjct: 1263 EEGEEGTNDLKRDKGDGVKGLKMRRRPTLAQAEEEIEDEAAEAAELCRLLMDDYEADRKK 1322 Query: 4205 XXXTAXXXXXXXXXXXXXTK---KINTIVKKIFRTPQPDGSFILKEQIITDPKEVEKILA 4375 +K VK+I T Q DG+ LKE ITD +E E + A Sbjct: 1323 KKKAKVMVGEARLVPKMQSKFSFDNAEQVKQITNTLQLDGTNHLKEDAITDLREEENVPA 1382 Query: 4376 KRALXXXXXXXXXXXXXXXAPIGLLKKKYTA-MGDGLK--VIKEKKQTDKPVRENFECGA 4546 K++ PI + KK MG+G+K V KEK KP RE F CGA Sbjct: 1383 KKS---KSLKVNKAKKNDIMPISIPNKKIKLNMGEGIKNQVFKEK----KPSRETFVCGA 1435 Query: 4547 CGQKGHMRTNKNCPRYGEDLDDSTE-------PGKSNFLDPPNLHQ-KTSVKKLIPKSAT 4702 CG+ GHMRTNKNCP+YGEDL+ E GKS+F+DP +L Q K KK + KSAT Sbjct: 1436 CGKAGHMRTNKNCPKYGEDLETQLESADMEKSSGKSSFVDPSSLSQHKAPSKKSMSKSAT 1495 Query: 4703 RLALVETSENSEKAGSKTKVKVKCGPAEKISEKIAPEESQNSDKQVTVDSETGTRSVNRI 4882 ++A V+ S +K +K KC EK S+K A E Q+SDK VT DSET + ++ Sbjct: 1496 KVAPVDNS-------TKIPLKFKCSSTEKSSDKPAVETLQSSDKPVTSDSETAKSA--KV 1546 Query: 4883 SRIVFSNKLKPEDTQVE---------PP-------------KLVIKPPAEVDRDQPHKKL 4996 ++I+ K+KP+DT E PP + I+PP E+DR+Q HKK+ Sbjct: 1547 NKIIIPKKVKPDDTLAESRKHAIVIRPPTDSGRGQVDSHKFPIKIRPPTEIDREQSHKKI 1606 Query: 4997 IIKQPKGNISVDQVKEESSSGLQENYRKTKTMRELSSSEKLRKQEGKRLSXXXXXXXXXX 5176 +IK+ K I ++ ++GLQ +RKTK + ELS+ EK +KQE + Sbjct: 1607 VIKRTKEVIDLELDSPGGNTGLQ--HRKTKRIVELSNFEKQKKQE--TVYGTEGFKKWNS 1662 Query: 5177 XXXXLW-EEEDKXXXXXXXXXXXXXXXXXXXXXXMQAEQQRLAELRRQE---AMWXXXXX 5344 W EE++K M EQ+RL E++R E Sbjct: 1663 KEDRRWREEQEKWRNDARLREEDRARRHHKEEIRMLKEQERLDEIKRFEEDIRREREEEE 1722 Query: 5345 XXXXXXXXXXXGPEMVDEYF-EAKPSRTDRRLPGRERATKRRSVVELGNYGAEYAPQSKR 5521 PE+ DEY + + R D+R+P R+R+ KRRSV ELG GA+Y P +KR Sbjct: 1723 RQKAKKKKKKKKPELRDEYLDDPRARRHDKRMPERDRSGKRRSVTELGKIGADYMPPTKR 1782 Query: 5522 RRGEAGEVGLANILEGIVDSL-KDATEVSYLFLKPVSKKEAPDYHDIIEHPMDLSTIKEK 5698 RRG GEVGLANILE +VD++ KD ++SYLFLKPVSKKEAPDY D+IE PMDLS I+E+ Sbjct: 1783 RRGGGGEVGLANILESVVDTIVKDRYDLSYLFLKPVSKKEAPDYLDVIERPMDLSRIRER 1842 Query: 5699 VRKMKYKDREVFRHDVWQITFNAHRYNDGRNPGIPPLADQLLELCDYLLDQNDERLAEAE 5878 VR M+YK RE FRHD+WQITFNAH+YNDGRNPGIPPLAD LLE CDYLL++ND+ L EAE Sbjct: 1843 VRNMEYKSREDFRHDMWQITFNAHKYNDGRNPGIPPLADMLLEYCDYLLNENDDSLTEAE 1902 Query: 5879 AGIEIR 5896 AGIEIR Sbjct: 1903 AGIEIR 1908 >ref|XP_003523903.1| PREDICTED: transcription initiation factor TFIID subunit 1-A-like [Glycine max] Length = 1919 Score = 1750 bits (4532), Expect = 0.0 Identities = 1014/1953 (51%), Positives = 1250/1953 (64%), Gaps = 80/1953 (4%) Frame = +2 Query: 278 NRLLGFMFGNVDNAGDLDVDYLGEDAKEHLAALADKLGSSLTDINLSIKSAHTRADASEQ 457 NR LGFMFGNVDN+GDLDVDYL EDAKEHL+ALADKLG SLTDI+LS KS T D EQ Sbjct: 28 NRFLGFMFGNVDNSGDLDVDYLDEDAKEHLSALADKLGPSLTDIDLSGKSPQTPPDVVEQ 87 Query: 458 DYDEKAEDAVDYEDIDEQYEGPEIFTTTEEDHILPKTEYFSAEVSLSVLGQRNSVFDEEN 637 D KAEDAVDYEDIDE+Y+GPE EED++LPK E+FSAE S+ L + SVFD+EN Sbjct: 88 GCDVKAEDAVDYEDIDEEYDGPETEAANEEDYLLPKKEFFSAEASVC-LESKASVFDDEN 146 Query: 638 YDEDVEFDEEQKVVNDCSEAQPIITSAVLVESLDPDCGSRLPCDASLGKNNLSSPEKDEN 817 YDED E +EQ VND + I P E Sbjct: 147 YDEDSE--KEQDFVNDDCKVDNI-------------------------------PLAGEQ 173 Query: 818 AELFVE---EEKSFEDTVHSPGSFEDDKLAVGLEDFQEDESEVFQEPLNEKSMSSLPVLY 988 E FV+ EE S E +H S + ++L ++ +E+ EV P +M LPVL Sbjct: 174 KESFVDASKEESSLEHELHVD-SPQTEELDADVQKLEEESPEV---PKRSMAMP-LPVLC 228 Query: 989 REDETVVLRFSEIFGIHEPKKKGEKRIHRYPIPIERYKVIDAYDT-VEEDEEVFFRGSCQ 1165 ED +LRFSEIFGIHEP +KGEKR HR+ IP +RYK +D D +EEDEE F +G Q Sbjct: 229 VEDGVTILRFSEIFGIHEPLRKGEKREHRHSIPRDRYKSLDLIDDFIEEDEEEFLKGFSQ 288 Query: 1166 SSFNTEELGSLSDDSMVKILDEESQISDVREGVSEKTVQTVE--RRKDSCLLAEPMKESV 1339 S T+++ + +D + + D + + +++ +V + + KDSC AEPMK Sbjct: 289 SLSLTKQVCVVHND-VSESNDVDLEFPKFGFLLADASVARKDDHQSKDSCHSAEPMKGDF 347 Query: 1340 TEKLAAGWGSPSCPKFCPLDQQDWEDGIVWGDSPF-SSRGXXXXXXXXXXXXXDINKETV 1516 E + F PLDQQDWED I+WG+SP S+ E Sbjct: 348 AEDHSRKDHPFMLANFYPLDQQDWEDEILWGNSPVPSNNNVESCEISGPELGASGGSEIE 407 Query: 1517 SEVGQQCSGLELQMEADEKYHHLFLSSYPIELESFGS-RNFSDHVNLPFTERKHPQQLRL 1693 E G Q +E Q + ++K H++ + S P+++E FGS +F NL HPQ LRL Sbjct: 408 IESGIQSIQMEPQKKLEDKDHNVLMCSSPVKVEPFGSWDSFGAKTNLISRSLFHPQLLRL 467 Query: 1694 DSFLKVDGSTYSEGGKENETKHLDGNGAIKNFTKLSLQNKDMLEDSWLDKIIWDP-NEAV 1870 +S +VD S+ ++G +E E + +G +K FTK+ QN+DM+E SWLDKIIW+ ++ + Sbjct: 468 ESRSEVDSSSLADG-REAEISEHNQSGQVKRFTKVISQNRDMMEGSWLDKIIWEELDQPM 526 Query: 1871 LKPKLILDLQDEQMLFEISETRDVKHLQAHAGAMIVTRSVKPNGGDYFDLPGHLGTSVGR 2050 +KPKLI DLQD+QM FE+ +++D HL+ HAGAMI+TRS++ GD +LPGH G+ G Sbjct: 527 VKPKLIFDLQDDQMHFEVLDSKDGTHLRLHAGAMILTRSLQSISGDSSELPGH-GSQYGW 585 Query: 2051 FNISNDKYYSNRXXXXXXXXXXXXRAAHGVKVLHSIPALKLQTMKPKLSNKDIANFHRPR 2230 +++NDK+YSNR R+AHGVKV HS PALKLQTMK KLSNKDIANFHRP+ Sbjct: 586 RHVANDKHYSNRKTSQQLKSNSKKRSAHGVKVFHSQPALKLQTMKLKLSNKDIANFHRPK 645 Query: 2231 ALWYPHDNEMVAKEQGKLSTQGPIKLILKSMGGKGSKLHVDAEETISSVKAKASKKLDFK 2410 ALWYPHDNE+ KEQGKL TQGP+K+I+KS+GGKGSKLHVDAEET+SSVKAKASKKLDFK Sbjct: 646 ALWYPHDNEVAVKEQGKLPTQGPMKIIIKSLGGKGSKLHVDAEETLSSVKAKASKKLDFK 705 Query: 2411 PSEKVKLIYSGKELEDDKSLAAENVRPNSVLHLVRSKIHLWPKAQKLPGENKSMRPPGAF 2590 SE VK+ Y G+ELED KSLAA+NV+PNS+LHLVR+KIHLWPKAQ++PGENKS+RPPGAF Sbjct: 706 VSETVKIFYLGRELEDHKSLAAQNVQPNSLLHLVRTKIHLWPKAQRVPGENKSLRPPGAF 765 Query: 2591 KKKSDLSLKDGHVFLMEYCEERPLMLGNVGMGARLCTYYQKXXXXXXXXXXXXXXXXXXX 2770 KKKSDLS+KDGHVFLME+CEERPL+L NVGMGARLCTYYQK Sbjct: 766 KKKSDLSVKDGHVFLMEHCEERPLLLSNVGMGARLCTYYQKCSPDDQSGSLLRNTDNSLG 825 Query: 2771 XVLTLDPADKSPFLGDIRPGFSQSCLETNMYRAPIFPHKLCSTDYLLVRSAKGKLSLRRI 2950 +++LDPADKSPFLGD++PG +QS LETNMYRAP+FPHK+ TDYLLVRS+KGKLSLRRI Sbjct: 826 HIISLDPADKSPFLGDLKPGCTQSSLETNMYRAPVFPHKVPLTDYLLVRSSKGKLSLRRI 885 Query: 2951 DRIDVVGQQEPHMEVISPGTKNLQTYIWNRLSVYIYREFVANEKRGLRPFIRADDLSAQF 3130 D+I+VVGQQEP MEV+SPG+KNLQ Y+ NRL V++ REF A EKR + P+IR D+ +QF Sbjct: 886 DKINVVGQQEPLMEVLSPGSKNLQNYMINRLLVHMCREFQAAEKRHMPPYIRVDEFLSQF 945 Query: 3131 PNISEPFIRKRLKHCADLQRGQNGHLLWVMRRNFRIPLEEEIRRMVTPENVCAYESMQAG 3310 P SE RK++K A+LQRG NG + V +RNFRI E+E+R+MVTPE VCAYESMQAG Sbjct: 946 PYQSEASFRKKIKEYANLQRGTNGQSILVKKRNFRIWSEDELRKMVTPELVCAYESMQAG 1005 Query: 3311 LYRLKRLGISRLTQPTGLSSAMNQLPDEAIALAAASHIERELQITPWNLSSNFVSCTNQD 3490 LYRLK LGI+ T PT +SSAM++LPDEAIALAAASHIERELQITPWNLSSNFV+CT+Q Sbjct: 1006 LYRLKHLGITE-THPTNISSAMSRLPDEAIALAAASHIERELQITPWNLSSNFVACTSQG 1064 Query: 3491 RENIERLEITGVGDPSGRGLGFSYXXXXXXXXXXXXXXXXXXXXXXGGSTVTGTDADLRR 3670 +ENIER+EITGVGDPSGRG+GFSY GGSTVTGTDADLRR Sbjct: 1065 KENIERMEITGVGDPSGRGMGFSYARAPPKAPVSSAMVKKKAAANRGGSTVTGTDADLRR 1124 Query: 3671 LSMDAARE------------------------------VLLKFNVPEEQIAKLTRWHRIA 3760 LSMDAARE VLLKFNVP+E IAK TRWHRIA Sbjct: 1125 LSMDAAREVCGILAICSELIAYVIPRVPVFNLKYGLIQVLLKFNVPDEVIAKQTRWHRIA 1184 Query: 3761 MIRKLSSEQATSGVKVDSTTISKYARGQRMSFLQLQQQTREKCQEIWDRQVQCXXXXXXX 3940 MIRKLSSEQATSGVKVD TTISKYARGQRMSFLQLQQQTREKCQEIWDRQVQ Sbjct: 1185 MIRKLSSEQATSGVKVDPTTISKYARGQRMSFLQLQQQTREKCQEIWDRQVQSLSAVNGD 1244 Query: 3941 XXXXXXXXXXXXXXFAGDLENLLDAEECGVDEEGSFEFEHNKEDGIKGLKMRRRPSQPQX 4120 FAGDLENLLDAEEC EE + + + +K DG+KGLKMRR P+ Q Sbjct: 1245 ENESDLEGNSDLDSFAGDLENLLDAEECEEGEESTNDLKRDKGDGVKGLKMRRHPTLAQA 1304 Query: 4121 XXXXXXXXXXXXXLCRMLMDEDESEXXXXXXTAXXXXXXXXXXXXXTK---KINTIVKKI 4291 LCR+LMD+DE++ +K VK+I Sbjct: 1305 EEEIEDDAAEAAELCRLLMDDDEADKKKKKKAKVIVGEARLVPKMQSKFSFDNAEQVKQI 1364 Query: 4292 FRTPQPDGSFILKEQIITDPKEVEKILAKRALXXXXXXXXXXXXXXXAPIGLLKKKYTA- 4468 T Q DG+ KE ITD +E E K++ PI + KK Sbjct: 1365 TNTLQLDGTNHWKEDAITDLREEENFPTKKS---KSLKVNKVKKNDITPISIPNKKIKLN 1421 Query: 4469 MGDGLK--VIKEKKQTDKPVRENFECGACGQKGHMRTNKNCPRYGEDLDDSTE------- 4621 MG+G+K V KEK KP RE F CGACG+ GHMRTNKNCP+YGEDL+ E Sbjct: 1422 MGEGIKNQVFKEK----KPSRETFVCGACGKAGHMRTNKNCPKYGEDLETQLESTDMEKS 1477 Query: 4622 PGKSNFLDPPNLHQ-KTSVKKLIPKSATRLALVETSENSEKAGSKTKVKVKCGPAEKISE 4798 GKS+F+DP +L Q K KK + K T++A V+ S SK +K KC EK S+ Sbjct: 1478 SGKSSFVDPSSLSQHKAPSKKSMSKGTTKIAPVDNS-------SKIPLKFKCSSTEKSSD 1530 Query: 4799 KIAPEESQNSDKQVTVDSETGTRSVNRISRIVFSNKLKPEDTQVE---------PP---- 4939 K A E Q+SDK VT DSET + ++++I+ K+KP+DTQ E PP Sbjct: 1531 KPAIESLQSSDKPVTSDSETAKSA--KVNKIIIPKKVKPDDTQAESGKHAIVIRPPTDSG 1588 Query: 4940 ---------KLVIKPPAEVDRDQPHKKLIIKQPKGNISVDQVKEESSSGLQENYRKTKTM 5092 + I+PP E+DR+Q HKK++IK+ K I ++ ++GLQ +RKTK + Sbjct: 1589 RGQVDSHKFPIKIRPPTEIDREQNHKKIVIKRTKEVIDLELDSPGGNTGLQ--HRKTKRI 1646 Query: 5093 RELSSSEKLRKQEGKRLSXXXXXXXXXXXXXXLW-EEEDKXXXXXXXXXXXXXXXXXXXX 5269 ELS+ EK +KQE + W EE++K Sbjct: 1647 VELSNFEKQKKQE--TVYGTEGFKKWNSKEDRRWQEEQEKWRNDARLREEDRARRHRKEE 1704 Query: 5270 XXMQAEQQRLAELRR--QEAMWXXXXXXXXXXXXXXXXGPEMVDEYF-EAKPSRTDRRLP 5440 M EQ+RL E++R ++ PE+ DEY + + R D+R+P Sbjct: 1705 IRMLKEQERLDEIKRFEEDIRREREEEEQQKAKKKKKKKPELRDEYLDDLRARRHDKRMP 1764 Query: 5441 GRERATKRRSVVELGNYGAEYAPQSKRRRGEAGEVGLANILEGIVDSL-KDATEVSYLFL 5617 R+R+ KRRS+ ELG GA+Y P +KRRRG GEVGLANILE +VD++ KD ++SYLFL Sbjct: 1765 ERDRSGKRRSITELGKIGADYMPPTKRRRGGGGEVGLANILESVVDTIVKDRYDLSYLFL 1824 Query: 5618 KPVSKKEAPDYHDIIEHPMDLSTIKEKVRKMKYKDREVFRHDVWQITFNAHRYNDGRNPG 5797 KPVSKKEAPDY DIIE PMDLS I+E+VR M+YK RE FRHD+WQITFNAH+YNDGRNPG Sbjct: 1825 KPVSKKEAPDYLDIIERPMDLSRIRERVRNMEYKSREDFRHDMWQITFNAHKYNDGRNPG 1884 Query: 5798 IPPLADQLLELCDYLLDQNDERLAEAEAGIEIR 5896 IPPLAD LLE CDYLL++ND+ L EAE GIEIR Sbjct: 1885 IPPLADMLLEYCDYLLNENDDSLTEAETGIEIR 1917 >ref|XP_002323740.1| histone acetyltransferase [Populus trichocarpa] gi|222866742|gb|EEF03873.1| histone acetyltransferase [Populus trichocarpa] Length = 1857 Score = 1733 bits (4489), Expect = 0.0 Identities = 997/1950 (51%), Positives = 1226/1950 (62%), Gaps = 86/1950 (4%) Frame = +2 Query: 296 MFGNVDNAGDLDVDYLGEDAKEHLAALADKLGSSLTDINLSIKSAHTRADASEQDYDEKA 475 MFGNVDN+GDLD DYL EDAKEHLAALADKLGSSLT+I+LS+KS T DA+EQDYD KA Sbjct: 1 MFGNVDNSGDLDADYLDEDAKEHLAALADKLGSSLTEIDLSVKSHQTSTDAAEQDYDAKA 60 Query: 476 EDAVDYEDIDEQYEGPEIFTTTEEDHILPKTEYFSAEVSLSVLGQRNSVFDEENYDEDVE 655 EDAVDYED DEQYEGPEI +EED++L K Y +E +L + D E+YDEDVE Sbjct: 61 EDAVDYEDFDEQYEGPEIQGVSEEDYLLSKKNYILSESTL-----QPPTSDNEDYDEDVE 115 Query: 656 FDEEQKVVNDCSEAQPIITSAVLVESLDPDCGSRLPCDASLGKNNLSSPEKDENAELFVE 835 + E+ +P+++ +L E + V Sbjct: 116 EELEK---------EPVVSDKIL--------------------------EFQTASLTGVG 140 Query: 836 EEKSFEDTVHSPGSFEDDKLAVGLEDFQEDESEVFQEPLNEKSMSSLPVLYREDETVVLR 1015 EKS +D V GS + + ED E+E++ + PL+ K S LP+L+ ED +L+ Sbjct: 141 VEKSSQDDVEL-GSMDSESSDAKSEDIHEEEADHVKGPLDGKGPSPLPILFIEDGMEILK 199 Query: 1016 FSEIFGIHEPKKKGEKRIHRYPIPIERYKVIDAYDTVEEDEEVFFRGSCQ-----SSFNT 1180 FSEIF IHEP KKG+KR HRY I E+Y +DA D VEEDEEVF + S Q N Sbjct: 200 FSEIFSIHEPLKKGQKRDHRYSIFKEKYTSMDASDIVEEDEEVFLKDSGQLFPSHLLVNQ 259 Query: 1181 EELGSLSDDSMVKILDEESQISDVREGVSEKTVQTVERRKDSCLLAEPMKESVTEKLAAG 1360 ++ LS+D+ E ++ V G + +VQ E+RK+S L AEPM E V Sbjct: 260 HDISILSEDAA-----ELARFGTV-HGAIKTSVQIEEQRKNSYLSAEPMNEEVE------ 307 Query: 1361 WGSPSCPKFCPLDQQDWEDGIVWGDSP-FSSRGXXXXXXXXXXXXXDINKETVSEVGQQC 1537 W SP KF PLDQQDWE+ I+W +SP S +E+ Q Sbjct: 308 WKSPVHSKFYPLDQQDWEERILWDNSPAISDNSVESFDLSGPDTGSSFIRESEQVTSPQN 367 Query: 1538 SGLELQMEADEKYHHLFLSSYPIELESFGSRNFSDHVNLPFTE-RKHPQQLRLDSFLKVD 1714 EL +E +E + + + LESFGS + S+ NLPF+E R HPQ LRL+S ++VD Sbjct: 368 LCSELPVELNENTSNFLRNRSSVLLESFGSEDSSEPGNLPFSESRCHPQLLRLESQMEVD 427 Query: 1715 GSTYSEGGKENETKHLDGNGAIKNFTKLSLQNKDMLEDSWLDKIIWDPNEAVLKPKLILD 1894 S++ + +EN + L + A++ F+KL+LQN+D++E SWLD IIW+PNE +KPKLILD Sbjct: 428 SSSHVDDRRENNSAELHESDAVRRFSKLTLQNRDLMEGSWLDNIIWEPNETNIKPKLILD 487 Query: 1895 LQDEQMLFEISETRDVKHLQAHAGAMIVTRSVKPNGGDYFDLPGHLGTSVGRFNISNDKY 2074 LQD+QMLFEI + RD KHLQ HAGAMI+TR++K +L GH S +FNI+NDK+ Sbjct: 488 LQDKQMLFEILDHRDSKHLQLHAGAMIITRTLKQRVS--HELLGHGNRSGWQFNIANDKF 545 Query: 2075 YSNRXXXXXXXXXXXXRAAHGVKVLHSIPALKLQTMKPKLSNKDIANFHRPRALWYPHDN 2254 Y NR R A+G+K+ HS PA+KLQTMK KLSNKD+ANFHRP+ALWYPHD+ Sbjct: 546 YMNRKISQQLQSNSNKRTAYGIKIHHSAPAIKLQTMKLKLSNKDLANFHRPKALWYPHDH 605 Query: 2255 EMVAKEQGKLSTQGPIKLILKSMGGKGSKLHVDAEETISSVKAKASKKLDFKPSEKVKLI 2434 E+ KE+GKL T GP+K+ILKS+GGKGSK+HVDAEET+SSVKAKASKKLDFKPSE VK+ Sbjct: 606 EVAVKERGKLPTVGPMKIILKSLGGKGSKVHVDAEETVSSVKAKASKKLDFKPSETVKIF 665 Query: 2435 YSGKELEDDKSLAAENVRPNSVLHLVRSKIHLWPKAQKLPGENKSMRPPGAFKKKSDLSL 2614 Y KELED SLAA+NV+PNS+LHLVR+KIHLWP+AQK+PGENKS+RPPGAFKKKSDLS+ Sbjct: 666 YLRKELEDHMSLAAQNVQPNSLLHLVRTKIHLWPRAQKIPGENKSLRPPGAFKKKSDLSV 725 Query: 2615 KDGHVFLMEYCEERPLMLGNVGMGARLCTYYQKXXXXXXXXXXXXXXXXXXXXVLTLDPA 2794 KDGH+FLMEYCEERPL+L NVGMGA L TYYQK V+ L+ Sbjct: 726 KDGHIFLMEYCEERPLLLSNVGMGANLRTYYQKSSPGDQTGISLRNEKRSLGNVVILEQT 785 Query: 2795 DKSPFLGDIRPGFSQSCLETNMYRAPIFPHKLCSTDYLLVRSAKGKLSLRRIDRIDVVGQ 2974 DKSPFLGDI+ G SQS LETNMY+APIFPHK+ TDYLLVRSAKGKL LRRIDR+ V+GQ Sbjct: 786 DKSPFLGDIKAGCSQSSLETNMYKAPIFPHKVPPTDYLLVRSAKGKLCLRRIDRVAVIGQ 845 Query: 2975 QEPHMEVISPGTKNLQTYIWNRLSVYIYREFVANEKRGLRPFIRADDLSAQFPNISEPFI 3154 QEP MEV++P +KNLQ YI NRL +Y+YRE A EKRG P+IRAD+LSA FP+I E + Sbjct: 846 QEPLMEVLAPASKNLQAYIINRLLLYLYRELRAAEKRGTPPWIRADELSALFPSIPETIL 905 Query: 3155 RKRLKHCADLQRGQNGHLLWVMRRNFRIPLEEEIRRMVTPENVCAYESMQAGLYRLKRLG 3334 RK+LK CA L++ NGHL W +R+F IP EEE+++MV PENVCAYESMQAGLYRLK LG Sbjct: 906 RKKLKECAVLRKDANGHLFWAKKRDFIIPSEEELKKMVLPENVCAYESMQAGLYRLKHLG 965 Query: 3335 ISRLTQPTGLSSAMNQLPDEAIALAAASHIERELQITPWNLSSNFVSCTNQDRENIERLE 3514 I++LT P +S+AM+QLPDEAIALAAASHIERELQITPW+LSSNFV+CTNQDR NIERLE Sbjct: 966 ITKLTLPASVSTAMSQLPDEAIALAAASHIERELQITPWSLSSNFVACTNQDRANIERLE 1025 Query: 3515 ITGVGDPSGRGLGFSYXXXXXXXXXXXXXXXXXXXXXXGGSTVTGTDADLRRLSMDAARE 3694 ITGVGDPSGRGLGFSY GGSTVTGTDADLRRLSM+AARE Sbjct: 1026 ITGVGDPSGRGLGFSYVRAAPKAPMSNAMMKKKAGAGRGGSTVTGTDADLRRLSMEAARE 1085 Query: 3695 VLLKFNVPEEQIAKLTRWHRIAMIRKLSSEQATSGVKVDSTTISKYARGQRMSFLQLQQQ 3874 VLLKFNVP+EQIAK TRWHRIAMIRKLSSEQA+ GVKVD TTISKYARGQRMSFLQLQQQ Sbjct: 1086 VLLKFNVPDEQIAKQTRWHRIAMIRKLSSEQASCGVKVDPTTISKYARGQRMSFLQLQQQ 1145 Query: 3875 TREKCQEIWDRQVQCXXXXXXXXXXXXXXXXXXXXXFAGDLENLLDAEECGVDEEGSFEF 4054 TREKCQEIWDRQVQ FAGDLENLLDAEE D E ++E Sbjct: 1146 TREKCQEIWDRQVQSLSALDGDEIESDSEANSDLDSFAGDLENLLDAEEFEGD-ESNYES 1204 Query: 4055 EHNKEDGIKGLKMRRRPSQPQXXXXXXXXXXXXXXLCRMLMDEDESEXXXXXXTAXXXXX 4234 +H+K D +KG+KMRRRPSQ Q LCR+LMD+DE+ Sbjct: 1205 KHDKGDCVKGIKMRRRPSQAQAEEEFEDEAAEAAELCRLLMDDDEAGQKKKKKIKTGGLN 1264 Query: 4235 XXXXXXXXTKKINTIVKKIFRTPQPDGSFILKEQIITDPKEVEKILAKRALXXXXXXXXX 4414 + N K QP GS+ KE I D KEV Sbjct: 1265 AVLAPKKPSFVDNVHRGKKMNKTQPSGSYTPKENSIRDSKEV------------------ 1306 Query: 4415 XXXXXXAPIGLLKKK---------YTAMGDGLK-----VIKEKKQTDKPVRENFECGACG 4552 + + +LKKK G G+ + KEKK RE F CGACG Sbjct: 1307 ---IIHSTLTMLKKKKKKKNNNNNKKRQGKGISISRKIIFKEKKS----AREKFVCGACG 1359 Query: 4553 QKGHMRTNKNCPRYGEDLDDSTE-------PGKSNFLDPPNLHQ-KTSVKKLIPKSATRL 4708 Q GHM+TNKNCP+YG++ + E KS D N+ Q K K+++ KSAT+ Sbjct: 1360 QLGHMKTNKNCPKYGKEPETPVETTDLEKASRKSTSQDLLNVSQHKLQKKRMVSKSATK- 1418 Query: 4709 ALVETSENSEKAGSKT-KVKVKCGPAEKISEKIAPEESQNSD-------KQVTVDSETGT 4864 VE SE + + +K+ VK KCG EK S+K A + +SD + V+ D +TG+ Sbjct: 1419 --VEVSEGEKSSLAKSLPVKFKCGSTEKFSDKPADGAADHSDQPTTSDVRPVSSDIDTGS 1476 Query: 4865 RSVNRISRIVFSNKLKPEDTQVE------------------------------------- 4933 RS ++++I NK KPE+ QVE Sbjct: 1477 RSTAKVNKIKIFNKAKPENIQVESHKPSIVIRPPMDIERSQIESHKPSIVIRPPTYRDRN 1536 Query: 4934 -----PPKLVIKPPAEVDRDQPHKKLIIKQPKGNISVDQVKEESSSGLQENYRKTKTMRE 5098 P +VI+PPAE DR++ KK++IKQ K I D+V ++ +G +RKTK + E Sbjct: 1537 HVDPHKPSIVIRPPAEKDREKTQKKIVIKQSKEIIDPDRVSQDGRTG--REHRKTKKIAE 1594 Query: 5099 LSSSEKLRKQEGKRLSXXXXXXXXXXXXXXLWEEEDKXXXXXXXXXXXXXXXXXXXXXXM 5278 LSS EK GK + WEEE+K + Sbjct: 1595 LSSFEK----HGKTMHFSRESAKRKAEDRSWWEEEEKRRTAERLREERARRIYAEEMRSL 1650 Query: 5279 QAEQQRLAELRR--QEAMW----XXXXXXXXXXXXXXXXGPEMVDEYF-EAKPSRTDRRL 5437 + EQ++LA+++R + W PE+ D+Y + + +R RR+ Sbjct: 1651 E-EQEKLADIKRYTETIRWDWDEEERQKAKKKKKKMKMKKPEISDDYLDDYRGARNGRRM 1709 Query: 5438 PGRERATKRRSVVELGNYGAEYAPQSKRRRGEAGEVGLANILEGIVDSLKDATEVSYLFL 5617 P R+R KRR VV++G YGA+Y P +KRRR GEVGLANILEGIVD+LKD EVSYLFL Sbjct: 1710 PERDRGAKRRPVVDVGTYGADYTPATKRRR--VGEVGLANILEGIVDALKDRVEVSYLFL 1767 Query: 5618 KPVSKKEAPDYHDIIEHPMDLSTIKEKVRKMKYKDREVFRHDVWQITFNAHRYNDGRNPG 5797 KPV KKEAPDY DII+ PMDLSTI++K RKM+YKDR FRHD+WQI +NAH YNDGRNPG Sbjct: 1768 KPVPKKEAPDYLDIIKRPMDLSTIRDKARKMEYKDRNEFRHDMWQIAYNAHVYNDGRNPG 1827 Query: 5798 IPPLADQLLELCDYLLDQNDERLAEAEAGI 5887 IPPLADQLLELCDYLL + E L+EAEAGI Sbjct: 1828 IPPLADQLLELCDYLLMEKQESLSEAEAGI 1857