BLASTX nr result
ID: Akebia23_contig00007893
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00007893 (1934 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282259.1| PREDICTED: uncharacterized protein LOC100260... 417 e-113 ref|XP_002277032.1| PREDICTED: uncharacterized protein LOC100246... 403 e-109 emb|CBI21214.3| unnamed protein product [Vitis vinifera] 390 e-105 emb|CAN79809.1| hypothetical protein VITISV_014912 [Vitis vinifera] 387 e-105 ref|XP_007027393.1| TBP-associated factor 8, putative [Theobroma... 359 3e-96 ref|XP_006354362.1| PREDICTED: transcription initiation factor T... 352 3e-94 ref|XP_004306253.1| PREDICTED: uncharacterized protein LOC101313... 350 1e-93 ref|XP_006845883.1| hypothetical protein AMTR_s00154p00079940 [A... 350 2e-93 ref|XP_004246634.1| PREDICTED: uncharacterized protein LOC101264... 350 2e-93 ref|XP_006428393.1| hypothetical protein CICLE_v10012002mg [Citr... 349 2e-93 gb|EXC16168.1| hypothetical protein L484_024336 [Morus notabilis] 348 5e-93 ref|XP_007215547.1| hypothetical protein PRUPE_ppa007206mg [Prun... 342 4e-91 ref|XP_002305385.1| hypothetical protein POPTR_0004s11520g [Popu... 339 2e-90 ref|XP_002323904.1| hypothetical protein POPTR_0017s13060g [Popu... 338 5e-90 ref|XP_004141587.1| PREDICTED: uncharacterized protein LOC101215... 328 5e-87 ref|XP_002519508.1| conserved hypothetical protein [Ricinus comm... 323 2e-85 ref|XP_003552582.1| PREDICTED: transcription initiation factor T... 322 4e-85 ref|XP_004304222.1| PREDICTED: uncharacterized protein LOC101292... 317 9e-84 ref|XP_003531863.1| PREDICTED: transcription initiation factor T... 314 8e-83 ref|XP_002527631.1| tbp-associated factor taf, putative [Ricinus... 314 1e-82 >ref|XP_002282259.1| PREDICTED: uncharacterized protein LOC100260255 [Vitis vinifera] Length = 368 Score = 417 bits (1071), Expect = e-113 Identities = 211/368 (57%), Positives = 267/368 (72%), Gaps = 14/368 (3%) Frame = +1 Query: 532 MSDGGGENGRETEHDGERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIVIR 711 MSDGG ++ R ++++ ++ G D F RA++++AVAQICE+ GF+ F +SAL+ALSNI +R Sbjct: 1 MSDGGEDDRRNSDNNAPKRAGPDEFGRAVSKIAVAQICESVGFEGFQDSALQALSNIAVR 60 Query: 712 YLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQDIIQ 852 YL D+GKTANF ANLAGRT CNVFD+I+ LEDLGSS+ S +++I++ Sbjct: 61 YLCDVGKTANFCANLAGRTQCNVFDVIRGLEDLGSSEGFSGASGVDQCIVSSGTVREIVE 120 Query: 853 YVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTPVW 1032 YV+ A+EIPFA+ +PRFPV+RN K PSF+Q+GETP GKHIPPWLPAFPDSHTY+ TP+W Sbjct: 121 YVNSAKEIPFAQPVPRFPVVRNCKATPSFVQMGETPVGKHIPPWLPAFPDSHTYIQTPMW 180 Query: 1033 NERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKE-KWVVKNNP 1209 NER TDPR DK+ LLSLQQRL CN S S + R E + NP Sbjct: 181 NERATDPRADKLEQARQRRKAERSLLSLQQRLVCNGSASASTSVGRCDDAEASRAAEGNP 240 Query: 1210 FLTPPLKFGEKEVSPVVLPAKLLNETNVENRVSVLETFAPAIEAAKHGLCDSGDGERKVL 1389 +L PL+FGEK+VS VVLPAKLL++ V+N VSVLETFAPAIEA K+ DSG+ E+ V+ Sbjct: 241 YLASPLQFGEKDVSTVVLPAKLLDDLVVDNHVSVLETFAPAIEAVKNSFVDSGESEKNVV 300 Query: 1390 PNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRAEQILKASME 1569 P KR VHFK GKK LG ++DL L++K VG+V S GRDEE+DDKKRRAE IL+ SME Sbjct: 301 PEKRSAVHFKLRTGKKILGESVDLRLKNKSVGKVVSLIGRDEERDDKKRRAEYILRQSME 360 Query: 1570 NPHDLAQL 1593 NP +L QL Sbjct: 361 NPQELTQL 368 >ref|XP_002277032.1| PREDICTED: uncharacterized protein LOC100246447 [Vitis vinifera] Length = 377 Score = 403 bits (1035), Expect = e-109 Identities = 213/377 (56%), Positives = 260/377 (68%), Gaps = 23/377 (6%) Frame = +1 Query: 532 MSDGGGENGRETEHDGERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIVIR 711 MSDGGGE+GRE++ +RK+ +F +AIA++AVAQICE+ GFQ F +SALE LS +V+R Sbjct: 1 MSDGGGESGRESDRATKRKSSDRDFPQAIAKIAVAQICESAGFQGFQQSALETLSEVVVR 60 Query: 712 YLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQDIIQ 852 Y+R+LGKTA+ YAN A RT CN+FDIIQ LEDL S Q S +++I+Q Sbjct: 61 YIRELGKTAHTYANSACRTECNIFDIIQGLEDLASLQGFSGASDSDHCLAGSGTVREIVQ 120 Query: 853 YVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTPVW 1032 YVS AEEIPFA ++P FPV+R+ K PSFLQIGE P G HIP WLPAFPD TYVH+PV Sbjct: 121 YVSEAEEIPFAHSVPHFPVIRDRKQTPSFLQIGEEPPGDHIPDWLPAFPDPQTYVHSPVL 180 Query: 1033 NERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVKNNPF 1212 NER DP I LL+LQQ+LACN E P+++D + K + + NPF Sbjct: 181 NERGADPCAGNIEQARQHKKAEWSLLNLQQQLACNGLEGPSMIDPGDAAKARRAAETNPF 240 Query: 1213 LTPPLKFGEKEVSPVVLPAKLLNETNVENR----------VSVLETFAPAIEAAKHGLCD 1362 L+ PL FGEK VSPV LPAKL NE VEN+ VSVLETFAPAIE K C+ Sbjct: 241 LSAPLHFGEKGVSPVFLPAKLSNEAVVENQAGENHAVANHVSVLETFAPAIELMKSRSCE 300 Query: 1363 SGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRA 1542 S +G +KVL N+R V FK IGKK GTALDLS Q+K V ++ SWFG+D EKDDKKRRA Sbjct: 301 SEEGRKKVLSNQRPAVQFKIEIGKKSTGTALDLSFQNKDVEKITSWFGKDNEKDDKKRRA 360 Query: 1543 EQILKASMENPHDLAQL 1593 E+ILK SM+NP +LAQL Sbjct: 361 EKILKESMKNPQELAQL 377 >emb|CBI21214.3| unnamed protein product [Vitis vinifera] Length = 357 Score = 390 bits (1001), Expect = e-105 Identities = 205/367 (55%), Positives = 251/367 (68%), Gaps = 13/367 (3%) Frame = +1 Query: 532 MSDGGGENGRETEHDGERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIVIR 711 MSDGGGE+GRE++ +RK+ +F +AIA++AVAQICE+ GFQ F +SALE LS +V+R Sbjct: 1 MSDGGGESGRESDRATKRKSSDRDFPQAIAKIAVAQICESAGFQGFQQSALETLSEVVVR 60 Query: 712 YLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQDIIQ 852 Y+R+LGKTA+ YAN A RT CN+FDIIQ LEDL S Q S +++I+Q Sbjct: 61 YIRELGKTAHTYANSACRTECNIFDIIQGLEDLASLQGFSGASDSDHCLAGSGTVREIVQ 120 Query: 853 YVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTPVW 1032 YVS AEEIPFA ++P FPV+R+ K PSFLQIGE P G HIP WLPAFPD TYVH+PV Sbjct: 121 YVSEAEEIPFAHSVPHFPVIRDRKQTPSFLQIGEEPPGDHIPDWLPAFPDPQTYVHSPVL 180 Query: 1033 NERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVKNNPF 1212 NER DP I LL+LQQ+LACN E P+++D + K + + NPF Sbjct: 181 NERGADPCAGNIEQARQHKKAEWSLLNLQQQLACNGLEGPSMIDPGDAAKARRAAETNPF 240 Query: 1213 LTPPLKFGEKEVSPVVLPAKLLNETNVENRVSVLETFAPAIEAAKHGLCDSGDGERKVLP 1392 L+ PL FGEK VSPV LPAKL NE TFAPAIE K C+S +G +KVL Sbjct: 241 LSAPLHFGEKGVSPVFLPAKLSNEA----------TFAPAIELMKSRSCESEEGRKKVLS 290 Query: 1393 NKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRAEQILKASMEN 1572 N+R V FK IGKK GTALDLS Q+K V ++ SWFG+D EKDDKKRRAE+ILK SM+N Sbjct: 291 NQRPAVQFKIEIGKKSTGTALDLSFQNKDVEKITSWFGKDNEKDDKKRRAEKILKESMKN 350 Query: 1573 PHDLAQL 1593 P +LAQL Sbjct: 351 PQELAQL 357 >emb|CAN79809.1| hypothetical protein VITISV_014912 [Vitis vinifera] Length = 366 Score = 387 bits (995), Expect = e-105 Identities = 208/377 (55%), Positives = 258/377 (68%), Gaps = 23/377 (6%) Frame = +1 Query: 532 MSDGGGENGRETEHDGERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIVIR 711 MSDGGGE+GRE++ +RK+ +F +AIA++AVAQICE+ GFQ F +SALE LS +V+R Sbjct: 1 MSDGGGESGRESDRATKRKSSDRDFPQAIAKIAVAQICESAGFQGFQQSALETLSEVVVR 60 Query: 712 YLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQDIIQ 852 Y+R+LGKTA+ YAN A RT CN+FDIIQ LEDL S Q S +++I+Q Sbjct: 61 YIRELGKTAHTYANSACRTECNIFDIIQGLEDLASLQGFSGASDSDHCLAGSGTVREIVQ 120 Query: 853 YVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTPVW 1032 YVS AEEIPFA ++P FPV+R+ K PSFLQIGE P G HIP WLPAFPD TYVH+PV Sbjct: 121 YVSEAEEIPFAHSVPHFPVIRDRKQTPSFLQIGEEPPGDHIPDWLPAFPDPQTYVHSPVT 180 Query: 1033 NERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVKNNPF 1212 E+ + + LL+LQQ+LACN E P+++D + K + + NPF Sbjct: 181 LEQARQHKKAE-----------WSLLNLQQQLACNGLEGPSMIDPGDAAKARRAAETNPF 229 Query: 1213 LTPPLKFGEKEVSPVVLPAKLLNETNVENR----------VSVLETFAPAIEAAKHGLCD 1362 L+ PL FGEK VSPV LPAKL NE VEN+ VSVLETFAPAIE K C+ Sbjct: 230 LSAPLHFGEKGVSPVFLPAKLSNEAVVENQAGENHAVANHVSVLETFAPAIELMKSRSCE 289 Query: 1363 SGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRA 1542 S +G +KVL N+R V FK IGKK GTALDLS Q+K V ++ SWFG+D EKDDKKRRA Sbjct: 290 SEEGRKKVLSNQRPAVQFKIEIGKKSTGTALDLSFQNKDVEKITSWFGKDNEKDDKKRRA 349 Query: 1543 EQILKASMENPHDLAQL 1593 E+ILK SM+NP +LAQL Sbjct: 350 EKILKESMKNPQELAQL 366 >ref|XP_007027393.1| TBP-associated factor 8, putative [Theobroma cacao] gi|508715998|gb|EOY07895.1| TBP-associated factor 8, putative [Theobroma cacao] Length = 373 Score = 359 bits (921), Expect = 3e-96 Identities = 203/376 (53%), Positives = 257/376 (68%), Gaps = 23/376 (6%) Frame = +1 Query: 532 MSDGGGENGRET-EHDGER-----KTGGDNFSRAIARVAVAQICENNGFQSFHESALEAL 693 MS GG E+ R+T E +G+R + D+F RA+++++VAQICE G+Q F ESALEAL Sbjct: 1 MSHGGVESTRDTRESEGQRSLPLGRPKADDFGRAVSKISVAQICECVGYQGFKESALEAL 60 Query: 694 SNIVIRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------V 834 ++I IRYL DLGKT++F+ANLAGRT CN+FDI Q LE+LG+S S Sbjct: 61 ADIAIRYLCDLGKTSSFHANLAGRTECNMFDITQSLEELGASYGFSGASEIGHCLAGSGA 120 Query: 835 MQDIIQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTY 1014 +++IIQ+V EEIPFA+ +P+FPV+RN K IPSF + ETP GKHIP WLPAFPD HTY Sbjct: 121 VREIIQFVGSKEEIPFAQPVPQFPVVRNRKLIPSFEHMNETPPGKHIPAWLPAFPDPHTY 180 Query: 1015 VHTPVWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVS--ESPALVDDRVSGKEK 1188 +HTP+WNER +DPR DKI LLSLQQRL CN S S +LV V K++ Sbjct: 181 IHTPMWNERASDPRADKIEQARQRRKAERALLSLQQRLVCNGSTETSASLV---VDAKKE 237 Query: 1189 WVVK--NNPFLTPPLKFGEKEVSPVVLPAKLLNETNVENRVSVLETFAPAIEAAKHGLCD 1362 + + NN FL PL+ GEK+V+ VVLPAKL +E + +N VS+LE FAPAIEA K G Sbjct: 238 TIQEAGNNAFLAAPLQPGEKDVARVVLPAKLSDEVSKDNHVSLLEAFAPAIEAMKGGPSG 297 Query: 1363 SGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRA 1542 DGE+ +LP +R VHFKF GKK LG +LDLSLQ KG ++F RDEE+DDKKRRA Sbjct: 298 ELDGEKMLLPERRPAVHFKFRTGKKILGESLDLSLQKKGERST-TFFLRDEERDDKKRRA 356 Query: 1543 EQILKASMENPHDLAQ 1590 E IL+ + E P +L Q Sbjct: 357 EFILRQTTEYPMELNQ 372 >ref|XP_006354362.1| PREDICTED: transcription initiation factor TFIID subunit 8-like [Solanum tuberosum] Length = 374 Score = 352 bits (903), Expect = 3e-94 Identities = 192/370 (51%), Positives = 241/370 (65%), Gaps = 19/370 (5%) Frame = +1 Query: 541 GGGENGRETE----HDGERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIVI 708 G E+ RE E + E + G D+F RAI+R AVAQICE+ GF+ F+ESALE+L++I I Sbjct: 5 GNAEDKREKESTVDNTREERAGTDDFGRAISRTAVAQICESIGFEIFNESALESLADIAI 64 Query: 709 RYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYV-------------SVMQDII 849 +Y+ DLGKTA+ ANLAGRT CNVFDII LED+ +S ++ +++ Sbjct: 65 KYILDLGKTASSSANLAGRTQCNVFDIIHGLEDMCASTGFLRASEVNRCGLSSGIVSEMV 124 Query: 850 QYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTPV 1029 +YV AEEIPF++ +P FPV+++P IPSFLQIGETP KHIPPWLPAFPD HTYV TP Sbjct: 125 EYVESAEEIPFSQPLPHFPVVKHPNLIPSFLQIGETPPFKHIPPWLPAFPDPHTYVRTPT 184 Query: 1030 WNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSE--SPALVDDRVSGKEKWVVKN 1203 WNER +DPR DKI LL+LQQRL CN S S + D V Sbjct: 185 WNERASDPRADKIELARQRRKAERSLLNLQQRLVCNGSAVGSTSRQPDDVGITSSASKSE 244 Query: 1204 NPFLTPPLKFGEKEVSPVVLPAKLLNETNVENRVSVLETFAPAIEAAKHGLCDSGDGERK 1383 NPFL P + GEK+V PV LP KL +E + +N VS+LETF+PAI+A K GL ++ +G K Sbjct: 245 NPFLAKPFQAGEKDVDPVALPTKLSSEVDDKNHVSLLETFSPAIQAMKDGLSETVNGTEK 304 Query: 1384 VLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRAEQILKAS 1563 LP+KR V +F GKK LG +LDL L KG G S F RDE++DDKKRRAE IL+ S Sbjct: 305 TLPDKRPAVCLEFRPGKKALGDSLDLRLWKKGSGRNASLFRRDEDRDDKKRRAELILRQS 364 Query: 1564 MENPHDLAQL 1593 EN +L QL Sbjct: 365 RENQQELTQL 374 >ref|XP_004306253.1| PREDICTED: uncharacterized protein LOC101313446 [Fragaria vesca subsp. vesca] Length = 390 Score = 350 bits (898), Expect = 1e-93 Identities = 199/389 (51%), Positives = 251/389 (64%), Gaps = 36/389 (9%) Frame = +1 Query: 532 MSDGGGENGRETEH-----DGERKT------GGDNFSRAIARVAVAQICENNGFQSFHES 678 MS G E+ R E D R+ GGD F RA+++VAVAQICE GF ES Sbjct: 1 MSHGDAESSRVNESGSGEDDAPRRAQQLSGGGGDEFGRAVSKVAVAQICEGVGFLGCKES 60 Query: 679 ALEALSNIVIRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS--------- 831 AL++L++I IRYLRDLGK AN+YANLAGRT NVFD+++ LEDL +SQ S Sbjct: 61 ALDSLADIAIRYLRDLGKMANYYANLAGRTESNVFDVVRGLEDLEASQGFSGAAEVRHCL 120 Query: 832 ----VMQDIIQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFP 999 M+ ++QYV AEEIPFA+++PRFPV+++ + I SF ++GE P GKH+P WLPAFP Sbjct: 121 AGSGTMKGLVQYVGTAEEIPFAQSLPRFPVVKDRRLILSFERMGEAPPGKHLPNWLPAFP 180 Query: 1000 DSHTYVHTPVWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVS----ESP----A 1155 D HTY+H+P+WNER+TDPR DKI LLSLQQRL CN S SP + Sbjct: 181 DPHTYIHSPMWNERKTDPREDKIEQARQRRKAERSLLSLQQRLLCNGSAPGLASPSAPVS 240 Query: 1156 LVDDRVSGKEKWVVKNNPFLTPPLKFGEKEVSPVVLPAKLLNETNVENRVSVLETFAPAI 1335 +V + G + ++NPFL PPL+ GEK+VSPVVLP+K N SVLE FAPAI Sbjct: 241 VVGNDGKGLKLQGGESNPFLEPPLQPGEKDVSPVVLPSKFSEVLAKGNSSSVLEAFAPAI 300 Query: 1336 EAAKHGLCDSGDG----ERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWF 1503 +A K+G+ G+G E K+LPN R VH KF KK LG + DLSLQ KG G +W Sbjct: 301 QAVKNGVWMDGEGDVEEESKLLPNSRPPVHLKFRPVKKFLGESSDLSLQKKGSGRPANWV 360 Query: 1504 GRDEEKDDKKRRAEQILKASMENPHDLAQ 1590 RDEE+D+KKRRAE IL+ SM+NP +L Q Sbjct: 361 LRDEERDEKKRRAEFILRQSMQNPQELNQ 389 >ref|XP_006845883.1| hypothetical protein AMTR_s00154p00079940 [Amborella trichopoda] gi|548848527|gb|ERN07558.1| hypothetical protein AMTR_s00154p00079940 [Amborella trichopoda] Length = 375 Score = 350 bits (897), Expect = 2e-93 Identities = 201/383 (52%), Positives = 252/383 (65%), Gaps = 29/383 (7%) Frame = +1 Query: 532 MSDGGGENGR-----ETEHDGERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALS 696 M+DGGGE+ R ++E GE++ D F RA+ RV+VAQICE+ G+ +F SALEAL+ Sbjct: 1 MNDGGGESRRNIDECKSERGGEQEE--DEFGRAVTRVSVAQICESAGYHTFQRSALEALA 58 Query: 697 NIVIRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VM 837 +I +RYLRDLG++A F+ANLAGRT CNVFD+IQ LEDLGSSQ + + Sbjct: 59 DIALRYLRDLGRSARFHANLAGRTACNVFDVIQALEDLGSSQGFAGASDVNHPLAASGAL 118 Query: 838 QDIIQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYV 1017 +DII+Y ++AEEIPFAR +PRFP+ + KP PSFLQ+GETP KHIP WLPAFPD HTY+ Sbjct: 119 KDIIRYTNIAEEIPFARAVPRFPIPKTRKPTPSFLQLGETPPHKHIPSWLPAFPDPHTYI 178 Query: 1018 HTPVWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVV 1197 HTPVWNER +DPRT+K+ L+SLQQRLACN + A +D + GK + Sbjct: 179 HTPVWNERGSDPRTEKLEQARQRRKAEKSLVSLQQRLACN-GATMASMDGELKGKRP-LD 236 Query: 1198 KNNPFLTPPLKFGEKEVSPVVLPAKL---LNETNVENR---VSVLETFAPAIEAAK-HGL 1356 NNPFL PPL GEKE S V +PA L + N+E + +SV+ FAPA EAAK GL Sbjct: 237 GNNPFLAPPLLSGEKEASLVPMPAGLSLKSPDENIEKKPGGLSVVNAFAPANEAAKGGGL 296 Query: 1357 CDSGDGERKVLPNKRHTVHFKFGIGKKPLGTALDL----SLQDKGVGEVGSWFGRDEEKD 1524 D E + L KR V FKFG+ K+ + A L + G SWF RDEEKD Sbjct: 297 ID----EARQLKPKRPVVQFKFGLDKRTVNPAPLLFGNRYNRTGGNATDMSWFSRDEEKD 352 Query: 1525 DKKRRAEQILKASMENPHDLAQL 1593 DKK+RAEQILK +MENP +L QL Sbjct: 353 DKKKRAEQILKEAMENPQELVQL 375 >ref|XP_004246634.1| PREDICTED: uncharacterized protein LOC101264247 [Solanum lycopersicum] Length = 373 Score = 350 bits (897), Expect = 2e-93 Identities = 192/370 (51%), Positives = 240/370 (64%), Gaps = 19/370 (5%) Frame = +1 Query: 541 GGGENGRETE----HDGERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIVI 708 G E+ RE E + E + G D+F RA++R AVAQICE+ GF+ F+ESALE+L++I I Sbjct: 5 GNAEDKREKESTVDNTREERIGTDDFGRAVSRTAVAQICESIGFEIFNESALESLADIAI 64 Query: 709 RYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYV-------------SVMQDII 849 +Y+ DLGKTAN AN+AGRT CNVFDIIQ LED+ +S ++ +++ Sbjct: 65 KYILDLGKTANSKANIAGRTQCNVFDIIQGLEDMCASTGFLRASEVNRCGLSSGIVSEMV 124 Query: 850 QYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTPV 1029 +YV AEEIPF++ +P FPV++ P IPSFLQIGETP KHIPPWLPAFPD HTYV TP Sbjct: 125 EYVESAEEIPFSQPLPHFPVVKQPNLIPSFLQIGETPPFKHIPPWLPAFPDPHTYVRTPT 184 Query: 1030 WNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVS--ESPALVDDRVSGKEKWVVKN 1203 WNER +DPR DKI LL+LQQRL CN S S + D V Sbjct: 185 WNERASDPRADKIELARQRRKAERSLLNLQQRLVCNGSAVASTSRQPDDVGITSSASKSE 244 Query: 1204 NPFLTPPLKFGEKEVSPVVLPAKLLNETNVENRVSVLETFAPAIEAAKHGLCDSGDGERK 1383 NPFL P + GEK+V PV LP KL +E + +N VS+LETF+PAI+A K GL ++ DG K Sbjct: 245 NPFLAKPFQAGEKDVDPVALPTKLSSEVDDKNHVSLLETFSPAIQAMKDGLSETVDGTEK 304 Query: 1384 VLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRAEQILKAS 1563 LP+KR V +F GKK LG +LDL L KG S F RDE++DDKKRRAE IL+ S Sbjct: 305 TLPDKRPAVCLEFRPGKKALGDSLDLRLWKKG-SRNASLFRRDEDRDDKKRRAELILRQS 363 Query: 1564 MENPHDLAQL 1593 EN +L QL Sbjct: 364 RENQQELTQL 373 >ref|XP_006428393.1| hypothetical protein CICLE_v10012002mg [Citrus clementina] gi|568880174|ref|XP_006493009.1| PREDICTED: transcription initiation factor TFIID subunit 8-like [Citrus sinensis] gi|568885488|ref|XP_006495304.1| PREDICTED: transcription initiation factor TFIID subunit 8-like [Citrus sinensis] gi|557530450|gb|ESR41633.1| hypothetical protein CICLE_v10012002mg [Citrus clementina] Length = 370 Score = 349 bits (896), Expect = 2e-93 Identities = 190/371 (51%), Positives = 249/371 (67%), Gaps = 17/371 (4%) Frame = +1 Query: 532 MSDGGGENGRETEHDGERKTG---GDNFSRAIARVAVAQICENNGFQSFHESALEALSNI 702 M+ GGGE+ +E + + ++FSRA++++AVAQICE+ GFQ F +SAL+AL +I Sbjct: 1 MNHGGGESTSRSESRTDTSSDRPKAEDFSRAVSKMAVAQICESVGFQGFKDSALDALLDI 60 Query: 703 VIRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDL-------GSSQY------VSVMQD 843 IRY+ DLGKT++F ANLA RT CN+FDII+ +EDL G+++ ++++ Sbjct: 61 AIRYICDLGKTSSFQANLACRTECNLFDIIRGIEDLEVLKGFMGAAEIGKCLVGSGIVKE 120 Query: 844 IIQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHT 1023 II +V EEIPFA+ IP++PV+R+ + IPSF ++ ETP GKHIP WLPAFPD HTY++T Sbjct: 121 IIDFVESKEEIPFAQPIPQYPVIRSRRLIPSFEEMNETPPGKHIPSWLPAFPDPHTYIYT 180 Query: 1024 PVWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVKN 1203 P+WNER++DPR DKI LLSLQQRL CN + +E + Sbjct: 181 PMWNERKSDPRADKIELARQRRKAEMALLSLQQRLVCNGETGTSASRPANDEEELLKTGS 240 Query: 1204 NPFLTPPLKFGEKEVSPVVLPAKLLNETNVENRVSVLETFAPAIEAAK-HGLCDSGDGER 1380 NPF PL+ GEK++SPV LPAKL ++ + N +SV+E FAPAIEA K G D DG+R Sbjct: 241 NPFFAKPLQSGEKDISPVGLPAKLKDKMSGGNHMSVMEAFAPAIEAVKVSGFSDDADGDR 300 Query: 1381 KVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRAEQILKA 1560 + LP KR VHFKF GKK LG LD SLQ KG G + F RDEEKDDKKRRAE ILK Sbjct: 301 RYLPEKRPAVHFKFRAGKKFLGEILDSSLQKKG-GRRSASFWRDEEKDDKKRRAEFILKQ 359 Query: 1561 SMENPHDLAQL 1593 S+ENP +L+QL Sbjct: 360 SIENPQELSQL 370 >gb|EXC16168.1| hypothetical protein L484_024336 [Morus notabilis] Length = 372 Score = 348 bits (893), Expect = 5e-93 Identities = 198/376 (52%), Positives = 237/376 (63%), Gaps = 22/376 (5%) Frame = +1 Query: 532 MSDGGGENGRETEHDGERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIVIR 711 M G R EH G G D+F RA++++ VAQICE+ GFQS ESAL+AL+NI IR Sbjct: 1 MGHGEANGTRVNEHGGG---GADDFGRAVSKIVVAQICESVGFQSSKESALDALANIAIR 57 Query: 712 YLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYV-------------SVMQDIIQ 852 YL DLGK AN YANL GRT CNVFDII+ LE L +SQ M++I Sbjct: 58 YLCDLGKIANSYANLTGRTECNVFDIIRALEVLEASQGFPGAGDVGHCLVRSGAMKEIAT 117 Query: 853 YVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTPVW 1032 YV AEEIPFA+ +PRFPVL+N + I SF Q+GE P G+HIP WLPA PD HTY+H+P+W Sbjct: 118 YVDSAEEIPFAQPVPRFPVLKNRRLILSFEQMGENPLGQHIPTWLPALPDPHTYIHSPMW 177 Query: 1033 NERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRV------SGKEKWV 1194 NER T+PR K+ LLSLQQRLA NV + A V G E Sbjct: 178 NERNTEPRLHKLEHARQRRKAERSLLSLQQRLARNVGYAGASTSAAVPPLVGGDGNESKQ 237 Query: 1195 VKNNPFLTPPLKFGEKEVSPVVLPAKLLNETNVENRVSVLETFAPAIEAA-KHGLCDSGD 1371 V+ N FL PPL GEK+VSP+V P K+L+E + SVLE FAPAIEA K G + G+ Sbjct: 238 VERNLFLEPPLHPGEKDVSPIVFPGKILDERGKGDHASVLEAFAPAIEAVKKSGFSEYGE 297 Query: 1372 GERKVLP--NKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRAE 1545 ER+VLP R + FKF KK G +LDLSL+ K VG WFGRDEE+DDKKRRAE Sbjct: 298 DERRVLPGIEARPAIQFKFRTAKKYFGESLDLSLK-KAVGRPAFWFGRDEERDDKKRRAE 356 Query: 1546 QILKASMENPHDLAQL 1593 IL+ SMENP +L QL Sbjct: 357 FILRQSMENPQELNQL 372 >ref|XP_007215547.1| hypothetical protein PRUPE_ppa007206mg [Prunus persica] gi|462411697|gb|EMJ16746.1| hypothetical protein PRUPE_ppa007206mg [Prunus persica] Length = 378 Score = 342 bits (877), Expect = 4e-91 Identities = 190/379 (50%), Positives = 248/379 (65%), Gaps = 25/379 (6%) Frame = +1 Query: 532 MSDGGGENGRETEHDG--ERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIV 705 MSDGGGE+GRE E +RK+ GD+F+RAIA++AVAQ+CE GFQ++ SALE LS++ Sbjct: 1 MSDGGGESGREHEQHNRTQRKSSGDDFARAIAKIAVAQVCEIVGFQTYQLSALETLSDVA 60 Query: 706 IRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQDI 846 + Y+ ++GKTA+FYANL+GR CNVFDIIQ LEDLG +Q + +++I Sbjct: 61 VHYIHNIGKTAHFYANLSGRMDCNVFDIIQGLEDLGLAQGFAGASDVDHCLASSGTVREI 120 Query: 847 IQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTP 1026 QYV E IPF+ +IP+FPV+++ K PSFLQ G G+HIP WLPAFP+ HTYV +P Sbjct: 121 AQYVGETEHIPFSYSIPQFPVVKDRKLTPSFLQSGVETLGEHIPIWLPAFPEPHTYVPSP 180 Query: 1027 VWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVKNN 1206 + NER + TD I L +LQ+RL CN E P+ +D + K K ++N Sbjct: 181 ISNERARELHTDMIEQKKKQRNVERSLFNLQRRLVCNGLEGPS-IDPGDADKAKQARESN 239 Query: 1207 PFLTPPLKFGEKEVSPVVLPAKLLNETNV-----ENRV-----SVLETFAPAIEAAKHGL 1356 PFL PL++GE EVS V LPAKL +E V ENRV SVLETFAPAIEA K Sbjct: 240 PFLAAPLQYGETEVSHVALPAKLSSEATVEKLVAENRVAEKCSSVLETFAPAIEAMKSSS 299 Query: 1357 CDSGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKR 1536 C+S + +++L ++R TV FK GI K T L S +KG + SWFGR+ EKD+KK+ Sbjct: 300 CESQEEHKEILLSRRPTVQFKIGIAKTSFSTMLHSSPHNKGFQKNYSWFGRENEKDEKKK 359 Query: 1537 RAEQILKASMENPHDLAQL 1593 RAE+ILK SMEN +LAQL Sbjct: 360 RAEKILKNSMENSQELAQL 378 >ref|XP_002305385.1| hypothetical protein POPTR_0004s11520g [Populus trichocarpa] gi|222848349|gb|EEE85896.1| hypothetical protein POPTR_0004s11520g [Populus trichocarpa] Length = 394 Score = 339 bits (870), Expect = 2e-90 Identities = 188/379 (49%), Positives = 240/379 (63%), Gaps = 28/379 (7%) Frame = +1 Query: 541 GGGENGRETE---HDGERKT--GGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIV 705 GGGE+GR E H+G+RK+ GD F+RAI ++AVAQ+CE+ GFQSF +SALE L+++ Sbjct: 16 GGGESGRLHEKVGHNGKRKSRASGDEFARAIGKIAVAQMCESMGFQSFQQSALETLTDVT 75 Query: 706 IRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQDI 846 Y+R++GK A ANLAGRT NVFD+IQ LE+LG Q + ++++I Sbjct: 76 TWYIRNIGKAAQLCANLAGRTEGNVFDVIQGLEELGLPQGFAGASDVDHCLASSGIVREI 135 Query: 847 IQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTP 1026 QY+ A++IPFA +IP FPV R KP PSF QIGE P +HIP WLPAFPD TY P Sbjct: 136 AQYIGDADDIPFAYSIPPFPVARERKPAPSFSQIGEEPPEEHIPAWLPAFPDPQTYAQLP 195 Query: 1027 VWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVKNN 1206 NE D D I ++L Q+ CN SE P+ V S K +N Sbjct: 196 EGNEGRADLNADNIESVRQHQKMDVSYMNLPQQFNCNGSEGPSSVAFGDSAKATQRTVSN 255 Query: 1207 PFLTPPLKFGEKEVSPVVLPAKLLNETNV----------ENRVSVLETFAPAIEAAKHGL 1356 PFL PL+FG KEVS VV PAKL +E V +N +SV++TFAPAIEA K L Sbjct: 256 PFLAAPLQFGVKEVSHVVPPAKLSDEAAVRYPVEQTRTMDNNMSVMKTFAPAIEAMKSRL 315 Query: 1357 CDSGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKR 1536 CDSG+G++KV N+R V FK G+GK L A DLSLQ+KG+ ++ W G+D E DD+KR Sbjct: 316 CDSGEGQKKVFFNQRPAVQFKIGVGKNSLDGAPDLSLQNKGIKKISMWSGKDSENDDQKR 375 Query: 1537 RAEQILKASMENPHDLAQL 1593 RAE+ILK SMENP +LAQL Sbjct: 376 RAEKILKQSMENPGELAQL 394 >ref|XP_002323904.1| hypothetical protein POPTR_0017s13060g [Populus trichocarpa] gi|566213067|ref|XP_006373367.1| hypothetical protein POPTR_0017s13060g [Populus trichocarpa] gi|222866906|gb|EEF04037.1| hypothetical protein POPTR_0017s13060g [Populus trichocarpa] gi|550320186|gb|ERP51164.1| hypothetical protein POPTR_0017s13060g [Populus trichocarpa] Length = 382 Score = 338 bits (867), Expect = 5e-90 Identities = 187/382 (48%), Positives = 240/382 (62%), Gaps = 28/382 (7%) Frame = +1 Query: 532 MSDGGGENGR---ETEHDGERKT--GGDNFSRAIARVAVAQICENNGFQSFHESALEALS 696 MS GGGE+GR + G+RK+ GD F+RAIA++AVAQ+CE GFQSF +SALE LS Sbjct: 1 MSHGGGESGRLHDKAGDSGKRKSRVSGDEFTRAIAKIAVAQMCETVGFQSFQQSALEKLS 60 Query: 697 NIVIRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VM 837 ++ Y+R+LGKTA FYANLAGRT NVFD+IQ +E+LG SQ + ++ Sbjct: 61 DVTTWYIRNLGKTAQFYANLAGRTEGNVFDVIQGMEELGLSQGFAGASNVDHCLASSGIV 120 Query: 838 QDIIQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYV 1017 ++I+QY+ AE+IPF +IP FPV R KP+PSF QI E +HIP WLPAFPD T+V Sbjct: 121 REIVQYIGDAEDIPFVYSIPPFPVARERKPVPSFFQICEESPAEHIPAWLPAFPDPQTHV 180 Query: 1018 HTPVWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVV 1197 P NE + DKI ++L Q CN S P+ V S + Sbjct: 181 QLPAGNEGDAVFNADKIEPARHHLKMDMSSMNLPQHFTCNGSGGPSSVTFGNSARATQGT 240 Query: 1198 KNNPFLTPPLKFGEKEVSPVVLPAKLLNETNV----------ENRVSVLETFAPAIEAAK 1347 ++NPFL PL+FGEKEVS +V PA+L +E V +N +SVLETFAPAIEA K Sbjct: 241 ESNPFLAAPLQFGEKEVSHLVPPARLSDEAAVRYPVEQNRIMDNHISVLETFAPAIEAMK 300 Query: 1348 HGLCDSGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDD 1527 CDS +G++KVL N+R V FK +GK L A DLS Q G+ ++ WFG+D E DD Sbjct: 301 SRFCDSEEGQKKVLLNQRPAVQFKIQVGKNSLAGAPDLSPQKIGIEKISKWFGKDSENDD 360 Query: 1528 KKRRAEQILKASMENPHDLAQL 1593 KKRRAE+ILK SMENP +L +L Sbjct: 361 KKRRAEKILKQSMENPSELGEL 382 >ref|XP_004141587.1| PREDICTED: uncharacterized protein LOC101215115 [Cucumis sativus] Length = 376 Score = 328 bits (841), Expect = 5e-87 Identities = 180/380 (47%), Positives = 241/380 (63%), Gaps = 26/380 (6%) Frame = +1 Query: 532 MSDGGGENGRETEHDGERKT-GGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIVI 708 MSDGGGE+G+ E RK G ++F RA+A++AVAQICE+ GFQ F +SALE L+++ + Sbjct: 1 MSDGGGESGKVHERPKTRKNLGSEDFPRALAKIAVAQICESEGFQIFQQSALETLADVAV 60 Query: 709 RYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQ-------------YVSVMQDII 849 RY++++G TANF AN AGRT CN+FDIIQ LEDLGS Q S +++ Sbjct: 61 RYVQNMGSTANFCANFAGRTECNLFDIIQALEDLGSVQGFAGASDIEHCLASSSTVKEFA 120 Query: 850 QYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTPV 1029 +YV+ AEE+PFA ++P+FPV++ K PSFLQIGE P G+HIP WLPA PD TY+ +P+ Sbjct: 121 RYVAQAEEVPFAYSVPKFPVVKERKLRPSFLQIGEEPPGEHIPSWLPALPDPETYIESPI 180 Query: 1030 WNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVKNNP 1209 E +P+T K +LQQ L CN E D R + K + ++NP Sbjct: 181 VKEEVVEPQTIK-TEPEKQCRTEKSFWNLQQWLFCNGLEGSQREDPRNAAMTKQIQESNP 239 Query: 1210 FLTPPLKFGEKEVSPVVLPAKLLNETN------------VENRVSVLETFAPAIEAAKHG 1353 FL PPL+FGEKEVS +VLP K+LN ++ V+ VSVLETFAPAIE+ K+ Sbjct: 240 FLAPPLQFGEKEVSSIVLPDKVLNNSSTEYHVPVMENCQVDTHVSVLETFAPAIESIKNN 299 Query: 1354 LCDSGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKK 1533 S E K N++ TV FK G GKK G ++L + GV + SWF ++EKDDKK Sbjct: 300 FHMS---EEKYSLNRKSTVQFKIGTGKKAAGNMIELRALNNGVKKSSSWFVGEDEKDDKK 356 Query: 1534 RRAEQILKASMENPHDLAQL 1593 R+AE+ILK SMEN ++L+ L Sbjct: 357 RKAEKILKDSMENSNELSHL 376 >ref|XP_002519508.1| conserved hypothetical protein [Ricinus communis] gi|223541371|gb|EEF42922.1| conserved hypothetical protein [Ricinus communis] Length = 356 Score = 323 bits (828), Expect = 2e-85 Identities = 182/362 (50%), Positives = 228/362 (62%), Gaps = 15/362 (4%) Frame = +1 Query: 553 NGRETEHDGERKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSNIVIRYLRDLGK 732 NG E RK D+F RA++R+AVAQICE+ GF ESAL++L+ + IRY+ DLGK Sbjct: 3 NGDEESTSARRKA--DDFGRAVSRMAVAQICESVGFHGCKESALDSLTEVAIRYIIDLGK 60 Query: 733 TANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQDIIQYVSVAEE 873 AN +ANL+GRT CN+FDI++ ED+G+ S +++II++V EE Sbjct: 61 IANSHANLSGRTQCNLFDIVRGFEDVGAPLGFSGASNSGNCVVCSGTVKEIIEFVESTEE 120 Query: 874 IPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVHTPVWNERETDP 1053 IPFA+ +P FPV+R+ + IPSFL +GE P GKHIP WLPA PD HTYVHTP+WNER DP Sbjct: 121 IPFAQPVPPFPVVRDKRLIPSFLNMGEIPPGKHIPAWLPALPDPHTYVHTPMWNERVVDP 180 Query: 1054 RTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPAL-VDDRVSGKEKWVVKNNPFLTPPLK 1230 R +KI LLSLQQRL N S + V +E V ++N FL PLK Sbjct: 181 RAEKIEQARQRRKAERALLSLQQRLLSNGSAGASTSVASNHYVQELGVGESNRFLARPLK 240 Query: 1231 FGEKEVSPVVLPAKLLNETNVENRVSVLETFAPAIEAAK-HGLCDSGDGERKVLPNKRHT 1407 GEK VS VV+P KL + V +++ F PAIEAAK G D + ERK+LP KR Sbjct: 241 PGEKAVSTVVVPDKL------KTSVPLIKAFEPAIEAAKGGGFADDEESERKLLPEKRPA 294 Query: 1408 VHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDKKRRAEQILKASMENPHDLA 1587 V+FKF GKK LG LDLSL K G G W G +E+DDKKRRAE IL+ SMENP +L Sbjct: 295 VNFKFKTGKKMLGEPLDLSLSRKSGGTAGHWLGPVDERDDKKRRAEYILRQSMENPQELT 354 Query: 1588 QL 1593 QL Sbjct: 355 QL 356 >ref|XP_003552582.1| PREDICTED: transcription initiation factor TFIID subunit 8-like isoform X1 [Glycine max] Length = 381 Score = 322 bits (825), Expect = 4e-85 Identities = 175/381 (45%), Positives = 240/381 (62%), Gaps = 27/381 (7%) Frame = +1 Query: 532 MSDGGGENGRETEHDG---ERKTGG-DNFSRAIARVAVAQICENNGFQSFHESALEALSN 699 MS+GGG+ GR+ E G RK GG D+++RAIA++AVAQ+CE GFQ+F +SALEALS+ Sbjct: 1 MSNGGGKTGRQLEQPGTWRRRKVGGGDDYARAIAKIAVAQVCEGEGFQAFQQSALEALSD 60 Query: 700 IVIRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQ 840 +V+RY+ ++GK+A+ +ANL+GRT CN FD+IQ LED+GS Q + V++ Sbjct: 61 VVVRYILNVGKSAHCHANLSGRTECNAFDVIQGLEDMGSVQGFAGAADVDHCLESSGVIR 120 Query: 841 DIIQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVH 1020 +I+ +V+ AE + FA IPRFPV++ P PSFLQ GE P G+HIP WLPAFPD TY Sbjct: 121 EIVHFVNDAEPVMFAHPIPRFPVVKERVPNPSFLQKGEEPPGEHIPAWLPAFPDPQTYSQ 180 Query: 1021 TPVWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVK 1200 +P N R T+PR K L+LQQ++ N+ E A +D + ++ + Sbjct: 181 SPAVNGRGTEPRAVKFDQERESGKGEWPALNLQQQMVSNMFEKSASIDPADAKAKRVAAE 240 Query: 1201 NNPFLTPPLKFGEKEVSPVVLPAKLLNETNVENRV----------SVLETFAPAIEAAKH 1350 NPFL PLK +KEV+ V PAKL N+ ++N V S LETFAPAIEA K Sbjct: 241 GNPFLAAPLKIEDKEVASVPPPAKLFNDEALDNPVVENLVENEPISALETFAPAIEAMKS 300 Query: 1351 GLCDSGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDK 1530 +CDS + + K N++ TV FK GI K LG ++ L Q + + WF ++EKDD+ Sbjct: 301 TICDSKEDQTKFCANEKPTVRFKIGIKNKLLGKSIGLIPQKEEHEKTLPWFAMEDEKDDR 360 Query: 1531 KRRAEQILKASMENPHDLAQL 1593 KRRAE+IL+ S+ENP L QL Sbjct: 361 KRRAEKILRESLENPDQLVQL 381 >ref|XP_004304222.1| PREDICTED: uncharacterized protein LOC101292232 [Fragaria vesca subsp. vesca] Length = 379 Score = 317 bits (813), Expect = 9e-84 Identities = 183/382 (47%), Positives = 241/382 (63%), Gaps = 28/382 (7%) Frame = +1 Query: 532 MSDGGGENGRETEHDGE----RKTGGDNFSRAIARVAVAQICENNGFQSFHESALEALSN 699 MSDGGGE+ RE E + + GD+F+RA++++AVAQ+CE G+QSF SALE LS+ Sbjct: 1 MSDGGGESAREHEQSNRITLRKPSCGDDFARAVSKIAVAQVCEVVGYQSFQLSALETLSD 60 Query: 700 IVIRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQ 840 + ++Y+R++GKTA+ YANL+GRT CNVFDIIQ LEDL ++Q + ++ Sbjct: 61 VAVQYIRNVGKTAHLYANLSGRTDCNVFDIIQGLEDLSAAQGFAGASDINHCLASSGTIK 120 Query: 841 DIIQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVH 1020 +I QYV+ AE +PFA TIPRFPV+++ K PSF Q GE G+HIP WLPAFP+ HTY Sbjct: 121 EISQYVAEAEHVPFAYTIPRFPVVKDRKLTPSFWQSGEETPGEHIPTWLPAFPEPHTYSR 180 Query: 1021 TPVWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPAL-VDDRVSGKEKWVV 1197 + NE T+P + + +L+ RL CN E P+L D V+ K+ Sbjct: 181 STTCNEGATEPDSALVEQEKQQRNVERAMLNFHHRLVCNGMEGPSLDPGDGVNAKQ--AR 238 Query: 1198 KNNPFLTPPLKFGEKEVSPVVLPAKLLNET-----NVENRV-----SVLETFAPAIEAAK 1347 ++NPFL PL+FGE EVS V LPAKL E EN SVLETFAPAIEA K Sbjct: 239 ESNPFLATPLQFGETEVSQVTLPAKLSIEATEETLKAENHAKDKCSSVLETFAPAIEAIK 298 Query: 1348 HGLCDSGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDD 1527 + + + ++K L +++ TV FK G+ KK LGT L KG EV WFGR+ EKD+ Sbjct: 299 NKPFEV-EEDQKTLLSRKPTVQFKIGMSKKSLGTMLYSGPHKKGFEEVYPWFGRENEKDE 357 Query: 1528 KKRRAEQILKASMENPHDLAQL 1593 KKRRAE+ILK SMEN +LAQL Sbjct: 358 KKRRAEKILKNSMENSQELAQL 379 >ref|XP_003531863.1| PREDICTED: transcription initiation factor TFIID subunit 8-like isoform 1 [Glycine max] Length = 381 Score = 314 bits (805), Expect = 8e-83 Identities = 171/381 (44%), Positives = 238/381 (62%), Gaps = 27/381 (7%) Frame = +1 Query: 532 MSDGGGENGRETEHDG---ERKTGG-DNFSRAIARVAVAQICENNGFQSFHESALEALSN 699 MS+GGG+ GR+ E G RK GG D+++RAIA++AVAQ+CE+ GFQ+F +SALEALS+ Sbjct: 1 MSNGGGKTGRQLEQPGTWGRRKVGGGDDYARAIAKIAVAQVCESEGFQAFQQSALEALSD 60 Query: 700 IVIRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQ 840 +V RY+ ++GK+A+ +ANL+GRT C+ FD+IQ LED+GS Q + V++ Sbjct: 61 VVARYILNVGKSAHCHANLSGRTECHAFDVIQGLEDMGSVQGFAGASDVDHCLESSGVIR 120 Query: 841 DIIQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVH 1020 +I+ +V+ AE + FA IP+FPV++ P PSFLQ GE P G+HIP WLPAFPD TY Sbjct: 121 EIVHFVNDAEPVMFAHPIPQFPVVKERVPNPSFLQKGEEPPGEHIPAWLPAFPDLQTYSE 180 Query: 1021 TPVWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVK 1200 +PV N R T+PR K ++ QQ++ N+ E AL+D + ++ + Sbjct: 181 SPVVNGRGTEPRAVKFDQERENGKGEWPAMNFQQQMVSNMFEKSALIDPADAKAKRVAAE 240 Query: 1201 NNPFLTPPLKFGEKEVSPVVLPAKLLNETNVENRV----------SVLETFAPAIEAAKH 1350 NPFL PLK +KEV+ V PAKL N+ ++N V S +ETFAPAIEA K Sbjct: 241 GNPFLAAPLKIEDKEVASVPPPAKLFNDVALDNPVVENFVENEPISAMETFAPAIEAMKS 300 Query: 1351 GLCDSGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDK 1530 CDS + + K N++ TV FK GI K LG ++ L Q + WF ++ KDD+ Sbjct: 301 TCCDSNEDQTKFRANEKPTVRFKIGIKNKLLGKSIGLIPQKEEHKNTLPWFAMEDGKDDR 360 Query: 1531 KRRAEQILKASMENPHDLAQL 1593 KRRAE+IL+ S+ENP L QL Sbjct: 361 KRRAEKILRESLENPDQLVQL 381 >ref|XP_002527631.1| tbp-associated factor taf, putative [Ricinus communis] gi|223533005|gb|EEF34770.1| tbp-associated factor taf, putative [Ricinus communis] Length = 379 Score = 314 bits (804), Expect = 1e-82 Identities = 182/381 (47%), Positives = 235/381 (61%), Gaps = 27/381 (7%) Frame = +1 Query: 532 MSDGGGENGRETEHD--GERKTG--GDNFSRAIARVAVAQICENNGFQSFHESALEALSN 699 MS GGG++GR E +RK+G GD F+R+IA++AVAQICE GFQ+F +SALE LS+ Sbjct: 1 MSHGGGQSGRVQEKSQLAKRKSGSSGDEFARSIAKIAVAQICECTGFQTFQQSALETLSD 60 Query: 700 IVIRYLRDLGKTANFYANLAGRTHCNVFDIIQVLEDLGSSQYVS-------------VMQ 840 + +RY+ +LGK A AN AGR N FDIIQ LE+L SSQ + +++ Sbjct: 61 VTVRYICNLGKLAQGNANSAGRIEGNAFDIIQALEELCSSQGFASASDVDHCIASSGIVR 120 Query: 841 DIIQYVSVAEEIPFARTIPRFPVLRNPKPIPSFLQIGETPSGKHIPPWLPAFPDSHTYVH 1020 DI QYVS A+++PFA +IP FP++R K P F QIGE P +HIP WLPAFPD Y+ Sbjct: 121 DIAQYVSDADDVPFAYSIPPFPIVRERKLAPIFSQIGEKPPWEHIPDWLPAFPDPQIYLQ 180 Query: 1021 TPVWNERETDPRTDKIXXXXXXXXXXXXLLSLQQRLACNVSESPALVDDRVSGKEKWVVK 1200 +P NE TD K LL QQ + S+ P+ + K +V+ Sbjct: 181 SPTVNEGATDLNMQKFEPARLHPKIDRSLL--QQPFTSSGSQGPSSNVPAGGYEGKLIVE 238 Query: 1201 NNPFLTPPLKFGEKEVSPVVLPAKLLNETNV----------ENRVSVLETFAPAIEAAKH 1350 NPF+ PL+ GEKEVS VV PAKL NET V +N VSVL TFAPAI+A Sbjct: 239 GNPFVAAPLQCGEKEVSHVVPPAKLSNETAVRNPIEHNRLADNHVSVLNTFAPAIKAMNS 298 Query: 1351 GLCDSGDGERKVLPNKRHTVHFKFGIGKKPLGTALDLSLQDKGVGEVGSWFGRDEEKDDK 1530 LCDS +G++KVL N+R + FK IGKK L T+L+L Q+K ++ W +D E DDK Sbjct: 299 RLCDSEEGQKKVLLNQRPAIQFKIAIGKKSLRTSLELGSQNKSAEKISPWSEKDNENDDK 358 Query: 1531 KRRAEQILKASMENPHDLAQL 1593 KRRAE+ILK S+ENP +LAQL Sbjct: 359 KRRAEKILKQSIENPGELAQL 379