BLASTX nr result
ID: Mentha26_contig00020867
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00020867 (1327 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU36365.1| hypothetical protein MIMGU_mgv1a002134mg [Mimulus... 456 e-125 ref|XP_002283285.2| PREDICTED: uncharacterized protein LOC100252... 367 8e-99 ref|XP_006433616.1| hypothetical protein CICLE_v10000177mg [Citr... 345 3e-92 ref|XP_006472283.1| PREDICTED: transcription initiation factor T... 343 1e-91 ref|XP_006362063.1| PREDICTED: transcription initiation factor T... 327 9e-87 ref|XP_004238100.1| PREDICTED: uncharacterized protein LOC101262... 326 2e-86 ref|XP_007210386.1| hypothetical protein PRUPE_ppa001063mg [Prun... 320 7e-85 ref|XP_002520510.1| transcription initiation factor, putative [R... 318 4e-84 ref|XP_007018537.1| TBP-associated factor 4, putative isoform 2 ... 313 8e-83 ref|XP_007018536.1| TBP-associated factor 4, putative isoform 1 ... 313 8e-83 ref|XP_002510115.1| transcription initiation factor, putative [R... 311 3e-82 ref|XP_002320699.1| hypothetical protein POPTR_0014s01830g [Popu... 291 3e-76 emb|CBI19420.3| unnamed protein product [Vitis vinifera] 291 6e-76 gb|EXB38469.1| Transcription initiation factor TFIID subunit 4B ... 290 8e-76 ref|XP_004300119.1| PREDICTED: uncharacterized protein LOC101295... 285 2e-74 ref|XP_006581260.1| PREDICTED: transcription initiation factor T... 268 3e-69 ref|XP_003527732.1| PREDICTED: transcription initiation factor T... 268 3e-69 ref|XP_007018538.1| TBP-associated factor 4, putative isoform 3,... 268 4e-69 ref|XP_007160898.1| hypothetical protein PHAVU_001G026300g [Phas... 267 7e-69 gb|EPS64696.1| hypothetical protein M569_10084, partial [Genlise... 265 3e-68 >gb|EYU36365.1| hypothetical protein MIMGU_mgv1a002134mg [Mimulus guttatus] Length = 709 Score = 456 bits (1173), Expect = e-125 Identities = 259/449 (57%), Positives = 296/449 (65%), Gaps = 7/449 (1%) Frame = +1 Query: 1 QAARNSQTAPNQSQTQPQASGRQMQMPS-AQLPTDLSNSISDNNAAKSHEMERQSNSHGA 177 QA RN+QTA NQ Q+QPQ S RQMQ+ S AQ+ TDLS+S D+N AKS E+E Q+ S G Sbjct: 58 QANRNAQTASNQFQSQPQISARQMQVASSAQMATDLSSSTGDSNTAKSREVESQAESQGG 117 Query: 178 LVGQMXXXXXXXXXHERMHPAFQAQGLNKQQHMQFSQTSFPTYGSAGSGYTQFPTNSAAS 357 QM ER HP+F GLN QQHM F QTSFP+YGS G+GY+ F +AAS Sbjct: 118 QASQMSSSGSGALIQERKHPSFPTHGLNNQQHMHFPQTSFPSYGSGGTGYSPFSATNAAS 177 Query: 358 SAATRPQPHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLT 537 S RPQ +H N AVNH+G R MN+TNM FDRPHSLSD KK+ GS+ Sbjct: 178 STPLRPQAQ--------AHQNSAVNHMGPTPRAMNMTNMPKFDRPHSLSDHKKMQPGSMA 229 Query: 538 YVNS-NTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQ-KXXXXXXXXXXX 711 ++NS N LQQNQ QWP+SASKEQK+ + S+++VKQEP DQ E Q + Sbjct: 230 HMNSSNNALQQNQVQWPASASKEQKSGAASSMSHVKQEPVDQPNEQQHRAQLSSSHGLSS 289 Query: 712 XXXXXXKLGST-APGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXX---METNILSSSR 879 K GS APGN KDE+FE+H SR G METN S SR Sbjct: 290 LSPALNKQGSVVAPGNFKDESFEMHLSRTGFAPPTSAVPTNSVPSSIPSPMETNTQSVSR 349 Query: 880 MSSLTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXXFADQSIEHLNDVT 1059 M SLT P+GPGN +KAPPKKPL GQKKPMEA F DQSIEHLNDVT Sbjct: 350 MPSLTNPIGPGN-TKAPPKKPLIGQKKPMEAPGSSPPSSKKQKVSGGFLDQSIEHLNDVT 408 Query: 1060 AVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLK 1239 AVSGVNLREEEEQLFS KEDSRVSEASRRVVQEEEERLIL + PLQKKM E+MAK GLK Sbjct: 409 AVSGVNLREEEEQLFSAAKEDSRVSEASRRVVQEEEERLILNKTPLQKKMVELMAKKGLK 468 Query: 1240 NMSNDVSRCLSLCVEERMRGLISNVIRLS 1326 NMS+DV RCLSLCVEER+RG+I NV+RLS Sbjct: 469 NMSSDVERCLSLCVEERLRGIIFNVVRLS 497 >ref|XP_002283285.2| PREDICTED: uncharacterized protein LOC100252311 [Vitis vinifera] Length = 922 Score = 367 bits (941), Expect = 8e-99 Identities = 222/459 (48%), Positives = 275/459 (59%), Gaps = 19/459 (4%) Frame = +1 Query: 7 ARNSQTAPNQSQTQPQASGRQMQ---------MPSA--QLPTDLSNSISDNNAAKSHEME 153 A N QT P+Q Q Q QAS Q MPS+ ++ TD S ++ N+ K EME Sbjct: 257 AWNYQTGPSQFQLQSQASALQQHLKTPSNSSHMPSSAMKVQTDSSYPTTETNSQKPREME 316 Query: 154 RQSNSHGALVGQMXXXXXXXXXHERMHPAFQAQGLNKQQ--HMQFSQTSFPTYGSAGSGY 327 RQS+SHG QM ER H QG NKQQ H+ FSQT F YGSAG Y Sbjct: 317 RQSDSHGMQGSQMSSSSLSSAKQEREHSVMPMQGPNKQQQQHLHFSQTPFTMYGSAGGNY 376 Query: 328 TQFP-TNSAASSAATRPQPHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLS 504 + TN S+ +T+ QPHDSQMRQ P H N+ +G S+ MN ++ F+R S++ Sbjct: 377 HSYTGTNVNTSATSTKQQPHDSQMRQVPLHQNIGSTQMGGTSQAMNPMSVPKFERQSSVN 436 Query: 505 DPKKIAAGSLTYVNSNTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQ-KX 681 DPK++ GSL + ++++ LQQ+ W SS +KEQ + S+ VKQEP+DQ E Q K Sbjct: 437 DPKRVQGGSLPHPSNSSTLQQSSVPWQSSTNKEQIS----SMAYVKQEPADQTNEQQQKS 492 Query: 682 XXXXXXXXXXXXXXXXKLGSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXX---M 852 + G+ PG +KDE+ E +SR G + Sbjct: 493 QLSTPQSLSSFPAVQVEKGNAIPGILKDESLEKQASRIGFSSSMSMLPPNSVSSSMGTHL 552 Query: 853 ETNILSSSRMSSLTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FAD 1029 + N+ SR+ S+T+PVG N++ PPKKP GQKKP+EA F D Sbjct: 553 DPNVTLGSRIPSVTSPVGI--NTRTPPKKPSIGQKKPLEALGSSPPLPSKKQKVSGAFLD 610 Query: 1030 QSIEHLNDVTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKM 1209 QSIE LNDVTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQ+ PLQKK+ Sbjct: 611 QSIEQLNDVTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQKAPLQKKL 670 Query: 1210 AEIMAKCGLKNMSNDVSRCLSLCVEERMRGLISNVIRLS 1326 AEIMA+C LKN+SNDV RCLSLCVEER+RG ISN+IRLS Sbjct: 671 AEIMARCSLKNISNDVERCLSLCVEERLRGFISNLIRLS 709 >ref|XP_006433616.1| hypothetical protein CICLE_v10000177mg [Citrus clementina] gi|557535738|gb|ESR46856.1| hypothetical protein CICLE_v10000177mg [Citrus clementina] Length = 954 Score = 345 bits (885), Expect = 3e-92 Identities = 220/480 (45%), Positives = 271/480 (56%), Gaps = 43/480 (8%) Frame = +1 Query: 16 SQTAPNQSQTQPQASGRQMQ--MPSAQL--------------------PTD--------- 102 SQ +Q +Q QAS RQ Q MPSA PTD Sbjct: 267 SQMGSHQFPSQSQASARQQQLRMPSASAAASQFSDTHSFAQVNQKSNSPTDPIHGPASSA 326 Query: 103 -----LSNSISDNNAAKSHEMERQSNSHGALVGQMXXXXXXXXXHERMHPAFQAQGLNKQ 267 S I +N+A KS E+E QS SHG Q+ ER + QGLNKQ Sbjct: 327 HVQVGSSYPIKENSAQKSRELEHQSASHGIHGSQISSSTPSTVNQERERSSV-VQGLNKQ 385 Query: 268 Q--HMQFSQTSFPTYGSAGSGYTQFP-TNSAASSAATRPQPHDSQMRQGPSHPNLAVNHL 438 Q H+ F QTSF YGS + Y + TN ++ +PQPHDS MRQ H ++ L Sbjct: 386 QQQHLHFPQTSFSMYGSGSNSYHPYSGTNVNNPGSSLKPQPHDSAMRQITHHQSMGSTPL 445 Query: 439 GSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTGLQQNQAQWPSSASKEQKTAN 618 G S+PMNV N+ F++ ++++DP K+ GS++ SN+ LQQ+ W +SA+KEQ + + Sbjct: 446 GGASQPMNVMNVPKFEKQNNMNDPGKVQGGSISQFTSNSTLQQSSVPWQASANKEQSSGS 505 Query: 619 SPSITNVKQEPSDQGIELQKXXXXXXXXXXXXXXXXXKLGSTAPGNMKDEAFEIHSSRAG 798 PS+ VK EP DQG + + + GST PG +KDEA E S R G Sbjct: 506 LPSMAYVKPEPIDQGTD--QPYKLHSSTPQGFSVAQVEPGSTVPGTLKDEASEKQSPRMG 563 Query: 799 XXXXXXXXXXXXXXXXX---METNILSSSRMSSLTAPVGPGNNSKAPPKKPLAGQKKPME 969 +++N LSS RM ++T+P G N++ PPKKP QKKP+E Sbjct: 564 FSASTSIVPSNSVSPSTTTLLDSNALSS-RMPAVTSPAGV--NARTPPKKPSVSQKKPVE 620 Query: 970 AXXXXXXXXXXXXXXXX-FADQSIEHLNDVTAVSGVNLREEEEQLFSGPKEDSRVSEASR 1146 F+DQSIE LNDVTAVSGVNLREEEEQLFSG KEDSRVSEASR Sbjct: 621 PPGSSPPMPSKKQKVSGAFSDQSIEQLNDVTAVSGVNLREEEEQLFSGTKEDSRVSEASR 680 Query: 1147 RVVQEEEERLILQRIPLQKKMAEIMAKCGLKNMSNDVSRCLSLCVEERMRGLISNVIRLS 1326 RVVQEEEERLILQ+ PLQKK+AEIM KCGLKNMSNDV RCLSLCVEERMRGL+ N+IRLS Sbjct: 681 RVVQEEEERLILQKNPLQKKLAEIMVKCGLKNMSNDVERCLSLCVEERMRGLLCNLIRLS 740 >ref|XP_006472283.1| PREDICTED: transcription initiation factor TFIID subunit 4b-like [Citrus sinensis] Length = 955 Score = 343 bits (879), Expect = 1e-91 Identities = 205/422 (48%), Positives = 255/422 (60%), Gaps = 7/422 (1%) Frame = +1 Query: 82 SAQLPTDLSNSISDNNAAKSHEMERQSNSHGALVGQMXXXXXXXXXHERMHPAFQAQGLN 261 SA + S I +N+A KS E+E QS SHG Q+ ER + QGLN Sbjct: 325 SAHVQVGSSYPIKENSAQKSRELEHQSASHGIHGSQISSSTPSTVNQERERSSV-VQGLN 383 Query: 262 KQQ--HMQFSQTSFPTYGSAGSGYTQFP-TNSAASSAATRPQPHDSQMRQGPSHPNLAVN 432 KQQ H+ F QTSF YGS + Y + TN ++ +PQPHDS MRQ H ++ Sbjct: 384 KQQQQHLHFPQTSFSMYGSGSNSYHPYSGTNVNNPGSSLKPQPHDSAMRQITHHQSMGST 443 Query: 433 HLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTGLQQNQAQWPSSASKEQKT 612 LG S+PMNV N+ F++ ++++DP K+ GS++ SN+ LQQ+ W +SA+KEQ + Sbjct: 444 PLGGASQPMNVMNVPKFEKQNNMNDPGKMQGGSISQFTSNSTLQQSSVPWQASANKEQSS 503 Query: 613 ANSPSITNVKQEPSDQGIELQKXXXXXXXXXXXXXXXXXKLGSTAPGNMKDEAFEIHSSR 792 + PS+ VK EP DQG + + + GST PG +KDEA E S R Sbjct: 504 GSLPSMAYVKPEPIDQGTD--QPYKLHSSTPQGFSVAQVEPGSTVPGTLKDEASEKQSPR 561 Query: 793 AGXXXXXXXXXXXXXXXXX---METNILSSSRMSSLTAPVGPGNNSKAPPKKPLAGQKKP 963 G +++N LSS RM ++T+P G N++ PPKKP QKKP Sbjct: 562 MGFSASTSIVPSNSVSPSTTTLLDSNALSS-RMPAVTSPAGV--NARTPPKKPSVSQKKP 618 Query: 964 MEAXXXXXXXXXXXXXXXX-FADQSIEHLNDVTAVSGVNLREEEEQLFSGPKEDSRVSEA 1140 +E F+DQSIE LNDVTAVSGVNLREEEEQLFSG KEDSRVSEA Sbjct: 619 VEPPGSSPPMPSKKQKVSGAFSDQSIEQLNDVTAVSGVNLREEEEQLFSGTKEDSRVSEA 678 Query: 1141 SRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNMSNDVSRCLSLCVEERMRGLISNVIR 1320 SRRVVQEEEERLILQ+ PLQKK+AEIM KCGLKNMSNDV RCLSLCVEERMRGL+ N+IR Sbjct: 679 SRRVVQEEEERLILQKNPLQKKLAEIMVKCGLKNMSNDVERCLSLCVEERMRGLLCNLIR 738 Query: 1321 LS 1326 LS Sbjct: 739 LS 740 >ref|XP_006362063.1| PREDICTED: transcription initiation factor TFIID subunit 4b-like [Solanum tuberosum] Length = 934 Score = 327 bits (837), Expect = 9e-87 Identities = 203/450 (45%), Positives = 262/450 (58%), Gaps = 8/450 (1%) Frame = +1 Query: 1 QAARNSQTAPNQSQTQPQASGRQMQMPSAQLPTDLSNSISDNNAAKSHEMERQSNSHGAL 180 QA++NSQ+ P Q Q QAS +Q + A D SN ++ A K HE+E Q++ GA Sbjct: 277 QASKNSQSVPGQFP-QSQASQQQHSLMPAD---DSSNMAIESKAQKLHEVENQADLRGAQ 332 Query: 181 VGQMXXXXXXXXXHERMHPAFQAQGLNKQQ--HMQFSQTSFPTYGSAGSGYTQFPTNSAA 354 QM ER H F QGLN+QQ H+ FSQ SFPT+ +AG+ Y+ + ++ Sbjct: 333 GSQMPSSGLTSVKQERDHTPFPIQGLNRQQQQHLHFSQASFPTFANAGNNYSAYSASNVN 392 Query: 355 SSAAT--RPQPHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAG 528 SS + Q D+QMRQ N G ++ M + + F++ ++ + K++ G Sbjct: 393 SSTTQPLKQQSDDAQMRQISVQQNRNATQFGVPTQAMGIMSAPKFEKQNTFGEAKRLPGG 452 Query: 529 SLTYVNSNTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQKXXXXXXXXXX 708 L ++S + +QQ QW SA+KEQK+ S +TN K EP D + Sbjct: 453 GLN-ISSTSRIQQTSVQWQPSANKEQKSILSSPMTNPKPEPIDHFHD-----QLHRSQLS 506 Query: 709 XXXXXXXKLGSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXX---METNILSSSR 879 G++ + +DE+ E +SR G M+T+ L +SR Sbjct: 507 PFSSVQVDQGNSTSESSRDESIE-QTSRIGLSSTTSMKPSNSASSSMSSHMDTSTLLTSR 565 Query: 880 MSSLTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHLNDV 1056 S+T+P+G GNN K P KKP GQKKP++ F DQSIE LNDV Sbjct: 566 TLSVTSPLGLGNNGKIPVKKPSIGQKKPLDVLGSSPPPSGKKQKVSGGFLDQSIEQLNDV 625 Query: 1057 TAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGL 1236 TAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQ+IPLQKK+AEIMAKCGL Sbjct: 626 TAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQKIPLQKKLAEIMAKCGL 685 Query: 1237 KNMSNDVSRCLSLCVEERMRGLISNVIRLS 1326 KNMS+DV RCLSLCVEERMRGLIS++IRLS Sbjct: 686 KNMSSDVERCLSLCVEERMRGLISSLIRLS 715 >ref|XP_004238100.1| PREDICTED: uncharacterized protein LOC101262209 [Solanum lycopersicum] Length = 934 Score = 326 bits (835), Expect = 2e-86 Identities = 204/451 (45%), Positives = 265/451 (58%), Gaps = 9/451 (1%) Frame = +1 Query: 1 QAARNSQTAPNQSQTQPQASGRQMQMPSAQLPTDLSNSISDNNAAKSHEMERQSNSHGAL 180 QA++NSQ+ P Q Q QAS +Q + A D SN ++ A K HE+E Q++ GA Sbjct: 277 QASKNSQSVPGQFP-QSQASQQQHSLMPAD---DSSNMAIESKAQKLHEVENQADLRGAQ 332 Query: 181 VGQMXXXXXXXXXHERMHPAFQAQGLNKQQ--HMQFSQTSFPTYGSAGSGYTQFPTNSAA 354 QM ER H F QGLN+QQ H+ FSQ SFPT+ +AG+ Y+ + ++ Sbjct: 333 GSQMSSSSLTAVKQERDHTPFPIQGLNRQQQQHLHFSQASFPTFANAGNNYSAYSASNVN 392 Query: 355 SSAAT--RPQPHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAG 528 SS + Q D+QMRQ N G ++ M + + F++ ++ + K++ G Sbjct: 393 SSTTQPLKQQSDDAQMRQISVQQNRNATQFGVPAQAMGIMSAPKFEKQNTFGEAKRLPGG 452 Query: 529 SLTYVNSNTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQ-GIELQKXXXXXXXXX 705 L ++S + +QQ QW SA+KEQK+ S +TN K EP D +LQ+ Sbjct: 453 GLN-MSSTSRIQQTSVQWQPSANKEQKSILSSPMTNPKPEPIDHFHDQLQRSQLSPFSSV 511 Query: 706 XXXXXXXXKLGSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXX---METNILSSS 876 G++ + +DE+ E +SR G M+T+ L +S Sbjct: 512 QVDQ------GNSTSESSRDESIE-QTSRIGLSSTTSMKPSNSASSSMSSHMDTSTLLTS 564 Query: 877 RMSSLTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHLND 1053 R S+T+P+G GNN K P KKP GQKKP++A F DQSIE LND Sbjct: 565 RTLSVTSPLGLGNNGKTPVKKPSIGQKKPLDALGSSPPPSGKKQKVSGGFLDQSIEQLND 624 Query: 1054 VTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCG 1233 VTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQ+IPLQKK+ EIMAKCG Sbjct: 625 VTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQKIPLQKKLTEIMAKCG 684 Query: 1234 LKNMSNDVSRCLSLCVEERMRGLISNVIRLS 1326 LK+MS+DV RCLSLCVEERMRGLIS++IRLS Sbjct: 685 LKSMSSDVERCLSLCVEERMRGLISSLIRLS 715 >ref|XP_007210386.1| hypothetical protein PRUPE_ppa001063mg [Prunus persica] gi|462406121|gb|EMJ11585.1| hypothetical protein PRUPE_ppa001063mg [Prunus persica] Length = 920 Score = 320 bits (821), Expect = 7e-85 Identities = 199/448 (44%), Positives = 252/448 (56%), Gaps = 16/448 (3%) Frame = +1 Query: 31 NQSQTQPQASGRQMQMPSAQLPTDLSNSISDNNAAKSHEMERQSNSHGALVGQMXXXXXX 210 +Q P + + Q+ +D S+S+ +N+A K E ER S+SHG V QM Sbjct: 262 SQRGANPPTDPSHIPSSAVQVQSDSSHSVIENSAKKLREAERPSDSHGMQVSQMPSSSAV 321 Query: 211 XXXHERMHPAFQAQGLNKQQHMQ---FSQTSFPTYGSAGSGYTQFP-TNSAASSAATRPQ 378 ER + Q LNKQQ Q + Q+SF YGS G Y + T+ S+ + Q Sbjct: 322 AGNQERERSSGPPQILNKQQQQQQLHYPQSSFAMYGSTGGNYHPYSGTSINTSTLPLKQQ 381 Query: 379 PHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTG 558 PHDSQ+RQ P H + G + +N+TN+S +R +SL+DP ++ GS+++ +N+ Sbjct: 382 PHDSQLRQIPQHQGMGSTQSGGEPQGVNITNVSKLERQNSLNDPSRLQGGSVSHFTNNSN 441 Query: 559 LQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQ-KXXXXXXXXXXXXXXXXXKL 735 LQQN SS +KEQ S++ VKQEP DQ E Q K + Sbjct: 442 LQQNSVPRQSS-NKEQNPGPVSSMSYVKQEPIDQTAEQQQKPPLSNQQGLPSASAAQLEQ 500 Query: 736 GSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXX----------METNILSSSRMS 885 GS PG DE+ E SSR G ++TN+ R+ Sbjct: 501 GSALPGISTDESIEKQSSRMGFATSGMVTSSSTGTVPPNSVSPSIMTQVDTNVSLGHRIP 560 Query: 886 SLTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHLNDVTA 1062 S TA G +++APPKKP GQKKP+E F DQSIE LNDVTA Sbjct: 561 SGTA----GISNRAPPKKPSIGQKKPLEVPGSSPPPSSKKQKLSGNFLDQSIEQLNDVTA 616 Query: 1063 VSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKN 1242 VSGVNLREEEEQLFSGPKEDSR SEASR+ VQEEEERLILQ+ PLQKK+AEIM KCGLK+ Sbjct: 617 VSGVNLREEEEQLFSGPKEDSRASEASRKFVQEEEERLILQKAPLQKKLAEIMVKCGLKS 676 Query: 1243 MSNDVSRCLSLCVEERMRGLISNVIRLS 1326 +SNDV RCLSLCVEERMRGLI+N+IRLS Sbjct: 677 ISNDVERCLSLCVEERMRGLINNLIRLS 704 >ref|XP_002520510.1| transcription initiation factor, putative [Ricinus communis] gi|223540352|gb|EEF41923.1| transcription initiation factor, putative [Ricinus communis] Length = 927 Score = 318 bits (814), Expect = 4e-84 Identities = 201/460 (43%), Positives = 256/460 (55%), Gaps = 19/460 (4%) Frame = +1 Query: 4 AARNSQTAPNQSQTQPQASGRQ--MQMP-------SAQLPTDLSNSISDNNAAKSHEMER 156 A SQ NQSQ Q QA Q ++MP + Q D S ++NNA KS E+E Sbjct: 221 AQLQSQPGSNQSQLQSQAFAPQHNVRMPISATVSSAVQAQADSSCPSAENNAQKSQEVEC 280 Query: 157 QSNSHGALVGQMXXXXXXXXXHERMHPAFQAQGLNKQQ-----HMQFSQTSFPTYGSAGS 321 Q NSHG V Q+ + G NKQQ H+ F Q SFP YG+ Sbjct: 281 QPNSHGMQVSQLSSSSTRALSQDSNCSLISVPGHNKQQQQEQQHLHFPQNSFPMYGNNSG 340 Query: 322 GYTQFP-TNSAASSAATRPQPHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHS 498 + + TN S ++ RPQ HD QMR+ SHP +G ++ M++ +S F+RP+S Sbjct: 341 THRPYSGTNFNTSGSSMRPQSHDLQMRK-ISHPTTGATQIGGSAQAMDMIKVSKFERPNS 399 Query: 499 LSDPKKIAAGSLTYVNSNTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQK 678 +DP K+ +GS + + LQ N A W +KEQK++ PS VKQEP +Q E + Sbjct: 400 GTDPNKVQSGSAAQYTNKSALQPNSAPWQPPTNKEQKSSPFPSKNYVKQEPVEQATEQHQ 459 Query: 679 XXXXXXXXXXXXXXXXXKLGSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXX--- 849 + G+ N+K ++ E SS+ G Sbjct: 460 KSQLSNPQDLSAAPV--EQGNAVTSNLKVDSLEKQSSKVGISIPSSMVPSSSVSTSIATR 517 Query: 850 METNILSSSRMSSLTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXXFA- 1026 ++ I S++ S+ A PG N++ PPKKPL GQKKP+EA + Sbjct: 518 LDPIIQVGSQIQSIAA--SPGVNARTPPKKPLIGQKKPLEALGSSPPMSSKKQKISVASS 575 Query: 1027 DQSIEHLNDVTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKK 1206 DQSIE LNDVTAVSGVNLREEEEQLFSG KEDSRVSEASRRVVQEEEERLILQ+ PLQKK Sbjct: 576 DQSIEQLNDVTAVSGVNLREEEEQLFSGSKEDSRVSEASRRVVQEEEERLILQKTPLQKK 635 Query: 1207 MAEIMAKCGLKNMSNDVSRCLSLCVEERMRGLISNVIRLS 1326 +AEIMAKCGLK +++DV RCLSLCVEERMRGL+S +IRLS Sbjct: 636 VAEIMAKCGLKYINSDVERCLSLCVEERMRGLVSTLIRLS 675 >ref|XP_007018537.1| TBP-associated factor 4, putative isoform 2 [Theobroma cacao] gi|508723865|gb|EOY15762.1| TBP-associated factor 4, putative isoform 2 [Theobroma cacao] Length = 944 Score = 313 bits (803), Expect = 8e-83 Identities = 203/447 (45%), Positives = 255/447 (57%), Gaps = 7/447 (1%) Frame = +1 Query: 7 ARNSQTAPNQSQTQPQASGRQMQMPSAQLPTDLSNSISDNNAAKSHEMERQSNSH-GALV 183 A+ Q PN T +A P+ + T+ S S ++N A KS EM+RQS+S G L Sbjct: 306 AQLQQKGPNSPATPSRAPS-----PAVPMQTNSSYSSTENKAPKSQEMDRQSDSRFGVLG 360 Query: 184 GQMXXXXXXXXXHERMHPAFQAQGLNKQQ--HMQFSQTSFPTYGSAGSGYTQFPTNSA-A 354 Q+ ER + QGLNKQQ H+ F QTSF +GS S Y + S A Sbjct: 361 SQISSFSTTTVNQERDRSSIPVQGLNKQQQQHLNFPQTSFSMHGS--SSYHPYSGPSVNA 418 Query: 355 SSAATRPQPHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSL 534 S ++ +PQPHDSQMRQ H ++ N +G ++ MNV + F+R +S +DP ++ GSL Sbjct: 419 SGSSLKPQPHDSQMRQTALHQSMGSNPVGGPTQAMNVMSGPKFERQNSSNDPNRLQGGSL 478 Query: 535 TYVNSNTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQKXXXXXXXXXXXX 714 ++ ++++ W +S+SKE S+T VKQE DQG E Q Sbjct: 479 SHFSNSS------VPWQASSSKETNPGPLSSVTYVKQESVDQGAEHQHKPHLSASQGLPT 532 Query: 715 XXXXXKLGSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXX--METNILSSSRMSS 888 + G+ KDE E SSR G +++N+L SR S Sbjct: 533 ALG--EQGNAVTTTPKDEPLEKQSSRIGFSTPNSMVPPNSVSPITTQVDSNVLLGSRNPS 590 Query: 889 LTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHLNDVTAV 1065 + P G NS+ P KKP GQKKP+E F DQSIE LNDVTAV Sbjct: 591 V--PSLAGANSRTPQKKPSVGQKKPLETLGSSPPPSSKKQKVSGAFLDQSIEQLNDVTAV 648 Query: 1066 SGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNM 1245 SGVNLREEEEQLFSGPK+DSRVSEASRRVVQEEEERLILQ+ PLQKK+AEIMAK GLKN+ Sbjct: 649 SGVNLREEEEQLFSGPKDDSRVSEASRRVVQEEEERLILQKTPLQKKLAEIMAKSGLKNI 708 Query: 1246 SNDVSRCLSLCVEERMRGLISNVIRLS 1326 SNDV RC+SLCVEERMRGLI N+IRLS Sbjct: 709 SNDVERCVSLCVEERMRGLICNLIRLS 735 >ref|XP_007018536.1| TBP-associated factor 4, putative isoform 1 [Theobroma cacao] gi|508723864|gb|EOY15761.1| TBP-associated factor 4, putative isoform 1 [Theobroma cacao] Length = 950 Score = 313 bits (803), Expect = 8e-83 Identities = 203/447 (45%), Positives = 255/447 (57%), Gaps = 7/447 (1%) Frame = +1 Query: 7 ARNSQTAPNQSQTQPQASGRQMQMPSAQLPTDLSNSISDNNAAKSHEMERQSNSH-GALV 183 A+ Q PN T +A P+ + T+ S S ++N A KS EM+RQS+S G L Sbjct: 306 AQLQQKGPNSPATPSRAPS-----PAVPMQTNSSYSSTENKAPKSQEMDRQSDSRFGVLG 360 Query: 184 GQMXXXXXXXXXHERMHPAFQAQGLNKQQ--HMQFSQTSFPTYGSAGSGYTQFPTNSA-A 354 Q+ ER + QGLNKQQ H+ F QTSF +GS S Y + S A Sbjct: 361 SQISSFSTTTVNQERDRSSIPVQGLNKQQQQHLNFPQTSFSMHGS--SSYHPYSGPSVNA 418 Query: 355 SSAATRPQPHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSL 534 S ++ +PQPHDSQMRQ H ++ N +G ++ MNV + F+R +S +DP ++ GSL Sbjct: 419 SGSSLKPQPHDSQMRQTALHQSMGSNPVGGPTQAMNVMSGPKFERQNSSNDPNRLQGGSL 478 Query: 535 TYVNSNTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQKXXXXXXXXXXXX 714 ++ ++++ W +S+SKE S+T VKQE DQG E Q Sbjct: 479 SHFSNSS------VPWQASSSKETNPGPLSSVTYVKQESVDQGAEHQHKPHLSASQGLPT 532 Query: 715 XXXXXKLGSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXX--METNILSSSRMSS 888 + G+ KDE E SSR G +++N+L SR S Sbjct: 533 ALG--EQGNAVTTTPKDEPLEKQSSRIGFSTPNSMVPPNSVSPITTQVDSNVLLGSRNPS 590 Query: 889 LTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHLNDVTAV 1065 + P G NS+ P KKP GQKKP+E F DQSIE LNDVTAV Sbjct: 591 V--PSLAGANSRTPQKKPSVGQKKPLETLGSSPPPSSKKQKVSGAFLDQSIEQLNDVTAV 648 Query: 1066 SGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNM 1245 SGVNLREEEEQLFSGPK+DSRVSEASRRVVQEEEERLILQ+ PLQKK+AEIMAK GLKN+ Sbjct: 649 SGVNLREEEEQLFSGPKDDSRVSEASRRVVQEEEERLILQKTPLQKKLAEIMAKSGLKNI 708 Query: 1246 SNDVSRCLSLCVEERMRGLISNVIRLS 1326 SNDV RC+SLCVEERMRGLI N+IRLS Sbjct: 709 SNDVERCVSLCVEERMRGLICNLIRLS 735 >ref|XP_002510115.1| transcription initiation factor, putative [Ricinus communis] gi|223550816|gb|EEF52302.1| transcription initiation factor, putative [Ricinus communis] Length = 925 Score = 311 bits (798), Expect = 3e-82 Identities = 197/453 (43%), Positives = 255/453 (56%), Gaps = 16/453 (3%) Frame = +1 Query: 16 SQTAPNQSQTQPQASGRQ--MQMP-------SAQLPTDLSNSISDNNAAKSHEMERQSNS 168 SQ QSQ Q QA GRQ ++MP + Q+ D S ++ NA + +E +S Sbjct: 261 SQQGSRQSQLQSQAFGRQHNVRMPVSATASSAVQVLADSSYPPAEGNAHRPRGVEHLPDS 320 Query: 169 HGALVGQMXXXXXXXXXHERMHPAFQAQGLNKQQ--HMQFSQTSFPTYGSA-GSGYTQFP 339 HG Q +R + G +KQQ H+ F Q SF TYGS+ G+ + Sbjct: 321 HGMQASQFSSPSTSTLSQDRERSSISVPGHSKQQQQHLHFPQNSFSTYGSSSGTHHPYSG 380 Query: 340 TNSAASSAATRPQPHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKI 519 TN S ++ + QPHD QMRQ SH +A +G + +N+ ++S F+RP+S+SDP ++ Sbjct: 381 TNINTSGSSMKTQPHDLQMRQ-ISHSTMASTQIGGSTPTLNMVHVSKFERPNSVSDPSRV 439 Query: 520 AAGSLTYVNSNTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQKXXXXXXX 699 +GS++ N+ + L QN W + +KEQ + PS VKQEP +Q + Q+ Sbjct: 440 QSGSMSQYNNKSALPQNSIPWQAPTNKEQTSPLFPSTNYVKQEPLEQATDQQQKPQLSNP 499 Query: 700 XXXXXXXXXXKLGSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXXM---ETNILS 870 + G+ P N K+++ E SS+ G + NI + Sbjct: 500 QGLSAAPG--EQGNAVPVNSKEDSLEKPSSKVGFSNPSTAVPSNSVSPSIAIQPDPNIQA 557 Query: 871 SSRMSSLTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHL 1047 R S A VG N++ P KK GQKKP+EA F DQSIE L Sbjct: 558 GPRFPSGAASVGV--NARTPTKKLSIGQKKPLEALGSSPPMSSKKQKVSGAFLDQSIEQL 615 Query: 1048 NDVTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAK 1227 NDVTAVSGVNLREEEEQLFSG KEDSRVSEASRRVVQEEEERLILQ+ PLQKK+AEIM K Sbjct: 616 NDVTAVSGVNLREEEEQLFSGSKEDSRVSEASRRVVQEEEERLILQKTPLQKKLAEIMVK 675 Query: 1228 CGLKNMSNDVSRCLSLCVEERMRGLISNVIRLS 1326 CGLKN++NDV RCLSLCVEERMRGLIS +IRLS Sbjct: 676 CGLKNINNDVERCLSLCVEERMRGLISTLIRLS 708 >ref|XP_002320699.1| hypothetical protein POPTR_0014s01830g [Populus trichocarpa] gi|222861472|gb|EEE99014.1| hypothetical protein POPTR_0014s01830g [Populus trichocarpa] Length = 875 Score = 291 bits (746), Expect = 3e-76 Identities = 185/431 (42%), Positives = 236/431 (54%), Gaps = 12/431 (2%) Frame = +1 Query: 70 MQMPSAQLPTDLSNS--------ISDNNAAKSHEMERQSNSHGALVGQMXXXXXXXXXHE 225 +++ +AQL + SN+ S N+ KS +E + +S Q E Sbjct: 254 LRLAAAQLQSQASNAWAIQLQTDSSIVNSQKSKAVEWKPDSLVMQASQSHSSNASISNQE 313 Query: 226 RMHPAFQAQGLNKQQ-HMQFSQTSFPTYGSAGSGYTQFP-TNSAASSAATRPQPHDSQMR 399 R + QG NKQQ H+ F TSFP YGS+G Y + TN + S + +PQPHD Q R Sbjct: 314 RERSSISMQGQNKQQQHVNFPPTSFPMYGSSGGNYHPYSGTNVSTSGPSVKPQPHDPQTR 373 Query: 400 QGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTGLQQNQAQ 579 Q P H NL V +G M T F+R +S DP ++ +GS+++ + + LQQN A Sbjct: 374 QIPHHQNLGVTQIGGPMHSMIST--PKFERQNSADDPSRVHSGSVSHYTNKSALQQNSAP 431 Query: 580 WPSSASKEQKTANSPSITNVKQEPSDQGIELQKXXXXXXXXXXXXXXXXXKLG-STAPGN 756 W + +++E+ A+ S+ VK +Q E Q K+ ST P N Sbjct: 432 WQAPSNREKSPASFSSLNYVKPGLLEQAGEQQNKPQLSSPQDQSLDKQSTKIVFSTVPPN 491 Query: 757 MKDEAFEIHSSRAGXXXXXXXXXXXXXXXXXMETNILSSSRMSSLTAPVGPGNNSKAPPK 936 + M+ N + SR+SS+ +P G N++ PPK Sbjct: 492 SAPPSIATQ----------------------MDPNGQAGSRISSVASPAGV--NARTPPK 527 Query: 937 KPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHLNDVTAVSGVNLREEEEQLFSGP 1113 KP GQKKP EA F+DQSIE LNDVTAVSGVNLREEEEQLFSGP Sbjct: 528 KPSVGQKKPFEALGSSPPASTKKHKVSGAFSDQSIEQLNDVTAVSGVNLREEEEQLFSGP 587 Query: 1114 KEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNMSNDVSRCLSLCVEERM 1293 KEDSRVSEASRR VQEEEERL+LQ+ PL+KK+ EIMAKCGLKN DV RCLSLCVEERM Sbjct: 588 KEDSRVSEASRRFVQEEEERLMLQKTPLKKKLGEIMAKCGLKNFGTDVERCLSLCVEERM 647 Query: 1294 RGLISNVIRLS 1326 RGLISN+IRLS Sbjct: 648 RGLISNMIRLS 658 >emb|CBI19420.3| unnamed protein product [Vitis vinifera] Length = 882 Score = 291 bits (744), Expect = 6e-76 Identities = 184/433 (42%), Positives = 235/433 (54%), Gaps = 2/433 (0%) Frame = +1 Query: 34 QSQTQPQASGRQMQMPSAQLPTDLSNSISDNNAAKSHEMERQSNSHGALVGQMXXXXXXX 213 + Q+ P A M + ++ TD S ++ N+ K EMERQS+SHG QM Sbjct: 303 KGQSTP-ADSSHMPSSAMKVQTDSSYPTTETNSQKPREMERQSDSHGMQGSQMSSSSLSS 361 Query: 214 XXHERMHPAFQAQGLNKQQHMQFSQTSFPTYGSAGSGYTQFP-TNSAASSAATRPQPHDS 390 ER H T F YGSAG Y + TN S+ +T+ QPHDS Sbjct: 362 AKQEREH-----------------STPFTMYGSAGGNYHSYTGTNVNTSATSTKQQPHDS 404 Query: 391 QMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTGLQQN 570 QMRQ P H N+ +G S+ MN ++ F+R S++DPK++ GSL + ++++ LQQ+ Sbjct: 405 QMRQVPLHQNIGSTQMGGTSQAMNPMSVPKFERQSSVNDPKRVQGGSLPHPSNSSTLQQS 464 Query: 571 QAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQKXXXXXXXXXXXXXXXXXKLGSTAP 750 Q S S +P +++++ S G S P Sbjct: 465 SQQQKSQLS-------TPQNESLEKQASRIGFSSSM--------------------SMLP 497 Query: 751 GNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXXMETNILSSSRMSSLTAPVGPGNNSKAP 930 N + H ++ N+ SR+ S+T+PVG N++ P Sbjct: 498 PNSVSSSMGTH----------------------LDPNVTLGSRIPSVTSPVGI--NTRTP 533 Query: 931 PKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHLNDVTAVSGVNLREEEEQLFS 1107 PKKP GQKKP+EA F DQSIE LNDVTAVSGVNLREEEEQLFS Sbjct: 534 PKKPSIGQKKPLEALGSSPPLPSKKQKVSGAFLDQSIEQLNDVTAVSGVNLREEEEQLFS 593 Query: 1108 GPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNMSNDVSRCLSLCVEE 1287 GPKEDSRVSEASRRVVQEEEERLILQ+ PLQKK+AEIMA+C LKN+SNDV RCLSLCVEE Sbjct: 594 GPKEDSRVSEASRRVVQEEEERLILQKAPLQKKLAEIMARCSLKNISNDVERCLSLCVEE 653 Query: 1288 RMRGLISNVIRLS 1326 R+RG ISN+IRLS Sbjct: 654 RLRGFISNLIRLS 666 >gb|EXB38469.1| Transcription initiation factor TFIID subunit 4B [Morus notabilis] Length = 961 Score = 290 bits (743), Expect = 8e-76 Identities = 193/454 (42%), Positives = 244/454 (53%), Gaps = 24/454 (5%) Frame = +1 Query: 37 SQTQPQASGRQMQMPSAQLPTDLSNSISDNNAAKSHEMERQSNSHGALVGQMXXXXXXXX 216 SQ S Q+ D+S+ S +++ QS SHG QM Sbjct: 293 SQFTDPRSFAQVHQKGTSTSADVSHVPSSVGQVQTNPS--QSASHGLQASQMPSSGAGAT 350 Query: 217 XHERMHPAFQAQGLNKQQHMQ---FSQTSFPTYG-SAGSGYTQFPTNSAASSAATRPQPH 384 ER QGLNKQQ Q F QTSF YG ++G+ + TN S+ + QPH Sbjct: 351 NQERD----SMQGLNKQQQQQQLHFPQTSFGMYGGNSGNIHLYSGTNVNTSTLPLKLQPH 406 Query: 385 DSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTGLQ 564 D+Q+R P H ++ LG ++ N+ + ++ +S++DP ++ GSL++ SN+ Q Sbjct: 407 DTQIRPIPQHQSVGSAQLGGETQGSNMLGLPKLEKQNSINDPSRMHIGSLSHFASNSANQ 466 Query: 565 QNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQ-KXXXXXXXXXXXXXXXXXKLGS 741 Q A W S +K+Q S + +K EP DQ IELQ K + G+ Sbjct: 467 QKPAPWQPSTNKDQTAGPLSSTSYIKPEPVDQAIELQHKPSPPNSQGLPSVSAVQIEHGN 526 Query: 742 TAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXXMET------NILSSSRMSSL--TA 897 + G KDE+ E H SR G + N +SS+ L Sbjct: 527 MSSGTSKDESTEKHHSRMGFPTSASIVPSSSTSIVPSSSTSMAPHNTISSNMSMQLGPNI 586 Query: 898 PVGP---------GNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHL 1047 P+GP G N+K PPKKP GQKKP+EA F DQSIE L Sbjct: 587 PLGPRAPIGTPPVGTNNKTPPKKPSVGQKKPLEALGSSPPPAGKKQKVSGNFLDQSIEQL 646 Query: 1048 NDVTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEI-MA 1224 NDVTAVSGVNLREEEEQLFSGPKEDSRVSEASR+VVQEEEERLILQ+ PLQKK+AEI + Sbjct: 647 NDVTAVSGVNLREEEEQLFSGPKEDSRVSEASRKVVQEEEERLILQKTPLQKKLAEITVV 706 Query: 1225 KCGLKNMSNDVSRCLSLCVEERMRGLISNVIRLS 1326 KCGLK++SNDV RCLSLCVEERMRGLI N+IRLS Sbjct: 707 KCGLKSISNDVERCLSLCVEERMRGLIDNLIRLS 740 >ref|XP_004300119.1| PREDICTED: uncharacterized protein LOC101295421 [Fragaria vesca subsp. vesca] Length = 958 Score = 285 bits (730), Expect = 2e-74 Identities = 189/458 (41%), Positives = 248/458 (54%), Gaps = 16/458 (3%) Frame = +1 Query: 1 QAARNSQTAPNQSQTQPQASGRQMQMPSAQLPTDLSNSISDNNAAKSHEMERQSNSHGAL 180 Q N T P+ T P + TD S+S +N+A K E ERQS+ HG Sbjct: 300 QRGVNPSTGPSHITTVP-------------VQTDSSHSAIENSAKKLREAERQSDPHGMQ 346 Query: 181 VGQMXXXXXXXXXHERMHPAFQAQ-GLNKQQH-MQFSQTSFPTYGSAGSGYTQFPTNSAA 354 + QM ER + Q N+QQH + + Q++F YGS G Y +P + Sbjct: 347 INQMSSSSTGASNQERDRSSVPMQVHSNQQQHQLHYPQSTFAMYGSTGGNYHPYPGTNV- 405 Query: 355 SSAATRPQPHDSQMRQGPSHPNL-AVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGS 531 S+ + QPHDS +R P H + + +G ++ N+ ++ +R +S++DP + GS Sbjct: 406 STMPIKQQPHDSHLRPIPQHQGMGSAQSVGGETQGTNIMSVPKLERQNSVNDPGRQQGGS 465 Query: 532 LTYVNSNTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQ-KXXXXXXXXXX 708 L + +++ LQQ+Q W SS +KEQ + S S+ VKQEP DQ E Q K Sbjct: 466 LPHFTNSSTLQQHQIPWQSS-NKEQISGPSSSMAYVKQEPIDQSAEQQHKTPLSNNQRLP 524 Query: 709 XXXXXXXKLGSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXXMETNI--LSSSRM 882 + S +PG DE+ E SSR G + +SS+ M Sbjct: 525 YASSLQLEQISASPGVSMDESLEKQSSRMGFSSAGPPGSMVISSSTSTGPPLTPISSTTM 584 Query: 883 SSLTAPVGP--------GNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX--FADQ 1032 + +G G N++ P KK GQKKP EA F+DQ Sbjct: 585 TQADPNLGSKIPSGTPAGTNNRIPAKKTSVGQKKPSEALGSPPPPSSGKKQKVSGAFSDQ 644 Query: 1033 SIEHLNDVTAVSGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMA 1212 SIE LNDVTAVSGVNLREEEEQLFSGPK+DSR SEASRRVVQEEEERLILQ+ PLQKK+A Sbjct: 645 SIEQLNDVTAVSGVNLREEEEQLFSGPKDDSRASEASRRVVQEEEERLILQKTPLQKKLA 704 Query: 1213 EIMAKCGLKNMSNDVSRCLSLCVEERMRGLISNVIRLS 1326 EIM + GLK++S+DV RCLSLCVEERMRGLI+N+IRLS Sbjct: 705 EIMFRSGLKSISHDVERCLSLCVEERMRGLINNLIRLS 742 >ref|XP_006581260.1| PREDICTED: transcription initiation factor TFIID subunit 4b-like isoform X4 [Glycine max] gi|571458910|ref|XP_006581261.1| PREDICTED: transcription initiation factor TFIID subunit 4b-like isoform X5 [Glycine max] Length = 929 Score = 268 bits (686), Expect = 3e-69 Identities = 170/423 (40%), Positives = 239/423 (56%), Gaps = 8/423 (1%) Frame = +1 Query: 82 SAQLPTDLSNSISDNNAAKSHEMERQSNSHGALVGQMXXXXXXXXXHERMHPAFQAQGLN 261 + Q+ + + D NA KS E++ Q S GA + Q+ E + QGLN Sbjct: 310 AVQVKNEPTYPTMDINAKKSRELDVQVESQGAQLNQLPSSSSNAVSQETERSSLHLQGLN 369 Query: 262 K--QQHMQFSQTSFPTYGSAGSGYTQFPTNSAASSAATRPQPHDSQMRQGPSHPNLAVNH 435 K QQH+ F YG++G Y F ++++S+++ RPQP DS MRQ P H +++ N Sbjct: 370 KEQQQHLHFPSA----YGNSGGNYNPFSGSTSSSTSSIRPQPFDSHMRQIP-HQSISPNQ 424 Query: 436 LGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTGLQQNQAQWPSSASKEQKTA 615 LG ++ + ++ D+ +S +DPK++ G ++ V +NT QQ W SA+KEQ + Sbjct: 425 LGGSTQ--GLIGLTKLDQQNSFNDPKRMPGGFVSPVANNTTSQQTSNSWQPSANKEQSSG 482 Query: 616 NSPSITNVKQEPSDQGIELQ-KXXXXXXXXXXXXXXXXXKLGSTA-PGNMKDE---AFEI 780 + S+ VK+EP+D E Q + + GS+A G +K+E F Sbjct: 483 SFSSVPYVKKEPNDLSTEQQHRHNLSKLHGLHSVNSVQNEQGSSANQGTLKEEFSRGFPA 542 Query: 781 HSSRAGXXXXXXXXXXXXXXXXXMETNILSSSRMSSLTAPVGPGNNSKAPPKKPLAGQKK 960 +S ++ + S ++ S T+ + N++ P KKP GQKK Sbjct: 543 STSMPHTTSSLLPLNSASPSVSQLDPSATLSPQIPSNTSVI----NARTPLKKPSPGQKK 598 Query: 961 PMEAXXXXXXXXXXXXXXXXFA-DQSIEHLNDVTAVSGVNLREEEEQLFSGPKEDSRVSE 1137 P+EA + + SIE LNDVTAVSGV+LREEEEQLFSGPKEDSR SE Sbjct: 599 PIEALGSSPPPPSKKQKVSGASLEPSIEQLNDVTAVSGVDLREEEEQLFSGPKEDSRASE 658 Query: 1138 ASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNMSNDVSRCLSLCVEERMRGLISNVI 1317 ASRRVVQEEEE LILQ+ PLQ+K+ EI+ +CGLK +SND+ RCLSLCVEERMRG+ISNVI Sbjct: 659 ASRRVVQEEEESLILQKAPLQRKLIEIINECGLKGVSNDLERCLSLCVEERMRGVISNVI 718 Query: 1318 RLS 1326 R+S Sbjct: 719 RMS 721 >ref|XP_003527732.1| PREDICTED: transcription initiation factor TFIID subunit 4b-like isoform X1 [Glycine max] gi|571458904|ref|XP_006581258.1| PREDICTED: transcription initiation factor TFIID subunit 4b-like isoform X2 [Glycine max] gi|571458906|ref|XP_006581259.1| PREDICTED: transcription initiation factor TFIID subunit 4b-like isoform X3 [Glycine max] Length = 933 Score = 268 bits (686), Expect = 3e-69 Identities = 170/423 (40%), Positives = 239/423 (56%), Gaps = 8/423 (1%) Frame = +1 Query: 82 SAQLPTDLSNSISDNNAAKSHEMERQSNSHGALVGQMXXXXXXXXXHERMHPAFQAQGLN 261 + Q+ + + D NA KS E++ Q S GA + Q+ E + QGLN Sbjct: 310 AVQVKNEPTYPTMDINAKKSRELDVQVESQGAQLNQLPSSSSNAVSQETERSSLHLQGLN 369 Query: 262 K--QQHMQFSQTSFPTYGSAGSGYTQFPTNSAASSAATRPQPHDSQMRQGPSHPNLAVNH 435 K QQH+ F YG++G Y F ++++S+++ RPQP DS MRQ P H +++ N Sbjct: 370 KEQQQHLHFPSA----YGNSGGNYNPFSGSTSSSTSSIRPQPFDSHMRQIP-HQSISPNQ 424 Query: 436 LGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTGLQQNQAQWPSSASKEQKTA 615 LG ++ + ++ D+ +S +DPK++ G ++ V +NT QQ W SA+KEQ + Sbjct: 425 LGGSTQ--GLIGLTKLDQQNSFNDPKRMPGGFVSPVANNTTSQQTSNSWQPSANKEQSSG 482 Query: 616 NSPSITNVKQEPSDQGIELQ-KXXXXXXXXXXXXXXXXXKLGSTA-PGNMKDE---AFEI 780 + S+ VK+EP+D E Q + + GS+A G +K+E F Sbjct: 483 SFSSVPYVKKEPNDLSTEQQHRHNLSKLHGLHSVNSVQNEQGSSANQGTLKEEFSRGFPA 542 Query: 781 HSSRAGXXXXXXXXXXXXXXXXXMETNILSSSRMSSLTAPVGPGNNSKAPPKKPLAGQKK 960 +S ++ + S ++ S T+ + N++ P KKP GQKK Sbjct: 543 STSMPHTTSSLLPLNSASPSVSQLDPSATLSPQIPSNTSVI----NARTPLKKPSPGQKK 598 Query: 961 PMEAXXXXXXXXXXXXXXXXFA-DQSIEHLNDVTAVSGVNLREEEEQLFSGPKEDSRVSE 1137 P+EA + + SIE LNDVTAVSGV+LREEEEQLFSGPKEDSR SE Sbjct: 599 PIEALGSSPPPPSKKQKVSGASLEPSIEQLNDVTAVSGVDLREEEEQLFSGPKEDSRASE 658 Query: 1138 ASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNMSNDVSRCLSLCVEERMRGLISNVI 1317 ASRRVVQEEEE LILQ+ PLQ+K+ EI+ +CGLK +SND+ RCLSLCVEERMRG+ISNVI Sbjct: 659 ASRRVVQEEEESLILQKAPLQRKLIEIINECGLKGVSNDLERCLSLCVEERMRGVISNVI 718 Query: 1318 RLS 1326 R+S Sbjct: 719 RMS 721 >ref|XP_007018538.1| TBP-associated factor 4, putative isoform 3, partial [Theobroma cacao] gi|508723866|gb|EOY15763.1| TBP-associated factor 4, putative isoform 3, partial [Theobroma cacao] Length = 707 Score = 268 bits (685), Expect = 4e-69 Identities = 180/420 (42%), Positives = 230/420 (54%), Gaps = 7/420 (1%) Frame = +1 Query: 7 ARNSQTAPNQSQTQPQASGRQMQMPSAQLPTDLSNSISDNNAAKSHEMERQSNSH-GALV 183 A+ Q PN T +A P+ + T+ S S ++N A KS EM+RQS+S G L Sbjct: 305 AQLQQKGPNSPATPSRAPS-----PAVPMQTNSSYSSTENKAPKSQEMDRQSDSRFGVLG 359 Query: 184 GQMXXXXXXXXXHERMHPAFQAQGLNKQQ--HMQFSQTSFPTYGSAGSGYTQFPTNSA-A 354 Q+ ER + QGLNKQQ H+ F QTSF +GS S Y + S A Sbjct: 360 SQISSFSTTTVNQERDRSSIPVQGLNKQQQQHLNFPQTSFSMHGS--SSYHPYSGPSVNA 417 Query: 355 SSAATRPQPHDSQMRQGPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSL 534 S ++ +PQPHDSQMRQ H ++ N +G ++ MNV + F+R +S +DP ++ GSL Sbjct: 418 SGSSLKPQPHDSQMRQTALHQSMGSNPVGGPTQAMNVMSGPKFERQNSSNDPNRLQGGSL 477 Query: 535 TYVNSNTGLQQNQAQWPSSASKEQKTANSPSITNVKQEPSDQGIELQKXXXXXXXXXXXX 714 ++ ++++ W +S+SKE S+T VKQE DQG E Q Sbjct: 478 SHFSNSS------VPWQASSSKETNPGPLSSVTYVKQESVDQGAEHQHKPHLSASQGLPT 531 Query: 715 XXXXXKLGSTAPGNMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXX--METNILSSSRMSS 888 + G+ KDE E SSR G +++N+L SR S Sbjct: 532 ALG--EQGNAVTTTPKDEPLEKQSSRIGFSTPNSMVPPNSVSPITTQVDSNVLLGSRNPS 589 Query: 889 LTAPVGPGNNSKAPPKKPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHLNDVTAV 1065 + P G NS+ P KKP GQKKP+E F DQSIE LNDVTAV Sbjct: 590 V--PSLAGANSRTPQKKPSVGQKKPLETLGSSPPPSSKKQKVSGAFLDQSIEQLNDVTAV 647 Query: 1066 SGVNLREEEEQLFSGPKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNM 1245 SGVNLREEEEQLFSGPK+DSRVSEASRRVVQEEEERLILQ+ PLQKK+AEIMAK GLKN+ Sbjct: 648 SGVNLREEEEQLFSGPKDDSRVSEASRRVVQEEEERLILQKTPLQKKLAEIMAKSGLKNI 707 >ref|XP_007160898.1| hypothetical protein PHAVU_001G026300g [Phaseolus vulgaris] gi|561034362|gb|ESW32892.1| hypothetical protein PHAVU_001G026300g [Phaseolus vulgaris] Length = 935 Score = 267 bits (683), Expect = 7e-69 Identities = 173/431 (40%), Positives = 233/431 (54%), Gaps = 11/431 (2%) Frame = +1 Query: 67 QMQMPSAQLPTDLSNSIS------DNNAAKSHEMERQSNSHGALVGQMXXXXXXXXXHER 228 QM S + D S S D+NA KS E + + S G Q+ E Sbjct: 295 QMHQRSMNVAVDQSRLSSSAGQTMDSNARKSQEFDVKIESQGLQPNQLTSSSSNTVGQET 354 Query: 229 MHPAFQAQGLNKQQ--HMQFSQTSFPTYGSAGSGYTQFPTNSAASSAATRPQPHDSQMRQ 402 + QGLNKQQ H+ F+ PTYG++G Y + +++SS++ + Q HDS M Q Sbjct: 355 ERTSVHIQGLNKQQQHHLHFA----PTYGNSGGNYNPYSGATSSSSSSIKLQSHDSHMSQ 410 Query: 403 GPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTGLQQNQAQW 582 P H ++ NHLG + ++VT M ++ +S +DPK++ GS++ +NT QQ W Sbjct: 411 IP-HQSIGSNHLGGSTHGLSVTGMPKVEQQNSFNDPKRLPGGSVSSSINNTASQQTSTAW 469 Query: 583 PSSASKEQKTANSPSITNVKQEPSDQGIELQ-KXXXXXXXXXXXXXXXXXKLGSTAPGNM 759 SS +KEQ S++ VK+EP+D E Q + + + G + Sbjct: 470 QSSTNKEQNLGLMSSVSYVKKEPTDLSTEQQNRHNLSKLHGYSSVNSAQLEQSGASQGTL 529 Query: 760 KDEAFE-IHSSRAGXXXXXXXXXXXXXXXXXMETNILSSSRMSSLTAPVGPGNNSKAPPK 936 KD+ + +S + T++ SS +SS G ++ K Sbjct: 530 KDDFSRGLPASTNMPPTTSTGLLPHSSGSSSIMTHLDSSVPLSSQVPSNASGIVARTSFK 589 Query: 937 KPLAGQKKPMEAXXXXXXXXXXXXXXXX-FADQSIEHLNDVTAVSGVNLREEEEQLFSGP 1113 K QKKP+EA + +QSIE LNDVTAVSGV+LREEEEQLFSGP Sbjct: 590 KSAVTQKKPLEALGSSPPPSSKKQKTSGGYVEQSIEQLNDVTAVSGVDLREEEEQLFSGP 649 Query: 1114 KEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNMSNDVSRCLSLCVEERM 1293 KEDSRVSEASR+ VQEEEERLILQ+ PLQKK+ +IMAK GLK MSNDV +CLSL VEERM Sbjct: 650 KEDSRVSEASRKAVQEEEERLILQKAPLQKKLIDIMAKSGLKGMSNDVEKCLSLSVEERM 709 Query: 1294 RGLISNVIRLS 1326 RGLISN+IR+S Sbjct: 710 RGLISNLIRIS 720 >gb|EPS64696.1| hypothetical protein M569_10084, partial [Genlisea aurea] Length = 633 Score = 265 bits (678), Expect = 3e-68 Identities = 170/372 (45%), Positives = 207/372 (55%), Gaps = 4/372 (1%) Frame = +1 Query: 223 ERMHPAFQAQGLNKQQHMQFSQTSFPTYGSAGSGYTQFPTNSAASSAATRPQPHDSQMRQ 402 +R HP F + GLNKQQ F +G+AGSG+ P+D QMR Sbjct: 122 DRRHPNFPSPGLNKQQLPSIPPVPFTQFGAAGSGHL----------------PYDPQMRP 165 Query: 403 GPSHPNLAVNHLGSISRPMNVTNMSTFDRPHSLSDPKKIAAGSLTYVNSNTGLQQNQAQW 582 SHP + VN LG+ ++PM + NMS F+ P G+ T NSNT +QQNQ QW Sbjct: 166 A-SHPGVNVNSLGAATQPMTMMNMSKFEMPK--------VQGAQT--NSNTSIQQNQVQW 214 Query: 583 PSSASKEQKTANSPSITNVKQEPSDQG---IELQKXXXXXXXXXXXXXXXXXKLGSTAPG 753 P + + K S+++VKQEP DQ ++Q + S+AP Sbjct: 215 PPG-NNDPKMGVVTSMSHVKQEPLDQSAASFDVQPSRVSLMPHTSITNSVPPPVPSSAPV 273 Query: 754 NMKDEAFEIHSSRAGXXXXXXXXXXXXXXXXXMETNILSSSRMSSL-TAPVGPGNNSKAP 930 ++ S+SR SL T P GPGN+SK+ Sbjct: 274 SLIS---------------------------------FSNSRAPSLSTNPPGPGNSSKST 300 Query: 931 PKKPLAGQKKPMEAXXXXXXXXXXXXXXXXFADQSIEHLNDVTAVSGVNLREEEEQLFSG 1110 PKK + GQKKPME F+DQSIEHLNDVTAVSGVNLREEEEQLFSG Sbjct: 301 PKKVVVGQKKPMEPPGSSPPSSKKQKVSGGFSDQSIEHLNDVTAVSGVNLREEEEQLFSG 360 Query: 1111 PKEDSRVSEASRRVVQEEEERLILQRIPLQKKMAEIMAKCGLKNMSNDVSRCLSLCVEER 1290 KED+RVSEASRRVVQEEEERLILQ++PLQKK+ EI+AK GLKNM NDV R LSLCVEER Sbjct: 361 SKEDARVSEASRRVVQEEEERLILQKVPLQKKVVEILAKFGLKNMGNDVERFLSLCVEER 420 Query: 1291 MRGLISNVIRLS 1326 +R LISN I+LS Sbjct: 421 LRSLISNSIKLS 432