BLASTX nr result
ID: Catharanthus23_contig00002943
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00002943 (1454 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006347058.1| PREDICTED: general transcription factor IIH ... 524 0.0 ref|XP_004232855.1| PREDICTED: general transcription factor IIH ... 527 0.0 emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera] 496 e-172 ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus commun... 495 e-166 ref|XP_004134713.1| PREDICTED: general transcription factor IIH ... 486 e-165 gb|EPS66674.1| hypothetical protein M569_08097 [Genlisea aurea] 500 e-165 gb|EOY28279.1| Transcription factor-related isoform 1 [Theobroma... 475 e-163 gb|EOY28280.1| Transcription factor-related isoform 2 [Theobroma... 475 e-163 ref|XP_006467817.1| PREDICTED: general transcription factor IIH ... 485 e-163 gb|EMJ15015.1| hypothetical protein PRUPE_ppa005553mg [Prunus pe... 477 e-162 ref|XP_004505045.1| PREDICTED: general transcription factor IIH ... 479 e-160 ref|XP_002305144.1| transcription factor-related family protein ... 488 e-159 gb|ESW31311.1| hypothetical protein PHAVU_002G227900g [Phaseolus... 476 e-159 ref|XP_002868091.1| hypothetical protein ARALYDRAFT_493174 [Arab... 472 e-159 ref|NP_974564.1| transcription factor-related protein [Arabidops... 471 e-159 dbj|BAD43671.1| unnamed protein product [Arabidopsis thaliana] 471 e-159 ref|NP_193435.2| transcription factor-related protein [Arabidops... 471 e-159 ref|NP_001190745.1| transcription factor-related protein [Arabid... 471 e-158 gb|EOY28282.1| Transcription factor-related isoform 4 [Theobroma... 459 e-158 gb|EOY28281.1| Transcription factor-related isoform 3 [Theobroma... 459 e-158 >ref|XP_006347058.1| PREDICTED: general transcription factor IIH subunit 4-like [Solanum tuberosum] Length = 453 Score = 524 bits (1349), Expect(2) = 0.0 Identities = 255/305 (83%), Positives = 285/305 (93%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL+LISS E KTTNIS SMMK+FQRGLLSQ+DD+EPP LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLNLISSSEGGKTTNISSSMMKIFQRGLLSQRDDREPPRLTESGFQFLLMDTNAQLWY 208 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYIT++E+RGVDSADLI+F+LELSFHVTG+AYN NT+ D+QRSII+DL+DLGLV+LQ Sbjct: 209 IIREYITNAEERGVDSADLIAFLLELSFHVTGKAYNTNTITDLQRSIIKDLSDLGLVKLQ 268 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRK+ WFIPTKLATNLSI LADT++RKQGF+V+ETNFR+YAYSTSKLHCEILRLFARVE Sbjct: 269 QGRKESWFIPTKLATNLSISLADTTSRKQGFIVIETNFRLYAYSTSKLHCEILRLFARVE 328 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAI KESLYKAF NGIT++QI+SFLQQNAHPRVAERIPSVPENVTDQIRLWE Sbjct: 329 YQLPNLIVGAINKESLYKAFQNGITSDQIVSFLQQNAHPRVAERIPSVPENVTDQIRLWE 388 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 SDLNRVEM P+H DEFPS+DVFEAACDFARE GGLLWE S++MRLVVK + +MKEFL Sbjct: 389 SDLNRVEMEPAHFYDEFPSKDVFEAACDFAREYGGLLWEDSKRMRLVVKADILTEMKEFL 448 Query: 300 RRQKQ 286 RRQKQ Sbjct: 449 RRQKQ 453 Score = 145 bits (365), Expect(2) = 0.0 Identities = 67/81 (82%), Positives = 75/81 (92%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 L DGFSKHKVAIDRLIQLRV++ET DRKK+ TY+LNP FQ N+QKHIVHGGILPREPMPS Sbjct: 68 LPDGFSKHKVAIDRLIQLRVMTETFDRKKEATYQLNPKFQFNLQKHIVHGGILPREPMPS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 NITVR+PSLEEL+AYA+ QWE Sbjct: 128 NITVRLPSLEELEAYAVEQWE 148 >ref|XP_004232855.1| PREDICTED: general transcription factor IIH subunit 4-like [Solanum lycopersicum] Length = 453 Score = 527 bits (1357), Expect(2) = 0.0 Identities = 257/305 (84%), Positives = 285/305 (93%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL+LISS EA KTTNIS SMMK+FQRGLLSQ+DD+EPP LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLNLISSSEAGKTTNISSSMMKIFQRGLLSQRDDREPPRLTESGFQFLLMDTNAQLWY 208 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYIT++E+RGVDSADLI+F+LELSFHVTG+AYN NT+ D+QRSII+DL+DLGLV+LQ Sbjct: 209 IIREYITNAEERGVDSADLIAFLLELSFHVTGKAYNTNTITDLQRSIIKDLSDLGLVKLQ 268 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRK+ WFIPTKLATNLSI LADT+ RKQGF+V+ETNFR+YAYSTSKLHCEILRLFARVE Sbjct: 269 QGRKESWFIPTKLATNLSISLADTTTRKQGFIVIETNFRLYAYSTSKLHCEILRLFARVE 328 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAI KESLYKAF NGIT+EQI+SFLQQNAHPRVAERIPSVPENVTDQIRLWE Sbjct: 329 YQLPNLIVGAINKESLYKAFQNGITSEQIVSFLQQNAHPRVAERIPSVPENVTDQIRLWE 388 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 SDLNRVEM P+H DEFPS+DVFEAACDFARE GGLLWE S++MRLVVK + +MKEFL Sbjct: 389 SDLNRVEMTPAHFYDEFPSKDVFEAACDFAREYGGLLWEDSKRMRLVVKADILAEMKEFL 448 Query: 300 RRQKQ 286 RRQKQ Sbjct: 449 RRQKQ 453 Score = 140 bits (352), Expect(2) = 0.0 Identities = 64/81 (79%), Positives = 74/81 (91%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 L DGFSKHKVAIDRLIQLRV++ET DRKK+ Y+LNP FQ N+QKHIV+GG+LPREPMPS Sbjct: 68 LPDGFSKHKVAIDRLIQLRVMTETFDRKKEAMYQLNPKFQFNLQKHIVYGGVLPREPMPS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 NITVR+PSLEEL+AYA+ QWE Sbjct: 128 NITVRLPSLEELEAYAVEQWE 148 >emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera] Length = 451 Score = 496 bits (1276), Expect(2) = e-172 Identities = 247/304 (81%), Positives = 275/304 (90%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LISS + EK TN S S+MKVFQRGLL+Q++ KE P LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLQLISSTQTEKLTNFSSSLMKVFQRGLLTQRE-KEAPRLTESGFQFLLMDTNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 I+REYI++SE+RGVD ADLISF+LELSFHVTGEAYN+NTL + QR+ I+DL DLGLV+LQ Sbjct: 208 IMREYISNSEERGVDPADLISFLLELSFHVTGEAYNINTLTEFQRNTIKDLVDLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRK+ WFIPTKLATNLS+ L+DTS+RKQGFVVVETNFR+YAYS+SKLHCEILRLF+RVE Sbjct: 268 QGRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRVE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY AF NGITAEQIISFLQQNAHPRVAER P+VPENVTDQIRLWE Sbjct: 328 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAERTPAVPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DLNRVE +PSHL DEFPSRDVFEAACDFARE GGLLWE S+KMRLVVK E M+E+L Sbjct: 388 TDLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRLVVKAEIHLHMREYL 447 Query: 300 RRQK 289 RR K Sbjct: 448 RRSK 451 Score = 139 bits (349), Expect(2) = e-172 Identities = 62/81 (76%), Positives = 76/81 (93%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADGFSKH+VAIDRLIQLRV +ET DRKK+T+YRLNP FQ N+QKH+++GG+LPREPMPS Sbjct: 68 LADGFSKHRVAIDRLIQLRVFTETSDRKKETSYRLNPTFQTNLQKHLIYGGVLPREPMPS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 NITVR+PSL++L+AYA+ QWE Sbjct: 128 NITVRLPSLDDLEAYALGQWE 148 >ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus communis] gi|223541708|gb|EEF43256.1| tfiih, polypeptide, putative [Ricinus communis] Length = 451 Score = 495 bits (1274), Expect(2) = e-166 Identities = 243/304 (79%), Positives = 278/304 (91%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLLHLI+SG AE++TN S SMMK+FQRGLL+Q+D KE P LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLHLINSGHAERSTNFSSSMMKIFQRGLLTQRD-KEAPRLTESGFQFLLMDTNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI++SE+RG+DSADLISF+LELSFH+TGEAYN+ L + QR++I+DLADLGLV+LQ Sbjct: 208 IIREYISNSEERGLDSADLISFLLELSFHITGEAYNMIMLTEFQRNMIKDLADLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRK+ WFIPTKLATNLS+ L D+S+RKQGFVVVETNFRMYAYSTSKLHCEI+RLF+RVE Sbjct: 268 QGRKESWFIPTKLATNLSMSLTDSSSRKQGFVVVETNFRMYAYSTSKLHCEIMRLFSRVE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNL+VGA+TKESLY AF NGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE Sbjct: 328 YQLPNLVVGAMTKESLYSAFENGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 SD+NRVEM P+HL DEFPSRDVFEAAC+FAR+ GLLWE S++MR+VVK E M+E+L Sbjct: 388 SDMNRVEMTPAHLYDEFPSRDVFEAACNFARDWNGLLWEDSKRMRMVVKAEIHLNMREYL 447 Query: 300 RRQK 289 R QK Sbjct: 448 RGQK 451 Score = 119 bits (298), Expect(2) = e-166 Identities = 53/81 (65%), Positives = 67/81 (82%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 L DG SKH+VAIDRL QLR+ +E VDRKK+ +Y+LNP FQ N+QKH++ GG+LP EP+ S Sbjct: 68 LPDGSSKHRVAIDRLTQLRIFTEIVDRKKEISYKLNPTFQTNLQKHLIDGGVLPGEPLAS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 NI VR+P+LEELD YA+ QWE Sbjct: 128 NIAVRLPTLEELDTYALGQWE 148 >ref|XP_004134713.1| PREDICTED: general transcription factor IIH subunit 4-like [Cucumis sativus] Length = 451 Score = 486 bits (1252), Expect(2) = e-165 Identities = 241/304 (79%), Positives = 276/304 (90%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LI+SG+AEK +NIS S+MKVFQ+GLLSQ+D KE P LTESGFQFLLM+TNAQLWY Sbjct: 149 CFLLQLINSGQAEKPSNISSSVMKVFQKGLLSQRD-KEAPRLTESGFQFLLMETNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI+++E+RGVD ADLISF+LELSFHVTGEAY+++TL+D QR I+DLADLGLV+LQ Sbjct: 208 IIREYISNAEERGVDPADLISFLLELSFHVTGEAYDIDTLSDEQRYAIKDLADLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRK+ WFIPTKLATNLS+ LAD+S+RK GFVVVETNFRMYAYSTSKLHCEILRLF+R+E Sbjct: 268 QGRKESWFIPTKLATNLSMSLADSSSRKLGFVVVETNFRMYAYSTSKLHCEILRLFSRIE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY AF NGITAEQI++FLQQNAHPRVAERIPSVPENVTDQIRLWE Sbjct: 328 YQLPNLIVGAITKESLYNAFKNGITAEQIVTFLQQNAHPRVAERIPSVPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 SDLNRV++ P+H DEFPSR+VFEAACD+ARE GLLWE S+ +RLVVK + M+E L Sbjct: 388 SDLNRVDITPAHFYDEFPSREVFEAACDYAREWNGLLWEDSKNLRLVVKADIHTHMREHL 447 Query: 300 RRQK 289 RRQK Sbjct: 448 RRQK 451 Score = 125 bits (315), Expect(2) = e-165 Identities = 58/81 (71%), Positives = 70/81 (86%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 L DG SK+KVA+DRLIQLRV ET DRK++TTYRLNP FQ N+QK ++HG +L REPMPS Sbjct: 68 LPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVLAREPMPS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 NITVR+PSLE+L+AYA+ QWE Sbjct: 128 NITVRLPSLEDLEAYALDQWE 148 >gb|EPS66674.1| hypothetical protein M569_08097 [Genlisea aurea] Length = 452 Score = 500 bits (1287), Expect(2) = e-165 Identities = 238/303 (78%), Positives = 283/303 (93%) Frame = -3 Query: 1197 FLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWYI 1018 FLLHL++S EAE++TN+S S+MKVFQRGLL+QKDDK+PP LTESGFQFLLMDTNAQLWYI Sbjct: 150 FLLHLMNSAEAERSTNLSSSIMKVFQRGLLTQKDDKQPPKLTESGFQFLLMDTNAQLWYI 209 Query: 1017 IREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQQ 838 IREYI++ EDRGVD+ADL+SFMLELSFH TG+AYN+NTL+D+QR II+DLADLGLV+LQQ Sbjct: 210 IREYISNCEDRGVDTADLLSFMLELSFHYTGKAYNLNTLSDVQRDIIKDLADLGLVKLQQ 269 Query: 837 GRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVEY 658 GRKD WFIPTKLATNLSI LA+TS+RK+GF++VETNFR+YAYS+SKLH E+LRLF+R+EY Sbjct: 270 GRKDSWFIPTKLATNLSISLAETSSRKEGFILVETNFRLYAYSSSKLHTEMLRLFSRIEY 329 Query: 657 QLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWES 478 QLPNL+V ITKESLYKAF NGITAEQII FLQQNAHPRVA R+P VPENVTDQIRLWES Sbjct: 330 QLPNLVVATITKESLYKAFQNGITAEQIIMFLQQNAHPRVALRLPCVPENVTDQIRLWES 389 Query: 477 DLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFLR 298 DLNR+EMIP+H+ DEFPSR+VF++AC++AR+ GGLLWESS+KM++VVK + F +MKEFLR Sbjct: 390 DLNRIEMIPAHIFDEFPSREVFDSACEYARDIGGLLWESSKKMQVVVKADVFTEMKEFLR 449 Query: 297 RQK 289 +QK Sbjct: 450 KQK 452 Score = 110 bits (274), Expect(2) = e-165 Identities = 53/81 (65%), Positives = 63/81 (77%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 L DG SKH VAIDRL+QLR+ SETV+RKK+T Y +NP FQ N+QK IVHGG+LPREPM Sbjct: 68 LRDGVSKHNVAIDRLLQLRIFSETVERKKETNYVMNPKFQHNLQKCIVHGGVLPREPMLP 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 T R+ S +LDAYA +QWE Sbjct: 128 TATGRLLSPNDLDAYAAQQWE 148 >gb|EOY28279.1| Transcription factor-related isoform 1 [Theobroma cacao] Length = 451 Score = 475 bits (1222), Expect(2) = e-163 Identities = 237/305 (77%), Positives = 270/305 (88%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LISSG+AEK TN S SMM++FQRGLL Q++ KE P LTESGFQFLLMDTNAQLWY Sbjct: 148 CFLLQLISSGQAEKPTNFSSSMMRIFQRGLLCQRE-KEAPRLTESGFQFLLMDTNAQLWY 206 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI++SE++GVD ADLI+F+LELSFH TGEAYN+NTL D QR++I+DL+DLGLV+LQ Sbjct: 207 IIREYISNSEEQGVDQADLIAFLLELSFHTTGEAYNMNTLTDDQRAMIKDLSDLGLVKLQ 266 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPTKLATNLS+ L D+S+RKQGFVVVETNFRMYAYS+SKLHCEILRLF+RVE Sbjct: 267 QGRKDSWFIPTKLATNLSVSLTDSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRVE 326 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY AF NGI AEQII+FLQQNAHPRVAE++PSVPENVTDQIRLWE Sbjct: 327 YQLPNLIVGAITKESLYNAFENGIAAEQIITFLQQNAHPRVAEKLPSVPENVTDQIRLWE 386 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DLNRVEM P+H D+FPSRDVFEAA D AR GLLWE ++KMR+VVK E M+E L Sbjct: 387 TDLNRVEMTPAHFYDDFPSRDVFEAASDLARVHCGLLWEDAKKMRMVVKAEIHMLMREQL 446 Query: 300 RRQKQ 286 R Q + Sbjct: 447 RGQNK 451 Score = 129 bits (325), Expect(2) = e-163 Identities = 60/81 (74%), Positives = 73/81 (90%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG SKHKVAIDRLIQLR+L E +DRKK+T+Y+LNP FQ N++KH+++GGILPREPMPS Sbjct: 68 LADGSSKHKVAIDRLIQLRIL-EVIDRKKETSYKLNPTFQTNLRKHLIYGGILPREPMPS 126 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 N+TVR+P+ EELDAYA QWE Sbjct: 127 NVTVRLPTSEELDAYAHEQWE 147 >gb|EOY28280.1| Transcription factor-related isoform 2 [Theobroma cacao] Length = 412 Score = 475 bits (1222), Expect(2) = e-163 Identities = 237/305 (77%), Positives = 270/305 (88%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LISSG+AEK TN S SMM++FQRGLL Q++ KE P LTESGFQFLLMDTNAQLWY Sbjct: 109 CFLLQLISSGQAEKPTNFSSSMMRIFQRGLLCQRE-KEAPRLTESGFQFLLMDTNAQLWY 167 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI++SE++GVD ADLI+F+LELSFH TGEAYN+NTL D QR++I+DL+DLGLV+LQ Sbjct: 168 IIREYISNSEEQGVDQADLIAFLLELSFHTTGEAYNMNTLTDDQRAMIKDLSDLGLVKLQ 227 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPTKLATNLS+ L D+S+RKQGFVVVETNFRMYAYS+SKLHCEILRLF+RVE Sbjct: 228 QGRKDSWFIPTKLATNLSVSLTDSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRVE 287 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY AF NGI AEQII+FLQQNAHPRVAE++PSVPENVTDQIRLWE Sbjct: 288 YQLPNLIVGAITKESLYNAFENGIAAEQIITFLQQNAHPRVAEKLPSVPENVTDQIRLWE 347 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DLNRVEM P+H D+FPSRDVFEAA D AR GLLWE ++KMR+VVK E M+E L Sbjct: 348 TDLNRVEMTPAHFYDDFPSRDVFEAASDLARVHCGLLWEDAKKMRMVVKAEIHMLMREQL 407 Query: 300 RRQKQ 286 R Q + Sbjct: 408 RGQNK 412 Score = 129 bits (325), Expect(2) = e-163 Identities = 60/81 (74%), Positives = 73/81 (90%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG SKHKVAIDRLIQLR+L E +DRKK+T+Y+LNP FQ N++KH+++GGILPREPMPS Sbjct: 29 LADGSSKHKVAIDRLIQLRIL-EVIDRKKETSYKLNPTFQTNLRKHLIYGGILPREPMPS 87 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 N+TVR+P+ EELDAYA QWE Sbjct: 88 NVTVRLPTSEELDAYAHEQWE 108 >ref|XP_006467817.1| PREDICTED: general transcription factor IIH subunit 4-like [Citrus sinensis] Length = 450 Score = 485 bits (1248), Expect(2) = e-163 Identities = 240/305 (78%), Positives = 274/305 (89%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LISS +AE+ TN S SMMKVFQRGLLS++D KE P LTESGFQFLLMDTNAQLWY Sbjct: 147 CFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRD-KEAPRLTESGFQFLLMDTNAQLWY 205 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 I+REYI++S++RG++ ADLISF+LELSFHV GEAYN+NTL++IQ+S+I+D ADLGLV+LQ Sbjct: 206 IVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQ 265 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRK+ WFIPTKLATNLS+ L D+SARK+GF+VVETNFRMYAYSTSKLHCEILRLF++VE Sbjct: 266 QGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVE 325 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY AF NGITAEQIISFLQQNAHPRVA+R+PSVPENV DQIRLWE Sbjct: 326 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWE 385 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 SDLNRVEM P+H DEFPSRDVFEAACD+AR+ GLLWE +KMRLVVK E M+EFL Sbjct: 386 SDLNRVEMTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDPKKMRLVVKAEIHMHMREFL 445 Query: 300 RRQKQ 286 R Q + Sbjct: 446 RGQNK 450 Score = 118 bits (296), Expect(2) = e-163 Identities = 54/81 (66%), Positives = 70/81 (86%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 L DGF+KH+VAIDRL+QLR+ SE ++KK+TTYRLN FQ N++KH+++GG LPREPMPS Sbjct: 68 LPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 IT R+P+LE+L+AYAI QWE Sbjct: 126 GITARLPTLEDLEAYAIGQWE 146 >gb|EMJ15015.1| hypothetical protein PRUPE_ppa005553mg [Prunus persica] Length = 455 Score = 477 bits (1228), Expect(2) = e-162 Identities = 239/308 (77%), Positives = 273/308 (88%), Gaps = 3/308 (0%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LI+S + E+ +NIS SMMK+FQRGLLSQ+D KE P LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLQLINSSQIERPSNISSSMMKIFQRGLLSQRD-KEAPRLTESGFQFLLMDTNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI++SE+RGVDSADLISF+LELSFHVTGEAYN+NTL ++Q++ I+DLADLGLV+LQ Sbjct: 208 IIREYISNSEERGVDSADLISFLLELSFHVTGEAYNINTLTEVQKNTIKDLADLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPT+LATNLS+ L D+S+RKQGFVVVETNFR+YAYSTSKLHCEILRLF+RVE Sbjct: 268 QGRKDSWFIPTRLATNLSVSLTDSSSRKQGFVVVETNFRLYAYSTSKLHCEILRLFSRVE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY AF NGITA+QIISFLQQNAHPRVAER+PSVPENVTDQIRLWE Sbjct: 328 YQLPNLIVGAITKESLYSAFENGITADQIISFLQQNAHPRVAERVPSVPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLW---ESSQKMRLVVKGESFPQMK 310 +DLNRVEM P++ D FPSRD+FE A DFAR LLW E S+KM LVVK E+ M+ Sbjct: 388 TDLNRVEMTPAYHYDGFPSRDLFEGASDFARGYNALLWEDPEDSKKMSLVVKAENHVHMR 447 Query: 309 EFLRRQKQ 286 E+L RQ+Q Sbjct: 448 EYLSRQRQ 455 Score = 122 bits (305), Expect(2) = e-162 Identities = 54/81 (66%), Positives = 72/81 (88%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 L DG SKH+VAIDRLIQLR+ +ETVDRK++TTY LNP+FQ N++K +++G +LPREPMPS Sbjct: 68 LPDGVSKHRVAIDRLIQLRIFTETVDRKRETTYTLNPIFQTNLKKLLLYGVVLPREPMPS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 N+TVR+PS E+L+A+A+ QWE Sbjct: 128 NVTVRLPSSEDLEAFALGQWE 148 >ref|XP_004505045.1| PREDICTED: general transcription factor IIH subunit 4-like [Cicer arietinum] Length = 451 Score = 479 bits (1234), Expect(2) = e-160 Identities = 240/304 (78%), Positives = 269/304 (88%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LIS EKT NIS S+MKVFQR LLSQ+D KE P LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLQLISPSHVEKTLNISSSLMKVFQRRLLSQRD-KEAPKLTESGFQFLLMDTNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI++SE+RGVD+ DLISFMLELSFHV GEAYN+NTL D QR+II+DLADLGLV+LQ Sbjct: 208 IIREYISNSEERGVDAGDLISFMLELSFHVIGEAYNINTLTDFQRNIIKDLADLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRK+ WFIPTKLATNLS+ L ++S+RK+GFVVVETNFR+YAYSTSKLHCEILRLF+RVE Sbjct: 268 QGRKESWFIPTKLATNLSVSLTESSSRKEGFVVVETNFRVYAYSTSKLHCEILRLFSRVE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY A NGITAEQI+SFL+QNAHPRVA+RIP+VPENVTDQIRLWE Sbjct: 328 YQLPNLIVGAITKESLYNALENGITAEQIVSFLRQNAHPRVAQRIPAVPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 SDLNRVEM ++ DEFPSRDVFE ACD ARE GLLWE+S+KM LVVK E +++FL Sbjct: 388 SDLNRVEMTEAYYYDEFPSRDVFEGACDCAREWNGLLWENSKKMHLVVKSEVHSYVRDFL 447 Query: 300 RRQK 289 RRQK Sbjct: 448 RRQK 451 Score = 115 bits (287), Expect(2) = e-160 Identities = 52/81 (64%), Positives = 66/81 (81%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 L DG SKHKVAIDRL+QLRV E +DRK + TY++N +Q ++QK +VHGG LPRE MPS Sbjct: 68 LPDGLSKHKVAIDRLVQLRVFVEALDRKNEKTYKVNSTYQRSLQKLLVHGGTLPRESMPS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 NITVR+P+LE+L+ YA+ QWE Sbjct: 128 NITVRLPTLEDLETYALEQWE 148 >ref|XP_002305144.1| transcription factor-related family protein [Populus trichocarpa] gi|222848108|gb|EEE85655.1| transcription factor-related family protein [Populus trichocarpa] Length = 449 Score = 488 bits (1256), Expect(2) = e-159 Identities = 240/304 (78%), Positives = 275/304 (90%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LISSG+AEK T++S SMMK+FQRGLLSQ+D ++ P LTE GFQFLLMDTNAQLWY Sbjct: 147 CFLLLLISSGQAEKPTSLSSSMMKIFQRGLLSQRD-RDAPRLTEGGFQFLLMDTNAQLWY 205 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYIT+SE+RG + ADLISF+LELSFHVTGEAYN+NTL +IQR+ I+DLA+LGLV+LQ Sbjct: 206 IIREYITNSEERGTEPADLISFLLELSFHVTGEAYNMNTLTEIQRNTIKDLAELGLVKLQ 265 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRK+ WFIPTKLATNLS+ L D+S+RKQG+VVVETNFR+YAYS+SKLHCEILRLF+++E Sbjct: 266 QGRKESWFIPTKLATNLSVSLTDSSSRKQGYVVVETNFRLYAYSSSKLHCEILRLFSKIE 325 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY AF NGIT++QIISFLQQNAHPRVAER+PSVPENVTDQIRLWE Sbjct: 326 YQLPNLIVGAITKESLYTAFENGITSDQIISFLQQNAHPRVAERLPSVPENVTDQIRLWE 385 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DLNRVE+ PSH DEFPSRD FEAACDFARE GLLWE S+KMR+VVK E M+EFL Sbjct: 386 ADLNRVEITPSHFYDEFPSRDTFEAACDFAREWNGLLWEDSKKMRVVVKAEIHMNMREFL 445 Query: 300 RRQK 289 R QK Sbjct: 446 RGQK 449 Score = 104 bits (259), Expect(2) = e-159 Identities = 47/81 (58%), Positives = 65/81 (80%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG +KH+VAIDRLIQLR+ E D+K++++Y+LN FQ N++KH+ +GG+LPRE M + Sbjct: 68 LADGVTKHRVAIDRLIQLRIFIEVSDKKRESSYKLNQTFQANLRKHLTNGGVLPRETMAA 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 V++PSLEELD YA+ QWE Sbjct: 128 --VVKLPSLEELDTYALEQWE 146 >gb|ESW31311.1| hypothetical protein PHAVU_002G227900g [Phaseolus vulgaris] Length = 451 Score = 476 bits (1226), Expect(2) = e-159 Identities = 240/303 (79%), Positives = 269/303 (88%) Frame = -3 Query: 1197 FLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWYI 1018 FLL LIS +A+K NIS S+MKVFQR LLSQ+D KE P LTESGFQFLLMDTNAQLWYI Sbjct: 150 FLLQLISPAQADKPLNISSSLMKVFQRRLLSQRD-KEAPKLTESGFQFLLMDTNAQLWYI 208 Query: 1017 IREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQQ 838 IREYI++SE+RGVD+ADLISFMLELSFHV GEAY+++TL D QRSII DLADLGLV+LQQ Sbjct: 209 IREYISNSEERGVDAADLISFMLELSFHVIGEAYSISTLTDFQRSIINDLADLGLVKLQQ 268 Query: 837 GRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVEY 658 GRK WFIPTKLATNLS+ LAD+S+RKQGFVVVETNFR+YAYSTSKLHCEILRLF+RVEY Sbjct: 269 GRKGSWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRVYAYSTSKLHCEILRLFSRVEY 328 Query: 657 QLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWES 478 QLPNLIVGAITKESLY AF NGITA+QI++F+QQNAHPRVAERIP VPENVTDQIRLWE+ Sbjct: 329 QLPNLIVGAITKESLYNAFENGITADQIVTFIQQNAHPRVAERIPCVPENVTDQIRLWEA 388 Query: 477 DLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFLR 298 DLNRVEM ++ DEFPSRDVFE ACD ARE GLLWE S+KM +VVK E P +++FLR Sbjct: 389 DLNRVEMTDAYYYDEFPSRDVFEGACDCAREWNGLLWEDSKKMHMVVKTEVHPYVRDFLR 448 Query: 297 RQK 289 RQK Sbjct: 449 RQK 451 Score = 115 bits (287), Expect(2) = e-159 Identities = 52/81 (64%), Positives = 66/81 (81%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 L DG SKH+VA+DRL+QLRV E VDRK + TY++NP +Q ++QK +V GG LPRE MPS Sbjct: 68 LPDGVSKHRVAVDRLVQLRVFLEAVDRKNEKTYKVNPTYQRSLQKLLVQGGTLPRESMPS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 NITVR+P+LE L+AYA+ QWE Sbjct: 128 NITVRLPTLENLEAYALEQWE 148 >ref|XP_002868091.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp. lyrata] gi|297313927|gb|EFH44350.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp. lyrata] Length = 452 Score = 472 bits (1215), Expect(2) = e-159 Identities = 231/303 (76%), Positives = 268/303 (88%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LI+SG+ EK T IS SMM++FQRGLLSQ+D K+ P LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLQLINSGQGEKLTGISSSMMRIFQRGLLSQRD-KDGPRLTESGFQFLLMDTNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI+++E+R V+ ADLISF+LELSFHVTGEAYN NTL ++Q + ++DLADLGLV+LQ Sbjct: 208 IIREYISNAEERDVEPADLISFLLELSFHVTGEAYNSNTLTEVQNNTLKDLADLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPTKLATNLS+ LAD+SARK+GFVV+ETNFRMYAYSTSKL CEILRLFAR+E Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLI AITKESLY AF NGIT++QII+FLQQN+HPR A+R+PS+PENVTDQIRLWE Sbjct: 328 YQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DL R+EM +H DEFPS+DVFEAACDFARE GGLLWE S++MRLVVK E QM+EFL Sbjct: 388 TDLKRIEMTQAHFYDEFPSKDVFEAACDFAREWGGLLWEDSKRMRLVVKSEVHNQMREFL 447 Query: 300 RRQ 292 Q Sbjct: 448 HNQ 450 Score = 118 bits (295), Expect(2) = e-159 Identities = 53/81 (65%), Positives = 68/81 (83%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG SKH+VAIDRLIQLR+ SET DRK+ +Y LNP FQ N+QKHI+ GG+LPREPM S Sbjct: 68 LADGASKHRVAIDRLIQLRIFSETSDRKRGISYSLNPTFQNNLQKHIISGGVLPREPMHS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 + +++PSL+EL+ YA++QWE Sbjct: 128 DNAIKLPSLQELETYALKQWE 148 >ref|NP_974564.1| transcription factor-related protein [Arabidopsis thaliana] gi|332658438|gb|AEE83838.1| transcription factor-related protein [Arabidopsis thaliana] Length = 462 Score = 471 bits (1212), Expect(2) = e-159 Identities = 231/305 (75%), Positives = 268/305 (87%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LI+SG+ EK T IS SMMK+FQRGLLSQ+D K+ P LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLQLINSGQGEKLTGISSSMMKIFQRGLLSQRD-KDGPRLTESGFQFLLMDTNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI ++E+R VD ADLISF+LELSFHVTG+AYN+NTL ++Q + ++DLADLGLV+LQ Sbjct: 208 IIREYILNAEERDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPTKLATNLS+ LAD+SARK+GFVV+ETNFRMYAYSTSKL CEILRLFAR+E Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLI AITKESLY AF NGIT++QII+FLQQN+HPR A+R+PS+PENVTDQIRLWE Sbjct: 328 YQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DL R+EM +H DEFPS+DVFEAACDFARE GLLWE S++MRLVVK E QM+EFL Sbjct: 388 TDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREFL 447 Query: 300 RRQKQ 286 Q + Sbjct: 448 HTQSR 452 Score = 118 bits (295), Expect(2) = e-159 Identities = 53/81 (65%), Positives = 68/81 (83%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG SKH+VAIDRLIQLR+ SE DRK+ T+Y LNP FQ N+QKHI+ GG+LPREPM S Sbjct: 68 LADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVLPREPMNS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 + +++PSL+EL+ YA++QWE Sbjct: 128 DNAIKLPSLQELETYALKQWE 148 >dbj|BAD43671.1| unnamed protein product [Arabidopsis thaliana] Length = 452 Score = 471 bits (1212), Expect(2) = e-159 Identities = 231/305 (75%), Positives = 268/305 (87%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LI+SG+ EK T IS SMMK+FQRGLLSQ+D K+ P LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLQLINSGQGEKLTGISSSMMKIFQRGLLSQRD-KDGPRLTESGFQFLLMDTNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI ++E+R VD ADLISF+LELSFHVTG+AYN+NTL ++Q + ++DLADLGLV+LQ Sbjct: 208 IIREYILNAEERDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPTKLATNLS+ LAD+SARK+GFVV+ETNFRMYAYSTSKL CEILRLFAR+E Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLI AITKESLY AF NGIT++QII+FLQQN+HPR A+R+PS+PENVTDQIRLWE Sbjct: 328 YQLPNLIACAITKESLYNAFGNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DL R+EM +H DEFPS+DVFEAACDFARE GLLWE S++MRLVVK E QM+EFL Sbjct: 388 TDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREFL 447 Query: 300 RRQKQ 286 Q + Sbjct: 448 HTQSR 452 Score = 118 bits (295), Expect(2) = e-159 Identities = 53/81 (65%), Positives = 68/81 (83%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG SKH+VAIDRLIQLR+ SE DRK+ T+Y LNP FQ N+QKHI+ GG+LPREPM S Sbjct: 68 LADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVLPREPMNS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 + +++PSL+EL+ YA++QWE Sbjct: 128 DNAIKLPSLQELETYALKQWE 148 >ref|NP_193435.2| transcription factor-related protein [Arabidopsis thaliana] gi|51969678|dbj|BAD43531.1| unnamed protein product [Arabidopsis thaliana] gi|115646777|gb|ABJ17114.1| At4g17020 [Arabidopsis thaliana] gi|332658439|gb|AEE83839.1| transcription factor-related protein [Arabidopsis thaliana] Length = 452 Score = 471 bits (1212), Expect(2) = e-159 Identities = 231/305 (75%), Positives = 268/305 (87%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LI+SG+ EK T IS SMMK+FQRGLLSQ+D K+ P LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLQLINSGQGEKLTGISSSMMKIFQRGLLSQRD-KDGPRLTESGFQFLLMDTNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI ++E+R VD ADLISF+LELSFHVTG+AYN+NTL ++Q + ++DLADLGLV+LQ Sbjct: 208 IIREYILNAEERDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPTKLATNLS+ LAD+SARK+GFVV+ETNFRMYAYSTSKL CEILRLFAR+E Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLI AITKESLY AF NGIT++QII+FLQQN+HPR A+R+PS+PENVTDQIRLWE Sbjct: 328 YQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DL R+EM +H DEFPS+DVFEAACDFARE GLLWE S++MRLVVK E QM+EFL Sbjct: 388 TDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREFL 447 Query: 300 RRQKQ 286 Q + Sbjct: 448 HTQSR 452 Score = 118 bits (295), Expect(2) = e-159 Identities = 53/81 (65%), Positives = 68/81 (83%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG SKH+VAIDRLIQLR+ SE DRK+ T+Y LNP FQ N+QKHI+ GG+LPREPM S Sbjct: 68 LADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVLPREPMNS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 + +++PSL+EL+ YA++QWE Sbjct: 128 DNAIKLPSLQELETYALKQWE 148 >ref|NP_001190745.1| transcription factor-related protein [Arabidopsis thaliana] gi|332658440|gb|AEE83840.1| transcription factor-related protein [Arabidopsis thaliana] Length = 482 Score = 471 bits (1211), Expect(2) = e-158 Identities = 231/303 (76%), Positives = 267/303 (88%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LI+SG+ EK T IS SMMK+FQRGLLSQ+D K+ P LTESGFQFLLMDTNAQLWY Sbjct: 149 CFLLQLINSGQGEKLTGISSSMMKIFQRGLLSQRD-KDGPRLTESGFQFLLMDTNAQLWY 207 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI ++E+R VD ADLISF+LELSFHVTG+AYN+NTL ++Q + ++DLADLGLV+LQ Sbjct: 208 IIREYILNAEERDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQ 267 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPTKLATNLS+ LAD+SARK+GFVV+ETNFRMYAYSTSKL CEILRLFAR+E Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLI AITKESLY AF NGIT++QII+FLQQN+HPR A+R+PS+PENVTDQIRLWE Sbjct: 328 YQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DL R+EM +H DEFPS+DVFEAACDFARE GLLWE S++MRLVVK E QM+EFL Sbjct: 388 TDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREFL 447 Query: 300 RRQ 292 Q Sbjct: 448 HTQ 450 Score = 118 bits (295), Expect(2) = e-158 Identities = 53/81 (65%), Positives = 68/81 (83%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG SKH+VAIDRLIQLR+ SE DRK+ T+Y LNP FQ N+QKHI+ GG+LPREPM S Sbjct: 68 LADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVLPREPMNS 127 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 + +++PSL+EL+ YA++QWE Sbjct: 128 DNAIKLPSLQELETYALKQWE 148 >gb|EOY28282.1| Transcription factor-related isoform 4 [Theobroma cacao] Length = 445 Score = 459 bits (1181), Expect(2) = e-158 Identities = 232/305 (76%), Positives = 264/305 (86%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LISSG+AEK TN S SMM++FQRGLL Q++ KE P LTESGFQFLLMDTNAQLWY Sbjct: 148 CFLLQLISSGQAEKPTNFSSSMMRIFQRGLLCQRE-KEAPRLTESGFQFLLMDTNAQLWY 206 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI++SE++GVD ADLI+F+LELSFH TGEAYN+NTL D QR++I+DL+DLGLV+LQ Sbjct: 207 IIREYISNSEEQGVDQADLIAFLLELSFHTTGEAYNMNTLTDDQRAMIKDLSDLGLVKLQ 266 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPTKLATNLS+ L D+S+RKQGFVVVETNFRMYAYS+SKLHCEILRLF+RVE Sbjct: 267 QGRKDSWFIPTKLATNLSVSLTDSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRVE 326 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY AF NGI AE QQNAHPRVAE++PSVPENVTDQIRLWE Sbjct: 327 YQLPNLIVGAITKESLYNAFENGIAAE------QQNAHPRVAEKLPSVPENVTDQIRLWE 380 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DLNRVEM P+H D+FPSRDVFEAA D AR GLLWE ++KMR+VVK E M+E L Sbjct: 381 TDLNRVEMTPAHFYDDFPSRDVFEAASDLARVHCGLLWEDAKKMRMVVKAEIHMLMREQL 440 Query: 300 RRQKQ 286 R Q + Sbjct: 441 RGQNK 445 Score = 129 bits (325), Expect(2) = e-158 Identities = 60/81 (74%), Positives = 73/81 (90%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG SKHKVAIDRLIQLR+L E +DRKK+T+Y+LNP FQ N++KH+++GGILPREPMPS Sbjct: 68 LADGSSKHKVAIDRLIQLRIL-EVIDRKKETSYKLNPTFQTNLRKHLIYGGILPREPMPS 126 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 N+TVR+P+ EELDAYA QWE Sbjct: 127 NVTVRLPTSEELDAYAHEQWE 147 >gb|EOY28281.1| Transcription factor-related isoform 3 [Theobroma cacao] Length = 406 Score = 459 bits (1181), Expect(2) = e-158 Identities = 232/305 (76%), Positives = 264/305 (86%) Frame = -3 Query: 1200 CFLLHLISSGEAEKTTNISPSMMKVFQRGLLSQKDDKEPPMLTESGFQFLLMDTNAQLWY 1021 CFLL LISSG+AEK TN S SMM++FQRGLL Q++ KE P LTESGFQFLLMDTNAQLWY Sbjct: 109 CFLLQLISSGQAEKPTNFSSSMMRIFQRGLLCQRE-KEAPRLTESGFQFLLMDTNAQLWY 167 Query: 1020 IIREYITHSEDRGVDSADLISFMLELSFHVTGEAYNVNTLNDIQRSIIRDLADLGLVRLQ 841 IIREYI++SE++GVD ADLI+F+LELSFH TGEAYN+NTL D QR++I+DL+DLGLV+LQ Sbjct: 168 IIREYISNSEEQGVDQADLIAFLLELSFHTTGEAYNMNTLTDDQRAMIKDLSDLGLVKLQ 227 Query: 840 QGRKDCWFIPTKLATNLSICLADTSARKQGFVVVETNFRMYAYSTSKLHCEILRLFARVE 661 QGRKD WFIPTKLATNLS+ L D+S+RKQGFVVVETNFRMYAYS+SKLHCEILRLF+RVE Sbjct: 228 QGRKDSWFIPTKLATNLSVSLTDSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRVE 287 Query: 660 YQLPNLIVGAITKESLYKAFLNGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 481 YQLPNLIVGAITKESLY AF NGI AE QQNAHPRVAE++PSVPENVTDQIRLWE Sbjct: 288 YQLPNLIVGAITKESLYNAFENGIAAE------QQNAHPRVAEKLPSVPENVTDQIRLWE 341 Query: 480 SDLNRVEMIPSHLIDEFPSRDVFEAACDFARECGGLLWESSQKMRLVVKGESFPQMKEFL 301 +DLNRVEM P+H D+FPSRDVFEAA D AR GLLWE ++KMR+VVK E M+E L Sbjct: 342 TDLNRVEMTPAHFYDDFPSRDVFEAASDLARVHCGLLWEDAKKMRMVVKAEIHMLMREQL 401 Query: 300 RRQKQ 286 R Q + Sbjct: 402 RGQNK 406 Score = 129 bits (325), Expect(2) = e-158 Identities = 60/81 (74%), Positives = 73/81 (90%) Frame = -1 Query: 1454 LADGFSKHKVAIDRLIQLRVLSETVDRKKDTTYRLNPMFQLNIQKHIVHGGILPREPMPS 1275 LADG SKHKVAIDRLIQLR+L E +DRKK+T+Y+LNP FQ N++KH+++GGILPREPMPS Sbjct: 29 LADGSSKHKVAIDRLIQLRIL-EVIDRKKETSYKLNPTFQTNLRKHLIYGGILPREPMPS 87 Query: 1274 NITVRIPSLEELDAYAIRQWE 1212 N+TVR+P+ EELDAYA QWE Sbjct: 88 NVTVRLPTSEELDAYAHEQWE 108