BLASTX nr result
ID: Akebia22_contig00007261
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00007261 (2204 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007039139.1| Transcription factor IIIC, subunit 5, putati... 553 e-155 ref|XP_007039140.1| Transcription factor IIIC, subunit 5, putati... 552 e-154 ref|XP_002275875.1| PREDICTED: transcription factor tau subunit ... 547 e-153 emb|CBI24753.3| unnamed protein product [Vitis vinifera] 541 e-151 ref|XP_007039138.1| General transcription factor 3C polypeptide ... 508 e-141 gb|EYU34318.1| hypothetical protein MIMGU_mgv1a003054mg [Mimulus... 501 e-139 ref|XP_004251822.1| PREDICTED: general transcription factor 3C p... 498 e-138 ref|XP_006464858.1| PREDICTED: general transcription factor 3C p... 491 e-136 ref|XP_006350004.1| PREDICTED: general transcription factor 3C p... 481 e-133 ref|XP_004297697.1| PREDICTED: general transcription factor 3C p... 476 e-131 ref|XP_003537671.1| PREDICTED: general transcription factor 3C p... 476 e-131 ref|XP_002529107.1| conserved hypothetical protein [Ricinus comm... 464 e-128 gb|EXB88280.1| hypothetical protein L484_020348 [Morus notabilis] 461 e-127 ref|XP_007203854.1| hypothetical protein PRUPE_ppa004640mg [Prun... 456 e-125 ref|XP_002323927.1| transcription factor-related family protein ... 452 e-124 ref|XP_003622988.1| General transcription factor 3C polypeptide ... 450 e-123 ref|NP_190510.3| transcription factor IIIC, subunit 5 [Arabidops... 447 e-123 dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana] 447 e-122 ref|NP_197833.2| transcription factor IIIC, subunit 5 [Arabidops... 446 e-122 ref|XP_006290824.1| hypothetical protein CARUB_v10016933mg [Caps... 444 e-121 >ref|XP_007039139.1| Transcription factor IIIC, subunit 5, putative isoform 2 [Theobroma cacao] gi|508776384|gb|EOY23640.1| Transcription factor IIIC, subunit 5, putative isoform 2 [Theobroma cacao] Length = 582 Score = 553 bits (1426), Expect = e-155 Identities = 305/611 (49%), Positives = 376/611 (61%), Gaps = 3/611 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP E FAVH+PGYP +T+RA+ETLGG EGI++ARSSQSN LELHFRPEDPYS PAFG Sbjct: 11 GTLPNDESFAVHFPGYPKTTARAIETLGGTEGILRARSSQSNKLELHFRPEDPYSRPAFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 ELRPC +D Q A S + CS++ S P Sbjct: 71 ELRPCNNLLLKISKKKSADGQSAEASSKVRECSTS---------------GATDSENPKQ 115 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 A EVQI E+ + +L A+IV+RVSEAY+F+GM DYQHVL+VHAD AR+++ Sbjct: 116 PSQA-----EVQISEQEQT-NLCADIVSRVSEAYHFDGMADYQHVLAVHADAARKRK--- 166 Query: 1664 DVQPDIVNKSGFRN--ESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXX 1491 RN E+ FEKGG MDV +E++M+++PPLFSPKDMPE Sbjct: 167 ------------RNWAEAEEPPFEKGGFMDVDQEDVMMILPPLFSPKDMPENIVLRPSTI 214 Query: 1490 XXXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFD 1311 KQE +VQ E+D+ P LAID NI+EIP KVNWE+ + RGS+ W WQM+V+KLFD Sbjct: 215 LSSKKKQEGVVQNTAEVDLEPGLAIDFNIKEIPKKVNWEELITRGSEQWEWQMIVSKLFD 274 Query: 1310 ERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESR 1131 ERPIW K S+ ERL D+ L F +LKRLL AYYFS GPF FWI+KGYDPRKDP+SR Sbjct: 275 ERPIWPKESVTERLLDKGLKFSHLMLKRLLLGVAYYFSNGPFLRFWIKKGYDPRKDPDSR 334 Query: 1130 IYQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEI 951 IYQ +FRVP LR+ D NT +K KHKW DLC+F+VFP+KCQT QL EL DDYIQQEI Sbjct: 335 IYQRTEFRVPEPLRSYSDANTANKLKHKWEDLCSFRVFPYKCQTFLQLFELDDDYIQQEI 394 Query: 950 RKPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHA 771 RKPPK TC TGWFS VLD LRLRVAVRFLS+YPK+GA+ + KS SD FEKL+R+ Sbjct: 395 RKPPKLATCDSKTGWFSECVLDCLRLRVAVRFLSVYPKDGAESIRKSYSDEFEKLKRSCI 454 Query: 770 LKRDLRP-EEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXX 594 K ++E + N++ + P + E Sbjct: 455 YKDVFNSHQQEIRRTNRELIGDEDKERPKSSDNEE--------------------DEIDA 494 Query: 593 XXXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAGNSNMQYAD 414 ++ +L G+D LQP +Y +N S YLQELFGSFPS G++ +Q AD Sbjct: 495 DDDEELDVYETLNLGGEDDEIPLQPDTYLDMENNSRTYLQELFGSFPSVVGGDA-IQAAD 553 Query: 413 SSDDEYQIFEQ 381 SD EYQI+EQ Sbjct: 554 ISDGEYQIYEQ 564 >ref|XP_007039140.1| Transcription factor IIIC, subunit 5, putative isoform 3 [Theobroma cacao] gi|508776385|gb|EOY23641.1| Transcription factor IIIC, subunit 5, putative isoform 3 [Theobroma cacao] Length = 579 Score = 552 bits (1423), Expect = e-154 Identities = 304/610 (49%), Positives = 374/610 (61%), Gaps = 2/610 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP E FAVH+PGYP +T+RA+ETLGG EGI++ARSSQSN LELHFRPEDPYS PAFG Sbjct: 11 GTLPNDESFAVHFPGYPKTTARAIETLGGTEGILRARSSQSNKLELHFRPEDPYSRPAFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 ELRPC +D Q A S + CS++ S P Sbjct: 71 ELRPCNNLLLKISKKKSADGQSAEASSKVRECSTS---------------GATDSENPKQ 115 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 A EVQI E+ + +L A+IV+RVSEAY+F+GM DYQHVL+VHAD AR+++ Sbjct: 116 PSQA-----EVQISEQEQT-NLCADIVSRVSEAYHFDGMADYQHVLAVHADAARKRK--- 166 Query: 1664 DVQPDIVNKSGFRN--ESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXX 1491 RN E+ FEKGG MDV +E++M+++PPLFSPKDMPE Sbjct: 167 ------------RNWAEAEEPPFEKGGFMDVDQEDVMMILPPLFSPKDMPENIVLRPSTI 214 Query: 1490 XXXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFD 1311 KQE +VQ E+D+ P LAID NI+EIP KVNWE+ + RGS+ W WQM+V+KLFD Sbjct: 215 LSSKKKQEGVVQNTAEVDLEPGLAIDFNIKEIPKKVNWEELITRGSEQWEWQMIVSKLFD 274 Query: 1310 ERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESR 1131 ERPIW K S+ ERL D+ L F +LKRLL AYYFS GPF FWI+KGYDPRKDP+SR Sbjct: 275 ERPIWPKESVTERLLDKGLKFSHLMLKRLLLGVAYYFSNGPFLRFWIKKGYDPRKDPDSR 334 Query: 1130 IYQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEI 951 IYQ +FRVP LR+ D NT +K KHKW DLC+F+VFP+KCQT QL EL DDYIQQEI Sbjct: 335 IYQRTEFRVPEPLRSYSDANTANKLKHKWEDLCSFRVFPYKCQTFLQLFELDDDYIQQEI 394 Query: 950 RKPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHA 771 RKPPK TC TGWFS VLD LRLRVAVRFLS+YPK+GA+ + KS SD FEKL+R+ Sbjct: 395 RKPPKLATCDSKTGWFSECVLDCLRLRVAVRFLSVYPKDGAESIRKSYSDEFEKLKRSCI 454 Query: 770 LKRDLRPEEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXX 591 K + Q + + + + P + E Sbjct: 455 YKDVFNSHQ--QEIRRTNRGDEDKERPKSSDNEE--------------------DEIDAD 492 Query: 590 XXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAGNSNMQYADS 411 ++ +L G+D LQP +Y +N S YLQELFGSFPS G++ +Q AD Sbjct: 493 DDEELDVYETLNLGGEDDEIPLQPDTYLDMENNSRTYLQELFGSFPSVVGGDA-IQAADI 551 Query: 410 SDDEYQIFEQ 381 SD EYQI+EQ Sbjct: 552 SDGEYQIYEQ 561 >ref|XP_002275875.1| PREDICTED: transcription factor tau subunit sfc1-like [Vitis vinifera] Length = 568 Score = 547 bits (1410), Expect = e-153 Identities = 308/610 (50%), Positives = 372/610 (60%), Gaps = 2/610 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G +P E F+VHYP YPSST+RA+ETLGG + I KARSSQSN LELHFRPEDPYSHPAFG Sbjct: 11 GYIPSNEAFSVHYPAYPSSTARAIETLGGTQAIRKARSSQSNKLELHFRPEDPYSHPAFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 EL+PC +D Q SES++T + + Sbjct: 71 ELQPCNNLLLRISKKKSTDGQ----SESVATGEEVEAQI--------------------- 105 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCR- 1668 S EV I+ L A+I+ARVSEAY+FNGMVDYQHVL VHADVARRK+ Sbjct: 106 -------SGEVPIR-------LCADIIARVSEAYHFNGMVDYQHVLPVHADVARRKKRNW 151 Query: 1667 EDVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXXX 1488 +V+P + EKG L+DV +E+LMIL+PPLFSPKD+PE Sbjct: 152 AEVEPHL---------------EKGDLVDVDQEDLMILLPPLFSPKDVPEKLVLRPSMTL 196 Query: 1487 XXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFDE 1308 KQE +VQQRWEM I PCLAID I+EIP KVNWE Y+P+GS+ W WQM V+ LFDE Sbjct: 197 NLKKKQEGVVQQRWEMGIEPCLAIDFEIKEIPKKVNWEQYIPKGSEQWEWQMAVSNLFDE 256 Query: 1307 RPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESRI 1128 RPIW K +L ERL D+ L+ G + L+RLLFRTAYYFS GPF FWIRKGYDPRK+P+S I Sbjct: 257 RPIWPKGALTERLLDKGLNVGDYTLRRLLFRTAYYFSNGPFLRFWIRKGYDPRKNPDSCI 316 Query: 1127 YQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIR 948 YQ +DFRVPPSLR+ D N + K +W D+C+F+VFP+KC TS QL EL DDYIQQEIR Sbjct: 317 YQRIDFRVPPSLRSYCDANAANGLKQRWEDICSFRVFPYKCHTSLQLFELADDYIQQEIR 376 Query: 947 KPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHAL 768 KP KQ TC+ +TGWFS VL+ LRL V VRFLSI P+ A+ LLKSASDRFEK +R H Sbjct: 377 KPLKQTTCTGATGWFSYRVLESLRLCVMVRFLSICPETSAEYLLKSASDRFEKSKRMHIY 436 Query: 767 KRDLRPEEEN-QYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXX 591 + +LRP EE Q VNK+ + P + E Sbjct: 437 ENNLRPNEEGIQEVNKELEGDKDKEEPNDVDDDE---------EDEMEAENGEEELDAYE 487 Query: 590 XXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAGNSNMQYADS 411 S + FS+ +NIS +YLQ LFGSF T+AG +Q AD+ Sbjct: 488 ALDMKIVERSVNTLRSSFGFSIYILDLD-AENISRDYLQGLFGSFSFTKAGGGEVQDADT 546 Query: 410 SDDEYQIFEQ 381 SD EYQI+EQ Sbjct: 547 SDGEYQIYEQ 556 >emb|CBI24753.3| unnamed protein product [Vitis vinifera] Length = 597 Score = 541 bits (1393), Expect = e-151 Identities = 313/648 (48%), Positives = 377/648 (58%), Gaps = 40/648 (6%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G +P E F+VHYP YPSST+RA+ETLGG + I KARSSQSN LELHFRPEDPYSHPAFG Sbjct: 11 GYIPSNEAFSVHYPAYPSSTARAIETLGGTQAIRKARSSQSNKLELHFRPEDPYSHPAFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 EL+PC +D Q A VS +S Q SG Sbjct: 71 ELQPCNNLLLRISKKKSTDGQSAEVSSKVSK---------------------SQISG--- 106 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCR- 1668 EV I+ L A+I+ARVSEAY+FNGMVDYQHVL VHADVARRK+ Sbjct: 107 ---------EVPIR-------LCADIIARVSEAYHFNGMVDYQHVLPVHADVARRKKRNW 150 Query: 1667 EDVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXXX 1488 +V+P + EKG L+DV +E+LMIL+PPLFSPKD+PE Sbjct: 151 AEVEPHL---------------EKGDLVDVDQEDLMILLPPLFSPKDVPEKLVLRPSMTL 195 Query: 1487 XXXXKQEAIVQQRWEMDIAPCLAIDCNIEEI----------------------------- 1395 KQE +VQQRWEM I PCLAID I++I Sbjct: 196 NLKKKQEGVVQQRWEMGIEPCLAIDFEIKDILIIYCLYRMCITSHMTSFSRIPLKLLVTP 255 Query: 1394 ---------PSKVNWEDYVPRGSDSWNWQMVVAKLFDERPIWTKHSLIERLHDESLHFGV 1242 P KVNWE Y+P+GS+ W WQM V+ LFDERPIW K +L ERL D+ L+ G Sbjct: 256 LLTKVVEIIPKKVNWEQYIPKGSEQWEWQMAVSNLFDERPIWPKGALTERLLDKGLNVGD 315 Query: 1241 HLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESRIYQSVDFRVPPSLRNIEDVNTTD 1062 + L+RLLFRTAYYFS GPF FWIRKGYDPRK+P+S IYQ +DFRVPPSLR+ D N + Sbjct: 316 YTLRRLLFRTAYYFSNGPFLRFWIRKGYDPRKNPDSCIYQRIDFRVPPSLRSYCDANAAN 375 Query: 1061 KFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIRKPPKQMTCSCSTGWFSSDVLDI 882 K +W D+C+F+VFP+KC TS QL EL DDYIQQEIRKP KQ TC+ +TGWFS VL+ Sbjct: 376 GLKQRWEDICSFRVFPYKCHTSLQLFELADDYIQQEIRKPLKQTTCTGATGWFSYRVLES 435 Query: 881 LRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHALKRDLRPEEEN-QYVNKDSSSCA 705 LRL V VRFLSI P+ A+ LLKSASDRFEK +R H + +LRP EE Q VNK+ Sbjct: 436 LRLCVMVRFLSICPETSAEYLLKSASDRFEKSKRMHIYENNLRPNEEGIQEVNKELEGDK 495 Query: 704 TRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSPHLDGDDGNFSL 525 + P + E ++ + G+D SL Sbjct: 496 DKEEPNDVDDDE------------------EDEMEAENGEEELDAYEALDMVGEDDEDSL 537 Query: 524 QPCSYPIGKNISTNYLQELFGSFPSTEAGNSNMQYADSSDDEYQIFEQ 381 Q SY +NIS +YLQ LFGSF T+AG +Q AD+SD EYQI+EQ Sbjct: 538 QSRSYLDAENISRDYLQGLFGSFSFTKAGGGEVQDADTSDGEYQIYEQ 585 >ref|XP_007039138.1| General transcription factor 3C polypeptide 5, putative isoform 1 [Theobroma cacao] gi|508776383|gb|EOY23639.1| General transcription factor 3C polypeptide 5, putative isoform 1 [Theobroma cacao] Length = 630 Score = 508 bits (1307), Expect = e-141 Identities = 300/659 (45%), Positives = 371/659 (56%), Gaps = 51/659 (7%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP E FAVH+PGYP +T+RA+ETLGG EGI++ARSSQSN LELHFRPEDPYS PAFG Sbjct: 11 GTLPNDESFAVHFPGYPKTTARAIETLGGTEGILRARSSQSNKLELHFRPEDPYSRPAFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 ELRPC +D Q A S + CS++ S P Sbjct: 71 ELRPCNNLLLKISKKKSADGQSAEASSKVRECSTS---------------GATDSENPKQ 115 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 A EVQI E+ + +L A+IV+RVSEAY+F+GM DYQHVL+VHAD AR+++ Sbjct: 116 PSQA-----EVQISEQEQT-NLCADIVSRVSEAYHFDGMADYQHVLAVHADAARKRK--- 166 Query: 1664 DVQPDIVNKSGFRN--ESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXX 1491 RN E+ FEKGG MDV +E++M+++PPLFSPKDMPE Sbjct: 167 ------------RNWAEAEEPPFEKGGFMDVDQEDVMMILPPLFSPKDMPENIVLRPSTI 214 Query: 1490 XXXXXKQEAIVQQRWE----MDIAPCL----AIDCNIEEIPSKVNWEDYVPRGSDSWNWQ 1335 KQE +VQ E +D L +D +IP KVNWE+ + RGS+ W WQ Sbjct: 215 LSSKKKQEGVVQNTAENVSNLDAVQILFSIFLLDLAFSQIPKKVNWEELITRGSEQWEWQ 274 Query: 1334 MVVAKLFDERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYD 1155 M+V+KLFDERPIW K S+ ERL D+ L F +LKRLL AYYFS GPF FWI+KGYD Sbjct: 275 MIVSKLFDERPIWPKESVTERLLDKGLKFSHLMLKRLLLGVAYYFSNGPFLRFWIKKGYD 334 Query: 1154 PRKDPESRIYQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELV 975 PRKDP+SRIYQ +FRVP LR+ D NT +K KHKW DLC+F+VFP+KCQT QL EL Sbjct: 335 PRKDPDSRIYQRTEFRVPEPLRSYSDANTANKLKHKWEDLCSFRVFPYKCQTFLQLFELD 394 Query: 974 DDYIQQEIRKPPKQMTC-------------------SCSTGWFSSDVLDILRLRVAVRFL 852 DDYIQQEIRKPPK TC TGWFS VLD LRLRVAVRFL Sbjct: 395 DDYIQQEIRKPPKLATCDGGCLWGVVIGVVGDLDTLQSKTGWFSECVLDCLRLRVAVRFL 454 Query: 851 SIYPKEGAKDLLKSASDRFEKLRRAHALKRDLRP-EEENQYVNKDSSSCATRVSPIKHNG 675 S+YPK+GA+ + KS SD FEKL+R+ K ++E + N++ + P + Sbjct: 455 SVYPKDGAESIRKSYSDEFEKLKRSCIYKDVFNSHQQEIRRTNRELIGDEDKERPKSSDN 514 Query: 674 TEMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSPHLDGDDGNFSLQP-------- 519 E ++ +L G+D LQP Sbjct: 515 EE--------------------DEIDADDDEELDVYETLNLGGEDDEIPLQPDTFFGFVR 554 Query: 518 -------CSYPI------GKNISTNYLQELFGSFPSTEAGNSNMQYADSSDDEYQIFEQ 381 +PI +N S YLQELFGSFPS G++ +Q AD SD EYQI+EQ Sbjct: 555 IWMFFVCLRFPIYCLDLDMENNSRTYLQELFGSFPSVVGGDA-IQAADISDGEYQIYEQ 612 >gb|EYU34318.1| hypothetical protein MIMGU_mgv1a003054mg [Mimulus guttatus] Length = 611 Score = 501 bits (1291), Expect = e-139 Identities = 290/610 (47%), Positives = 361/610 (59%), Gaps = 3/610 (0%) Frame = -1 Query: 2204 GVLPEK-EGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAF 2028 GVLP E FAV YPGYP+S RA+ETLGG +GI KAR+ +SN LELHFRPEDPYSHP F Sbjct: 11 GVLPSSSEAFAVLYPGYPTSIGRAIETLGGDQGIAKARTDKSNRLELHFRPEDPYSHPLF 70 Query: 2027 GELRPCXXXXXXXXXXXXSDDQDALVSESMST-CSSTKTNLEPVSCSPETVQNGQQSSGP 1851 G+L+ C D D S+S S L S PE+ ++ + P Sbjct: 71 GKLKSCNNFLLKISKTKVKDTHDIKELNSLSEHASEDSLRLSNNSLIPESTESTAHIAQP 130 Query: 1850 VNSISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRC 1671 S + S++ QI+ A + LSA+IVARVSEAY+F GMVDYQHVL++HAD RRK+ Sbjct: 131 ECDFS--DPSDKAQIKNGA-QEQLSADIVARVSEAYHFKGMVDYQHVLAIHADRTRRKKR 187 Query: 1670 R-EDVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXX 1494 +V+P +FEKGGL+D+ +E+LMILVPPLFS KD+P+ Sbjct: 188 NWAEVEP---------------QFEKGGLVDIDQEDLMILVPPLFSLKDIPDTIVLKSSG 232 Query: 1493 XXXXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLF 1314 KQ+ VQ R EM+I PCLAID NI+EIP +VNWE V R SD W+ M V +LF Sbjct: 233 EMSLKKKQKGDVQPREEMEIEPCLAIDFNIKEIPKRVNWEKSVTRNSDRWHGLMAVCELF 292 Query: 1313 DERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPES 1134 DERP+W K SL E+LHD L+ +LKR L AYYFS GP+ FWIRKGYDPRKDPES Sbjct: 293 DERPVWVKKSLAEQLHDRGLNVENKMLKRFLVVVAYYFSNGPYLRFWIRKGYDPRKDPES 352 Query: 1133 RIYQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQE 954 RIYQ DFRVPPSLR+ + K +W D+C F+VFP KCQ S QL EL DDYIQQE Sbjct: 353 RIYQRTDFRVPPSLRSYCYSDAVSGSKSRWEDICAFRVFPRKCQISLQLFELKDDYIQQE 412 Query: 953 IRKPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAH 774 IRKP + CS TGWFSS V+D LRLRVA RFLS YP+ GA+ LKSAS+RFEK +RAH Sbjct: 413 IRKPASEGNCSLQTGWFSSQVIDCLRLRVAQRFLSAYPETGAELFLKSASNRFEKSKRAH 472 Query: 773 ALKRDLRPEEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXX 594 ++L+ + EN+ +K+ + + + T Sbjct: 473 LNVKNLKVDAENKPADKEVLESEDKEANDEEKETN---DEDKEANDEIEYEEEDEEDEMD 529 Query: 593 XXXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAGNSNMQYAD 414 ++ L D +F P SY ++IS YLQELFGSFP G +Q D Sbjct: 530 DDNLDMDADEAFDLVDQDWDFP-PPNSYTNHESISKGYLQELFGSFPFGGGGGGEVQDVD 588 Query: 413 SSDDEYQIFE 384 D E+QI+E Sbjct: 589 PDDGEFQIYE 598 >ref|XP_004251822.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Solanum lycopersicum] Length = 597 Score = 498 bits (1281), Expect = e-138 Identities = 283/611 (46%), Positives = 364/611 (59%), Gaps = 3/611 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G+LP E FAVHYP YPSS RAVETLGGI+GI+KAR+SQSN LELHFRPEDPYSHP FG Sbjct: 11 GILPTNEVFAVHYPAYPSSVERAVETLGGIQGIVKARTSQSNKLELHFRPEDPYSHPTFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSST-KTNLEPVSCSPETVQNGQQSSGPV 1848 EL+ D + A + S+C +++ V+C E N Sbjct: 71 ELKHSNNFLLKISKCKVRDVRSA--DSADSSCGIVIQSSRSLVNCEQE---NAAPKLNEP 125 Query: 1847 NSISAVNKSNEVQIQEEA-VSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRC 1671 +SA S E+++Q + + +HLSA IV+ VSEAY+FNGMVDYQHVL+VHAD ARRK+ Sbjct: 126 RCLSA-GASKEIEMQTDTNLQEHLSANIVSHVSEAYHFNGMVDYQHVLAVHADDARRKKR 184 Query: 1670 R-EDVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXX 1494 + +V+P +FEKGGLMDV +E++MIL+P LF+ KDMP+ Sbjct: 185 QWAEVEP---------------KFEKGGLMDVDQEDMMILLPSLFASKDMPDNIVLKSCT 229 Query: 1493 XXXXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLF 1314 KQE + WE ++ P LAID I+EIP V+WE Y+P+GSD W WQ V++LF Sbjct: 230 TVGSKRKQEG--RHNWEREMEPSLAIDFAIKEIPKPVDWEKYIPQGSDRWRWQKAVSELF 287 Query: 1313 DERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPES 1134 +ER IW K SL ERLHD L F ++LKRLL AYYF GPFR FWI+KGYDPRKDPES Sbjct: 288 EERKIWAKESLAERLHDRGLKFRDNMLKRLLCGVAYYFLNGPFRRFWIKKGYDPRKDPES 347 Query: 1133 RIYQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQE 954 RIYQ++DFRV LR+ + ++ +H+W D+C F+VFP KCQ + QL EL DDYIQQE Sbjct: 348 RIYQNIDFRVHHELRSYCESRSSSGLQHRWDDICAFRVFPCKCQLALQLCELKDDYIQQE 407 Query: 953 IRKPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAH 774 I KP K+ TC+ TGWFS +D LR R+ VRF+S+ P A+ LL S S RFEK +R H Sbjct: 408 ISKPSKEETCNNVTGWFSFHTIDCLRRRIDVRFMSVCPHPRAESLLNSMSTRFEKSKRTH 467 Query: 773 ALKRDLRPEEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXX 594 + RPEE+ + NKD+ + H+ + Sbjct: 468 TYVKVARPEEQEK-TNKDAENNEVDEQAENHDVDD-----------PDDLEDYEDEFDDD 515 Query: 593 XXXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAGNSNMQYAD 414 +S L +GN SL + N+S +YLQELFG+FPS AG +Q D Sbjct: 516 NVEEEMDAYESLDLAVQEGNVSLHDDPHTNHDNVSRDYLQELFGNFPSNTAGMDEVQ-DD 574 Query: 413 SSDDEYQIFEQ 381 S EYQI++Q Sbjct: 575 QSLGEYQIYDQ 585 >ref|XP_006464858.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Citrus sinensis] Length = 605 Score = 491 bits (1263), Expect = e-136 Identities = 290/625 (46%), Positives = 369/625 (59%), Gaps = 17/625 (2%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP E FAVHYPGY SSTSRA++TLGG E I+KARSS+SN LEL FRPEDPYSHPAFG Sbjct: 11 GNLPSNEVFAVHYPGYSSSTSRAIQTLGGSEAILKARSSKSNKLELRFRPEDPYSHPAFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 E+RPC + L+ S S P S +T ++ + V Sbjct: 71 EVRPC---------------NNLLLKMSKKKTSQPCDGQSP-KLSNQTFKHPLHDAADVG 114 Query: 1844 SISAVNK--SNEVQIQEEAVSK------HLSAEIVARVSEAYNFNGMVDYQHVLSVHADV 1689 ++ +++ S+ V ++EA + +L A+IVARVSEAY+F+GM DYQHV++VHADV Sbjct: 115 NVPEIHQLESDSVVSRKEAEKQKSEDQVNLFADIVARVSEAYHFDGMADYQHVVAVHADV 174 Query: 1688 ARRKRCREDVQPDIVNKSGFRN--ESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEX 1515 ARRK+ RN E +FEKGGL+D+ +++M+++PPLF+PKD+PE Sbjct: 175 ARRKK---------------RNWTEVEEPQFEKGGLIDLDEDDVMMILPPLFAPKDVPEN 219 Query: 1514 XXXXXXXXXXXXXKQEAIVQQRWEMDIAPCLAIDCNIEEI------PSKVNWEDYVPRGS 1353 K+ + Q E DI LAID NI++I S WE+++ R S Sbjct: 220 LVLRPSVIPSSLKKEARVEQNISEKDIESGLAIDFNIKDILLFYLCSSAPPWEEFISRDS 279 Query: 1352 DSWNWQMVVAKLFDERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFW 1173 + W WQM V+KLFDE+PIW K S+ +R+ DE L F +LKRLL AYYFS+GPF FW Sbjct: 280 EQWKWQMAVSKLFDEQPIWPKSSINDRMLDEGLKFNSIMLKRLLLGIAYYFSSGPFLRFW 339 Query: 1172 IRKGYDPRKDPESRIYQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSF 993 IRKGYDPRKDPESRIYQ DFRV P LR+ D N + K++W+DLC FQVFP KC TS Sbjct: 340 IRKGYDPRKDPESRIYQRTDFRVKPPLRSYCDSNADTELKYRWKDLCAFQVFPTKCSTSL 399 Query: 992 QLSELVDDYIQQEIRKPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLK 813 QL ELVDDYIQQEIRKP K+ TCS TGWFSS VL +R RV VRFLS++P GA+ LLK Sbjct: 400 QLFELVDDYIQQEIRKPVKRTTCSLQTGWFSSHVLAAIRRRVEVRFLSVFPGTGAQKLLK 459 Query: 812 SASDRFEKLRRAHALKRDLRP-EEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXX 636 +AS+ FEKL+R K L+P +EEN +NK R P + E Sbjct: 460 NASESFEKLKRICIYKDTLKPDQEENLQINKGDGD--NREKPEAVDDEE---------DR 508 Query: 635 XXXXXXXXXXXXXXXXXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSF 456 ++ + G+D SLQ SY ++ S YLQELFGSF Sbjct: 509 IEVDDEEEDRIEVDAGEEESDADETLDMVGEDDEISLQSHSYLGLESNSRIYLQELFGSF 568 Query: 455 PSTEAGNSNMQYADSSDDEYQIFEQ 381 ST+ +Q SD EYQI+EQ Sbjct: 569 SSTDVDVDKIQDNGISDGEYQIYEQ 593 >ref|XP_006350004.1| PREDICTED: general transcription factor 3C polypeptide 5-like isoform X1 [Solanum tuberosum] gi|565366663|ref|XP_006350006.1| PREDICTED: general transcription factor 3C polypeptide 5-like isoform X3 [Solanum tuberosum] Length = 561 Score = 481 bits (1237), Expect = e-133 Identities = 277/609 (45%), Positives = 351/609 (57%), Gaps = 1/609 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP E FAVHYP YPSS RAVETLGGI+GI+KAR+S+SN LELHFRPEDPYSHPAFG Sbjct: 11 GRLPTNEVFAVHYPAYPSSVERAVETLGGIQGIVKARTSESNKLELHFRPEDPYSHPAFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 EL+ + L+ S ++ PV+C E N Sbjct: 71 ELK---------------HSNNFLLKISKCKVRDVQSADSPVNCEQE------------N 103 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCR- 1668 S++ A + L+A IV+ VSE Y+FNGMVDYQHVL+VHAD ARRK+ + Sbjct: 104 SLA-------------APKERLAANIVSHVSEGYHFNGMVDYQHVLAVHADDARRKKRQW 150 Query: 1667 EDVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXXX 1488 +V+P +FEKGGLMDV +E+LMIL+PPLF+ KDMP+ Sbjct: 151 AEVEP---------------KFEKGGLMDVDQEDLMILLPPLFASKDMPDNIVLKSCTTL 195 Query: 1487 XXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFDE 1308 KQE + WE ++ P LAID I+EIP V+WE Y+P+ SD W WQ V++LF+E Sbjct: 196 GSKRKQEG--RHNWEREMEPSLAIDFTIKEIPKPVDWEKYIPQSSDRWRWQKAVSELFEE 253 Query: 1307 RPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESRI 1128 IW K SL ERLHD L F ++LKRLL AYYF GPFR FWI+KGYDPRKDPESRI Sbjct: 254 CKIWPKESLAERLHDGGLKFRDNMLKRLLCGVAYYFLNGPFRRFWIKKGYDPRKDPESRI 313 Query: 1127 YQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIR 948 YQ++DFRV LR+ + + +H+W D+C F+VFP KCQ + QL EL DDYIQQEIR Sbjct: 314 YQNIDFRVHHELRSYCESRLSSGLQHRWDDICAFRVFPCKCQLALQLCELKDDYIQQEIR 373 Query: 947 KPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHAL 768 KP K+ TC+ TGWFS +D LR + VRF+S+ P A+ LL S S RFEK +R H Sbjct: 374 KPSKEKTCNSVTGWFSFHTVDCLRRCIDVRFMSVCPHPRAESLLNSISTRFEKSKRTHTY 433 Query: 767 KRDLRPEEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXXX 588 + RPEE+ + VNKD+ + H+ E Sbjct: 434 LKVARPEEQEK-VNKDAENNEVDEQAENHDVDE-----------PDDLEDYEDEFDDDNV 481 Query: 587 XXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAGNSNMQYADSS 408 S L +G+ SL + N+S +YLQELFG+FPS+ AG +Q D S Sbjct: 482 EEEMDAYVSLDLAVQEGDVSLHDDPHTNHDNVSRDYLQELFGNFPSSTAGTDEVQ-DDQS 540 Query: 407 DDEYQIFEQ 381 EYQI++Q Sbjct: 541 LGEYQIYDQ 549 >ref|XP_004297697.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Fragaria vesca subsp. vesca] Length = 553 Score = 476 bits (1225), Expect = e-131 Identities = 266/612 (43%), Positives = 351/612 (57%), Gaps = 4/612 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNS----LELHFRPEDPYSH 2037 G LP + F VHYPGYPSS SRA++TLGG + I KA SS SN+ LEL FR +DPYSH Sbjct: 11 GFLPRTQVFGVHYPGYPSSMSRAIDTLGGTQAIHKAHSSASNNNNNRLELRFRHDDPYSH 70 Query: 2036 PAFGELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSS 1857 PAFG+LRPC +S S++++L +PET Q Sbjct: 71 PAFGDLRPCNSFLL-----------------KISKSKSSESDLLAAKLTPETDQ------ 107 Query: 1856 GPVNSISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRK 1677 ++ A+IVARV +AY+F+GM DYQHV++VHADVAR++ Sbjct: 108 -----------------------VNVCADIVARVPKAYHFDGMADYQHVIAVHADVARKR 144 Query: 1676 RCREDVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXX 1497 + R E+ ++GGLMD+ +E++MIL+P F+PKD+P+ Sbjct: 145 KRN-------------RVETEEPHSDRGGLMDIDQEDVMILLPQFFAPKDVPDNLVLRPS 191 Query: 1496 XXXXXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKL 1317 QE VQ + EMD+ P LAID I EIP + NWE+Y+P+ SD W QM V+ L Sbjct: 192 GTLSVKKNQEEPVQHQLEMDMEPVLAIDFGITEIPKRTNWEEYIPQDSDQWESQMAVSSL 251 Query: 1316 FDERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPE 1137 FDERP+W K S+ ERL ++ F H+L+RLL R AYYFS GPF FWI+KG+DPRKDP+ Sbjct: 252 FDERPVWPKDSVTERLLNKGFIFSDHMLRRLLSRVAYYFSRGPFLRFWIKKGFDPRKDPD 311 Query: 1136 SRIYQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQ 957 SRIYQ +D+RV P L + N+ ++ KHKW DLC F+VFP+KC T+ QL EL D+YIQ+ Sbjct: 312 SRIYQKIDYRVKPPLHGYCEANSANQLKHKWSDLCAFRVFPYKCHTTLQLFELDDNYIQE 371 Query: 956 EIRKPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRA 777 +IRK P Q TCS TGWFS +VL+ L+ RV VRFLS+YPK GA+ LLK+A++ F+K ++ Sbjct: 372 QIRKAPAQTTCSPETGWFSYNVLENLKYRVQVRFLSVYPKPGAERLLKAATESFKKSKKI 431 Query: 776 HALKRDLRPEEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXX 597 +R E Q N + + P N E Sbjct: 432 CNKDNLVRDEMVQQQTNAELTGDVDAEEP---NNVE-----------------DDEDDIE 471 Query: 596 XXXXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAGNSNMQYA 417 H +DG SLQP SY +NIS +LQELFGSFP EAG+ N+Q A Sbjct: 472 VDNGEEALDTYVGHDLAEDGEISLQPHSYLNMENISRTHLQELFGSFPPPEAGDDNIQDA 531 Query: 416 DSSDDEYQIFEQ 381 +SD+EYQI+EQ Sbjct: 532 YTSDEEYQIYEQ 543 >ref|XP_003537671.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Glycine max] Length = 547 Score = 476 bits (1224), Expect = e-131 Identities = 274/619 (44%), Positives = 356/619 (57%), Gaps = 13/619 (2%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 GVLPE +GF VHYP YPSS SRAV+TLGGI+ I KAR S+SN LEL FRPEDPYSHPAFG Sbjct: 11 GVLPEPQGFMVHYPAYPSSISRAVDTLGGIQAIQKARCSKSNKLELRFRPEDPYSHPAFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 ELRP + + S TK P V + + SS N Sbjct: 71 ELRP--------------------TNSLLLKISKTKP--------PPPVHDAEASSSSTN 102 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 E+ L A+IVAR EAY F GM DYQHV+ VHADVARRK+ Sbjct: 103 G-------------EQDQEGSLCADIVARFPEAYFFYGMADYQHVIPVHADVARRKK--- 146 Query: 1664 DVQPDIVNKSGFRNESASGE--FEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXX 1491 RN S E F+KGG MD+ E++MI+VPP+F+PKD+PE Sbjct: 147 ------------RNWSELEELHFDKGGFMDLDHEDVMIIVPPIFAPKDVPENLVLRPATM 194 Query: 1490 XXXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFD 1311 K E +VQ +EMD+ P LAID +I+EIP KVNWE+Y+P+GSD W QMVV+++FD Sbjct: 195 SSSKKKPEEVVQPHFEMDMEPVLAIDFDIKEIPKKVNWEEYIPQGSDQWELQMVVSRMFD 254 Query: 1310 ERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESR 1131 ERPIW+K+SL E L D+ L F +L+RLL R +YYFS+GPF FWI+KGYDPRKDP SR Sbjct: 255 ERPIWSKNSLTELLLDKGLSFSHSMLRRLLSRISYYFSSGPFLRFWIKKGYDPRKDPNSR 314 Query: 1130 IYQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEI 951 IYQ +D+RVP LR+ D ++ +K KH+W+D+C F+VFP+K QTS Q +LVDDYIQ EI Sbjct: 315 IYQRIDYRVPVPLRSYCDAHSANKSKHRWKDICAFRVFPYKFQTSLQFFDLVDDYIQSEI 374 Query: 950 RKPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRR--- 780 KPP + TC+ TGWFS +++ +R R+ VR+LS++PK GA++LL++A+ +FEKL+R Sbjct: 375 NKPPFRPTCTSGTGWFSQHMINCIRQRLMVRYLSVFPKPGAENLLRAATLKFEKLKRECY 434 Query: 779 AHALKRD--------LRPEEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXX 624 HA+K D L EE + N + A + E Sbjct: 435 RHAMKLDGEECQQANLGLEENEELDNGEDEEEAAEGNDSDEEWEE--------------- 479 Query: 623 XXXXXXXXXXXXXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTE 444 H D L SY +N+S +LQ+LF +FP E Sbjct: 480 ---------------------EHDLAGDNEMPLPSDSYINFENLSRTHLQDLFVNFPPNE 518 Query: 443 AGNSNMQYADSSDDEYQIF 387 N+Q A+ S++EYQI+ Sbjct: 519 IDCDNVQ-ANGSEEEYQIY 536 >ref|XP_002529107.1| conserved hypothetical protein [Ricinus communis] gi|223531458|gb|EEF33291.1| conserved hypothetical protein [Ricinus communis] Length = 540 Score = 464 bits (1195), Expect = e-128 Identities = 274/610 (44%), Positives = 340/610 (55%), Gaps = 2/610 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G++P E FAVHYPGYPSS SRA++TLGG + I+KAR+SQSN LEL+FRPEDPYSHPAFG Sbjct: 11 GIIPSNEAFAVHYPGYPSSISRAIQTLGGTDAILKARTSQSNKLELYFRPEDPYSHPAFG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 ELR C NL + Sbjct: 71 ELRAC-------------------------------NNL-------------------LL 80 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 IS K Q Q E LSA++VAR+ EAY+F+GMVDYQHV++VHAD A +KR R Sbjct: 81 KISKKKKKTNSQCQTE-----LSADVVARIPEAYHFDGMVDYQHVVAVHADAAAQKRKRN 135 Query: 1664 DVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXXXX 1485 Q + F+K GLMD+ +E++MILVPP F+ KDMP Sbjct: 136 WTQME------------EPHFDKAGLMDLDQEDVMILVPPHFTSKDMPVNLALKATSIPS 183 Query: 1484 XXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFDER 1305 QE V+ E+ + +IP ++NW+ ++ +G++ W WQ+ V++LFDER Sbjct: 184 SKKIQEEAVENHIELHLT--------FVQIPKEINWKLFIAQGTELWGWQIAVSELFDER 235 Query: 1304 PIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESRIY 1125 PIW K +L RL ++L F L+RLL AYYFS GPF FWIRKGYDPRKDP+SRIY Sbjct: 236 PIWPKDALTGRLLVKNLKFTHQTLRRLLLAVAYYFSGGPFLRFWIRKGYDPRKDPDSRIY 295 Query: 1124 QSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIRK 945 Q +DFRVPP LR+ D N KHKW DLC FQVFP+K QTS QL EL DDYIQQEI+K Sbjct: 296 QRIDFRVPPPLRSFSDANAAKGLKHKWEDLCKFQVFPYKFQTSLQLCELDDDYIQQEIKK 355 Query: 944 PPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHALK 765 PPKQ TC+ TGWF V D R RV VRFLS+YPK GA LLK+AS+ FEK +RA K Sbjct: 356 PPKQTTCTYGTGWFLQQVHDSFRHRVMVRFLSVYPKSGAAKLLKAASEDFEKSKRACIYK 415 Query: 764 RDLRPEE-ENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXXX 588 L+ ++ E Q +NK S + I E Sbjct: 416 EVLKSDQVERQKINKGILSDKANENQINVAEGE------------------ADDIEADDP 457 Query: 587 XXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAG-NSNMQYADS 411 ++ L G+D SLQ SY +N S +YLQELF SFPS + +Q AD Sbjct: 458 EEELDADEALDLAGEDDETSLQSHSYL--ENNSKSYLQELFDSFPSADPTIGDRIQDADI 515 Query: 410 SDDEYQIFEQ 381 SD+EYQIFEQ Sbjct: 516 SDEEYQIFEQ 525 >gb|EXB88280.1| hypothetical protein L484_020348 [Morus notabilis] Length = 553 Score = 461 bits (1185), Expect = e-127 Identities = 265/505 (52%), Positives = 318/505 (62%), Gaps = 12/505 (2%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G +P KE FAV+YPGYPSS SRAVETLGG+E I KARS QSN LELHFRPEDPYSHPAFG Sbjct: 33 GFVPSKEAFAVNYPGYPSSISRAVETLGGLEAIHKARSLQSNRLELHFRPEDPYSHPAFG 92 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQ------- 1866 +LRPC S+ QDA VS P +QNG Sbjct: 93 DLRPCNHLLLKLSRIKSSNGQDAQVS------------------GPSALQNGNNLDYTYT 134 Query: 1865 -QSSGPVNSISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADV 1689 ++SG +S V +VQI E+ + + A+IVARV EAY+F+GMVDYQHV +VHADV Sbjct: 135 TRASGSTSSAKQV----DVQIPEDDQT-NFCADIVARVLEAYHFDGMVDYQHVTAVHADV 189 Query: 1688 ARRKRCREDVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXX 1509 ARRK+ K E S EK GLMDV +++M+LVPPLF+PKD PE Sbjct: 190 ARRKK----------RKWLELEEPLS---EKNGLMDVDEDDVMMLVPPLFAPKDFPENLV 236 Query: 1508 XXXXXXXXXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKV-NWEDYVPRGSDSWNWQM 1332 +EAI P L EIP ++ NWE Y+P+GS W QM Sbjct: 237 LRPSVILSSKKNEEAINH--------PDL-------EIPKRIINWEQYIPKGSYQWELQM 281 Query: 1331 VVAKLFDERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDP 1152 V+KLFDERPIW KHS+ ERL D+ + H+L+RLL R AYYFS+GPF FWI+KGYDP Sbjct: 282 AVSKLFDERPIWIKHSVNERLVDKGYNVVDHMLRRLLSRVAYYFSSGPFLRFWIKKGYDP 341 Query: 1151 RKDPESRIYQSVDFRVPPSLRNIEDVNTTD---KFKHKWRDLCTFQVFPWKCQTSFQLSE 981 RKDP+SRIYQ +DFRV PSLR+ D N T+ K K +W D+CTFQVFP KCQTS QL E Sbjct: 342 RKDPDSRIYQRIDFRVHPSLRSYCDANVTNQGKKEKQRWGDICTFQVFPVKCQTSLQLFE 401 Query: 980 LVDDYIQQEIRKPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASD 801 L DDYIQQEIRKPP Q TC+ TGWFSS V D LR R+++RFLS YPK GA+ LLK A++ Sbjct: 402 LADDYIQQEIRKPPSQKTCTPGTGWFSSTVHDSLRHRISIRFLSTYPKPGAEHLLKEATE 461 Query: 800 RFEKLRRAHALKRDLRPEEENQYVN 726 FEK +R + + EEE Q V+ Sbjct: 462 NFEKSKRRLSKDCVMLHEEERQEVD 486 >ref|XP_007203854.1| hypothetical protein PRUPE_ppa004640mg [Prunus persica] gi|462399385|gb|EMJ05053.1| hypothetical protein PRUPE_ppa004640mg [Prunus persica] Length = 498 Score = 456 bits (1172), Expect = e-125 Identities = 240/490 (48%), Positives = 310/490 (63%), Gaps = 15/490 (3%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP E FA+HYPGYPSS SRA+ETLGG +GI KA SSQSN LELHFR ++PYSHPAFG Sbjct: 12 GFLPSSEVFAIHYPGYPSSMSRAIETLGGTQGIRKAHSSQSNRLELHFRHQEPYSHPAFG 71 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 +LRPC + + S TK+N E + Sbjct: 72 DLRPC--------------------NNLLLKISKTKSNAGQTQPQSELL----------- 100 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 +K +EVQI E + + +IVARV EAY+F+GMVDYQHV+ VHADVAR+K+ Sbjct: 101 ----ASKQDEVQIPE---NDRVHFDIVARVPEAYHFDGMVDYQHVVPVHADVARKKK--- 150 Query: 1664 DVQPDIVNKSGFRN--ESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXX 1491 RN E +KGGLMD+ +E+ MIL+P LF+PKD+P+ Sbjct: 151 ------------RNWIEIKDPHSDKGGLMDIDQEDAMILLPQLFAPKDVPDNLVLKPSVT 198 Query: 1490 XXXXXKQEAIVQQRWEMDIAPCLAIDCNIEEI-------------PSKVNWEDYVPRGSD 1350 QE VQ +WEMD+ P LAID I +I P + NWE+Y+P+GSD Sbjct: 199 LSAKKNQEEPVQHQWEMDMEPVLAIDFGISDILSFVIFFLDLIMIPKRTNWEEYIPQGSD 258 Query: 1349 SWNWQMVVAKLFDERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWI 1170 W QM V+ LFDERP+W K SL+ERL D+ +F HLL+RLL R AYYFS GPF FWI Sbjct: 259 QWESQMAVSHLFDERPVWPKDSLLERLVDKGFNFSDHLLRRLLSRVAYYFSRGPFLRFWI 318 Query: 1169 RKGYDPRKDPESRIYQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQ 990 +KGYDPRKDPESRI+Q +DFRV P L++ D N+ ++ KH+W D+C F+VFP+KC T+ Q Sbjct: 319 KKGYDPRKDPESRIFQKIDFRVRPPLQSYCDANSANQPKHRWEDICAFRVFPYKCHTTLQ 378 Query: 989 LSELVDDYIQQEIRKPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKS 810 L EL DDYIQ++IRKPP Q TCS TGWFS ++L+ L+ V VRFLS++P+ GA+ LLK+ Sbjct: 379 LFELGDDYIQEQIRKPPAQTTCSSETGWFSYNMLENLKDCVKVRFLSVFPEPGAEPLLKA 438 Query: 809 ASDRFEKLRR 780 A++ F+K ++ Sbjct: 439 ATESFKKSKK 448 >ref|XP_002323927.1| transcription factor-related family protein [Populus trichocarpa] gi|222866929|gb|EEF04060.1| transcription factor-related family protein [Populus trichocarpa] Length = 527 Score = 452 bits (1162), Expect = e-124 Identities = 271/612 (44%), Positives = 343/612 (56%), Gaps = 4/612 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G++P KEGFAVHYPGYPSS SRA++TLGG E I+KARSSQSN LEL+FRPEDPYSHP G Sbjct: 11 GLIPSKEGFAVHYPGYPSSISRAIQTLGGTESILKARSSQSNKLELYFRPEDPYSHPVSG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 ELR C SM S K +++S P+N Sbjct: 71 ELRSC---------------------HSMLLKISRK----------------KKNSSPIN 93 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 + +EE+ H A+IVAR+ EAY F GM DYQHV+ VHAD+ARRKR Sbjct: 94 -----------EAKEESEEFH--ADIVARIPEAYYFEGMADYQHVVPVHADIARRKRKNP 140 Query: 1664 DVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXXXX 1485 +K GL+D+ E++M+L PPLFS KD+PE Sbjct: 141 ---------------------KKPGLIDMGPEDVMMLSPPLFSLKDVPENIVLRPPSTSS 179 Query: 1484 XXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFDER 1305 KQ+ + E P I +IP K+NW++++ G+ W WQ+ V++LF+ER Sbjct: 180 SKKKQD----EPPETHSKPLAFI-----QIPKKINWKEFITEGTPMWEWQIAVSELFEER 230 Query: 1304 PIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESRIY 1125 PIW K+SLIERL D++L F LKRLL YYFS GPF+ FWIRKGYDPRKDP+SRIY Sbjct: 231 PIWPKYSLIERLLDKNLKFTYQTLKRLLLTVGYYFSGGPFQKFWIRKGYDPRKDPDSRIY 290 Query: 1124 QSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIRK 945 QSV FRVPP L++ D N KH+W DLC F+ FP++ Q SFQL EL DDYIQQEI+K Sbjct: 291 QSVAFRVPPELKSYCDDNAAKGLKHRWEDLCKFRFFPYRNQYSFQLYELDDDYIQQEIQK 350 Query: 944 PPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHALK 765 PPKQ +C+ TGWFS V D LRL V VRFLSI+P+ GA+ LK+AS++F K +RA K Sbjct: 351 PPKQTSCTYETGWFSQHVHDSLRLCVKVRFLSIFPETGAEKFLKAASEKFMKSKRACIFK 410 Query: 764 RDLRP-EEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXXX 588 +P +EE+Q +N+D + N TE V Sbjct: 411 DAPKPVQEEHQQINEDHETL--------KNDTEAVDEAIENQIDTDDVEV---------- 452 Query: 587 XXXXXXXDSPHLDGDDG--NFSLQPCSYPIGKNISTNYLQELFGSFPSTEA-GNSNMQYA 417 LD DDG F + +N ST+YLQ+L GSFPS + G+ Sbjct: 453 ---------DELDSDDGEEEFDVYGMDSADMENTSTSYLQQLLGSFPSMDTNGDKKQDGG 503 Query: 416 DSSDDEYQIFEQ 381 +SSD EYQI+EQ Sbjct: 504 ESSDGEYQIYEQ 515 >ref|XP_003622988.1| General transcription factor 3C polypeptide [Medicago truncatula] gi|355498003|gb|AES79206.1| General transcription factor 3C polypeptide [Medicago truncatula] Length = 612 Score = 450 bits (1157), Expect = e-123 Identities = 277/666 (41%), Positives = 357/666 (53%), Gaps = 58/666 (8%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 GVLPE +GF VHYPGYPS+TSRAV+TLGG +GI+KARSSQ+N LEL FRPEDPY HPAFG Sbjct: 16 GVLPEPQGFLVHYPGYPSTTSRAVDTLGGSQGILKARSSQANKLELRFRPEDPYCHPAFG 75 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 E RP DD A S SM C E +G Q+ + Sbjct: 76 ERRPTNALLLKISKRKLPDDDGATTSNSM--------------CGME---HGMQADNVES 118 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 A +K + EEA +L A+IV RV EAY F GM DYQ+V+ VHADVA+RK+ Sbjct: 119 EHGAADK-----VDEEA---NLCADIVGRVPEAYFFEGMADYQYVVPVHADVAKRKK--- 167 Query: 1664 DVQPDIVNKSGFRNESASGE--FEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXX 1491 RN S E KGG +DV E++MI+VPP+F+PKDMPE Sbjct: 168 ------------RNWSEPEETHLAKGGRIDVDHEDIMIIVPPIFAPKDMPEDLLLRPPTV 215 Query: 1490 XXXXXKQEAIVQQRWEMDIAPCLAIDC---------NIE---------------EIPSKV 1383 K+E IV +E+D+ P LA+D NI +IP KV Sbjct: 216 SSSKKKEEEIVHPHFEIDMEPVLALDFFQIKDILKENISKHIALLWFSFDLAVLQIPKKV 275 Query: 1382 NWEDYVPRGSDSWNWQMVVAKLFDERPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYY 1203 NWE+Y+P+GS+ W QM V+++FDE+PIW+K+SL ERL D+ L F + +RLL R AYY Sbjct: 276 NWEEYIPQGSEQWESQMAVSRMFDEKPIWSKNSLTERLLDKGLSFSHGMFRRLLSRIAYY 335 Query: 1202 FSTGPFRLFWIRKGYDPRKDPESR------------IYQSVDFRVPPSLRNIEDVNTTDK 1059 FS+GPF+ FWI+KGYDPRKDP SR +YQ +D+RVP LR+ D + DK Sbjct: 336 FSSGPFQRFWIKKGYDPRKDPGSRMIGTVPLVRKLLLYQRIDYRVPVPLRSFCDTYSADK 395 Query: 1058 FKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIRKPPKQMTCSCSTGWFSSDVLDIL 879 KHKW D+C F+ FP+K QTS Q EL+DDYIQ EI KPP Q TC+ +GWFS + ++ L Sbjct: 396 LKHKWGDICAFRAFPYKFQTSLQFVELIDDYIQSEINKPPMQDTCTFESGWFSLNKINCL 455 Query: 878 RLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRR---AHALK----------RDLRPEEEN 738 R R+ VR+LSI+PK GA+ LL+ A+ +FEKL+R A+K L EE Sbjct: 456 RQRLMVRYLSIFPKPGAESLLRVAASKFEKLKRECNREAVKLCVEERQQANTGLEESEEP 515 Query: 737 QYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSP 558 + V D A + + + E+ Sbjct: 516 ENVEDDDGEAAEANNSDEESEEEL------------------------------------ 539 Query: 557 HLDGDDGNFSLQPCSYPIGKN-------ISTNYLQELFGSFPSTEAGNSNMQYADSSDDE 399 L GD P Y + IS +LQELFGSFPS E Q + S++E Sbjct: 540 DLTGDTEMPLPSPSRYRTRHSTCLSYPNISMTHLQELFGSFPSDEIDGDKAQ-ENGSEEE 598 Query: 398 YQIFEQ 381 Y I+E+ Sbjct: 599 YHIYEE 604 >ref|NP_190510.3| transcription factor IIIC, subunit 5 [Arabidopsis thaliana] gi|332645018|gb|AEE78539.1| transcription factor IIIC, subunit 5 [Arabidopsis thaliana] Length = 574 Score = 447 bits (1151), Expect = e-123 Identities = 252/611 (41%), Positives = 343/611 (56%), Gaps = 3/611 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP KE F VH+PGYPSS SRA+ETLGGI+GI +AR S SN LEL FRPEDPY+HPA G Sbjct: 11 GTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRPEDPYAHPALG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 E RPC S ++ +Q Sbjct: 71 EQRPC---------------------------------------SGFLLRISKQDIKKPE 91 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCR- 1668 S S ++ S +V ++E S L A+IVAR+SE+++F+GM DYQHV+ +HAD+A++K+ + Sbjct: 92 SQSVLDTSRDVCLEE--ASPVLCADIVARLSESFHFDGMADYQHVIPIHADIAQQKKRKW 149 Query: 1667 EDVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXXX 1488 DV P +G+ + GL D E++M+L+P F+PKD+P+ Sbjct: 150 MDVDP------------LTGKSDLMGLAD---EDVMMLLPQFFAPKDIPDNVALKPPATS 194 Query: 1487 XXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFDE 1308 K +A Q +E+D+ P AID +++EIP K+ WED+V R S+ W WQ+ V+ LF+E Sbjct: 195 GPKKKDDAATQNFYEIDVGPVFAIDFSVKEIPKKLKWEDFVSRSSNHWQWQVAVSALFEE 254 Query: 1307 RPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESRI 1128 RPIWT+ S+++RL D+ L H+L R L R AYYFS+GPF FWI++GYDPR DPESR+ Sbjct: 255 RPIWTRDSVVQRLLDKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRV 314 Query: 1127 YQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIR 948 YQ ++FRVPP LR D N T+ K W D+C F++FP+KCQT QL EL D+YIQ+EIR Sbjct: 315 YQRMEFRVPPELRGYCDANATNNSKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIR 374 Query: 947 KPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHAL 768 KPPKQ TCS +GWFS +LD LRLRVAVRF+S++P+ G +D+ KS + FE+ + Sbjct: 375 KPPKQTTCSHKSGWFSEAMLDTLRLRVAVRFVSVFPETGFEDVFKSIQEEFERSEKVQIQ 434 Query: 767 KRDLRPE-EENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXX 591 K L+P +++ K S T S + N V Sbjct: 435 KETLKPSLVKHREATKGSEDMETFKS-VNENVDANVNEDGEDENLDDEDEDEEEEEEL-- 491 Query: 590 XXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAG-NSNMQYAD 414 + D SL Y +N S YLQ LF SFPS+E + D Sbjct: 492 -----------DMAAGDNEISLDSHGYLDTENSSRTYLQGLFDSFPSSEPNLYGDFAVDD 540 Query: 413 SSDDEYQIFEQ 381 SD E+QI+E+ Sbjct: 541 GSDGEFQIYEE 551 >dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana] Length = 574 Score = 447 bits (1149), Expect = e-122 Identities = 251/611 (41%), Positives = 343/611 (56%), Gaps = 3/611 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP KE F VH+PGYPSS SRA+ETLGGI+GI +AR S SN LEL FRPEDPY+HPA G Sbjct: 11 GTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRPEDPYAHPALG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 E RPC S ++ +Q Sbjct: 71 EQRPC---------------------------------------SGFLLRISKQDIKKPE 91 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCR- 1668 S S ++ S +V ++E S L A+IVAR+SE+++F+GM DYQHV+ +HAD+A++K+ + Sbjct: 92 SQSVLDTSRDVCLEE--ASPVLCADIVARLSESFHFDGMADYQHVIPIHADIAQQKKRKW 149 Query: 1667 EDVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXXX 1488 DV P +G+ + GL D E++M+L+P F+PKD+P+ Sbjct: 150 MDVDP------------LTGKSDLMGLAD---EDVMMLLPQFFAPKDIPDNVALKPPATS 194 Query: 1487 XXXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFDE 1308 K + Q +E+D+ P AID +++EIP K+ WED+V R S+ W WQ+ V+ LF+E Sbjct: 195 GPKKKDDVATQNFYEIDVGPVFAIDFSVKEIPKKLKWEDFVSRSSNHWQWQVAVSALFEE 254 Query: 1307 RPIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESRI 1128 RPIWT+ S+++RL D+ L H+L R L R AYYFS+GPF FWI++GYDPR DPESR+ Sbjct: 255 RPIWTRDSVVQRLLDKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRV 314 Query: 1127 YQSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIR 948 YQ ++FRVPP LR D N T+ K W D+C F++FP+KCQT QL EL D+YIQ+EIR Sbjct: 315 YQRMEFRVPPELRGYCDANATNNSKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIR 374 Query: 947 KPPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHAL 768 KPPKQ TCS +GWFS +LD LRLRVAVRF+S++P+ G +D+ KS + FE+ ++ Sbjct: 375 KPPKQTTCSHKSGWFSEAMLDTLRLRVAVRFVSVFPETGFEDVFKSIQEEFERSKKVQIQ 434 Query: 767 KRDLRPE-EENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXX 591 K L+P +++ K S T S + N V Sbjct: 435 KETLKPSLVKHREATKGSEDIETFKS-VNENVDANVNEDGEDENLDDEDEDEEEEEEL-- 491 Query: 590 XXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAG-NSNMQYAD 414 + D SL Y +N S YLQ LF SFPS+E + D Sbjct: 492 -----------DMAAGDNEISLDSHGYLDTENSSRTYLQGLFDSFPSSEPNLYGDFAVDD 540 Query: 413 SSDDEYQIFEQ 381 SD E+QI+E+ Sbjct: 541 GSDGEFQIYEE 551 >ref|NP_197833.2| transcription factor IIIC, subunit 5 [Arabidopsis thaliana] gi|332005929|gb|AED93312.1| transcription factor IIIC, subunit 5 [Arabidopsis thaliana] Length = 554 Score = 446 bits (1147), Expect = e-122 Identities = 260/618 (42%), Positives = 340/618 (55%), Gaps = 10/618 (1%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP KE F VHYPGYPSS SRAVETLGGI+GI AR S SN LELHFRPEDP +HPA+G Sbjct: 11 GNLPSKEAFVVHYPGYPSSISRAVETLGGIQGITTARESTSNKLELHFRPEDPSAHPAYG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 E R C D + ES S++ +C PE Sbjct: 71 ERRHCNGFLLKISKEDVKKDS---LPESQPVISTSD------ACLPE------------- 108 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 V L A+IVARVSE+Y F+GMVDYQHV+ +HAD+A++K+ Sbjct: 109 -----------------VRPALCADIVARVSESYCFDGMVDYQHVIPIHADIAQQKK--- 148 Query: 1664 DVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXXXX 1485 + +S +G K LMD+ E++M+L+P FSPKD P+ Sbjct: 149 --------RKWMEVKSLAG---KNDLMDMADEDVMMLLPQFFSPKDRPDNLVLRLPVTSS 197 Query: 1484 XXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFDER 1305 K E + Q +E+DI P AID +++EIP + WEDY+ S+ W WQ+ V+ LF+ER Sbjct: 198 PKKKDEELTQNLYEIDIGPVFAIDFSVKEIPKILKWEDYIVPTSNQWKWQVAVSALFEER 257 Query: 1304 PIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESRIY 1125 P+WT+ S+++RL D+ L H+L R L R AYYFS GPF FWI++GYDPRKDPESR++ Sbjct: 258 PVWTRDSIVQRLLDKGLTCTHHMLNRFLLRAAYYFSGGPFLRFWIKRGYDPRKDPESRVF 317 Query: 1124 QSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIRK 945 Q ++FRVPP L+ D N T+K K W D+C F+VFP+KCQT QL EL D+YIQQEIRK Sbjct: 318 QRMEFRVPPELKGYCDSNATNKSKPSWDDICAFKVFPFKCQTFLQLFELDDEYIQQEIRK 377 Query: 944 PPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHALK 765 PPKQ TC+ TGWFS +LD LRLRVAVRF+S++P+ G +D+ KS + FE+ + K Sbjct: 378 PPKQTTCNYKTGWFSEALLDNLRLRVAVRFVSVFPEPGFEDVFKSIQEEFERSEKTRIQK 437 Query: 764 RDLRPEEEN-QYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXXXXX 588 L+P + N Q KD C K+ E Sbjct: 438 DALQPSQRNHQETTKDMKKC-------KNTNKE-------------------------KD 465 Query: 587 XXXXXXXDSPHLD------GDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEA---GN 435 DS LD +D + S+ Y +N S YLQ LF FPS+ + G+ Sbjct: 466 DDVNADEDSEDLDDEYEEAANDDDISISSHGYGDMENNSRTYLQGLFNRFPSSASALYGS 525 Query: 434 SNMQYADSSDDEYQIFEQ 381 +N + SD EY I+EQ Sbjct: 526 ANDD--NDSDGEYPIYEQ 541 >ref|XP_006290824.1| hypothetical protein CARUB_v10016933mg [Capsella rubella] gi|482559531|gb|EOA23722.1| hypothetical protein CARUB_v10016933mg [Capsella rubella] Length = 571 Score = 444 bits (1141), Expect = e-121 Identities = 247/613 (40%), Positives = 336/613 (54%), Gaps = 5/613 (0%) Frame = -1 Query: 2204 GVLPEKEGFAVHYPGYPSSTSRAVETLGGIEGIIKARSSQSNSLELHFRPEDPYSHPAFG 2025 G LP KE F +H+PGYPSS S+A+ETLGGI+GI +AR S SN LEL FRPEDPY+HP G Sbjct: 11 GTLPSKEAFVLHFPGYPSSISKAIETLGGIQGITQARESISNKLELRFRPEDPYAHPVLG 70 Query: 2024 ELRPCXXXXXXXXXXXXSDDQDALVSESMSTCSSTKTNLEPVSCSPETVQNGQQSSGPVN 1845 E RPC QD SES +++ CS E Sbjct: 71 EQRPCNGFLLRI------SKQDIKKSESQPVLATSDV------CSEEA------------ 106 Query: 1844 SISAVNKSNEVQIQEEAVSKHLSAEIVARVSEAYNFNGMVDYQHVLSVHADVARRKRCRE 1665 S L A+IVA VSE+++F+GM DYQHV+ +HAD+A++K+ Sbjct: 107 ------------------SPALCADIVAHVSESFHFDGMADYQHVIPIHADIAQQKK--- 145 Query: 1664 DVQPDIVNKSGFRNESASGEFEKGGLMDVYREELMILVPPLFSPKDMPEXXXXXXXXXXX 1485 + +S +G + GL D E++M+L+P F+PKD+P+ Sbjct: 146 --------RKWMEMDSLTGNTDLMGLAD---EDVMMLLPQFFAPKDIPDNVALKPPATTG 194 Query: 1484 XXXKQEAIVQQRWEMDIAPCLAIDCNIEEIPSKVNWEDYVPRGSDSWNWQMVVAKLFDER 1305 K +A Q +E+D+ P AI+ +++EIP K+NWE++V S W WQ+ V+ LF+ER Sbjct: 195 PKKKDDAEAQNFYEIDVGPVFAIEFSVKEIPKKLNWEEFVSPSSKHWQWQVSVSALFEER 254 Query: 1304 PIWTKHSLIERLHDESLHFGVHLLKRLLFRTAYYFSTGPFRLFWIRKGYDPRKDPESRIY 1125 PIWT+ S+++RL D+ L H+L R L R AYYFS+GPF FWI++GYDPR DPESR+Y Sbjct: 255 PIWTRDSVVQRLLDKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRDDPESRVY 314 Query: 1124 QSVDFRVPPSLRNIEDVNTTDKFKHKWRDLCTFQVFPWKCQTSFQLSELVDDYIQQEIRK 945 Q ++FRVPP LR+ D N T+ K W D+C F++FP+KCQT QL EL D+YIQ+EIRK Sbjct: 315 QRMEFRVPPELRSYCDANATNNSKPSWNDICAFKIFPFKCQTFLQLFELDDEYIQREIRK 374 Query: 944 PPKQMTCSCSTGWFSSDVLDILRLRVAVRFLSIYPKEGAKDLLKSASDRFEKLRRAHALK 765 PPKQ TCS TGWFS +LD LRLRVAVRF+S++P+ G +D+ KS + FE+ + LK Sbjct: 375 PPKQTTCSHKTGWFSEAMLDTLRLRVAVRFVSVFPEPGFEDVFKSIQEEFERSEKIQILK 434 Query: 764 RDLRP----EEENQYVNKDSSSCATRVSPIKHNGTEMVXXXXXXXXXXXXXXXXXXXXXX 597 L+P E+ +D C T + N E Sbjct: 435 ETLKPSLVKHRESTKGAEDMEKCKTVNEDVDANVNE-------------------DGSDE 475 Query: 596 XXXXXXXXXXDSPHLDGDDGNFSLQPCSYPIGKNISTNYLQELFGSFPSTEAG-NSNMQY 420 + + D S Y +N S YLQ LF SFP++E G + Sbjct: 476 NLDDEEEEEEEELDMAAGDNEKSFDSHGYLDNENSSRTYLQGLFDSFPTSEPGLYGDHAV 535 Query: 419 ADSSDDEYQIFEQ 381 D SD E+QI+E+ Sbjct: 536 DDGSDGEFQIYEE 548