BLASTX nr result
ID: Sinomenium21_contig00024485
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00024485 (838 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002275875.1| PREDICTED: transcription factor tau subunit ... 284 3e-74 emb|CBI24753.3| unnamed protein product [Vitis vinifera] 283 4e-74 gb|EYU34318.1| hypothetical protein MIMGU_mgv1a003054mg [Mimulus... 272 1e-70 ref|XP_007039140.1| Transcription factor IIIC, subunit 5, putati... 271 2e-70 ref|XP_007039139.1| Transcription factor IIIC, subunit 5, putati... 271 2e-70 ref|XP_003537671.1| PREDICTED: general transcription factor 3C p... 256 7e-66 ref|XP_007039138.1| General transcription factor 3C polypeptide ... 256 1e-65 ref|XP_006464858.1| PREDICTED: general transcription factor 3C p... 252 1e-64 ref|XP_007157964.1| hypothetical protein PHAVU_002G1131001g, par... 249 9e-64 ref|XP_003622988.1| General transcription factor 3C polypeptide ... 248 2e-63 ref|XP_004251822.1| PREDICTED: general transcription factor 3C p... 246 8e-63 ref|XP_006427924.1| hypothetical protein CICLE_v100272412mg, par... 243 5e-62 ref|XP_007203854.1| hypothetical protein PRUPE_ppa004640mg [Prun... 241 2e-61 ref|XP_006350004.1| PREDICTED: general transcription factor 3C p... 239 7e-61 gb|EXB88280.1| hypothetical protein L484_020348 [Morus notabilis] 234 4e-59 ref|XP_004297697.1| PREDICTED: general transcription factor 3C p... 221 3e-55 ref|XP_002529107.1| conserved hypothetical protein [Ricinus comm... 213 5e-53 ref|NP_190510.3| transcription factor IIIC, subunit 5 [Arabidops... 213 5e-53 gb|EPS67527.1| hypothetical protein M569_07249 [Genlisea aurea] 213 9e-53 dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana] 212 2e-52 >ref|XP_002275875.1| PREDICTED: transcription factor tau subunit sfc1-like [Vitis vinifera] Length = 568 Score = 284 bits (727), Expect = 3e-74 Identities = 141/263 (53%), Positives = 173/263 (65%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+IE+G++SG +P E F+VHYP YPSS +RA+ETLGG + I KARSS SN LEL FRP Sbjct: 1 MGVIEEGSISGYIPSNEAFSVHYPAYPSSTARAIETLGGTQAIRKARSSQSNKLELHFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPYSHPAFG +P +NLLL+I+KK++ DGQ Sbjct: 61 EDPYSHPAFGELQPCNNLLLRISKKKSTDGQS---------------------------- 92 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 ++V +G AQI V +CADI+ARV++ Y+FNGMVDYQHVL VHAD Sbjct: 93 ----------ESVATGEEVEAQISGEVPIRLCADIIARVSEAYHFNGMVDYQHVLPVHAD 142 Query: 588 VARKKKRNWAEMEPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXXH 767 VAR+KKRNWAE+EPH EKG +D+DQEDLMIL+PPLFS KD+PE LV Sbjct: 143 VARRKKRNWAEVEPHLEKGDLVDVDQEDLMILLPPLFSPKDVPEKLVLRPSMTLNLKKKQ 202 Query: 768 EAIVQQRWEMDIAPCLGLDFDIK 836 E +VQQRWEM I PCL +DF+IK Sbjct: 203 EGVVQQRWEMGIEPCLAIDFEIK 225 >emb|CBI24753.3| unnamed protein product [Vitis vinifera] Length = 597 Score = 283 bits (725), Expect = 4e-74 Identities = 141/263 (53%), Positives = 173/263 (65%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+IE+G++SG +P E F+VHYP YPSS +RA+ETLGG + I KARSS SN LEL FRP Sbjct: 1 MGVIEEGSISGYIPSNEAFSVHYPAYPSSTARAIETLGGTQAIRKARSSQSNKLELHFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPYSHPAFG +P +NLLL+I+KK++ DGQ A +S + K Sbjct: 61 EDPYSHPAFGELQPCNNLLLRISKKKSTDGQSAEVSSKVSK------------------- 101 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 +QI V +CADI+ARV++ Y+FNGMVDYQHVL VHAD Sbjct: 102 --------------------SQISGEVPIRLCADIIARVSEAYHFNGMVDYQHVLPVHAD 141 Query: 588 VARKKKRNWAEMEPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXXH 767 VAR+KKRNWAE+EPH EKG +D+DQEDLMIL+PPLFS KD+PE LV Sbjct: 142 VARRKKRNWAEVEPHLEKGDLVDVDQEDLMILLPPLFSPKDVPEKLVLRPSMTLNLKKKQ 201 Query: 768 EAIVQQRWEMDIAPCLGLDFDIK 836 E +VQQRWEM I PCL +DF+IK Sbjct: 202 EGVVQQRWEMGIEPCLAIDFEIK 224 >gb|EYU34318.1| hypothetical protein MIMGU_mgv1a003054mg [Mimulus guttatus] Length = 611 Score = 272 bits (696), Expect = 1e-70 Identities = 143/264 (54%), Positives = 179/264 (67%), Gaps = 1/264 (0%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEK-EGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFR 224 MGIIEDG+VSGV+P E FAV YPGYP+SI RA+ETLGG++GI KAR+ SN LEL FR Sbjct: 1 MGIIEDGSVSGVLPSSSEAFAVLYPGYPTSIGRAIETLGGDQGIAKARTDKSNRLELHFR 60 Query: 225 PEDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSET 404 PEDPYSHP FG + +N LLKI+K + D + +L + +S + L + E+ Sbjct: 61 PEDPYSHPLFGKLKSCNNFLLKISKTKVKDTHDIKELNSLSEHASEDSLRLSNNSLIPES 120 Query: 405 VENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHA 584 E+ + + S SD+AQIK + ADIVARV++ Y+F GMVDYQHVLA+HA Sbjct: 121 TESTAHIA-QPECDFSDPSDKAQIKNGAQEQLSADIVARVSEAYHFKGMVDYQHVLAIHA 179 Query: 585 DVARKKKRNWAEMEPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXX 764 D R+KKRNWAE+EP FEKGG +DIDQEDLMILVPPLFSLKDIP+ +V Sbjct: 180 DRTRRKKRNWAEVEPQFEKGGLVDIDQEDLMILVPPLFSLKDIPDTIVLKSSGEMSLKKK 239 Query: 765 HEAIVQQRWEMDIAPCLGLDFDIK 836 + VQ R EM+I PCL +DF+IK Sbjct: 240 QKGDVQPREEMEIEPCLAIDFNIK 263 >ref|XP_007039140.1| Transcription factor IIIC, subunit 5, putative isoform 3 [Theobroma cacao] gi|508776385|gb|EOY23641.1| Transcription factor IIIC, subunit 5, putative isoform 3 [Theobroma cacao] Length = 579 Score = 271 bits (693), Expect = 2e-70 Identities = 140/264 (53%), Positives = 179/264 (67%), Gaps = 1/264 (0%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+I++G VSG +P E FAVH+PGYP + +RA+ETLGG EGIL+ARSS SN LEL FRP Sbjct: 1 MGVIKEGRVSGTLPNDESFAVHFPGYPKTTARAIETLGGTEGILRARSSQSNKLELHFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPYS PAFG RP +NLLLKI+KK++ DGQ A S + E T + Sbjct: 61 EDPYSRPAFGELRPCNNLLLKISKKKSADGQSAEASSKV----------RECSTSGATDS 110 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 EN ++ S ++ QI E ++CADIV+RV++ Y+F+GM DYQHVLAVHAD Sbjct: 111 ENPKQPSQAE----------VQISEQEQTNLCADIVSRVSEAYHFDGMADYQHVLAVHAD 160 Query: 588 VARKKKRNWAEM-EPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXX 764 ARK+KRNWAE EP FEKGGFMD+DQED+M+++PPLFS KD+PEN+V Sbjct: 161 AARKRKRNWAEAEEPPFEKGGFMDVDQEDVMMILPPLFSPKDMPENIVLRPSTILSSKKK 220 Query: 765 HEAIVQQRWEMDIAPCLGLDFDIK 836 E +VQ E+D+ P L +DF+IK Sbjct: 221 QEGVVQNTAEVDLEPGLAIDFNIK 244 >ref|XP_007039139.1| Transcription factor IIIC, subunit 5, putative isoform 2 [Theobroma cacao] gi|508776384|gb|EOY23640.1| Transcription factor IIIC, subunit 5, putative isoform 2 [Theobroma cacao] Length = 582 Score = 271 bits (693), Expect = 2e-70 Identities = 140/264 (53%), Positives = 179/264 (67%), Gaps = 1/264 (0%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+I++G VSG +P E FAVH+PGYP + +RA+ETLGG EGIL+ARSS SN LEL FRP Sbjct: 1 MGVIKEGRVSGTLPNDESFAVHFPGYPKTTARAIETLGGTEGILRARSSQSNKLELHFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPYS PAFG RP +NLLLKI+KK++ DGQ A S + E T + Sbjct: 61 EDPYSRPAFGELRPCNNLLLKISKKKSADGQSAEASSKV----------RECSTSGATDS 110 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 EN ++ S ++ QI E ++CADIV+RV++ Y+F+GM DYQHVLAVHAD Sbjct: 111 ENPKQPSQAE----------VQISEQEQTNLCADIVSRVSEAYHFDGMADYQHVLAVHAD 160 Query: 588 VARKKKRNWAEM-EPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXX 764 ARK+KRNWAE EP FEKGGFMD+DQED+M+++PPLFS KD+PEN+V Sbjct: 161 AARKRKRNWAEAEEPPFEKGGFMDVDQEDVMMILPPLFSPKDMPENIVLRPSTILSSKKK 220 Query: 765 HEAIVQQRWEMDIAPCLGLDFDIK 836 E +VQ E+D+ P L +DF+IK Sbjct: 221 QEGVVQNTAEVDLEPGLAIDFNIK 244 >ref|XP_003537671.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Glycine max] Length = 547 Score = 256 bits (654), Expect = 7e-66 Identities = 137/264 (51%), Positives = 170/264 (64%), Gaps = 1/264 (0%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+I+DGT+SGV+PE +GF VHYP YPSSISRAV+TLGG + I KAR S SN LELRFRP Sbjct: 1 MGVIKDGTISGVLPEPQGFMVHYPAYPSSISRAVDTLGGIQAIQKARCSKSNKLELRFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPYSHPAFG RP+++LLLKI+K T V Sbjct: 61 EDPYSHPAFGELRPTNSLLLKISK-----------------------------TKPPPPV 91 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 + + SS S T+G D+ +CADIVAR + Y+F GM DYQHV+ VHAD Sbjct: 92 HDAEASSSS----TNGEQDQ-------EGSLCADIVARFPEAYFFYGMADYQHVIPVHAD 140 Query: 588 VARKKKRNWAEMEP-HFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXX 764 VAR+KKRNW+E+E HF+KGGFMD+D ED+MI+VPP+F+ KD+PENLV Sbjct: 141 VARRKKRNWSELEELHFDKGGFMDLDHEDVMIIVPPIFAPKDVPENLVLRPATMSSSKKK 200 Query: 765 HEAIVQQRWEMDIAPCLGLDFDIK 836 E +VQ +EMD+ P L +DFDIK Sbjct: 201 PEEVVQPHFEMDMEPVLAIDFDIK 224 >ref|XP_007039138.1| General transcription factor 3C polypeptide 5, putative isoform 1 [Theobroma cacao] gi|508776383|gb|EOY23639.1| General transcription factor 3C polypeptide 5, putative isoform 1 [Theobroma cacao] Length = 630 Score = 256 bits (653), Expect = 1e-65 Identities = 133/250 (53%), Positives = 168/250 (67%), Gaps = 1/250 (0%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+I++G VSG +P E FAVH+PGYP + +RA+ETLGG EGIL+ARSS SN LEL FRP Sbjct: 1 MGVIKEGRVSGTLPNDESFAVHFPGYPKTTARAIETLGGTEGILRARSSQSNKLELHFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPYS PAFG RP +NLLLKI+KK++ DGQ A S + E T + Sbjct: 61 EDPYSRPAFGELRPCNNLLLKISKKKSADGQSAEASSKV----------RECSTSGATDS 110 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 EN ++ S ++ QI E ++CADIV+RV++ Y+F+GM DYQHVLAVHAD Sbjct: 111 ENPKQPSQAE----------VQISEQEQTNLCADIVSRVSEAYHFDGMADYQHVLAVHAD 160 Query: 588 VARKKKRNWAEM-EPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXX 764 ARK+KRNWAE EP FEKGGFMD+DQED+M+++PPLFS KD+PEN+V Sbjct: 161 AARKRKRNWAEAEEPPFEKGGFMDVDQEDVMMILPPLFSPKDMPENIVLRPSTILSSKKK 220 Query: 765 HEAIVQQRWE 794 E +VQ E Sbjct: 221 QEGVVQNTAE 230 >ref|XP_006464858.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Citrus sinensis] Length = 605 Score = 252 bits (644), Expect = 1e-64 Identities = 135/267 (50%), Positives = 175/267 (65%), Gaps = 4/267 (1%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+I+DG VSG +P E FAVHYPGY SS SRA++TLGG+E ILKARSS SN LELRFRP Sbjct: 1 MGVIKDGKVSGNLPSNEVFAVHYPGYSSSTSRAIQTLGGSEAILKARSSKSNKLELRFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNN---DGQEAVISKNLLKGSSTEVASLETMTCHS 398 EDPYSHPAFG RP +NLLLK++KK+ + DGQ +S K + A + + Sbjct: 61 EDPYSHPAFGEVRPCNNLLLKMSKKKTSQPCDGQSPKLSNQTFKHPLHDAADVGNVP--- 117 Query: 399 ETVENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAV 578 + + D+V S Q K ++ ADIVARV++ Y+F+GM DYQHV+AV Sbjct: 118 ------EIHQLESDSVVSRKEAEKQ-KSEDQVNLFADIVARVSEAYHFDGMADYQHVVAV 170 Query: 579 HADVARKKKRNWAEM-EPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXX 755 HADVAR+KKRNW E+ EP FEKGG +D+D++D+M+++PPLF+ KD+PENLV Sbjct: 171 HADVARRKKRNWTEVEEPQFEKGGLIDLDEDDVMMILPPLFAPKDVPENLVLRPSVIPSS 230 Query: 756 XXXHEAIVQQRWEMDIAPCLGLDFDIK 836 + Q E DI L +DF+IK Sbjct: 231 LKKEARVEQNISEKDIESGLAIDFNIK 257 >ref|XP_007157964.1| hypothetical protein PHAVU_002G1131001g, partial [Phaseolus vulgaris] gi|561031379|gb|ESW29958.1| hypothetical protein PHAVU_002G1131001g, partial [Phaseolus vulgaris] Length = 220 Score = 249 bits (636), Expect = 9e-64 Identities = 134/250 (53%), Positives = 170/250 (68%), Gaps = 1/250 (0%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+I+DGT+SGV+PE +GF VHYP YPSSISRAV+TLGG +GILKARSS SN LE RFRP Sbjct: 1 MGVIKDGTISGVIPEPQGFLVHYPAYPSSISRAVDTLGGIQGILKARSSQSNKLEFRFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPYSHPAFG RP++ LLLKI+K+ K+ G + E +S S V Sbjct: 61 EDPYSHPAFGELRPTNTLLLKISKR-----------KSRCVGDAEEASS-------SSGV 102 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 +NG++ + + S+R Q +CADIVARV+D Y F+GM DYQHV+ +HAD Sbjct: 103 KNGEQENQPE-------SERKQ-----EESLCADIVARVSDAYSFDGMADYQHVIPIHAD 150 Query: 588 VARKKKRNWAEM-EPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXX 764 VAR+KKRNW+E+ EP F+K GFMD D ED+MI+VPP+F+ KD+PENLV Sbjct: 151 VARRKKRNWSELEEPLFDKVGFMDPDHEDVMIIVPPIFAPKDVPENLVLRPATMPCSKKK 210 Query: 765 HEAIVQQRWE 794 E +VQQ +E Sbjct: 211 QEEVVQQHFE 220 >ref|XP_003622988.1| General transcription factor 3C polypeptide [Medicago truncatula] gi|355498003|gb|AES79206.1| General transcription factor 3C polypeptide [Medicago truncatula] Length = 612 Score = 248 bits (634), Expect = 2e-63 Identities = 128/262 (48%), Positives = 171/262 (65%), Gaps = 1/262 (0%) Frame = +3 Query: 45 IMGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFR 224 +MG+I+DGT+SGV+PE +GF VHYPGYPS+ SRAV+TLGG++GILKARSS +N LELRFR Sbjct: 5 LMGVIKDGTISGVLPEPQGFLVHYPGYPSTTSRAVDTLGGSQGILKARSSQANKLELRFR 64 Query: 225 PEDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSET 404 PEDPY HPAFG RP++ LLLKI+K++ D A S ++ Sbjct: 65 PEDPYCHPAFGERRPTNALLLKISKRKLPDDDGATTSNSMC------------------- 105 Query: 405 VENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHA 584 G + D V S + ++ E A++CADIV RV + Y+F GM DYQ+V+ VHA Sbjct: 106 ---GMEHGMQADNVESEHGAADKVDE--EANLCADIVGRVPEAYFFEGMADYQYVVPVHA 160 Query: 585 DVARKKKRNWAE-MEPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXX 761 DVA++KKRNW+E E H KGG +D+D ED+MI+VPP+F+ KD+PE+L+ Sbjct: 161 DVAKRKKRNWSEPEETHLAKGGRIDVDHEDIMIIVPPIFAPKDMPEDLLLRPPTVSSSKK 220 Query: 762 XHEAIVQQRWEMDIAPCLGLDF 827 E IV +E+D+ P L LDF Sbjct: 221 KEEEIVHPHFEIDMEPVLALDF 242 >ref|XP_004251822.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Solanum lycopersicum] Length = 597 Score = 246 bits (628), Expect = 8e-63 Identities = 132/268 (49%), Positives = 174/268 (64%), Gaps = 5/268 (1%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MGII+DG+VSG++P E FAVHYP YPSS+ RAVETLGG +GI+KAR+S SN LEL FRP Sbjct: 1 MGIIKDGSVSGILPTNEVFAVHYPAYPSSVERAVETLGGIQGIVKARTSQSNKLELHFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNND---GQEAVISKNLLKGSSTEVASLETMTCHS 398 EDPYSHP FG + S+N LLKI+K + D A S ++ SS + + E Sbjct: 61 EDPYSHPTFGELKHSNNFLLKISKCKVRDVRSADSADSSCGIVIQSSRSLVNCEQ----- 115 Query: 399 ETVENGQRSSVSDDAVTSGNSDRAQIK--EAVSAHVCADIVARVTDTYYFNGMVDYQHVL 572 EN +++G S +++ + H+ A+IV+ V++ Y+FNGMVDYQHVL Sbjct: 116 ---ENAAPKLNEPRCLSAGASKEIEMQTDTNLQEHLSANIVSHVSEAYHFNGMVDYQHVL 172 Query: 573 AVHADVARKKKRNWAEMEPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXX 752 AVHAD AR+KKR WAE+EP FEKGG MD+DQED+MIL+P LF+ KD+P+N+V Sbjct: 173 AVHADDARRKKRQWAEVEPKFEKGGLMDVDQEDMMILLPSLFASKDMPDNIVLKSCTTVG 232 Query: 753 XXXXHEAIVQQRWEMDIAPCLGLDFDIK 836 E + WE ++ P L +DF IK Sbjct: 233 SKRKQEG--RHNWEREMEPSLAIDFAIK 258 >ref|XP_006427924.1| hypothetical protein CICLE_v100272412mg, partial [Citrus clementina] gi|557529914|gb|ESR41164.1| hypothetical protein CICLE_v100272412mg, partial [Citrus clementina] Length = 248 Score = 243 bits (621), Expect = 5e-62 Identities = 126/231 (54%), Positives = 163/231 (70%), Gaps = 4/231 (1%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+I+DG VSG +P E FAVHYPGY SS SRA++TLGG+E ILKARSS SN LELRFRP Sbjct: 1 MGVIKDGKVSGNLPSNEVFAVHYPGYSSSTSRAIQTLGGSEAILKARSSKSNKLELRFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNN---DGQEAVISKNLLKGSSTEVASLETMTCHS 398 EDPYSHPAFG RP +NLLLK++KK+ + DGQ +S K + A + + Sbjct: 61 EDPYSHPAFGEVRPCNNLLLKMSKKKTSQPCDGQSPKLSNQTFKHPLHDAADVGNVP--- 117 Query: 399 ETVENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAV 578 + + D+V S Q K ++ ADIVARV++ Y+F+GM DYQHV+AV Sbjct: 118 ------EIHQLESDSVVSRKEAEKQ-KSEDQVNLFADIVARVSEAYHFDGMADYQHVVAV 170 Query: 579 HADVARKKKRNWAEM-EPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLV 728 HADVAR+KKRNW E+ EP FEKGG +D+D++D+M+++PPLF+ KD+PENLV Sbjct: 171 HADVARRKKRNWTEVEEPQFEKGGLIDLDEDDVMMILPPLFAPKDVPENLV 221 >ref|XP_007203854.1| hypothetical protein PRUPE_ppa004640mg [Prunus persica] gi|462399385|gb|EMJ05053.1| hypothetical protein PRUPE_ppa004640mg [Prunus persica] Length = 498 Score = 241 bits (616), Expect = 2e-61 Identities = 132/264 (50%), Positives = 163/264 (61%), Gaps = 2/264 (0%) Frame = +3 Query: 48 MGIIEDG-TVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFR 224 MG+++DG T +G +P E FA+HYPGYPSS+SRA+ETLGG +GI KA SS SN LEL FR Sbjct: 1 MGVVKDGSTTTGFLPSSEVFAIHYPGYPSSMSRAIETLGGTQGIRKAHSSQSNRLELHFR 60 Query: 225 PEDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSET 404 ++PYSHPAFG RP +NLLLKI+K ++N GQ S+ L Sbjct: 61 HQEPYSHPAFGDLRPCNNLLLKISKTKSNAGQTQPQSELL-------------------- 100 Query: 405 VENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHA 584 + D QI E H DIVARV + Y+F+GMVDYQHV+ VHA Sbjct: 101 ---------------ASKQDEVQIPENDRVHF--DIVARVPEAYHFDGMVDYQHVVPVHA 143 Query: 585 DVARKKKRNWAEM-EPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXX 761 DVARKKKRNW E+ +PH +KGG MDIDQED MIL+P LF+ KD+P+NLV Sbjct: 144 DVARKKKRNWIEIKDPHSDKGGLMDIDQEDAMILLPQLFAPKDVPDNLVLKPSVTLSAKK 203 Query: 762 XHEAIVQQRWEMDIAPCLGLDFDI 833 E VQ +WEMD+ P L +DF I Sbjct: 204 NQEEPVQHQWEMDMEPVLAIDFGI 227 >ref|XP_006350004.1| PREDICTED: general transcription factor 3C polypeptide 5-like isoform X1 [Solanum tuberosum] gi|565366663|ref|XP_006350006.1| PREDICTED: general transcription factor 3C polypeptide 5-like isoform X3 [Solanum tuberosum] Length = 561 Score = 239 bits (611), Expect = 7e-61 Identities = 131/263 (49%), Positives = 162/263 (61%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MGII+DG+VSG +P E FAVHYP YPSS+ RAVETLGG +GI+KAR+S SN LEL FRP Sbjct: 1 MGIIKDGSVSGRLPTNEVFAVHYPAYPSSVERAVETLGGIQGIVKARTSESNKLELHFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPYSHPAFG + S+N LLKI+K + D Q A Sbjct: 61 EDPYSHPAFGELKHSNNFLLKISKCKVRDVQSA--------------------------- 93 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 D V N ++ A + A+IV+ V++ Y+FNGMVDYQHVLAVHAD Sbjct: 94 ---------DSPV---NCEQENSLAAPKERLAANIVSHVSEGYHFNGMVDYQHVLAVHAD 141 Query: 588 VARKKKRNWAEMEPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXXH 767 AR+KKR WAE+EP FEKGG MD+DQEDLMIL+PPLF+ KD+P+N+V Sbjct: 142 DARRKKRQWAEVEPKFEKGGLMDVDQEDLMILLPPLFASKDMPDNIVLKSCTTLGSKRKQ 201 Query: 768 EAIVQQRWEMDIAPCLGLDFDIK 836 E + WE ++ P L +DF IK Sbjct: 202 EG--RHNWEREMEPSLAIDFTIK 222 >gb|EXB88280.1| hypothetical protein L484_020348 [Morus notabilis] Length = 553 Score = 234 bits (596), Expect = 4e-59 Identities = 132/250 (52%), Positives = 164/250 (65%), Gaps = 7/250 (2%) Frame = +3 Query: 48 MGIIE-DGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFR 224 MG+I+ DG VSG +P KE FAV+YPGYPSSISRAVETLGG E I KARS SN LEL FR Sbjct: 22 MGVIKKDGRVSGFVPSKEAFAVNYPGYPSSISRAVETLGGLEAIHKARSLQSNRLELHFR 81 Query: 225 PEDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSET 404 PEDPYSHPAFG RP ++LLLK+++ ++++GQ+A +S G S Sbjct: 82 PEDPYSHPAFGDLRPCNHLLLKLSRIKSSNGQDAQVS-----GPS--------------A 122 Query: 405 VENGQRSSVSDDAVTSGNSDRA-----QIKEAVSAHVCADIVARVTDTYYFNGMVDYQHV 569 ++NG + SG++ A QI E + CADIVARV + Y+F+GMVDYQHV Sbjct: 123 LQNGNNLDYTYTTRASGSTSSAKQVDVQIPEDDQTNFCADIVARVLEAYHFDGMVDYQHV 182 Query: 570 LAVHADVARKKKRNWAEM-EPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXX 746 AVHADVAR+KKR W E+ EP EK G MD+D++D+M+LVPPLF+ KD PENLV Sbjct: 183 TAVHADVARRKKRKWLELEEPLSEKNGLMDVDEDDVMMLVPPLFAPKDFPENLVLRPSVI 242 Query: 747 XXXXXXHEAI 776 EAI Sbjct: 243 LSSKKNEEAI 252 >ref|XP_004297697.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Fragaria vesca subsp. vesca] Length = 553 Score = 221 bits (562), Expect = 3e-55 Identities = 123/267 (46%), Positives = 158/267 (59%), Gaps = 5/267 (1%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSS----NSNSLEL 215 MG+++DGT+SG +P + F VHYPGYPSS+SRA++TLGG + I KA SS N+N LEL Sbjct: 1 MGVVKDGTISGFLPRTQVFGVHYPGYPSSMSRAIDTLGGTQAIHKAHSSASNNNNNRLEL 60 Query: 216 RFRPEDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCH 395 RFR +DPYSHPAFG RP ++ LLKI+K ++++ +LL T Sbjct: 61 RFRHDDPYSHPAFGDLRPCNSFLLKISKSKSSES-------DLLAAKLTP---------- 103 Query: 396 SETVENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLA 575 E +VCADIVARV Y+F+GM DYQHV+A Sbjct: 104 ----------------------------ETDQVNVCADIVARVPKAYHFDGMADYQHVIA 135 Query: 576 VHADVARKKKRNWAEME-PHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXX 752 VHADVARK+KRN E E PH ++GG MDIDQED+MIL+P F+ KD+P+NLV Sbjct: 136 VHADVARKRKRNRVETEEPHSDRGGLMDIDQEDVMILLPQFFAPKDVPDNLVLRPSGTLS 195 Query: 753 XXXXHEAIVQQRWEMDIAPCLGLDFDI 833 E VQ + EMD+ P L +DF I Sbjct: 196 VKKNQEEPVQHQLEMDMEPVLAIDFGI 222 >ref|XP_002529107.1| conserved hypothetical protein [Ricinus communis] gi|223531458|gb|EEF33291.1| conserved hypothetical protein [Ricinus communis] Length = 540 Score = 213 bits (543), Expect = 5e-53 Identities = 116/254 (45%), Positives = 148/254 (58%), Gaps = 2/254 (0%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MG+I++G SG++P E FAVHYPGYPSSISRA++TLGG + ILKAR+S SN LEL FRP Sbjct: 1 MGVIKEGEASGIIPSNEAFAVHYPGYPSSISRAIQTLGGTDAILKARTSQSNKLELYFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPYSHPAFG R +NLLLKI+KK+ + C +E Sbjct: 61 EDPYSHPAFGELRACNNLLLKISKKKKKTNSQ----------------------CQTE-- 96 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 + AD+VAR+ + Y+F+GMVDYQHV+AVHAD Sbjct: 97 ------------------------------LSADVVARIPEAYHFDGMVDYQHVVAVHAD 126 Query: 588 -VARKKKRNWAEM-EPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXX 761 A+K+KRNW +M EPHF+K G MD+DQED+MILVPP F+ KD+P NL Sbjct: 127 AAAQKRKRNWTQMEEPHFDKAGLMDLDQEDVMILVPPHFTSKDMPVNLALKATSIPSSKK 186 Query: 762 XHEAIVQQRWEMDI 803 E V+ E+ + Sbjct: 187 IQEEAVENHIELHL 200 >ref|NP_190510.3| transcription factor IIIC, subunit 5 [Arabidopsis thaliana] gi|332645018|gb|AEE78539.1| transcription factor IIIC, subunit 5 [Arabidopsis thaliana] Length = 574 Score = 213 bits (543), Expect = 5e-53 Identities = 110/263 (41%), Positives = 154/263 (58%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MGIIE+GT+SG +P KE F VH+PGYPSSISRA+ETLGG +GI +AR S SN LELRFRP Sbjct: 1 MGIIEEGTISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPY+HPA G RP S LL+I+K + + Sbjct: 61 EDPYAHPALGEQRPCSGFLLRISK---------------------------------QDI 87 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 + + SV D + R E S +CADIVAR++++++F+GM DYQHV+ +HAD Sbjct: 88 KKPESQSVLD-------TSRDVCLEEASPVLCADIVARLSESFHFDGMADYQHVIPIHAD 140 Query: 588 VARKKKRNWAEMEPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXXH 767 +A++KKR W +++P K M + ED+M+L+P F+ KDIP+N+ Sbjct: 141 IAQQKKRKWMDVDPLTGKSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKD 200 Query: 768 EAIVQQRWEMDIAPCLGLDFDIK 836 +A Q +E+D+ P +DF +K Sbjct: 201 DAATQNFYEIDVGPVFAIDFSVK 223 >gb|EPS67527.1| hypothetical protein M569_07249 [Genlisea aurea] Length = 548 Score = 213 bits (541), Expect = 9e-53 Identities = 120/274 (43%), Positives = 165/274 (60%), Gaps = 11/274 (4%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEG--FAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRF 221 MG+IE+G++SGV+ FAV+YPGYPSS+ RA+ETLGG+ GILK + S LELRF Sbjct: 1 MGLIEEGSISGVLAGSINGVFAVNYPGYPSSVERAIETLGGSHGILKVHADKSKKLELRF 60 Query: 222 RPEDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSE 401 RPEDPYSHPAFG + +N LLKI+KK+ D N GSS + SL E Sbjct: 61 RPEDPYSHPAFGERQSCNNFLLKISKKKAKD------VHNETSGSS-QAESLHV----RE 109 Query: 402 TVENGQRSSVSDDAVTSGNSDRAQIKE-AVSAHVCADIVARVTDTYYFNGMVDYQHVLAV 578 + G + +++ + + D A+ K+ + + A IV+R+++ Y+FNGM DYQHVL + Sbjct: 110 SSGKGTAAGNESESIPASSVDEARKKDGGIQDQLSACIVSRISEAYHFNGMADYQHVLPL 169 Query: 579 HADVARKKKRNWAEMEPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXX 758 HAD + +KKR WAE+E K +D+D ED+MILVPPLFSLKD PE ++ Sbjct: 170 HADSSGRKKRTWAEVEKSVGKDDLLDVDLEDIMILVPPLFSLKDQPEKILLKPCVESNVK 229 Query: 759 XXHEAIVQQRWE--------MDIAPCLGLDFDIK 836 E + E M+I PCL +DF++K Sbjct: 230 KKPEENAEPPAEESSSVTKQMEIEPCLAIDFNVK 263 >dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana] Length = 574 Score = 212 bits (539), Expect = 2e-52 Identities = 109/263 (41%), Positives = 153/263 (58%) Frame = +3 Query: 48 MGIIEDGTVSGVMPEKEGFAVHYPGYPSSISRAVETLGGNEGILKARSSNSNSLELRFRP 227 MGIIE+GT+SG +P KE F VH+PGYPSSISRA+ETLGG +GI +AR S SN LELRFRP Sbjct: 1 MGIIEEGTISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRP 60 Query: 228 EDPYSHPAFGVPRPSSNLLLKITKKRNNDGQEAVISKNLLKGSSTEVASLETMTCHSETV 407 EDPY+HPA G RP S LL+I+K + + Sbjct: 61 EDPYAHPALGEQRPCSGFLLRISK---------------------------------QDI 87 Query: 408 ENGQRSSVSDDAVTSGNSDRAQIKEAVSAHVCADIVARVTDTYYFNGMVDYQHVLAVHAD 587 + + SV D + R E S +CADIVAR++++++F+GM DYQHV+ +HAD Sbjct: 88 KKPESQSVLD-------TSRDVCLEEASPVLCADIVARLSESFHFDGMADYQHVIPIHAD 140 Query: 588 VARKKKRNWAEMEPHFEKGGFMDIDQEDLMILVPPLFSLKDIPENLVXXXXXXXXXXXXH 767 +A++KKR W +++P K M + ED+M+L+P F+ KDIP+N+ Sbjct: 141 IAQQKKRKWMDVDPLTGKSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKD 200 Query: 768 EAIVQQRWEMDIAPCLGLDFDIK 836 + Q +E+D+ P +DF +K Sbjct: 201 DVATQNFYEIDVGPVFAIDFSVK 223