BLASTX nr result
ID: Akebia22_contig00014307
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00014307 (1931 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002277032.1| PREDICTED: uncharacterized protein LOC100246... 449 e-123 ref|XP_002282259.1| PREDICTED: uncharacterized protein LOC100260... 441 e-121 emb|CAN79809.1| hypothetical protein VITISV_014912 [Vitis vinifera] 432 e-118 emb|CBI21214.3| unnamed protein product [Vitis vinifera] 431 e-118 ref|XP_007215547.1| hypothetical protein PRUPE_ppa007206mg [Prun... 397 e-108 ref|XP_006428393.1| hypothetical protein CICLE_v10012002mg [Citr... 391 e-106 ref|XP_007027393.1| TBP-associated factor 8, putative [Theobroma... 390 e-106 ref|XP_002323904.1| hypothetical protein POPTR_0017s13060g [Popu... 383 e-103 ref|XP_004306253.1| PREDICTED: uncharacterized protein LOC101313... 374 e-101 ref|XP_002305385.1| hypothetical protein POPTR_0004s11520g [Popu... 370 2e-99 ref|XP_006354362.1| PREDICTED: transcription initiation factor T... 369 2e-99 ref|XP_004304222.1| PREDICTED: uncharacterized protein LOC101292... 369 2e-99 ref|XP_002519508.1| conserved hypothetical protein [Ricinus comm... 368 5e-99 ref|XP_004246634.1| PREDICTED: uncharacterized protein LOC101264... 366 2e-98 ref|XP_006845883.1| hypothetical protein AMTR_s00154p00079940 [A... 363 1e-97 ref|XP_003552582.1| PREDICTED: transcription initiation factor T... 360 1e-96 gb|EXC16168.1| hypothetical protein L484_024336 [Morus notabilis] 358 6e-96 ref|XP_002527631.1| tbp-associated factor taf, putative [Ricinus... 354 9e-95 ref|XP_003531863.1| PREDICTED: transcription initiation factor T... 352 3e-94 ref|XP_004141587.1| PREDICTED: uncharacterized protein LOC101215... 350 1e-93 >ref|XP_002277032.1| PREDICTED: uncharacterized protein LOC100246447 [Vitis vinifera] Length = 377 Score = 449 bits (1156), Expect = e-123 Identities = 235/383 (61%), Positives = 279/383 (72%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 MSDGGGESGR ++ K+K DF +AIAKIAVAQICE+AGFQG QQSALETLS++ Sbjct: 1 MSDGGGESGRESDRATKRKSSDR---DFPQAIAKIAVAQICESAGFQGFQQSALETLSEV 57 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 +RY+ +LGKTAH YAN A RT+CN+FDII+GLE+L + QGF GAS+ LA SG V E Sbjct: 58 VVRYIRELGKTAHTYANSACRTECNIFDIIQGLEDLASLQGFSGASDSDHCLAGSGTVRE 117 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 I+QYVS AEE+PFA +PHFPVIR +K TPSFLQIGE P G HIP WLPAFPD TYVH+ Sbjct: 118 IVQYVSEAEEIPFAHSVPHFPVIRDRKQTPSFLQIGEEPPGDHIPDWLPAFPDPQTYVHS 177 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729 P+ NER DP IEQARQ +KAEWSLL+LQQ+L C+G E P+ ++P D KA++AAE+ Sbjct: 178 PVLNERGADPCAGNIEQARQHKKAEWSLLNLQQQLACNGLEGPSMIDPGDAAKARRAAET 237 Query: 728 NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549 NPFLS+PL FGEK VSPV LP LSNE V + N AVAN SVLETFAP IE Sbjct: 238 NPFLSAPLHFGEKGVSPVFLPAKLSNEAVVENQAGE--NHAVAN-HVSVLETFAPAIELM 294 Query: 548 KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369 KS C++E+G +KVL ++RP V FK +GKKS G LDL+ QN K SWFG Sbjct: 295 KSRSCESEEGRKKVLSNQRPAVQFKIEIGKKSTGTALDLSFQNKDVEKITSWFGKDNEKD 354 Query: 368 XXXXRAEQILKESMENPQELVQL 300 RAE+ILKESM+NPQEL QL Sbjct: 355 DKKRRAEKILKESMKNPQELAQL 377 >ref|XP_002282259.1| PREDICTED: uncharacterized protein LOC100260255 [Vitis vinifera] Length = 368 Score = 441 bits (1135), Expect = e-121 Identities = 228/384 (59%), Positives = 280/384 (72%), Gaps = 1/384 (0%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 MSDGG + R ++++ K R G D+F RA++KIAVAQICE+ GF+G Q SAL+ LS+I Sbjct: 1 MSDGGEDDRRNSDNNAPK---RAGPDEFGRAVSKIAVAQICESVGFEGFQDSALQALSNI 57 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 +RYLCD+GKTA+F ANLAGRT CNVFD+IRGLE+LG+++GF GAS V + SSG V E Sbjct: 58 AVRYLCDVGKTANFCANLAGRTQCNVFDVIRGLEDLGSSEGFSGASGVDQCIVSSGTVRE 117 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 I++YV+ A+E+PFAQP+P FPV+R K TPSF+Q+GETP GKHIP WLPAFPDSHTY+ T Sbjct: 118 IVEYVNSAKEIPFAQPVPRFPVVRNCKATPSFVQMGETPVGKHIPPWLPAFPDSHTYIQT 177 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGS-EAPAPLNPEDEWKAKQAAE 732 PMWNER TDPR DK+EQARQRRKAE SLLSLQQRL C+GS A + D+ +A +AAE Sbjct: 178 PMWNERATDPRADKLEQARQRRKAERSLLSLQQRLVCNGSASASTSVGRCDDAEASRAAE 237 Query: 731 SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552 NP+L+SPLQFGEKDVS VV LPAKL ++ V + SVLETFAP IEA Sbjct: 238 GNPYLASPLQFGEKDVSTVV-------------LPAKLLDDLVVDNHVSVLETFAPAIEA 284 Query: 551 AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372 K+ F D+ + E+ V+P KR VHFK GKK LG +DL L+N + GK VS G Sbjct: 285 VKNSFVDSGESEKNVVPEKRSAVHFKLRTGKKILGESVDLRLKNKSVGKVVSLIGRDEER 344 Query: 371 XXXXXRAEQILKESMENPQELVQL 300 RAE IL++SMENPQEL QL Sbjct: 345 DDKKRRAEYILRQSMENPQELTQL 368 >emb|CAN79809.1| hypothetical protein VITISV_014912 [Vitis vinifera] Length = 366 Score = 432 bits (1111), Expect = e-118 Identities = 229/383 (59%), Positives = 274/383 (71%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 MSDGGGESGR ++ K+K DF +AIAKIAVAQICE+AGFQG QQSALETLS++ Sbjct: 1 MSDGGGESGRESDRATKRKSSDR---DFPQAIAKIAVAQICESAGFQGFQQSALETLSEV 57 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 +RY+ +LGKTAH YAN A RT+CN+FDII+GLE+L + QGF GAS+ LA SG V E Sbjct: 58 VVRYIRELGKTAHTYANSACRTECNIFDIIQGLEDLASLQGFSGASDSDHCLAGSGTVRE 117 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 I+QYVS AEE+PFA +PHFPVIR +K TPSFLQIGE P G HIP WLPAFPD TYVH+ Sbjct: 118 IVQYVSEAEEIPFAHSVPHFPVIRDRKQTPSFLQIGEEPPGDHIPDWLPAFPDPQTYVHS 177 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729 P+ +EQARQ +KAEWSLL+LQQ+L C+G E P+ ++P D KA++AAE+ Sbjct: 178 PV-----------TLEQARQHKKAEWSLLNLQQQLACNGLEGPSMIDPGDAAKARRAAET 226 Query: 728 NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549 NPFLS+PL FGEK VSPV LP LSNE V + N AVAN SVLETFAP IE Sbjct: 227 NPFLSAPLHFGEKGVSPVFLPAKLSNEAVVENQAGE--NHAVAN-HVSVLETFAPAIELM 283 Query: 548 KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369 KS C++E+G +KVL ++RP V FK +GKKS G LDL+ QN K SWFG Sbjct: 284 KSRSCESEEGRKKVLSNQRPAVQFKIEIGKKSTGTALDLSFQNKDVEKITSWFGKDNEKD 343 Query: 368 XXXXRAEQILKESMENPQELVQL 300 RAE+ILKESM+NPQEL QL Sbjct: 344 DKKRRAEKILKESMKNPQELAQL 366 >emb|CBI21214.3| unnamed protein product [Vitis vinifera] Length = 357 Score = 431 bits (1109), Expect = e-118 Identities = 228/383 (59%), Positives = 271/383 (70%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 MSDGGGESGR ++ K+K DF +AIAKIAVAQICE+AGFQG QQSALETLS++ Sbjct: 1 MSDGGGESGRESDRATKRKSSDR---DFPQAIAKIAVAQICESAGFQGFQQSALETLSEV 57 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 +RY+ +LGKTAH YAN A RT+CN+FDII+GLE+L + QGF GAS+ LA SG V E Sbjct: 58 VVRYIRELGKTAHTYANSACRTECNIFDIIQGLEDLASLQGFSGASDSDHCLAGSGTVRE 117 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 I+QYVS AEE+PFA +PHFPVIR +K TPSFLQIGE P G HIP WLPAFPD TYVH+ Sbjct: 118 IVQYVSEAEEIPFAHSVPHFPVIRDRKQTPSFLQIGEEPPGDHIPDWLPAFPDPQTYVHS 177 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729 P+ NER DP IEQARQ +KAEWSLL+LQQ+L C+G E P+ ++P D KA++AAE+ Sbjct: 178 PVLNERGADPCAGNIEQARQHKKAEWSLLNLQQQLACNGLEGPSMIDPGDAAKARRAAET 237 Query: 728 NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549 NPFLS+PL FGEK VSPV LPAKLSNEA TFAP IE Sbjct: 238 NPFLSAPLHFGEKGVSPVF-------------LPAKLSNEA----------TFAPAIELM 274 Query: 548 KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369 KS C++E+G +KVL ++RP V FK +GKKS G LDL+ QN K SWFG Sbjct: 275 KSRSCESEEGRKKVLSNQRPAVQFKIEIGKKSTGTALDLSFQNKDVEKITSWFGKDNEKD 334 Query: 368 XXXXRAEQILKESMENPQELVQL 300 RAE+ILKESM+NPQEL QL Sbjct: 335 DKKRRAEKILKESMKNPQELAQL 357 >ref|XP_007215547.1| hypothetical protein PRUPE_ppa007206mg [Prunus persica] gi|462411697|gb|EMJ16746.1| hypothetical protein PRUPE_ppa007206mg [Prunus persica] Length = 378 Score = 397 bits (1020), Expect = e-108 Identities = 212/383 (55%), Positives = 263/383 (68%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 MSDGGGESGR +E + ++K GDDF+RAIAKIAVAQ+CE GFQ Q SALETLSD+ Sbjct: 1 MSDGGGESGREHEQHNRTQRK-SSGDDFARAIAKIAVAQVCEIVGFQTYQLSALETLSDV 59 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 + Y+ ++GKTAHFYANL+GR DCNVFDII+GLE+LG AQGF GAS+V LASSG V E Sbjct: 60 AVHYIHNIGKTAHFYANLSGRMDCNVFDIIQGLEDLGLAQGFAGASDVDHCLASSGTVRE 119 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 I QYV E +PF+ IP FPV++ +K TPSFLQ G G+HIP WLPAFP+ HTYV + Sbjct: 120 IAQYVGETEHIPFSYSIPQFPVVKDRKLTPSFLQSGVETLGEHIPIWLPAFPEPHTYVPS 179 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729 P+ NER + TD IEQ +++R E SL +LQ+RL C+G E P+ ++P D KAKQA ES Sbjct: 180 PISNERARELHTDMIEQKKKQRNVERSLFNLQRRLVCNGLEGPS-IDPGDADKAKQARES 238 Query: 728 NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549 NPFL++PLQ+GE +VS V LP LS+E L A+ VA SVLETFAP IEA Sbjct: 239 NPFLAAPLQYGETEVSHVALPAKLSSEATVEKLVAE---NRVAEKCSSVLETFAPAIEAM 295 Query: 548 KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369 KS C++++ +++L S+RP V FK G+ K S L + N K SWFG Sbjct: 296 KSSSCESQEEHKEILLSRRPTVQFKIGIAKTSFSTMLHSSPHNKGFQKNYSWFGRENEKD 355 Query: 368 XXXXRAEQILKESMENPQELVQL 300 RAE+ILK SMEN QEL QL Sbjct: 356 EKKKRAEKILKNSMENSQELAQL 378 >ref|XP_006428393.1| hypothetical protein CICLE_v10012002mg [Citrus clementina] gi|568880174|ref|XP_006493009.1| PREDICTED: transcription initiation factor TFIID subunit 8-like [Citrus sinensis] gi|568885488|ref|XP_006495304.1| PREDICTED: transcription initiation factor TFIID subunit 8-like [Citrus sinensis] gi|557530450|gb|ESR41633.1| hypothetical protein CICLE_v10012002mg [Citrus clementina] Length = 370 Score = 391 bits (1005), Expect = e-106 Identities = 207/384 (53%), Positives = 256/384 (66%), Gaps = 1/384 (0%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 M+ GGGES +E R +DFSRA++K+AVAQICE+ GFQG + SAL+ L DI Sbjct: 1 MNHGGGESTSRSESRTDTSSDRPKAEDFSRAVSKMAVAQICESVGFQGFKDSALDALLDI 60 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 IRY+CDLGKT+ F ANLA RT+CN+FDIIRG+E+L +GF GA+ + L SG+V E Sbjct: 61 AIRYICDLGKTSSFQANLACRTECNLFDIIRGIEDLEVLKGFMGAAEIGKCLVGSGIVKE 120 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 I+ +V EE+PFAQPIP +PVIR ++ PSF ++ ETP GKHIP+WLPAFPD HTY++T Sbjct: 121 IIDFVESKEEIPFAQPIPQYPVIRSRRLIPSFEEMNETPPGKHIPSWLPAFPDPHTYIYT 180 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729 PMWNER +DPR DKIE ARQRRKAE +LLSLQQRL C+G + P ++ + S Sbjct: 181 PMWNERKSDPRADKIELARQRRKAEMALLSLQQRLVCNGETGTSASRPANDEEELLKTGS 240 Query: 728 NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549 NPF + PLQ GEKD+SPV LPAKL ++ SV+E FAP IEA Sbjct: 241 NPFFAKPLQSGEKDISPV-------------GLPAKLKDKMSGGNHMSVMEAFAPAIEAV 287 Query: 548 K-SGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372 K SGF D DG+R+ LP KRP VHFKF GKK LG LD +LQ G+ + F Sbjct: 288 KVSGFSDDADGDRRYLPEKRPAVHFKFRAGKKFLGEILDSSLQKKG-GRRSASFWRDEEK 346 Query: 371 XXXXXRAEQILKESMENPQELVQL 300 RAE ILK+S+ENPQEL QL Sbjct: 347 DDKKRRAEFILKQSIENPQELSQL 370 >ref|XP_007027393.1| TBP-associated factor 8, putative [Theobroma cacao] gi|508715998|gb|EOY07895.1| TBP-associated factor 8, putative [Theobroma cacao] Length = 373 Score = 390 bits (1003), Expect = e-106 Identities = 213/386 (55%), Positives = 261/386 (67%), Gaps = 4/386 (1%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKK---REGGDDFSRAIAKIAVAQICETAGFQGIQQSALETL 1278 MS GG ES R ++ R DDF RA++KI+VAQICE G+QG ++SALE L Sbjct: 1 MSHGGVESTRDTRESEGQRSLPLGRPKADDFGRAVSKISVAQICECVGYQGFKESALEAL 60 Query: 1277 SDITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGM 1098 +DI IRYLCDLGKT+ F+ANLAGRT+CN+FDI + LEELG + GF GAS + LA SG Sbjct: 61 ADIAIRYLCDLGKTSSFHANLAGRTECNMFDITQSLEELGASYGFSGASEIGHCLAGSGA 120 Query: 1097 VGEIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTY 918 V EI+Q+V EE+PFAQP+P FPV+R +K PSF + ETP GKHIPAWLPAFPD HTY Sbjct: 121 VREIIQFVGSKEEIPFAQPVPQFPVVRNRKLIPSFEHMNETPPGKHIPAWLPAFPDPHTY 180 Query: 917 VHTPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGS-EAPAPLNPEDEWKAKQ 741 +HTPMWNER +DPR DKIEQARQRRKAE +LLSLQQRL C+GS E A L + + + Q Sbjct: 181 IHTPMWNERASDPRADKIEQARQRRKAERALLSLQQRLVCNGSTETSASLVVDAKKETIQ 240 Query: 740 AAESNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPV 561 A +N FL++PLQ GEKDV+ VV LPAKLS+E + S+LE FAP Sbjct: 241 EAGNNAFLAAPLQPGEKDVARVV-------------LPAKLSDEVSKDNHVSLLEAFAPA 287 Query: 560 IEAAKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXX 381 IEA K G DGE+ +LP +RP VHFKF GKK LG LDL+LQ E ++ ++F Sbjct: 288 IEAMKGGPSGELDGEKMLLPERRPAVHFKFRTGKKILGESLDLSLQKKGE-RSTTFFLRD 346 Query: 380 XXXXXXXXRAEQILKESMENPQELVQ 303 RAE IL+++ E P EL Q Sbjct: 347 EERDDKKRRAEFILRQTTEYPMELNQ 372 >ref|XP_002323904.1| hypothetical protein POPTR_0017s13060g [Populus trichocarpa] gi|566213067|ref|XP_006373367.1| hypothetical protein POPTR_0017s13060g [Populus trichocarpa] gi|222866906|gb|EEF04037.1| hypothetical protein POPTR_0017s13060g [Populus trichocarpa] gi|550320186|gb|ERP51164.1| hypothetical protein POPTR_0017s13060g [Populus trichocarpa] Length = 382 Score = 383 bits (984), Expect = e-103 Identities = 208/385 (54%), Positives = 258/385 (67%), Gaps = 2/385 (0%) Frame = -1 Query: 1448 MSDGGGESGRVNEH--DVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLS 1275 MS GGGESGR+++ D K+K R GD+F+RAIAKIAVAQ+CET GFQ QQSALE LS Sbjct: 1 MSHGGGESGRLHDKAGDSGKRKSRVSGDEFTRAIAKIAVAQMCETVGFQSFQQSALEKLS 60 Query: 1274 DITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMV 1095 D+T Y+ +LGKTA FYANLAGRT+ NVFD+I+G+EELG +QGF GASNV LASSG+V Sbjct: 61 DVTTWYIRNLGKTAQFYANLAGRTEGNVFDVIQGMEELGLSQGFAGASNVDHCLASSGIV 120 Query: 1094 GEIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYV 915 EI+QY+ AE++PF IP FPV R++KP PSF QI E +HIPAWLPAFPD T+V Sbjct: 121 REIVQYIGDAEDIPFVYSIPPFPVARERKPVPSFFQICEESPAEHIPAWLPAFPDPQTHV 180 Query: 914 HTPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAA 735 P NE DKIE AR K + S ++L Q TC+GS P+ + + +A Q Sbjct: 181 QLPAGNEGDAVFNADKIEPARHHLKMDMSSMNLPQHFTCNGSGGPSSVTFGNSARATQGT 240 Query: 734 ESNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIE 555 ESNPFL++PLQFGEK+VS +V P LS+E + + + SVLETFAP IE Sbjct: 241 ESNPFLAAPLQFGEKEVSHLVPPARLSDEAAVR---YPVEQNRIMDNHISVLETFAPAIE 297 Query: 554 AAKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXX 375 A KS FCD+E+G++KVL ++RP V FK VGK SL DL+ Q K WFG Sbjct: 298 AMKSRFCDSEEGQKKVLLNQRPAVQFKIQVGKNSLAGAPDLSPQKIGIEKISKWFGKDSE 357 Query: 374 XXXXXXRAEQILKESMENPQELVQL 300 RAE+ILK+SMENP EL +L Sbjct: 358 NDDKKRRAEKILKQSMENPSELGEL 382 >ref|XP_004306253.1| PREDICTED: uncharacterized protein LOC101313446 [Fragaria vesca subsp. vesca] Length = 390 Score = 374 bits (961), Expect = e-101 Identities = 212/402 (52%), Positives = 262/402 (65%), Gaps = 20/402 (4%) Frame = -1 Query: 1448 MSDGGGESGRVNEH-----DVKKKKKR---EGGDDFSRAIAKIAVAQICETAGFQGIQQS 1293 MS G ES RVNE D ++ ++ GGD+F RA++K+AVAQICE GF G ++S Sbjct: 1 MSHGDAESSRVNESGSGEDDAPRRAQQLSGGGGDEFGRAVSKVAVAQICEGVGFLGCKES 60 Query: 1292 ALETLSDITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSL 1113 AL++L+DI IRYL DLGK A++YANLAGRT+ NVFD++RGLE+L +QGF GA+ V L Sbjct: 61 ALDSLADIAIRYLRDLGKMANYYANLAGRTESNVFDVVRGLEDLEASQGFSGAAEVRHCL 120 Query: 1112 ASSGMVGEIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFP 933 A SG + ++QYV AEE+PFAQ +P FPV++ ++ SF ++GE P GKH+P WLPAFP Sbjct: 121 AGSGTMKGLVQYVGTAEEIPFAQSLPRFPVVKDRRLILSFERMGEAPPGKHLPNWLPAFP 180 Query: 932 DSHTYVHTPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGS-------EAPAP 774 D HTY+H+PMWNER TDPR DKIEQARQRRKAE SLLSLQQRL C+GS AP Sbjct: 181 DPHTYIHSPMWNERKTDPREDKIEQARQRRKAERSLLSLQQRLLCNGSAPGLASPSAPVS 240 Query: 773 LNPEDEWKAK-QAAESNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVAN 597 + D K Q ESNPFL PLQ GEKDVSPVVLP+ S EV+A Sbjct: 241 VVGNDGKGLKLQGGESNPFLEPPLQPGEKDVSPVVLPSKFS-EVLAK------------G 287 Query: 596 TQPSVLETFAPVIEAAKSGFCDTEDG----ERKVLPSKRPVVHFKFGVGKKSLGVPLDLT 429 SVLE FAP I+A K+G +G E K+LP+ RP VH KF KK LG DL+ Sbjct: 288 NSSSVLEAFAPAIQAVKNGVWMDGEGDVEEESKLLPNSRPPVHLKFRPVKKFLGESSDLS 347 Query: 428 LQNNAEGKTVSWFGXXXXXXXXXXRAEQILKESMENPQELVQ 303 LQ G+ +W RAE IL++SM+NPQEL Q Sbjct: 348 LQKKGSGRPANWVLRDEERDEKKRRAEFILRQSMQNPQELNQ 389 >ref|XP_002305385.1| hypothetical protein POPTR_0004s11520g [Populus trichocarpa] gi|222848349|gb|EEE85896.1| hypothetical protein POPTR_0004s11520g [Populus trichocarpa] Length = 394 Score = 370 bits (949), Expect = 2e-99 Identities = 202/383 (52%), Positives = 252/383 (65%), Gaps = 3/383 (0%) Frame = -1 Query: 1439 GGGESGRVNE---HDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 GGGESGR++E H+ K+K R GD+F+RAI KIAVAQ+CE+ GFQ QQSALETL+D+ Sbjct: 16 GGGESGRLHEKVGHN-GKRKSRASGDEFARAIGKIAVAQMCESMGFQSFQQSALETLTDV 74 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 T Y+ ++GK A ANLAGRT+ NVFD+I+GLEELG QGF GAS+V LASSG+V E Sbjct: 75 TTWYIRNIGKAAQLCANLAGRTEGNVFDVIQGLEELGLPQGFAGASDVDHCLASSGIVRE 134 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 I QY+ A+++PFA IP FPV R++KP PSF QIGE P +HIPAWLPAFPD TY Sbjct: 135 IAQYIGDADDIPFAYSIPPFPVARERKPAPSFSQIGEEPPEEHIPAWLPAFPDPQTYAQL 194 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729 P NE D D IE RQ +K + S ++L Q+ C+GSE P+ + D KA Q S Sbjct: 195 PEGNEGRADLNADNIESVRQHQKMDVSYMNLPQQFNCNGSEGPSSVAFGDSAKATQRTVS 254 Query: 728 NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549 NPFL++PLQFG K+VS VV P LS+E + + SV++TFAP IEA Sbjct: 255 NPFLAAPLQFGVKEVSHVVPPAKLSDEAAVR---YPVEQTRTMDNNMSVMKTFAPAIEAM 311 Query: 548 KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369 KS CD+ +G++KV ++RP V FK GVGK SL DL+LQN K W G Sbjct: 312 KSRLCDSGEGQKKVFFNQRPAVQFKIGVGKNSLDGAPDLSLQNKGIKKISMWSGKDSEND 371 Query: 368 XXXXRAEQILKESMENPQELVQL 300 RAE+ILK+SMENP EL QL Sbjct: 372 DQKRRAEKILKQSMENPGELAQL 394 >ref|XP_006354362.1| PREDICTED: transcription initiation factor TFIID subunit 8-like [Solanum tuberosum] Length = 374 Score = 369 bits (948), Expect = 2e-99 Identities = 204/383 (53%), Positives = 246/383 (64%), Gaps = 3/383 (0%) Frame = -1 Query: 1439 GGGESGRVNEHDVKK-KKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDITI 1263 G E R E V +++R G DDF RAI++ AVAQICE+ GF+ +SALE+L+DI I Sbjct: 5 GNAEDKREKESTVDNTREERAGTDDFGRAISRTAVAQICESIGFEIFNESALESLADIAI 64 Query: 1262 RYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGEIM 1083 +Y+ DLGKTA ANLAGRT CNVFDII GLE++ + GF AS V SSG+V E++ Sbjct: 65 KYILDLGKTASSSANLAGRTQCNVFDIIHGLEDMCASTGFLRASEVNRCGLSSGIVSEMV 124 Query: 1082 QYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHTPM 903 +YV AEE+PF+QP+PHFPV++ PSFLQIGETP KHIP WLPAFPD HTYV TP Sbjct: 125 EYVESAEEIPFSQPLPHFPVVKHPNLIPSFLQIGETPPFKHIPPWLPAFPDPHTYVRTPT 184 Query: 902 WNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSE-APAPLNPEDEWKAKQAAES- 729 WNER +DPR DKIE ARQRRKAE SLL+LQQRL C+GS P+D A++S Sbjct: 185 WNERASDPRADKIELARQRRKAERSLLNLQQRLVCNGSAVGSTSRQPDDVGITSSASKSE 244 Query: 728 NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549 NPFL+ P Q GEKDV PV LPT KLS+E S+LETF+P I+A Sbjct: 245 NPFLAKPFQAGEKDVDPVALPT-------------KLSSEVDDKNHVSLLETFSPAIQAM 291 Query: 548 KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369 K G +T +G K LP KRP V +F GKK+LG LDL L G+ S F Sbjct: 292 KDGLSETVNGTEKTLPDKRPAVCLEFRPGKKALGDSLDLRLWKKGSGRNASLFRRDEDRD 351 Query: 368 XXXXRAEQILKESMENPQELVQL 300 RAE IL++S EN QEL QL Sbjct: 352 DKKRRAELILRQSRENQQELTQL 374 >ref|XP_004304222.1| PREDICTED: uncharacterized protein LOC101292232 [Fragaria vesca subsp. vesca] Length = 379 Score = 369 bits (948), Expect = 2e-99 Identities = 203/384 (52%), Positives = 255/384 (66%), Gaps = 1/384 (0%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVK-KKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSD 1272 MSDGGGES R +E + +K GDDF+RA++KIAVAQ+CE G+Q Q SALETLSD Sbjct: 1 MSDGGGESAREHEQSNRITLRKPSCGDDFARAVSKIAVAQVCEVVGYQSFQLSALETLSD 60 Query: 1271 ITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVG 1092 + ++Y+ ++GKTAH YANL+GRTDCNVFDII+GLE+L AQGF GAS++ LASSG + Sbjct: 61 VAVQYIRNVGKTAHLYANLSGRTDCNVFDIIQGLEDLSAAQGFAGASDINHCLASSGTIK 120 Query: 1091 EIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVH 912 EI QYV+ AE VPFA IP FPV++ +K TPSF Q GE G+HIP WLPAFP+ HTY Sbjct: 121 EISQYVAEAEHVPFAYTIPRFPVVKDRKLTPSFWQSGEETPGEHIPTWLPAFPEPHTYSR 180 Query: 911 TPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE 732 + NE T+P + +EQ +Q+R E ++L+ RL C+G E P+ L+P D AKQA E Sbjct: 181 STTCNEGATEPDSALVEQEKQQRNVERAMLNFHHRLVCNGMEGPS-LDPGDGVNAKQARE 239 Query: 731 SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552 SNPFL++PLQFGE +VS V LP LS E TL K N A + SVLETFAP IEA Sbjct: 240 SNPFLATPLQFGETEVSQVTLPAKLSIEATEETL--KAENHA-KDKCSSVLETFAPAIEA 296 Query: 551 AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372 K+ + E+ ++K L S++P V FK G+ KKSLG L + WFG Sbjct: 297 IKNKPFEVEE-DQKTLLSRKPTVQFKIGMSKKSLGTMLYSGPHKKGFEEVYPWFGRENEK 355 Query: 371 XXXXXRAEQILKESMENPQELVQL 300 RAE+ILK SMEN QEL QL Sbjct: 356 DEKKRRAEKILKNSMENSQELAQL 379 >ref|XP_002519508.1| conserved hypothetical protein [Ricinus communis] gi|223541371|gb|EEF42922.1| conserved hypothetical protein [Ricinus communis] Length = 356 Score = 368 bits (945), Expect = 5e-99 Identities = 193/371 (52%), Positives = 244/371 (65%), Gaps = 2/371 (0%) Frame = -1 Query: 1406 DVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDITIRYLCDLGKTAHF 1227 D + R DDF RA++++AVAQICE+ GF G ++SAL++L+++ IRY+ DLGK A+ Sbjct: 5 DEESTSARRKADDFGRAVSRMAVAQICESVGFHGCKESALDSLTEVAIRYIIDLGKIANS 64 Query: 1226 YANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGEIMQYVSLAEEVPFA 1047 +ANL+GRT CN+FDI+RG E++G GF GASN + SG V EI+++V EE+PFA Sbjct: 65 HANLSGRTQCNLFDIVRGFEDVGAPLGFSGASNSGNCVVCSGTVKEIIEFVESTEEIPFA 124 Query: 1046 QPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHTPMWNERVTDPRTDK 867 QP+P FPV+R ++ PSFL +GE P GKHIPAWLPA PD HTYVHTPMWNERV DPR +K Sbjct: 125 QPVPPFPVVRDKRLIPSFLNMGEIPPGKHIPAWLPALPDPHTYVHTPMWNERVVDPRAEK 184 Query: 866 IEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEW-KAKQAAESNPFLSSPLQFGEK 690 IEQARQRRKAE +LLSLQQRL +GS + + + + ESN FL+ PL+ GEK Sbjct: 185 IEQARQRRKAERALLSLQQRLLSNGSAGASTSVASNHYVQELGVGESNRFLARPLKPGEK 244 Query: 689 DVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAAK-SGFCDTEDGER 513 VS VV+P L V +++ F P IEAAK GF D E+ ER Sbjct: 245 AVSTVVVPDKLKTSV-------------------PLIKAFEPAIEAAKGGGFADDEESER 285 Query: 512 KVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXXXXXXRAEQILKE 333 K+LP KRP V+FKF GKK LG PLDL+L + G W G RAE IL++ Sbjct: 286 KLLPEKRPAVNFKFKTGKKMLGEPLDLSLSRKSGGTAGHWLGPVDERDDKKRRAEYILRQ 345 Query: 332 SMENPQELVQL 300 SMENPQEL QL Sbjct: 346 SMENPQELTQL 356 >ref|XP_004246634.1| PREDICTED: uncharacterized protein LOC101264247 [Solanum lycopersicum] Length = 373 Score = 366 bits (939), Expect = 2e-98 Identities = 203/383 (53%), Positives = 249/383 (65%), Gaps = 3/383 (0%) Frame = -1 Query: 1439 GGGESGRVNEHDVKK-KKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDITI 1263 G E R E V +++R G DDF RA+++ AVAQICE+ GF+ +SALE+L+DI I Sbjct: 5 GNAEDKREKESTVDNTREERIGTDDFGRAVSRTAVAQICESIGFEIFNESALESLADIAI 64 Query: 1262 RYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGEIM 1083 +Y+ DLGKTA+ AN+AGRT CNVFDII+GLE++ + GF AS V SSG+V E++ Sbjct: 65 KYILDLGKTANSKANIAGRTQCNVFDIIQGLEDMCASTGFLRASEVNRCGLSSGIVSEMV 124 Query: 1082 QYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHTPM 903 +YV AEE+PF+QP+PHFPV+++ PSFLQIGETP KHIP WLPAFPD HTYV TP Sbjct: 125 EYVESAEEIPFSQPLPHFPVVKQPNLIPSFLQIGETPPFKHIPPWLPAFPDPHTYVRTPT 184 Query: 902 WNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSE-APAPLNPEDEWKAKQAAES- 729 WNER +DPR DKIE ARQRRKAE SLL+LQQRL C+GS A P+D A++S Sbjct: 185 WNERASDPRADKIELARQRRKAERSLLNLQQRLVCNGSAVASTSRQPDDVGITSSASKSE 244 Query: 728 NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549 NPFL+ P Q GEKDV PV LPT KLS+E S+LETF+P I+A Sbjct: 245 NPFLAKPFQAGEKDVDPVALPT-------------KLSSEVDDKNHVSLLETFSPAIQAM 291 Query: 548 KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369 K G +T DG K LP KRP V +F GKK+LG LDL L + S F Sbjct: 292 KDGLSETVDGTEKTLPDKRPAVCLEFRPGKKALGDSLDLRLWKKG-SRNASLFRRDEDRD 350 Query: 368 XXXXRAEQILKESMENPQELVQL 300 RAE IL++S EN QEL QL Sbjct: 351 DKKRRAELILRQSRENQQELTQL 373 >ref|XP_006845883.1| hypothetical protein AMTR_s00154p00079940 [Amborella trichopoda] gi|548848527|gb|ERN07558.1| hypothetical protein AMTR_s00154p00079940 [Amborella trichopoda] Length = 375 Score = 363 bits (933), Expect = 1e-97 Identities = 199/388 (51%), Positives = 266/388 (68%), Gaps = 5/388 (1%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 M+DGGGES R + ++ + D+F RA+ +++VAQICE+AG+ Q+SALE L+DI Sbjct: 1 MNDGGGESRRNIDECKSERGGEQEEDEFGRAVTRVSVAQICESAGYHTFQRSALEALADI 60 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 +RYL DLG++A F+ANLAGRT CNVFD+I+ LE+LG++QGF GAS+V LA+SG + + Sbjct: 61 ALRYLRDLGRSARFHANLAGRTACNVFDVIQALEDLGSSQGFAGASDVNHPLAASGALKD 120 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 I++Y ++AEE+PFA+ +P FP+ + +KPTPSFLQ+GETP KHIP+WLPAFPD HTY+HT Sbjct: 121 IIRYTNIAEEIPFARAVPRFPIPKTRKPTPSFLQLGETPPHKHIPSWLPAFPDPHTYIHT 180 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE- 732 P+WNER +DPRT+K+EQARQRRKAE SL+SLQQRL C+G+ + + E K K+ + Sbjct: 181 PVWNERGSDPRTEKLEQARQRRKAEKSLVSLQQRLACNGA---TMASMDGELKGKRPLDG 237 Query: 731 SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552 +NPFL+ PL GEK+ S V +P LS + + K +V N FAP EA Sbjct: 238 NNPFLAPPLLSGEKEASLVPMPAGLSLKSPDENIEKKPGGLSVVN-------AFAPANEA 290 Query: 551 AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLG-VPLDLTLQNNAEG---KTVSWFGX 384 AK G D R++ P KRPVV FKFG+ K+++ PL + N G +SWF Sbjct: 291 AKGG--GLIDEARQLKP-KRPVVQFKFGLDKRTVNPAPLLFGNRYNRTGGNATDMSWFSR 347 Query: 383 XXXXXXXXXRAEQILKESMENPQELVQL 300 RAEQILKE+MENPQELVQL Sbjct: 348 DEEKDDKKKRAEQILKEAMENPQELVQL 375 >ref|XP_003552582.1| PREDICTED: transcription initiation factor TFIID subunit 8-like isoform X1 [Glycine max] Length = 381 Score = 360 bits (925), Expect = 1e-96 Identities = 185/384 (48%), Positives = 255/384 (66%), Gaps = 1/384 (0%) Frame = -1 Query: 1448 MSDGGGESGR-VNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSD 1272 MS+GGG++GR + + +++K GGDD++RAIAKIAVAQ+CE GFQ QQSALE LSD Sbjct: 1 MSNGGGKTGRQLEQPGTWRRRKVGGGDDYARAIAKIAVAQVCEGEGFQAFQQSALEALSD 60 Query: 1271 ITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVG 1092 + +RY+ ++GK+AH +ANL+GRT+CN FD+I+GLE++G+ QGF GA++V L SSG++ Sbjct: 61 VVVRYILNVGKSAHCHANLSGRTECNAFDVIQGLEDMGSVQGFAGAADVDHCLESSGVIR 120 Query: 1091 EIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVH 912 EI+ +V+ AE V FA PIP FPV++++ P PSFLQ GE P G+HIPAWLPAFPD TY Sbjct: 121 EIVHFVNDAEPVMFAHPIPRFPVVKERVPNPSFLQKGEEPPGEHIPAWLPAFPDPQTYSQ 180 Query: 911 TPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE 732 +P N R T+PR K +Q R+ K EW L+LQQ++ + E A ++P D + AAE Sbjct: 181 SPAVNGRGTEPRAVKFDQERESGKGEWPALNLQQQMVSNMFEKSASIDPADAKAKRVAAE 240 Query: 731 SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552 NPFL++PL+ +K+V+ V P L N+ L + V N S LETFAP IEA Sbjct: 241 GNPFLAAPLKIEDKEVASVPPPAKLFND---EALDNPVVENLVENEPISALETFAPAIEA 297 Query: 551 AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372 KS CD+++ + K +++P V FK G+ K LG + L Q KT+ WF Sbjct: 298 MKSTICDSKEDQTKFCANEKPTVRFKIGIKNKLLGKSIGLIPQKEEHEKTLPWFAMEDEK 357 Query: 371 XXXXXRAEQILKESMENPQELVQL 300 RAE+IL+ES+ENP +LVQL Sbjct: 358 DDRKRRAEKILRESLENPDQLVQL 381 >gb|EXC16168.1| hypothetical protein L484_024336 [Morus notabilis] Length = 372 Score = 358 bits (918), Expect = 6e-96 Identities = 207/393 (52%), Positives = 246/393 (62%), Gaps = 10/393 (2%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 M G RVNEH G DDF RA++KI VAQICE+ GFQ ++SAL+ L++I Sbjct: 1 MGHGEANGTRVNEHG------GGGADDFGRAVSKIVVAQICESVGFQSSKESALDALANI 54 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 IRYLCDLGK A+ YANL GRT+CNVFDIIR LE L +QGFPGA +V L SG + E Sbjct: 55 AIRYLCDLGKIANSYANLTGRTECNVFDIIRALEVLEASQGFPGAGDVGHCLVRSGAMKE 114 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 I YV AEE+PFAQP+P FPV++ ++ SF Q+GE P G+HIP WLPA PD HTY+H+ Sbjct: 115 IATYVDSAEEIPFAQPVPRFPVLKNRRLILSFEQMGENPLGQHIPTWLPALPDPHTYIHS 174 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLT-------CSGSEAPAPLNPEDEWK 750 PMWNER T+PR K+E ARQRRKAE SLLSLQQRL S S A PL D + Sbjct: 175 PMWNERNTEPRLHKLEHARQRRKAERSLLSLQQRLARNVGYAGASTSAAVPPLVGGDGNE 234 Query: 749 AKQAAESNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETF 570 +KQ E N FL PL GEKDVSP+V P K+ +E SVLE F Sbjct: 235 SKQ-VERNLFLEPPLHPGEKDVSPIV-------------FPGKILDERGKGDHASVLEAF 280 Query: 569 APVIEAA-KSGFCDTEDGERKVLP--SKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTV 399 AP IEA KSGF + + ER+VLP RP + FKF KK G LDL+L+ A G+ Sbjct: 281 APAIEAVKKSGFSEYGEDERRVLPGIEARPAIQFKFRTAKKYFGESLDLSLK-KAVGRPA 339 Query: 398 SWFGXXXXXXXXXXRAEQILKESMENPQELVQL 300 WFG RAE IL++SMENPQEL QL Sbjct: 340 FWFGRDEERDDKKRRAEFILRQSMENPQELNQL 372 >ref|XP_002527631.1| tbp-associated factor taf, putative [Ricinus communis] gi|223533005|gb|EEF34770.1| tbp-associated factor taf, putative [Ricinus communis] Length = 379 Score = 354 bits (908), Expect = 9e-95 Identities = 195/384 (50%), Positives = 249/384 (64%), Gaps = 1/384 (0%) Frame = -1 Query: 1448 MSDGGGESGRVNEHD-VKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSD 1272 MS GGG+SGRV E + K+K GD+F+R+IAKIAVAQICE GFQ QQSALETLSD Sbjct: 1 MSHGGGQSGRVQEKSQLAKRKSGSSGDEFARSIAKIAVAQICECTGFQTFQQSALETLSD 60 Query: 1271 ITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVG 1092 +T+RY+C+LGK A AN AGR + N FDII+ LEEL ++QGF AS+V +ASSG+V Sbjct: 61 VTVRYICNLGKLAQGNANSAGRIEGNAFDIIQALEELCSSQGFASASDVDHCIASSGIVR 120 Query: 1091 EIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVH 912 +I QYVS A++VPFA IP FP++R++K P F QIGE P +HIP WLPAFPD Y+ Sbjct: 121 DIAQYVSDADDVPFAYSIPPFPIVRERKLAPIFSQIGEKPPWEHIPDWLPAFPDPQIYLQ 180 Query: 911 TPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE 732 +P NE TD K E AR K + SL LQQ T SGS+ P+ P ++ K E Sbjct: 181 SPTVNEGATDLNMQKFEPARLHPKIDRSL--LQQPFTSSGSQGPSSNVPAGGYEGKLIVE 238 Query: 731 SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552 NPF+++PLQ GEK+VS VV P LSNE + + +A+ SVL TFAP I+A Sbjct: 239 GNPFVAAPLQCGEKEVSHVVPPAKLSNETAVRN---PIEHNRLADNHVSVLNTFAPAIKA 295 Query: 551 AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372 S CD+E+G++KVL ++RP + FK +GKKSL L+L QN + K W Sbjct: 296 MNSRLCDSEEGQKKVLLNQRPAIQFKIAIGKKSLRTSLELGSQNKSAEKISPWSEKDNEN 355 Query: 371 XXXXXRAEQILKESMENPQELVQL 300 RAE+ILK+S+ENP EL QL Sbjct: 356 DDKKRRAEKILKQSIENPGELAQL 379 >ref|XP_003531863.1| PREDICTED: transcription initiation factor TFIID subunit 8-like isoform 1 [Glycine max] Length = 381 Score = 352 bits (904), Expect = 3e-94 Identities = 182/384 (47%), Positives = 253/384 (65%), Gaps = 1/384 (0%) Frame = -1 Query: 1448 MSDGGGESGR-VNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSD 1272 MS+GGG++GR + + ++K GGDD++RAIAKIAVAQ+CE+ GFQ QQSALE LSD Sbjct: 1 MSNGGGKTGRQLEQPGTWGRRKVGGGDDYARAIAKIAVAQVCESEGFQAFQQSALEALSD 60 Query: 1271 ITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVG 1092 + RY+ ++GK+AH +ANL+GRT+C+ FD+I+GLE++G+ QGF GAS+V L SSG++ Sbjct: 61 VVARYILNVGKSAHCHANLSGRTECHAFDVIQGLEDMGSVQGFAGASDVDHCLESSGVIR 120 Query: 1091 EIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVH 912 EI+ +V+ AE V FA PIP FPV++++ P PSFLQ GE P G+HIPAWLPAFPD TY Sbjct: 121 EIVHFVNDAEPVMFAHPIPQFPVVKERVPNPSFLQKGEEPPGEHIPAWLPAFPDLQTYSE 180 Query: 911 TPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE 732 +P+ N R T+PR K +Q R+ K EW ++ QQ++ + E A ++P D + AAE Sbjct: 181 SPVVNGRGTEPRAVKFDQERENGKGEWPAMNFQQQMVSNMFEKSALIDPADAKAKRVAAE 240 Query: 731 SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552 NPFL++PL+ +K+V+ V P L N+V L + V N S +ETFAP IEA Sbjct: 241 GNPFLAAPLKIEDKEVASVPPPAKLFNDV---ALDNPVVENFVENEPISAMETFAPAIEA 297 Query: 551 AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372 KS CD+ + + K +++P V FK G+ K LG + L Q T+ WF Sbjct: 298 MKSTCCDSNEDQTKFRANEKPTVRFKIGIKNKLLGKSIGLIPQKEEHKNTLPWFAMEDGK 357 Query: 371 XXXXXRAEQILKESMENPQELVQL 300 RAE+IL+ES+ENP +LVQL Sbjct: 358 DDRKRRAEKILRESLENPDQLVQL 381 >ref|XP_004141587.1| PREDICTED: uncharacterized protein LOC101215115 [Cucumis sativus] Length = 376 Score = 350 bits (899), Expect = 1e-93 Identities = 193/384 (50%), Positives = 252/384 (65%), Gaps = 1/384 (0%) Frame = -1 Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269 MSDGGGESG+V H+ K +K G +DF RA+AKIAVAQICE+ GFQ QQSALETL+D+ Sbjct: 1 MSDGGGESGKV--HERPKTRKNLGSEDFPRALAKIAVAQICESEGFQIFQQSALETLADV 58 Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089 +RY+ ++G TA+F AN AGRT+CN+FDII+ LE+LG+ QGF GAS++ LASS V E Sbjct: 59 AVRYVQNMGSTANFCANFAGRTECNLFDIIQALEDLGSVQGFAGASDIEHCLASSSTVKE 118 Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909 +YV+ AEEVPFA +P FPV++++K PSFLQIGE P G+HIP+WLPA PD TY+ + Sbjct: 119 FARYVAQAEEVPFAYSVPKFPVVKERKLRPSFLQIGEEPPGEHIPSWLPALPDPETYIES 178 Query: 908 PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729 P+ E V +P+T K E +Q R E S +LQQ L C+G E +P + KQ ES Sbjct: 179 PIVKEEVVEPQTIKTEPEKQCR-TEKSFWNLQQWLFCNGLEGSQREDPRNAAMTKQIQES 237 Query: 728 NPFLSSPLQFGEKDVSPVVLP-TMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552 NPFL+ PLQFGEK+VS +VLP +L+N +P + +T SVLETFAP IE+ Sbjct: 238 NPFLAPPLQFGEKEVSSIVLPDKVLNNSSTEYHVP--VMENCQVDTHVSVLETFAPAIES 295 Query: 551 AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372 K+ F E K +++ V FK G GKK+ G ++L NN K+ SWF Sbjct: 296 IKNNF---HMSEEKYSLNRKSTVQFKIGTGKKAAGNMIELRALNNGVKKSSSWFVGEDEK 352 Query: 371 XXXXXRAEQILKESMENPQELVQL 300 +AE+ILK+SMEN EL L Sbjct: 353 DDKKRKAEKILKDSMENSNELSHL 376