BLASTX nr result

ID: Akebia22_contig00014307 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00014307
         (1931 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002277032.1| PREDICTED: uncharacterized protein LOC100246...   449   e-123
ref|XP_002282259.1| PREDICTED: uncharacterized protein LOC100260...   441   e-121
emb|CAN79809.1| hypothetical protein VITISV_014912 [Vitis vinifera]   432   e-118
emb|CBI21214.3| unnamed protein product [Vitis vinifera]              431   e-118
ref|XP_007215547.1| hypothetical protein PRUPE_ppa007206mg [Prun...   397   e-108
ref|XP_006428393.1| hypothetical protein CICLE_v10012002mg [Citr...   391   e-106
ref|XP_007027393.1| TBP-associated factor 8, putative [Theobroma...   390   e-106
ref|XP_002323904.1| hypothetical protein POPTR_0017s13060g [Popu...   383   e-103
ref|XP_004306253.1| PREDICTED: uncharacterized protein LOC101313...   374   e-101
ref|XP_002305385.1| hypothetical protein POPTR_0004s11520g [Popu...   370   2e-99
ref|XP_006354362.1| PREDICTED: transcription initiation factor T...   369   2e-99
ref|XP_004304222.1| PREDICTED: uncharacterized protein LOC101292...   369   2e-99
ref|XP_002519508.1| conserved hypothetical protein [Ricinus comm...   368   5e-99
ref|XP_004246634.1| PREDICTED: uncharacterized protein LOC101264...   366   2e-98
ref|XP_006845883.1| hypothetical protein AMTR_s00154p00079940 [A...   363   1e-97
ref|XP_003552582.1| PREDICTED: transcription initiation factor T...   360   1e-96
gb|EXC16168.1| hypothetical protein L484_024336 [Morus notabilis]     358   6e-96
ref|XP_002527631.1| tbp-associated factor taf, putative [Ricinus...   354   9e-95
ref|XP_003531863.1| PREDICTED: transcription initiation factor T...   352   3e-94
ref|XP_004141587.1| PREDICTED: uncharacterized protein LOC101215...   350   1e-93

>ref|XP_002277032.1| PREDICTED: uncharacterized protein LOC100246447 [Vitis vinifera]
          Length = 377

 Score =  449 bits (1156), Expect = e-123
 Identities = 235/383 (61%), Positives = 279/383 (72%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            MSDGGGESGR ++   K+K       DF +AIAKIAVAQICE+AGFQG QQSALETLS++
Sbjct: 1    MSDGGGESGRESDRATKRKSSDR---DFPQAIAKIAVAQICESAGFQGFQQSALETLSEV 57

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
             +RY+ +LGKTAH YAN A RT+CN+FDII+GLE+L + QGF GAS+    LA SG V E
Sbjct: 58   VVRYIRELGKTAHTYANSACRTECNIFDIIQGLEDLASLQGFSGASDSDHCLAGSGTVRE 117

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
            I+QYVS AEE+PFA  +PHFPVIR +K TPSFLQIGE P G HIP WLPAFPD  TYVH+
Sbjct: 118  IVQYVSEAEEIPFAHSVPHFPVIRDRKQTPSFLQIGEEPPGDHIPDWLPAFPDPQTYVHS 177

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729
            P+ NER  DP    IEQARQ +KAEWSLL+LQQ+L C+G E P+ ++P D  KA++AAE+
Sbjct: 178  PVLNERGADPCAGNIEQARQHKKAEWSLLNLQQQLACNGLEGPSMIDPGDAAKARRAAET 237

Query: 728  NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549
            NPFLS+PL FGEK VSPV LP  LSNE V      +  N AVAN   SVLETFAP IE  
Sbjct: 238  NPFLSAPLHFGEKGVSPVFLPAKLSNEAVVENQAGE--NHAVAN-HVSVLETFAPAIELM 294

Query: 548  KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369
            KS  C++E+G +KVL ++RP V FK  +GKKS G  LDL+ QN    K  SWFG      
Sbjct: 295  KSRSCESEEGRKKVLSNQRPAVQFKIEIGKKSTGTALDLSFQNKDVEKITSWFGKDNEKD 354

Query: 368  XXXXRAEQILKESMENPQELVQL 300
                RAE+ILKESM+NPQEL QL
Sbjct: 355  DKKRRAEKILKESMKNPQELAQL 377


>ref|XP_002282259.1| PREDICTED: uncharacterized protein LOC100260255 [Vitis vinifera]
          Length = 368

 Score =  441 bits (1135), Expect = e-121
 Identities = 228/384 (59%), Positives = 280/384 (72%), Gaps = 1/384 (0%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            MSDGG +  R ++++  K   R G D+F RA++KIAVAQICE+ GF+G Q SAL+ LS+I
Sbjct: 1    MSDGGEDDRRNSDNNAPK---RAGPDEFGRAVSKIAVAQICESVGFEGFQDSALQALSNI 57

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
             +RYLCD+GKTA+F ANLAGRT CNVFD+IRGLE+LG+++GF GAS V   + SSG V E
Sbjct: 58   AVRYLCDVGKTANFCANLAGRTQCNVFDVIRGLEDLGSSEGFSGASGVDQCIVSSGTVRE 117

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
            I++YV+ A+E+PFAQP+P FPV+R  K TPSF+Q+GETP GKHIP WLPAFPDSHTY+ T
Sbjct: 118  IVEYVNSAKEIPFAQPVPRFPVVRNCKATPSFVQMGETPVGKHIPPWLPAFPDSHTYIQT 177

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGS-EAPAPLNPEDEWKAKQAAE 732
            PMWNER TDPR DK+EQARQRRKAE SLLSLQQRL C+GS  A   +   D+ +A +AAE
Sbjct: 178  PMWNERATDPRADKLEQARQRRKAERSLLSLQQRLVCNGSASASTSVGRCDDAEASRAAE 237

Query: 731  SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552
             NP+L+SPLQFGEKDVS VV             LPAKL ++ V +   SVLETFAP IEA
Sbjct: 238  GNPYLASPLQFGEKDVSTVV-------------LPAKLLDDLVVDNHVSVLETFAPAIEA 284

Query: 551  AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372
             K+ F D+ + E+ V+P KR  VHFK   GKK LG  +DL L+N + GK VS  G     
Sbjct: 285  VKNSFVDSGESEKNVVPEKRSAVHFKLRTGKKILGESVDLRLKNKSVGKVVSLIGRDEER 344

Query: 371  XXXXXRAEQILKESMENPQELVQL 300
                 RAE IL++SMENPQEL QL
Sbjct: 345  DDKKRRAEYILRQSMENPQELTQL 368


>emb|CAN79809.1| hypothetical protein VITISV_014912 [Vitis vinifera]
          Length = 366

 Score =  432 bits (1111), Expect = e-118
 Identities = 229/383 (59%), Positives = 274/383 (71%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            MSDGGGESGR ++   K+K       DF +AIAKIAVAQICE+AGFQG QQSALETLS++
Sbjct: 1    MSDGGGESGRESDRATKRKSSDR---DFPQAIAKIAVAQICESAGFQGFQQSALETLSEV 57

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
             +RY+ +LGKTAH YAN A RT+CN+FDII+GLE+L + QGF GAS+    LA SG V E
Sbjct: 58   VVRYIRELGKTAHTYANSACRTECNIFDIIQGLEDLASLQGFSGASDSDHCLAGSGTVRE 117

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
            I+QYVS AEE+PFA  +PHFPVIR +K TPSFLQIGE P G HIP WLPAFPD  TYVH+
Sbjct: 118  IVQYVSEAEEIPFAHSVPHFPVIRDRKQTPSFLQIGEEPPGDHIPDWLPAFPDPQTYVHS 177

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729
            P+            +EQARQ +KAEWSLL+LQQ+L C+G E P+ ++P D  KA++AAE+
Sbjct: 178  PV-----------TLEQARQHKKAEWSLLNLQQQLACNGLEGPSMIDPGDAAKARRAAET 226

Query: 728  NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549
            NPFLS+PL FGEK VSPV LP  LSNE V      +  N AVAN   SVLETFAP IE  
Sbjct: 227  NPFLSAPLHFGEKGVSPVFLPAKLSNEAVVENQAGE--NHAVAN-HVSVLETFAPAIELM 283

Query: 548  KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369
            KS  C++E+G +KVL ++RP V FK  +GKKS G  LDL+ QN    K  SWFG      
Sbjct: 284  KSRSCESEEGRKKVLSNQRPAVQFKIEIGKKSTGTALDLSFQNKDVEKITSWFGKDNEKD 343

Query: 368  XXXXRAEQILKESMENPQELVQL 300
                RAE+ILKESM+NPQEL QL
Sbjct: 344  DKKRRAEKILKESMKNPQELAQL 366


>emb|CBI21214.3| unnamed protein product [Vitis vinifera]
          Length = 357

 Score =  431 bits (1109), Expect = e-118
 Identities = 228/383 (59%), Positives = 271/383 (70%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            MSDGGGESGR ++   K+K       DF +AIAKIAVAQICE+AGFQG QQSALETLS++
Sbjct: 1    MSDGGGESGRESDRATKRKSSDR---DFPQAIAKIAVAQICESAGFQGFQQSALETLSEV 57

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
             +RY+ +LGKTAH YAN A RT+CN+FDII+GLE+L + QGF GAS+    LA SG V E
Sbjct: 58   VVRYIRELGKTAHTYANSACRTECNIFDIIQGLEDLASLQGFSGASDSDHCLAGSGTVRE 117

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
            I+QYVS AEE+PFA  +PHFPVIR +K TPSFLQIGE P G HIP WLPAFPD  TYVH+
Sbjct: 118  IVQYVSEAEEIPFAHSVPHFPVIRDRKQTPSFLQIGEEPPGDHIPDWLPAFPDPQTYVHS 177

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729
            P+ NER  DP    IEQARQ +KAEWSLL+LQQ+L C+G E P+ ++P D  KA++AAE+
Sbjct: 178  PVLNERGADPCAGNIEQARQHKKAEWSLLNLQQQLACNGLEGPSMIDPGDAAKARRAAET 237

Query: 728  NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549
            NPFLS+PL FGEK VSPV              LPAKLSNEA          TFAP IE  
Sbjct: 238  NPFLSAPLHFGEKGVSPVF-------------LPAKLSNEA----------TFAPAIELM 274

Query: 548  KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369
            KS  C++E+G +KVL ++RP V FK  +GKKS G  LDL+ QN    K  SWFG      
Sbjct: 275  KSRSCESEEGRKKVLSNQRPAVQFKIEIGKKSTGTALDLSFQNKDVEKITSWFGKDNEKD 334

Query: 368  XXXXRAEQILKESMENPQELVQL 300
                RAE+ILKESM+NPQEL QL
Sbjct: 335  DKKRRAEKILKESMKNPQELAQL 357


>ref|XP_007215547.1| hypothetical protein PRUPE_ppa007206mg [Prunus persica]
            gi|462411697|gb|EMJ16746.1| hypothetical protein
            PRUPE_ppa007206mg [Prunus persica]
          Length = 378

 Score =  397 bits (1020), Expect = e-108
 Identities = 212/383 (55%), Positives = 263/383 (68%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            MSDGGGESGR +E   + ++K   GDDF+RAIAKIAVAQ+CE  GFQ  Q SALETLSD+
Sbjct: 1    MSDGGGESGREHEQHNRTQRK-SSGDDFARAIAKIAVAQVCEIVGFQTYQLSALETLSDV 59

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
             + Y+ ++GKTAHFYANL+GR DCNVFDII+GLE+LG AQGF GAS+V   LASSG V E
Sbjct: 60   AVHYIHNIGKTAHFYANLSGRMDCNVFDIIQGLEDLGLAQGFAGASDVDHCLASSGTVRE 119

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
            I QYV   E +PF+  IP FPV++ +K TPSFLQ G    G+HIP WLPAFP+ HTYV +
Sbjct: 120  IAQYVGETEHIPFSYSIPQFPVVKDRKLTPSFLQSGVETLGEHIPIWLPAFPEPHTYVPS 179

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729
            P+ NER  +  TD IEQ +++R  E SL +LQ+RL C+G E P+ ++P D  KAKQA ES
Sbjct: 180  PISNERARELHTDMIEQKKKQRNVERSLFNLQRRLVCNGLEGPS-IDPGDADKAKQARES 238

Query: 728  NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549
            NPFL++PLQ+GE +VS V LP  LS+E     L A+     VA    SVLETFAP IEA 
Sbjct: 239  NPFLAAPLQYGETEVSHVALPAKLSSEATVEKLVAE---NRVAEKCSSVLETFAPAIEAM 295

Query: 548  KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369
            KS  C++++  +++L S+RP V FK G+ K S    L  +  N    K  SWFG      
Sbjct: 296  KSSSCESQEEHKEILLSRRPTVQFKIGIAKTSFSTMLHSSPHNKGFQKNYSWFGRENEKD 355

Query: 368  XXXXRAEQILKESMENPQELVQL 300
                RAE+ILK SMEN QEL QL
Sbjct: 356  EKKKRAEKILKNSMENSQELAQL 378


>ref|XP_006428393.1| hypothetical protein CICLE_v10012002mg [Citrus clementina]
            gi|568880174|ref|XP_006493009.1| PREDICTED: transcription
            initiation factor TFIID subunit 8-like [Citrus sinensis]
            gi|568885488|ref|XP_006495304.1| PREDICTED: transcription
            initiation factor TFIID subunit 8-like [Citrus sinensis]
            gi|557530450|gb|ESR41633.1| hypothetical protein
            CICLE_v10012002mg [Citrus clementina]
          Length = 370

 Score =  391 bits (1005), Expect = e-106
 Identities = 207/384 (53%), Positives = 256/384 (66%), Gaps = 1/384 (0%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            M+ GGGES   +E        R   +DFSRA++K+AVAQICE+ GFQG + SAL+ L DI
Sbjct: 1    MNHGGGESTSRSESRTDTSSDRPKAEDFSRAVSKMAVAQICESVGFQGFKDSALDALLDI 60

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
             IRY+CDLGKT+ F ANLA RT+CN+FDIIRG+E+L   +GF GA+ +   L  SG+V E
Sbjct: 61   AIRYICDLGKTSSFQANLACRTECNLFDIIRGIEDLEVLKGFMGAAEIGKCLVGSGIVKE 120

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
            I+ +V   EE+PFAQPIP +PVIR ++  PSF ++ ETP GKHIP+WLPAFPD HTY++T
Sbjct: 121  IIDFVESKEEIPFAQPIPQYPVIRSRRLIPSFEEMNETPPGKHIPSWLPAFPDPHTYIYT 180

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729
            PMWNER +DPR DKIE ARQRRKAE +LLSLQQRL C+G    +   P ++ +      S
Sbjct: 181  PMWNERKSDPRADKIELARQRRKAEMALLSLQQRLVCNGETGTSASRPANDEEELLKTGS 240

Query: 728  NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549
            NPF + PLQ GEKD+SPV              LPAKL ++       SV+E FAP IEA 
Sbjct: 241  NPFFAKPLQSGEKDISPV-------------GLPAKLKDKMSGGNHMSVMEAFAPAIEAV 287

Query: 548  K-SGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372
            K SGF D  DG+R+ LP KRP VHFKF  GKK LG  LD +LQ    G+  + F      
Sbjct: 288  KVSGFSDDADGDRRYLPEKRPAVHFKFRAGKKFLGEILDSSLQKKG-GRRSASFWRDEEK 346

Query: 371  XXXXXRAEQILKESMENPQELVQL 300
                 RAE ILK+S+ENPQEL QL
Sbjct: 347  DDKKRRAEFILKQSIENPQELSQL 370


>ref|XP_007027393.1| TBP-associated factor 8, putative [Theobroma cacao]
            gi|508715998|gb|EOY07895.1| TBP-associated factor 8,
            putative [Theobroma cacao]
          Length = 373

 Score =  390 bits (1003), Expect = e-106
 Identities = 213/386 (55%), Positives = 261/386 (67%), Gaps = 4/386 (1%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKK---REGGDDFSRAIAKIAVAQICETAGFQGIQQSALETL 1278
            MS GG ES R       ++     R   DDF RA++KI+VAQICE  G+QG ++SALE L
Sbjct: 1    MSHGGVESTRDTRESEGQRSLPLGRPKADDFGRAVSKISVAQICECVGYQGFKESALEAL 60

Query: 1277 SDITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGM 1098
            +DI IRYLCDLGKT+ F+ANLAGRT+CN+FDI + LEELG + GF GAS +   LA SG 
Sbjct: 61   ADIAIRYLCDLGKTSSFHANLAGRTECNMFDITQSLEELGASYGFSGASEIGHCLAGSGA 120

Query: 1097 VGEIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTY 918
            V EI+Q+V   EE+PFAQP+P FPV+R +K  PSF  + ETP GKHIPAWLPAFPD HTY
Sbjct: 121  VREIIQFVGSKEEIPFAQPVPQFPVVRNRKLIPSFEHMNETPPGKHIPAWLPAFPDPHTY 180

Query: 917  VHTPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGS-EAPAPLNPEDEWKAKQ 741
            +HTPMWNER +DPR DKIEQARQRRKAE +LLSLQQRL C+GS E  A L  + + +  Q
Sbjct: 181  IHTPMWNERASDPRADKIEQARQRRKAERALLSLQQRLVCNGSTETSASLVVDAKKETIQ 240

Query: 740  AAESNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPV 561
             A +N FL++PLQ GEKDV+ VV             LPAKLS+E   +   S+LE FAP 
Sbjct: 241  EAGNNAFLAAPLQPGEKDVARVV-------------LPAKLSDEVSKDNHVSLLEAFAPA 287

Query: 560  IEAAKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXX 381
            IEA K G     DGE+ +LP +RP VHFKF  GKK LG  LDL+LQ   E ++ ++F   
Sbjct: 288  IEAMKGGPSGELDGEKMLLPERRPAVHFKFRTGKKILGESLDLSLQKKGE-RSTTFFLRD 346

Query: 380  XXXXXXXXRAEQILKESMENPQELVQ 303
                    RAE IL+++ E P EL Q
Sbjct: 347  EERDDKKRRAEFILRQTTEYPMELNQ 372


>ref|XP_002323904.1| hypothetical protein POPTR_0017s13060g [Populus trichocarpa]
            gi|566213067|ref|XP_006373367.1| hypothetical protein
            POPTR_0017s13060g [Populus trichocarpa]
            gi|222866906|gb|EEF04037.1| hypothetical protein
            POPTR_0017s13060g [Populus trichocarpa]
            gi|550320186|gb|ERP51164.1| hypothetical protein
            POPTR_0017s13060g [Populus trichocarpa]
          Length = 382

 Score =  383 bits (984), Expect = e-103
 Identities = 208/385 (54%), Positives = 258/385 (67%), Gaps = 2/385 (0%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEH--DVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLS 1275
            MS GGGESGR+++   D  K+K R  GD+F+RAIAKIAVAQ+CET GFQ  QQSALE LS
Sbjct: 1    MSHGGGESGRLHDKAGDSGKRKSRVSGDEFTRAIAKIAVAQMCETVGFQSFQQSALEKLS 60

Query: 1274 DITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMV 1095
            D+T  Y+ +LGKTA FYANLAGRT+ NVFD+I+G+EELG +QGF GASNV   LASSG+V
Sbjct: 61   DVTTWYIRNLGKTAQFYANLAGRTEGNVFDVIQGMEELGLSQGFAGASNVDHCLASSGIV 120

Query: 1094 GEIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYV 915
             EI+QY+  AE++PF   IP FPV R++KP PSF QI E    +HIPAWLPAFPD  T+V
Sbjct: 121  REIVQYIGDAEDIPFVYSIPPFPVARERKPVPSFFQICEESPAEHIPAWLPAFPDPQTHV 180

Query: 914  HTPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAA 735
              P  NE       DKIE AR   K + S ++L Q  TC+GS  P+ +   +  +A Q  
Sbjct: 181  QLPAGNEGDAVFNADKIEPARHHLKMDMSSMNLPQHFTCNGSGGPSSVTFGNSARATQGT 240

Query: 734  ESNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIE 555
            ESNPFL++PLQFGEK+VS +V P  LS+E         +    + +   SVLETFAP IE
Sbjct: 241  ESNPFLAAPLQFGEKEVSHLVPPARLSDEAAVR---YPVEQNRIMDNHISVLETFAPAIE 297

Query: 554  AAKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXX 375
            A KS FCD+E+G++KVL ++RP V FK  VGK SL    DL+ Q     K   WFG    
Sbjct: 298  AMKSRFCDSEEGQKKVLLNQRPAVQFKIQVGKNSLAGAPDLSPQKIGIEKISKWFGKDSE 357

Query: 374  XXXXXXRAEQILKESMENPQELVQL 300
                  RAE+ILK+SMENP EL +L
Sbjct: 358  NDDKKRRAEKILKQSMENPSELGEL 382


>ref|XP_004306253.1| PREDICTED: uncharacterized protein LOC101313446 [Fragaria vesca
            subsp. vesca]
          Length = 390

 Score =  374 bits (961), Expect = e-101
 Identities = 212/402 (52%), Positives = 262/402 (65%), Gaps = 20/402 (4%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEH-----DVKKKKKR---EGGDDFSRAIAKIAVAQICETAGFQGIQQS 1293
            MS G  ES RVNE      D  ++ ++    GGD+F RA++K+AVAQICE  GF G ++S
Sbjct: 1    MSHGDAESSRVNESGSGEDDAPRRAQQLSGGGGDEFGRAVSKVAVAQICEGVGFLGCKES 60

Query: 1292 ALETLSDITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSL 1113
            AL++L+DI IRYL DLGK A++YANLAGRT+ NVFD++RGLE+L  +QGF GA+ V   L
Sbjct: 61   ALDSLADIAIRYLRDLGKMANYYANLAGRTESNVFDVVRGLEDLEASQGFSGAAEVRHCL 120

Query: 1112 ASSGMVGEIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFP 933
            A SG +  ++QYV  AEE+PFAQ +P FPV++ ++   SF ++GE P GKH+P WLPAFP
Sbjct: 121  AGSGTMKGLVQYVGTAEEIPFAQSLPRFPVVKDRRLILSFERMGEAPPGKHLPNWLPAFP 180

Query: 932  DSHTYVHTPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGS-------EAPAP 774
            D HTY+H+PMWNER TDPR DKIEQARQRRKAE SLLSLQQRL C+GS        AP  
Sbjct: 181  DPHTYIHSPMWNERKTDPREDKIEQARQRRKAERSLLSLQQRLLCNGSAPGLASPSAPVS 240

Query: 773  LNPEDEWKAK-QAAESNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVAN 597
            +   D    K Q  ESNPFL  PLQ GEKDVSPVVLP+  S EV+A              
Sbjct: 241  VVGNDGKGLKLQGGESNPFLEPPLQPGEKDVSPVVLPSKFS-EVLAK------------G 287

Query: 596  TQPSVLETFAPVIEAAKSGFCDTEDG----ERKVLPSKRPVVHFKFGVGKKSLGVPLDLT 429
               SVLE FAP I+A K+G     +G    E K+LP+ RP VH KF   KK LG   DL+
Sbjct: 288  NSSSVLEAFAPAIQAVKNGVWMDGEGDVEEESKLLPNSRPPVHLKFRPVKKFLGESSDLS 347

Query: 428  LQNNAEGKTVSWFGXXXXXXXXXXRAEQILKESMENPQELVQ 303
            LQ    G+  +W            RAE IL++SM+NPQEL Q
Sbjct: 348  LQKKGSGRPANWVLRDEERDEKKRRAEFILRQSMQNPQELNQ 389


>ref|XP_002305385.1| hypothetical protein POPTR_0004s11520g [Populus trichocarpa]
            gi|222848349|gb|EEE85896.1| hypothetical protein
            POPTR_0004s11520g [Populus trichocarpa]
          Length = 394

 Score =  370 bits (949), Expect = 2e-99
 Identities = 202/383 (52%), Positives = 252/383 (65%), Gaps = 3/383 (0%)
 Frame = -1

Query: 1439 GGGESGRVNE---HDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            GGGESGR++E   H+  K+K R  GD+F+RAI KIAVAQ+CE+ GFQ  QQSALETL+D+
Sbjct: 16   GGGESGRLHEKVGHN-GKRKSRASGDEFARAIGKIAVAQMCESMGFQSFQQSALETLTDV 74

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
            T  Y+ ++GK A   ANLAGRT+ NVFD+I+GLEELG  QGF GAS+V   LASSG+V E
Sbjct: 75   TTWYIRNIGKAAQLCANLAGRTEGNVFDVIQGLEELGLPQGFAGASDVDHCLASSGIVRE 134

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
            I QY+  A+++PFA  IP FPV R++KP PSF QIGE P  +HIPAWLPAFPD  TY   
Sbjct: 135  IAQYIGDADDIPFAYSIPPFPVARERKPAPSFSQIGEEPPEEHIPAWLPAFPDPQTYAQL 194

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729
            P  NE   D   D IE  RQ +K + S ++L Q+  C+GSE P+ +   D  KA Q   S
Sbjct: 195  PEGNEGRADLNADNIESVRQHQKMDVSYMNLPQQFNCNGSEGPSSVAFGDSAKATQRTVS 254

Query: 728  NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549
            NPFL++PLQFG K+VS VV P  LS+E         +      +   SV++TFAP IEA 
Sbjct: 255  NPFLAAPLQFGVKEVSHVVPPAKLSDEAAVR---YPVEQTRTMDNNMSVMKTFAPAIEAM 311

Query: 548  KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369
            KS  CD+ +G++KV  ++RP V FK GVGK SL    DL+LQN    K   W G      
Sbjct: 312  KSRLCDSGEGQKKVFFNQRPAVQFKIGVGKNSLDGAPDLSLQNKGIKKISMWSGKDSEND 371

Query: 368  XXXXRAEQILKESMENPQELVQL 300
                RAE+ILK+SMENP EL QL
Sbjct: 372  DQKRRAEKILKQSMENPGELAQL 394


>ref|XP_006354362.1| PREDICTED: transcription initiation factor TFIID subunit 8-like
            [Solanum tuberosum]
          Length = 374

 Score =  369 bits (948), Expect = 2e-99
 Identities = 204/383 (53%), Positives = 246/383 (64%), Gaps = 3/383 (0%)
 Frame = -1

Query: 1439 GGGESGRVNEHDVKK-KKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDITI 1263
            G  E  R  E  V   +++R G DDF RAI++ AVAQICE+ GF+   +SALE+L+DI I
Sbjct: 5    GNAEDKREKESTVDNTREERAGTDDFGRAISRTAVAQICESIGFEIFNESALESLADIAI 64

Query: 1262 RYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGEIM 1083
            +Y+ DLGKTA   ANLAGRT CNVFDII GLE++  + GF  AS V     SSG+V E++
Sbjct: 65   KYILDLGKTASSSANLAGRTQCNVFDIIHGLEDMCASTGFLRASEVNRCGLSSGIVSEMV 124

Query: 1082 QYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHTPM 903
            +YV  AEE+PF+QP+PHFPV++     PSFLQIGETP  KHIP WLPAFPD HTYV TP 
Sbjct: 125  EYVESAEEIPFSQPLPHFPVVKHPNLIPSFLQIGETPPFKHIPPWLPAFPDPHTYVRTPT 184

Query: 902  WNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSE-APAPLNPEDEWKAKQAAES- 729
            WNER +DPR DKIE ARQRRKAE SLL+LQQRL C+GS        P+D      A++S 
Sbjct: 185  WNERASDPRADKIELARQRRKAERSLLNLQQRLVCNGSAVGSTSRQPDDVGITSSASKSE 244

Query: 728  NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549
            NPFL+ P Q GEKDV PV LPT             KLS+E       S+LETF+P I+A 
Sbjct: 245  NPFLAKPFQAGEKDVDPVALPT-------------KLSSEVDDKNHVSLLETFSPAIQAM 291

Query: 548  KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369
            K G  +T +G  K LP KRP V  +F  GKK+LG  LDL L     G+  S F       
Sbjct: 292  KDGLSETVNGTEKTLPDKRPAVCLEFRPGKKALGDSLDLRLWKKGSGRNASLFRRDEDRD 351

Query: 368  XXXXRAEQILKESMENPQELVQL 300
                RAE IL++S EN QEL QL
Sbjct: 352  DKKRRAELILRQSRENQQELTQL 374


>ref|XP_004304222.1| PREDICTED: uncharacterized protein LOC101292232 [Fragaria vesca
            subsp. vesca]
          Length = 379

 Score =  369 bits (948), Expect = 2e-99
 Identities = 203/384 (52%), Positives = 255/384 (66%), Gaps = 1/384 (0%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVK-KKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSD 1272
            MSDGGGES R +E   +   +K   GDDF+RA++KIAVAQ+CE  G+Q  Q SALETLSD
Sbjct: 1    MSDGGGESAREHEQSNRITLRKPSCGDDFARAVSKIAVAQVCEVVGYQSFQLSALETLSD 60

Query: 1271 ITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVG 1092
            + ++Y+ ++GKTAH YANL+GRTDCNVFDII+GLE+L  AQGF GAS++   LASSG + 
Sbjct: 61   VAVQYIRNVGKTAHLYANLSGRTDCNVFDIIQGLEDLSAAQGFAGASDINHCLASSGTIK 120

Query: 1091 EIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVH 912
            EI QYV+ AE VPFA  IP FPV++ +K TPSF Q GE   G+HIP WLPAFP+ HTY  
Sbjct: 121  EISQYVAEAEHVPFAYTIPRFPVVKDRKLTPSFWQSGEETPGEHIPTWLPAFPEPHTYSR 180

Query: 911  TPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE 732
            +   NE  T+P +  +EQ +Q+R  E ++L+   RL C+G E P+ L+P D   AKQA E
Sbjct: 181  STTCNEGATEPDSALVEQEKQQRNVERAMLNFHHRLVCNGMEGPS-LDPGDGVNAKQARE 239

Query: 731  SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552
            SNPFL++PLQFGE +VS V LP  LS E    TL  K  N A  +   SVLETFAP IEA
Sbjct: 240  SNPFLATPLQFGETEVSQVTLPAKLSIEATEETL--KAENHA-KDKCSSVLETFAPAIEA 296

Query: 551  AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372
             K+   + E+ ++K L S++P V FK G+ KKSLG  L          +   WFG     
Sbjct: 297  IKNKPFEVEE-DQKTLLSRKPTVQFKIGMSKKSLGTMLYSGPHKKGFEEVYPWFGRENEK 355

Query: 371  XXXXXRAEQILKESMENPQELVQL 300
                 RAE+ILK SMEN QEL QL
Sbjct: 356  DEKKRRAEKILKNSMENSQELAQL 379


>ref|XP_002519508.1| conserved hypothetical protein [Ricinus communis]
            gi|223541371|gb|EEF42922.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 356

 Score =  368 bits (945), Expect = 5e-99
 Identities = 193/371 (52%), Positives = 244/371 (65%), Gaps = 2/371 (0%)
 Frame = -1

Query: 1406 DVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDITIRYLCDLGKTAHF 1227
            D +    R   DDF RA++++AVAQICE+ GF G ++SAL++L+++ IRY+ DLGK A+ 
Sbjct: 5    DEESTSARRKADDFGRAVSRMAVAQICESVGFHGCKESALDSLTEVAIRYIIDLGKIANS 64

Query: 1226 YANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGEIMQYVSLAEEVPFA 1047
            +ANL+GRT CN+FDI+RG E++G   GF GASN    +  SG V EI+++V   EE+PFA
Sbjct: 65   HANLSGRTQCNLFDIVRGFEDVGAPLGFSGASNSGNCVVCSGTVKEIIEFVESTEEIPFA 124

Query: 1046 QPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHTPMWNERVTDPRTDK 867
            QP+P FPV+R ++  PSFL +GE P GKHIPAWLPA PD HTYVHTPMWNERV DPR +K
Sbjct: 125  QPVPPFPVVRDKRLIPSFLNMGEIPPGKHIPAWLPALPDPHTYVHTPMWNERVVDPRAEK 184

Query: 866  IEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEW-KAKQAAESNPFLSSPLQFGEK 690
            IEQARQRRKAE +LLSLQQRL  +GS   +     + + +     ESN FL+ PL+ GEK
Sbjct: 185  IEQARQRRKAERALLSLQQRLLSNGSAGASTSVASNHYVQELGVGESNRFLARPLKPGEK 244

Query: 689  DVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAAK-SGFCDTEDGER 513
             VS VV+P  L   V                    +++ F P IEAAK  GF D E+ ER
Sbjct: 245  AVSTVVVPDKLKTSV-------------------PLIKAFEPAIEAAKGGGFADDEESER 285

Query: 512  KVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXXXXXXRAEQILKE 333
            K+LP KRP V+FKF  GKK LG PLDL+L   + G    W G          RAE IL++
Sbjct: 286  KLLPEKRPAVNFKFKTGKKMLGEPLDLSLSRKSGGTAGHWLGPVDERDDKKRRAEYILRQ 345

Query: 332  SMENPQELVQL 300
            SMENPQEL QL
Sbjct: 346  SMENPQELTQL 356


>ref|XP_004246634.1| PREDICTED: uncharacterized protein LOC101264247 [Solanum
            lycopersicum]
          Length = 373

 Score =  366 bits (939), Expect = 2e-98
 Identities = 203/383 (53%), Positives = 249/383 (65%), Gaps = 3/383 (0%)
 Frame = -1

Query: 1439 GGGESGRVNEHDVKK-KKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDITI 1263
            G  E  R  E  V   +++R G DDF RA+++ AVAQICE+ GF+   +SALE+L+DI I
Sbjct: 5    GNAEDKREKESTVDNTREERIGTDDFGRAVSRTAVAQICESIGFEIFNESALESLADIAI 64

Query: 1262 RYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGEIM 1083
            +Y+ DLGKTA+  AN+AGRT CNVFDII+GLE++  + GF  AS V     SSG+V E++
Sbjct: 65   KYILDLGKTANSKANIAGRTQCNVFDIIQGLEDMCASTGFLRASEVNRCGLSSGIVSEMV 124

Query: 1082 QYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHTPM 903
            +YV  AEE+PF+QP+PHFPV+++    PSFLQIGETP  KHIP WLPAFPD HTYV TP 
Sbjct: 125  EYVESAEEIPFSQPLPHFPVVKQPNLIPSFLQIGETPPFKHIPPWLPAFPDPHTYVRTPT 184

Query: 902  WNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSE-APAPLNPEDEWKAKQAAES- 729
            WNER +DPR DKIE ARQRRKAE SLL+LQQRL C+GS  A     P+D      A++S 
Sbjct: 185  WNERASDPRADKIELARQRRKAERSLLNLQQRLVCNGSAVASTSRQPDDVGITSSASKSE 244

Query: 728  NPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEAA 549
            NPFL+ P Q GEKDV PV LPT             KLS+E       S+LETF+P I+A 
Sbjct: 245  NPFLAKPFQAGEKDVDPVALPT-------------KLSSEVDDKNHVSLLETFSPAIQAM 291

Query: 548  KSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXXX 369
            K G  +T DG  K LP KRP V  +F  GKK+LG  LDL L      +  S F       
Sbjct: 292  KDGLSETVDGTEKTLPDKRPAVCLEFRPGKKALGDSLDLRLWKKG-SRNASLFRRDEDRD 350

Query: 368  XXXXRAEQILKESMENPQELVQL 300
                RAE IL++S EN QEL QL
Sbjct: 351  DKKRRAELILRQSRENQQELTQL 373


>ref|XP_006845883.1| hypothetical protein AMTR_s00154p00079940 [Amborella trichopoda]
            gi|548848527|gb|ERN07558.1| hypothetical protein
            AMTR_s00154p00079940 [Amborella trichopoda]
          Length = 375

 Score =  363 bits (933), Expect = 1e-97
 Identities = 199/388 (51%), Positives = 266/388 (68%), Gaps = 5/388 (1%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            M+DGGGES R  +    ++   +  D+F RA+ +++VAQICE+AG+   Q+SALE L+DI
Sbjct: 1    MNDGGGESRRNIDECKSERGGEQEEDEFGRAVTRVSVAQICESAGYHTFQRSALEALADI 60

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
             +RYL DLG++A F+ANLAGRT CNVFD+I+ LE+LG++QGF GAS+V   LA+SG + +
Sbjct: 61   ALRYLRDLGRSARFHANLAGRTACNVFDVIQALEDLGSSQGFAGASDVNHPLAASGALKD 120

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
            I++Y ++AEE+PFA+ +P FP+ + +KPTPSFLQ+GETP  KHIP+WLPAFPD HTY+HT
Sbjct: 121  IIRYTNIAEEIPFARAVPRFPIPKTRKPTPSFLQLGETPPHKHIPSWLPAFPDPHTYIHT 180

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE- 732
            P+WNER +DPRT+K+EQARQRRKAE SL+SLQQRL C+G+      + + E K K+  + 
Sbjct: 181  PVWNERGSDPRTEKLEQARQRRKAEKSLVSLQQRLACNGA---TMASMDGELKGKRPLDG 237

Query: 731  SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552
            +NPFL+ PL  GEK+ S V +P  LS +     +  K    +V N        FAP  EA
Sbjct: 238  NNPFLAPPLLSGEKEASLVPMPAGLSLKSPDENIEKKPGGLSVVN-------AFAPANEA 290

Query: 551  AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLG-VPLDLTLQNNAEG---KTVSWFGX 384
            AK G     D  R++ P KRPVV FKFG+ K+++   PL    + N  G     +SWF  
Sbjct: 291  AKGG--GLIDEARQLKP-KRPVVQFKFGLDKRTVNPAPLLFGNRYNRTGGNATDMSWFSR 347

Query: 383  XXXXXXXXXRAEQILKESMENPQELVQL 300
                     RAEQILKE+MENPQELVQL
Sbjct: 348  DEEKDDKKKRAEQILKEAMENPQELVQL 375


>ref|XP_003552582.1| PREDICTED: transcription initiation factor TFIID subunit 8-like
            isoform X1 [Glycine max]
          Length = 381

 Score =  360 bits (925), Expect = 1e-96
 Identities = 185/384 (48%), Positives = 255/384 (66%), Gaps = 1/384 (0%)
 Frame = -1

Query: 1448 MSDGGGESGR-VNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSD 1272
            MS+GGG++GR + +    +++K  GGDD++RAIAKIAVAQ+CE  GFQ  QQSALE LSD
Sbjct: 1    MSNGGGKTGRQLEQPGTWRRRKVGGGDDYARAIAKIAVAQVCEGEGFQAFQQSALEALSD 60

Query: 1271 ITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVG 1092
            + +RY+ ++GK+AH +ANL+GRT+CN FD+I+GLE++G+ QGF GA++V   L SSG++ 
Sbjct: 61   VVVRYILNVGKSAHCHANLSGRTECNAFDVIQGLEDMGSVQGFAGAADVDHCLESSGVIR 120

Query: 1091 EIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVH 912
            EI+ +V+ AE V FA PIP FPV++++ P PSFLQ GE P G+HIPAWLPAFPD  TY  
Sbjct: 121  EIVHFVNDAEPVMFAHPIPRFPVVKERVPNPSFLQKGEEPPGEHIPAWLPAFPDPQTYSQ 180

Query: 911  TPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE 732
            +P  N R T+PR  K +Q R+  K EW  L+LQQ++  +  E  A ++P D    + AAE
Sbjct: 181  SPAVNGRGTEPRAVKFDQERESGKGEWPALNLQQQMVSNMFEKSASIDPADAKAKRVAAE 240

Query: 731  SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552
             NPFL++PL+  +K+V+ V  P  L N+     L   +    V N   S LETFAP IEA
Sbjct: 241  GNPFLAAPLKIEDKEVASVPPPAKLFND---EALDNPVVENLVENEPISALETFAPAIEA 297

Query: 551  AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372
             KS  CD+++ + K   +++P V FK G+  K LG  + L  Q     KT+ WF      
Sbjct: 298  MKSTICDSKEDQTKFCANEKPTVRFKIGIKNKLLGKSIGLIPQKEEHEKTLPWFAMEDEK 357

Query: 371  XXXXXRAEQILKESMENPQELVQL 300
                 RAE+IL+ES+ENP +LVQL
Sbjct: 358  DDRKRRAEKILRESLENPDQLVQL 381


>gb|EXC16168.1| hypothetical protein L484_024336 [Morus notabilis]
          Length = 372

 Score =  358 bits (918), Expect = 6e-96
 Identities = 207/393 (52%), Positives = 246/393 (62%), Gaps = 10/393 (2%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            M  G     RVNEH         G DDF RA++KI VAQICE+ GFQ  ++SAL+ L++I
Sbjct: 1    MGHGEANGTRVNEHG------GGGADDFGRAVSKIVVAQICESVGFQSSKESALDALANI 54

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
             IRYLCDLGK A+ YANL GRT+CNVFDIIR LE L  +QGFPGA +V   L  SG + E
Sbjct: 55   AIRYLCDLGKIANSYANLTGRTECNVFDIIRALEVLEASQGFPGAGDVGHCLVRSGAMKE 114

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
            I  YV  AEE+PFAQP+P FPV++ ++   SF Q+GE P G+HIP WLPA PD HTY+H+
Sbjct: 115  IATYVDSAEEIPFAQPVPRFPVLKNRRLILSFEQMGENPLGQHIPTWLPALPDPHTYIHS 174

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLT-------CSGSEAPAPLNPEDEWK 750
            PMWNER T+PR  K+E ARQRRKAE SLLSLQQRL         S S A  PL   D  +
Sbjct: 175  PMWNERNTEPRLHKLEHARQRRKAERSLLSLQQRLARNVGYAGASTSAAVPPLVGGDGNE 234

Query: 749  AKQAAESNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETF 570
            +KQ  E N FL  PL  GEKDVSP+V              P K+ +E       SVLE F
Sbjct: 235  SKQ-VERNLFLEPPLHPGEKDVSPIV-------------FPGKILDERGKGDHASVLEAF 280

Query: 569  APVIEAA-KSGFCDTEDGERKVLP--SKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTV 399
            AP IEA  KSGF +  + ER+VLP    RP + FKF   KK  G  LDL+L+  A G+  
Sbjct: 281  APAIEAVKKSGFSEYGEDERRVLPGIEARPAIQFKFRTAKKYFGESLDLSLK-KAVGRPA 339

Query: 398  SWFGXXXXXXXXXXRAEQILKESMENPQELVQL 300
             WFG          RAE IL++SMENPQEL QL
Sbjct: 340  FWFGRDEERDDKKRRAEFILRQSMENPQELNQL 372


>ref|XP_002527631.1| tbp-associated factor taf, putative [Ricinus communis]
            gi|223533005|gb|EEF34770.1| tbp-associated factor taf,
            putative [Ricinus communis]
          Length = 379

 Score =  354 bits (908), Expect = 9e-95
 Identities = 195/384 (50%), Positives = 249/384 (64%), Gaps = 1/384 (0%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHD-VKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSD 1272
            MS GGG+SGRV E   + K+K    GD+F+R+IAKIAVAQICE  GFQ  QQSALETLSD
Sbjct: 1    MSHGGGQSGRVQEKSQLAKRKSGSSGDEFARSIAKIAVAQICECTGFQTFQQSALETLSD 60

Query: 1271 ITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVG 1092
            +T+RY+C+LGK A   AN AGR + N FDII+ LEEL ++QGF  AS+V   +ASSG+V 
Sbjct: 61   VTVRYICNLGKLAQGNANSAGRIEGNAFDIIQALEELCSSQGFASASDVDHCIASSGIVR 120

Query: 1091 EIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVH 912
            +I QYVS A++VPFA  IP FP++R++K  P F QIGE P  +HIP WLPAFPD   Y+ 
Sbjct: 121  DIAQYVSDADDVPFAYSIPPFPIVRERKLAPIFSQIGEKPPWEHIPDWLPAFPDPQIYLQ 180

Query: 911  TPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE 732
            +P  NE  TD    K E AR   K + SL  LQQ  T SGS+ P+   P   ++ K   E
Sbjct: 181  SPTVNEGATDLNMQKFEPARLHPKIDRSL--LQQPFTSSGSQGPSSNVPAGGYEGKLIVE 238

Query: 731  SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552
             NPF+++PLQ GEK+VS VV P  LSNE         + +  +A+   SVL TFAP I+A
Sbjct: 239  GNPFVAAPLQCGEKEVSHVVPPAKLSNETAVRN---PIEHNRLADNHVSVLNTFAPAIKA 295

Query: 551  AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372
              S  CD+E+G++KVL ++RP + FK  +GKKSL   L+L  QN +  K   W       
Sbjct: 296  MNSRLCDSEEGQKKVLLNQRPAIQFKIAIGKKSLRTSLELGSQNKSAEKISPWSEKDNEN 355

Query: 371  XXXXXRAEQILKESMENPQELVQL 300
                 RAE+ILK+S+ENP EL QL
Sbjct: 356  DDKKRRAEKILKQSIENPGELAQL 379


>ref|XP_003531863.1| PREDICTED: transcription initiation factor TFIID subunit 8-like
            isoform 1 [Glycine max]
          Length = 381

 Score =  352 bits (904), Expect = 3e-94
 Identities = 182/384 (47%), Positives = 253/384 (65%), Gaps = 1/384 (0%)
 Frame = -1

Query: 1448 MSDGGGESGR-VNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSD 1272
            MS+GGG++GR + +     ++K  GGDD++RAIAKIAVAQ+CE+ GFQ  QQSALE LSD
Sbjct: 1    MSNGGGKTGRQLEQPGTWGRRKVGGGDDYARAIAKIAVAQVCESEGFQAFQQSALEALSD 60

Query: 1271 ITIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVG 1092
            +  RY+ ++GK+AH +ANL+GRT+C+ FD+I+GLE++G+ QGF GAS+V   L SSG++ 
Sbjct: 61   VVARYILNVGKSAHCHANLSGRTECHAFDVIQGLEDMGSVQGFAGASDVDHCLESSGVIR 120

Query: 1091 EIMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVH 912
            EI+ +V+ AE V FA PIP FPV++++ P PSFLQ GE P G+HIPAWLPAFPD  TY  
Sbjct: 121  EIVHFVNDAEPVMFAHPIPQFPVVKERVPNPSFLQKGEEPPGEHIPAWLPAFPDLQTYSE 180

Query: 911  TPMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAE 732
            +P+ N R T+PR  K +Q R+  K EW  ++ QQ++  +  E  A ++P D    + AAE
Sbjct: 181  SPVVNGRGTEPRAVKFDQERENGKGEWPAMNFQQQMVSNMFEKSALIDPADAKAKRVAAE 240

Query: 731  SNPFLSSPLQFGEKDVSPVVLPTMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552
             NPFL++PL+  +K+V+ V  P  L N+V    L   +    V N   S +ETFAP IEA
Sbjct: 241  GNPFLAAPLKIEDKEVASVPPPAKLFNDV---ALDNPVVENFVENEPISAMETFAPAIEA 297

Query: 551  AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372
             KS  CD+ + + K   +++P V FK G+  K LG  + L  Q      T+ WF      
Sbjct: 298  MKSTCCDSNEDQTKFRANEKPTVRFKIGIKNKLLGKSIGLIPQKEEHKNTLPWFAMEDGK 357

Query: 371  XXXXXRAEQILKESMENPQELVQL 300
                 RAE+IL+ES+ENP +LVQL
Sbjct: 358  DDRKRRAEKILRESLENPDQLVQL 381


>ref|XP_004141587.1| PREDICTED: uncharacterized protein LOC101215115 [Cucumis sativus]
          Length = 376

 Score =  350 bits (899), Expect = 1e-93
 Identities = 193/384 (50%), Positives = 252/384 (65%), Gaps = 1/384 (0%)
 Frame = -1

Query: 1448 MSDGGGESGRVNEHDVKKKKKREGGDDFSRAIAKIAVAQICETAGFQGIQQSALETLSDI 1269
            MSDGGGESG+V  H+  K +K  G +DF RA+AKIAVAQICE+ GFQ  QQSALETL+D+
Sbjct: 1    MSDGGGESGKV--HERPKTRKNLGSEDFPRALAKIAVAQICESEGFQIFQQSALETLADV 58

Query: 1268 TIRYLCDLGKTAHFYANLAGRTDCNVFDIIRGLEELGTAQGFPGASNVTCSLASSGMVGE 1089
             +RY+ ++G TA+F AN AGRT+CN+FDII+ LE+LG+ QGF GAS++   LASS  V E
Sbjct: 59   AVRYVQNMGSTANFCANFAGRTECNLFDIIQALEDLGSVQGFAGASDIEHCLASSSTVKE 118

Query: 1088 IMQYVSLAEEVPFAQPIPHFPVIRKQKPTPSFLQIGETPDGKHIPAWLPAFPDSHTYVHT 909
              +YV+ AEEVPFA  +P FPV++++K  PSFLQIGE P G+HIP+WLPA PD  TY+ +
Sbjct: 119  FARYVAQAEEVPFAYSVPKFPVVKERKLRPSFLQIGEEPPGEHIPSWLPALPDPETYIES 178

Query: 908  PMWNERVTDPRTDKIEQARQRRKAEWSLLSLQQRLTCSGSEAPAPLNPEDEWKAKQAAES 729
            P+  E V +P+T K E  +Q R  E S  +LQQ L C+G E     +P +    KQ  ES
Sbjct: 179  PIVKEEVVEPQTIKTEPEKQCR-TEKSFWNLQQWLFCNGLEGSQREDPRNAAMTKQIQES 237

Query: 728  NPFLSSPLQFGEKDVSPVVLP-TMLSNEVVANTLPAKLSNEAVANTQPSVLETFAPVIEA 552
            NPFL+ PLQFGEK+VS +VLP  +L+N      +P  +      +T  SVLETFAP IE+
Sbjct: 238  NPFLAPPLQFGEKEVSSIVLPDKVLNNSSTEYHVP--VMENCQVDTHVSVLETFAPAIES 295

Query: 551  AKSGFCDTEDGERKVLPSKRPVVHFKFGVGKKSLGVPLDLTLQNNAEGKTVSWFGXXXXX 372
             K+ F      E K   +++  V FK G GKK+ G  ++L   NN   K+ SWF      
Sbjct: 296  IKNNF---HMSEEKYSLNRKSTVQFKIGTGKKAAGNMIELRALNNGVKKSSSWFVGEDEK 352

Query: 371  XXXXXRAEQILKESMENPQELVQL 300
                 +AE+ILK+SMEN  EL  L
Sbjct: 353  DDKKRKAEKILKDSMENSNELSHL 376


Top