BLASTX nr result

ID: Mentha28_contig00016154 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00016154
         (1814 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002277150.2| PREDICTED: uncharacterized protein LOC100242...   386   e-104
ref|XP_006341120.1| PREDICTED: transcription initiation factor T...   384   e-104
ref|XP_004246524.1| PREDICTED: uncharacterized protein LOC101257...   376   e-101
emb|CBI30264.3| unnamed protein product [Vitis vinifera]              363   2e-97
ref|XP_007027689.1| Transcription initiation factor TFIID subuni...   355   3e-95
ref|XP_006481807.1| PREDICTED: transcription initiation factor T...   352   4e-94
ref|XP_006430244.1| hypothetical protein CICLE_v10011440mg [Citr...   351   5e-94
ref|XP_007027691.1| Transcription initiation factor TFIID subuni...   351   7e-94
ref|XP_004516193.1| PREDICTED: transcription initiation factor T...   338   5e-90
ref|XP_007027692.1| Transcription initiation factor TFIID subuni...   335   5e-89
ref|XP_007203746.1| hypothetical protein PRUPE_ppa017159mg [Prun...   330   1e-87
ref|XP_003542594.1| PREDICTED: transcription initiation factor T...   328   5e-87
ref|XP_007145316.1| hypothetical protein PHAVU_007G228800g [Phas...   325   3e-86
ref|XP_004156257.1| PREDICTED: uncharacterized protein LOC101226...   323   2e-85
gb|EXB88734.1| Transcription initiation factor TFIID subunit 12 ...   321   6e-85
ref|XP_003537062.1| PREDICTED: transcription initiation factor T...   320   1e-84
ref|XP_004305348.1| PREDICTED: uncharacterized protein LOC101314...   317   9e-84
ref|XP_002884775.1| tata-associated factor II 58 [Arabidopsis ly...   314   1e-82
ref|XP_004143314.1| PREDICTED: uncharacterized protein LOC101211...   313   1e-82
ref|XP_006297380.1| hypothetical protein CARUB_v10013405mg [Caps...   313   2e-82

>ref|XP_002277150.2| PREDICTED: uncharacterized protein LOC100242486 [Vitis vinifera]
          Length = 496

 Score =  386 bits (991), Expect = e-104
 Identities = 229/420 (54%), Positives = 263/420 (62%), Gaps = 31/420 (7%)
 Frame = +3

Query: 384  RGGMAIGVPAHNPSTGAPPPASFSSLTPPSYGQQ--------------------SQIRQP 503
            RGG+AIGVPA     G P P  FSSL PP++GQQ                    SQ+R  
Sbjct: 87   RGGIAIGVPA-----GQPTP--FSSLNPPTFGQQYGGLGRSAVNVPESVANTNTSQVRPS 139

Query: 504  VQG---MGMAGALGATSSMRPAGVSPN-QLRPSQSSLRPQSTPSTQSPAAQNFQGHGMLR 671
            +QG   MGM G LG+ S MRP G+S + Q RP QSSLRPQST + QSPA QNFQGHG+LR
Sbjct: 140  IQGSQGMGMMGTLGSGSQMRPGGISAHHQQRPVQSSLRPQSTVNNQSPATQNFQGHGLLR 199

Query: 672  VXXXXXXXXXXXXXXXXXXXXNPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQ 851
                                 N PWLSS + QGKPPLP+ S RPQ   QS  QRSHIP Q
Sbjct: 200  ASSVGSPGAPSPNTSQSMQPHNQPWLSSGS-QGKPPLPSPSFRPQMTAQSLPQRSHIPQQ 258

Query: 852  LHHXXXXXXXXXXXXXXXXXXXXXXXXXXXEHFSQQFPP---PRSITHQMQMSKGPGIGA 1022
             HH                           EH+ QQFPP   P+S+ H  Q+ +  G G 
Sbjct: 259  HHHPLPTASQQQQMSTAQQPQQPLLSHQQQEHYGQQFPPSRVPQSLPHPQQIGRVQGSGN 318

Query: 1023 QRPP-LGTTLSGASQPGALSKPAAADPEESSNRILTKRSIQELVNQIDPSEKLDPEVEDI 1199
            Q+P  L        Q G  S+ A+A+  ES NRIL+KRSI ELVNQIDPSEKLDPEVEDI
Sbjct: 319  QKPSSLAIVQPSTPQLGPHSRTASAEASESGNRILSKRSIHELVNQIDPSEKLDPEVEDI 378

Query: 1200 LVDIAEDFVESITTFGCSLAKHRKSSTLEAKDILLHLERNWNMALPGFGGEEIRMFKKPL 1379
            LVDIAEDFVESITTFGCSLAKHRKS TLEAKDILLHLERNWNM LPGFGG+EI+ FKKP 
Sbjct: 379  LVDIAEDFVESITTFGCSLAKHRKSPTLEAKDILLHLERNWNMTLPGFGGDEIKTFKKPF 438

Query: 1380 VNEIHRERLAAVKKSML---AADTMSKNPSGASGSAKGHLAKGPANTISSPPNPKIREAA 1550
            V++IH+ERLAA+KKS +   +A+T S +  GA G+ KGH AK  AN +SS PN KIREAA
Sbjct: 439  VSDIHKERLAAIKKSAVGTESANTKSSSGQGA-GNTKGHPAKTSANVLSS-PNLKIREAA 496


>ref|XP_006341120.1| PREDICTED: transcription initiation factor TFIID subunit 12-like
            [Solanum tuberosum]
          Length = 561

 Score =  384 bits (987), Expect = e-104
 Identities = 233/453 (51%), Positives = 271/453 (59%), Gaps = 38/453 (8%)
 Frame = +3

Query: 279  RPWQQP-PYSHFSLXXXXXXXXXXXXXXXXXXXXXX--------RGGMAIGVPAHNPSTG 431
            RPWQQP P+ HFSL                              RGGMA+GVPAH+PS+ 
Sbjct: 120  RPWQQPSPFQHFSLPPPPPPPPPHSSSSSSSISSSLVSMQNQNPRGGMAMGVPAHHPSS- 178

Query: 432  APPPASFSSLTPPSYGQQ------------------SQIRQPVQGM---GMAGALGATSS 548
                 SFSSLT PSYGQQ                  SQ+RQP+QGM   GM G+LG+TS 
Sbjct: 179  -----SFSSLTTPSYGQQFGGLGRNLPDSTPPTSTTSQVRQPIQGMHGMGMMGSLGSTSP 233

Query: 549  MRPAGVSPNQLRPSQSSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXX 728
            MRPAGVSP QLRP  SS+RPQ++ S+QS A QNFQGH MLRV                  
Sbjct: 234  MRPAGVSPQQLRPVSSSIRPQTSISSQSAATQNFQGHSMLRVQSVGFPPSQSHTTSQSPR 293

Query: 729  XXNPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHI--PHQLHHXXXXXXXXXXXXXX 902
                PWLSS A  GKPPLPT SLRPQ NPQS  QRSHI  PHQ                 
Sbjct: 294  TQTQPWLSSGAP-GKPPLPTPSLRPQINPQSLHQRSHILAPHQ-------HTVTTTSSAQ 345

Query: 903  XXXXXXXXXXXXXEHFSQQFPPPR---SITHQMQMSKGPGIGAQRPPLGTTLSG-ASQPG 1070
                         +H  QQ PP R   S+++Q  + +G G+G QRP     +   A QPG
Sbjct: 346  QSQSQPSASSQSQDHLGQQMPPSRIQQSLSNQ-PLGRGQGLGIQRPSSHALMQPTAVQPG 404

Query: 1071 ALSKPAAA-DPEESSNRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFG 1247
              SK A   +  +   RIL+KRSIQE+V QIDPSEKLD EVEDILVDIAE+FVESITTFG
Sbjct: 405  PPSKAATTLEMGDPCTRILSKRSIQEIVTQIDPSEKLDAEVEDILVDIAEEFVESITTFG 464

Query: 1248 CSLAKHRKSSTLEAKDILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSM 1427
            CSLAKHRKS+TLEAKDILLHLERNWNM LPGFGG+EIR +KKPL ++IH+ER+A +KKS 
Sbjct: 465  CSLAKHRKSNTLEAKDILLHLERNWNMTLPGFGGDEIRAYKKPLTSDIHKERIAVIKKST 524

Query: 1428 LAAD-TMSKNPSGASGSAKGHLAKGPANTISSP 1523
            L A+ T +K P+   G+ K HLAK  AN + SP
Sbjct: 525  LVAEMTNAKGPTQTGGNMKSHLAKSAANILGSP 557


>ref|XP_004246524.1| PREDICTED: uncharacterized protein LOC101257205 [Solanum
            lycopersicum]
          Length = 566

 Score =  376 bits (965), Expect = e-101
 Identities = 229/452 (50%), Positives = 269/452 (59%), Gaps = 37/452 (8%)
 Frame = +3

Query: 279  RPWQQP-PYSHFSLXXXXXXXXXXXXXXXXXXXXXX--------RGGMAIGVPAHNPSTG 431
            RPWQQP P+ HFSL                              RGGMA+GVPAH+PS+ 
Sbjct: 126  RPWQQPSPFQHFSLPPPPPPPPPHSSSSSSSVSSSSVSMQNQNPRGGMAMGVPAHHPSS- 184

Query: 432  APPPASFSSLTPPSYGQQ------------------SQIRQPVQGM---GMAGALGATSS 548
                 SFSSLT PSYGQQ                  SQ+RQP+QGM   GM G+LG+TS 
Sbjct: 185  -----SFSSLTTPSYGQQFGGLGRNLPDSTPPTSTTSQVRQPIQGMHGMGMMGSLGSTSP 239

Query: 549  MRPAGVSPNQLRPSQSSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXX 728
            MRPAG+SP QLRP  S++RPQ++  +QS A QNFQGH MLRV                  
Sbjct: 240  MRPAGISPQQLRPVSSAIRPQTSIGSQSAATQNFQGHSMLRVQSVGFPPSQSHTTTSQSP 299

Query: 729  XXNP-PWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXX 905
                 PWLSS A  GKPPLPT SLRPQ NPQS  QRSHI     H               
Sbjct: 300  KTQTQPWLSSGAP-GKPPLPTPSLRPQINPQSLHQRSHILAAHQHTVTTSSSAQQSQPST 358

Query: 906  XXXXXXXXXXXXEHFSQQFPPPR---SITHQMQMSKGPGIGAQRPPLGTTLSG-ASQPGA 1073
                        +H  QQ PP R   S+++Q  + +G G+G QRP     +   A QPG 
Sbjct: 359  SSQSQ-------DHLGQQMPPSRIQQSLSNQA-LGRGQGLGIQRPSSHALMQPTAVQPGL 410

Query: 1074 LSKPAAA-DPEESSNRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGC 1250
             SK A   +  +   RIL+KRSIQE+V QIDPSEKLD EVEDILVDIAE+FVESITTFGC
Sbjct: 411  PSKAATTLEMGDPCTRILSKRSIQEIVTQIDPSEKLDAEVEDILVDIAEEFVESITTFGC 470

Query: 1251 SLAKHRKSSTLEAKDILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSML 1430
            SLAKHRKS+TLEAKDILLHLERNWNM LPGFGG+EIR +KKPL ++IH+ER+AA+KKS L
Sbjct: 471  SLAKHRKSNTLEAKDILLHLERNWNMTLPGFGGDEIRAYKKPLTSDIHKERIAAIKKSTL 530

Query: 1431 AAD-TMSKNPSGASGSAKGHLAKGPANTISSP 1523
             A+ T +K P+   G+ K HLAK  AN + SP
Sbjct: 531  VAEMTNAKGPTQTGGNMKSHLAKNAANIMGSP 562


>emb|CBI30264.3| unnamed protein product [Vitis vinifera]
          Length = 372

 Score =  363 bits (931), Expect = 2e-97
 Identities = 206/359 (57%), Positives = 235/359 (65%), Gaps = 8/359 (2%)
 Frame = +3

Query: 498  QPVQGMGMAGALGATSSMRPAGVSPN-QLRPSQSSLRPQSTPSTQSPAAQNFQGHGMLRV 674
            Q  QGMGM G LG+ S MRP G+S + Q RP QSSLRPQST + QSPA QNFQGHG+LR 
Sbjct: 17   QGSQGMGMMGTLGSGSQMRPGGISAHHQQRPVQSSLRPQSTVNNQSPATQNFQGHGLLRA 76

Query: 675  XXXXXXXXXXXXXXXXXXXXNPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQL 854
                                N PWLSS + QGKPPLP+ S RPQ   QS  QRSHIP Q 
Sbjct: 77   SSVGSPGAPSPNTSQSMQPHNQPWLSSGS-QGKPPLPSPSFRPQMTAQSLPQRSHIPQQH 135

Query: 855  HHXXXXXXXXXXXXXXXXXXXXXXXXXXXEHFSQQFPP---PRSITHQMQMSKGPGIGAQ 1025
            HH                           EH+ QQFPP   P+S+ H  Q+ +  G G Q
Sbjct: 136  HHPLPTASQQQQMSTAQQPQQPLLSHQQQEHYGQQFPPSRVPQSLPHPQQIGRVQGSGNQ 195

Query: 1026 RPP-LGTTLSGASQPGALSKPAAADPEESSNRILTKRSIQELVNQIDPSEKLDPEVEDIL 1202
            +P  L        Q G  S+ A+A+  ES NRIL+KRSI ELVNQIDPSEKLDPEVEDIL
Sbjct: 196  KPSSLAIVQPSTPQLGPHSRTASAEASESGNRILSKRSIHELVNQIDPSEKLDPEVEDIL 255

Query: 1203 VDIAEDFVESITTFGCSLAKHRKSSTLEAKDILLHLERNWNMALPGFGGEEIRMFKKPLV 1382
            VDIAEDFVESITTFGCSLAKHRKS TLEAKDILLHLERNWNM LPGFGG+EI+ FKKP V
Sbjct: 256  VDIAEDFVESITTFGCSLAKHRKSPTLEAKDILLHLERNWNMTLPGFGGDEIKTFKKPFV 315

Query: 1383 NEIHRERLAAVKKSML---AADTMSKNPSGASGSAKGHLAKGPANTISSPPNPKIREAA 1550
            ++IH+ERLAA+KKS +   +A+T S +  GA G+ KGH AK  AN +SS PN KIREAA
Sbjct: 316  SDIHKERLAAIKKSAVGTESANTKSSSGQGA-GNTKGHPAKTSANVLSS-PNLKIREAA 372


>ref|XP_007027689.1| Transcription initiation factor TFIID subunit 12, putative isoform 1
            [Theobroma cacao] gi|590631895|ref|XP_007027690.1|
            Transcription initiation factor TFIID subunit 12,
            putative isoform 1 [Theobroma cacao]
            gi|508716294|gb|EOY08191.1| Transcription initiation
            factor TFIID subunit 12, putative isoform 1 [Theobroma
            cacao] gi|508716295|gb|EOY08192.1| Transcription
            initiation factor TFIID subunit 12, putative isoform 1
            [Theobroma cacao]
          Length = 560

 Score =  355 bits (912), Expect = 3e-95
 Identities = 217/443 (48%), Positives = 262/443 (59%), Gaps = 20/443 (4%)
 Frame = +3

Query: 276  TRPWQQ--PPYSHFSLXXXXXXXXXXXXXXXXXXXXXXRGGMAIGVPAHNPSTGAPPPA- 446
            +RPWQQ    ++HFS                       RG  AIGVP+ + S   P P+ 
Sbjct: 125  SRPWQQHSSQFTHFS-----SSSPSVSSSPSPTLSSQPRGSFAIGVPSSHSSPSPPTPSP 179

Query: 447  ----SFSSLTPPSYG-----QQSQIRQPVQGMGMAGA-LGATSSMRPAGVSPN-QLRPSQ 593
                SFS     S+G       SQ RQP+QGMGM G+ +G++S MRP G+S + Q RP Q
Sbjct: 180  SQPTSFSGSFGHSFGGGSSSNVSQARQPIQGMGMVGSSIGSSSQMRPGGLSAHHQQRPVQ 239

Query: 594  SSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXXNPPWLSSSATQGK 773
            SSLRP S+ ++QSPA QNFQGHG++RV                    N PWLSS A QGK
Sbjct: 240  SSLRPPSSTNSQSPATQNFQGHGLMRVSAVGTSGSSTPSTPQTTQSLNQPWLSSGA-QGK 298

Query: 774  PPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXXXXXXXXXXXXEHFS 953
            PPLP  S RPQ N  S QQRSHI  Q  H                           EHF 
Sbjct: 299  PPLPPPSYRPQINSPSLQQRSHISQQ--HHSLPTVSQQQHVSSPQVPQPLPSHQQQEHFG 356

Query: 954  QQFPP---PRSITHQMQMSKGPGIGAQRPP-LGTTLSGASQPGALSKPAAADPEESSNRI 1121
            QQF     P+S+ HQ Q+S+  G   Q+P  L        QP   +K A  + +ES  RI
Sbjct: 357  QQFSQSRVPQSLPHQQQVSRAQGSANQKPSSLAMIQPSIVQPLNQNKAAITESDESGGRI 416

Query: 1122 LTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSSTLEAKDIL 1301
            L+KRS+ +LVNQIDPSEKLDPEVEDILVDIAEDFV+SITTFGCSLAKHRKS TLEAKDIL
Sbjct: 417  LSKRSVHDLVNQIDPSEKLDPEVEDILVDIAEDFVDSITTFGCSLAKHRKSDTLEAKDIL 476

Query: 1302 LHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTMSKNPSG--ASGS 1475
            LHLERNW+M LPGF G+EI+ ++KPL NEIH+ERLAA+KKS+L  +  +    G  A+ +
Sbjct: 477  LHLERNWHMTLPGFCGDEIKTYRKPLTNEIHKERLAAIKKSILVTEATNTKHFGGQAAVN 536

Query: 1476 AKGHLAKGPANTISSPPNPKIRE 1544
            AKG+L K  AN + S PN KIRE
Sbjct: 537  AKGNLGKAAANILGS-PNVKIRE 558


>ref|XP_006481807.1| PREDICTED: transcription initiation factor TFIID subunit 12-like
            [Citrus sinensis]
          Length = 538

 Score =  352 bits (902), Expect = 4e-94
 Identities = 214/438 (48%), Positives = 257/438 (58%), Gaps = 23/438 (5%)
 Frame = +3

Query: 276  TRPWQQPP--YSHFSLXXXXXXXXXXXXXXXXXXXXXXRGGMAIGVPAHNPSTGAPPPAS 449
            +RPWQ P   +SHFS                       RGG+AIGVPA  P+  +P P+ 
Sbjct: 104  SRPWQPPQQHFSHFS-SLPSSSSATPSTSASPPIPPPPRGGIAIGVPAPRPTALSPQPSP 162

Query: 450  ---------FSSLTPPSYGQQSQIRQP-VQGMGMAGALGATSSMRPAGVSPNQLRP---S 590
                     F  L          +R P +QGMG+ G+LG++S MRPAG+S    +P    
Sbjct: 163  PFSSSFGQPFGGLGRSGVNVPDSVRPPAIQGMGVMGSLGSSSQMRPAGISVQHHQPRPVQ 222

Query: 591  QSSLRPQ-STPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXXNPPWLSSSATQ 767
            QSSLRP  S+PS+QSP  QNFQG G++RV                    N PWLSS + Q
Sbjct: 223  QSSLRPPPSSPSSQSPGTQNFQGQGLMRVSQVGSPGSSSPNTSQSVQSFNQPWLSSGS-Q 281

Query: 768  GKPPL-PTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXXXXXXXXXXXXE 944
            GKPPL P  + RPQ N  S QQRSHIP Q  H                           +
Sbjct: 282  GKPPLAPPSTYRPQMNTPSMQQRSHIPQQ--HSPLSTNLQQQHLSSVQPQQSKPSHQLPD 339

Query: 945  HFSQQFPPPR---SITHQMQMSKGPGIGAQRPP-LGTTLSGASQPGALSKPAAADPEESS 1112
            H+ QQF  PR   S  HQ Q+++ PG   Q+P  L      A Q G  SK AA + +E  
Sbjct: 340  HYGQQFSSPRVPQSSPHQQQITRPPGSATQKPSSLALVQPNAVQTGNQSKIAATESDEFG 399

Query: 1113 NRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSSTLEAK 1292
            NRILTKRSIQELVNQI PSE+LDP+VEDILVDIAEDFVESITTFGCSLAKHRKS TLEAK
Sbjct: 400  NRILTKRSIQELVNQIGPSERLDPDVEDILVDIAEDFVESITTFGCSLAKHRKSDTLEAK 459

Query: 1293 DILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTMSKNPSG--A 1466
            DIL+HLERNWNM LPGF G+EI+ F+KPLV +IH+ERLAA+KKS++A +  S   +G  A
Sbjct: 460  DILVHLERNWNMTLPGFSGDEIKTFRKPLVCDIHKERLAAIKKSVMATEVASARTTGGQA 519

Query: 1467 SGSAKGHLAKGPANTISS 1520
            + SAKG+L K PAN I S
Sbjct: 520  AASAKGNLGKMPANIIGS 537


>ref|XP_006430244.1| hypothetical protein CICLE_v10011440mg [Citrus clementina]
            gi|557532301|gb|ESR43484.1| hypothetical protein
            CICLE_v10011440mg [Citrus clementina]
          Length = 537

 Score =  351 bits (901), Expect = 5e-94
 Identities = 213/438 (48%), Positives = 255/438 (58%), Gaps = 23/438 (5%)
 Frame = +3

Query: 276  TRPWQQPP--YSHFSLXXXXXXXXXXXXXXXXXXXXXXRGGMAIGVPAHNPSTGAPPPAS 449
            +RPWQ P   +SHFS                       RGG+AIGVPA  P+  +P P+ 
Sbjct: 103  SRPWQPPQQHFSHFS-SLPSSSSATPSTSASPPIPSPPRGGIAIGVPAPRPTASSPQPSP 161

Query: 450  ---------FSSLTPPSYGQQSQIRQP-VQGMGMAGALGATSSMRPAGVSPNQLRP---S 590
                     F  L          +R P +QGMG+ G+LG++S MRPAG+S    +P    
Sbjct: 162  PFSSSFGQPFGGLGRSGVNVPDSVRPPAIQGMGVMGSLGSSSQMRPAGISVQHHQPRPVQ 221

Query: 591  QSSLRPQ-STPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXXNPPWLSSSATQ 767
            QSSLRP  S+PS+QSP  QNFQG G++RV                    N PWLSS + Q
Sbjct: 222  QSSLRPPPSSPSSQSPGTQNFQGQGLMRVSQVGSPGSSSPNTSQSVQSFNQPWLSSGS-Q 280

Query: 768  GKPPLPTQSL-RPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXXXXXXXXXXXXE 944
            GKPPLP  S  RPQ N  S QQRSHIP Q  H                           +
Sbjct: 281  GKPPLPPPSTYRPQMNTPSMQQRSHIPQQ--HSPLSTNLQQQHLSSVQSQQSKPSHQLPD 338

Query: 945  HFSQQFPPPR---SITHQMQMSKGPGIGAQRPP-LGTTLSGASQPGALSKPAAADPEESS 1112
            H+ QQF  PR   S  HQ Q+++ PG   Q+P  L      A Q G  SK A  + +E  
Sbjct: 339  HYGQQFSSPRVPQSSPHQQQITRPPGSATQKPSSLALVQPNAVQTGNQSKIAGTESDEFG 398

Query: 1113 NRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSSTLEAK 1292
            NRILTKRSIQELVNQIDPSE+LDP+VEDILVDIAEDFVESIT FGCSLAKHRKS TLEAK
Sbjct: 399  NRILTKRSIQELVNQIDPSERLDPDVEDILVDIAEDFVESITMFGCSLAKHRKSDTLEAK 458

Query: 1293 DILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTMSKNPSG--A 1466
            DIL+HLERNWNM LPGF G+EI+ F+KPLV +IH+ERLAA+KKS++A +  S   +G   
Sbjct: 459  DILVHLERNWNMTLPGFSGDEIKTFRKPLVCDIHKERLAAIKKSVMATEVASARTTGGQT 518

Query: 1467 SGSAKGHLAKGPANTISS 1520
            + SAKG+L K PAN I S
Sbjct: 519  AASAKGNLGKMPANIIGS 536


>ref|XP_007027691.1| Transcription initiation factor TFIID subunit 12, putative isoform 3
            [Theobroma cacao] gi|508716296|gb|EOY08193.1|
            Transcription initiation factor TFIID subunit 12,
            putative isoform 3 [Theobroma cacao]
          Length = 561

 Score =  351 bits (900), Expect = 7e-94
 Identities = 217/444 (48%), Positives = 262/444 (59%), Gaps = 21/444 (4%)
 Frame = +3

Query: 276  TRPWQQ--PPYSHFSLXXXXXXXXXXXXXXXXXXXXXXRGGMAIGVPAHNPSTGAPPPA- 446
            +RPWQQ    ++HFS                       RG  AIGVP+ + S   P P+ 
Sbjct: 125  SRPWQQHSSQFTHFS-----SSSPSVSSSPSPTLSSQPRGSFAIGVPSSHSSPSPPTPSP 179

Query: 447  ----SFSSLTPPSYG-----QQSQIRQPVQGMGMAGA-LGATSSMRPAGVSPN-QLRPSQ 593
                SFS     S+G       SQ RQP+QGMGM G+ +G++S MRP G+S + Q RP Q
Sbjct: 180  SQPTSFSGSFGHSFGGGSSSNVSQARQPIQGMGMVGSSIGSSSQMRPGGLSAHHQQRPVQ 239

Query: 594  SSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXXNPPWLSSSATQGK 773
            SSLRP S+ ++QSPA QNFQGHG++RV                    N PWLSS A QGK
Sbjct: 240  SSLRPPSSTNSQSPATQNFQGHGLMRVSAVGTSGSSTPSTPQTTQSLNQPWLSSGA-QGK 298

Query: 774  PPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXXXXXXXXXXXXEHFS 953
            PPLP  S RPQ N  S QQRSHI  Q  H                           EHF 
Sbjct: 299  PPLPPPSYRPQINSPSLQQRSHISQQ--HHSLPTVSQQQHVSSPQVPQPLPSHQQQEHFG 356

Query: 954  QQFPP---PRSITHQMQMSKGPGIGAQRPP-LGTTLSGASQPGALSKPAAADPEESSNRI 1121
            QQF     P+S+ HQ Q+S+  G   Q+P  L        QP   +K A  + +ES  RI
Sbjct: 357  QQFSQSRVPQSLPHQQQVSRAQGSANQKPSSLAMIQPSIVQPLNQNKAAITESDESGGRI 416

Query: 1122 LTKRSIQELVNQ-IDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSSTLEAKDI 1298
            L+KRS+ +LVNQ IDPSEKLDPEVEDILVDIAEDFV+SITTFGCSLAKHRKS TLEAKDI
Sbjct: 417  LSKRSVHDLVNQQIDPSEKLDPEVEDILVDIAEDFVDSITTFGCSLAKHRKSDTLEAKDI 476

Query: 1299 LLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTMSKNPSG--ASG 1472
            LLHLERNW+M LPGF G+EI+ ++KPL NEIH+ERLAA+KKS+L  +  +    G  A+ 
Sbjct: 477  LLHLERNWHMTLPGFCGDEIKTYRKPLTNEIHKERLAAIKKSILVTEATNTKHFGGQAAV 536

Query: 1473 SAKGHLAKGPANTISSPPNPKIRE 1544
            +AKG+L K  AN + S PN KIRE
Sbjct: 537  NAKGNLGKAAANILGS-PNVKIRE 559


>ref|XP_004516193.1| PREDICTED: transcription initiation factor TFIID subunit 12-like
            isoform X1 [Cicer arietinum]
            gi|502178179|ref|XP_004516194.1| PREDICTED: transcription
            initiation factor TFIID subunit 12-like isoform X2 [Cicer
            arietinum]
          Length = 523

 Score =  338 bits (867), Expect = 5e-90
 Identities = 194/394 (49%), Positives = 242/394 (61%), Gaps = 14/394 (3%)
 Frame = +3

Query: 384  RGGMAIGVPAHNPSTGAPPPASFSS-------LTPPSYGQQSQIRQPVQGMGMAGALGAT 542
            RGGMAIGVPAH+ +   P  +SFS            S    SQ+R P+QGMGM G+LG++
Sbjct: 126  RGGMAIGVPAHHQNPSPPFSSSFSQHFGGLARSDSTSNSNTSQVRAPMQGMGMLGSLGSS 185

Query: 543  SSMRPAGVSPNQLRPSQSSLRPQSTPSTQSPAA--QNFQGHGMLRVXXXXXXXXXXXXXX 716
            S MRP+G+  +  RP+QSSLRP  +     PA   Q+FQGHG+LR               
Sbjct: 186  SQMRPSGMPSHTQRPAQSSLRPPPSVQNNQPAGSQQSFQGHGILRPSSVGSPANPSPSAS 245

Query: 717  XXXXXXNPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXX 896
                  N PWLSS    GKPPLP+Q+ R Q NPQS QQR+H+  QL              
Sbjct: 246  QSMQTINQPWLSSGPP-GKPPLPSQAYRQQLNPQSLQQRTHVSQQLQPVPTVSQQQQPVP 304

Query: 897  XXXXXXXXXXXXXXXEHFSQQFPPPRSI--THQMQMSKGPGIGAQRPP-LGTTLSGASQP 1067
                           EHF QQ P  R++   HQ Q+++  G G Q+P  L    S A QP
Sbjct: 305  TASQQQQPLPSNQSQEHFGQQVPSSRAVHVPHQPQVTRLQGPGNQKPSSLVAAQSSAVQP 364

Query: 1068 GALSKPAAADPEESSNRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFG 1247
             + S+   AD EES   +L+KRSI ELVNQ+DP EKLDPEV D+L DIAE+F++SI   G
Sbjct: 365  ASQSRLPIADTEESGKSVLSKRSIHELVNQVDPLEKLDPEVADVLGDIAENFLDSIIRSG 424

Query: 1248 CSLAKHRKSSTLEAKDILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSM 1427
            CSLAKHRKS+TLEAKD+LLHLE+NWNM LPGFGGEEI+ ++KP+ ++IH+ERLAA+KKSM
Sbjct: 425  CSLAKHRKSTTLEAKDVLLHLEKNWNMTLPGFGGEEIKNYRKPVPSDIHKERLAAIKKSM 484

Query: 1428 LAADTM-SKNPSG-ASGSAKGHLAKGPANTISSP 1523
            +A +   SK  +G ASGSAKG  AK P N I SP
Sbjct: 485  VATEVAHSKGTAGQASGSAKGGQAKTPFNVIGSP 518


>ref|XP_007027692.1| Transcription initiation factor TFIID subunit 12, putative isoform 4
            [Theobroma cacao] gi|508716297|gb|EOY08194.1|
            Transcription initiation factor TFIID subunit 12,
            putative isoform 4 [Theobroma cacao]
          Length = 603

 Score =  335 bits (858), Expect = 5e-89
 Identities = 217/486 (44%), Positives = 262/486 (53%), Gaps = 63/486 (12%)
 Frame = +3

Query: 276  TRPWQQ--PPYSHFSLXXXXXXXXXXXXXXXXXXXXXXRGGMAIGVPAHNPSTGAPPPA- 446
            +RPWQQ    ++HFS                       RG  AIGVP+ + S   P P+ 
Sbjct: 125  SRPWQQHSSQFTHFS-----SSSPSVSSSPSPTLSSQPRGSFAIGVPSSHSSPSPPTPSP 179

Query: 447  ----SFSSLTPPSYG-----QQSQIRQPVQGMGMAGA-LGATSSMRPAGVSPN-QLRPSQ 593
                SFS     S+G       SQ RQP+QGMGM G+ +G++S MRP G+S + Q RP Q
Sbjct: 180  SQPTSFSGSFGHSFGGGSSSNVSQARQPIQGMGMVGSSIGSSSQMRPGGLSAHHQQRPVQ 239

Query: 594  SSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXXNPPWLSSSATQGK 773
            SSLRP S+ ++QSPA QNFQGHG++RV                    N PWLSS A QGK
Sbjct: 240  SSLRPPSSTNSQSPATQNFQGHGLMRVSAVGTSGSSTPSTPQTTQSLNQPWLSSGA-QGK 298

Query: 774  PPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXXXXXXXXXXXXEHFS 953
            PPLP  S RPQ N  S QQRSHI  Q  H                           EHF 
Sbjct: 299  PPLPPPSYRPQINSPSLQQRSHISQQ--HHSLPTVSQQQHVSSPQVPQPLPSHQQQEHFG 356

Query: 954  QQFPP---PRSITHQMQMSKGPGIGAQRPP-LGTTLSGASQPGALSKPAAADPEESSNRI 1121
            QQF     P+S+ HQ Q+S+  G   Q+P  L        QP   +K A  + +ES  RI
Sbjct: 357  QQFSQSRVPQSLPHQQQVSRAQGSANQKPSSLAMIQPSIVQPLNQNKAAITESDESGGRI 416

Query: 1122 LTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSSTLEAKDIL 1301
            L+KRS+ +LVNQIDPSEKLDPEVEDILVDIAEDFV+SITTFGCSLAKHRKS TLEAKDIL
Sbjct: 417  LSKRSVHDLVNQIDPSEKLDPEVEDILVDIAEDFVDSITTFGCSLAKHRKSDTLEAKDIL 476

Query: 1302 LHLERNWNMALPGFGGEEIRMFKKP----------------------------------- 1376
            LHLERNW+M LPGF G+EI+ ++KP                                   
Sbjct: 477  LHLERNWHMTLPGFCGDEIKTYRKPVKCSLLSIVHTGDVGCAVVCKRNLTVVDLDLQLFL 536

Query: 1377 --------LVNEIHRERLAAVKKSMLAADTMSKNPSG--ASGSAKGHLAKGPANTISSPP 1526
                    L NEIH+ERLAA+KKS+L  +  +    G  A+ +AKG+L K  AN + S P
Sbjct: 537  IVALLVMQLTNEIHKERLAAIKKSILVTEATNTKHFGGQAAVNAKGNLGKAAANILGS-P 595

Query: 1527 NPKIRE 1544
            N KIRE
Sbjct: 596  NVKIRE 601


>ref|XP_007203746.1| hypothetical protein PRUPE_ppa017159mg [Prunus persica]
            gi|462399277|gb|EMJ04945.1| hypothetical protein
            PRUPE_ppa017159mg [Prunus persica]
          Length = 509

 Score =  330 bits (846), Expect = 1e-87
 Identities = 205/446 (45%), Positives = 255/446 (57%), Gaps = 24/446 (5%)
 Frame = +3

Query: 279  RPWQQPPYSHFSLXXXXXXXXXXXXXXXXXXXXXXRGGMAIGVPAHNPSTGAPPPASFSS 458
            RPW Q  ++HF                        RG +AIGVPAH+PS   P PA FSS
Sbjct: 94   RPWPQSHFAHFP-SPSSSASSGPPPPPSSSAPPPPRGSIAIGVPAHHPSPSPPQPAPFSS 152

Query: 459  L----------------TPPSYGQQSQIR---QPVQGMGMAGALGATSSMRPAGVSPNQ- 578
                              P S    SQ+R   Q +QGM M G+LG++S MRPAGVS    
Sbjct: 153  SYGQHFGGLGRGGAAVPEPVSNSSASQVRPSVQGMQGMAMMGSLGSSSQMRPAGVSAQHP 212

Query: 579  LRPSQSSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXXNPPWLSSS 758
             RP QSS RP ST S+QSP++QNFQGH +LRV                     P WLSS 
Sbjct: 213  QRPVQSSFRPPSTASSQSPSSQNFQGHSLLRVSSVGTPSTSPNTSQGLQPHTQP-WLSSG 271

Query: 759  ATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXXXXXXXXXXX 938
            + QGKPP+P+ S R   +  S QQRSH P                               
Sbjct: 272  S-QGKPPIPSPSYRQHMSSPSMQQRSHGP------------------------------- 299

Query: 939  XEHFSQQFPPPR-SITHQMQMSKGPGIGAQRPPLGTTLS-GASQPGALSKPAAADPEESS 1112
             EHF Q   P R   T   Q+++   +  Q+     ++     Q G  +K A+A+ +ES 
Sbjct: 300  -EHFGQHVQPSRIPQTTPRQITRVQNLANQKSSSPASVQPNTVQSGPQNKSASAETDESC 358

Query: 1113 NRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSSTLEAK 1292
            NRIL+KRSI ELVNQIDPSEKLDPEVEDIL+DIA++FVESITTF CSLAKHRKS+TLEAK
Sbjct: 359  NRILSKRSIHELVNQIDPSEKLDPEVEDILMDIADEFVESITTFSCSLAKHRKSTTLEAK 418

Query: 1293 DILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTMSKNPS--GA 1466
            DILLH+E+NWN+ LPGFGG+EI+ F+KPL N+IH+ERL+ +KKS++A +T +   S   A
Sbjct: 419  DILLHIEKNWNITLPGFGGDEIKGFRKPLTNDIHKERLSVIKKSIVATETANARSSTGQA 478

Query: 1467 SGSAKGHLAKGPANTISSPPNPKIRE 1544
            +G+AKG L K PAN ISS  N K+RE
Sbjct: 479  TGNAKGSLVKAPANVISS-QNTKMRE 503


>ref|XP_003542594.1| PREDICTED: transcription initiation factor TFIID subunit 12-like
            [Glycine max]
          Length = 507

 Score =  328 bits (841), Expect = 5e-87
 Identities = 194/399 (48%), Positives = 239/399 (59%), Gaps = 19/399 (4%)
 Frame = +3

Query: 384  RGGMAIGVPAHNPSTGAPPPASFSS------------LTPPSYGQQSQIRQPVQGMGMAG 527
            RGGMAIGVPAH+ S   P  +SF                  S    SQ R PVQGMGM G
Sbjct: 118  RGGMAIGVPAHHQSPSPPFSSSFGQHFGGLGRTAVNVAESTSNSSTSQARTPVQGMGMLG 177

Query: 528  ALGATSSMRPAGVSPNQLRPSQSSLRPQ-STPSTQSPAAQNFQGHGMLRVXXXXXXXXXX 704
                 S MRP+G+  +Q R  QSSLRP  S P+ Q   +Q+FQGHG++R           
Sbjct: 178  -----SQMRPSGIGSHQQRSVQSSLRPPTSAPNNQPAGSQSFQGHGLMRPSSVGSTATPS 232

Query: 705  XXXXXXXXXXNPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXX 884
                      N PWLSS + QGKPPLP+ + R Q NPQS QQRSHIP             
Sbjct: 233  PSSSQSMQSLNQPWLSSGS-QGKPPLPSAAYRQQLNPQSMQQRSHIPPMQQSTPTSSQQQ 291

Query: 885  XXXXXXXXXXXXXXXXXXXEHFSQQFPPPRS---ITHQMQMSKGPGIGAQRPP-LGTTLS 1052
                               EHF QQ PP R+   + HQ Q+++  G G Q+P  L    S
Sbjct: 292  QQQPLLSNQSQ--------EHFGQQVPPSRAPLHMPHQPQVTRLQGPGNQKPSSLVAAQS 343

Query: 1053 GASQPGALSKPAAADPEESSNRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVES 1232
             A+QPG  S+   +D +ESSN IL+KRSI ELVNQ+DP EKL+PEV DILVDIAE+F+ES
Sbjct: 344  SAAQPGTQSRLTNSDTDESSNSILSKRSIHELVNQVDPLEKLEPEVADILVDIAENFLES 403

Query: 1233 ITTFGCSLAKHRKSSTLEAKDILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAA 1412
            IT  GCSLAKHRKS+TLE+KDILLHLE+NWNM LPGFGG+EI+ +++P+ ++IH+ERLA 
Sbjct: 404  ITRSGCSLAKHRKSTTLESKDILLHLEKNWNMTLPGFGGDEIKSYRRPITSDIHKERLAV 463

Query: 1413 VKKSMLAADTM-SKNPSG-ASGSAKGHLAKGPANTISSP 1523
            +KKSM + +    K  +G ASGSAKG+  K P N I SP
Sbjct: 464  IKKSMASTEAAHGKGSAGQASGSAKGNQGKTPLNIIGSP 502


>ref|XP_007145316.1| hypothetical protein PHAVU_007G228800g [Phaseolus vulgaris]
            gi|561018506|gb|ESW17310.1| hypothetical protein
            PHAVU_007G228800g [Phaseolus vulgaris]
          Length = 506

 Score =  325 bits (834), Expect = 3e-86
 Identities = 201/435 (46%), Positives = 245/435 (56%), Gaps = 19/435 (4%)
 Frame = +3

Query: 276  TRPWQQPPYSHFSLXXXXXXXXXXXXXXXXXXXXXXRGGMAIGVPAHNPSTGAPPPASFS 455
            T P  QP + HFS                       RGGMAIGVPAH+ S   P  +SF 
Sbjct: 88   TLPPSQPQFQHFS--------SAPPPASAPGAAAAPRGGMAIGVPAHHQSPSPPFSSSFG 139

Query: 456  S------------LTPPSYGQQSQIRQPVQGMGMAGALGATSSMRPAGVSPNQLRPSQSS 599
                             S    SQ+R PVQGMGM G     S MRP G++ +Q RP QSS
Sbjct: 140  QHFGGLGRTGVNVAESASNSSTSQVRTPVQGMGMLG-----SQMRPGGIAAHQQRPVQSS 194

Query: 600  LRPQST-PSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXXNPPWLSSSATQGKP 776
            LRP S+ P+TQ   +Q+FQGHG++R                     N PWLSS    GKP
Sbjct: 195  LRPPSSAPNTQPGGSQSFQGHGIMRPSSVGSPATPSQGASQSVQSLNQPWLSSGPL-GKP 253

Query: 777  PLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXXXXXXXXXXXXEHFSQ 956
            PLP+ + R Q NP S QQRSHIP Q                              EHF Q
Sbjct: 254  PLPSTAYRQQLNPPSMQQRSHIPPQQQSTPISSQQQQQQQPSLSNQSQ-------EHFGQ 306

Query: 957  QFPP---PRSITHQMQMSKGPGIGAQRPP-LGTTLSGASQPGALSKPAAADPEESSNRIL 1124
            Q  P   P  + HQ Q+++  G G Q+P  L    + A Q G+ S+    D EES N IL
Sbjct: 307  QVQPSRAPHHVPHQQQVTRLQGPGNQKPSSLVAAQTSAVQTGSQSRLTNVDTEESCNSIL 366

Query: 1125 TKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSSTLEAKDILL 1304
            +KRSI ELVNQ+DP EKLDPEV DILVDIAE+F+ESI   GCSLAKHRKS+TLEAKDILL
Sbjct: 367  SKRSIHELVNQVDPLEKLDPEVADILVDIAENFLESIIRSGCSLAKHRKSTTLEAKDILL 426

Query: 1305 HLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAAD-TMSKNPSG-ASGSA 1478
            HLE+NWNM LPGFGG+EI+ +++ + ++IH+ERL+A+KKSM A +   +K  +G ASGSA
Sbjct: 427  HLEKNWNMTLPGFGGDEIKSYRRQITSDIHKERLSAIKKSMTATELAHAKGSAGQASGSA 486

Query: 1479 KGHLAKGPANTISSP 1523
            KG+ AK P N I SP
Sbjct: 487  KGNQAKTPMNIIGSP 501


>ref|XP_004156257.1| PREDICTED: uncharacterized protein LOC101226357 [Cucumis sativus]
          Length = 511

 Score =  323 bits (828), Expect = 2e-85
 Identities = 203/437 (46%), Positives = 249/437 (56%), Gaps = 30/437 (6%)
 Frame = +3

Query: 285  WQQPP-YSHFSLXXXXXXXXXXXXXXXXXXXXXXRGGMAIGVPAHNPSTGAPPPASFSSL 461
            WQ P  +SHFS                       R G+AIGVPAH P T +P PA FS+ 
Sbjct: 92   WQPPSHFSHFSSPSPSASSSVPSPRPVSASSPAQRSGVAIGVPAHQP-TPSPQPAPFSA- 149

Query: 462  TPPSYGQQ--------------------SQIRQPVQGM---GMAGALGATSSMRPAGVSP 572
               SYGQ                     SQ+R P+QGM   GM G+ G++S M       
Sbjct: 150  ---SYGQHFGGLGRGGVSISDGASNSNPSQVRPPMQGMQGLGMLGSSGSSSQMLH----- 201

Query: 573  NQLRPSQSSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXXNPPWLS 752
               RP QSSLRP STP++   A+QNFQGHG+LRV                    N PWL 
Sbjct: 202  ---RPVQSSLRPPSTPNS---ASQNFQGHGLLRVPSTSSPSSSLPNTSQGMQPTNQPWLP 255

Query: 753  SSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXXXXXXXXX 932
            SS+ QGKPPLPT S RPQ N  + QQRSHIP Q +H                        
Sbjct: 256  SSS-QGKPPLPTPSYRPQANSPAMQQRSHIPQQQNHPLTPVSQQQQISSAPQQQPAQSHQ 314

Query: 933  XXXEHFSQQFPPPRS---ITHQMQMSKGPG-IGAQRPPLGTTLSGASQPGALSKPAAADP 1100
               EHF+QQF   RS   + HQ Q ++  G    +  PL    +  +Q    S+   A+ 
Sbjct: 315  PQ-EHFAQQFQQSRSSQGLPHQQQAARAQGPANPKASPLAPPQTNNAQALTPSRAITAEV 373

Query: 1101 EESSNRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSST 1280
            EE  +RIL+KRSI +LVNQIDPSE+LDPEVEDILVD+AE+FVESITTFGCSLAKHRKS+T
Sbjct: 374  EEPCSRILSKRSIGKLVNQIDPSERLDPEVEDILVDLAEEFVESITTFGCSLAKHRKSTT 433

Query: 1281 LEAKDILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTMSKNPS 1460
            LEAKDILLHLE+NWN+ LPGFG +EI++F+KPL N+ HRER+AAVKKS++A++  S   S
Sbjct: 434  LEAKDILLHLEKNWNLTLPGFGSDEIKIFRKPLTNDTHRERVAAVKKSIVASEMASTRSS 493

Query: 1461 G--ASGSAKGHLAKGPA 1505
               A+G+ K  L K PA
Sbjct: 494  AGQAAGNTKSSLTKTPA 510


>gb|EXB88734.1| Transcription initiation factor TFIID subunit 12 [Morus notabilis]
          Length = 603

 Score =  321 bits (823), Expect = 6e-85
 Identities = 195/441 (44%), Positives = 253/441 (57%), Gaps = 58/441 (13%)
 Frame = +3

Query: 384  RGGMAIGVPAHNPSTGAPPPASFSSLTPPSYGQ-------------------QSQIRQPV 506
            RGG+AIGVPAH PS     PA FS+    SYG                     SQ+R  +
Sbjct: 142  RGGVAIGVPAHRPSPSPSQPAPFSA----SYGHFGGLGRSGVGLPESSPNSNASQVRPSM 197

Query: 507  QGMGMAGALGATSSMRPAGVSPN-QLRPSQSSLRPQSTPSTQSPAAQ-NFQGHGMLRVXX 680
            QG+GM G+L + + MRP G+ P+ Q RP Q SLRP S+PS Q P++Q N+Q H +LRV  
Sbjct: 198  QGIGMLGSLSSGAQMRPGGIPPHHQQRPVQPSLRPVSSPSNQLPSSQQNYQAHSLLRVPS 257

Query: 681  XXXXXXXXXXXXXXXXXXNPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIP----- 845
                              N PWL+S + QGKPPLP+ S R   NPQS QQRSH+      
Sbjct: 258  AGSPGSPSPSMSQSVQSLNQPWLASGS-QGKPPLPSPSYRQPMNPQSLQQRSHMQTQPQQ 316

Query: 846  ---------------------HQLHHXXXXXXXXXXXXXXXXXXXXXXXXXXXEHFSQQF 962
                                  Q +H                           EH+ +Q 
Sbjct: 317  QMQPQLQPQSQLQPQPQQQQQQQQNHQLPMTAQQHMSALQQQQQQPSPSQQAHEHYGKQV 376

Query: 963  PP---PRSITHQMQMSKGPGIGAQRPPLGTTLS-GASQPGALSKPAAADPEESSNRILTK 1130
            P    P++++HQ Q+++    G Q+     ++     Q    ++ + A+ +ES NRIL+K
Sbjct: 377  PSSRVPQALSHQQQITRVQASGNQKSSTPASVQPNTVQSVPQNRISTAETDESCNRILSK 436

Query: 1131 RSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSSTLEAKDILLHL 1310
            RSI ELV+Q+DPSEKLDPEVEDIL+DIA+DFVESITTFGCSLAKHRKS+TLEAKDILLHL
Sbjct: 437  RSIHELVSQVDPSEKLDPEVEDILMDIADDFVESITTFGCSLAKHRKSTTLEAKDILLHL 496

Query: 1311 ERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTM-SKNPSG-ASGSAKG 1484
            ERNWN+ LPGFGG+EI+ F+KP+VN+IH ERL A+KKSML  +T  ++NPSG A+G+AKG
Sbjct: 497  ERNWNITLPGFGGDEIKSFRKPVVNDIHMERLVAIKKSMLTTETANTRNPSGQAAGNAKG 556

Query: 1485 HLAKGPANTIS-----SPPNP 1532
             L +   + +S     S P P
Sbjct: 557  SLPRVVKHAVSGRHGTSDPRP 577


>ref|XP_003537062.1| PREDICTED: transcription initiation factor TFIID subunit 12-like
            [Glycine max]
          Length = 507

 Score =  320 bits (820), Expect = 1e-84
 Identities = 192/400 (48%), Positives = 233/400 (58%), Gaps = 20/400 (5%)
 Frame = +3

Query: 384  RGGMAIGVPAHNPSTGAPPPASFSS------------LTPPSYGQQSQIRQPVQGMGMAG 527
            RGGMAIGVPAH+ S   P  +SF                  S    SQ+R PVQG GM G
Sbjct: 125  RGGMAIGVPAHHQSPSPPFSSSFGQHFGGLGRTGVNVAESTSNSSTSQVRTPVQGTGMLG 184

Query: 528  ALGATSSMRPAGVSPNQLRPSQSSLRPQ--STPSTQSPAAQNFQGHGMLRVXXXXXXXXX 701
                 S MRP+G+  +Q RP QSSLRP   S P+ Q   +Q+FQGHG++R          
Sbjct: 185  -----SQMRPSGIGAHQQRPVQSSLRPPPPSAPNNQPAGSQSFQGHGLMRSSSVGSPATP 239

Query: 702  XXXXXXXXXXXNPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXX 881
                       N PWLSS   QGKPPLP+ + R Q NPQS QQR HIP Q          
Sbjct: 240  SPSSSLSMQSLNQPWLSSGP-QGKPPLPSAAYRQQLNPQSMQQRPHIPLQQQSTPTPLLA 298

Query: 882  XXXXXXXXXXXXXXXXXXXXEHFSQQFPPPRS---ITHQMQMSKGPGIGAQRPP-LGTTL 1049
                                EHF QQ  PPR+   + HQ Q+ +  G G Q+P  L    
Sbjct: 299  NQSQ----------------EHFGQQVLPPRAPLHVPHQPQIMRVHGPGNQKPSSLVAAQ 342

Query: 1050 SGASQPGALSKPAAADPEESSNRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVE 1229
            S A+QPG  S+    D +ESSN IL+KRSI ELVNQ+DP EKL+PEV DILVDIAE+F+E
Sbjct: 343  SSAAQPGTRSRLTNTDTDESSNSILSKRSIHELVNQVDPLEKLEPEVADILVDIAENFLE 402

Query: 1230 SITTFGCSLAKHRKSSTLEAKDILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLA 1409
            SIT  GCSLAKHRKS+TLEAKDILLHLE+NWNM L GFGG++I+ +++P  ++IH+ERL 
Sbjct: 403  SITRSGCSLAKHRKSTTLEAKDILLHLEKNWNMTLLGFGGDDIKSYRRPTTSDIHKERLT 462

Query: 1410 AVKKSMLAADTM-SKNPSG-ASGSAKGHLAKGPANTISSP 1523
             +KKSM A +    K  +G ASGSAKG+  K P N I  P
Sbjct: 463  VIKKSMAATEAAHGKGSAGQASGSAKGNQGKTPLNIIGLP 502


>ref|XP_004305348.1| PREDICTED: uncharacterized protein LOC101314490 [Fragaria vesca
            subsp. vesca]
          Length = 538

 Score =  317 bits (813), Expect = 9e-84
 Identities = 192/428 (44%), Positives = 245/428 (57%), Gaps = 41/428 (9%)
 Frame = +3

Query: 384  RGGMAIGVPAHNPSTGAPPPASFSSL----------------TPPSYGQQSQIRQPV--- 506
            RGG+AIGVPAH+PS  +P P  + S                  P +    SQ+R P+   
Sbjct: 115  RGGIAIGVPAHHPSPPSPQPTPYPSSYGHFGGLGRGAGVSLPEPSANSNASQVRPPMPGM 174

Query: 507  QGMGMAGALGATSSMRPAGVSPNQ-LRPSQSSLRPQSTPSTQSPAAQNFQGHGMLRVXXX 683
            QGMGM G+LG++S MRPAG+SP+   RP QS LRP ST S QSP++QNFQGH + R    
Sbjct: 175  QGMGMLGSLGSSSQMRPAGISPHHPQRPVQSPLRPASTASNQSPSSQNFQGHNLSRASSL 234

Query: 684  XXXXXXXXXXXXXXXXXNPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHX 863
                             + PWLS S  QGKPP+P+   R Q  P S  QRSH+P Q    
Sbjct: 235  GSPSSPNTSQGLALH--SQPWLSGS--QGKPPIPSSPYRQQIAPTSMLQRSHLPQQQQQQ 290

Query: 864  XXXXXXXXXXXXXXXXXXXXXXXXXX---------------EHFSQQFPPPR----SITH 986
                                                     +HF QQ PPPR    S   
Sbjct: 291  QHLPLPTTPPQQQQGPITTASPQQHMPSVQHQQPSSSHQGHDHFGQQAPPPRITQASPRQ 350

Query: 987  QMQMSKGPGIGAQRPPLGTTLSGASQPGALSKPAAADPEESSNRILTKRSIQELVNQIDP 1166
            Q        +  +  PL      + Q    ++  +A+ EE  NRIL+KR+I ELVNQIDP
Sbjct: 351  QQNTRVQVPVNQKSFPLAAAQPNSVQSTQQNRITSAETEEPGNRILSKRTIHELVNQIDP 410

Query: 1167 SEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSSTLEAKDILLHLERNWNMALPGFG 1346
            SE+LDP+VEDIL+DIA++FVESITTF CSLAKHRKS+TLEAKDILLHLE+NWN+ LPGFG
Sbjct: 411  SERLDPDVEDILMDIADEFVESITTFSCSLAKHRKSTTLEAKDILLHLEKNWNITLPGFG 470

Query: 1347 GEEIRMFKKPLVNEIHRERLAAVKKSMLAADTM-SKNPSG-ASGSAKGHLAKGPANTISS 1520
            G+EI+ ++KP+ N+IH+ RLAA+KKSM+A +T  ++N +G A+G+AKG L K P N   S
Sbjct: 471  GDEIKGYRKPITNDIHKGRLAAIKKSMVATETANTRNSAGQATGNAKGGLVKVPVNI--S 528

Query: 1521 PPNPKIRE 1544
              N K+RE
Sbjct: 529  SQNVKMRE 536


>ref|XP_002884775.1| tata-associated factor II 58 [Arabidopsis lyrata subsp. lyrata]
            gi|297330615|gb|EFH61034.1| tata-associated factor II 58
            [Arabidopsis lyrata subsp. lyrata]
          Length = 541

 Score =  314 bits (804), Expect = 1e-82
 Identities = 198/447 (44%), Positives = 248/447 (55%), Gaps = 32/447 (7%)
 Frame = +3

Query: 276  TRPWQQ-PPYSHFSLXXXXXXXXXXXXXXXXXXXXXX---RGGMAIGVPAHNPSTGAPPP 443
            +RPWQQ   Y+HFS                          RGGMAIGVPA    + +P P
Sbjct: 98   SRPWQQHSSYTHFSSASSPLLSSSSAPASSSSSLPITGQQRGGMAIGVPASPIPSPSPTP 157

Query: 444  ASFS-SLTPPSYGQQ--------------------SQIR--QPVQGMGMAGALGATSSMR 554
            +  + S  P S+GQQ                     Q+R  Q  QG+GM G LG+ S MR
Sbjct: 158  SQHTPSAFPGSFGQQYGGLGRGTVGMSEATSNSSAPQVRMMQGTQGIGMMGTLGSGSQMR 217

Query: 555  PAGVSPNQLRPSQSSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXX 734
            P+G++ +Q RP+QSSLRP S+PSTQSP AQNFQGH ++R                     
Sbjct: 218  PSGMAQHQQRPTQSSLRPASSPSTQSPVAQNFQGHSLMRPSPIGSPSVQSTGASQQSLQA 277

Query: 735  -NPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXX 911
             N PWLSS+  QGKPPLP  S RPQ N  S QQR HIP Q                    
Sbjct: 278  INQPWLSSTP-QGKPPLPPPSYRPQVNSPSMQQRPHIPQQ-------HLSTSAATSQPQQ 329

Query: 912  XXXXXXXXXXEHFSQQFPPPRSITHQMQMSKGPGIGAQR--PPLGTTLSGASQPGALSKP 1085
                      E   Q   P + + H  Q ++  G+  Q+   P+  +    +QPG  +K 
Sbjct: 330  LQSQQQHQPQEQLQQLRSPQQPLPHPHQPTRVQGLVNQKVTSPVMPSQPPVAQPGNHAKT 389

Query: 1086 AAADPEESSNRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKH 1265
             +A+ E S +RIL KRSI EL+ QIDPSEKLDPEVEDIL DIAEDFVESITTFGCSLAKH
Sbjct: 390  VSAENETSDDRILGKRSIHELLQQIDPSEKLDPEVEDILADIAEDFVESITTFGCSLAKH 449

Query: 1266 RKSSTLEAKDILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTM 1445
            RKS TLEAKDILLH+ERNWN+  PGF  +E + F+KPL  +IH+ERLAA+KKS+ A +  
Sbjct: 450  RKSDTLEAKDILLHVERNWNIRPPGFSSDEFKTFRKPLTTDIHKERLAAIKKSVTATEAA 509

Query: 1446 S-KNPSG-ASGSAKGHLAKGPANTISS 1520
            S +N  G  + +A+G  +K P+N + S
Sbjct: 510  SARNQFGHGTANARGGQSKTPSNPLGS 536


>ref|XP_004143314.1| PREDICTED: uncharacterized protein LOC101211513 [Cucumis sativus]
          Length = 511

 Score =  313 bits (803), Expect = 1e-82
 Identities = 198/437 (45%), Positives = 245/437 (56%), Gaps = 30/437 (6%)
 Frame = +3

Query: 285  WQQPP-YSHFSLXXXXXXXXXXXXXXXXXXXXXXRGGMAIGVPAHNPSTGAPPPASFSSL 461
            WQ P  +SHFS                       R G+AIGVPAH P T +P PA FS+ 
Sbjct: 92   WQPPSHFSHFSSPSPSASSSVPSPRPVSASSPAQRSGVAIGVPAHQP-TPSPQPAPFSA- 149

Query: 462  TPPSYGQQ--------------------SQIRQPVQGM---GMAGALGATSSMRPAGVSP 572
               SYGQ                     SQ+R P+QGM   GM G+ G++S M       
Sbjct: 150  ---SYGQHFGGLGRGGVSISDGASNSNPSQVRPPMQGMQGLGMLGSSGSSSQMLH----- 201

Query: 573  NQLRPSQSSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXXNPPWLS 752
               RP QSSLRP STP++   A+QNFQGHG+LRV                    N PWL 
Sbjct: 202  ---RPVQSSLRPPSTPNS---ASQNFQGHGLLRVPSTSSPSSSLPNTSQGMQPTNQPWLP 255

Query: 753  SSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXXXXXXXXX 932
            SS+ QGKPPLPT S RPQ N  + QQRSHIP Q +H                        
Sbjct: 256  SSS-QGKPPLPTPSYRPQANSPAMQQRSHIPQQQNHPLTPVSQQQQISSAPQQQPAQSHQ 314

Query: 933  XXXEHFSQQFPPPRS---ITHQMQMSKGPG-IGAQRPPLGTTLSGASQPGALSKPAAADP 1100
               EHF+QQF   RS   + HQ Q ++  G    +  PL    +  +Q    S+   A+ 
Sbjct: 315  PQ-EHFAQQFQQSRSSQGLPHQQQAARAQGPANPKASPLAPPQTNNAQALTPSRAITAEM 373

Query: 1101 EESSNRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKHRKSST 1280
            EE  +RIL+KRSI +LVNQIDPSE+LDPEVEDILVD+A+ +   ITTFGCSLAKHRKS+T
Sbjct: 374  EEPCSRILSKRSIGKLVNQIDPSERLDPEVEDILVDLADTYFVQITTFGCSLAKHRKSTT 433

Query: 1281 LEAKDILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTMSKNPS 1460
            LEAKDILLHLE+NWN+ LPGFG +EI++F+KPL N+ HRER+AAVKKS++A++  S   S
Sbjct: 434  LEAKDILLHLEKNWNLTLPGFGSDEIKIFRKPLTNDTHRERVAAVKKSIVASEMASTRSS 493

Query: 1461 G--ASGSAKGHLAKGPA 1505
               A+G+ K  L K PA
Sbjct: 494  AGQAAGNTKSSLTKTPA 510


>ref|XP_006297380.1| hypothetical protein CARUB_v10013405mg [Capsella rubella]
            gi|482566089|gb|EOA30278.1| hypothetical protein
            CARUB_v10013405mg [Capsella rubella]
          Length = 539

 Score =  313 bits (801), Expect = 2e-82
 Identities = 196/450 (43%), Positives = 245/450 (54%), Gaps = 32/450 (7%)
 Frame = +3

Query: 276  TRPWQQ-PPYSHFSLXXXXXXXXXXXXXXXXXXXXXX---RGGMAIGVPAHNPSTGAPPP 443
            +RPWQQ   Y+HFS                          RGGMAIGVPA      +P P
Sbjct: 97   SRPWQQHSSYTHFSSASSPLLSSTSAPASSSSSLPITGQQRGGMAIGVPASPIPNPSPSP 156

Query: 444  ASFS-SLTPPSYGQQ----------------------SQIRQPVQGMGMAGALGATSSMR 554
            +  S S  P S+GQQ                      +++ Q  QG+GM G LG++S MR
Sbjct: 157  SQHSPSAFPGSFGQQYGGLGRGTVGMSEATSNTSVPQARMMQGTQGIGMMGTLGSSSQMR 216

Query: 555  PAGVSPNQLRPSQSSLRPQSTPSTQSPAAQNFQGHGMLRVXXXXXXXXXXXXXXXXXXXX 734
            P G++ +Q RP+QSSLRP S+PSTQSP  QNFQGH ++R                     
Sbjct: 217  PTGMAQHQQRPTQSSLRPASSPSTQSPVTQNFQGHSLMRPSPIGSPSVHSTGASQQSLQG 276

Query: 735  -NPPWLSSSATQGKPPLPTQSLRPQTNPQSFQQRSHIPHQLHHXXXXXXXXXXXXXXXXX 911
             N PWLSS+  QGKPPLP  + RP  N  S QQR HIP Q                    
Sbjct: 277  INQPWLSSTP-QGKPPLPPPTYRPPVNSPSMQQRPHIPPQ-------HLSTPAATSQPQQ 328

Query: 912  XXXXXXXXXXEHFSQQFPPPRSITHQMQMSKGPGIGAQR--PPLGTTLSGASQPGALSKP 1085
                      E   Q   P +S+ H  Q ++  G   Q+   P+  +    +QPG  +K 
Sbjct: 329  QQSQQQHQPQEQLQQLRSPQQSLPHPHQPNRVQGSVNQKVTSPVMPSQPPVAQPGNQTKT 388

Query: 1086 AAADPEESSNRILTKRSIQELVNQIDPSEKLDPEVEDILVDIAEDFVESITTFGCSLAKH 1265
             +A+ E S +RIL KRSI EL+ QIDPSEKLDPEVEDIL DIAEDFVESITTFGCSLAKH
Sbjct: 389  VSAEIEASDDRILGKRSIHELLQQIDPSEKLDPEVEDILADIAEDFVESITTFGCSLAKH 448

Query: 1266 RKSSTLEAKDILLHLERNWNMALPGFGGEEIRMFKKPLVNEIHRERLAAVKKSMLAADTM 1445
            RKS TLEAKDILLH+ERNWN+  PGF  +E + F+KPL  +IH+ERLA +KKS++A +  
Sbjct: 449  RKSDTLEAKDILLHVERNWNIRPPGFSSDEFKTFRKPLTTDIHKERLATIKKSVMATEAA 508

Query: 1446 SKNPSGASG--SAKGHLAKGPANTISSPPN 1529
            +   S   G  +A+G  AK PAN ++S  N
Sbjct: 509  NARNSFGHGTANARGGQAKTPANPLASTFN 538


Top