BLASTX nr result

ID: Rheum21_contig00002224 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00002224
         (2974 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002275875.1| PREDICTED: transcription factor tau subunit ...   560   e-156
emb|CBI24753.3| unnamed protein product [Vitis vinifera]              550   e-153
gb|EOY23640.1| Transcription factor IIIC, subunit 5, putative is...   527   e-146
gb|EOY23641.1| Transcription factor IIIC, subunit 5, putative is...   523   e-145
ref|XP_004251822.1| PREDICTED: general transcription factor 3C p...   509   e-141
ref|XP_004297697.1| PREDICTED: general transcription factor 3C p...   499   e-138
ref|XP_006350004.1| PREDICTED: general transcription factor 3C p...   497   e-137
ref|XP_004287180.1| PREDICTED: general transcription factor 3C p...   491   e-136
ref|XP_006464858.1| PREDICTED: general transcription factor 3C p...   486   e-134
ref|XP_003537671.1| PREDICTED: general transcription factor 3C p...   481   e-133
ref|NP_190510.3| transcription factor IIIC, subunit 5 [Arabidops...   474   e-130
dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana]           474   e-130
gb|EOY23639.1| General transcription factor 3C polypeptide 5, pu...   472   e-130
gb|EMJ05053.1| hypothetical protein PRUPE_ppa004640mg [Prunus pe...   471   e-130
ref|XP_006290824.1| hypothetical protein CARUB_v10016933mg [Caps...   470   e-129
ref|XP_002875963.1| hypothetical protein ARALYDRAFT_485301 [Arab...   466   e-128
ref|NP_197833.2| transcription factor IIIC, subunit 5 [Arabidops...   465   e-128
gb|EXB88280.1| hypothetical protein L484_020348 [Morus notabilis]     465   e-128
ref|XP_006286747.1| hypothetical protein CARUB_v10003057mg [Caps...   457   e-125
gb|EPS67527.1| hypothetical protein M569_07249 [Genlisea aurea]       454   e-124

>ref|XP_002275875.1| PREDICTED: transcription factor tau subunit sfc1-like [Vitis
            vinifera]
          Length = 568

 Score =  560 bits (1442), Expect = e-156
 Identities = 314/589 (53%), Positives = 383/589 (65%), Gaps = 20/589 (3%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MGVIEEG+ISG +PS   FSV+YP YPSS  RAI+TLGG   I K R+S+SNKL+L+FRP
Sbjct: 1    MGVIEEGSISGYIPSNEAFSVHYPAYPSSTARAIETLGGTQAIRKARSSQSNKLELHFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDPYSHPA G+L  C N LL+IS+K   +        QSE   +G             EE
Sbjct: 61   EDPYSHPAFGELQPCNNLLLRISKKKSTD-------GQSESVATG-------------EE 100

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKRT 1476
            V++          Q    +P  + ADI+A+V E YHFNGM DYQHVL VHADV   KKR 
Sbjct: 101  VEA----------QISGEVPIRLCADIIARVSEAYHFNGMVDYQHVLPVHADVARRKKRN 150

Query: 1477 WTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHR 1653
            W +VE  ++E GDL DVD E+LMIL PPLFS KD+PE LVLRPS+TL+ KKK E VVQ R
Sbjct: 151  WAEVEP-HLEKGDLVDVDQEDLMILLPPLFSPKDVPEKLVLRPSMTLNLKKKQEGVVQQR 209

Query: 1654 WEMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESL 1833
            WEM IEPCLAIDF IKEIP+KVNWE +I + S  WE Q+A+S LFDERPIW + ++ E L
Sbjct: 210  WEMGIEPCLAIDFEIKEIPKKVNWEQYIPKGSEQWEWQMAVSNLFDERPIWPKGALTERL 269

Query: 1834 DNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLR 2013
             + G  VG+  L+RLLFR AYYF +GPF RFWIRKGYDPRK+P+S IYQRIDFR+PP LR
Sbjct: 270  LDKGLNVGDYTLRRLLFRTAYYFSNGPFLRFWIRKGYDPRKNPDSCIYQRIDFRVPPSLR 329

Query: 2014 THCD--ASTGTKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATG 2187
            ++CD  A+ G KQRWE+ICSFR+FP K H  LQL EL+DD+IQ+EIRKP KQT CT ATG
Sbjct: 330  SYCDANAANGLKQRWEDICSFRVFPYKCHTSLQLFELADDYIQQEIRKPLKQTTCTGATG 389

Query: 2188 WFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPSD----E 2355
            WF+  +L +LRL + +RFLS+ P   AE LLKS SDRFEKSKR+     N++P++    E
Sbjct: 390  WFSYRVLESLRLCVMVRFLSICPETSAEYLLKSASDRFEKSKRMHIYENNLRPNEEGIQE 449

Query: 2356 VSQNIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDV-----------GF 2502
            V++ +E                           + AY    LD + V           GF
Sbjct: 450  VNKELEGDKDKEEPNDVDDDEEDEMEAENGEEELDAY--EALDMKIVERSVNTLRSSFGF 507

Query: 2503 SLEPDTYPDGENMSKNYLQELFGSFPL-DAGDGTSLLKPESDEEYHIYE 2646
            S+      D EN+S++YLQ LFGSF    AG G       SD EY IYE
Sbjct: 508  SIYILDL-DAENISRDYLQGLFGSFSFTKAGGGEVQDADTSDGEYQIYE 555


>emb|CBI24753.3| unnamed protein product [Vitis vinifera]
          Length = 597

 Score =  550 bits (1418), Expect = e-153
 Identities = 308/616 (50%), Positives = 378/616 (61%), Gaps = 47/616 (7%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MGVIEEG+ISG +PS   FSV+YP YPSS  RAI+TLGG   I K R+S+SNKL+L+FRP
Sbjct: 1    MGVIEEGSISGYIPSNEAFSVHYPAYPSSTARAIETLGGTQAIRKARSSQSNKLELHFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDPYSHPA G+L  C N LL+IS+K                                  +
Sbjct: 61   EDPYSHPAFGELQPCNNLLLRISKKKST-------------------------------D 89

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKRT 1476
              SA   +K+   Q    +P  + ADI+A+V E YHFNGM DYQHVL VHADV   KKR 
Sbjct: 90   GQSAEVSSKVSKSQISGEVPIRLCADIIARVSEAYHFNGMVDYQHVLPVHADVARRKKRN 149

Query: 1477 WTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHR 1653
            W +VE  ++E GDL DVD E+LMIL PPLFS KD+PE LVLRPS+TL+ KKK E VVQ R
Sbjct: 150  WAEVEP-HLEKGDLVDVDQEDLMILLPPLFSPKDVPEKLVLRPSMTLNLKKKQEGVVQQR 208

Query: 1654 WEMEIEPCLAIDFNIKEI--------------------------------------PRKV 1719
            WEM IEPCLAIDF IK+I                                      P+KV
Sbjct: 209  WEMGIEPCLAIDFEIKDILIIYCLYRMCITSHMTSFSRIPLKLLVTPLLTKVVEIIPKKV 268

Query: 1720 NWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESLDNLGFKVGENMLKRLLFRVAYY 1899
            NWE +I + S  WE Q+A+S LFDERPIW + ++ E L + G  VG+  L+RLLFR AYY
Sbjct: 269  NWEQYIPKGSEQWEWQMAVSNLFDERPIWPKGALTERLLDKGLNVGDYTLRRLLFRTAYY 328

Query: 1900 FGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLRTHCD--ASTGTKQRWEEICSFR 2073
            F +GPF RFWIRKGYDPRK+P+S IYQRIDFR+PP LR++CD  A+ G KQRWE+ICSFR
Sbjct: 329  FSNGPFLRFWIRKGYDPRKNPDSCIYQRIDFRVPPSLRSYCDANAANGLKQRWEDICSFR 388

Query: 2074 LFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATGWFTSHILHTLRLRIALRFLSVS 2253
            +FP K H  LQL EL+DD+IQ+EIRKP KQT CT ATGWF+  +L +LRL + +RFLS+ 
Sbjct: 389  VFPYKCHTSLQLFELADDYIQQEIRKPLKQTTCTGATGWFSYRVLESLRLCVMVRFLSIC 448

Query: 2254 PLPGAEELLKSISDRFEKSKRIRSQVPNIKPSD----EVSQNIEEAVGXXXXXXXXXXXX 2421
            P   AE LLKS SDRFEKSKR+     N++P++    EV++ +E                
Sbjct: 449  PETSAEYLLKSASDRFEKSKRMHIYENNLRPNEEGIQEVNKELEGDKDKEEPNDVDDDEE 508

Query: 2422 XXXXXXXXXXXMYAYYPSHLDAEDVGFSLEPDTYPDGENMSKNYLQELFGSFPL-DAGDG 2598
                       + AY    +  ED   SL+  +Y D EN+S++YLQ LFGSF    AG G
Sbjct: 509  DEMEAENGEEELDAYEALDMVGEDDEDSLQSRSYLDAENISRDYLQGLFGSFSFTKAGGG 568

Query: 2599 TSLLKPESDEEYHIYE 2646
                   SD EY IYE
Sbjct: 569  EVQDADTSDGEYQIYE 584


>gb|EOY23640.1| Transcription factor IIIC, subunit 5, putative isoform 2 [Theobroma
            cacao]
          Length = 582

 Score =  527 bits (1357), Expect = e-146
 Identities = 292/577 (50%), Positives = 367/577 (63%), Gaps = 8/577 (1%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MGVI+EG +SG LP+   F+V++PGYP +  RAI+TLGG + I + R+S+SNKL+L+FRP
Sbjct: 1    MGVIKEGRVSGTLPNDESFAVHFPGYPKTTARAIETLGGTEGILRARSSQSNKLELHFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQ-SECAGSGVNNIHPEPVENELE 1296
            EDPYS PA G+L  C N LLKIS+K   +  + ++ ++  EC+ SG  +    P +    
Sbjct: 61   EDPYSRPAFGELRPCNNLLLKISKKKSADGQSAEASSKVRECSTSGATDSE-NPKQPSQA 119

Query: 1297 EVDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGCK-KR 1473
            EV            Q      T++ ADIV++V E YHF+GMADYQHVLAVHAD   K KR
Sbjct: 120  EV------------QISEQEQTNLCADIVSRVSEAYHFDGMADYQHVLAVHADAARKRKR 167

Query: 1474 TWTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQH 1650
             W + E    E G   DVD E++M++ PPLFS KD+PEN+VLRPS  LSSKKK E VVQ+
Sbjct: 168  NWAEAEEPPFEKGGFMDVDQEDVMMILPPLFSPKDMPENIVLRPSTILSSKKKQEGVVQN 227

Query: 1651 RWEMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIES 1830
              E+++EP LAIDFNIKEIP+KVNWE  I++ S  WE Q+ +SKLFDERPIW + SV E 
Sbjct: 228  TAEVDLEPGLAIDFNIKEIPKKVNWEELITRGSEQWEWQMIVSKLFDERPIWPKESVTER 287

Query: 1831 LDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDL 2010
            L + G K    MLKRLL  VAYYF +GPF RFWI+KGYDPRKDP+SRIYQR +FR+P  L
Sbjct: 288  LLDKGLKFSHLMLKRLLLGVAYYFSNGPFLRFWIKKGYDPRKDPDSRIYQRTEFRVPEPL 347

Query: 2011 RTHCDASTGT--KQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPAT 2184
            R++ DA+T    K +WE++CSFR+FP K   +LQL EL DD+IQ+EIRKP K   C   T
Sbjct: 348  RSYSDANTANKLKHKWEDLCSFRVFPYKCQTFLQLFELDDDYIQQEIRKPPKLATCDSKT 407

Query: 2185 GWFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKR--IRSQVPNIKPSDEV 2358
            GWF+  +L  LRLR+A+RFLSV P  GAE + KS SD FEK KR  I   V N     E+
Sbjct: 408  GWFSECVLDCLRLRVAVRFLSVYPKDGAESIRKSYSDEFEKLKRSCIYKDVFN-SHQQEI 466

Query: 2359 SQNIEEAVG-XXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVGFSLEPDTYPDGE 2535
             +   E +G                        +  Y   +L  ED    L+PDTY D E
Sbjct: 467  RRTNRELIGDEDKERPKSSDNEEDEIDADDDEELDVYETLNLGGEDDEIPLQPDTYLDME 526

Query: 2536 NMSKNYLQELFGSFPLDAGDGTSLLKPESDEEYHIYE 2646
            N S+ YLQELFGSFP   G         SD EY IYE
Sbjct: 527  NNSRTYLQELFGSFPSVVGGDAIQAADISDGEYQIYE 563


>gb|EOY23641.1| Transcription factor IIIC, subunit 5, putative isoform 3 [Theobroma
            cacao]
          Length = 579

 Score =  523 bits (1347), Expect = e-145
 Identities = 289/581 (49%), Positives = 366/581 (62%), Gaps = 12/581 (2%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MGVI+EG +SG LP+   F+V++PGYP +  RAI+TLGG + I + R+S+SNKL+L+FRP
Sbjct: 1    MGVIKEGRVSGTLPNDESFAVHFPGYPKTTARAIETLGGTEGILRARSSQSNKLELHFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQ-SECAGSGVNNIHPEPVENELE 1296
            EDPYS PA G+L  C N LLKIS+K   +  + ++ ++  EC+ SG  +    P +    
Sbjct: 61   EDPYSRPAFGELRPCNNLLLKISKKKSADGQSAEASSKVRECSTSGATDSE-NPKQPSQA 119

Query: 1297 EVDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGCK-KR 1473
            EV            Q      T++ ADIV++V E YHF+GMADYQHVLAVHAD   K KR
Sbjct: 120  EV------------QISEQEQTNLCADIVSRVSEAYHFDGMADYQHVLAVHADAARKRKR 167

Query: 1474 TWTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQH 1650
             W + E    E G   DVD E++M++ PPLFS KD+PEN+VLRPS  LSSKKK E VVQ+
Sbjct: 168  NWAEAEEPPFEKGGFMDVDQEDVMMILPPLFSPKDMPENIVLRPSTILSSKKKQEGVVQN 227

Query: 1651 RWEMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIES 1830
              E+++EP LAIDFNIKEIP+KVNWE  I++ S  WE Q+ +SKLFDERPIW + SV E 
Sbjct: 228  TAEVDLEPGLAIDFNIKEIPKKVNWEELITRGSEQWEWQMIVSKLFDERPIWPKESVTER 287

Query: 1831 LDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDL 2010
            L + G K    MLKRLL  VAYYF +GPF RFWI+KGYDPRKDP+SRIYQR +FR+P  L
Sbjct: 288  LLDKGLKFSHLMLKRLLLGVAYYFSNGPFLRFWIKKGYDPRKDPDSRIYQRTEFRVPEPL 347

Query: 2011 RTHCDASTGT--KQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPAT 2184
            R++ DA+T    K +WE++CSFR+FP K   +LQL EL DD+IQ+EIRKP K   C   T
Sbjct: 348  RSYSDANTANKLKHKWEDLCSFRVFPYKCQTFLQLFELDDDYIQQEIRKPPKLATCDSKT 407

Query: 2185 GWFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKR-------IRSQVPNIK 2343
            GWF+  +L  LRLR+A+RFLSV P  GAE + KS SD FEK KR         S    I+
Sbjct: 408  GWFSECVLDCLRLRVAVRFLSVYPKDGAESIRKSYSDEFEKLKRSCIYKDVFNSHQQEIR 467

Query: 2344 PSDEVSQNIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVGFSLEPDTY 2523
             ++   ++ E                           +  Y   +L  ED    L+PDTY
Sbjct: 468  RTNRGDEDKER--------PKSSDNEEDEIDADDDEELDVYETLNLGGEDDEIPLQPDTY 519

Query: 2524 PDGENMSKNYLQELFGSFPLDAGDGTSLLKPESDEEYHIYE 2646
             D EN S+ YLQELFGSFP   G         SD EY IYE
Sbjct: 520  LDMENNSRTYLQELFGSFPSVVGGDAIQAADISDGEYQIYE 560


>ref|XP_004251822.1| PREDICTED: general transcription factor 3C polypeptide 5-like
            [Solanum lycopersicum]
          Length = 597

 Score =  509 bits (1311), Expect = e-141
 Identities = 278/588 (47%), Positives = 373/588 (63%), Gaps = 19/588 (3%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MG+I++G++SG LP+  VF+V+YP YPSS+ RA++TLGG+  I K RTS+SNKL+L+FRP
Sbjct: 1    MGIIKDGSVSGILPTNEVFAVHYPAYPSSVERAVETLGGIQGIVKARTSQSNKLELHFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSEC-----AGSGVNNIHPEPVE 1284
            EDPYSHP  G+L    NFLLKIS+    +  + DS + S C     +   + N   E   
Sbjct: 61   EDPYSHPTFGELKHSNNFLLKISKCKVRDVRSADSADSS-CGIVIQSSRSLVNCEQENAA 119

Query: 1285 NELEEVDSAS-DCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHAD-V 1458
             +L E    S   +K +  Q   ++  H++A+IV+ V E YHFNGM DYQHVLAVHAD  
Sbjct: 120  PKLNEPRCLSAGASKEIEMQTDTNLQEHLSANIVSHVSEAYHFNGMVDYQHVLAVHADDA 179

Query: 1459 GCKKRTWTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLE 1635
              KKR W +VE    E G L DVD E++MIL P LF+ KD+P+N+VL+   T+ SK+K E
Sbjct: 180  RRKKRQWAEVEPK-FEKGGLMDVDQEDMMILLPSLFASKDMPDNIVLKSCTTVGSKRKQE 238

Query: 1636 AVVQHRWEMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRA 1815
               +H WE E+EP LAIDF IKEIP+ V+WE +I Q S+ W  Q A+S+LF+ER IW + 
Sbjct: 239  G--RHNWEREMEPSLAIDFAIKEIPKPVDWEKYIPQGSDRWRWQKAVSELFEERKIWAKE 296

Query: 1816 SVIESLDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFR 1995
            S+ E L + G K  +NMLKRLL  VAYYF +GPF RFWI+KGYDPRKDPESRIYQ IDFR
Sbjct: 297  SLAERLHDRGLKFRDNMLKRLLCGVAYYFLNGPFRRFWIKKGYDPRKDPESRIYQNIDFR 356

Query: 1996 IPPDLRTHCD--ASTGTKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTV 2169
            +  +LR++C+  +S+G + RW++IC+FR+FP K  L LQLCEL DD+IQ+EI KPSK+  
Sbjct: 357  VHHELRSYCESRSSSGLQHRWDDICAFRVFPCKCQLALQLCELKDDYIQQEISKPSKEET 416

Query: 2170 CTPATGWFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPS 2349
            C   TGWF+ H +  LR RI +RF+SV P P AE LL S+S RFEKSKR  + V   +P 
Sbjct: 417  CNNVTGWFSFHTIDCLRRRIDVRFMSVCPHPRAESLLNSMSTRFEKSKRTHTYVKVARPE 476

Query: 2350 DEVSQN-------IEEAV--GXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVGF 2502
            ++   N       ++E                            M AY    L  ++   
Sbjct: 477  EQEKTNKDAENNEVDEQAENHDVDDPDDLEDYEDEFDDDNVEEEMDAYESLDLAVQEGNV 536

Query: 2503 SLEPDTYPDGENMSKNYLQELFGSFPLDAGDGTSLLKPESDEEYHIYE 2646
            SL  D + + +N+S++YLQELFG+FP +      +   +S  EY IY+
Sbjct: 537  SLHDDPHTNHDNVSRDYLQELFGNFPSNTAGMDEVQDDQSLGEYQIYD 584


>ref|XP_004297697.1| PREDICTED: general transcription factor 3C polypeptide 5-like
            [Fragaria vesca subsp. vesca]
          Length = 553

 Score =  499 bits (1285), Expect = e-138
 Identities = 275/580 (47%), Positives = 360/580 (62%), Gaps = 11/580 (1%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSN----KLDL 1107
            MGV+++GTISG LP   VF V+YPGYPSS++RAIDTLGG   I K  +S SN    +L+L
Sbjct: 1    MGVVKDGTISGFLPRTQVFGVHYPGYPSSMSRAIDTLGGTQAIHKAHSSASNNNNNRLEL 60

Query: 1108 YFRPEDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVEN 1287
             FR +DPYSHPA G L  C +FLLKIS+                                
Sbjct: 61   RFRHDDPYSHPAFGDLRPCNSFLLKISKSK------------------------------ 90

Query: 1288 ELEEVDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGCK 1467
                   +S+   L     P +   ++ ADIVA+V + YHF+GMADYQHV+AVHADV  K
Sbjct: 91   -------SSESDLLAAKLTPETDQVNVCADIVARVPKAYHFDGMADYQHVIAVHADVARK 143

Query: 1468 -KRTWTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAV 1641
             KR   + E  + + G L D+D E++MIL P  F+ KD+P+NLVLRPS TLS KK  E  
Sbjct: 144  RKRNRVETEEPHSDRGGLMDIDQEDVMILLPQFFAPKDVPDNLVLRPSGTLSVKKNQEEP 203

Query: 1642 VQHRWEMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASV 1821
            VQH+ EM++EP LAIDF I EIP++ NWE +I Q S+ WE Q+A+S LFDERP+W + SV
Sbjct: 204  VQHQLEMDMEPVLAIDFGITEIPKRTNWEEYIPQDSDQWESQMAVSSLFDERPVWPKDSV 263

Query: 1822 IESLDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIP 2001
             E L N GF   ++ML+RLL RVAYYF  GPF RFWI+KG+DPRKDP+SRIYQ+ID+R+ 
Sbjct: 264  TERLLNKGFIFSDHMLRRLLSRVAYYFSRGPFLRFWIKKGFDPRKDPDSRIYQKIDYRVK 323

Query: 2002 PDLRTHCDASTGT--KQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCT 2175
            P L  +C+A++    K +W ++C+FR+FP K H  LQL EL D++IQE+IRK   QT C+
Sbjct: 324  PPLHGYCEANSANQLKHKWSDLCAFRVFPYKCHTTLQLFELDDNYIQEQIRKAPAQTTCS 383

Query: 2176 PATGWFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPSDE 2355
            P TGWF+ ++L  L+ R+ +RFLSV P PGAE LLK+ ++ F+KSK+I ++  N+   + 
Sbjct: 384  PETGWFSYNVLENLKYRVQVRFLSVYPKPGAERLLKAATESFKKSKKICNK-DNLVRDEM 442

Query: 2356 VSQ--NIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVGFSLEPDTYPD 2529
            V Q  N E                               Y  H  AED   SL+P +Y +
Sbjct: 443  VQQQTNAELTGDVDAEEPNNVEDDEDDIEVDNGEEALDTYVGHDLAEDGEISLQPHSYLN 502

Query: 2530 GENMSKNYLQELFGSF-PLDAGDGTSLLKPESDEEYHIYE 2646
             EN+S+ +LQELFGSF P +AGD        SDEEY IYE
Sbjct: 503  MENISRTHLQELFGSFPPPEAGDDNIQDAYTSDEEYQIYE 542


>ref|XP_006350004.1| PREDICTED: general transcription factor 3C polypeptide 5-like isoform
            X1 [Solanum tuberosum] gi|565366663|ref|XP_006350006.1|
            PREDICTED: general transcription factor 3C polypeptide
            5-like isoform X3 [Solanum tuberosum]
          Length = 561

 Score =  497 bits (1279), Expect = e-137
 Identities = 274/582 (47%), Positives = 361/582 (62%), Gaps = 13/582 (2%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MG+I++G++SG LP+  VF+V+YP YPSS+ RA++TLGG+  I K RTS+SNKL+L+FRP
Sbjct: 1    MGIIKDGSVSGRLPTNEVFAVHYPAYPSSVERAVETLGGIQGIVKARTSESNKLELHFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDPYSHPA G+L    NFLLKIS+                                +++ 
Sbjct: 61   EDPYSHPAFGELKHSNNFLLKISKCK----------------------------VRDVQS 92

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHAD-VGCKKRT 1476
             DS  +C +     AP      + A+IV+ V E YHFNGM DYQHVLAVHAD    KKR 
Sbjct: 93   ADSPVNCEQENSLAAPKE---RLAANIVSHVSEGYHFNGMVDYQHVLAVHADDARRKKRQ 149

Query: 1477 WTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHR 1653
            W +VE    E G L DVD E+LMIL PPLF+ KD+P+N+VL+   TL SK+K E   +H 
Sbjct: 150  WAEVEPK-FEKGGLMDVDQEDLMILLPPLFASKDMPDNIVLKSCTTLGSKRKQEG--RHN 206

Query: 1654 WEMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESL 1833
            WE E+EP LAIDF IKEIP+ V+WE +I Q+S+ W  Q A+S+LF+E  IW + S+ E L
Sbjct: 207  WEREMEPSLAIDFTIKEIPKPVDWEKYIPQSSDRWRWQKAVSELFEECKIWPKESLAERL 266

Query: 1834 DNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLR 2013
             + G K  +NMLKRLL  VAYYF +GPF RFWI+KGYDPRKDPESRIYQ IDFR+  +LR
Sbjct: 267  HDGGLKFRDNMLKRLLCGVAYYFLNGPFRRFWIKKGYDPRKDPESRIYQNIDFRVHHELR 326

Query: 2014 THCDA--STGTKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATG 2187
            ++C++  S+G + RW++IC+FR+FP K  L LQLCEL DD+IQ+EIRKPSK+  C   TG
Sbjct: 327  SYCESRLSSGLQHRWDDICAFRVFPCKCQLALQLCELKDDYIQQEIRKPSKEKTCNSVTG 386

Query: 2188 WFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPSDEVSQN 2367
            WF+ H +  LR  I +RF+SV P P AE LL SIS RFEKSKR  + +   +P ++   N
Sbjct: 387  WFSFHTVDCLRRCIDVRFMSVCPHPRAESLLNSISTRFEKSKRTHTYLKVARPEEQEKVN 446

Query: 2368 -------IEEAV--GXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVGFSLEPDT 2520
                   ++E                            M AY    L  ++   SL  D 
Sbjct: 447  KDAENNEVDEQAENHDVDEPDDLEDYEDEFDDDNVEEEMDAYVSLDLAVQEGDVSLHDDP 506

Query: 2521 YPDGENMSKNYLQELFGSFPLDAGDGTSLLKPESDEEYHIYE 2646
            + + +N+S++YLQELFG+FP        +   +S  EY IY+
Sbjct: 507  HTNHDNVSRDYLQELFGNFPSSTAGTDEVQDDQSLGEYQIYD 548


>ref|XP_004287180.1| PREDICTED: general transcription factor 3C polypeptide 5-like
            [Fragaria vesca subsp. vesca]
          Length = 540

 Score =  491 bits (1265), Expect = e-136
 Identities = 271/578 (46%), Positives = 355/578 (61%), Gaps = 9/578 (1%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSN--KLDLYF 1113
            MGV+++GTISG LPS   F V+YPGYPSS++RAIDTLGG   I K  +S SN  +L+L F
Sbjct: 1    MGVVKDGTISGFLPSTQAFGVHYPGYPSSMSRAIDTLGGTQAIHKAHSSASNNNRLELRF 60

Query: 1114 RPEDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENEL 1293
            R +DPYSHPA G L  C +FLLKIS+                              ++E 
Sbjct: 61   RHDDPYSHPAFGDLRPCNSFLLKISKS-----------------------------KSET 91

Query: 1294 EEVDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KK 1470
            ++VD                    + ADIVA V E YHF+GMADYQHV+AVHADV   +K
Sbjct: 92   DQVD--------------------LCADIVAHVPEAYHFDGMADYQHVIAVHADVARNRK 131

Query: 1471 RTWTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQ 1647
            R   + E  + + G L D+D E++MIL P LF+ KD+P+NLVLRPS TLS KK  E  VQ
Sbjct: 132  RNRVETEEPHSDRGGLMDIDQEDVMILLPQLFAPKDVPDNLVLRPSGTLSVKKNQEEPVQ 191

Query: 1648 HRWEMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIE 1827
            H+ EM++EP LAIDF I EIP++ NWE +I Q S+ WE Q+A+S LFDERP+W + SV E
Sbjct: 192  HQLEMDMEPVLAIDFGISEIPKRTNWEEYIPQDSDQWESQMAVSSLFDERPVWPKDSVTE 251

Query: 1828 SLDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPD 2007
             L N GF   ++ML+RLL RVAYYF  GPF RFWI+KG+DPRKDP+SRIYQ+ID+R+ P 
Sbjct: 252  RLLNKGFIFSDHMLRRLLSRVAYYFSRGPFLRFWIKKGFDPRKDPDSRIYQKIDYRVKPP 311

Query: 2008 LRTHCDASTGT--KQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPA 2181
            L  +C+A++    K +W ++C+FR+FP K H  LQL EL DD+IQE+IRK   QT+C+P 
Sbjct: 312  LHGYCEANSANQLKHKWSDLCAFRVFPYKCHTTLQLFELDDDYIQEQIRKAPAQTMCSPE 371

Query: 2182 TGWFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPSDEVS 2361
            TGWF+ ++L  L+ R+ +RFLSV P PGAE LLK+ ++ F KSK+I ++   ++      
Sbjct: 372  TGWFSYNLLENLKYRVQVRFLSVYPKPGAECLLKAATESFRKSKKICNKDNLVRDEMVQQ 431

Query: 2362 QNIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAY--YPSHLDAEDVGFSLEPDTYPDGE 2535
            Q   E  G                             Y  H   ED   SL+P +Y + +
Sbjct: 432  QTNAELTGDVAAEEPNNVEDDEEDDIEVDNGEETLDTYGGHDLVEDGEISLQPHSYLNMD 491

Query: 2536 NMSKNYLQELFGSFPLDAGDGTSLLKP-ESDEEYHIYE 2646
            N+S+ +LQELFGSFP     G  +     SDEEY IYE
Sbjct: 492  NISRTHLQELFGSFPSPEARGDRIQDAYTSDEEYQIYE 529


>ref|XP_006464858.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Citrus
            sinensis]
          Length = 605

 Score =  486 bits (1252), Expect = e-134
 Identities = 282/599 (47%), Positives = 368/599 (61%), Gaps = 30/599 (5%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MGVI++G +SG LPS  VF+V+YPGY SS +RAI TLGG + I K R+SKSNKL+L FRP
Sbjct: 1    MGVIKDGKVSGNLPSNEVFAVHYPGYSSSTSRAIQTLGGSEAILKARSSKSNKLELRFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVEN---- 1287
            EDPYSHPA G++  C N LLK+S+K  +   +G S   S       N     P+ +    
Sbjct: 61   EDPYSHPAFGEVRPCNNLLLKMSKKKTSQPCDGQSPKLS-------NQTFKHPLHDAADV 113

Query: 1288 ----ELEEVDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHAD 1455
                E+ +++S S  ++   ++  +    ++ ADIVA+V E YHF+GMADYQHV+AVHAD
Sbjct: 114  GNVPEIHQLESDSVVSRKEAEKQKSEDQVNLFADIVARVSEAYHFDGMADYQHVVAVHAD 173

Query: 1456 VGC-KKRTWTDVETSNVETGDLADVDEN-LMILAPPLFSLKDIPENLVLRPSVTLSSKKK 1629
            V   KKR WT+VE    E G L D+DE+ +M++ PPLF+ KD+PENLVLRPSV  SS KK
Sbjct: 174  VARRKKRNWTEVEEPQFEKGGLIDLDEDDVMMILPPLFAPKDVPENLVLRPSVIPSSLKK 233

Query: 1630 LEAVVQHRWEMEIEPCLAIDFNIKEI------PRKVNWEVHISQTSNHWEPQIALSKLFD 1791
               V Q+  E +IE  LAIDFNIK+I           WE  IS+ S  W+ Q+A+SKLFD
Sbjct: 234  EARVEQNISEKDIESGLAIDFNIKDILLFYLCSSAPPWEEFISRDSEQWKWQMAVSKLFD 293

Query: 1792 ERPIWTRASVIESLDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESR 1971
            E+PIW ++S+ + + + G K    MLKRLL  +AYYF SGPF RFWIRKGYDPRKDPESR
Sbjct: 294  EQPIWPKSSINDRMLDEGLKFNSIMLKRLLLGIAYYFSSGPFLRFWIRKGYDPRKDPESR 353

Query: 1972 IYQRIDFRIPPDLRTHCD--ASTGTKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEI 2145
            IYQR DFR+ P LR++CD  A T  K RW+++C+F++FP+K    LQL EL DD+IQ+EI
Sbjct: 354  IYQRTDFRVKPPLRSYCDSNADTELKYRWKDLCAFQVFPTKCSTSLQLFELVDDYIQQEI 413

Query: 2146 RKPSKQTVCTPATGWFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRS 2325
            RKP K+T C+  TGWF+SH+L  +R R+ +RFLSV P  GA++LLK+ S+ FEK KRI  
Sbjct: 414  RKPVKRTTCSLQTGWFSSHVLAAIRRRVEVRFLSVFPGTGAQKLLKNASESFEKLKRICI 473

Query: 2326 QVPNIKPSDEVSQNIEEAVG-----------XXXXXXXXXXXXXXXXXXXXXXXMYAYYP 2472
                +KP  E +  I +  G                                    A   
Sbjct: 474  YKDTLKPDQEENLQINKGDGDNREKPEAVDDEEDRIEVDDEEEDRIEVDAGEEESDADET 533

Query: 2473 SHLDAEDVGFSLEPDTYPDGENMSKNYLQELFGSFPLDAGDGTSLL-KPESDEEYHIYE 2646
              +  ED   SL+  +Y   E+ S+ YLQELFGSF     D   +     SD EY IYE
Sbjct: 534  LDMVGEDDEISLQSHSYLGLESNSRIYLQELFGSFSSTDVDVDKIQDNGISDGEYQIYE 592


>ref|XP_003537671.1| PREDICTED: general transcription factor 3C polypeptide 5-like
            [Glycine max]
          Length = 547

 Score =  481 bits (1239), Expect = e-133
 Identities = 266/575 (46%), Positives = 359/575 (62%), Gaps = 7/575 (1%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MGVI++GTISG LP    F V+YP YPSSI+RA+DTLGG+  I+K R SKSNKL+L FRP
Sbjct: 1    MGVIKDGTISGVLPEPQGFMVHYPAYPSSISRAVDTLGGIQAIQKARCSKSNKLELRFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDPYSHPA G+L    + LLKIS+                          P PV     +
Sbjct: 61   EDPYSHPAFGELRPTNSLLLKISKTKP-----------------------PPPVH----D 93

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKRT 1476
             +++S  T    DQ  +     + ADIVA+  E Y F GMADYQHV+ VHADV   KKR 
Sbjct: 94   AEASSSSTNGEQDQEGS-----LCADIVARFPEAYFFYGMADYQHVIPVHADVARRKKRN 148

Query: 1477 WTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHR 1653
            W+++E  + + G   D+D E++MI+ PP+F+ KD+PENLVLRP+   SSKKK E VVQ  
Sbjct: 149  WSELEELHFDKGGFMDLDHEDVMIIVPPIFAPKDVPENLVLRPATMSSSKKKPEEVVQPH 208

Query: 1654 WEMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESL 1833
            +EM++EP LAIDF+IKEIP+KVNWE +I Q S+ WE Q+ +S++FDERPIW++ S+ E L
Sbjct: 209  FEMDMEPVLAIDFDIKEIPKKVNWEEYIPQGSDQWELQMVVSRMFDERPIWSKNSLTELL 268

Query: 1834 DNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLR 2013
             + G     +ML+RLL R++YYF SGPF RFWI+KGYDPRKDP SRIYQRID+R+P  LR
Sbjct: 269  LDKGLSFSHSMLRRLLSRISYYFSSGPFLRFWIKKGYDPRKDPNSRIYQRIDYRVPVPLR 328

Query: 2014 THCDASTG--TKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATG 2187
            ++CDA +   +K RW++IC+FR+FP K+   LQ  +L DD+IQ EI KP  +  CT  TG
Sbjct: 329  SYCDAHSANKSKHRWKDICAFRVFPYKFQTSLQFFDLVDDYIQSEINKPPFRPTCTSGTG 388

Query: 2188 WFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPSDEVSQ- 2364
            WF+ H+++ +R R+ +R+LSV P PGAE LL++ + +FEK KR   +       +E  Q 
Sbjct: 389  WFSQHMINCIRQRLMVRYLSVFPKPGAENLLRAATLKFEKLKRECYRHAMKLDGEECQQA 448

Query: 2365 --NIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVGFSLEPDTYPDGEN 2538
               +EE                             +   H  A D    L  D+Y + EN
Sbjct: 449  NLGLEE-------NEELDNGEDEEEAAEGNDSDEEWEEEHDLAGDNEMPLPSDSYINFEN 501

Query: 2539 MSKNYLQELFGSFPLDAGDGTSLLKPESDEEYHIY 2643
            +S+ +LQ+LF +FP +  D  ++    S+EEY IY
Sbjct: 502  LSRTHLQDLFVNFPPNEIDCDNVQANGSEEEYQIY 536


>ref|NP_190510.3| transcription factor IIIC, subunit 5 [Arabidopsis thaliana]
            gi|332645018|gb|AEE78539.1| transcription factor IIIC,
            subunit 5 [Arabidopsis thaliana]
          Length = 574

 Score =  474 bits (1220), Expect = e-130
 Identities = 256/591 (43%), Positives = 354/591 (59%), Gaps = 22/591 (3%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MG+IEEGTISG LPS   F V++PGYPSSI+RAI+TLGG+  I + R S SNKL+L FRP
Sbjct: 1    MGIIEEGTISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDPY+HPA G+   C  FLL+IS+++                                 E
Sbjct: 61   EDPYAHPALGEQRPCSGFLLRISKQDIKK-----------------------------PE 91

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKRT 1476
              S  D ++   D         + ADIVA++ E +HF+GMADYQHV+ +HAD+   KKR 
Sbjct: 92   SQSVLDTSR---DVCLEEASPVLCADIVARLSESFHFDGMADYQHVIPIHADIAQQKKRK 148

Query: 1477 WTDVETSNVETGDLADVDENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHRW 1656
            W DV+    ++  +   DE++M+L P  F+ KDIP+N+ L+P  T   KKK +A  Q+ +
Sbjct: 149  WMDVDPLTGKSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKDDAATQNFY 208

Query: 1657 EMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESLD 1836
            E+++ P  AIDF++KEIP+K+ WE  +S++SNHW+ Q+A+S LF+ERPIWTR SV++ L 
Sbjct: 209  EIDVGPVFAIDFSVKEIPKKLKWEDFVSRSSNHWQWQVAVSALFEERPIWTRDSVVQRLL 268

Query: 1837 NLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLRT 2016
            + G K   +ML R L R AYYF SGPF RFWI++GYDPR DPESR+YQR++FR+PP+LR 
Sbjct: 269  DKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRVYQRMEFRVPPELRG 328

Query: 2017 HCD--ASTGTKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATGW 2190
            +CD  A+  +K  W +IC+F+LFP K   +LQL EL D++IQ EIRKP KQT C+  +GW
Sbjct: 329  YCDANATNNSKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIRKPPKQTTCSHKSGW 388

Query: 2191 FTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPS------- 2349
            F+  +L TLRLR+A+RF+SV P  G E++ KSI + FE+S++++ Q   +KPS       
Sbjct: 389  FSEAMLDTLRLRVAVRFVSVFPETGFEDVFKSIQEEFERSEKVQIQKETLKPSLVKHREA 448

Query: 2350 ----------DEVSQNIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVG 2499
                        V++N++  V                                + A D  
Sbjct: 449  TKGSEDMETFKSVNENVDANVNEDGEDENLDDEDEDEEEEEEL---------DMAAGDNE 499

Query: 2500 FSLEPDTYPDGENMSKNYLQELFGSFPLDAGD--GTSLLKPESDEEYHIYE 2646
             SL+   Y D EN S+ YLQ LF SFP    +  G   +   SD E+ IYE
Sbjct: 500  ISLDSHGYLDTENSSRTYLQGLFDSFPSSEPNLYGDFAVDDGSDGEFQIYE 550


>dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana]
          Length = 574

 Score =  474 bits (1220), Expect = e-130
 Identities = 256/591 (43%), Positives = 353/591 (59%), Gaps = 22/591 (3%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MG+IEEGTISG LPS   F V++PGYPSSI+RAI+TLGG+  I + R S SNKL+L FRP
Sbjct: 1    MGIIEEGTISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDPY+HPA G+   C  FLL+IS+++                                 E
Sbjct: 61   EDPYAHPALGEQRPCSGFLLRISKQDIKK-----------------------------PE 91

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKRT 1476
              S  D ++   D         + ADIVA++ E +HF+GMADYQHV+ +HAD+   KKR 
Sbjct: 92   SQSVLDTSR---DVCLEEASPVLCADIVARLSESFHFDGMADYQHVIPIHADIAQQKKRK 148

Query: 1477 WTDVETSNVETGDLADVDENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHRW 1656
            W DV+    ++  +   DE++M+L P  F+ KDIP+N+ L+P  T   KKK +   Q+ +
Sbjct: 149  WMDVDPLTGKSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKDDVATQNFY 208

Query: 1657 EMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESLD 1836
            E+++ P  AIDF++KEIP+K+ WE  +S++SNHW+ Q+A+S LF+ERPIWTR SV++ L 
Sbjct: 209  EIDVGPVFAIDFSVKEIPKKLKWEDFVSRSSNHWQWQVAVSALFEERPIWTRDSVVQRLL 268

Query: 1837 NLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLRT 2016
            + G K   +ML R L R AYYF SGPF RFWI++GYDPR DPESR+YQR++FR+PP+LR 
Sbjct: 269  DKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRVYQRMEFRVPPELRG 328

Query: 2017 HCD--ASTGTKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATGW 2190
            +CD  A+  +K  W +IC+F+LFP K   +LQL EL D++IQ EIRKP KQT C+  +GW
Sbjct: 329  YCDANATNNSKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIRKPPKQTTCSHKSGW 388

Query: 2191 FTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPS------- 2349
            F+  +L TLRLR+A+RF+SV P  G E++ KSI + FE+SK+++ Q   +KPS       
Sbjct: 389  FSEAMLDTLRLRVAVRFVSVFPETGFEDVFKSIQEEFERSKKVQIQKETLKPSLVKHREA 448

Query: 2350 ----------DEVSQNIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVG 2499
                        V++N++  V                                + A D  
Sbjct: 449  TKGSEDIETFKSVNENVDANVNEDGEDENLDDEDEDEEEEEEL---------DMAAGDNE 499

Query: 2500 FSLEPDTYPDGENMSKNYLQELFGSFPLDAGD--GTSLLKPESDEEYHIYE 2646
             SL+   Y D EN S+ YLQ LF SFP    +  G   +   SD E+ IYE
Sbjct: 500  ISLDSHGYLDTENSSRTYLQGLFDSFPSSEPNLYGDFAVDDGSDGEFQIYE 550


>gb|EOY23639.1| General transcription factor 3C polypeptide 5, putative isoform 1
            [Theobroma cacao]
          Length = 630

 Score =  472 bits (1215), Expect = e-130
 Identities = 281/625 (44%), Positives = 357/625 (57%), Gaps = 56/625 (8%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MGVI+EG +SG LP+   F+V++PGYP +  RAI+TLGG + I + R+S+SNKL+L+FRP
Sbjct: 1    MGVIKEGRVSGTLPNDESFAVHFPGYPKTTARAIETLGGTEGILRARSSQSNKLELHFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQ-SECAGSGVNNIHPEPVENELE 1296
            EDPYS PA G+L  C N LLKIS+K   +  + ++ ++  EC+ SG  +    P +    
Sbjct: 61   EDPYSRPAFGELRPCNNLLLKISKKKSADGQSAEASSKVRECSTSGATDSE-NPKQPSQA 119

Query: 1297 EVDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGCK-KR 1473
            EV            Q      T++ ADIV++V E YHF+GMADYQHVLAVHAD   K KR
Sbjct: 120  EV------------QISEQEQTNLCADIVSRVSEAYHFDGMADYQHVLAVHADAARKRKR 167

Query: 1474 TWTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQH 1650
             W + E    E G   DVD E++M++ PPLFS KD+PEN+VLRPS  LSSKKK E VVQ+
Sbjct: 168  NWAEAEEPPFEKGGFMDVDQEDVMMILPPLFSPKDMPENIVLRPSTILSSKKKQEGVVQN 227

Query: 1651 RWE--------MEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIW 1806
              E          +     +D    +IP+KVNWE  I++ S  WE Q+ +SKLFDERPIW
Sbjct: 228  TAENVSNLDAVQILFSIFLLDLAFSQIPKKVNWEELITRGSEQWEWQMIVSKLFDERPIW 287

Query: 1807 TRASVIESLDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRI 1986
             + SV E L + G K    MLKRLL  VAYYF +GPF RFWI+KGYDPRKDP+SRIYQR 
Sbjct: 288  PKESVTERLLDKGLKFSHLMLKRLLLGVAYYFSNGPFLRFWIKKGYDPRKDPDSRIYQRT 347

Query: 1987 DFRIPPDLRTHCDASTGT--KQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSK 2160
            +FR+P  LR++ DA+T    K +WE++CSFR+FP K   +LQL EL DD+IQ+EIRKP K
Sbjct: 348  EFRVPEPLRSYSDANTANKLKHKWEDLCSFRVFPYKCQTFLQLFELDDDYIQQEIRKPPK 407

Query: 2161 QTVC-------------------TPATGWFTSHILHTLRLRIALRFLSVSPLPGAEELLK 2283
               C                      TGWF+  +L  LRLR+A+RFLSV P  GAE + K
Sbjct: 408  LATCDGGCLWGVVIGVVGDLDTLQSKTGWFSECVLDCLRLRVAVRFLSVYPKDGAESIRK 467

Query: 2284 SISDRFEKSKR--IRSQVPNIKPSDEVSQNIEEAVG-XXXXXXXXXXXXXXXXXXXXXXX 2454
            S SD FEK KR  I   V N     E+ +   E +G                        
Sbjct: 468  SYSDEFEKLKRSCIYKDVFN-SHQQEIRRTNRELIGDEDKERPKSSDNEEDEIDADDDEE 526

Query: 2455 MYAYYPSHLDAEDVGFSLEPDTY---------------------PDGENMSKNYLQELFG 2571
            +  Y   +L  ED    L+PDT+                      D EN S+ YLQELFG
Sbjct: 527  LDVYETLNLGGEDDEIPLQPDTFFGFVRIWMFFVCLRFPIYCLDLDMENNSRTYLQELFG 586

Query: 2572 SFPLDAGDGTSLLKPESDEEYHIYE 2646
            SFP   G         SD EY IYE
Sbjct: 587  SFPSVVGGDAIQAADISDGEYQIYE 611


>gb|EMJ05053.1| hypothetical protein PRUPE_ppa004640mg [Prunus persica]
          Length = 498

 Score =  471 bits (1213), Expect = e-130
 Identities = 244/500 (48%), Positives = 337/500 (67%), Gaps = 18/500 (3%)
 Frame = +1

Query: 940  MGVIEEG-TISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFR 1116
            MGV+++G T +G LPS+ VF+++YPGYPSS++RAI+TLGG   I K  +S+SN+L+L+FR
Sbjct: 1    MGVVKDGSTTTGFLPSSEVFAIHYPGYPSSMSRAIETLGGTQGIRKAHSSQSNRLELHFR 60

Query: 1117 PEDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELE 1296
             ++PYSHPA G L  C N LLKIS+   N    G +  QSE   S  + +          
Sbjct: 61   HQEPYSHPAFGDLRPCNNLLLKISKTKSNA---GQTQPQSELLASKQDEV---------- 107

Query: 1297 EVDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGCKK-R 1473
                          Q P +   H   DIVA+V E YHF+GM DYQHV+ VHADV  KK R
Sbjct: 108  --------------QIPENDRVHF--DIVARVPEAYHFDGMVDYQHVVPVHADVARKKKR 151

Query: 1474 TWTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQH 1650
             W +++  + + G L D+D E+ MIL P LF+ KD+P+NLVL+PSVTLS+KK  E  VQH
Sbjct: 152  NWIEIKDPHSDKGGLMDIDQEDAMILLPQLFAPKDVPDNLVLKPSVTLSAKKNQEEPVQH 211

Query: 1651 RWEMEIEPCLAIDFNIKEI-------------PRKVNWEVHISQTSNHWEPQIALSKLFD 1791
            +WEM++EP LAIDF I +I             P++ NWE +I Q S+ WE Q+A+S LFD
Sbjct: 212  QWEMDMEPVLAIDFGISDILSFVIFFLDLIMIPKRTNWEEYIPQGSDQWESQMAVSHLFD 271

Query: 1792 ERPIWTRASVIESLDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESR 1971
            ERP+W + S++E L + GF   +++L+RLL RVAYYF  GPF RFWI+KGYDPRKDPESR
Sbjct: 272  ERPVWPKDSLLERLVDKGFNFSDHLLRRLLSRVAYYFSRGPFLRFWIKKGYDPRKDPESR 331

Query: 1972 IYQRIDFRIPPDLRTHCDASTGT--KQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEI 2145
            I+Q+IDFR+ P L+++CDA++    K RWE+IC+FR+FP K H  LQL EL DD+IQE+I
Sbjct: 332  IFQKIDFRVRPPLQSYCDANSANQPKHRWEDICAFRVFPYKCHTTLQLFELGDDYIQEQI 391

Query: 2146 RKPSKQTVCTPATGWFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRS 2325
            RKP  QT C+  TGWF+ ++L  L+  + +RFLSV P PGAE LLK+ ++ F+KSK++ S
Sbjct: 392  RKPPAQTTCSSETGWFSYNMLENLKDCVKVRFLSVFPEPGAEPLLKAATESFKKSKKM-S 450

Query: 2326 QVPNIKPSDEVSQNIEEAVG 2385
            +  +++  + V +  ++ +G
Sbjct: 451  RYEDVEEPNNVEEEEKDEIG 470


>ref|XP_006290824.1| hypothetical protein CARUB_v10016933mg [Capsella rubella]
            gi|482559531|gb|EOA23722.1| hypothetical protein
            CARUB_v10016933mg [Capsella rubella]
          Length = 571

 Score =  470 bits (1210), Expect = e-129
 Identities = 253/580 (43%), Positives = 351/580 (60%), Gaps = 11/580 (1%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MG+IE+GTISG LPS   F +++PGYPSSI++AI+TLGG+  I + R S SNKL+L FRP
Sbjct: 1    MGIIEDGTISGTLPSKEAFVLHFPGYPSSISKAIETLGGIQGITQARESISNKLELRFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDPY+HP  G+   C  FLL+IS+++        S +Q   A S V +    P       
Sbjct: 61   EDPYAHPVLGEQRPCNGFLLRISKQDIKK-----SESQPVLATSDVCSEEASPA------ 109

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKRT 1476
                                  + ADIVA V E +HF+GMADYQHV+ +HAD+   KKR 
Sbjct: 110  ----------------------LCADIVAHVSESFHFDGMADYQHVIPIHADIAQQKKRK 147

Query: 1477 WTDVETSNVETGDLADVDENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHRW 1656
            W ++++    T  +   DE++M+L P  F+ KDIP+N+ L+P  T   KKK +A  Q+ +
Sbjct: 148  WMEMDSLTGNTDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATTGPKKKDDAEAQNFY 207

Query: 1657 EMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESLD 1836
            E+++ P  AI+F++KEIP+K+NWE  +S +S HW+ Q+++S LF+ERPIWTR SV++ L 
Sbjct: 208  EIDVGPVFAIEFSVKEIPKKLNWEEFVSPSSKHWQWQVSVSALFEERPIWTRDSVVQRLL 267

Query: 1837 NLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLRT 2016
            + G K   +ML R L R AYYF SGPF RFWI++GYDPR DPESR+YQR++FR+PP+LR+
Sbjct: 268  DKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRDDPESRVYQRMEFRVPPELRS 327

Query: 2017 HCD--ASTGTKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATGW 2190
            +CD  A+  +K  W +IC+F++FP K   +LQL EL D++IQ EIRKP KQT C+  TGW
Sbjct: 328  YCDANATNNSKPSWNDICAFKIFPFKCQTFLQLFELDDEYIQREIRKPPKQTTCSHKTGW 387

Query: 2191 FTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPS----DEV 2358
            F+  +L TLRLR+A+RF+SV P PG E++ KSI + FE+S++I+     +KPS     E 
Sbjct: 388  FSEAMLDTLRLRVAVRFVSVFPEPGFEDVFKSIQEEFERSEKIQILKETLKPSLVKHRES 447

Query: 2359 SQNIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLD--AEDVGFSLEPDTYPDG 2532
            ++  E+                                  LD  A D   S +   Y D 
Sbjct: 448  TKGAEDMEKCKTVNEDVDANVNEDGSDENLDDEEEEEEEELDMAAGDNEKSFDSHGYLDN 507

Query: 2533 ENMSKNYLQELFGSFPLDAGD--GTSLLKPESDEEYHIYE 2646
            EN S+ YLQ LF SFP       G   +   SD E+ IYE
Sbjct: 508  ENSSRTYLQGLFDSFPTSEPGLYGDHAVDDGSDGEFQIYE 547


>ref|XP_002875963.1| hypothetical protein ARALYDRAFT_485301 [Arabidopsis lyrata subsp.
            lyrata] gi|297321801|gb|EFH52222.1| hypothetical protein
            ARALYDRAFT_485301 [Arabidopsis lyrata subsp. lyrata]
          Length = 571

 Score =  466 bits (1198), Expect = e-128
 Identities = 254/584 (43%), Positives = 353/584 (60%), Gaps = 15/584 (2%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MG+IEEG ISG LPS   F V++PGYPSSI+RAI+TLGG+  I + R S SNKL+L FRP
Sbjct: 1    MGIIEEGIISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGISQARESISNKLELRFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDPY+HPA G+   C  FLL+IS+++                   +     +PV     +
Sbjct: 61   EDPYAHPALGEQRPCCGFLLRISKQD-------------------IKKPESQPVLATSSD 101

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKRT 1476
            V     C +           T + ADI+A+V E +HF+GMADYQHV+ +HAD+   KKR 
Sbjct: 102  V-----CLE--------EASTVLCADIIARVSESFHFDGMADYQHVIPIHADIAQQKKRK 148

Query: 1477 WTDVETSNVETGDLADVDENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHRW 1656
            W DV++    +  +   DE++M+L P  F+ KDIP+N+ L+P  T   KKK +A  Q+ +
Sbjct: 149  WMDVDSLTGNSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKDDAATQNFY 208

Query: 1657 EMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESLD 1836
            E+++ P  AIDF+I   P+K+ WE  +S++SNHW+ Q+++S LF+ERPIWTR SV++ L 
Sbjct: 209  EIDVGPVFAIDFSI---PKKLKWEDFVSRSSNHWQWQVSVSALFEERPIWTRDSVVQRLL 265

Query: 1837 NLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLRT 2016
            + G K   +ML R L R AYYF SGPF RFWI++GYDPR DPESR+YQR++FR+PP+LR+
Sbjct: 266  DKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRVYQRMEFRVPPELRS 325

Query: 2017 HCD--ASTGTKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATGW 2190
            +CD  A+   K  W +IC+F+LFP K   +LQL EL D++IQ EIRKP KQT C+  +GW
Sbjct: 326  YCDANATNSAKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIRKPPKQTTCSHKSGW 385

Query: 2191 FTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPS------- 2349
            F+  +L TLRLR+A+RF+SV P PG E++ KSI + FE+S++++SQ   +KPS       
Sbjct: 386  FSEALLDTLRLRVAVRFVSVFPEPGFEDVFKSIQEEFERSEKVQSQKETLKPSLVKHREA 445

Query: 2350 DEVSQNIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSH---LDAEDVGFSLEPDT 2520
             + S+++E+                                     + A D   SL    
Sbjct: 446  TKSSEDMEKCKSVNEDVDANVNEDGDDENLDDEDEEEEEEEEEEVDMAAGDNEISLGSHG 505

Query: 2521 YPDGENMSKNYLQELFGSFPLDAGD--GTSLLKPESDEEYHIYE 2646
            Y D EN S+ YLQ LF SFP       G   +   SD E+ IYE
Sbjct: 506  YLDTENSSRTYLQGLFDSFPTSEPGLYGDFAVDDGSDGEFQIYE 549


>ref|NP_197833.2| transcription factor IIIC, subunit 5 [Arabidopsis thaliana]
            gi|332005929|gb|AED93312.1| transcription factor IIIC,
            subunit 5 [Arabidopsis thaliana]
          Length = 554

 Score =  465 bits (1197), Expect = e-128
 Identities = 261/575 (45%), Positives = 353/575 (61%), Gaps = 6/575 (1%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MG+IE GTISG LPS   F V+YPGYPSSI+RA++TLGG+  I   R S SNKL+L+FRP
Sbjct: 1    MGIIENGTISGNLPSKEAFVVHYPGYPSSISRAVETLGGIQGITTARESTSNKLELHFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDP +HPA G+   C  FLLKIS+++       DS+ +S+                    
Sbjct: 61   EDPSAHPAYGERRHCNGFLLKISKEDVKK----DSLPESQ-------------------P 97

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKRT 1476
            V S SD         P   P  + ADIVA+V E Y F+GM DYQHV+ +HAD+   KKR 
Sbjct: 98   VISTSDAC------LPEVRPA-LCADIVARVSESYCFDGMVDYQHVIPIHADIAQQKKRK 150

Query: 1477 WTDVETSNVETGDLADV-DENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHR 1653
            W +V+ S     DL D+ DE++M+L P  FS KD P+NLVLR  VT S KKK E + Q+ 
Sbjct: 151  WMEVK-SLAGKNDLMDMADEDVMMLLPQFFSPKDRPDNLVLRLPVTSSPKKKDEELTQNL 209

Query: 1654 WEMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESL 1833
            +E++I P  AIDF++KEIP+ + WE +I  TSN W+ Q+A+S LF+ERP+WTR S+++ L
Sbjct: 210  YEIDIGPVFAIDFSVKEIPKILKWEDYIVPTSNQWKWQVAVSALFEERPVWTRDSIVQRL 269

Query: 1834 DNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLR 2013
             + G     +ML R L R AYYF  GPF RFWI++GYDPRKDPESR++QR++FR+PP+L+
Sbjct: 270  LDKGLTCTHHMLNRFLLRAAYYFSGGPFLRFWIKRGYDPRKDPESRVFQRMEFRVPPELK 329

Query: 2014 THCD--ASTGTKQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATG 2187
             +CD  A+  +K  W++IC+F++FP K   +LQL EL D++IQ+EIRKP KQT C   TG
Sbjct: 330  GYCDSNATNKSKPSWDDICAFKVFPFKCQTFLQLFELDDEYIQQEIRKPPKQTTCNYKTG 389

Query: 2188 WFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPSDEVSQN 2367
            WF+  +L  LRLR+A+RF+SV P PG E++ KSI + FE+S++ R Q   ++PS    Q 
Sbjct: 390  WFSEALLDNLRLRVAVRFVSVFPEPGFEDVFKSIQEEFERSEKTRIQKDALQPSQRNHQ- 448

Query: 2368 IEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVGFSLEPDTYPDGENMSK 2547
             E                           +   Y    + +D+  S+    Y D EN S+
Sbjct: 449  -ETTKDMKKCKNTNKEKDDDVNADEDSEDLDDEYEEAANDDDI--SISSHGYGDMENNSR 505

Query: 2548 NYLQELFGSFPLDAGD--GTSLLKPESDEEYHIYE 2646
             YLQ LF  FP  A    G++    +SD EY IYE
Sbjct: 506  TYLQGLFNRFPSSASALYGSANDDNDSDGEYPIYE 540


>gb|EXB88280.1| hypothetical protein L484_020348 [Morus notabilis]
          Length = 553

 Score =  465 bits (1196), Expect = e-128
 Identities = 276/579 (47%), Positives = 349/579 (60%), Gaps = 10/579 (1%)
 Frame = +1

Query: 940  MGVIE-EGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFR 1116
            MGVI+ +G +SG +PS   F+V YPGYPSSI+RA++TLGG++ I K R+ +SN+L+L+FR
Sbjct: 22   MGVIKKDGRVSGFVPSKEAFAVNYPGYPSSISRAVETLGGLEAIHKARSLQSNRLELHFR 81

Query: 1117 PEDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELE 1296
            PEDPYSHPA G L  C + LLK+SR   +N    D+      A    NN+          
Sbjct: 82   PEDPYSHPAFGDLRPCNHLLLKLSRIKSSN--GQDAQVSGPSALQNGNNLDYTYTTRASG 139

Query: 1297 EVDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKR 1473
               SA    K V  Q P    T+  ADIVA+V E YHF+GM DYQHV AVHADV   KKR
Sbjct: 140  STSSA----KQVDVQIPEDDQTNFCADIVARVLEAYHFDGMVDYQHVTAVHADVARRKKR 195

Query: 1474 TWTDVETSNVETGDLADVDEN-LMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQH 1650
             W ++E    E   L DVDE+ +M+L PPLF+ KD PENLVLRPSV LSSKK  EA+   
Sbjct: 196  KWLELEEPLSEKNGLMDVDEDDVMMLVPPLFAPKDFPENLVLRPSVILSSKKNEEAINH- 254

Query: 1651 RWEMEIEPCLAIDFNIKEIPRKV-NWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIE 1827
                   P L       EIP+++ NWE +I + S  WE Q+A+SKLFDERPIW + SV E
Sbjct: 255  -------PDL-------EIPKRIINWEQYIPKGSYQWELQMAVSKLFDERPIWIKHSVNE 300

Query: 1828 SLDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPD 2007
             L + G+ V ++ML+RLL RVAYYF SGPF RFWI+KGYDPRKDP+SRIYQRIDFR+ P 
Sbjct: 301  RLVDKGYNVVDHMLRRLLSRVAYYFSSGPFLRFWIKKGYDPRKDPDSRIYQRIDFRVHPS 360

Query: 2008 LRTHCDASTGT-----KQRWEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVC 2172
            LR++CDA+        KQRW +IC+F++FP K    LQL EL+DD+IQ+EIRKP  Q  C
Sbjct: 361  LRSYCDANVTNQGKKEKQRWGDICTFQVFPVKCQTSLQLFELADDYIQQEIRKPPSQKTC 420

Query: 2173 TPATGWFTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPSD 2352
            TP TGWF+S +  +LR RI++RFLS  P PGAE LLK  ++ FEKSKR R     +   +
Sbjct: 421  TPGTGWFSSTVHDSLRHRISIRFLSTYPKPGAEHLLKEATENFEKSKR-RLSKDCVMLHE 479

Query: 2353 EVSQNIEEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVGFSLEPDTYPDG 2532
            E  Q ++                                      EDV    EP+   D 
Sbjct: 480  EERQEVDSG-----------------------------------NEDV---QEPNIVEDE 501

Query: 2533 ENMSKNYLQELFGSFPLDAGDGTSLLKPE-SDEEYHIYE 2646
            E   +   +ELFGSFP     G  +   + SDEEY I+E
Sbjct: 502  EEEEEIDEEELFGSFPSTEAGGDKIQNADTSDEEYQIFE 540


>ref|XP_006286747.1| hypothetical protein CARUB_v10003057mg [Capsella rubella]
            gi|482555453|gb|EOA19645.1| hypothetical protein
            CARUB_v10003057mg [Capsella rubella]
          Length = 546

 Score =  457 bits (1175), Expect = e-125
 Identities = 261/573 (45%), Positives = 344/573 (60%), Gaps = 4/573 (0%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSATVFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYFRP 1119
            MG+IE GTISG LPS   F V+YPGYPSSI+RA+DTLGG+  I   R S SNKL+L+FRP
Sbjct: 1    MGIIENGTISGNLPSKEAFVVHYPGYPSSISRALDTLGGIQGITAARESTSNKLELHFRP 60

Query: 1120 EDPYSHPASGQLYSCKNFLLKISRKNHNNCFNGDSMNQSECAGSGVNNIHPEPVENELEE 1299
            EDPY+HPA G+   C  FLLKIS+++       DS+ +SE                    
Sbjct: 61   EDPYAHPAWGEQRPCNGFLLKISKEDVKK----DSLPESE-------------------P 97

Query: 1300 VDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHADVGC-KKRT 1476
            V + SD         P + P  ++ADIVA+V E Y F+GMADYQHV+ +HA +   KKR 
Sbjct: 98   VLATSDAC------LPEASPA-LSADIVARVSESYCFDGMADYQHVIPIHAGIAQQKKRK 150

Query: 1477 WTDVETSNVETGDLADVDENLMILAPPLFSLKDIPENLVLRPSVTLSSKKKLEAVVQHRW 1656
            W +V++       +   DE++M+L P  F+ KD P+NLVLR  VT S KKK E   Q  +
Sbjct: 151  WMEVKSLAGNNDLMGMADEDVMMLLPQFFAPKDRPDNLVLRLPVT-SPKKKEEEPTQTLY 209

Query: 1657 EMEIEPCLAIDFNIKEIPRKVNWEVHISQTSNHWEPQIALSKLFDERPIWTRASVIESLD 1836
            EM+I P  AIDF   +IP+ +NWE  I  +S+ W+ Q A+S LF+ERP+WTR S+++ L 
Sbjct: 210  EMDIGPVFAIDFASIQIPKILNWEDVIVPSSDQWKWQTAVSALFEERPVWTRDSIVQRLL 269

Query: 1837 NLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKGYDPRKDPESRIYQRIDFRIPPDLRT 2016
            + G K   +ML R L R AYYF  GPF RFWIR+GYDPRKDPESR++QR++FR+PP+LR 
Sbjct: 270  DKGLKCTHHMLNRFLLRAAYYFSGGPFLRFWIRRGYDPRKDPESRVFQRMEFRVPPELRG 329

Query: 2017 HCDASTGTKQR--WEEICSFRLFPSKYHLYLQLCELSDDFIQEEIRKPSKQTVCTPATGW 2190
            +CDA+   K +  W++IC+F++FP K   +LQL EL D++IQ EIRKP KQT C   TGW
Sbjct: 330  YCDANATNKSKPSWDDICAFKVFPFKCQTFLQLFELHDEYIQREIRKPPKQTTCNYKTGW 389

Query: 2191 FTSHILHTLRLRIALRFLSVSPLPGAEELLKSISDRFEKSKRIRSQVPNIKPSDEVSQNI 2370
            F+  +L  LRLR+A+RF+SV P PG E++ KSI + FE+S++ R Q   +KP     Q  
Sbjct: 390  FSEALLDNLRLRVAVRFVSVFPEPGFEDVFKSIQEEFERSEKTRIQKDALKPFHRNHQET 449

Query: 2371 EEAVGXXXXXXXXXXXXXXXXXXXXXXXMYAYYPSHLDAEDVGFSLEPDTYPDGENMSKN 2550
             + +                           Y    + A D   S+    Y D EN SK 
Sbjct: 450  TKDMKKHKDTNQEKDGDVNTDEDADDID-DEYEELDVAANDDEISISSHGYGDMENNSKT 508

Query: 2551 YLQELFGSFPLD-AGDGTSLLKPESDEEYHIYE 2646
            YLQ LF  FP   AGDG       SD EY IY+
Sbjct: 509  YLQGLFDRFPSSGAGDG-------SDGEYPIYD 534


>gb|EPS67527.1| hypothetical protein M569_07249 [Genlisea aurea]
          Length = 548

 Score =  454 bits (1167), Expect = e-124
 Identities = 257/510 (50%), Positives = 328/510 (64%), Gaps = 32/510 (6%)
 Frame = +1

Query: 940  MGVIEEGTISGCLPSAT--VFSVYYPGYPSSITRAIDTLGGVDKIEKVRTSKSNKLDLYF 1113
            MG+IEEG+ISG L  +   VF+V YPGYPSS+ RAI+TLGG   I KV   KS KL+L F
Sbjct: 1    MGLIEEGSISGVLAGSINGVFAVNYPGYPSSVERAIETLGGSHGILKVHADKSKKLELRF 60

Query: 1114 RPEDPYSHPASGQLYSCKNFLLKISRKNHNNCFN-------GDSMNQSECAGSGVNNIHP 1272
            RPEDPYSHPA G+  SC NFLLKIS+K   +  N        +S++  E +G G      
Sbjct: 61   RPEDPYSHPAFGERQSCNNFLLKISKKKAKDVHNETSGSSQAESLHVRESSGKGT----- 115

Query: 1273 EPVENELEEVDSASDCTKLVGDQAPNSMPTHITADIVAQVEEVYHFNGMADYQHVLAVHA 1452
                NE E + ++S       D     +   ++A IV+++ E YHFNGMADYQHVL +HA
Sbjct: 116  -AAGNESESIPASSVDEARKKD---GGIQDQLSACIVSRISEAYHFNGMADYQHVLPLHA 171

Query: 1453 DV-GCKKRTWTDVETSNVETGDLADVD-ENLMILAPPLFSLKDIPENLVLRPSVTLSSKK 1626
            D  G KKRTW +VE S V   DL DVD E++MIL PPLFSLKD PE ++L+P V  + KK
Sbjct: 172  DSSGRKKRTWAEVEKS-VGKDDLLDVDLEDIMILVPPLFSLKDQPEKILLKPCVESNVKK 230

Query: 1627 KLEAVVQHRWE--------MEIEPCLAIDFNIKEI-------PRKVNWEVHISQTSNHWE 1761
            K E   +   E        MEIEPCLAIDFN+K+I       P+ VNWE  I + S  W 
Sbjct: 231  KPEENAEPPAEESSSVTKQMEIEPCLAIDFNVKDILNFHLFVPKAVNWEELIPRNSKRWL 290

Query: 1762 PQIALSKLFDERPIWTRASVIESLDNLGFKVGENMLKRLLFRVAYYFGSGPFHRFWIRKG 1941
             Q A+  LFDE PIW ++S+ E L N G  V  N+L+RLLF  AYYF +GPF RFWIRKG
Sbjct: 291  LQRAVCDLFDEHPIWPKSSLAERLINRGMDVANNVLRRLLFIAAYYFSNGPFLRFWIRKG 350

Query: 1942 YDPRKDPESRIYQRIDFRIPPDLRTHC--DASTGTKQRWEEICSFRLFPSKYHLYLQLCE 2115
            YDPRKDP SR+YQR DFR+PP LR++C  DA +G   +WE+IC+FR+FP K  + LQL E
Sbjct: 351  YDPRKDPGSRVYQRTDFRVPPSLRSYCFSDAVSGLNDKWEDICAFRVFPRKCQISLQLFE 410

Query: 2116 LSDDFIQEEIRKP-SKQTVCTPATGWFTSHILHTLRLRIALRFLSVSPLPGAEELLKSIS 2292
            L DD+IQEEI KP  +++ C+  TGWF++  + + RLR+A RFLS+ P  G+E LLK +S
Sbjct: 411  LKDDYIQEEIVKPIHQESRCSLQTGWFSNQSIESFRLRVAQRFLSIYPEAGSETLLKHVS 470

Query: 2293 DRFEKSKR---IRSQVPNIKPSDEVSQNIE 2373
             RFE++KR   I    P +    +V+  IE
Sbjct: 471  FRFERTKRAHLIVKNPPKVGEKKDVAAEIE 500


Top