BLASTX nr result

ID: Mentha24_contig00009113 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00009113
         (682 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU43842.1| hypothetical protein MIMGU_mgv1a0002972mg, partia...   272   8e-71
ref|XP_006339413.1| PREDICTED: mediator of RNA polymerase II tra...   254   1e-65
ref|XP_004229451.1| PREDICTED: mediator of RNA polymerase II tra...   253   5e-65
ref|XP_007034585.1| REF4-related 1 [Theobroma cacao] gi|50871361...   247   3e-63
ref|XP_006419799.1| hypothetical protein CICLE_v10006738mg [Citr...   243   5e-62
ref|XP_002311827.1| hypothetical protein POPTR_0008s20610g [Popu...   241   1e-61
ref|XP_004165440.1| PREDICTED: mediator of RNA polymerase II tra...   240   3e-61
gb|EXB95840.1| hypothetical protein L484_010039 [Morus notabilis]     239   6e-61
ref|XP_002277484.1| PREDICTED: uncharacterized protein LOC100247...   239   8e-61
gb|EPS68585.1| hypothetical protein M569_06182 [Genlisea aurea]       236   5e-60
ref|XP_002516789.1| conserved hypothetical protein [Ricinus comm...   231   2e-58
dbj|BAC41797.1| unknown protein [Arabidopsis thaliana]                229   6e-58
ref|NP_189001.1| mediator of RNA polymerase II transcription sub...   229   6e-58
ref|XP_002883436.1| hypothetical protein ARALYDRAFT_479868 [Arab...   229   8e-58
ref|XP_007050679.1| Reduced epidermal fluorescence 4, putative i...   228   1e-57
ref|XP_007147785.1| hypothetical protein PHAVU_006G154900g [Phas...   226   4e-57
ref|XP_006406015.1| hypothetical protein EUTSA_v10019912mg [Eutr...   224   2e-56
ref|XP_004298175.1| PREDICTED: mediator of RNA polymerase II tra...   224   3e-56
dbj|BAF01832.1| hypothetical protein [Arabidopsis thaliana]           223   3e-56
ref|XP_003547235.2| PREDICTED: mediator of RNA polymerase II tra...   223   4e-56

>gb|EYU43842.1| hypothetical protein MIMGU_mgv1a0002972mg, partial [Mimulus guttatus]
          Length = 1207

 Score =  272 bits (695), Expect = 8e-71
 Identities = 156/236 (66%), Positives = 163/236 (69%), Gaps = 9/236 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AFVSLTITYKLD+ASQRFLDLAGPALE+LAAGCP
Sbjct: 833  LAATGVDVPSLAAGVSSPAALPLPLAAFVSLTITYKLDKASQRFLDLAGPALETLAAGCP 892

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA---------IST 350
            WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAV+QLLRSCFSA         IS+
Sbjct: 893  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVVQLLRSCFSATLGLNTSCCISS 952

Query: 349  NXXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYG 170
            N           GSHFNGGISPVAPGILYLRVYRSIRD+MFLREEIVSLL+QTV+DIV  
Sbjct: 953  NGGIGALLGHGFGSHFNGGISPVAPGILYLRVYRSIRDVMFLREEIVSLLMQTVEDIVCL 1012

Query: 169  GPXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
             P                YGH               LGASVVFLTGGLGLVQSL K
Sbjct: 1013 KP------------KKSKYGHASFAAALTEVKVAASLGASVVFLTGGLGLVQSLFK 1056


>ref|XP_006339413.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            33A-like [Solanum tuberosum]
          Length = 1318

 Score =  254 bits (650), Expect = 1e-65
 Identities = 145/235 (61%), Positives = 162/235 (68%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AFVSLTITYKLD+ASQRFL+LAGPALESLAAGCP
Sbjct: 934  LAATGVDVPSLAAGGSSPAILPLPLAAFVSLTITYKLDKASQRFLNLAGPALESLAAGCP 993

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCF--------SAISTN 347
            WPCMPIVASLWTQKAKRWSDFLVFSASRTVFL++N AVIQLL+SCF        S+IS+N
Sbjct: 994  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLNNNHAVIQLLKSCFNATLGLNSSSISSN 1053

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGILYLRVYRSIRDIMFLREEIVSLL+Q++ DI    
Sbjct: 1054 GGIGALLGHGFGSHFYGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLMQSISDIARNE 1113

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNGK +G+               LGAS+++L+GG GLVQSL+K
Sbjct: 1114 LPRQRLNKLKIPKNGKKFGNVSLAATMTRVKLAALLGASLLWLSGGSGLVQSLIK 1168


>ref|XP_004229451.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            33A-like [Solanum lycopersicum]
          Length = 1303

 Score =  253 bits (645), Expect = 5e-65
 Identities = 144/235 (61%), Positives = 162/235 (68%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AFVSLTITYKLD+ASQRFL+LAGPALESLAAGCP
Sbjct: 919  LAATGVDVPSLVAGGSSPAILPLPLAAFVSLTITYKLDKASQRFLNLAGPALESLAAGCP 978

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCF--------SAISTN 347
            WPCMPIVASLWTQKAKRWSDFLVFSASRTVFL+++ AVIQLL+SCF        S+IS+N
Sbjct: 979  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLNNHHAVIQLLKSCFNATLGLNSSSISSN 1038

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGILYLRVYRSIRDIMFLREEIVSLL+Q++ DI    
Sbjct: 1039 GGIGALLGHGFGSHFYGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLMQSISDIARSE 1098

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNGK +G+               LGAS+++L+GG GLVQSL+K
Sbjct: 1099 LPRQRLNKLKILKNGKKFGNVSLAATMTRVKLAALLGASLLWLSGGSGLVQSLIK 1153


>ref|XP_007034585.1| REF4-related 1 [Theobroma cacao] gi|508713614|gb|EOY05511.1|
            REF4-related 1 [Theobroma cacao]
          Length = 1325

 Score =  247 bits (630), Expect = 3e-63
 Identities = 137/235 (58%), Positives = 159/235 (67%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AFVSLTITYK+D+AS+RFL+LAGPALESLAA CP
Sbjct: 942  LAATGVDVPRLATGGSSPATLPLPLAAFVSLTITYKIDKASERFLNLAGPALESLAADCP 1001

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRW DFLVFSASRTVFLH+ DAV+QLL+SCF+A        IS+N
Sbjct: 1002 WPCMPIVASLWTQKAKRWFDFLVFSASRTVFLHNRDAVVQLLKSCFTATLGLNVAPISSN 1061

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GG+SPVAPGILYLRVYRS+RDI+F+ EE+VSLL+ +V++I Y G
Sbjct: 1062 GGVGALLGHGFGSHFCGGLSPVAPGILYLRVYRSMRDIVFITEEVVSLLMDSVREIAYSG 1121

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNG  YG                L AS+V+L+GGLGLVQSL+K
Sbjct: 1122 LLREKLEKLKTSKNGTKYGQVSLAAGMTRVKLAASLAASLVWLSGGLGLVQSLIK 1176


>ref|XP_006419799.1| hypothetical protein CICLE_v10006738mg [Citrus clementina]
            gi|568872251|ref|XP_006489285.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 33A-like [Citrus
            sinensis] gi|557521672|gb|ESR33039.1| hypothetical
            protein CICLE_v10006738mg [Citrus clementina]
          Length = 1331

 Score =  243 bits (619), Expect = 5e-62
 Identities = 133/234 (56%), Positives = 158/234 (67%), Gaps = 8/234 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LA TG+D+                  AF+SLTITYK+D+AS+RFL+LAGPALESLAAGCP
Sbjct: 948  LATTGIDIPSLAAGGTSPATLPLPLAAFLSLTITYKIDKASERFLNLAGPALESLAAGCP 1007

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRW DFLVFSASRTVFLH++DAV+QLL+SCF+A        IS+N
Sbjct: 1008 WPCMPIVASLWTQKAKRWFDFLVFSASRTVFLHNSDAVVQLLKSCFTATLGLNSNPISSN 1067

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGILYLRVYRS+RDI+F+ EEIVSLL+ +V++I + G
Sbjct: 1068 VGVGALLGHGFGSHFCGGISPVAPGILYLRVYRSMRDILFITEEIVSLLMHSVREIAFSG 1127

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLV 5
                        KNG  YG                LGAS+V+L+GGLG V SL+
Sbjct: 1128 LPQEKMEKLKASKNGMRYGQVSLAAAITRVKLAASLGASLVWLSGGLGSVHSLI 1181


>ref|XP_002311827.1| hypothetical protein POPTR_0008s20610g [Populus trichocarpa]
            gi|222851647|gb|EEE89194.1| hypothetical protein
            POPTR_0008s20610g [Populus trichocarpa]
          Length = 1304

 Score =  241 bits (616), Expect = 1e-61
 Identities = 137/235 (58%), Positives = 158/235 (67%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AFVSLTITYK+D+AS+RFL+LAGPALESLAAGCP
Sbjct: 924  LAATGVDVPSLAAGVSSLATIPLPLAAFVSLTITYKIDKASERFLNLAGPALESLAAGCP 983

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFS--------AISTN 347
            WPCMPIVASLWTQKAKRW DFLVFSASRTVFLH+NDAV QLL+SCFS        AIS+N
Sbjct: 984  WPCMPIVASLWTQKAKRWFDFLVFSASRTVFLHNNDAVFQLLKSCFSATLGPNAAAISSN 1043

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF+GGISPVAPGILYLRVYRSIRDI+ L E+I+SL++ +V++I   G
Sbjct: 1044 GGVGALLGHGFGSHFSGGISPVAPGILYLRVYRSIRDIVSLMEDIISLMMLSVREIACTG 1103

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNG   G                LGAS+++L+GGLGLVQ+L K
Sbjct: 1104 LPRERLEKLKRSKNGLRCGQFSLTAAMTRVKLAASLGASLIWLSGGLGLVQALFK 1158


>ref|XP_004165440.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            33A-like [Cucumis sativus]
          Length = 1311

 Score =  240 bits (613), Expect = 3e-61
 Identities = 136/235 (57%), Positives = 156/235 (66%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AFVSLTITYK+DRASQRFL+LAGPALESLAAGCP
Sbjct: 927  LAATGVDVPSLAAGGSSPATLPLPLAAFVSLTITYKIDRASQRFLNLAGPALESLAAGCP 986

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRWSDFLVFSASRTVFL + DAV+QLL+SCF+A        +S+N
Sbjct: 987  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLQNCDAVVQLLKSCFTATLGLTANPLSSN 1046

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGIL+LRVYRSIRD+  L EEI+SLL+ +V++I   G
Sbjct: 1047 GGVGALLGHGFGSHFCGGISPVAPGILFLRVYRSIRDVALLVEEILSLLMDSVREIACNG 1106

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                         N K YG                LGAS+V+L+GGL LVQS++K
Sbjct: 1107 AGKDKSGKLKTTNNAKRYGQISLSSAMTQVKLAASLGASLVWLSGGLVLVQSVIK 1161


>gb|EXB95840.1| hypothetical protein L484_010039 [Morus notabilis]
          Length = 1285

 Score =  239 bits (610), Expect = 6e-61
 Identities = 132/235 (56%), Positives = 159/235 (67%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AFVSLTITYK+D+AS+RFL+LAGP LE LAAGCP
Sbjct: 904  LAATGVDVPSLAAGGTSPATLPLPLAAFVSLTITYKIDKASERFLNLAGPTLEILAAGCP 963

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRWSDFL+FSASRTVFLH++DAV+QLL+SCF+A        +S+N
Sbjct: 964  WPCMPIVASLWTQKAKRWSDFLIFSASRTVFLHNSDAVVQLLKSCFAATLGLNATPVSSN 1023

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       G+HF GG+SPVAPGILYLRVYRS+RDI+F+ E+IV++L+ +V++I   G
Sbjct: 1024 GGVGTLLGHGFGTHFCGGMSPVAPGILYLRVYRSMRDIVFMTEKIVAVLMHSVREIASSG 1083

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNG  YG                LGAS+V+LTGGL LVQSL+K
Sbjct: 1084 LPRERSEKLKKTKNGVRYGQVSLAAAMTRVKLAASLGASLVWLTGGLVLVQSLIK 1138


>ref|XP_002277484.1| PREDICTED: uncharacterized protein LOC100247741 [Vitis vinifera]
            gi|297736973|emb|CBI26174.3| unnamed protein product
            [Vitis vinifera]
          Length = 1305

 Score =  239 bits (609), Expect = 8e-61
 Identities = 136/235 (57%), Positives = 155/235 (65%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AF SLTITYK+DRASQRFL+LAGPALE+LAA CP
Sbjct: 921  LAATGVDVPSLAAGGNSPATLPLPLAAFASLTITYKIDRASQRFLNLAGPALEALAADCP 980

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLH++DAV+QLL+SCF+A        IS+N
Sbjct: 981  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHNSDAVVQLLKSCFTATLGLKTTPISSN 1040

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGILYLR YRSIRD++F+ EEIVSLL+  V++I    
Sbjct: 1041 GGVGALLGHGFGSHFCGGISPVAPGILYLRAYRSIRDVVFMAEEIVSLLMHFVREIASSQ 1100

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KN   YG                L AS+V+L+GGLGLVQSL+K
Sbjct: 1101 LSGERSEKLKKAKNEMKYGQISLGAALARVKLIASLAASLVWLSGGLGLVQSLIK 1155


>gb|EPS68585.1| hypothetical protein M569_06182 [Genlisea aurea]
          Length = 1279

 Score =  236 bits (602), Expect = 5e-60
 Identities = 139/244 (56%), Positives = 157/244 (64%), Gaps = 17/244 (6%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXA-FVSLTITYKLDRASQRFLDLAGPALESLAAGC 506
            LAATGVDV                  A FVSLTITYKLDRASQRFLDLAGPALESLAAGC
Sbjct: 892  LAATGVDVPSLEAGGGSSPAVLPLPLAAFVSLTITYKLDRASQRFLDLAGPALESLAAGC 951

Query: 505  PWPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA----------- 359
            PWPCMPIVASLWTQKAKRWSDFL+FSASRTVFLHS DA +QLLRSCF+A           
Sbjct: 952  PWPCMPIVASLWTQKAKRWSDFLIFSASRTVFLHSTDAAVQLLRSCFAATLGLNNVCGGG 1011

Query: 358  -----ISTNXXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQ 194
                 I++N           GSHF+GGISPVAPGILYLRVYRS+RD+ FLR+E+V+LL++
Sbjct: 1012 GGGAIITSNGGIGALLGHGFGSHFDGGISPVAPGILYLRVYRSMRDVTFLRDELVALLVK 1071

Query: 193  TVQDIVYGGPXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQ 14
            +V+DI                 +    G                LGASV+FLTGGLG VQ
Sbjct: 1072 SVEDIA-------AVAAVPVKLHRSRNGCPSMAAVMTKVKLAASLGASVLFLTGGLGAVQ 1124

Query: 13   SLVK 2
            SL+K
Sbjct: 1125 SLLK 1128


>ref|XP_002516789.1| conserved hypothetical protein [Ricinus communis]
            gi|223543877|gb|EEF45403.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1325

 Score =  231 bits (588), Expect = 2e-58
 Identities = 131/235 (55%), Positives = 154/235 (65%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVD+                  AFVSLTITYK+D+AS+RFL+LAGPALE LAAGCP
Sbjct: 942  LAATGVDIPSLASGGSSPATLPLPLAAFVSLTITYKIDKASERFLNLAGPALECLAAGCP 1001

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCF--------SAISTN 347
            WPCMPIVASLWTQKAKRW DFLVFSASRTVFLH ++AV QLL+SCF        +AI +N
Sbjct: 1002 WPCMPIVASLWTQKAKRWFDFLVFSASRTVFLHDSNAVFQLLKSCFAATLGLSATAIYSN 1061

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGILYLRVYRSIR+I+F+ EEI+SL++ +V++I   G
Sbjct: 1062 GGVGALLGHGFGSHFCGGISPVAPGILYLRVYRSIREIVFVTEEIISLIMLSVREIACSG 1121

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNG   G                LGAS+V+L+GG+GLV SL K
Sbjct: 1122 LPREKLEKLKRSKNGLRCGQVSLTAAMTWVKVAASLGASLVWLSGGVGLVHSLFK 1176


>dbj|BAC41797.1| unknown protein [Arabidopsis thaliana]
          Length = 1309

 Score =  229 bits (584), Expect = 6e-58
 Identities = 129/235 (54%), Positives = 151/235 (64%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LA TGVD+                  AFVSLTITYK+D+AS+RFL+LAGPALE LAAGCP
Sbjct: 924  LATTGVDIPSLAPGGSSPATLPLPLAAFVSLTITYKIDKASERFLNLAGPALECLAAGCP 983

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRW DFLVFSASRTVFLH+ DAVIQLLR+CFSA        +S +
Sbjct: 984  WPCMPIVASLWTQKAKRWFDFLVFSASRTVFLHNQDAVIQLLRNCFSATLGLNAAPMSND 1043

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGILYLR+YR++RD + + EEI+SLL+ +V+DI    
Sbjct: 1044 GGVGALLGHGFGSHFYGGISPVAPGILYLRMYRALRDTVSVSEEILSLLIHSVEDIAQNR 1103

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNG  YG                L AS+V+LTGGLG+V  L+K
Sbjct: 1104 LSKEKLEKLKTVKNGSRYGQSSLATAMTQVKLAASLSASLVWLTGGLGVVHVLIK 1158


>ref|NP_189001.1| mediator of RNA polymerase II transcription subunit 33A [Arabidopsis
            thaliana] gi|75274224|sp|Q9LUG9.1|MD33A_ARATH RecName:
            Full=Mediator of RNA polymerase II transcription subunit
            33A; AltName: Full=REF4-related 1 protein; AltName:
            Full=REF4-resembling 1 protein gi|9294515|dbj|BAB02777.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|332643259|gb|AEE76780.1| mediator of RNA polymerase II
            transcription subunit 33A [Arabidopsis thaliana]
          Length = 1309

 Score =  229 bits (584), Expect = 6e-58
 Identities = 129/235 (54%), Positives = 151/235 (64%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LA TGVD+                  AFVSLTITYK+D+AS+RFL+LAGPALE LAAGCP
Sbjct: 924  LATTGVDIPSLAPGGSSPATLPLPLAAFVSLTITYKIDKASERFLNLAGPALECLAAGCP 983

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRW DFLVFSASRTVFLH+ DAVIQLLR+CFSA        +S +
Sbjct: 984  WPCMPIVASLWTQKAKRWFDFLVFSASRTVFLHNQDAVIQLLRNCFSATLGLNAAPMSND 1043

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGILYLR+YR++RD + + EEI+SLL+ +V+DI    
Sbjct: 1044 GGVGALLGHGFGSHFYGGISPVAPGILYLRMYRALRDTVSVSEEILSLLIHSVEDIAQNR 1103

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNG  YG                L AS+V+LTGGLG+V  L+K
Sbjct: 1104 LSKEKLEKLKTVKNGSRYGQSSLATAMTQVKLAASLSASLVWLTGGLGVVHVLIK 1158


>ref|XP_002883436.1| hypothetical protein ARALYDRAFT_479868 [Arabidopsis lyrata subsp.
            lyrata] gi|297329276|gb|EFH59695.1| hypothetical protein
            ARALYDRAFT_479868 [Arabidopsis lyrata subsp. lyrata]
          Length = 1309

 Score =  229 bits (583), Expect = 8e-58
 Identities = 129/235 (54%), Positives = 151/235 (64%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LA TGVD+                  AFVSLTITYK+D+AS+RFL+LAGPALE LAAGCP
Sbjct: 924  LATTGVDIPSLAPGGSSPATLPLPLAAFVSLTITYKIDKASERFLNLAGPALECLAAGCP 983

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRW DFLVFSASRTVFLH+ DAVIQLLR+CFSA        +S +
Sbjct: 984  WPCMPIVASLWTQKAKRWFDFLVFSASRTVFLHNQDAVIQLLRNCFSATLGLNAAPMSND 1043

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGILYLR+YR++RD + + EEI+SLL+ +V+DI    
Sbjct: 1044 GGVGALLGHGFGSHFYGGISPVAPGILYLRMYRALRDTVSVSEEILSLLIHSVEDIAQNR 1103

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNG  YG                L AS+V+LTGGLG+V  L+K
Sbjct: 1104 LSKEKLERLKTVKNGTRYGQSSLATAMTQVKLAASLSASLVWLTGGLGVVHLLIK 1158


>ref|XP_007050679.1| Reduced epidermal fluorescence 4, putative isoform 1 [Theobroma
            cacao] gi|508702940|gb|EOX94836.1| Reduced epidermal
            fluorescence 4, putative isoform 1 [Theobroma cacao]
          Length = 1334

 Score =  228 bits (581), Expect = 1e-57
 Identities = 130/235 (55%), Positives = 150/235 (63%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  A VSLTITYKLD+ S+RFL L GPAL SLA GCP
Sbjct: 949  LAATGVDVPSLAVGGSSPTTLPLPLAALVSLTITYKLDKGSERFLILIGPALNSLAEGCP 1008

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCF--------SAISTN 347
            WPCMPI+ASLW QK KRW+DFLVFSASRTVF HS+DAV+QLLRSCF        S I +N
Sbjct: 1009 WPCMPIIASLWAQKVKRWNDFLVFSASRTVFHHSSDAVVQLLRSCFTSTLGLSPSIIYSN 1068

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF+GG+SPVAPGILYLRV+RS+RDIMF+ EEIVSLL+ +V++I   G
Sbjct: 1069 GGVGALLGHGFGSHFSGGMSPVAPGILYLRVHRSVRDIMFMTEEIVSLLMSSVREIASSG 1128

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        K G  YG                LGAS+V+L+GGL LVQSL+K
Sbjct: 1129 LSQEKSEKLKKTKFGLRYGQVSLGAAMTRVKLAASLGASLVWLSGGLSLVQSLIK 1183


>ref|XP_007147785.1| hypothetical protein PHAVU_006G154900g [Phaseolus vulgaris]
            gi|561021008|gb|ESW19779.1| hypothetical protein
            PHAVU_006G154900g [Phaseolus vulgaris]
          Length = 1332

 Score =  226 bits (577), Expect = 4e-57
 Identities = 128/235 (54%), Positives = 151/235 (64%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AF SLTITYK+D+AS+RFL+LAG  LESLAAGCP
Sbjct: 948  LAATGVDVPSLASGDSSPATLPLPLAAFTSLTITYKVDKASERFLNLAGQTLESLAAGCP 1007

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWT KAKRWSDFL+FSASRTVFLH++DAV+QLL+SCF+A        IS N
Sbjct: 1008 WPCMPIVASLWTLKAKRWSDFLIFSASRTVFLHNSDAVVQLLKSCFTATLGTNTSPISCN 1067

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                         H  GG+ PVAPGILYLR YRSIRDI+FL EEIVS+L+ +V++IV  G
Sbjct: 1068 GGVGALLGHGFKYHLCGGLCPVAPGILYLRAYRSIRDIVFLTEEIVSILMHSVREIVCSG 1127

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        K+G  YG                LGAS+V+++GGL LVQ L+K
Sbjct: 1128 LVRERLEKLKATKDGIRYGQASLAASMTRVKLAAALGASLVWISGGLMLVQLLIK 1182


>ref|XP_006406015.1| hypothetical protein EUTSA_v10019912mg [Eutrema salsugineum]
            gi|557107161|gb|ESQ47468.1| hypothetical protein
            EUTSA_v10019912mg [Eutrema salsugineum]
          Length = 1295

 Score =  224 bits (571), Expect = 2e-56
 Identities = 124/235 (52%), Positives = 149/235 (63%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LA TGVD+                  AFVSLTITYK+D+ S+RFL+LAGPALE LAAGCP
Sbjct: 912  LATTGVDIPSLAPGGSSPATLPLPLAAFVSLTITYKIDKGSERFLNLAGPALECLAAGCP 971

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRW DFLVFSASRTVFLH+ DAV+QLLR+CFSA        +S +
Sbjct: 972  WPCMPIVASLWTQKAKRWFDFLVFSASRTVFLHNPDAVVQLLRNCFSATLGLNAGPMSND 1031

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF GGISPVAPGILYLR+YR++RD + + EEI S+L+ +V+DI    
Sbjct: 1032 GGVGALLGHGFGSHFYGGISPVAPGILYLRMYRALRDTVSVTEEIFSILIHSVEDIAQNR 1091

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        +NG  YG                L AS+V+LTGG+G+V  L+K
Sbjct: 1092 LSKENLQRLKTVRNGSRYGQSSLATAMTQVKLAASLSASLVWLTGGIGVVHLLIK 1146


>ref|XP_004298175.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            33A-like [Fragaria vesca subsp. vesca]
          Length = 1322

 Score =  224 bits (570), Expect = 3e-56
 Identities = 129/235 (54%), Positives = 151/235 (64%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVD+                  AFVS+TITYK+DRAS+RFL LAGP LE LAAGCP
Sbjct: 939  LAATGVDIPSLAAERSSPATLPLPLAAFVSVTITYKIDRASERFLSLAGPTLECLAAGCP 998

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWTQKAKRWSDFL+FSASRTVFL +  +V+QLL+SCF+A         S+N
Sbjct: 999  WPCMPIVASLWTQKAKRWSDFLIFSASRTVFLQNRQSVVQLLKSCFTATLGLNATPTSSN 1058

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                       GSHF G ISPVAPGILYLRVYRSI DI+F+ EEIV++L+ +V++I    
Sbjct: 1059 GGVGALLGHGFGSHFCGEISPVAPGILYLRVYRSIADIVFMTEEIVTILMHSVREIAC-D 1117

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        KNG  YG                LGAS+V+LTGGL LVQSL+K
Sbjct: 1118 VLPKERLGKSKTKNGMRYGQVSLATAMTQVKLAASLGASLVWLTGGLCLVQSLIK 1172


>dbj|BAF01832.1| hypothetical protein [Arabidopsis thaliana]
          Length = 370

 Score =  223 bits (569), Expect = 3e-56
 Identities = 121/208 (58%), Positives = 144/208 (69%), Gaps = 8/208 (3%)
 Frame = -1

Query: 601 FVSLTITYKLDRASQRFLDLAGPALESLAAGCPWPCMPIVASLWTQKAKRWSDFLVFSAS 422
           FVSLTITYK+D+AS+RFL+LAGPALE LAAGCPWPCMPIVASLWTQKAKRW DFLVFSAS
Sbjct: 12  FVSLTITYKIDKASERFLNLAGPALECLAAGCPWPCMPIVASLWTQKAKRWFDFLVFSAS 71

Query: 421 RTVFLHSNDAVIQLLRSCFSA--------ISTNXXXXXXXXXXXGSHFNGGISPVAPGIL 266
           RTVFLH+ DAVIQLLR+CFSA        +S +           GSHF GGISPVAPGIL
Sbjct: 72  RTVFLHNQDAVIQLLRNCFSATLGLNAAPMSNDGGVGALLGHGFGSHFYGGISPVAPGIL 131

Query: 265 YLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGGPXXXXXXXXXXXKNGKNYGHXXXXXXX 86
           YLR+YR++RD + + EEI+SLL+ +V+DI                KNG  YG        
Sbjct: 132 YLRMYRALRDTVSVSEEILSLLIHSVEDIAQNRLSKEKLEKLKTVKNGSRYGQSSLATAM 191

Query: 85  XXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                   L AS+V+LTGGLG+V+ L++
Sbjct: 192 TQVKLAASLSASLVWLTGGLGVVRVLIR 219


>ref|XP_003547235.2| PREDICTED: mediator of RNA polymerase II transcription subunit
            33A-like [Glycine max]
          Length = 1316

 Score =  223 bits (568), Expect = 4e-56
 Identities = 125/235 (53%), Positives = 151/235 (64%), Gaps = 8/235 (3%)
 Frame = -1

Query: 682  LAATGVDVXXXXXXXXXXXXXXXXXXAFVSLTITYKLDRASQRFLDLAGPALESLAAGCP 503
            LAATGVDV                  AF SLTITYK+D+ S+RFL+LAG  LESLAAGCP
Sbjct: 932  LAATGVDVPSLASGDSCPAILPLPLAAFTSLTITYKVDKTSERFLNLAGQTLESLAAGCP 991

Query: 502  WPCMPIVASLWTQKAKRWSDFLVFSASRTVFLHSNDAVIQLLRSCFSA--------ISTN 347
            WPCMPIVASLWT KAKRWSDFL+FSASRTVFLH++DAV+QL++SCF+A        IS++
Sbjct: 992  WPCMPIVASLWTLKAKRWSDFLIFSASRTVFLHNSDAVVQLIKSCFTATLGMNSSPISSS 1051

Query: 346  XXXXXXXXXXXGSHFNGGISPVAPGILYLRVYRSIRDIMFLREEIVSLLLQTVQDIVYGG 167
                         H  GG+ PVAPGILYLR YRSIRDI+FL EEIVS+L+ +V++IV  G
Sbjct: 1052 GGVGALLGQGFKYHLCGGLCPVAPGILYLRAYRSIRDIVFLTEEIVSILMHSVREIVCSG 1111

Query: 166  PXXXXXXXXXXXKNGKNYGHXXXXXXXXXXXXXXXLGASVVFLTGGLGLVQSLVK 2
                        K+G  YG                LGAS+V+++GGL LVQ L+K
Sbjct: 1112 LPRERLEKLKATKDGIKYGQASLAASMTRVKLAAALGASLVWISGGLMLVQLLIK 1166


Top