BLASTX nr result

ID: Paeonia23_contig00007958 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00007958
         (1321 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXB26151.1| Zinc finger protein CONSTANS-LIKE 2 [Morus notabi...   208   4e-51
ref|NP_001268172.1| zinc finger protein-like [Vitis vinifera] gi...   205   3e-50
ref|XP_006427763.1| hypothetical protein CICLE_v10026364mg [Citr...   200   1e-48
ref|XP_006465171.1| PREDICTED: uncharacterized protein LOC102616...   197   7e-48
ref|XP_002316440.1| zinc finger family protein [Populus trichoca...   197   9e-48
ref|XP_004303825.1| PREDICTED: uncharacterized protein LOC101291...   194   7e-47
ref|XP_002311907.2| zinc finger family protein [Populus trichoca...   184   8e-44
gb|ADL36668.1| COL domain class transcription factor [Malus dome...   184   8e-44
ref|XP_004507021.1| PREDICTED: uncharacterized protein LOC101494...   160   2e-36
ref|XP_006357444.1| PREDICTED: uncharacterized protein LOC102580...   159   3e-36
ref|XP_003530172.1| PREDICTED: uncharacterized protein LOC100781...   157   8e-36
ref|XP_006585659.1| PREDICTED: uncharacterized protein LOC102667...   157   1e-35
ref|XP_007142121.1| hypothetical protein PHAVU_008G254400g [Phas...   153   2e-34
ref|XP_004242261.1| PREDICTED: uncharacterized protein LOC101262...   151   5e-34
gb|EYU21215.1| hypothetical protein MIMGU_mgv1a009313mg [Mimulus...   148   5e-33
ref|NP_188752.1| B-box 32 protein [Arabidopsis thaliana] gi|1199...   132   3e-28
ref|XP_006299141.1| hypothetical protein CARUB_v10015283mg [Caps...   129   4e-27
ref|XP_002885422.1| hypothetical protein ARALYDRAFT_342260 [Arab...   127   1e-26
ref|XP_006406309.1| hypothetical protein EUTSA_v10021479mg [Eutr...   125   4e-26
ref|XP_007216707.1| hypothetical protein PRUPE_ppa026514mg, part...   102   3e-19

>gb|EXB26151.1| Zinc finger protein CONSTANS-LIKE 2 [Morus notabilis]
          Length = 294

 Score =  208 bits (530), Expect = 4e-51
 Identities = 126/289 (43%), Positives = 168/289 (58%), Gaps = 25/289 (8%)
 Frame = -1

Query: 1093 MKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVS 914
            MK   CELC+ EA++YC SD AFLC+TCD  VH ANFLVARHVR+ +CS CK F G  +S
Sbjct: 1    MKGMICELCSKEASVYCDSDHAFLCWTCDADVHQANFLVARHVREPLCSNCKGFTGGFIS 60

Query: 913  GTVVCLSR--PVCRSCXXXXXXXXXXXXXXXXXXXXXXSFIN------IKKNPRREMDKI 758
            G  +  +   P+CRSC                      + ++      ++   + ++ +I
Sbjct: 61   GEGLRRNPRFPICRSCSPDESSGEDQADSLSSSSSVSSACVSSNGLKTLQFVDQPKIARI 120

Query: 757  LSSSSVTENSGENMN----FCGEVTSKKNRA-----------KPITKILRSRVPAKVDAK 623
              S SVTE S E  +    F GEV SKK              K + + +R R P  VDAK
Sbjct: 121  GPSISVTELSSEESSLPAIFSGEVMSKKTSQSLKKKKNNKMIKKVNEQIRPRGPMSVDAK 180

Query: 622  AEDIFVNWCRKMGLDGNCNIPT-ASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAA 446
            AE IF NWCR++GL+GN  + + AS ALS+C+  S+V PFRV+LAASFW+ +R+C   +A
Sbjct: 181  AEGIFGNWCRELGLNGNSAVVSLASHALSLCVGRSSVLPFRVSLAASFWWGLRSCVDKSA 240

Query: 445  TTDQNLKKLEEISGVPAKLILTAQTKLARVLSVRNDQQ-DQEEGWAECS 302
                 LK+LE++S VPA+LILT   KL R LS R  ++ D  EGWAECS
Sbjct: 241  RALHCLKRLEDVSRVPARLILTVGFKLDRELSARKSRRHDLAEGWAECS 289


>ref|NP_001268172.1| zinc finger protein-like [Vitis vinifera] gi|307707121|gb|ADN87331.1|
            zinc finger protein-like protein [Vitis vinifera]
          Length = 260

 Score =  205 bits (522), Expect = 3e-50
 Identities = 123/281 (43%), Positives = 155/281 (55%), Gaps = 17/281 (6%)
 Frame = -1

Query: 1093 MKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVS 914
            MK R CELCN EA+LYC SDSAFLC++CD +VHGANFLVARHVR T+CS+C    G+   
Sbjct: 1    MKGRVCELCNEEASLYCGSDSAFLCWSCDARVHGANFLVARHVRHTLCSECNGLAGDTFF 60

Query: 913  GTVVCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSS--- 743
            G      R +CRSC                          ++     + D   SSSS   
Sbjct: 61   GVGFQPHRLICRSCSS-----------------------EVESETSTDHDSKSSSSSCVS 97

Query: 742  ----------VTENSGENMNFCGEVTSKKNRAKPITKILRSRVPAKVDAKAEDIFVNWCR 593
                      V+    E   F   V++           LR+R  + VDAKAEDI VNWCR
Sbjct: 98   TTESAPRKGGVSRRKAERTGFTSSVSAVSGVDSRFPSKLRAR--SSVDAKAEDILVNWCR 155

Query: 592  KMGLDGNCNIPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEE 413
            K+GL+G+C    AS AL VCL + TV P RV+L A+   + +     +A T QNLK+L E
Sbjct: 156  KLGLNGSCT-SVASHALGVCLVKLTVLPLRVSLVAAISCAAKLSGDRSAYTPQNLKRLVE 214

Query: 412  ISGVPAKLILTAQTKLARVLSVRNDQ----QDQEEGWAECS 302
            ISGVPAKLIL A++KLARVL +   +    +D+ EGWAECS
Sbjct: 215  ISGVPAKLILAAESKLARVLKMERRRPRHVRDRVEGWAECS 255


>ref|XP_006427763.1| hypothetical protein CICLE_v10026364mg [Citrus clementina]
            gi|557529753|gb|ESR41003.1| hypothetical protein
            CICLE_v10026364mg [Citrus clementina]
          Length = 243

 Score =  200 bits (509), Expect = 1e-48
 Identities = 120/273 (43%), Positives = 152/273 (55%), Gaps = 11/273 (4%)
 Frame = -1

Query: 1084 RACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGTV 905
            RACELC+ EAAL+CASD AFLCF CD +VH ANFLVARHVRQT+CSQCK   G  +SG  
Sbjct: 3    RACELCSQEAALHCASDEAFLCFDCDDRVHKANFLVARHVRQTLCSQCKSLTGKFISGER 62

Query: 904  VCLSR-PVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILS-SSSVTEN 731
               S  P+C SC                         +  +   RE  ++ + SSSV++ 
Sbjct: 63   SSSSLVPICPSCCSSTTSTSSDCISSTES--------SAAEKMGRERKRVRACSSSVSDI 114

Query: 730  SGENMNFCGEVTSKKNRAKPITKILRSRVPAKVDAKAEDIFVNWCRKMGLDGN---CN-- 566
            SGE                        +  A  D+KAE IF  WCR++GL+GN   CN  
Sbjct: 115  SGE------------------------KAAAVTDSKAEGIFAIWCRRLGLNGNNSNCNSV 150

Query: 565  --IPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGVPAK 392
              +  AS+AL +CL  +T  P R  LAASFWF +R C      T  NL++LE ISGVPAK
Sbjct: 151  VVVSLASRALGLCLERTTALPLRACLAASFWFGLRMCGDKTVATWPNLRRLEAISGVPAK 210

Query: 391  LILTAQTKLARVLSVRNDQQDQ--EEGWAECSV 299
            LI+  + K+ARV++VR  +  Q  EEGWAEC+V
Sbjct: 211  LIVAVEGKIARVMAVRRRRPRQVLEEGWAECNV 243


>ref|XP_006465171.1| PREDICTED: uncharacterized protein LOC102616615 [Citrus sinensis]
          Length = 243

 Score =  197 bits (502), Expect = 7e-48
 Identities = 119/273 (43%), Positives = 152/273 (55%), Gaps = 11/273 (4%)
 Frame = -1

Query: 1084 RACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGTV 905
            RACELC+ EAAL+CASD AFLCF CD +VH ANFLVARHVRQT+CSQCK   G  +SG +
Sbjct: 3    RACELCSQEAALHCASDEAFLCFDCDDRVHKANFLVARHVRQTLCSQCKSLTGKFISGEL 62

Query: 904  VCLSR-PVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILS-SSSVTEN 731
               S  P+C SC                         +  +   RE  ++ + SSSV++ 
Sbjct: 63   SSSSLVPICPSCCSSTTSTSSDCISSTES--------SAAEKMGRERKRVRACSSSVSDI 114

Query: 730  SGENMNFCGEVTSKKNRAKPITKILRSRVPAKVDAKAEDIFVNWCRKMGLDGN---CN-- 566
            SGE                        +  A  D+KAE IF  WCR++GL+GN   CN  
Sbjct: 115  SGE------------------------KAAAVADSKAEGIFAIWCRRLGLNGNNSNCNSV 150

Query: 565  --IPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGVPAK 392
              +  AS+AL + L  +T  P R  LAASFWF +R C      T  NL++LE ISGVPAK
Sbjct: 151  VVVSLASRALGLFLERTTALPLRACLAASFWFGLRMCGDKTVATWPNLRRLEAISGVPAK 210

Query: 391  LILTAQTKLARVLSVRNDQQDQ--EEGWAECSV 299
            LI+  + K+ARV++VR  +  Q  EEGWAEC+V
Sbjct: 211  LIVAVEGKIARVMAVRRRRPRQVLEEGWAECNV 243


>ref|XP_002316440.1| zinc finger family protein [Populus trichocarpa]
            gi|222865480|gb|EEF02611.1| zinc finger family protein
            [Populus trichocarpa]
          Length = 249

 Score =  197 bits (501), Expect = 9e-48
 Identities = 112/278 (40%), Positives = 153/278 (55%), Gaps = 13/278 (4%)
 Frame = -1

Query: 1093 MKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVS 914
            M  + CELC  EA +YC SD+A+LCF CD  VH ANFLVARH R+ ICS C    GN  S
Sbjct: 1    MAVKVCELCRREAGVYCDSDAAYLCFDCDSNVHNANFLVARHARRVICSGCGSITGNPFS 60

Query: 913  GTVVCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTE 734
            G    LSR  C SC                                +E+D I  SSS T 
Sbjct: 61   GHTPSLSRVTCCSCSPG----------------------------NKELDSISCSSSSTL 92

Query: 733  NSG----------ENMNFCGEVTSKKNRAKPIT-KILRSRVPAKVDAKAEDIFVNWCRKM 587
            +S           EN     + TS  +  K I  + LR R+    + ++E +FVNWC+++
Sbjct: 93   SSACISSTETTRFENTRKGVKATSSSSSVKNIPGRSLRDRLKRSRNLRSEGVFVNWCKRL 152

Query: 586  GLDGNCNIPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEIS 407
            GL+GN  +  A++A+++C     + PFRV+LAASFWF +R C   + TT +NL++LEE+S
Sbjct: 153  GLNGNLVVQRATRAMALCFGRLAL-PFRVSLAASFWFGLRLCGDKSVTTWENLRRLEEVS 211

Query: 406  GVPAKLILTAQTKLARVLSVR--NDQQDQEEGWAECSV 299
            GVP KLI+T + K+ + L  +    Q++ EEGWAECSV
Sbjct: 212  GVPNKLIVTVEMKIEQALRSKRLQLQKEMEEGWAECSV 249


>ref|XP_004303825.1| PREDICTED: uncharacterized protein LOC101291940 [Fragaria vesca
            subsp. vesca]
          Length = 264

 Score =  194 bits (493), Expect = 7e-47
 Identities = 114/281 (40%), Positives = 145/281 (51%), Gaps = 18/281 (6%)
 Frame = -1

Query: 1093 MKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVS 914
            MK R CELC+  AALYCASDSAFLCF CD +VH ANFLVARHVRQ +CS CK   G  +S
Sbjct: 1    MKDRMCELCDQRAALYCASDSAFLCFRCDSRVHSANFLVARHVRQPLCSNCKSLAGYPIS 60

Query: 913  GTVVCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTE 734
            G  V     +C SC                                  +D + S+ + + 
Sbjct: 61   GDGVRTDHWLCSSCSPEDFSGDDDDSLLSS-----------------SLDSVGSACASST 103

Query: 733  NSGENMNFCGEVTSKKNRAKPITKILRSRVPAK---------------VDAKAEDIFVNW 599
            +        G+V  +++ +        S VPA+                DA+AE  FVNW
Sbjct: 104  DQSLATTTTGKVCPRRSGSSVTEVSKGSYVPARFSARFMRRRMQRVRSADARAEGTFVNW 163

Query: 598  CRKMGL---DGNCNIPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNL 428
            C+K+G+   D    + +AS AL  CL+     P RVALAASFWF VR C   A +T QNL
Sbjct: 164  CKKLGMSSGDSAVVVSSASHALGFCLARLPGVPLRVALAASFWFGVRICGDRAVSTCQNL 223

Query: 427  KKLEEISGVPAKLILTAQTKLARVLSVRNDQQDQEEGWAEC 305
            +++EEISGVP KLIL    KL R L  R  + + +EGWAEC
Sbjct: 224  RRVEEISGVPVKLILAVDAKLGRELRTRRGRPEIKEGWAEC 264


>ref|XP_002311907.2| zinc finger family protein [Populus trichocarpa]
            gi|550332086|gb|EEE89274.2| zinc finger family protein
            [Populus trichocarpa]
          Length = 250

 Score =  184 bits (467), Expect = 8e-44
 Identities = 102/273 (37%), Positives = 142/273 (52%), Gaps = 11/273 (4%)
 Frame = -1

Query: 1093 MKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVS 914
            M  + CELC  EA LYC SD+AFLCF CD  VH ANF+V+RH+R+ ICS C    G   S
Sbjct: 1    MAVKVCELCQREAGLYCDSDAAFLCFECDSNVHNANFVVSRHLRRVICSACNSLTGGSFS 60

Query: 913  GTVVCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTE 734
            GT   L R  C SC                                +E+D I  SSS + 
Sbjct: 61   GTAPSLRRVTCLSCSPE----------------------------NKELDSISCSSSCSS 92

Query: 733  NSGENMNFCGEVTSKKNRAKPITKILRSRVPAKVDA---------KAEDIFVNWCRKMGL 581
                      E T  +N  K +     + +PA+            ++E +FVNWC ++GL
Sbjct: 93   TLSSACISTTETTRFENTRKGVETSCVTNIPARFSGGRLKRSRNLRSECVFVNWCERLGL 152

Query: 580  DGNCNIPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGV 401
            +GN  +  A++A+++C     V PFRV+LAASFWF VR+C   + TT Q+L++LEE+SGV
Sbjct: 153  NGNLVVQRATRAIALCFGR-LVLPFRVSLAASFWFGVRSCGDKSVTTWQDLRRLEEVSGV 211

Query: 400  PAKLILTAQTKLARVLSVRNDQ--QDQEEGWAE 308
            P K+I   + K+   L  R  +  ++ EEGWA+
Sbjct: 212  PRKIISAVEMKIEHALRSRRLELHKNMEEGWAD 244


>gb|ADL36668.1| COL domain class transcription factor [Malus domestica]
          Length = 271

 Score =  184 bits (467), Expect = 8e-44
 Identities = 113/276 (40%), Positives = 149/276 (53%), Gaps = 15/276 (5%)
 Frame = -1

Query: 1087 FRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGT 908
            +RACELC+ EA+ YC SDSAFLC  CD +VH ANFLVARH+RQ +CS CK      V+GT
Sbjct: 5    YRACELCDQEASFYCPSDSAFLCSRCDARVHQANFLVARHLRQPLCSNCKS-----VAGT 59

Query: 907  VVCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKK--NPRREMDKILSSSSVTE 734
                S  +C SC                      + I+  +    +   +   S SSVT+
Sbjct: 60   RDLHS--LCSSCSPEFFSGDCDGDAKSSSSSDCSACISSTEMGTTKTGYENRKSESSVTD 117

Query: 733  NSGENMNFCGEVTSKKNRAKP---------ITKILRSRVPAKVDAKAEDIFVNWCRKMGL 581
             SG N+ +  + +  K    P           +  R+R    VDA+AE  FVNWC+++G+
Sbjct: 118  VSGSNVPY--KFSGMKRNILPKFSGAGRNNSVRRARARTSRSVDARAEGSFVNWCKRLGV 175

Query: 580  DGNCN---IPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEI 410
            +GN     + TAS A   CL      P RV LAASFWF +R C   +  T QNL+++EE+
Sbjct: 176  NGNLAESVVSTASNAFGFCLERLASVPPRVCLAASFWFGLRFCGDRSVFTCQNLRRVEEL 235

Query: 409  SGVPAKLILTAQTKLARVLSVRNDQQDQ-EEGWAEC 305
            SGVP KLIL  + KL   L VR  ++D  EEGWAEC
Sbjct: 236  SGVPVKLILAVEAKLGSELRVRRARRDDLEEGWAEC 271


>ref|XP_004507021.1| PREDICTED: uncharacterized protein LOC101494931 [Cicer arietinum]
          Length = 240

 Score =  160 bits (404), Expect = 2e-36
 Identities = 104/273 (38%), Positives = 138/273 (50%), Gaps = 9/273 (3%)
 Frame = -1

Query: 1099 LTMKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNC 920
            + +K + CELCN +A+LYC SDSAFLC  CD  VH AN LVARH RQ ICS+C  F G  
Sbjct: 1    MEIKCKTCELCNQQASLYCPSDSAFLCRNCDDAVHAANLLVARHHRQLICSKCNGFTGIH 60

Query: 919  VSGTVVCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSV 740
            +SGT +      C+SC                          + +NP  + D  LSSS  
Sbjct: 61   ISGTELRRLPSTCQSC--------------------------LPENPADDTDSQLSSS-- 92

Query: 739  TENSGENMNFCGEVTSKKNRAKPITKILRSRVPAK--------VDAKAEDIFVNWCRKMG 584
                 E+     +    +   + ++ +     PAK        V + AE+IFV W R++ 
Sbjct: 93   ---PSESCTTAPKKMKSRRIKRSLSSVTDETSPAKKMKIGSKSVGSVAEEIFVKWRRELE 149

Query: 583  LDGNCN-IPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEIS 407
            LD   N      +AL+VCL +  + P  V  A SFWF +R C   +  T +NL +LE+IS
Sbjct: 150  LDLPVNGNRVVVEALNVCLRKWKLLPLEVVAATSFWFGLRFCGDVSFATSRNLIRLEKIS 209

Query: 406  GVPAKLILTAQTKLARVLSVRNDQQDQEEGWAE 308
             VPAKLIL A  KLARVL+   + Q   EGW E
Sbjct: 210  KVPAKLILAAHAKLARVLTHHFELQ---EGWDE 239


>ref|XP_006357444.1| PREDICTED: uncharacterized protein LOC102580785 [Solanum tuberosum]
          Length = 262

 Score =  159 bits (401), Expect = 3e-36
 Identities = 104/279 (37%), Positives = 141/279 (50%), Gaps = 15/279 (5%)
 Frame = -1

Query: 1093 MKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVS 914
            M  + CELCN +AAL+C SDSAFLCF CD KVH ANFLVARH+R T+CS C     N  S
Sbjct: 4    MSSKLCELCNDQAALFCPSDSAFLCFHCDAKVHQANFLVARHLRLTLCSHCNSLTKNRFS 63

Query: 913  GTVVCLSR--PVCRSCXXXXXXXXXXXXXXXXXXXXXXSF---------INIKKNPRREM 767
                C  R   +C SC                      S          INI  + R++ 
Sbjct: 64   P---CSPRRPALCPSCSRNSSADSDLRSLSSSSSSTCVSSTQSSAVTQKINISFSNRKQF 120

Query: 766  DKILSSSSVTE-NSGENMNFCGEVTSKKNRAKPITKILRSRVPAKVDAKAED-IFVNWCR 593
             +  ++ S+ E NSG +                   ++RSR     D +A   +F++WC 
Sbjct: 121  PEYSTNDSIGEVNSGSS------------------NLVRSRSAKLRDPRAATCVFMHWCT 162

Query: 592  KMGLDGNCNI-PTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLE 416
            K+G++G   +  TA   L++C       P RVALAA FW  ++     + +T Q+LKKLE
Sbjct: 163  KLGMNGEERVVQTACSVLAICFGRFRGLPLRVALAACFWLGLKNIEEKSKSTWQSLKKLE 222

Query: 415  EISGVPAKLILTAQTKLARVLSVRN-DQQDQEEGWAECS 302
            EISGVPAK+IL  + KL +++   N  +Q  EE WAE S
Sbjct: 223  EISGVPAKIILATELKLRKIVKTNNRRRQGMEESWAESS 261


>ref|XP_003530172.1| PREDICTED: uncharacterized protein LOC100781783 [Glycine max]
            gi|347666428|gb|AEP17825.1| B-box 53 protein [Expression
            vector pMON98939]
          Length = 243

 Score =  157 bits (398), Expect = 8e-36
 Identities = 103/271 (38%), Positives = 132/271 (48%), Gaps = 9/271 (3%)
 Frame = -1

Query: 1093 MKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVS 914
            MK + CELC+  A+LYC SDSAFLCF CD  VH ANFLVARH+R+ +CS+C  F    +S
Sbjct: 1    MKPKTCELCHQLASLYCPSDSAFLCFHCDAAVHAANFLVARHLRRLLCSKCNRFAAIHIS 60

Query: 913  GTVVCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSS---S 743
            G +       C SC                              P  + D + SSS   S
Sbjct: 61   GAISRHLSSTCTSCSLEI--------------------------PSADSDSLPSSSTCVS 94

Query: 742  VTENSGENMNFCGEVTSKKNRAKPITKILRSRVPA-----KVDAKAEDIFVNWCRKMGLD 578
             +E+   N     +   ++ R+   + +     PA     +      ++F  W R++GL 
Sbjct: 95   SSESCSTNQIKAEKKRRRRRRSFSSSSVTDDASPAAKKRRRNGGSVAEVFEKWSREIGLG 154

Query: 577  GNCN-IPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGV 401
               N    AS ALSVCL +    PFRVA A SFW  +R C      T QNL +LE ISGV
Sbjct: 155  LGVNGNRVASNALSVCLGKWRSLPFRVAAATSFWLGLRFCGDRGLATCQNLARLEAISGV 214

Query: 400  PAKLILTAQTKLARVLSVRNDQQDQEEGWAE 308
            PAKLIL A   LARV + R + Q   EGW E
Sbjct: 215  PAKLILGAHANLARVFTHRRELQ---EGWGE 242


>ref|XP_006585659.1| PREDICTED: uncharacterized protein LOC102667703 [Glycine max]
            gi|347666424|gb|AEP17822.1| B-box 52 protein [Expression
            vector pMON108080]
          Length = 241

 Score =  157 bits (397), Expect = 1e-35
 Identities = 102/270 (37%), Positives = 136/270 (50%), Gaps = 8/270 (2%)
 Frame = -1

Query: 1093 MKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDG-NCV 917
            MK + CELC+ +A+LYC SDSAFLC  CD  VH ANFLVARH+R+ +CS+C  F G +  
Sbjct: 1    MKGKTCELCDQQASLYCPSDSAFLCSDCDAAVHAANFLVARHLRRLLCSKCNRFAGFHIS 60

Query: 916  SGTVVCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVT 737
            SG +       C SC                            +NP  +    L SSS  
Sbjct: 61   SGAISRHLSSTCSSCSP--------------------------ENPSADYSDSLPSSSTC 94

Query: 736  ENSGENMNFCGEVTSKKNRAKPITKILRSRVPAKVDAK----AEDIFVNWCRKMGLDGNC 569
             +S E+ +   ++  +K R+   + +     PA    +    +E++F  W R++GL    
Sbjct: 95   VSSSESCS-TKQIKVEKKRSWSGSSVTDDASPAAKKRQRSGGSEEVFEKWSREIGLGLGL 153

Query: 568  NIP---TASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGVP 398
             +     AS ALSVCL +    PFRVA A SFW  +R C      + QNL +LE ISGVP
Sbjct: 154  GVNGNRVASNALSVCLGKWRWLPFRVAAATSFWLGLRFCGDRGLASCQNLARLEAISGVP 213

Query: 397  AKLILTAQTKLARVLSVRNDQQDQEEGWAE 308
             KLIL A   LARV + R + Q   EGW E
Sbjct: 214  VKLILAAHGDLARVFTHRRELQ---EGWGE 240


>ref|XP_007142121.1| hypothetical protein PHAVU_008G254400g [Phaseolus vulgaris]
            gi|561015254|gb|ESW14115.1| hypothetical protein
            PHAVU_008G254400g [Phaseolus vulgaris]
          Length = 251

 Score =  153 bits (386), Expect = 2e-34
 Identities = 102/279 (36%), Positives = 136/279 (48%), Gaps = 20/279 (7%)
 Frame = -1

Query: 1084 RACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGTV 905
            +ACELC+  A+ YC SDSAFLC  CD  VH ANFLVARH R+ ICS+C  F G  +SG  
Sbjct: 3    KACELCSNRASFYCPSDSAFLCCDCDAAVHAANFLVARHFRRRICSECNRFTGIHISGAA 62

Query: 904  VCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTENS- 728
            +      C SC                         + +K P  ++D + SSS+    S 
Sbjct: 63   L---PSTCTSC-------------------------SPEKPPSDDVDSLPSSSTCVSTSE 94

Query: 727  ---GENMNFCGEVTSKKNR----------AKPITKILRSRV-PAKVDAKAEDIFVNWCRK 590
                E +        KK R          A    K  R+ V   +++ + E++F  W R+
Sbjct: 95   SCAAEKIKATRAAAGKKRRRSFWSSVIDDASQEAKKKRNSVGSVELEQEQEEVFGKWSRE 154

Query: 589  MGLD-----GNCNIPTASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLK 425
            +GL      G      AS AL+VCL +  + PFRVA A S W  +R C   +  T QNL 
Sbjct: 155  IGLGLGLGLGENGNRVASHALNVCLGKWNLLPFRVAAATSLWQGLRFCGDRSLATWQNLA 214

Query: 424  KLEEISGVPAKLILTAQTKLARVLSVRNDQQDQEEGWAE 308
            +LE+ISGVPA LIL A   LARV ++    ++  EGW E
Sbjct: 215  RLEKISGVPANLILAAHANLARVFTL---PRELHEGWGE 250


>ref|XP_004242261.1| PREDICTED: uncharacterized protein LOC101262021 [Solanum
            lycopersicum]
          Length = 261

 Score =  151 bits (382), Expect = 5e-34
 Identities = 99/266 (37%), Positives = 137/266 (51%), Gaps = 5/266 (1%)
 Frame = -1

Query: 1084 RACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGTV 905
            + CELCN +AAL+C SDSAFLCF CD KVH ANFLVARH+R T+CS C        S   
Sbjct: 7    KLCELCNDQAALFCPSDSAFLCFHCDAKVHQANFLVARHLRLTLCSHCNSLTKKRFSP-- 64

Query: 904  VCLSRP--VCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTEN 731
             C   P  +C SC                      + ++  ++        + SS+  + 
Sbjct: 65   -CSPPPPALCPSCSRNSSGDSDLRSVSTTSSSSSSTCVSSTQSSAITQKINIISSNRKQF 123

Query: 730  SGENMNFCGEVTSKKNRAKPITKILRSRVPAKVDAKAED-IFVNWCRKMGLDGNCNI-PT 557
               + N  GEV S +        ++RSR     D +A   +F++WC K+ ++    +  T
Sbjct: 124  PDSDSN--GEVNSGR------CNLVRSRSVKLRDPRAATCVFMHWCTKLQMNREERVVQT 175

Query: 556  ASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGVPAKLILTA 377
            A   L +C S     P RVALAA FWF ++     + T+ Q+LKKLEEISGVPAK+IL  
Sbjct: 176  ACSVLGICFSRFRGLPLRVALAACFWFGLKTTEDKSKTS-QSLKKLEEISGVPAKIILAT 234

Query: 376  QTKLARVLSVRNDQ-QDQEEGWAECS 302
            + KL +++   + Q Q  EE WAE S
Sbjct: 235  ELKLRKIMKTNHGQPQAMEESWAESS 260


>gb|EYU21215.1| hypothetical protein MIMGU_mgv1a009313mg [Mimulus guttatus]
          Length = 307

 Score =  148 bits (374), Expect = 5e-33
 Identities = 94/242 (38%), Positives = 130/242 (53%)
 Frame = -1

Query: 1084 RACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGTV 905
            R CELC+GEAA++C++D+A LC++CD +VH ANFLVARHVRQ +CS C +  G+ +SG  
Sbjct: 5    RHCELCSGEAAVFCSADNAHLCWSCDARVHSANFLVARHVRQFLCSACNNLTGHSISGVG 64

Query: 904  VCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTENSG 725
              L    C SC                        I+   +P +++       SV  +S 
Sbjct: 65   SDLVPATCSSCPTADDVSSLSSDNSSVC-------ISSTTSPAKKLYCGGGGQSVDSSS- 116

Query: 724  ENMNFCGEVTSKKNRAKPITKILRSRVPAKVDAKAEDIFVNWCRKMGLDGNCNIPTASQA 545
                    VTS++ R        +SRV      +AE +FVNW  K+G+  +  +  AS+A
Sbjct: 117  ------SSVTSERER--------KSRVDI---FEAEGVFVNWYGKLGVGDDVAVRMASRA 159

Query: 544  LSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGVPAKLILTAQTKL 365
            + V L   TV PFR+ LAAS W  +R     +  T Q LK+LEEISG PAK+IL A +KL
Sbjct: 160  MRVFLGRLTVLPFRICLAASVWHGLRF---GSVQTWQVLKRLEEISGAPAKIILAAASKL 216

Query: 364  AR 359
             R
Sbjct: 217  ER 218


>ref|NP_188752.1| B-box 32 protein [Arabidopsis thaliana] gi|11994275|dbj|BAB01458.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|26450753|dbj|BAC42485.1| unknown protein [Arabidopsis
            thaliana] gi|28950769|gb|AAO63308.1| At3g21150
            [Arabidopsis thaliana] gi|332642946|gb|AEE76467.1| B-box
            32 protein [Arabidopsis thaliana]
            gi|347666435|gb|AEP17830.1| B-box 32 protein [Expression
            vector pMON81312]
          Length = 225

 Score =  132 bits (333), Expect = 3e-28
 Identities = 95/262 (36%), Positives = 129/262 (49%), Gaps = 5/262 (1%)
 Frame = -1

Query: 1078 CELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGTVVC 899
            CELC  EA L+CA+DSAFLC +CD K H +NFL ARH R+ IC  CK    N VSG +  
Sbjct: 5    CELCGAEADLHCAADSAFLCRSCDAKFHASNFLFARHFRRVICPNCKSLTQNFVSGPL-- 62

Query: 898  LSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTENSGEN 719
            L  P   +C                                    +  SSS  +     +
Sbjct: 63   LPWPPRTTC----------------------------------CSESSSSSCCSSLDCVS 88

Query: 718  MNFCGEVTSKKNRAKPITKILRSRVPAKVDA--KAEDIFVNWCRKMGLDGNCNIPTASQA 545
             +     T   NRA+       +RV AK  A   A+ IFVNWC K+GL+ +      S A
Sbjct: 89   SSELSSTTRDVNRARG----RENRVNAKAVAVTVADGIFVNWCGKLGLNRDLTNAVVSYA 144

Query: 544  -LSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGVPAKLILTAQTK 368
             L++ +     +  RV LAA+FWF V+       TT QNLKK+E+++GV A +I   ++K
Sbjct: 145  SLALAVETRPRATKRVFLAAAFWFGVK-----NTTTWQNLKKVEDVTGVSAGMIRAVESK 199

Query: 367  LARVLS--VRNDQQDQEEGWAE 308
            LAR ++  +R  + D EEGWAE
Sbjct: 200  LARAMTQQLRRWRVDSEEGWAE 221


>ref|XP_006299141.1| hypothetical protein CARUB_v10015283mg [Capsella rubella]
            gi|482567850|gb|EOA32039.1| hypothetical protein
            CARUB_v10015283mg [Capsella rubella]
          Length = 231

 Score =  129 bits (323), Expect = 4e-27
 Identities = 92/262 (35%), Positives = 128/262 (48%), Gaps = 5/262 (1%)
 Frame = -1

Query: 1078 CELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGTVVC 899
            CELCN EA L+CA+DSAFLC +CD K H +NFL ARH R+ IC  CK    + VSG +  
Sbjct: 5    CELCNAEADLHCAADSAFLCRSCDAKFHASNFLFARHFRRVICPSCKSLTRDFVSGPL-- 62

Query: 898  LSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTENSGEN 719
            L  P   SC                             +         SS S +E S   
Sbjct: 63   LPWPPRTSCC---------------------------SDSSSSSSSCCSSLSSSELSSTT 95

Query: 718  MNFCGEVTSKKNRAKPITKILRSRVPAKVDAKAEDIFVNWCRKMGLD---GNCNIPTASQ 548
                G   +++   + I K  +  V A     A+ +FV WC K+GL+    N  +  AS 
Sbjct: 96   R---GVNRAERGGEQSIAKANKKAVAAV--TVADGVFVKWCDKLGLNRDLTNAVVSYASL 150

Query: 547  ALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGVPAKLILTAQTK 368
            AL+V +     +  RV LA++FWF V+        T Q+LKK+E+++GV A +I   ++K
Sbjct: 151  ALAVEMRPRPRATNRVVLASAFWFGVK-----NTMTWQSLKKVEDVTGVAAGMIRAVESK 205

Query: 367  LAR--VLSVRNDQQDQEEGWAE 308
            +AR   L +R  + D EEGWAE
Sbjct: 206  MARAMTLQLRRWRVDSEEGWAE 227


>ref|XP_002885422.1| hypothetical protein ARALYDRAFT_342260 [Arabidopsis lyrata subsp.
            lyrata] gi|297331262|gb|EFH61681.1| hypothetical protein
            ARALYDRAFT_342260 [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  127 bits (319), Expect = 1e-26
 Identities = 92/262 (35%), Positives = 126/262 (48%), Gaps = 5/262 (1%)
 Frame = -1

Query: 1078 CELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGTVVC 899
            CELC  EA ++CA+DSAFLC +CD K HG+NFL ARH R+ IC  CK    + VSG +  
Sbjct: 198  CELCGAEADIHCAADSAFLCRSCDAKFHGSNFLFARHFRRVICPNCKSLTQDFVSGPL-- 255

Query: 898  LSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTENSGEN 719
            L  P   +C                         +   +    +D + SS   +   G N
Sbjct: 256  LPWPPRTTCCSESS--------------------SSSSSCCSSLDCVSSSELSSTTRGVN 295

Query: 718  MNFCGEVTSKKNRAKPITKILRSRVPAKVDAKAEDIFVNWCRKMGLD---GNCNIPTASQ 548
                     ++NR K           A     A+ IFVNWC K+GL     N  +  AS 
Sbjct: 296  -----RARGRENRVK---------AKAVAVTVADGIFVNWCGKLGLKRDLTNAVVSYASL 341

Query: 547  ALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGVPAKLILTAQTK 368
            ALSV   +   +  RV LAA+FWF V+          Q+LKK+E+++GV A +I   ++K
Sbjct: 342  ALSVAERKPRATK-RVILAAAFWFGVK-----NTMKLQSLKKVEDVTGVSAGMIRAVESK 395

Query: 367  LAR--VLSVRNDQQDQEEGWAE 308
            +AR   L +R  + D EEGWAE
Sbjct: 396  MARAMTLQLRRWRVDSEEGWAE 417


>ref|XP_006406309.1| hypothetical protein EUTSA_v10021479mg [Eutrema salsugineum]
            gi|557107455|gb|ESQ47762.1| hypothetical protein
            EUTSA_v10021479mg [Eutrema salsugineum]
          Length = 222

 Score =  125 bits (314), Expect = 4e-26
 Identities = 89/265 (33%), Positives = 127/265 (47%), Gaps = 6/265 (2%)
 Frame = -1

Query: 1084 RACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVSGTV 905
            + CELC+ +A LYC +DSAFLC +CD K H +NFL +RH+R+ IC  C+   G+ VSG++
Sbjct: 3    KLCELCSAQADLYCDADSAFLCRSCDAKFHASNFLFSRHLRRIICPDCESLTGDFVSGSL 62

Query: 904  VCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTENSG 725
                 P   SC                                              +S 
Sbjct: 63   P--PWPPRTSCCSG------------------------------------------SHSS 78

Query: 724  ENMNFCGEVTSKKNRAKPITKILRSRVPAK-VDAKAEDIFVNWCRKMGLD---GNCNIPT 557
               + C E++S K R   +    R+R   K V A A  +FV WC ++GL+    N  +  
Sbjct: 79   SASSCCSELSSTKTRKTRVVVANRARGREKTVKAVAVGVFVKWCDRLGLNEGFRNAVVSL 138

Query: 556  ASQALSVCLSESTVSPFRVALAASFWFSVRACRGSAATTDQNLKKLEEISGVPAKLILTA 377
            AS AL+V   E      +V LAA+FW  V+  R   A T   LKK+E+++GV + +I   
Sbjct: 139  ASLALAV---EKPRLKTKVILAAAFWLGVKNSR--KAMTWPTLKKVEDVTGVASGMIRAV 193

Query: 376  QTKLAR--VLSVRNDQQDQEEGWAE 308
            ++KLAR   L +R  + D EEGWAE
Sbjct: 194  ESKLARAMTLQLRRWRVDSEEGWAE 218


>ref|XP_007216707.1| hypothetical protein PRUPE_ppa026514mg, partial [Prunus persica]
            gi|462412857|gb|EMJ17906.1| hypothetical protein
            PRUPE_ppa026514mg, partial [Prunus persica]
          Length = 169

 Score =  102 bits (255), Expect = 3e-19
 Identities = 61/172 (35%), Positives = 81/172 (47%)
 Frame = -1

Query: 1093 MKFRACELCNGEAALYCASDSAFLCFTCDGKVHGANFLVARHVRQTICSQCKDFDGNCVS 914
            MK R CELC+ EA+LYC SDSAFLC  CD +VH ANFLVARH+RQ IC  CK   G+   
Sbjct: 1    MKARVCELCDQEASLYCPSDSAFLCSRCDARVHQANFLVARHIRQYICYNCKGLTGSRNI 60

Query: 913  GTVVCLSRPVCRSCXXXXXXXXXXXXXXXXXXXXXXSFINIKKNPRREMDKILSSSSVTE 734
             +      P   S                                +   D + S SSVT+
Sbjct: 61   RSFCSSCSPDNFSGHGNGDGDTQSSSSACSACVSSTDSFGGTAATKAGFDNLKSESSVTQ 120

Query: 733  NSGENMNFCGEVTSKKNRAKPITKILRSRVPAKVDAKAEDIFVNWCRKMGLD 578
             SG+  N     +  K +     +  ++R     DAKA+  F+NWC ++GL+
Sbjct: 121  VSGKLSNIPARFSGAKRKC---VQRAQARTSTSADAKAKGSFINWCSQLGLN 169


Top